Theses (Computer Science)

Permanent URI for this collection

https://dspace.library.uvic.ca/handle/1828/76

Browse

Now showing 1 - 20 of 759

Bandwidth tomography
(2025) An, Jianwei; Wu, Kui
Bandwidth tomography—inferring the bandwidth of internal network links from end-to-end path bandwidth measurements—is a long-standing open problem in network tomography. The core challenge arises from the fact that no existing mathematical framework directly addresses the inverse problem formulated as a set of min-equations. To systematically tackle this challenge, we design a polynomial-time algorithm that accurately determines the bandwidth of all identifiable links and derives the tightest possible error bounds for unidentifiable links based on a given set of measurement paths. Furthermore, when additional information on link correlations is available, we leverage the extra information to refine our error bounds. Specifically, we explore two key types of link correlations: fairness constraints and total capacity constraints among a node's adjacent links. We provide theoretical guarantees on how these correlations enhance the precision of bandwidth tomography and develop algorithms to address two fundamental challenges in refining these bounds: (i) the impact of synchronous vs. asynchronous updates and (ii) the cascading effects during bound updates. Having developed algorithms to derive the tightest possible performance bounds for a given set of measurement paths, we then tackle the next major challenge: constructing optimal measurement paths that minimize the global error bounds for unidentifiable links. We prove the hardness of this problem and, in response, propose a reinforcement learning (RL) approach for measurement path construction. Our solution leverages domain-specific knowledge in bandwidth tomography and integrates both offline training and online prediction to build suitable measurement paths. We evaluate our proposed methods using real-world ISP topologies and simulated networks. Experimental results show that compared to existing path construction methods—Random and Diversity Preferred—our RL-based approach significantly reduces the average error bound of inferred link bandwidths. In addition, our performance bound computation algorithms improve the state-of-the-art techniques by substantially tightening the performance bounds in bandwidth tomography.
Three ethical dimensions of AI: Fairness in social recommenders, bias detection in LLMs, and privacy in NLP
(2025) Potka, Shera; Thomo, Alex
This thesis investigates three foundational challenges in the development of responsible Artificial Intelligence (AI): fairness in social recommender systems, demographic bias in large language models (LLMs), and privacy-preserving techniques for Natural Language Processing (NLP). Though these problems differ in technical scope and application domain, they share a common thread: vector-based representations—embeddings of users, words, and tokens—fundamentally shape how AI systems behave, make decisions, and affect people. Across these three dimensions, this work introduces new methods for measuring, interpreting, and mitigating risk, offering solutions grounded in both empirical analysis and practical utility.The first part of the thesis (Chapter 2) examines fairness in algorithmic link recommendation, with a focus on how structural minority communities—groups defined by network topology rather than identity—are represented in evolving social graphs. Standard recommenders tend to amplify popular users, reinforcing visibility gaps over time. We propose MinWalk, a fairness-aware algorithm that improves minority visibility while maintaining network stability. Simulations on real-world networks show that fairness- and diversity- aware algorithms vary widely in long-term impact, and that MinWalk offers a balanced, effective solution. This work underscores the importance of evaluating fairness dynami- cally and provides tools for designing more inclusive recommendation systems. The second part (Chapters 3 and 4) turns to demographic bias in LLM behavior. We analyze gender and race associations in contextual embeddings from five leading models developed by OpenAI, Google, Microsoft, Cohere, and BGE. Using the SC-WEAT metric and clustering techniques, we show that stereotypical associations persist and are amplified in modern embeddings. We also examine how these biases appear in real-world applications, focusing on consumer product recommendations. Using prompt engineering and computational linguistics methods—including Marked Words, SVM classification, and distributional divergence—we find that LLMs generate demographically skewed suggestions that reinforce social stereotypes. These findings highlight the risks of bias in LLM outputs and offer concrete tools for auditing fairness in generative systems. The final part (Chapter 5) addresses privacy in NLP, where the challenge lies in re- moving sensitive information from text without damaging meaning or fluency. Existing approaches either prioritize privacy but degrade text quality, or preserve fluency at the cost of weaker guarantees. To address this, we propose CluSanT, a flexible framework that uses token clustering and controlled replacement mechanisms to balance privacy and utility. Unlike prior methods, CluSanT retains strong privacy protection while producing more natural, semantically faithful text. We evaluate it using a range of metrics—including coherence, grammar, and semantic similarity—showing that it consistently improves over baselines on a legal benchmark dataset. Our results demonstrate that text sanitization can be both effective and intelligible to human readers. Taken together, this thesis presents a unified perspective on ethical AI through the lens of embeddings. In social networks, language generation, and privacy-preserving NLP, vector representations are not neutral—they encode power dynamics, preferences, and access. By examining how these embeddings influence visibility, bias, and confidentiality, this work contributes both practical algorithms and conceptual frameworks for designing fair, inclusive, and trustworthy AI systems.
Single mutation effects on protein secondary structure
(2025) Perez Martell, Raul Ivan; Stege, Ulrike; Jabbari, Hosna
Human diversity often manifests through single nucleotide polymorphisms (SNPs). Among these polymorphisms, SNPs that alter amino acids can modify a protein's three-dimensional structure. Such single amino acid mutations can impact the protein's function and potentially elicit diseases or affect drug interactions. Thus, understanding protein single point mutations is crucial for precision medicine, as it helps tailor treatments based on individual genetic variations. Protein tertiary structure prediction models like AlphaFold2 have revolutionized the field with unprecedented accuracy, yet predicting structural changes arising from single amino acid mutations remains a challenge. The complexity introduced by these mutations calls for models that can incorporate mutational information into their predictions. As atomic locations can be susceptible to any number of changes that might or might not affect function, we focus on the secondary structure to provide concrete results on possible protein structural deformation that may occur from single amino acid mutations. We assess state-of-the-art structure prediction methods regarding backbone deformations caused by single amino acid mutations. We categorize these deformations as local, distant, or global based on the proximity of structural changes to the mutation site. Our analysis utilizes a diverse dataset from the Protein Data Bank, comprising over 500 protein clusters with experimentally determined structures and documented mutations. Our findings indicate that single amino acid mutations can significantly affect the accuracy of structure prediction methods. These mutations often lead to predicted structural changes even when the actual secondary structures remain unchanged, suggesting that current methods overestimate the impact of single amino acid mutations. This issue is particularly evident in advanced prediction algorithms, which struggle to accurately model proteins with stable mutations. We also found that the addition of low-performing prediction methods during structural analysis can positively impact the results on some proteins, particularly those with low levels of homology. Furthermore, proteins that form complexes or bind ligands—such as membrane and transport proteins—are inaccurately predicted due to the absence of extra-molecular interaction data in the models, highlighting how single amino acid mutations can complicate accurate structure prediction. Due to these findings, we propose a novel refinement strategy for protein secondary structure prediction that leverages single amino acid mutational data. As part of this strategy, we introduce Mut2Dens, a model that not only yields more consistent predictions for mutational data but also maintains robust predictive performance on non-mutational datasets. These refined models take multiple predicted secondary structures and generate a mutation-aware secondary structure. In particular, Mut2Dens employs the extremely randomized trees algorithm to avoid overfitting and make effective use of the limited mutational data available from experimentally determined three-dimensional structures. By combining predictions from highly accurate structure prediction models, we create an ensemble that integrates their strengths while enhancing mutational capabilities. This refinement strategy also improves the non-mutational performance of state-of-the-art methods by addressing their most inaccurate and least confident predictions. Moreover, our refinement strategy reduces improbable outcomes in mutated protein structures—such as transforming π-helices into β-sheets—that can still occur in current prediction models. Finally, by using interpretable machine learning algorithms, we can reveal the underlying biological knowledge from the refinement model. The insights gained from Mut2Dens can be corroborated with known mutational outcomes, helping users pinpoint discrepancies across structure prediction models and make more informed decisions regarding the predicted structures.
Toward an extensible quantum platform-agnostic combinatorial optimization library
(2025) Ossorio Trochez, Jose; Muller, Hausi A.; Villegas, Norha M.
Combinatorial optimization (CO) problems are computationally challenging as evidenced in various industry and research domains. With recent advances in quantum computing hardware and algorithms, such problems represent an excellent case study for these technologies. Nevertheless, current software tools for CO lack platform-agnostic abstractions to enable researchers and practitioners to utilize quantum resources effectively. This thesis aims to validate and extend the QPLEX Python library, a platform-agnostic CO package built on DOcplex which integrates execution across multiple quantum providers using various algorithms. We focus on two key software quality attributes: completeness, examining the quantum providers QPLEX supports to look for features that could be added to our library, enhancing its capabilities for handling CO problems; and extensibility, making the library more adaptable for future expansions. We first compile a high-level workflow for solving CO problems to ensure that our elicited software requirements align with the actual process practitioners follow when solving these problems. Subsequently, we evaluate QPLEX through a comprehensive analysis of its completeness by comparing features against alternative solutions including platform-specific SDKs, and its extensibility by examining how easily new features can be integrated without disrupting existing functionality. Based on the identified functional and non-functional requirements, we design and implement several extensions to QPLEX, including support for Qiskit Runtime Sessions, integration with D-Wave's quantum solvers and implementation of the QAOAnsatz algorithm. Furthermore, we enhance the extensibility of the library through comprehensive documentation, automated testing, and CI/CD pipelines to ensure smooth integration of future open-source contributions. Validation results demonstrate that these enhancements successfully extend QPLEX's capabilities for solving CO problems using quantum resources, providing a more comprehensive suite of features for quantum-based CO while establishing robust foundations for future development. This work contributes to the evolving field of quantum software engineering by advancing an abstraction layer that shields practitioners from low-level quantum details, allowing them to focus on problem formulation. As quantum hardware and algorithms continue to advance, such platform-agnostic libraries will play a crucial role in broadening quantum computing adoption, enabling domain experts to leverage quantum resources without requiring deep quantum computing knowledge.
Thresholded linear bandits
(2025) Nguyen, Trang Thu; Mehta, Nishant
Thresholded linear bandits is a novel bandit problem that lies in the intersection of several important multiarmed bandit (MAB) variants, including active learning, structured bandits, and learning halfspaces. To achieve sublinear regret in the presence of exponentially many arms, one method is to exploit the structure of the reward function. However, the presence of an unknown threshold component makes previously known algorithms for structured bandits unsuitable. Moreover, the threshold introduces a discontinuity to the reward function, making the problem significantly more difficult. In this thesis, we study the union of axis-parallel halfspace variant of the thresholded linear bandits problem. We suggest an algorithm that achieves sublinear regret and provide theoretical guarantees on the performance of the algorithm
AI-driven security in software-defined networks: A unified framework for intrusion detection and mitigation
(2025) El Gadal, Walid; Ganti, Sudhakar
Over the past decade, data networks have evolved from static resource deployment to a more dynamic and adaptive paradigm. Software-Defined Networking (SDN) is one of the most creative network technologies where network control is separated from forwarding. It is directly programmable and has been proposed as a way to programmatically control networks, facilitating the deployment of new applications and services, as well as tuning network policies and performance. However, various challenges have hindered achieving strong cybersecurity within the dynamic network configurations of Software-Defined Networking. Traditional cybersecurity measures, especially in programmable and dynamic network infrastructures like SDNs, are not sufficient to mitigate cyber threats. This dissertation explores the capabilities of SDN and examines how AI-driven methods can enhance intrusion detection and mitigation. The study begins by providing a comprehensive introduction to SDN, outlining its fundamental capabilities and comparative advantages over traditional network architectures. In addition, it explores SDN vulnerabilities and addresses complex security challenges. The objective of this thesis work is to improve the detection and mitigation of threats in SDN environments. For this, we first present a dynamic defense framework that includes Machine Learning and Deep Learning techniques for attack detection and mitigation. Furthermore, a novel hybrid Coot-Lyrebird optimization algorithm is developed to specifically choose the most impactful features in the network. The selected features are given to the proposed hybrid network that combines Convolutional Neural Network (CNN), SE-ResNeXt, and Long Short-Term Memory (LSTM) networks. Finally, the proposed Deep Q-Network (DQN) model performs attack mitigation measures. The results indicate that the proposed dynamic defense has an accuracy of 0.999571%. In addition, we extended our study to include more complex environments. Software-Defined Internet of Things (SD-IoT) networks enabled intelligent network management through their dynamic features, but expose centralized infrastructure to complex cyberattacks that put the system in great danger. In order to address this, a novel federated secure intelligent intrusion detection and mitigation framework with automated attack reporting for SD-IoT network is presented.
MountainScape semantic segmentation of historical and repeat images
(2025) Mahindrakar, Aniket; Tzanetakis, George; Higgs, Eric
Semantic segmentation of ultra-high resolution images is challenging due to high memory and computation requirements. Current approaches to this problem involve cropping the ultra-high resolution image into small patches for individual processing in order to provide local context, or under-sampling the images to provide global context, or following a combination of both which gives rise to global-local refinement pipelines. In this thesis, we present the MountainScape Segmentation Dataset (MS2D) which comprises high-resolution historic (grayscale) manually segmented images of Canadian mountain landscapes captured from 1861 to 1958 and their corresponding modern (colour) repeat images. Additionally, we analyze the characteristics of the dataset, define evaluation criteria, and provide a baseline to serve as a reference benchmark for automated land cover classification using the Python Landscape Classification Tool (PyLC), an existing software tool. The main contribution of this thesis is the experimental exploration of various deep learning architectures to address the tiling artifacts and spatial context loss faced by PyLC in its tile-based processing of ultra-high-resolution images, alongside a comprehensive investigation using a larger dataset than that employed in the original PyLC study to solve this tiling problem.
Two views of cryptography and the gap in-between
(2025) Wu, Zehou; Kapron, Bruce; Lu, Yun
There are two popular views of cryptography. One is formal (symbolic), which uses expressions to model the ideal functionality of encryption functions and is easy to verify. The other is computational, which is what cryptographic assumptions rely on and is used for most security definitions. The challenge of reconciling these two views of cryptography lies in security under the presence of encryption cycles. In this thesis, we provide a proof of completeness for Abadi-Rogaway symbolic logic with respect to KDM security, a strong form of circular security. Further, we provide a larger set of expressions for which Micciancio's symbolic logic is complete with respect to CPA security, extending Micciancio's completeness, which holds only for the set of acyclic expressions. We also give an alternate characterization of Micciancio's logic. On the computational side, we give a proof that circular insecurity is maintained as cycle length decreases, which is not a previously shown result.
Dynamic and cost-efficient deployment of large language models using uplift modeling and multi armed bandits
(2025) Tongay, Ninad; Chester, Sean
The rapid advancement of large language models (LLMs) has brought about a new class of challenges in balancing performance, cost, and scalability. As organizations seek to deploy these models in production environments, a key question arises: how can we maintain the quality of responses delivered by advanced LLMs while reducing the significant computational and financial costs associated with them? Relying entirely on high-end models like GPT-4 can ensure quality but often proves economically unsustainable, while defaulting to smaller, cheaper models may sacrifice performance and user satisfaction. This tension calls for more intelligent decision-making strategies ones that dynamically allocate queries to the most appropriate model depending on the task’s complexity and expected value. To address this, we propose a hybrid decision-making framework that brings together causal uplift modeling and multi-armed bandits to drive cost-aware, adaptive model selection. Uplift modeling enables the system to reason causally about the benefit of using a stronger model for a specific query, thereby offering interpretable, feature-informed decisions from the outset. These predictions serve as a strong offline prior. The bandit component builds on this by adapting the policy in real time learning from feedback, correcting for model mispredictions, and responding to shifts in query distribution or underlying model performance. This fusion of causal inference and online learning results in a system that is not only efficient and scalable, but also interpretable and responsive to real-world variability. We validate the approach through controlled simulations that mimic real deployment conditions, including concept drift, shifts in user query types, and the emergence of unseen domains. Across these scenarios, the hybrid consistently achieves a more favorable balance between quality and cost than baseline strategies. Furthermore, the system is designed to expose its decision-making logic, offering transparency through uplift scores and feature-based justifications a critical requirement for high-stakes AI deployments. By combining performance, cost awareness, and explainability, this work contributes a practical solution to the growing need for intelligent model orchestration in the multi-LLM landscape.
Morphology agnostic multi-agent character control
(2025) Zhang, Rui; Haworth, Brandon
Crowd simulation plays a crucial role in various applications, from urban planning to virtual reality, by modeling realistic pedestrian behavior and interactions. Traditional approaches typically utilize simplified agent representation such as particles, whereas recent advancements have introduced fully physical character models in crowds, which relies on morphology-specific motion control, limiting their applicability to heterogeneous agents with diverse body structures and movement capabilities. This thesis introduces a morphology-agnostic multi-agent character control framework that integrates physics-based locomotion with hierarchical reinforcement learning. A low-level locomotion controller utilizes generalized goal conditioning to enable robust and adaptable movement across agents with different morphologies through parameter sharing, eliminating the need for predefined gait cycles or morphology-specific trajectory planning. A high-level navigation controller processes morphology-agnostic state observations and integrates visual attention sampling to improve decision-making. The navigation controller provides goal conditioning to the locomotion controller, guiding agents toward their target positions in dynamic environments. The proposed system improves generalizability in multi-agent settings by decoupling locomotion control from agent-specific kinematics while maintaining stability and responsiveness.
Fast trips: A scalable insertion operator approach for ridesharing over time-dependent road networks
(2025) Mukherjee, Aaditya; Chester, Sean; Nascimento, Mario A.
Effective Route planning for shared mobility (RPSM) is crucial for optimizing the goals of transportation services such as ridesharing, logistics, and food delivery. Route planning requires online integration of new transportation requests into existing routes of transportation workers while accounting for real-world conditions such as traffic congestion, variable travel speeds, and changing demand patterns. A core component of route-planning systems is the insertion operator, a state-of-the-art method that integrates new transportation requests into existing worker routes with minimal additional travel time. Although effective and fast for route-planning simulations on static road networks, route-planning simulations experience significant performance degradation when applied to real-world, time-dependent road networks (TDRNs), where travel times between roads fluctuate due to varying traffic conditions. This thesis addresses this scalability challenge by introducing an informed approach to partitioning the data used by the insertion operator into separate, disjoint batches. I propose a partioning method utilizing K-means clustering complemented by an opportunistic allocation of workers to clusters. This method reduces the large number of shortest path query invocations inherent to time-dependent insertions, significantly decreasing RPSM simulation speeds without sacrificing, and in some cases even improving the quality of the solutions. Through extensive experimental evaluations using large-scale, real-world datasets from major Chinese cities, the proposed method is compared against the sequential time-dependent insertion operator. The results indicate a minimum of 7X acceleration in RPSM simulation times and maximum speedups up to 24X, while consistently matching or surpassing the original insertion operator in terms of solution quality. Furthermore, the flexibility of the clustering approach allows for customizable trade-offs between simulation speed and service quality, ensuring adaptability to diverse operational goals of transportation services. Ultimately, this thesis offers a scalable, adaptable, and computationally efficient insertion operator framework capable of handling realistic scenarios in dynamic shared mobility environments, providing valuable tools for transportation and logistics companies seeking operational optimization.
Fast database join on ray-tracing core equipped GPU
(2025) Wu, Yijie; Chester, Sean
With the increase in GPU memory and computing power, GPU databases have become more popular, driving extensive research on GPU-based indexing. One study introduced a novel approach called RTX(Ray-tracing Index), which utilizes ray-tracing cores(RT cores) to accelerate GPU indexing. However, RTX suffers from a large build size and slow range queries. A follow-up work called cgRX(Coarse-granular Indexing), optimized the construction and range query algorithms, improving throughput by 1.5x–3x in relation to memory footprint, the range query time by 2x, and 5.5x faster updatability compared to RTX. However, the experimental results of cgRX may be inaccurate because RTX was not properly optimized as a baseline in cgRX, at least for the range query. To optimize the RTX, this thesis explores multiple OptiX(Nvidia's Ray-tracing Software API) optimization strategies for RTX, including a revised range query algorithm, BVH partitioning, reverse mapping, and spatially closed query mapping. Additionally, the best configurations are applied to other baselines, including cgRX. All these improvements together are used to reproduce the experiments in cgRX. The evaluation is first based on the impact of each optimization technique on RTX. These optimizations reduce RTX's memory usage during construction and improve range query performance. Then, cgRX, optimized RTX, and other baselines are compared using the same experimental setup as cgRX, all using their best configurations. The re-evaluated results differ significantly from those in cgRX. In summary, this thesis contributes to RTX optimization by exploring the effects of multiple optimization techniques. The optimized RTX and baselines configured with optimized settings collectively aim to develop a high-performance GPU database index.
A power-aware IoT-fog-cloud architecture for telehealth applications
(2025) Guo, Yunyong; Ganti, Sudhakar
This dissertation presents an energy-efficient model for integrating Internet of Things (IoT) devices with fog and cloud computing platforms, specifically designed for telehealth applications. As the deployment of telehealth IoT devices continues to grow, the demand for efficient, real-time data processing and energy conservation becomes increasingly critical. This research addresses these challenges by proposing a hybrid architecture that combines the low-latency benefits of fog computing with the scalable resources of cloud computing. The model reduces energy consumption by processing data locally through fog nodes, minimizing the need for constant communication with cloud servers. This not only decreases latency but also optimizes the use of computational resources, making the system more adaptable to the dynamic demands of telehealth services. The model is further enhanced by an adaptive resource scaling algorithm, which dynamically adjusts processing capacity based on workload, ensuring both efficiency and reliability in critical healthcare applications. Simulations studies demonstrate the effectiveness of the model in reducing energy consumption and improving system performance for real-time telehealth monitoring. The results show significant improvements in data processing speed, energy efficiency, and resource utilization compared to traditional cloud-only architectures. This work contributes to the ongoing development of sustainable telehealth solutions by providing a robust framework for IoT-fog-cloud integration that meets the stringent demands of modern healthcare systems.
CNN-based models for pitch estimation, modification, and auto-tuning
(2024) Jiang, Jiazhuo; Tzanetakis, George
Pitch estimation and pitch modification are fundamental audio processing tasks that are used in a variety of applications. An important example is the auto-tuning of vocals in which pitch estimation is applied, deviations from a desired target pitch are calculated, and the pitch of input vocal signal is modified to match the target pitch. Most existing approaches to auto-tuning are based on traditional digital signal processing (DSP) techniques for both the pitch detection and the pitch modification of the signal. In this thesis, the use of Convolutional Neural Networks (CNNs) is explored as a possible replacement of traditional DSP methods for pitch estimation, pitch modification as well as end-to-end autotuning. CNNs can model complex intput and output relationships and are more efficient than deep learning methods that take into account time/sequence information such as Long Term/Short Term (LSTM) networks and Recurrent Neural Networks (RNNs). The results show the potential of this approach as well as some of the challenges that need to be overcome. The experimental results indicate that larger data sets can result in better accuracy but they also tend to bring in more noise.
Real-time gesture-based sound control system
(2024) Khazaei, Mahya; Tzanetakis, George
This thesis presents a real-time, human-in-the-loop music control and manipulation system that dynamically adapts audio outputs based on the analysis of human movement captured via live-stream video. This project creates a responsive link between visual and auditory stimuli, fostering an interactive experience where dancers not only respond to music but dynamically influence it through their movements. The system enhances live performances, interactive installations, and personal entertainment, creating an immersive experience where users’ movements directly shape the music in real time. This project demonstrates how machine learning and signal processing techniques can create responsive audio-visual systems that evolve with each movement, bridging human interaction and machine response in a seamless loop. The system leverages computer vision techniques and machine learning tools to track and interpret the motion of individuals dancing or moving, enabling them to participate actively in shaping audio adjustments, such as tempo, pitch, effects, and playback sequence in real time. Constantly improving through ongoing training, the system allows users to generalize models for user-independent use by providing varied samples; around 50–80 samples are typically sufficient to label a simple gesture. Through an integrated pipeline of gesture training, cue mapping, and audio manipulation, this human-centered system continuously adapts to user input. Gestures are trained as signals from human to model, mapped to sound control commands, and then used to naturally manipulate audio elements.
Policy-value concordance for deep actor-critic reinforcement learning algorithms
(2024) Buro, Jonas; Haworth, Brandon
Designing general agents to optimize sequential decision-making underneath uncertainty has long been central to artificial intelligence research. Recent advances in deep reinforcement learning (RL) have made progress in this pursuit, achieving superhuman performance in a collection of challenging and visually complex domains, in a tabula rasa fashion without embedding human domain knowledge. Although making progress towards designing general problem-solving agents, these methods require significant amounts of data to learn effective decision-making policies relative to humans, preventing their application to most real-world problems for which no simulator exists. It is clear that the question of how to best learn models intended for downstream purposes such as planning in this context remains unresolved. Motivated by this gap in the literature, we propose a novel learning objective for RL algorithms with deep actor-critic architectures, with the goal of further investigating the efficacy of such methods as autonomous general problem solvers. These algorithms employ artificial neural networks as parameterized policy and value functions, which guide their decision-making processes. Our approach introduces a learning signal that explicitly captures desirable properties of the policy function in terms of the value function from the perspective of a downstream reward-maximizing agent. Specifically, the signal encourages the policy to favour actions in a manner that is concordant with the relative ordering of value function estimates during training. We hypothesize that when correctly balanced with other learning objectives, RL algorithms incorporating our method will converge to comparable strength policies using less real-world data relative to their original instantiations. To empirically investigate this hypothesis, we incorporate our technique with state-of-the-art RL algorithms, ranging from simple policy gradient actor-critic methods to more complex model-based architectures, and deploy them on standard deep RL benchmark tasks, and then perform statistical analysis on their performance data.
FTRL-WRR: Learning-based two-path scheduler for LEO networks
(2024) Li, Daoping; Pan, Jianping
Multipath QUIC is inspired by the resource pooling principle, aiming to make a collection of resources behave as a single pool. However, current multipath schedulers tend to prioritize specific metrics like Round-Trip Time (RTT) or congestion window, often overlooking strategies that enhance overall resource usage and reduce flow completion time. This can lead to resource underutilization in high dynamic settings, such as those involving Low Earth Orbit (LEO) satellites. Addressing this challenge requires efficient traffic allocation to maximize bandwidth utilization. In this thesis, we verify that the relationship between traffic distribution and throughput in a two-path scenario resembles a quasi-concave function. Accordingly, we formulate the traffic allocation across two paths as a 1-dimensional optimization problem. To solve the two-path scheduling problem in dynamic environments, we introduce the FTRL-WRR algorithm. This approach integrates a Follow The Regularized Leader (FTRL) learner, ADWIN2 distribution change detector, and Weighted Round Robin (WRR) scheduler to enhance bandwidth utilization. We validate the effectiveness of the algorithm through extensive emulation and real-world testbed experiments, demonstrating consistent reduction in completion time across a range of scenarios. Additionally, we discuss the algorithm's limitations and suggest directions for future research.
Exploring text-based support for designing weave drafts
(2024) Nayar, Chehak; Somanath, Sowmya
We present the design and evaluation of Textere — a tool that helps weavers use text inputs to design weave drafts for weaving. Our research lies at the intersection of two areas of research: (i) text-based design tools, and (ii) design tools in weaving. Text-based design tools have been explored by researchers in various domains like garment design, 3D modeling, and data visualization, showing benefits for expanding creative possibilities, enabling rapid prototyping, and making design processes more accessible for a broader range of users. Motivated by such benefits, in our research we explore how text-based tools can help with designing weave drafts. Weaving is a design and production activity, wherein weavers map ideas, inspiration, or client requirements to visual elements like pattern, color, and weave structures to design a weave draft. The drafts are then physically produced using a loom. Weave drafts are designed before production, to convey what the appearance of the final product will look like. Design tools in weaving use different modalities, like audio and tactile, to make the design process more accessible, creative, and efficient— benefits that design tools in other domains have achieved using text support. Several text-based scenarios in weaving, require interpretation of words from text inputs. Yet, current text-based techniques and design tools in weaving are limited to mapping individual alphabets to specific weaving elements, or incorporating text as is in the weave draft. We extend this research space by exploring how weave drafts can be designed using meaning or interpretations of words. We developed Textere, a text-based tool for designing weave drafts using the open source AdaCAD weaving platform. Using Textere, weavers can map text inputs to visual elements such as color, weave structure, and patterns based on meanings and interpretations. We curated the text-to-visual mappings used in our system from existing user studies in research, that describe how people associate words to visual elements. To evaluate Textere, we first used the evaluation-by-demonstration method, to produce four physical woven samples designed using our tool. Further, we conducted a qualitative study with 12 weavers to evaluate opportunities and limitations of using Textere, by comparing workflows to the tool we extended, AdaCAD, with no explicit text-to-visual support. From our study we learned about the strengths and limitations of Textere. Informed by our results, we further discuss how text-based design tools like Textere can enable reflective decision making, generation and broadening of ideas, gaining different perspectives on what visual elements represent, and contribute to an ecosystem of tools for designing weave drafts. This thesis makes three contributions: i) a novel tool for designing weave drafts using text inputs, ii) empirical findings on the benefits and limitations of text-based interactions for designing weave drafts, and iii) a set of design implications for future text-based design technologies.
Multi-agent footstep steering with deep reinforcement learning
(2024) Peng, Kun; Haworth, Brandon
Crowd simulation plays a crucial role in a wide range of fields, from digital media to urban planning. However, traditional particle-based algorithms often lack essential information to present realistic human bipedal locomotion. This research aims to propose a more realistic and efficient steering model for crowd simulation by combining Multi-Agent Reinforcement Learning (MARL) with bipedal locomotion modelling. This study explores the advantages of MARL and analyzes a mathematical approach to simplifying complex bipedal locomotion. The approach utilizes the Proximal Policy Optimization algorithm and trains the model in adjustable randomized maze-like environments. Assessment results of the model indicate that the model learns goal-reaching behaviours and learns to avoid static and dynamic obstacles. Furthermore, the agents can simulate complex steering behaviours such as side-stepping and turning-like behaviours with two feet. This research contributes to the advancement of the field of crowd simulation through a flexible and realistic approach to modelling human steering behaviours in complex and dynamic environments.
Enhancing fact-checking in large language models: Cost-effective claim verification through first-order logic reformulation
(2024) Asghari, Sara; Thomo, Alex; Srinivasan, Venkatesh
In the realm of Large Language Models (LLMs), the ability to accurately perform Fact Checking (FC) tasks, which involves verifying complex claims against challenging evidence from multiple sources, remains a crucial yet under-explored area. Our study presents a comprehensive benchmarking of various LLMs, including GPT-4, on this critical task. We utilize a modern, challenging dataset designed explicitly for fact-checking, HOVER, which comprises thousands of evidence-claim pairs covering diverse aspects of life, history, and entertainment. This dataset differs from common datasets that evaluate the reading comprehension capabilities of LLMs, which are primarily composed of sets of question-and-answer pairs. Our findings demonstrate that GPT-4 not only decisively surpasses the current state-of-the-art (SOTA) models in FC tasks but also shows that other, open-source, LLMs (e.g. Mixtral and Llama-3) exhibit close-to-SOTA performance out-of-the-box. This implies that simply presenting these models with the evidence text and claim allows them to infer the claim’s veracity effectively. We contrast this with the existing SOTA methods, which involve complex, multi-step solutions, including the use of multiple LLMs to verify claims – a process that necessitates continuous updates and local execution, making it less accessible for regular users. Furthermore, we explore the impact of claim formulation on the FC task’s effectiveness. By converting complex claims into first-order logic (FOL) and then back into natural language, we observe improved performance in some LLMs, particularly with more challenging dataset subsets. This method, although utilizing GPT-4 for the FOL breakdown, serves as a practical guideline for users: more formally structured claims yield more reliable responses.

Browse

Recent Submissions