Skip to content

Automatically Update Arxiv Papers about Path Planning, LLM and Autonomous Driving using Github Actions since 2024.2.

Notifications You must be signed in to change notification settings

XuzhaoLi/ro-arxiv-daily

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Updated on 2024.09.21

Table of Contents
  1. Path Planning
  2. Large Language Model
  3. Autonomous Driving

Path Planning

Publish Date Title Authors PDF Code
2024-09-18 Differential dynamic programming with stagewise equality and inequality constraints using interior point method Siddharth Prabhu et.al. 2409.12048 null
2024-09-18 Second-Order Constrained Dynamic Optimization Yuichiro Aoyama et.al. 2409.11649 null
2024-09-18 Multi-stage stochastic linear programming for shared autonomous vehicle system operation and design with on-demand and pre-booked requests Riki Kawase et.al. 2409.11611 null
2024-09-17 Optimal Investment with Costly Expert Opinions Christoph Knochenhauer et.al. 2409.11569 null
2024-09-17 Exact Wavefront Propagation for Globally Optimal One-to-All Path Planning on 2D Cartesian Grids Ibrahim Ibrahim et.al. 2409.11545 null
2024-09-17 Neural Networks for Vehicle Routing Problem László Kovács et.al. 2409.11290 null
2024-09-17 Selective algorithm processing of subset sum distributions Nick Dawes et.al. 2409.11076 null
2024-09-17 Local discontinuous Galerkin method for nonlinear BSPDEs of Neumann boundary conditions with deep backward dynamic programming time-marching Yixiang Dai et.al. 2409.11004 null
2024-09-17 Relationship between stochastic maximum principle and dynamic programming principle under convex expectation Xiaojuan Li et.al. 2409.10987 null
2024-09-16 Direct Data-Driven Discounted Infinite Horizon Linear Quadratic Regulator with Robustness Guarantees Ramin Esmzad et.al. 2409.10703 null
2024-09-16 Motion Forecasting via Model-Based Risk Minimization Aron Distelzweig et.al. 2409.10585 null
2024-09-16 Estimates for Optimal Multistage Group Partition Testing Guojiang Shao et.al. 2409.10410 null
2024-09-16 Pareto Sums of Pareto Sets: Lower Bounds and Algorithms Daniel Funke et.al. 2409.10232 null
2024-09-12 Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning Teng Yan et.al. 2409.08062 null
2024-09-12 Super Monotonic Alignment Search Junhyeok Lee et.al. 2409.07704 link
2024-09-10 Design of Threshold-Constrained Indirect Quantizers Ariel Doubchak et.al. 2409.06839 null
2024-09-10 Cooptimizing Safety and Performance with a Control-Constrained Formulation Hao Wang et.al. 2409.06696 null
2024-09-12 Valuation Model of Chinese Convertible Bonds Based on Monte Carlo Simulation Yu Liu et.al. 2409.06496 null
2024-09-09 OTFS-MDMA: An Elastic Multi-Domain Resource Utilization Mechanism for High Mobility Scenarios Jie Chen et.al. 2409.05724 null
2024-09-09 Enhancing Empathic Accuracy: Penalized Functional Alignment Method to Correct Misalignment in Emotional Perception Linh H Nghiem et.al. 2409.05343 null
2024-09-08 Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks Khai Doan et.al. 2409.05025 null
2024-09-08 Fast Deep Predictive Coding Networks for Videos Feature Extraction without Labels Wenqian Xue et.al. 2409.04945 null
2024-09-17 Second-Order Stein Variational Dynamic Optimization Yuichiro Aoyama et.al. 2409.04644 null
2024-09-06 Refined Bounds on Near Optimality Finite Window Policies in POMDPs and Their Reinforcement Learning Yunus Emre Demirci et.al. 2409.04351 null
2024-09-05 Space-Efficient Algorithm for Integer Programming with Few Constraints Lars Rohwedder et.al. 2409.03681 null
2024-09-05 Fine-Grained Equivalence for Problems Related to Integer Linear Programming Lars Rohwedder et.al. 2409.03675 null
2024-09-06 Revenue Management with Calendar-Aware and Dependent Demands: Asymptotically Tight Fluid Approximations Weiyuan Li et.al. 2409.02637 null
2024-09-03 FuzzCoder: Byte-level Fuzzing Test via Large Language Model Liqun Yang et.al. 2409.01944 null
2024-09-03 Quantum Algorithms for One-Sided Crossing Minimization Susanna Caroppo et.al. 2409.01942 null
2024-09-02 Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning Hongpei Li et.al. 2409.00968 null
2024-09-02 Multistage Robust Average Randomized Spectral Risk Optimization Qiong Wu et.al. 2409.00892 null
2024-09-01 An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI Michelle Su et.al. 2409.00798 null
2024-09-01 Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning Jiaming Yin et.al. 2409.00754 null
2024-09-01 The landscape of deterministic and stochastic optimal control problems: One-shot Optimization versus Dynamic Programming Jihun Kim et.al. 2409.00655 null
2024-08-31 Foundations of Multivariate Distributional Reinforcement Learning Harley Wiltzer et.al. 2409.00328 null
2024-08-30 Approximation Algorithms for Anchored Multiwatchman Routes Joseph S. B. Mitchell et.al. 2408.17343 null
2024-08-30 Stationary Policies are Optimal in Risk-averse Total-reward MDPs with EVaR Xihong Su et.al. 2408.17286 null
2024-08-30 A Two-Timescale Decision-Hazard-Decision Formulation for Storage Usage Values Calculation Camila Martinez Parra et.al. 2408.17113 null
2024-08-29 Optimization Models for the Quadratic Traveling Salesperson Problem Yuxiao Chen et.al. 2408.16680 null
2024-08-27 On the parameterized complexity of computing good edge-labelings Davi de Andrade et.al. 2408.15181 null
2024-08-26 Achieving designed texture and flows in bulk active nematics using optimal control theory Saptorshi Ghosh et.al. 2408.14596 null
2024-08-25 Decentralized Stochastic Control in Standard Borel Spaces: Centralized MDP Reductions, Near Optimality of Finite Window Local Information, and Q-Learning Omar Mrani-Zentar et.al. 2408.13828 null
2024-08-23 The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities Venkatesh Balavadhani Parthasarathy et.al. 2408.13296 null
2024-08-18 An Introduction to Cognidynamics Marco Gori et.al. 2408.13112 null
2024-08-20 Optimal Guarantees for Online Selection Over Time Sebastian Perez-Salazar et.al. 2408.11224 null
2024-08-20 Fault Tolerant Dynamic Task Assignment for UAV-based Search Teams Ali Nasir et.al. 2408.10564 null
2024-08-19 Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm Nikolai Rozanov et.al. 2408.10055 null
2024-08-19 Continuous-Time Dynamic Decision Making with Costly Information Christoph Knochenhauer et.al. 2408.09693 null
2024-08-19 Solving stochastic climate-economy models: A deep least-squares Monte Carlo approach Aleksandar Arandjelović et.al. 2408.09642 null
2024-08-18 Exploratory Optimal Stopping: A Singular Control Formulation Jodi Dianetti et.al. 2408.09335 null
2024-08-17 Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming Seungyeop Han et.al. 2408.09244 null
2024-08-17 Twin Sorting Dynamic Programming Assisted User Association and Wireless Bandwidth Allocation for Hierarchical Federated Learning Rung-Hung Gau et.al. 2408.09076 null
2024-08-17 Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) Mingkuan Xu et.al. 2408.09055 null
2024-08-15 Optimal control problems with generalized mean-field dynamics and viscosity solution to Master Bellman equation Rainer Buckdahn et.al. 2408.08046 null
2024-08-14 Differentiating Policies for Non-Myopic Bayesian Optimization Darian Nwankwo et.al. 2408.07812 null
2024-08-11 Moderate Exponential-time Quantum Dynamic Programming Across the Subsets for Scheduling Problems Camille Grange et.al. 2408.05741 null
2024-08-10 Convergence Guarantee of Dynamic Programming for LTL Surrogate Reward Zetong Xuan et.al. 2408.05438 null
2024-08-09 MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling Drew Edwards et.al. 2408.05024 null
2024-08-09 A Comprehensive System Architecture using Field Programmable Gate Arrays Technology, Dijkstra's Algorithm, and Edge Computing for Emergency Response in Smart Cities Mahamat Abdel Aziz Assoul et.al. 2408.04924 null
2024-08-08 Mathematical Programming For Adaptive Experiments Ethan Che et.al. 2408.04570 null
2024-08-08 Non-maximizing policies that fulfill multi-criterion aspirations in expectation Simon Dima et.al. 2408.04385 null
2024-08-08 Enhanced Traffic Flow Prediction with Multi-Segment Fusion Tensor Graph Convolutional Networks Wei Zhang et.al. 2408.04232 null
2024-08-06 A Course in Dynamic Optimization Bar Light et.al. 2408.03034 null
2024-08-05 Positive Dynamic Programming: A Critique Aaqib Peerzada et.al. 2408.02809 null
2024-08-05 Multi-level Traffic-Responsive Tilt Camera Surveillance through Predictive Correlated Online Learning Tao Li et.al. 2408.02208 null
2024-08-04 Non-local Hamilton-Jacobi-Bellman equations for the stochastic optimal control of path-dependent piecewise deterministic processes Elena Bandini et.al. 2408.02147 null
2024-08-03 Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation Balázs Opra et.al. 2408.01640 null
2024-08-02 Occasionally Observed Piecewise-deterministic Markov Processes Marissa Gee et.al. 2408.01335 null
2024-08-02 The Impact of Program Reduction on Automated Program Repair Linas Vidziunas et.al. 2408.01134 null
2024-08-11 Deep Learning Approach for Changepoint Detection: Penalty Parameter Optimization Tung L Nguyen et.al. 2408.00856 link
2024-07-31 Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation Taehyun Cho et.al. 2407.21260 null
2024-07-30 A Machine Learning Approach to Boost the Vehicle-2-Grid Scheduling Gabriele Agliardi et.al. 2407.20802 null
2024-07-30 Generalized replicator dynamics based on mean-field pairwise comparison dynamic Hidekazu Yoshioka et.al. 2407.20751 null
2024-08-10 A UAV-Enabled Time-Sensitive Data Collection Scheme for Grassland Monitoring Edge Networks Dongbin Jiao et.al. 2407.20585 null
2024-07-29 A Differential Dynamic Programming Framework for Inverse Reinforcement Learning Kun Cao et.al. 2407.19902 null
2024-07-27 Map-Matching Queries under Fréchet Distance on Low-Density Spanners Kevin Buchin et.al. 2407.19304 null
2024-07-26 RRO: A Regularized Routing Optimization Algorithm for Enhanced Throughput and Low Latency with Efficient Complexity David Zenati et.al. 2407.18683 null
2024-07-26 Mean-field control of non exchangeable systems Anna De Crescenzo et.al. 2407.18635 null
2024-08-01 Stochastic Games with Minimally Bounded Action Costs David Mguni et.al. 2407.18010 null
2024-07-25 Personalized and Context-aware Route Planning for Edge-assisted Vehicles Dinesh Cyril Selvaraj et.al. 2407.17980 null
2024-07-23 Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings Petar Bevanda et.al. 2407.16407 null
2024-07-23 Data-driven Multistage Distributionally Robust Linear Optimization with Nested Distance Rui Gao et.al. 2407.16346 null
2024-07-22 Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search Redha Taguelmimt et.al. 2407.16092 null
2024-07-22 Scheduling on a Stochastic Number of Machines Moritz Buchem et.al. 2407.15737 null
2024-07-20 Interdiction of minimum spanning trees and other matroid bases Noah Weninger et.al. 2407.14906 link
2024-07-20 A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems Kamran Razavi et.al. 2407.14843 null
2024-07-19 Dynamic Programming Techniques for Planar Orbital Transfer of Low Earth Orbit Satellites C. Ciancarelli et.al. 2407.14675 null
2024-07-19 Generalization Error Analysis of Deep Backward Dynamic Programming for Solving Nonlinear PDEs Du Ouyang et.al. 2407.14566 null
2024-07-19 On Policy Evaluation Algorithms in Distributional Reinforcement Learning Julian Gerstenberg et.al. 2407.14175 null
2024-07-18 Shaded Route Planning Using Active Segmentation and Identification of Satellite Images Longchao Da et.al. 2407.13689 null
2024-07-18 The Madness of Multiple Entries in March Madness Jeff Decary et.al. 2407.13438 null
2024-07-18 Double interdiction problem on trees on the sum of root-leaf distances by upgrading edges Xiao Li et.al. 2407.13391 null
2024-07-18 Deterministic Trajectory Optimization through Probabilistic Optimal Control Mohammad Mahmoudi Filabadi et.al. 2407.13316 null
2024-07-18 Integrated Hardware Architecture and Device Placement Search Irene Wang et.al. 2407.13143 link
2024-07-18 Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II Rixin Wu et.al. 2407.13113 null
2024-07-17 Dynamic Programming Principle and Hamilton-Jacobi-Bellman Equation for Optimal Control Problems with Uncertainty M. Soledad Aronna et.al. 2407.13045 null
2024-07-17 Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics Kevin L. McKinney et.al. 2407.12775 null
2024-07-16 Enabling MCTS Explainability for Sequential Planning Through Computation Tree Logic Ziyan An et.al. 2407.10820 null
2024-07-14 Fine Grained Lower Bounds for Multidimensional Knapsack Ilan Doron-Arad et.al. 2407.10146 null
2024-07-12 Investigating the Interplay of Prioritized Replay and Generalization Parham Mohammad Panahi et.al. 2407.09702 null
2024-07-12 An efficient algorithm to compute the minimum free energy of interacting nucleic acid strands Ahmed Shalaby et.al. 2407.09676 null
2024-07-12 Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey Milan Ganai et.al. 2407.09645 null
2024-07-12 Integer programs with nearly totally unimodular matrices: the cographic case Manuel Aprile et.al. 2407.09477 null
2024-07-12 A new approach to principal-agent problems with volatility control Alessandro Chiusolo et.al. 2407.09471 null
2024-07-12 CAACS: A Carbon Aware Ant Colony System Marina Lin et.al. 2407.09404 null
2024-07-12 Structure and Independence in Hyperbolic Uniform Disk Graphs Thomas Bläsius et.al. 2407.09362 null
2024-07-12 KUNPENG: An Embodied Large Model for Intelligent Maritime Naiyao Wang et.al. 2407.09048 link
2024-07-09 Trajectory Data Mining and Trip Travel Time Prediction on Specific Roads Muhammad Awais Amin et.al. 2407.07030 null
2024-07-08 Solving Multi-Model MDPs by Coordinate Ascent and Dynamic Programming Xihong Su et.al. 2407.06329 link
2024-07-08 Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization Daniil Tiapkin et.al. 2407.05704 null
2024-07-06 Advancing Algorithmic Approaches to Probabilistic Argumentation under the Constellation Approach Andrei Popescu et.al. 2407.05058 null
2024-07-05 Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning Eric Pasewark et.al. 2407.04787 link
2024-07-05 GOALPlace: Begin with the End in Mind Anthony Agnesina et.al. 2407.04579 null
2024-07-04 Advanced Artificial Intelligence Strategy for Optimizing Urban Rail Network Design using Nature-Inspired Algorithms Hariram Sampath Kumar et.al. 2407.04087 null
2024-07-04 Multi-Time Scale Service Caching and Pricing in MEC Systems with Dynamic Program Popularity Yiming Chen et.al. 2407.03804 null
2024-07-03 Reconsidering utility: unveiling the limitations of synthetic mobility data generation algorithms in real-life scenarios Alexandra Kapp et.al. 2407.03237 null
2024-07-12 A Two-stage Identification Method for Switched Linear Systems Zheng Wenju et.al. 2407.02743 null
2024-07-02 DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection Kaixin Xu et.al. 2407.02098 null
2024-06-28 Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints Arash Mozhdehi et.al. 2407.01615 null
2024-07-02 Contractual Reinforcement Learning: Pulling Arms with Invisible Hands Jibang Wu et.al. 2407.01458 null
2024-07-01 Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach Stef Baas et.al. 2407.01055 null
2024-06-30 Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models Sangwoong Yoon et.al. 2407.00626 link
2024-06-30 Your Car Tells Me Where You Drove: A Novel Path Inference Attack via CAN Bus and OBD-II Data Tommaso Bianchi et.al. 2407.00585 null
2024-06-29 A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation Aicheng Gong et.al. 2407.00496 link
2024-06-29 Vector-valued robust stochastic control Igor Cialenco et.al. 2407.00266 null
2024-06-28 Leveraging Fixed-Parameter Tractability for Robot Inspection Planning Yosuke Mizutani et.al. 2407.00251 null
2024-06-28 Approximate Solutions for Multi-Trip Route Planning in Time-Sensitive Situations Bahar Cavdar et.al. 2407.00173 null
2024-06-28 Online Optimization of DNN Inference Network Utility in Collaborative Edge Computing Rui Li et.al. 2406.19613 null
2024-06-27 Efficient and Distributed Large-Scale 3D Map Registration using Tomographic Features Halil Utku Unlu et.al. 2406.19461 link
2024-06-27 Cuts in Graphs with Matroid Constraints Aritra Banik et.al. 2406.19134 null
2024-06-27 State and Input Constrained Output-Feedback Adaptive Optimal Control of Affine Nonlinear Systems Tochukwu Elijah Ogri et.al. 2406.18804 null
2024-06-26 Markov Decision Process and Approximate Dynamic Programming for a Patient Assignment Scheduling problem Malgorzata M. O'Reilly et.al. 2406.18618 null
2024-06-26 Tiered Service Architecture for Remote Patient Monitoring Siddharth Chandak et.al. 2406.18000 null
2024-06-25 Splitting Guarantees for Prophet Inequalities via Nonlinear Systems Johannes Brustle et.al. 2406.17767 null
2024-06-25 Using iterated local alignment to aggregate GPS trajectories into a traffic flow map Tarn Duong et.al. 2406.17500 null
2024-06-24 A multiplicative surface signature through its Magnus expansion Ilya Chevyrev et.al. 2406.16856 null
2024-06-24 Stochastic Path-Dependent Volatility Models for Price-Storage Dynamics in Natural Gas Markets and Discrete-Time Swing Option Pricing Jinniao Qiu et.al. 2406.16400 null
2024-06-21 Exact discovery is polynomial for sparse causal Bayesian networks Felix L. Rios et.al. 2406.15012 link
2024-06-19 A programmable wafer-scale chiroptical heterostructure of twisted aligned carbon nanotubes and phase change materials Jichao Fan et.al. 2406.13190 null
2024-06-14 Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction Wenzhao Jiang et.al. 2406.12923 null
2024-06-26 LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging Jinuk Kim et.al. 2406.12837 link
2024-06-17 LibProf: A Python Profiler for Improving Cold Start Performance in Serverless Applications Syed Salauddin Mohammad Tariq et.al. 2406.11734 null
2024-06-17 Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces Shengbo Wang et.al. 2406.11281 null
2024-06-16 WeShap: Weak Supervision Source Evaluation with Shapley Values Naiqing Guan et.al. 2406.11010 null
2024-06-16 Solving Co-Path/Cycle Packing Faster than $3^k$ Yuxi Liu et.al. 2406.10829 null
2024-06-15 Scheduling two types of jobs with minimum makespan Song Cao et.al. 2406.10467 null
2024-06-14 CycleTrajectory: An End-to-End Pipeline for Enriching and Analyzing GPS Trajectories to Understand Cycling Behavior and Environment Meihui Wang et.al. 2406.10069 link
2024-06-13 Optimal Control of Agent-Based Dynamics under Deep Galerkin Feedback Laws Frederik Kelbel et.al. 2406.09141 link
2024-06-13 Coordinated Trading Strategies for Battery Storage in Reserve and Spot Markets Paul E. Seifert et.al. 2406.08390 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507 null
2024-06-11 Variational inequalities and smooth-fit principle for singular stochastic control problems in Hilbert spaces Salvatore Federico et.al. 2406.07242 null
2024-06-10 Stochastic Guidance of Buoyancy Controlled Vehicles under Ice Shelves using Ocean Currents Federico Rossi et.al. 2406.06724 null
2024-06-10 Leveraging Hyperscanning EEG and VR Omnidirectional Treadmill to Explore Inter-Brain Synchrony in Collaborative Spatial Navigation Chun-Hsiang Chuang et.al. 2406.06327 null
2024-06-09 Production and distribution planning, scheduling, and routing optimization in a yogurt supply chain under demand uncertainty: A case study Babak Javadi et.al. 2406.05803 null
2024-06-09 Heart Sound Segmentation Using Deep Learning Techniques Manas Madine et.al. 2406.05653 null
2024-06-11 Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently Sergio Calo et.al. 2406.04056 null
2024-06-04 GrootVL: Tree Topology is All You Need in State Space Model Yicheng Xiao et.al. 2406.02395 link
2024-06-21 Branches: A Fast Dynamic Programming and Branch & Bound Algorithm for Optimal Decision Trees Ayman Chaouki et.al. 2406.02175 link
2024-06-03 An efficient solution to Hidden Markov Models on trees with coupled branches Farzan Vafa et.al. 2406.01663 null
2024-06-03 A New View on Planning in Online Reinforcement Learning Kevin Roice et.al. 2406.01562 null
2024-06-02 Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems Jiaqi Liang et.al. 2406.00868 null
2024-06-02 Computing Optimal Equilibria in Repeated Games with Restarts Ratip Emin Berker et.al. 2406.00851 null
2024-06-02 A Lazy Abstraction Algorithm for Markov Decision Processes: Theory and Initial Evaluation Dániel Szekeres et.al. 2406.00824 null
2024-06-10 Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming Dimitri P. Bertsekas et.al. 2406.00592 null
2024-06-01 Optimal Transmission Power Scheduling for Networked Control System under DoS Attack Siyi Wang et.al. 2406.00540 null
2024-06-01 A Single-Loop Robust Policy Gradient Method for Robust Markov Decision Processes Zhenwei Lin et.al. 2406.00274 link
2024-05-31 Finding Diverse Solutions Parameterized by Cliquewidth Karolina Drabik et.al. 2405.20931 null
2024-05-29 A numerical algorithm with linear complexity for Multi-marginal Optimal Transport with $L^1$ Cost Chunhui Chen et.al. 2405.19246 null
2024-05-28 A Pontryagin Perspective on Reinforcement Learning Onno Eberhard et.al. 2405.18100 null
2024-05-27 Q-value Regularized Transformer for Offline Reinforcement Learning Shengchao Hu et.al. 2405.17098 null
2024-05-25 A Bi-Objective Approach to Last-Mile Delivery Routing Considering Driver Preferences Juan Pablo Mesa et.al. 2405.16051 null
2024-06-03 Inference of Utilities and Time Preference in Sequential Decision-Making Haoyang Cao et.al. 2405.15975 null
2024-05-31 Stability and Performance Analysis of Model Predictive Control of Uncertain Linear Systems Changrui Liu et.al. 2405.15552 link
2024-05-24 An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking Pratyusha Musunuru et.al. 2405.15137 null
2024-05-23 Two-Stage ML-Guided Decision Rules for Sequential Decision Making under Uncertainty Andrew Rosemberg et.al. 2405.14973 null
2024-05-23 A rolling horizon heuristic approach for a multi-stage stochastic waste collection problem Andrea Spinelli et.al. 2405.14499 link
2024-05-23 EdgeShard: Efficient LLM Inference via Collaborative Edge Computing Mingjin Zhang et.al. 2405.14371 null
2024-05-23 Optimal Whole Body Trajectory Planning for Mobile Manipulators in Planetary Exploration and Construction Federica Storiale et.al. 2405.14363 null
2024-05-23 Deterministic Policies for Constrained Reinforcement Learning in Polynomial-Time Jeremy McMahan et.al. 2405.14183 null
2024-05-22 Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning Maximilian Nägele et.al. 2405.13609 link
2024-05-21 Parallel Algorithm for Optimal Threshold Labeling of Ordinal Regression Methods Ryoya Yamasaki et.al. 2405.12756 link
2024-05-21 Short and simple introduction to Bellman filtering and smoothing Rutger-Jan Lange et.al. 2405.12668 null
2024-05-21 Data-driven Coordinated AC/DC Control Strategy for Frequency Safety Qianni Cao et.al. 2405.12546 null
2024-05-20 Semantic Trajectory Data Mining with LLM-Informed POI Classification Yifan Liu et.al. 2405.11715 null
2024-05-18 On the Trajectory Regularity of ODE-based Diffusion Sampling Defang Chen et.al. 2405.11326 link
2024-05-15 Harmonizing Human Insights and AI Precision: Hand in Hand for Advancing Knowledge Graph Task Shurong Wang et.al. 2405.09477 null
2024-05-14 Treatment Effect Estimation for User Interest Exploration on Recommender Systems Jiaju Chen et.al. 2405.08582 link
2024-05-27 Dynamic Programming for Symbolic Boolean Realizability and Synthesis Yi Lin et.al. 2405.07975 null
2024-05-13 Space Domain based Ecological Cooperative and Adaptive Cruise Control on Rolling Terrain Mingyue Lei et.al. 2405.07553 null
2024-05-12 Deciding regular games: a playground for exponential time algorithms Zihui Liang et.al. 2405.07188 null
2024-05-12 Trade execution games in a Markovian environment Masamitsu Ohnishi et.al. 2405.07184 null
2024-05-10 Dynamic programming principle and computable prices in financial market models with transaction costs Emmanuel Lepinette et.al. 2405.06623 null
2024-05-09 Change point localisation and inference in fragmented functional data Gengyu Xue et.al. 2405.05730 link
2024-05-09 Infinite horizon stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems Sheng Luo et.al. 2405.05561 null
2024-05-14 Robust Reward Placement under Uncertainty Petros Petsinis et.al. 2405.05433 null
2024-05-06 Novel Tour Construction Heuristic for Pick-Up and Delivery Routing Problems Mithun Goutham et.al. 2405.03774 null
2024-05-05 TSP Escapes the $O(2^n n^2)$ Curse Mihail Stoian et.al. 2405.03018 link
2024-05-02 DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Ye Tian et.al. 2405.01248 null
2024-05-02 Lipschitz constant estimation for general neural network architectures using control tools Patricia Pauli et.al. 2405.01125 link
2024-05-01 A biased random-key genetic algorithm with variable mutants to solve a vehicle routing problem Paola Festa et.al. 2405.00268 null
2024-04-28 Bi-objective optimization of a VRP problem applied to urban solid waste collection through a model that includes the visual attraction of routes Diego Rossit et.al. 2405.00068 null
2024-04-26 Energy Storage Arbitrage in Two-settlement Markets: A Transformer-Based Approach Saud Alghumayjan et.al. 2404.17683 null
2024-04-25 Path integral control under McKean-Vlasov dynamics Timothy Bennett et.al. 2404.17006 null
2024-04-25 Parallel and (Nearly) Work-Efficient Dynamic Programming Xiangyun Ding et.al. 2404.16314 link
2024-04-23 Prediction from compression for models with infinite memory, with applications to hidden Markov and renewal processes Yanjun Han et.al. 2404.15454 null
2024-04-26 Variational Dynamic Programming for Stochastic Optimal Control Marc Lambert et.al. 2404.14806 link
2024-04-22 Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360 $^\circ$ VR Video Streaming Haopeng Wang et.al. 2404.14573 null
2024-04-21 Stochastic Multi-round Submodular Optimization with Budget Vincenzo Auletta et.al. 2404.13737 null
2024-04-21 Planning of Truck Platooning for Road-Network Capacitated Vehicle Routing Problem Yilang Hao et.al. 2404.13512 null
2024-04-20 Liquidity Pool Design on Automated Market Makers Xue Dong He et.al. 2404.13291 null
2024-04-19 Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning Daniel May et.al. 2404.13142 null
2024-04-18 NLP-enabled trajectory map-matching in urban road networks using transformer sequence-to-sequence model Sevin Mohammadi et.al. 2404.12460 null
2024-04-18 Recursive stochastic differential games with non-Lipschitzian generators and viscosity solutions of Hamilton-Jacobi-Bellman-Isaacs equation Guangchen Wang et.al. 2404.12129 null
2024-04-18 Actor-Critic Reinforcement Learning with Phased Actor Ruofan Wu et.al. 2404.11834 null
2024-04-18 Itō and Itō-Wentzell chain rule for flows of conditional laws of continuous semimartingales: an easy approach Assil Fadle et.al. 2404.11010 null
2024-04-16 Zero-Sum Games for Volterra Integral Equations and Viscosity Solutions of Path-Dependent Hamilton-Jacobi Equations Mikhail I. Gomoyunov et.al. 2404.10428 null
2024-04-16 Urban Water Sprinkler Routing: A Multi-Depot Mixed Capacitated Arc Routing Problem Incorporating Real-Time Demands Hongtai Yang et.al. 2404.10230 null
2024-04-13 Fast Gradient Computation for Gromov-Wasserstein Distance Wei Zhang et.al. 2404.08970 null
2024-04-12 A Parametric Approach for Solving Convex Quadratic Optimization with Indicators Over Trees Aaresh Bhathena et.al. 2404.08178 link
2024-04-06 Viscosity solutions for mean field optimal switching with a two-time-scale Markov chain Tian Chen et.al. 2404.07998 null
2024-04-11 Parameterized Fast and Safe Tracking (FaSTrack) using Deepreach Hyun Joe Jeong et.al. 2404.07431 null
2024-04-09 Inexact Policy Iteration Methods for Large-Scale Markov Decision Processes Matilde Gargiani et.al. 2404.06136 null
2024-04-09 fastcpd: Fast Change Point Detection in R Xingchi Li et.al. 2404.05933 link
2024-04-08 Non-concave distributionally robust stochastic control in a discrete time finite horizon setting Ariel Neufeld et.al. 2404.05230 link
2024-04-07 Percentile Criterion Optimization in Offline Reinforcement Learning Elita A. Lobo et.al. 2404.05055 link
2024-04-05 A Ground Mobile Robot for Autonomous Terrestrial Laser Scanning-Based Field Phenotyping Javier Rodriguez-Sanchez et.al. 2404.04404 null
2024-04-04 Forecasting with Neuro-Dynamic Programming Pedro Afonso Fernandes et.al. 2404.03737 null
2024-04-03 Reinforcement Learning in Categorical Cybernetics Jules Hedges et.al. 2404.02688 null
2024-04-03 Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization Chanyeong Kim et.al. 2404.02583 null
2024-04-01 Versatile Navigation under Partial Observability via Value-guided Diffusion Policy Gengyu Zhang et.al. 2404.02176 null
2024-03-31 Adversarially-Robust Inference on Trees via Belief Propagation Samuel B. Hopkins et.al. 2404.00768 null
2024-03-28 A Faster Algorithm for Pigeonhole Equal Sums Ce Jin et.al. 2403.19117 null
2024-03-27 Policy iteration for discrete-time systems with discounted costs: stability and near-optimality guarantees Jonathan de Brusse et.al. 2403.19007 null
2024-03-27 A Dynamic Programming Approach for Road Traffic Estimation Mattia Laurini et.al. 2403.18561 null
2024-03-26 Generalized Maximum Entropy Differential Dynamic Programming Yuichiro Aoyama et.al. 2403.18130 null
2024-03-26 Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer Jeong-Yoon Kim et.al. 2403.17327 null
2024-03-25 State-Augmented Linear Games with Antagonistic Error for High-Dimensional, Nonlinear Hamilton-Jacobi Reachability Will Sharpless et.al. 2403.16982 link
2024-03-25 Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints Jiping Luo et.al. 2403.16855 null
2024-03-24 On the Navier-Stokes equations and the Hamilton-Jacobi-Bellman equation on the group of volume preserving diffeomorphisms Xiang-Dong Li et.al. 2403.15997 null
2024-03-23 On Merton's Optimal Portfolio Problem under Sporadic Bankruptcy Yaacov Kopeliovich et.al. 2403.15923 link
2024-03-22 Transactive Local Energy Markets Enable Community-Level Resource Coordination Using Individual Rewards Daniel C. May et.al. 2403.15617 null
2024-03-19 Most Likely Sequence Generation for $n$ -Grams, Transformers, HMMs, and Markov Chains, by Using Rollout Algorithms Yuchao Li et.al. 2403.15465 null
2024-03-21 Conservative Linear Envelopes for High-Dimensional, Hamilton-Jacobi Reachability for Nonlinear Systems via the Hopf Formula Will Sharpless et.al. 2403.14184 null
2024-03-20 Optimal control of continuous-time symmetric systems with unknown dynamics and noisy measurements Hamed Taghavian et.al. 2403.13605 null
2024-03-19 Solving Combinatorial Pricing Problems using Embedded Dynamic Programming Models Quang Minh Bui et.al. 2403.12923 null
2024-03-18 AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition SooHwan Eom et.al. 2403.11578 null
2024-03-17 Multiscale Quantile Regression with Local Error Control Zhi Liu et.al. 2403.11356 link
2024-03-15 Fast Generation of Feasible Trajectories in Direct Optimal Control David Kiessling et.al. 2403.10115 link
2024-03-14 Is Data All That Matters? The Role of Control Frequency for Learning-Based Sampled-Data Control of Uncertain Systems Ralf Römer et.al. 2403.09504 link
2024-03-14 Quantum Dynamic Programming Jeongrak Son et.al. 2403.09187 null
2024-03-15 Relationship between General MP and DPP for the Stochastic Recursive Optimal Control Problem With Jumps: Viscosity Solution Framework Bin Wang et.al. 2403.09044 null
2024-03-13 Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning Jiajun Shen et.al. 2403.08948 null
2024-03-13 Online Multi-Contact Feedback Model Predictive Control for Interactive Robotic Tasks Seo Wook Han et.al. 2403.08302 null
2024-03-12 Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services Maqsood Hussain Shah et.al. 2403.07964 null
2024-03-12 The Primal Pathwidth SETH Michael Lampis et.al. 2403.07239 null
2024-03-10 A Unified Model for Spatio-Temporal Prediction Queries with Arbitrary Modifiable Areal Units Liyue Chen et.al. 2403.07022 link
2024-03-11 Domain-Independent Dynamic Programming and Constraint Programming Approaches for Assembly Line Balancing Problems with Setups Jiachen Zhang et.al. 2403.06780 null
2024-03-11 Balanced Substructures in Bicolored Graphs P. S. Ardra et.al. 2403.06608 null
2024-03-11 An Efficient Solution to the 2D Visibility Problem in Cartesian Grid Maps and its Application in Heuristic Path Planning Ibrahim Ibrahim et.al. 2403.06494 link
2024-03-11 AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping Seongyeon Park et.al. 2403.06478 link
2024-03-09 Spatial Clustering Approach for Vessel Path Identification Mohamed Abuella et.al. 2403.05778 link
2024-03-07 On $[1,2]$ -Domination in Interval and Circle Graphs Mohsen Alambardar Meybodi et.al. 2403.04694 null
2024-03-07 Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control Sadegh Sadeghi Tabas et.al. 2403.04195 null
2024-03-06 Global Geolocated Realtime Data of Interfleet Urban Transit Bus Idling Nicholas Kunz et.al. 2403.03489 link
2024-03-06 SalienTime: User-driven Selection of Salient Time Steps for Large-Scale Geospatial Data Visualization Juntong Chen et.al. 2403.03449 link
2024-03-06 Leveraging The Finite States of Emotion Processing to Study Late-Life Mental Health Yuanzhe Huang et.al. 2403.03414 null
2024-03-04 Dynamic programming principle in cost-efficient sequential design: application to switching measurements Jeongmin Han et.al. 2403.02245 null
2024-03-04 Cooperative and Interaction-aware Driver Model for Lane Change Maneuver Jemin Woo et.al. 2403.01752 null
2024-03-01 DyPyBench: A Benchmark of Executable Python Software Islem Bouzenia et.al. 2403.00539 link
2024-03-01 Graph Construction with Flexible Nodes for Traffic Demand Prediction Jinyan Hou et.al. 2403.00276 link
2024-02-29 Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress Ameya Prabhu et.al. 2402.19472 link
2024-02-27 Globally Convergent Distributed Sequential Quadratic Programming with Overlapping Decomposition and Exact Augmented Lagrangian Merit Function Runxin Ni et.al. 2402.17170 null
2024-02-24 Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems Abdelkarim Ben Sada et.al. 2402.16904 null
2024-02-25 IKLink: End-Effector Trajectory Tracking with Minimal Reconfigurations Yeping Wang et.al. 2402.16154 link
2024-02-25 Evolving E-commerce Logistics Planning- Integrating Embedded Technology and Ant Colony Algorithm for Enhanced Efficiency Lynn Huang et.al. 2402.15965 null
2024-02-25 Budget-Constrained Tool Learning with Planning Yuanhang Zheng et.al. 2402.15960 link
2024-02-23 Neural optimal controller for stochastic systems via pathwise HJB operator Zhe Jiao et.al. 2402.15592 null
2024-02-23 Curve fitting on a quantum annealer for an advanced navigation method Philipp Isserstedt et.al. 2402.15308 null
2024-02-22 Quantum Markov Decision Processes Part II: Optimal Solutions and Algorithms Naci Saldi et.al. 2402.14651 null
2024-02-22 Quantum Markov Decision Processes Part I: General Theory, Approximations, and Classes of Policies Naci Saldi et.al. 2402.14649 null
2024-02-21 Quantum Annealing and Graph Neural Networks for Solving TSP with QUBO Haoqi He et.al. 2402.14036 null
2024-02-21 Do Efficient Transformers Really Save Computation? Kai Yang et.al. 2402.13934 null
2024-02-21 Benchmarking and Dissecting the Nvidia Hopper GPU Architecture Weile Luo et.al. 2402.13499 null
2024-02-20 An Improved Lower Bound on the Number of Pseudoline Arrangements Fernando Cortés Kühnast et.al. 2402.13107 null
2024-02-20 Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept Kui Wang et.al. 2402.12682 null
2024-02-19 An algorithm for counting number of all (normal) fuzzy subgroups in $U_{6n}$ Marek Hyčko et.al. 2402.12543 null
2024-02-29 Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding Zhuoming Chen et.al. 2402.12374 link
2024-02-19 Scalable Virtual Valuations Combinatorial Auction Design by Combining Zeroth-Order and First-Order Optimization Method Zhijian Duan et.al. 2402.11904 null
2024-02-19 Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic Jeremy J. Lin et.al. 2402.11866 null
2024-02-18 A Fisher Information based Receding Horizon Control Method for Signal Strength Model Estimation Yancheng Zhu et.al. 2402.11483 null
2024-02-16 Optimal Savings and Value of Population in A Stochastic Environment: Transient Behavior Hao Liu et.al. 2402.10768 null
2024-02-15 Engraving Oriented Joint Estimation of Pitch Spelling and Local and Global Keys Augustin Bouquillard et.al. 2402.10247 null
2024-02-14 Analyzing the Impact of Computation in Adaptive Dynamic Programming for Stochastic LQR Problem Wenhan Cao et.al. 2402.09575 null
2024-02-13 Approximate Sequential Optimization for Informative Path Planning Joshua Ott et.al. 2402.08841 link
2024-02-13 Sequence graphs realizations and ambiguity in language models Sammy Khalife et.al. 2402.08830 null
2024-02-11 GenSTL: General Sparse Trajectory Learning via Auto-regressive Generation of Feature Domains Yan Lin et.al. 2402.07232 link
2024-02-09 High-Precision Geosteering via Reinforcement Learning and Particle Filters Ressi Bonti Muhammad et.al. 2402.06377 null
2024-02-09 Bellman Conformal Inference: Calibrating Prediction Intervals For Time Series Zitong Yang et.al. 2402.05203 link
2024-02-04 Empowering Computing and Networks Convergence System with Distributed Cooperative Routing Yujiao Hu et.al. 2402.02381 null
2024-02-03 Multiple sequences Prophet Inequality Under Observation Constraints Aristomenis Tsopelakos et.al. 2402.02059 null
2024-02-02 Capturing waste collection planning expert knowledge in a fitness function through preference learning Laura Fernández Díaz et.al. 2402.01849 null
2024-02-02 Dynamic programming for the stochastic matching model on general graphs: the case of the `N-graph' Loïc Jean et.al. 2402.01803 null
2024-02-01 AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems Ruihan Zhou et.al. 2402.00907 null
2024-02-01 Cocco: Hardware-Mapping Co-Exploration towards Memory Capacity-Communication Optimization Zhanhong Tan et.al. 2402.00629 null
2024-02-02 Branch and Price for the Length-Constrained Cycle Partition Problem Mohammed Ghannam et.al. 2401.17937 link
2024-01-31 Revisiting speech segmentation and lexicon learning with better features Herman Kamper et.al. 2401.17902 null
2024-02-16 The computation of approximate feedback Stackelberg equilibria in multi-player nonlinear constrained dynamic games Jingqi Li et.al. 2401.15745 link
2024-01-28 HappyRouting: Learning Emotion-Aware Route Trajectories for Scalable In-The-Wild Navigation David Bethge et.al. 2401.15695 null
2024-01-28 Constrained Markov decision processes for response-adaptive procedures in clinical trials with binary outcomes Stef Baas et.al. 2401.15694 null
2024-01-27 Fair and Efficient Ridesharing: A Dynamic Programming-based Relocation Approach Aqsa Ashraf Makhdomi et.al. 2401.15363 null
2024-01-27 Optimal Sparse Survival Trees Rui Zhang et.al. 2401.15330 link
2024-01-25 Domain-Independent Dynamic Programming Ryo Kuroiwa et.al. 2401.13883 link
2024-01-27 Deep multitask neural networks for solving some stochastic optimal control problems Christian Yeo et.al. 2401.12923 link
2024-01-23 Optimal Stopping of Branching Diffusion Processes Idris Kharroubi et.al. 2401.12811 null
2024-01-22 On a class of interdiction problems with partition matroids: complexity and polynomial-time algorithms Sergey S. Ketkov et.al. 2401.12010 null
2024-01-22 Finite horizon optimal control of reaction-diffusion SIV epidemic system with stochastic environment Zong Wang et.al. 2401.11744 null
2024-01-20 Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View Raj Ghugare et.al. 2401.11237 link

(back to top)

Large Language Model

Publish Date Title Authors PDF Code
2024-09-19 Gender Representation and Bias in Indian Civil Service Mock Interviews Somonnoy Banerjee et.al. 2409.12194 null
2024-09-18 Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Peng Wang et.al. 2409.12191 link
2024-09-18 To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Zayne Sprague et.al. 2409.12183 null
2024-09-18 A Controlled Study on Long Context Extension and Generalization in LLMs Yi Lu et.al. 2409.12181 link
2024-09-18 Finetuning Language Models to Emit Linguistic Expressions of Uncertainty Arslan Chaudhry et.al. 2409.12180 null
2024-09-18 Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference Najmeh Forouzandehmehr et.al. 2409.12150 null
2024-09-18 MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning Justin Chih-Yao Chen et.al. 2409.12147 link
2024-09-18 MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion Kalakonda Sai Shashank et.al. 2409.12140 null
2024-09-18 Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models EverestAI et.al. 2409.12139 null
2024-09-18 GRIN: GRadient-INformed MoE Liyuan Liu et.al. 2409.12136 null
2024-09-18 Linguini: A benchmark for language-agnostic linguistic reasoning Eduardo Sánchez et.al. 2409.12126 null
2024-09-18 Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement An Yang et.al. 2409.12122 null
2024-09-18 Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference Edresson Casanova et.al. 2409.12117 null
2024-09-18 Measuring Human and AI Values based on Generative Psychometrics with Large Language Models Haoran Ye et.al. 2409.12106 link
2024-09-19 Skill matching at scale: freelancer-project alignment for efficient multilingual candidate retrieval Warren Jouanneau et.al. 2409.12097 null
2024-09-19 The Impact of Element Ordering on LM Agent Performance Wayne Chi et.al. 2409.12089 link
2024-09-18 Dual-Layer Training and Decoding of Large Language Model with Simultaneously Thinking and Speaking Ningyuan Xi et.al. 2409.12059 null
2024-09-19 Using Large Language Models to Generate Clinical Trial Tables and Figures Yumeng Yang et.al. 2409.12046 null
2024-09-18 All-in-one foundational models learning across quantum chemical levels Yuxinxin Chen et.al. 2409.12015 link
2024-09-18 Mixture of Prompt Learning for Vision Language Models Yu Du et.al. 2409.12011 null
2024-09-17 AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs Basel Mousi et.al. 2409.11404 null
2024-09-17 NVLM: Open Frontier-Class Multimodal LLMs Wenliang Dai et.al. 2409.11402 null
2024-09-17 Says Who? Effective Zero-Shot Annotation of Focalization Rebecca M. M. Hicke et.al. 2409.11390 null
2024-09-17 Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement Simon Yu et.al. 2409.11378 null
2024-09-17 Towards Time Series Reasoning with LLMs Winnie Chow et.al. 2409.11376 null
2024-09-17 Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification Fatema-E- Jannat et.al. 2409.11375 null
2024-09-17 Learning Spatially-Aware Language and Audio Embedding Bhavika Devnani et.al. 2409.11369 null
2024-09-17 CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration Jiahui Gao et.al. 2409.11365 null
2024-09-17 CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark Zachary S. Siegel et.al. 2409.11363 link
2024-09-17 AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances Dhruv Agarwal et.al. 2409.11360 null
2024-09-17 THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models Mengfei Liang et.al. 2409.11353 null
2024-09-17 LPT++: Efficient Training on Mixture of Long-tailed Experts Bowen Dong et.al. 2409.11323 null
2024-09-17 SOAP: Improving and Stabilizing Shampoo using Adam Nikhil Vyas et.al. 2409.11321 link
2024-09-17 Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models Divij Gupta et.al. 2409.11302 null
2024-09-17 Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T5 Marcel Lamott et.al. 2409.11282 null
2024-09-17 P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task Weiye Xu et.al. 2409.11279 null
2024-09-17 Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments Maria Rigaki et.al. 2409.11276 null
2024-09-17 Task Arithmetic for Language Expansion in Speech Translation Yao-Fei Cheng et.al. 2409.11274 null
2024-09-18 LOLA -- An Open-Source Massively Multilingual Large Language Model Nikit Srivastava et.al. 2409.11272 link
2024-09-17 Bio-Inspired Mamba: Temporal Locality and Bioplausible Learning in Selective State Space Models Jiahao Qin et.al. 2409.11263 null
2024-09-16 RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval Di Liu et.al. 2409.10516 null
2024-09-16 Context-aware Code Segmentation for C-to-Rust Translation using Large Language Models Momoko Shiraishi et.al. 2409.10506 null
2024-09-16 DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction John Wu et.al. 2409.10504 null
2024-09-16 Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles Kulin Shah et.al. 2409.10502 null
2024-09-16 Code Vulnerability Detection: A Comparative Analysis of Emerging Large Language Models Shaznin Sultana et.al. 2409.10490 null
2024-09-16 Do Pre-trained Vision-Language Models Encode Object States? Kaleb Newman et.al. 2409.10488 null
2024-09-16 XLM for Autonomous Driving Systems: A Comprehensive Review Sonda Fourati et.al. 2409.10484 null
2024-09-17 Schrodinger's Memory: Large Language Models Wei Wang et.al. 2409.10482 null
2024-09-16 Towards Semantic Versioning of Open Pre-trained Language Model Releases on Hugging Face Adekunle Ajibode et.al. 2409.10472 null
2024-09-16 LLM as BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning Jicong Ao et.al. 2409.10444 null
2024-09-16 CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera Jingpei Lu et.al. 2409.10441 null
2024-09-16 HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models Vineet Bhat et.al. 2409.10419 null
2024-09-16 A Large-Scale Privacy Assessment of Android Third-Party SDKs Mark Huasong Meng et.al. 2409.10411 null
2024-09-16 A Knowledge-Enhanced Disease Diagnosis Method Based on Prompt Learning and BERT Integration Zhang Zheng et.al. 2409.10403 null
2024-09-17 Learnings from a Large-Scale Deployment of an LLM-Powered Expert-in-the-Loop Healthcare Chatbot Bhuvan Sachdeva et.al. 2409.10354 null
2024-09-16 Large Language Model Enhanced Hard Sample Identification for Denoising Recommendation Tianrui Song et.al. 2409.10343 null
2024-09-16 The 20 questions game to distinguish large language models Gurvan Richardeau et.al. 2409.10338 null
2024-09-16 MGSA: Multi-granularity Graph Structure Attention for Knowledge Graph-to-Text Generation Shanshan Wang et.al. 2409.10294 null
2024-09-16 ReflectDiffu: Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework Jiahao Yuan et.al. 2409.10289 null
2024-09-16 ComplexCodeEval: A Benchmark for Evaluating Large Code Models on More Complex Code Jia Feng et.al. 2409.10280 null
2024-09-13 Agents in Software Engineering: Survey, Landscape, and Vision Yanxian Huang et.al. 2409.09030 link
2024-09-13 Contri(e)ve: Context + Retrieve for Scholarly Question Answering Kanchan Shivashankar et.al. 2409.09010 null
2024-09-13 Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance Lucio La Cava et.al. 2409.08963 null
2024-09-13 Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions Zahra Ashktorab et.al. 2409.08937 null
2024-09-13 SynSUM -- Synthetic Benchmark with Structured and Unstructured Medical Records Paloma Rabaey et.al. 2409.08936 link
2024-09-13 LLM-based Weak Supervision Framework for Query Intent Classification in Video Search Farnoosh Javadi et.al. 2409.08931 null
2024-09-13 Affective Computing Has Changed: The Foundation Model Disruption Björn Schuller et.al. 2409.08907 null
2024-09-13 AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models Yifei Yao et.al. 2409.08904 null
2024-09-13 A Market for Lemons? Strategic Directions for a Vigilant Application of Artificial Intelligence in Entrepreneurship Research Martin Obschonka et.al. 2409.08890 null
2024-09-13 Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark Xuchen Li et.al. 2409.08887 null
2024-09-13 Exploring Graph Structure Comprehension Ability of Multimodal Large Language Models: Case Studies Zhiqiang Zhong et.al. 2409.08864 null
2024-09-13 FP-VEC: Fingerprinting Large Language Models via Efficient Vector Addition Zhenhua Xu et.al. 2409.08846 null
2024-09-13 AIPO: Improving Training Objective for Iterative Preference Optimization Yaojie Shen et.al. 2409.08845 null
2024-09-13 A RAG Approach for Generating Competency Questions in Ontology Engineering Xueli Pan et.al. 2409.08820 null
2024-09-13 Your Weak LLM is Secretly a Strong Teacher for Alignment Leitian Tao et.al. 2409.08813 null
2024-09-13 Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task Shao Zhang et.al. 2409.08811 null
2024-09-13 LLaQo: Towards a Query-Based Coach in Expressive Music Performance Assessment Huan Zhang et.al. 2409.08795 null
2024-09-13 Optimizing Ingredient Substitution Using Large Language Models to Enhance Phytochemical Content in Recipes Luis Rita et.al. 2409.08792 null
2024-09-13 Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling Jialu Tang et.al. 2409.08788 null
2024-09-13 Uncertainty and Generalizability in Foundation Models for Earth Observation Raul Ramos-Pollan et.al. 2409.08744 null
2024-09-12 Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Rogerio Bonatti et.al. 2409.08264 link
2024-09-12 OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering Jiahao Nick Li et.al. 2409.08250 null
2024-09-12 Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources Alisia Lupidi et.al. 2409.08239 null
2024-09-12 LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems Hakan T. Otal et.al. 2409.08234 link
2024-09-12 Adaptive Language-Guided Abstraction from Contrastive Explanations Andi Peng et.al. 2409.08212 null
2024-09-12 ComAlign: Compositional Alignment in Vision-Language Models Ali Abdollah et.al. 2409.08206 null
2024-09-12 What Makes a Maze Look Like a Maze? Joy Hsu et.al. 2409.08202 null
2024-09-12 AudioBERT: Audio Knowledge Augmented Language Model Hyunjong Ok et.al. 2409.08199 link
2024-09-12 Fine-tuning Large Language Models for Entity Matching Aaron Steiner et.al. 2409.08185 link
2024-09-12 On the Role of Context in Reading Time Prediction Andreas Opedal et.al. 2409.08160 link
2024-09-12 Faster Speech-LLaMA Inference with Multi-token Prediction Desh Raj et.al. 2409.08148 null
2024-09-12 LLM-POTUS Score: A Framework of Analyzing Presidential Debates with Large Language Models Zhengliang Liu et.al. 2409.08147 null
2024-09-12 Towards a graph-based foundation model for network traffic analysis Louis Van Langendonck et.al. 2409.08111 null
2024-09-12 The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language Michael Ong et.al. 2409.08103 null
2024-09-12 The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal Huiyuan Xie et.al. 2409.08098 null
2024-09-12 Securing Large Language Models: Addressing Bias, Misinformation, and Prompt Attacks Benji Peng et.al. 2409.08087 null
2024-09-12 SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality Chenyang Lei et.al. 2409.08083 link
2024-09-12 SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing An Guo et.al. 2409.08081 null
2024-09-12 TravelAgent: An AI Assistant for Personalized Travel Planning Aili Chen et.al. 2409.08069 null
2024-09-12 An Evaluation Framework for Attributed Information Retrieval using Large Language Models Hanane Djeddal et.al. 2409.08014 link
2024-09-11 "My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays Shengxin Hong et.al. 2409.07453 null
2024-09-11 StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos Sijie Zhao et.al. 2409.07447 null
2024-09-11 SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories Ben Bogin et.al. 2409.07440 link
2024-09-11 A Suite for Acoustic Language Model Evaluation Gallil Maimon et.al. 2409.07437 link
2024-09-11 Synthetic continued pretraining Zitong Yang et.al. 2409.07431 link
2024-09-11 Agent Workflow Memory Zora Zhiruo Wang et.al. 2409.07429 link
2024-09-11 CLNX: Bridging Code and Natural Language for C/C++ Vulnerability-Contributing Commits Identification Zeqing Qin et.al. 2409.07407 null
2024-09-11 AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge Han Wang et.al. 2409.07394 link
2024-09-11 Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination Daniel Zhang-Li et.al. 2409.07372 null
2024-09-11 Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code Khiem Ton et.al. 2409.07368 null
2024-09-11 Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation SeongYeub Chu et.al. 2409.07355 link
2024-09-11 Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks Md Zarif Hossain et.al. 2409.07353 link
2024-09-11 Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization Mehrdad Zakershahrak et.al. 2409.07335 null
2024-09-11 Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering Weixi Weng et.al. 2409.07331 null
2024-09-11 MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Praveen K Kanithi et.al. 2409.07314 null
2024-09-11 Exploring User-level Gradient Inversion with a Diffusion Prior Zhuohang Li et.al. 2409.07291 null
2024-09-11 STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM Qijiong Liu et.al. 2409.07276 null
2024-09-11 MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving Enming Zhang et.al. 2409.07267 link
2024-09-12 Alignment of Diffusion Models: Fundamentals, Challenges, and Future Buhua Liu et.al. 2409.07253 link
2024-09-11 PiTe: Pixel-Temporal Alignment for Large Video-Language Model Yang Liu et.al. 2409.07239 link
2024-09-10 Benchmarking Sub-Genre Classification For Mainstage Dance Music Hongzhi Shu et.al. 2409.06690 null
2024-09-10 E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning Zihan Liao et.al. 2409.06679 null
2024-09-10 LLaMA-Omni: Seamless Speech Interaction with Large Language Models Qingkai Fang et.al. 2409.06666 link
2024-09-10 Human Perception of LLM-generated Text Content in Social Media Environments Kristina Radivojevic et.al. 2409.06653 null
2024-09-10 Optimal Workload Placement on Multi-Instance GPUs Bekir Turkkan et.al. 2409.06646 null
2024-09-11 EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis Danli Shi et.al. 2409.06644 null
2024-09-11 Segmenting sea ice floes in close-range optical imagery with active contour and foundation models Giulio Passerotti et.al. 2409.06641 null
2024-09-10 TeXBLEU: Automatic Metric for Evaluate LaTeX Format Kyudan Jung et.al. 2409.06639 link
2024-09-10 MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders Wenyu Zhang et.al. 2409.06635 null
2024-09-10 A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio Ningyuan Xi et.al. 2409.06624 null
2024-09-10 Exploring Italian sentence embeddings properties through multi-tasking Vivi Nastase et.al. 2409.06622 null
2024-09-10 Alleviating Hallucinations in Large Language Models with Scepticism Modeling Yetao Wu et.al. 2409.06601 null
2024-09-10 GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering Sacha Muller et.al. 2409.06595 link
2024-09-10 Quantifying and Enabling the Interpretability of CLIP-like Models Avinash Madasu et.al. 2409.06579 null
2024-09-10 Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement Vivi Nastase et.al. 2409.06567 null
2024-09-10 MAPS: Energy-Reliability Tradeoff Management in Autonomous Vehicles Through LLMs Penetrated Science Mahdieh Aliazam et.al. 2409.06558 null
2024-09-10 Questioning Internal Knowledge Structure of Large Language Models Through the Lens of the Olympic Games Juhwan Choi et.al. 2409.06518 link
2024-09-10 Aligning Machine and Human Visual Representations across Abstraction Levels Lukas Muttenthaler et.al. 2409.06509 null
2024-09-10 Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding Xiaoyu Liang et.al. 2409.06485 null
2024-09-10 Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles Qiujing Lu et.al. 2409.06450 null
2024-09-09 MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct Run Luo et.al. 2409.05840 null
2024-09-09 Are Large Language Models a Threat to Programming Platforms? An Exploratory Study Md Mustakim Billah et.al. 2409.05824 null
2024-09-09 VFA: Vision Frequency Analysis of Foundation Models and Human Mohammad-Javad Darvishi-Bayazi et.al. 2409.05817 null
2024-09-09 Improving Pretraining Data Using Perplexity Correlations Tristan Thrush et.al. 2409.05816 null
2024-09-09 Benchmarking Chinese Knowledge Rectification in Large Language Models Tianhe Lu et.al. 2409.05806 link
2024-09-09 Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models Emily Cheng et.al. 2409.05771 null
2024-09-09 Model Input Verification of Large Scale Simulations Rumyana Neykova et.al. 2409.05768 null
2024-09-09 A Novel Idea Generation Tool using a Structured Conversational AI (CAI) System B. Sankar et.al. 2409.05747 null
2024-09-09 LLMs Will Always Hallucinate, and We Need to Live With This Sourav Banerjee et.al. 2409.05746 null
2024-09-09 A System and Benchmark for LLM-based Q&A on Heterogeneous Data Achille Fokoue et.al. 2409.05735 null
2024-09-09 Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning Approach Meng Zhou et.al. 2409.05732 null
2024-09-09 The Influence of Task and Group Disparities over Users' Attitudes Toward Using Large Language Models for Psychotherapy Qihang He et.al. 2409.05703 null
2024-09-09 Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features Jacob Gildenblat et.al. 2409.05697 null
2024-09-09 Zero-shot Outlier Detection via Prior-data Fitted Networks: Model Selection Bygone! Yuchen Shen et.al. 2409.05672 null
2024-09-09 Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case Vagrant Gautam et.al. 2409.05653 link
2024-09-10 MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery Hongjin Qian et.al. 2409.05591 link
2024-09-09 Leveraging Content and Acoustic Representations for Efficient Speech Emotion Recognition Soumya Dutta et.al. 2409.05566 null
2024-09-09 CauseJudger: Identifying the Cause with LLMs for Abductive Logical Reasoning Jinwei He et.al. 2409.05559 null
2024-09-09 SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning Alireza Ghafarollahi et.al. 2409.05556 link
2024-09-09 Harmonic Reasoning in Large Language Models Anna Kruspe et.al. 2409.05521 null
2024-09-06 VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation Yecheng Wu et.al. 2409.04429 null
2024-09-06 Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques Davide Clode da Silva et.al. 2409.04424 null
2024-09-06 RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs Jiaxing Wu et.al. 2409.04421 null
2024-09-06 Question-Answering Dense Video Events Hangyu Qin et.al. 2409.04388 null
2024-09-06 Learning vs Retrieval: The Role of In-Context Examples in Regression with LLMs Aliakbar Nafar et.al. 2409.04318 link
2024-09-06 An optically accelerated extreme learning machine using hot atomic vapors Pierre Azam et.al. 2409.04312 null
2024-09-06 Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets Desiree Heim et.al. 2409.04286 null
2024-09-06 Advancing Automated Knowledge Transfer in Evolutionary Multitasking via Large Language Models Yuxiao Huang et.al. 2409.04270 null
2024-09-06 An overview of domain-specific foundation model: key technologies, applications and challenges Haolong Chen et.al. 2409.04267 null
2024-09-06 UniDet3D: Multi-dataset Indoor 3D Object Detection Maksim Kolodiazhnyi et.al. 2409.04234 link
2024-09-06 Fast Forwarding Low-Rank Training Adir Rahamim et.al. 2409.04206 null
2024-09-06 Residual Stream Analysis with Multi-Layer SAEs Tim Lawson et.al. 2409.04185 link
2024-09-06 GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding Ziyin Zhang et.al. 2409.04183 null
2024-09-06 Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering Larissa Pusch et.al. 2409.04181 null
2024-09-06 From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks Andreas Stephan et.al. 2409.04168 null
2024-09-06 Can OpenSource beat ChatGPT? -- A Comparative Study of Large Language Models for Text-to-Code Generation Luis Mayer et.al. 2409.04164 null
2024-09-06 Prompt-based Personality Profiling: Reinforcement Learning for Relevance Filtering Jan Hofmann et.al. 2409.04122 null
2024-09-06 Multi-Programming Language Ensemble for Code Generation in Large Language Model Tengfei Xue et.al. 2409.04114 link
2024-09-06 Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Chenglei Si et.al. 2409.04109 link
2024-09-06 UI-JEPA: Towards Active Perception of User Intent through Onscreen User Activity Yicheng Fu et.al. 2409.04081 null
2024-09-05 Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding Yunze Man et.al. 2409.03757 link
2024-09-05 Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution Marga Don et.al. 2409.03754 link
2024-09-05 Attention Heads of Large Language Models: A Survey Zifan Zheng et.al. 2409.03752 link
2024-09-05 LLM-CI: Assessing Contextual Integrity Norms in Language Models Yan Shvartzshnaider et.al. 2409.03735 null
2024-09-05 Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry Meena Jagadeesan et.al. 2409.03734 null
2024-09-05 Planning In Natural Language Improves LLM Search For Code Generation Evan Wang et.al. 2409.03733 null
2024-09-06 RAG based Question-Answering for Contextual Response Prediction System Sriram Veturi et.al. 2409.03708 null
2024-09-05 LAST: Language Model Aware Speech Tokenization Arnon Turetzky et.al. 2409.03701 null
2024-09-05 TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems Stylianos Loukas Vasileiou et.al. 2409.03671 null
2024-09-05 A Fused Large Language Model for Predicting Startup Success Abdurahman Maarouf et.al. 2409.03668 null
2024-09-05 The representation landscape of few-shot learning and fine-tuning in large language models Diego Doimo et.al. 2409.03662 link
2024-09-06 LLM-based multi-agent poetry generation in non-cooperative environments Ran Zhang et.al. 2409.03659 link
2024-09-05 On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization Yong Lin et.al. 2409.03650 null
2024-09-05 Text-Guided Mixup Towards Long-Tailed Image Categorization Richard Franklin et.al. 2409.03583 link
2024-09-05 FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation Xi Chen et.al. 2409.03525 null
2024-09-05 Have Large Vision-Language Models Mastered Art History? Ombretta Strafforello et.al. 2409.03521 null
2024-09-05 Tissue Concepts: supervised foundation models in computational pathology Till Nicke et.al. 2409.03519 link
2024-09-05 From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents Jifan Yu et.al. 2409.03512 null
2024-09-05 LLM-based event abstraction and integration for IoT-sourced logs Mohsen Shirali et.al. 2409.03478 link
2024-09-05 How Much Data is Enough Data? Fine-Tuning Large Language Models for In-House Translation: Performance Evaluation Across Multiple Dataset Sizes Inacio Vieira et.al. 2409.03454 null
2024-09-04 RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version) Yao Mu et.al. 2409.02920 null
2024-09-04 Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving Yuhang Lu et.al. 2409.02914 null
2024-09-04 Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling Kaiwen Zheng et.al. 2409.02908 null
2024-09-05 LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA Jiajie Zhang et.al. 2409.02897 link
2024-09-04 LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture Xidong Wang et.al. 2409.02889 link
2024-09-04 CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently Jonathan Zalach et.al. 2409.02885 null
2024-09-04 Benchmarking Spurious Bias in Few-Shot Image Classifiers Guangtao Zheng et.al. 2409.02882 link
2024-09-04 Configurable Foundation Models: Building LLMs from a Modular Perspective Chaojun Xiao et.al. 2409.02877 null
2024-09-04 Historical German Text Normalization Using Type- and Token-Based Language Modeling Anton Ehrmanntraut et.al. 2409.02841 null
2024-09-04 Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models Moein Shahiki Tash et.al. 2409.02836 null
2024-09-04 CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models Wentao Liu et.al. 2409.02834 null
2024-09-04 ExpLLM: Towards Chain of Thought for Facial Expression Recognition Xing Lan et.al. 2409.02828 null
2024-09-04 Design Contradictions: Help or Hindrance? Aron E. Owen et.al. 2409.02823 null
2024-09-04 Language Understanding as a Constraint on Consensus Size in LLM Societies Giordano De Marzo et.al. 2409.02822 null
2024-09-04 Towards a Unified View of Preference Learning for Large Language Models: A Survey Bofei Gao et.al. 2409.02795 link
2024-09-05 Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models? Yixuan Tang et.al. 2409.02727 link
2024-09-04 Pre-training data selection for biomedical domain adaptation using journal impact metrics Mathieu Laï-king et.al. 2409.02725 null
2024-09-04 Alignment-Aware Model Extraction Attacks on Large Language Models Zi Liang et.al. 2409.02718 link
2024-09-04 Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL Mohammad Reshadati et.al. 2409.02711 null
2024-09-04 LLM-Assisted Visual Analytics: Opportunities and Challenges Maeve Hutchinson et.al. 2409.02691 null
2024-08-30 SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists Raoyuan Zhao et.al. 2408.17437 link
2024-08-30 DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model Mona Sheikh Zeinoddin et.al. 2408.17433 link
2024-08-30 Advancing Multi-talker ASR Performance with Large Language Models Mohan Shi et.al. 2408.17431 null
2024-08-30 CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models Jonathan Bourne et.al. 2408.17428 null
2024-09-03 Open-vocabulary Temporal Action Localization using VLMs Naoki Wake et.al. 2408.17422 null
2024-08-30 Getting Inspiration for Feature Elicitation: App Store- vs. LLM-based Approach Jialiang Wei et.al. 2408.17404 null
2024-08-30 EMPOWER: Embodied Multi-role Open-vocabulary Planning with Online Grounding and Execution Francesco Argenziano et.al. 2408.17379 null
2024-08-30 NDP: Next Distribution Prediction as a More Broad Target Junhao Ruan et.al. 2408.17377 null
2024-08-30 Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain Francesca Grasso et.al. 2408.17362 link
2024-08-30 Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage Md Rafi Ur Rashid et.al. 2408.17354 null
2024-09-02 LSMS: Language-guided Scale-aware MedSegmentor for Medical Image Referring Segmentation Shuyi Ouyang et.al. 2408.17347 null
2024-08-30 Investigating Neuron Ablation in Attention Heads: The Case for Peak Activation Centering Nicholas Pochinkov et.al. 2408.17322 link
2024-08-30 Bridging Domain Knowledge and Process Discovery Using Large Language Models Ali Norouzifar et.al. 2408.17316 link
2024-08-30 Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts Rhui Dih Lee et.al. 2408.17280 null
2024-08-30 Joint Estimation and Prediction of City-wide Delivery Demand: A Large Language Model Empowered Graph-based Learning Approach Tong Nie et.al. 2408.17258 null
2024-08-30 VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters Mouxiang Chen et.al. 2408.17253 link
2024-08-30 Improving Extraction of Clinical Event Contextual Properties from Electronic Health Records: A Comparative Study Shubham Agarwal et.al. 2408.17181 null
2024-08-30 Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model Zhen Ye et.al. 2408.17175 link
2024-08-30 Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning Xiaoye Qu et.al. 2408.17150 link
2024-08-30 Reasoning AI Performance Degradation in 6G Networks with Large Language Models Liming Huang et.al. 2408.17097 null
2024-08-29 PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning Noor Hussein et.al. 2408.16769 link
2024-08-29 How Far Can Cantonese NLP Go? Benchmarking Cantonese Capabilities of Large Language Models Jiyue Jiang et.al. 2408.16756 null
2024-08-29 Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models Alec Solway et.al. 2408.16753 null
2024-08-29 A Gradient Analysis Framework for Rewarding Good and Penalizing Bad Examples in Language Models Yi-Lin Tuan et.al. 2408.16751 null
2024-08-29 Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge Beidi Dong et.al. 2408.16749 null
2024-08-29 Theoretical and Methodological Framework for Studying Texts Produced by Large Language Models Jiří Milička et.al. 2408.16740 null
2024-08-29 Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling Hritik Bansal et.al. 2408.16737 null
2024-08-29 VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation Shiwei Wu et.al. 2408.16730 null
2024-08-30 Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming Zhifei Xie et.al. 2408.16725 link
2024-08-29 GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models Moreno D'Incà et.al. 2408.16700 link
2024-08-29 Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity Ziniu Li et.al. 2408.16673 null
2024-08-29 Space3D-Bench: Spatial 3D Question Answering Benchmark Emilia Szymanska et.al. 2408.16662 null
2024-08-29 DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving Yongjie Fu et.al. 2408.16647 null
2024-08-29 Examination of Code generated by Large Language Models Robin Beer et.al. 2408.16601 link
2024-08-29 Enhancing Dialogue Generation in Werewolf Game Through Situation Analysis and Persuasion Strategies Zhiyang Qi et.al. 2408.16586 null
2024-08-29 WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Shengpeng Ji et.al. 2408.16532 link
2024-08-29 CNIMA: A Universal Evaluation Framework and Automated Approach for Assessing Second Language Dialogues Rena Gao et.al. 2408.16518 link
2024-08-29 LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs? Jan Cegin et.al. 2408.16502 null
2024-08-29 CogVLM2: Visual Language Models for Image and Video Understanding Wenyi Hong et.al. 2408.16500 link
2024-08-29 A Survey on Evaluating Large Language Models in Code Generation Tasks Liguo Chen et.al. 2408.16498 null
2024-08-28 Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Min Shi et.al. 2408.15998 link
2024-08-29 Spatio-Temporal Context Prompting for Zero-Shot Action Detection Wei-Jhe Huang et.al. 2408.15996 null
2024-08-28 Perceive-IR: Learning to Perceive Degradation Better for All-in-One Image Restoration Xu Zhang et.al. 2408.15994 null
2024-08-28 BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems Wei Wang et.al. 2408.15971 null
2024-08-28 More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding Yuan Tang et.al. 2408.15966 link
2024-08-28 Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games Nicholas R. Waytowich et.al. 2408.15950 null
2024-08-28 DeMoBot: Deformable Mobile Manipulation with Vision-based Sub-goal Retrieval Yuying Zhang et.al. 2408.15919 null
2024-08-28 Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models Yuncheng Yang et.al. 2408.15915 link
2024-08-28 Decentralized LLM Inference over Edge Networks with Energy Harvesting Aria Khoshsirat et.al. 2408.15907 null
2024-08-28 LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments Ruirui Chen et.al. 2408.15903 null
2024-08-28 Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts Nikolas Gritsch et.al. 2408.15901 null
2024-08-28 Bias in LLMs as Annotators: The Effect of Party Cues on Labelling Decision by Large Language Models Sebastian Vallejo Vera et.al. 2408.15895 null
2024-08-28 LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation Fangxun Shu et.al. 2408.15881 link
2024-08-28 Persuasion Games using Large Language Models Ganesh Prasath Ramani et.al. 2408.15879 null
2024-08-28 Retrieval-Augmented Instruction Tuning for Automated Process Engineering Calculations : A Tool-Chaining Problem-Solving Framework with Attributable Reflection Sagar Srinivas Sakhinana et.al. 2408.15866 null
2024-08-28 Benchmarking foundation models as feature extractors for weakly-supervised computational pathology Peter Neidlinger et.al. 2408.15823 null
2024-08-28 Visual Prompt Engineering for Medical Vision Language Models in Radiology Stefan Denner et.al. 2408.15802 null
2024-08-28 Scaling Up Summarization: Leveraging Large Language Models for Long Text Extractive Summarization Léo Hemamou et.al. 2408.15801 null
2024-08-28 Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language Models Hédi Zhegidi et.al. 2408.15796 link
2024-08-28 Efficient LLM Scheduling by Learning to Rank Yichao Fu et.al. 2408.15792 null
2024-08-27 Generative Verifiers: Reward Modeling as Next-Token Prediction Lunjun Zhang et.al. 2408.15240 null
2024-08-27 The Mamba in the Llama: Distilling and Accelerating Hybrid Models Junxiong Wang et.al. 2408.15237 link
2024-08-27 Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations Yucheng Jiang et.al. 2408.15232 null
2024-08-27 LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet Nathaniel Li et.al. 2408.15221 null
2024-08-27 Investigating Coverage Criteria in Large Language Models: An In-Depth Study Through Jailbreak Attacks Shide Zhou et.al. 2408.15207 null
2024-08-27 Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation Jian Hu et.al. 2408.15205 link
2024-08-27 Can Unconfident LLM Annotations Be Used for Confident Conclusions? Kristina Gligorić et.al. 2408.15204 link
2024-08-27 Infusing Acoustic Pause Context into Text-Based Dementia Assessment Franziska Braun et.al. 2408.15188 null
2024-08-27 Unlocking Potential in Pre-Trained Music Language Models for Versatile Multi-Track Music Arrangement Longshen Ou et.al. 2408.15176 null
2024-08-27 X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation Hanjia Lyu et.al. 2408.15172 null
2024-08-27 Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation N. E. Kriman et.al. 2408.15171 null
2024-08-27 How transformers learn structured data: insights from hierarchical filtering Jerome Garnier-Brun et.al. 2408.15138 null
2024-08-27 CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP Zhenchen Tang et.al. 2408.15098 null
2024-08-27 Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models Xiyu Liu et.al. 2408.15091 null
2024-08-27 BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline Guosheng Dong et.al. 2408.15079 null
2024-08-27 Constraining Participation: Affordances of Feedback Features in Interfaces to Large Language Models Ned Cooper et.al. 2408.15066 null
2024-08-27 The Benefits of Balance: From Information Projections to Variance Reduction Lang Liu et.al. 2408.15065 null
2024-08-28 DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding Wenhui Liao et.al. 2408.15045 null
2024-08-28 A Survey of Large Language Models for European Languages Wazir Ali et.al. 2408.15040 null
2024-08-27 Speech Recognition Transformers: Topological-lingualism Perspective Shruti Singh et.al. 2408.14991 null
2024-08-26 A Practitioner's Guide to Continual Multimodal Pretraining Karsten Roth et.al. 2408.14471 link
2024-08-27 Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models Aradhye Agarwal et.al. 2408.14470 link
2024-08-26 Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos Qirui Chen et.al. 2408.14469 null
2024-08-26 Explicit Inductive Inference using Large Language Models Tianyang Liu et.al. 2408.14467 null
2024-08-26 Evaluating Large Language Models on Spatial Tasks: A Multi-Task Benchmarking Study Liuchang Xu Shuo Zhao et.al. 2408.14438 null
2024-08-26 Social perception of faces in a vision-language model Carina I. Hausladen et.al. 2408.14435 link
2024-08-26 CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models Shubham Bharti et.al. 2408.14419 null
2024-08-26 MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues Kuluhan Binici et.al. 2408.14418 null
2024-08-26 Hyperdimensional Computing Empowered Federated Foundation Model over Wireless Networks for Metaverse Yahao Ding et.al. 2408.14416 null
2024-08-26 Language-specific Calibration for Pruning Multilingual Language Models Simon Kurz et.al. 2408.14398 null
2024-08-26 Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning Sakhinana Sagar Srinivas et.al. 2408.14387 null
2024-08-26 Probing Causality Manipulation of Large Language Models Chenyang Zhang et.al. 2408.14380 link
2024-08-26 An Embedding is Worth a Thousand Noisy Labels Francesco Di Salvo et.al. 2408.14358 link
2024-08-26 SWE-bench-java: A GitHub Issue Resolving Benchmark for Java Daoguang Zan et.al. 2408.14354 link
2024-08-26 Assessing Contamination in Large Language Models: Introducing the LogProber method Nicolas Yax et.al. 2408.14352 null
2024-08-27 Foundation Models for Music: A Survey Yinghao Ma et.al. 2408.14340 link
2024-08-26 Claim Verification in the Age of Large Language Models: A Survey Alphaeus Dmonte et.al. 2408.14317 null
2024-08-26 LLM-3D Print: Large Language Models To Monitor and Control 3D Printing Yayati Jadhav et.al. 2408.14307 null
2024-08-26 Investigating the Effectiveness of Bayesian Spam Filters in Detecting LLM-modified Spam Mails Malte Josten et.al. 2408.14293 link
2024-08-26 Predictability and Causality in Spanish and English Natural Language Generation Andrea Busto-Castiñeira et.al. 2408.14283 null
2024-08-23 MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? Yi-Fan Zhang et.al. 2408.13257 null
2024-08-23 Domain-specific long text classification from sparse relevant information Célia D'Cruz et.al. 2408.13253 null
2024-08-23 Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption Sakhinana Sagar Srinivas et.al. 2408.13248 null
2024-08-23 Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time Yingyu Liang et.al. 2408.13233 null
2024-08-23 EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods Hongcheng Ding et.al. 2408.13214 null
2024-08-23 DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation Qiming Zhu et.al. 2408.13204 null
2024-08-23 Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning Hourui Deng et.al. 2408.13184 null
2024-08-23 IntelliCare: Improving Healthcare Analysis with Variance-Controlled Patient-Level Knowledge from Large Language Models Zhihao Yu et.al. 2408.13073 link
2024-08-23 Guiding IoT-Based Healthcare Alert Systems with Large Language Models Yulan Gao et.al. 2408.13071 null
2024-08-23 SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks Kai-Wei Chang et.al. 2408.13040 null
2024-08-23 VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models Wentao Wu et.al. 2408.13031 link
2024-08-23 In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting Haowei Du et.al. 2408.13028 null
2024-08-23 A Web-Based Solution for Federated Learning with LLM-Based Automation Chamith Mawela et.al. 2408.13010 null
2024-08-23 Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates Hui Wei et.al. 2408.13006 link
2024-08-23 CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution Ruiyang Xu et.al. 2408.13001 null
2024-08-23 Open Llama2 Model for the Lithuanian Language Artūras Nakvosas et.al. 2408.12963 null
2024-08-23 Multimodal Contrastive In-Context Learning Yosuke Miyanishi et.al. 2408.12959 null
2024-08-23 Image Segmentation in Foundation Model Era: A Survey Tianfei Zhou et.al. 2408.12957 null
2024-08-23 E-code: Mastering Efficient Code Generation through Pretrained Models and Expert Encoder Group Yue Pan et.al. 2408.12948 null
2024-08-23 Causal-Guided Active Learning for Debiasing Large Language Models Zhouhao Sun et.al. 2408.12942 link
2024-08-22 Controllable Text Generation for Large Language Models: A Survey Xun Liang et.al. 2408.12599 link
2024-08-23 Non-Homophilic Graph Pre-Training and Prompt Learning Xingtong Yu et.al. 2408.12594 null
2024-08-22 RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment Xiaohan Wang et.al. 2408.12579 null
2024-08-22 MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi et.al. 2408.12574 link
2024-08-22 Jamba-1.5: Hybrid Transformer-Mamba Models at Scale Jamba Team et.al. 2408.12570 null
2024-08-22 ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation Lujia Zhong et.al. 2408.12561 link
2024-08-22 Towards Evaluating and Building Versatile Large Language Models for Medicine Chaoyi Wu et.al. 2408.12547 link
2024-08-22 Show-o: One Single Transformer to Unify Multimodal Understanding and Generation Jinheng Xie et.al. 2408.12528 null
2024-08-22 MEDCO: Medical Education Copilots Based on A Multi-Agent Framework Hao Wei et.al. 2408.12496 null
2024-08-22 GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models Kunsheng Tang et.al. 2408.12494 link
2024-08-23 Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese Khang T. Doan et.al. 2408.12480 null
2024-08-22 Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition Bozheng Li et.al. 2408.12475 null
2024-08-22 DLCRec: A Novel Approach for Managing Diversity in LLM-Based Recommender Systems Jiaju Chen et.al. 2408.12470 null
2024-08-22 Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning Mushui Liu et.al. 2408.12469 null
2024-08-22 Enhancing Multi-hop Reasoning through Knowledge Erasure in Large Language Model Editing Mengqi Zhang et.al. 2408.12456 null
2024-08-22 Positional Description for Numerical Normalization Deepanshu Gupta et.al. 2408.12430 null
2024-08-22 FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing Jue Wang et.al. 2408.12429 link
2024-08-22 Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification Sudi Murindanyi et.al. 2408.12426 null
2024-08-22 Unlearning Trojans in Large Language Models: A Comparison Between Natural Language and Source Code Mahdi Kazemi et.al. 2408.12416 null
2024-08-22 Generalized SAM: Efficient Fine-Tuning of SAM for Variable Input Image Sizes Sota Kato et.al. 2408.12406 link
2024-08-21 Great Memory, Shallow Reasoning: Limits of $k$ NN-LMs Shangyi Geng et.al. 2408.11815 link
2024-08-21 SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs Yuanyang Yin et.al. 2408.11813 null
2024-08-21 EmbodiedSAM: Online Segment Any 3D Thing in Real Time Xiuwei Xu et.al. 2408.11811 null
2024-08-21 Approaching Deep Learning through the Spectral Dynamics of Weights David Yunis et.al. 2408.11804 link
2024-08-21 Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models Yuzhou Huang et.al. 2408.11801 null
2024-08-21 PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain Rounak Meyur et.al. 2408.11800 null
2024-08-21 Practical token pruning for foundation models in few-shot conversational virtual assistant systems Haode Qi et.al. 2408.11799 null
2024-08-21 EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model Feipeng Ma et.al. 2408.11795 null
2024-08-21 Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design Nathaniel H. Park et.al. 2408.11793 null
2024-08-21 Critique-out-Loud Reward Models Zachary Ankner et.al. 2408.11791 link
2024-08-21 DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework Zhifei Xie et.al. 2408.11788 null
2024-08-21 Personality Alignment of Large Language Models Minjun Zhu et.al. 2408.11779 link
2024-08-21 Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards Omar Erak et.al. 2408.11775 link
2024-08-21 Against All Odds: Overcoming Typology, Script, and Language Confusion in Multilingual Embedding Inversion Attacks Yiyi Chen et.al. 2408.11749 link
2024-08-21 DH-Bench: Probing Depth and Height Perception of Large Visual-Language Models Shehreen Azad et.al. 2408.11748 link
2024-08-21 Open-Ended 3D Point Cloud Instance Segmentation Phuc D. A. Nguyen et.al. 2408.11747 null
2024-08-21 Mixed Sparsity Training: Achieving 4 $\times$ FLOP Reduction for Transformer Pretraining Pihe Hu et.al. 2408.11746 null
2024-08-21 FocusLLM: Scaling LLM's Context by Parallel Decoding Zhenyu Li et.al. 2408.11745 null
2024-08-21 MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models Elias Frantar et.al. 2408.11743 link
2024-08-21 CluMo: Cluster-based Modality Fusion Prompt for Continual Learning in Visual Question Answering Yuliang Cai et.al. 2408.11742 link
2024-08-20 Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement Satoshi Kosugi et.al. 2408.11055 link
2024-08-20 Revisiting VerilogEval: Newer LLMs, In-Context Learning, and Specification-to-RTL Tasks Nathaniel Pinckney et.al. 2408.11053 link
2024-08-20 FLAME: Learning to Navigate with Multimodal LLM in Urban Environments Yunzhe Xu et.al. 2408.11051 link
2024-08-21 MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding Jian Chen et.al. 2408.11049 link
2024-08-20 Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders Yuan Xin et.al. 2408.11046 null
2024-08-20 Reconciling Methodological Paradigms: Employing Large Language Models as Novice Qualitative Research Assistants in Talent Management Research Sreyoshi Bhaduri et.al. 2408.11043 null
2024-08-20 Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Chunting Zhou et.al. 2408.11039 null
2024-08-20 Scaling Law with Learning Rate Annealing Howe Tissue et.al. 2408.11029 null
2024-08-20 Athena: Safe Autonomous Agents with Verbal Contrastive Learning Tanmana Sadhu et.al. 2408.11021 null
2024-08-20 While GitHub Copilot Excels at Coding, Does It Ensure Responsible Output? Wen Cheng et.al. 2408.11006 link
2024-08-20 SenPa-MAE: Sensor Parameter Aware Masked Autoencoder for Multi-Satellite Self-Supervised Pretraining Jonathan Prexl et.al. 2408.11000 null
2024-08-20 CTP-LLM: Clinical Trial Phase Transition Prediction Using Large Language Models Michael Reinisch et.al. 2408.10995 null
2024-08-20 Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models Yuyan Chen et.al. 2408.10947 null
2024-08-20 Large Language Model Driven Recommendation Anton Korikov et.al. 2408.10946 null
2024-08-20 HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments Kazi Hasan Ibn Arif et.al. 2408.10945 link
2024-08-20 SysBench: Can Large Language Models Follow System Messages? Yanzhao Qin et.al. 2408.10943 link
2024-08-20 Proxona: Leveraging LLM-Driven Personas to Enhance Creators' Understanding of Their Audience Yoonseo Choi et.al. 2408.10937 null
2024-08-21 LBC: Language-Based-Classifier for Out-Of-Variable Generalization Kangjun Noh et.al. 2408.10923 link
2024-08-21 BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model Yeyong Yu et.al. 2408.10903 link
2024-08-20 Soda-Eval: Open-Domain Dialogue Evaluation in the age of LLMs John Mendonça et.al. 2408.10902 null
2024-08-19 SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP Yusuke Hirota et.al. 2408.10202 null
2024-08-19 Demystifying the Communication Characteristics for Distributed Transformer Models Quentin Anthony et.al. 2408.10197 null
2024-08-19 Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models Aviv Bick et.al. 2408.10189 null
2024-08-19 LongVILA: Scaling Long-Context Visual Language Models for Long Videos Fuzhao Xue et.al. 2408.10188 link
2024-08-19 SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models Anke Tang et.al. 2408.10174 link
2024-08-19 Customizing Language Models with Instance-wise LoRA for Sequential Recommendation Xiaoyu Kong et.al. 2408.10159 null
2024-08-19 Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models Amey Hengle et.al. 2408.10151 link
2024-08-19 In-Context Learning with Representations: Contextual Generalization of Trained Transformers Tong Yang et.al. 2408.10147 null
2024-08-19 Instruction Finetuning for Leaderboard Generation from Empirical AI Research Salomon Kabongo et.al. 2408.10141 null
2024-08-19 Rhyme-aware Chinese lyric generator based on GPT Yixiao Yuan et.al. 2408.10130 null
2024-08-19 Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track Feiyu Pan et.al. 2408.10125 null
2024-08-19 Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models Tianyu Zhang et.al. 2408.10124 link
2024-08-19 Geometry Informed Tokenization of Molecules for Language Model Generation Xiner Li et.al. 2408.10120 null
2024-08-19 GLIMMER: Incorporating Graph and Lexical Features in Unsupervised Multi-Document Summarization Ran Liu et.al. 2408.10115 link
2024-08-20 PLUTUS: A Well Pre-trained Large Unified Transformer can Unveil Financial Time Series Regularities Yuanjian Xu et.al. 2408.10111 null
2024-08-19 ARMADA: Attribute-Based Multimodal Data Augmentation Xiaomeng Jin et.al. 2408.10086 null
2024-08-19 Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning Sriyash Poddar et.al. 2408.10075 null
2024-08-19 FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant Zhengchao Huang et.al. 2408.10072 null
2024-08-19 Privacy Checklist: Privacy Violation Detection Grounding on Contextual Integrity Theory Haoran Li et.al. 2408.10053 null
2024-08-19 Defense Priorities in the Open-Source AI Debate: A Preliminary Assessment Masao Dahlgren et.al. 2408.10026 null
2024-08-16 SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation Xinyu Xiong et.al. 2408.08870 link
2024-08-16 PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars Sumanth Prabhu et.al. 2408.08869 null
2024-08-16 A Hassle-free Algorithm for Private Learning in Practice: Don't Use Tree Aggregation, Use BLTs H. Brendan McMahan et.al. 2408.08868 null
2024-08-16 Visual Agents as Fast and Slow Thinkers Guangyan Sun et.al. 2408.08862 link
2024-08-16 DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models Eman Ali et.al. 2408.08855 null
2024-08-16 GeoTransformer: Enhancing Urban Forecasting with Geospatial Attention Mechanisms Yuhao Jia et.al. 2408.08852 null
2024-08-16 ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis Yubao Zhao et.al. 2408.08849 null
2024-08-16 PsychoLex: Unveiling the Psychological Mind of Large Language Models Mohammad Amin Abbasi et.al. 2408.08848 null
2024-08-16 FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats Xuanliang Zhang et.al. 2408.08841 link
2024-08-16 EasyRec: Simple yet Effective Language Models for Recommendation Xubin Ren et.al. 2408.08821 link
2024-08-16 Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models Lin Zhao et.al. 2408.08813 null
2024-08-16 Artificial Intelligence and Strategic Decision-Making: Evidence from Entrepreneurs and Investors Felipe A. Csaszar et.al. 2408.08811 null
2024-08-16 Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge Ravi Raju et.al. 2408.08808 null
2024-08-16 CIKMar: A Dual-Encoder Approach to Prompt-Based Reranking in Educational Dialogue Systems Joanito Agili Lopo et.al. 2408.08805 null
2024-08-16 A Disease-Specific Foundation Model Using Over 100K Fundus Images: Release and Validation for Abnormality and Multi-Disease Classification on Downstream Tasks Boa Jang et.al. 2408.08790 link
2024-08-16 EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics Chenwei Wan et.al. 2408.08782 link
2024-08-16 Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions Chenming Tang et.al. 2408.08780 null
2024-08-16 DAC: Decomposed Automation Correction for Text-to-SQL Dingzirui Wang et.al. 2408.08779 link
2024-08-16 Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused Dingwei Chen et.al. 2408.08769 null
2024-08-16 Rethinking Generative Semantic Communication for Multi-User Systems with Multi-Modal LLM Wanting Yang et.al. 2408.08765 null
2024-08-15 Can Large Language Models Understand Symbolic Graphics Programs? Zeju Qiu et.al. 2408.08313 null
2024-08-15 ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws Ruihang Li et.al. 2408.08310 null
2024-08-15 Towards Flexible Visual Relationship Segmentation Fangrui Zhu et.al. 2408.08305 null
2024-08-15 Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors Usman Syed et.al. 2408.08302 null
2024-08-15 VLPG-Nav: Object Navigation Using Visual Language Pose Graph and Object Localization Probability Maps Senthil Hariharan Arul et.al. 2408.08301 null
2024-08-15 HELP: Hierarchical Embeddings-based Log Parsing Andy Xu et.al. 2408.08300 null
2024-08-15 The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community Shachar Don-Yehiya et.al. 2408.08291 null
2024-08-15 Autonomous Behavior Planning For Humanoid Loco-manipulation Through Grounded Language Model Jin Wang et.al. 2408.08282 null
2024-08-15 BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts Qizhen Zhang et.al. 2408.08274 null
2024-08-15 DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System Xihong Yang et.al. 2408.08231 null
2024-08-15 RED-CT: A Systems Design Methodology for Using LLM-labeled Data to Train and Deploy Edge Classifiers for Computational Social Science David Farr et.al. 2408.08217 null
2024-08-15 Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models Javier González et.al. 2408.08210 null
2024-08-15 LLM4DSR: Leveraing Large Language Model for Denoising Sequential Recommendation Bohao Wang et.al. 2408.08208 null
2024-08-15 Heavy Labels Out! Dataset Distillation with Label Space Lightening Ruonan Yu et.al. 2408.08201 null
2024-08-15 Scaling Up Natural Language Understanding for Multi-Robots Through the Lens of Hierarchy Shaojun Xu et.al. 2408.08188 null
2024-08-15 General-purpose Clothes Manipulation with Semantic Keypoints Yuhong Deng et.al. 2408.08160 null
2024-08-15 EmBARDiment: an Embodied AI Agent for Productivity in XR Riccardo Bovo et.al. 2408.08158 null
2024-08-15 DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search Huajian Xin et.al. 2408.08152 link
2024-08-15 P/D-Serve: Serving Disaggregated Large Language Model at Scale Yibo Jin et.al. 2408.08147 null
2024-08-15 KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning Kaiqi Zhang et.al. 2408.08146 null
2024-08-14 The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models Karime Maamari et.al. 2408.07702 null
2024-08-15 Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities Enneng Yang et.al. 2408.07666 link
2024-08-14 Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models Yi-Cheng Lin et.al. 2408.07665 link
2024-08-14 Alignment-Enhanced Decoding:Defending via Token-Level Adaptive Refining of Probability Distributions Quan Liu et.al. 2408.07663 link
2024-08-14 WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs Weijian Xie et.al. 2408.07611 null
2024-08-14 Transformers and Large Language Models for Efficient Intrusion Detection Systems: A Comprehensive Survey Hamza Kheddar et.al. 2408.07583 null
2024-08-15 MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark Minxuan Zhou et.al. 2408.07543 link
2024-08-15 Usefulness of data flow diagrams and large language models for security threat validation: a registered report Winnie Bahati Mbaka et.al. 2408.07537 null
2024-08-14 Development of a Multi-Agent Clinical Decision Support System for Korean Triage and Acuity Scale (KTAS)-Based Triage and Treatment Planning in Emergency Departments Seungjun Han et.al. 2408.07531 null
2024-08-14 Large Language Models Know What Makes Exemplary Contexts Quanyu Long et.al. 2408.07505 null
2024-08-14 Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach Shizhou Zhang et.al. 2408.07500 link
2024-08-14 QirK: Question Answering via Intermediate Representation on Knowledge Graphs Jan Luca Scheerer et.al. 2408.07494 null
2024-08-14 Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems Ning Lu et.al. 2408.07482 null
2024-08-14 Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization Yuxin Jiang et.al. 2408.07471 null
2024-08-14 Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification Yongcheng Li et.al. 2408.07467 link
2024-08-14 Large Language Models Prompting With Episodic Memory Dai Do et.al. 2408.07465 null
2024-08-14 From Brazilian Portuguese to European Portuguese João Sanches et.al. 2408.07457 null
2024-08-14 Fact or Fiction? Improving Fact Verification with Knowledge Graphs through Simplified Subgraph Retrievals Tobias A. Opsahl et.al. 2408.07453 link
2024-08-15 BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt Learning Asif Hanif et.al. 2408.07440 link
2024-08-14 Beyond Inter-Item Relations: Dynamic Adaptive Mixture-of-Experts for LLM-Based Sequential Recommendation CanYi Liu et.al. 2408.07427 null
2024-08-13 Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents Kexun Zhang et.al. 2408.07060 null
2024-08-13 LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs Yushi Bai et.al. 2408.07055 link
2024-08-13 Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models Chun Jie Chong et.al. 2408.07004 null
2024-08-13 LLMs can Schedule Henrik Abgaryan et.al. 2408.06993 link
2024-08-13 DyG-Mamba: Continuous State Space Modeling on Dynamic Graphs Dongyuan Li et.al. 2408.06966 null
2024-08-13 Towards Holistic Disease Risk Prediction using Small Language Models Liv Björkdahl et.al. 2408.06943 null
2024-08-13 OpenResearcher: Unleashing AI for Accelerated Scientific Research Yuxiang Zheng et.al. 2408.06941 link
2024-08-13 The advantages of context specific language models: the case of the Erasmian Language Model João Gonçalves et.al. 2408.06931 link
2024-08-13 Evaluating Cultural Adaptability of a Large Language Model via Simulation of Synthetic Personas Louis Kwok et.al. 2408.06929 link
2024-08-13 SceneGPT: A Language Model for 3D Scene Understanding Shivam Chandhok et.al. 2408.06926 null
2024-08-13 Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives Zhihu Wang et.al. 2408.06904 null
2024-08-13 Leveraging Language Models for Emotion and Behavior Analysis in Education Kaito Tanaka et.al. 2408.06874 null
2024-08-13 LoRA $^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models Jia-Chen Zhang et.al. 2408.06854 null
2024-08-13 Causal Agent based on Large Language Model Kairong Han et.al. 2408.06849 link
2024-08-13 DracoGPT: Extracting Visualization Design Preferences from Large Language Models Huichen Will Wang et.al. 2408.06845 null
2024-08-13 How Aligned are Human Chart Takeaways and LLM Predictions? A Case Study on Bar Charts with Varying Layouts Huichen Will Wang et.al. 2408.06837 null
2024-08-13 Efficient Search for Customized Activation Functions with Gradient Descent Lukas Strack et.al. 2408.06820 link
2024-08-13 MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty Yongjin Yang et.al. 2408.06816 null
2024-08-13 HLSPilot: LLM-based High-Level Synthesis Chenwei Xiong et.al. 2408.06810 link
2024-08-13 Layerwise Recurrent Router for Mixture-of-Experts Zihan Qiu et.al. 2408.06793 link
2024-08-12 FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection Yufei Huang et.al. 2408.06333 link
2024-08-12 Animate, or Inanimate, That is the Question for Large Language Models Leonardo Ranaldi et.al. 2408.06332 null
2024-08-12 Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example Yanan Chen et.al. 2408.06318 null
2024-08-12 Long-Form Answers to Visual Questions from Blind and Low Vision People Mina Huh et.al. 2408.06303 null
2024-08-12 The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Chris Lu et.al. 2408.06292 link
2024-08-12 MovieSum: An Abstractive Summarization Dataset for Movie Screenplays Rohit Saxena et.al. 2408.06281 link
2024-08-13 Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation Jieyong Kim et.al. 2408.06276 null
2024-08-13 FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data Haoran Sun et.al. 2408.06273 link
2024-08-12 A RAG-Based Question-Answering Solution for Cyber-Attack Investigation and Attribution Sampath Rajapaksha et.al. 2408.06272 null
2024-08-12 Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment Karel D'Oosterlinck et.al. 2408.06266 link
2024-08-12 Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning Yingjin Song et.al. 2408.06259 null
2024-08-12 On Effects of Steering Latent Representation for Large Language Model Unlearning Dang Huu-Tien et.al. 2408.06223 null
2024-08-12 Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Zhenting Qi et.al. 2408.06195 link
2024-08-12 FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework Lukas Meyer et.al. 2408.06190 link
2024-08-12 Improving Structural Diversity of Blackbox LLMs via Chain-of-Specification Prompting Halley Young et.al. 2408.06186 null
2024-08-12 OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning Mushui Liu et.al. 2408.06158 link
2024-08-12 LipidBERT: A Lipid Language Model Pre-trained on METiS de novo Lipid Library Tianhao Yu et.al. 2408.06150 null
2024-08-12 Self-Supervised Learning on MeerKAT Wide-Field Continuum Images Erica Lastufka et.al. 2408.06147 link
2024-08-12 Med42-v2: A Suite of Clinical LLMs Clément Christophe et.al. 2408.06142 null
2024-08-12 Utilize Transformers for translating Wikipedia category names Hoang-Thang Ta et.al. 2408.06124 null
2024-08-10 Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions Michele Miranda et.al. 2408.05212 link
2024-08-09 VITA: Towards Open-Source Interactive Omni Multimodal LLM Chaoyou Fu et.al. 2408.05211 null
2024-08-09 Evaluating the capability of large language models to personalize science texts for diverse middle-school-age learners Michael Vaccaro Jr et.al. 2408.05204 null
2024-08-09 TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning Yujie Feng et.al. 2408.05200 null
2024-08-09 ECG-FM: An Open Electrocardiogram Foundation Model Kaden McKeen et.al. 2408.05178 link
2024-08-09 Weak-Annotation of HAR Datasets using Vision Foundation Models Marius Bock et.al. 2408.05169 link
2024-08-09 AttackER: Towards Enhancing Cyber-Attack Attribution with a Named Entity Recognition Dataset Pritam Deka et.al. 2408.05149 null
2024-08-09 A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning Ye Yuan et.al. 2408.05141 null
2024-08-09 Is ChatGPT a Good Software Librarian? An Exploratory Study on the Use of ChatGPT for Software Library Recommendations Jasmine Latendresse et.al. 2408.05128 null
2024-08-09 Large Language Models and Thematic Analysis: Human-AI Synergy in Researching Hate Speech on Social Media Petre Breazu et.al. 2408.05126 null
2024-08-09 Sportify: Question Answering with Embedded Visualizations and Personified Narratives for Sports Video Chunggi Lee et.al. 2408.05123 null
2024-08-09 A Survey of NL2SQL with Large Language Models: Where are we, and where are we going? Xinyu Liu et.al. 2408.05109 link
2024-08-09 Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection Xincheng Pang et.al. 2408.05107 null
2024-08-09 How Well Do LLMs Identify Cultural Unity in Diversity? Jialin Li et.al. 2408.05102 link
2024-08-09 Hyperbolic Learning with Multimodal Large Language Models Paolo Mandica et.al. 2408.05097 null
2024-08-09 Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts Tingchen Fu et.al. 2408.05094 null
2024-08-09 Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models Zikai Xie et.al. 2408.05093 link
2024-08-09 Generating novel experimental hypotheses from language models: A case study on cross-dative generalization Kanishka Misra et.al. 2408.05086 link
2024-08-09 RT-Surv: Improving Mortality Prediction After Radiotherapy with Large Language Model Structuring of Large-Scale Unstructured Electronic Health Records Sangjoon Park et.al. 2408.05074 null
2024-08-09 Examining the Behavior of LLM Architectures Within the Framework of Standardized National Exams in Brazil Marcelo Sartori Locatelli et.al. 2408.05035 null
2024-08-08 Better Alignment with Instruction Back-and-Forth Translation Thao Nguyen et.al. 2408.04614 null
2024-08-08 Code-switching in text and speech reveals information-theoretic audience design Debasmita Bhattacharya et.al. 2408.04596 null
2024-08-09 Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models Qirui Jiao et.al. 2408.04594 link
2024-08-08 Towards Resilient and Efficient LLMs: A Comparative Study of Efficiency, Performance, and Adversarial Robustness Xiaojing Fan et.al. 2408.04585 null
2024-08-08 SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More Tianrun Chen et.al. 2408.04579 null
2024-08-08 SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals Haoran Zheng et.al. 2408.04575 null
2024-08-08 Learning Fine-Grained Grounded Citations for Attributed Large Language Models Lei Huang et.al. 2408.04568 link
2024-08-08 Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models Yupeng Chang et.al. 2408.04556 link
2024-08-08 Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation Daniele Rege Cambrin et.al. 2408.04523 link
2024-08-08 Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models Fabio Pernisi et.al. 2408.04522 null
2024-08-08 What You Need is What You Get: Theory of Mind for an LLM-Based Code Understanding Assistant Jonan Richards et.al. 2408.04477 null
2024-08-08 Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive Debate Yiqun Zhang et.al. 2408.04472 link
2024-08-08 RiskAwareBench: Towards Evaluating Physical Risk Awareness for High-level Planning of LLM-based Embodied Agents Zihao Zhu et.al. 2408.04449 null
2024-08-08 Large Language Models for cross-language code clone detection Micheline Bénédicte Moumoula et.al. 2408.04430 null
2024-08-08 Recognizing Emotion Regulation Strategies from Human Behavior with Large Language Models Philipp Müller et.al. 2408.04420 null
2024-08-08 Enhancing Robustness of Retrieval-Augmented Language Models with In-Context Learning Seong-Il Park et.al. 2408.04414 null
2024-08-08 Deeploy: Enabling Energy-Efficient Deployment of Small Language Models On Heterogeneous Microcontrollers Moritz Scherer et.al. 2408.04413 null
2024-08-08 Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset Kentaro Ozeki et.al. 2408.04403 link
2024-08-08 Automated Educational Question Generation at Different Bloom's Skill Levels using Large Language Models: Strategies and Evaluation Nicy Scaria et.al. 2408.04394 null
2024-08-08 Open-domain Implicit Format Control for Large Language Model Generation Yiqun Yao et.al. 2408.04392 link
2024-08-07 How Well Can Vision Language Models See Image Details? Chenhui Gou et.al. 2408.03940 null
2024-08-07 SLIM-RAFT: A Novel Fine-Tuning Approach to Improve Cross-Linguistic Performance for Mercosur Common Nomenclature Vinícius Di Oliveira et.al. 2408.03936 null
2024-08-07 CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases Xiangyan Liu et.al. 2408.03910 link
2024-08-07 Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models Shachi H Kumar et.al. 2408.03907 null
2024-08-07 Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond Beomseok Lee et.al. 2408.03900 link
2024-08-07 Simplifying Scholarly Abstracts for Accessible Digital Libraries Haining Wang et.al. 2408.03899 link
2024-08-07 From Data to Story: Towards Automatic Animated Data Video Creation with LLM-based Multi-Agent Systems Leixian Shen et.al. 2408.03876 null
2024-08-07 PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training Haoran Xu et.al. 2408.03865 null
2024-08-07 GAIA -- A Large Language Model for Advanced Power Dispatch Yuheng Cheng et.al. 2408.03847 null
2024-08-07 MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models Yuchen Dong et.al. 2408.03841 null
2024-08-07 WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models Prannaya Gupta et.al. 2408.03837 link
2024-08-07 Target Prompting for Information Extraction with Vision Language Model Dipankar Medhi et.al. 2408.03834 null
2024-08-07 Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning Simret Araya Gebreegziabher et.al. 2408.03819 null
2024-08-07 Generative Language Models with Retrieval Augmented Generation for Automated Short Answer Scoring Zifan Wang et.al. 2408.03811 null
2024-08-07 'Finance Wizard' at the FinLLM Challenge Task: Financial Text Summarization Meisin Lee et.al. 2408.03762 null
2024-08-07 MMSummary: Multimodal Summary Generation for Fetal Ultrasound Video Xiaoqing Guo et.al. 2408.03761 null
2024-08-07 Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation Jingjing Xie et.al. 2408.03735 link
2024-08-07 Question Rephrasing for Quantifying Uncertainty in Large Language Models: Applications in Molecular Chemistry Tasks Zizhang Chen et.al. 2408.03732 null
2024-08-07 A Convex-optimization-based Layer-wise Post-training Pruner for Large Language Models Pengxiang Zhao et.al. 2408.03728 null
2024-08-07 Local Topology Measures of Contextual Language Model Latent Spaces With Applications to Dialogue Term Extraction Benjamin Matthias Ruppik et.al. 2408.03706 null
2024-08-06 CoverBench: A Challenging Benchmark for Complex Claim Verification Alon Jacovi et.al. 2408.03325 null
2024-08-06 Segment Anything in Medical Images and Videos: Benchmark and Deployment Jun Ma et.al. 2408.03322 link
2024-08-06 TextIM: Part-aware Interactive Motion Synthesis from Text Siyuan Fan et.al. 2408.03302 null
2024-08-06 KaPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models Ruizhe Zhang et.al. 2408.03297 null
2024-08-06 Biomedical SAM 2: Segment Anything in Biomedical Images and Videos Zhiling Yan et.al. 2408.03286 null
2024-08-07 StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation Boxi Cao et.al. 2408.03281 link
2024-08-06 Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Angie Boggust et.al. 2408.03274 null
2024-08-06 Synthesizing Text-to-SQL Data from Weak and Strong LLMs Jiaxi Yang et.al. 2408.03256 null
2024-08-06 Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons Yifei Wang et.al. 2408.03247 null
2024-08-06 Making Long-Context Language Models Better Multi-Hop Reasoners Yanyang Li et.al. 2408.03246 link
2024-08-06 Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi Pranita Deshmukh et.al. 2408.03172 null
2024-08-06 Conditioning LLMs with Emotion in Neural Machine Translation Charles Brazier et.al. 2408.03150 null
2024-08-06 Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization Yanghai Zhang et.al. 2408.03149 link
2024-08-06 Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations Leo Donisch et.al. 2408.03130 null
2024-08-06 Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral 7B Model and Data Augmentation Artur Guimarães et.al. 2408.03127 link
2024-08-06 Evaluating the Translation Performance of Large Language Models Based on Euas-20 Yan Huang et.al. 2408.03119 null
2024-08-06 Topic Modeling with Fine-tuning LLMs and Bag of Sentences Johannes Schneider et.al. 2408.03099 link
2024-08-07 TestART: Improving LLM-based Unit Test via Co-evolution of Automated Generation and Repair Iteration Siqi Gu et.al. 2408.03095 null
2024-08-06 500xCompressor: Generalized Prompt Compression for Large Language Models Zongqian Li et.al. 2408.03094 link
2024-08-06 Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement Le Yu et.al. 2408.03092 link
2024-08-05 Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Dongyang Liu et.al. 2408.02657 link
2024-08-05 Can Reinforcement Learning Unlock the Hidden Dangers in Aligned Large Language Models? Mohammad Bahrami Karkevandi et.al. 2408.02651 null
2024-08-05 Command-line Obfuscation Detection using Small Language Models Vojtech Outrata et.al. 2408.02637 null
2024-08-05 SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models Muxi Diao et.al. 2408.02632 null
2024-08-05 Language Model Can Listen While Speaking Ziyang Ma et.al. 2408.02622 null
2024-08-05 Progressively Selective Label Enhancement for Language Model Alignment Biao Liu et.al. 2408.02599 null
2024-08-05 Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with Attention for Multimodal Sarcasm Detection Sajal Aggarwal et.al. 2408.02595 null
2024-08-05 Leveraging the Power of LLMs: A Fine-Tuning Approach for High-Quality Aspect-Based Summarization Ankan Mullick et.al. 2408.02584 null
2024-08-05 DanModCap: Designing a Danmaku Moderation Tool for Video-Sharing Platforms that Leverages Impact Captions Siying Hu et.al. 2408.02574 null
2024-08-05 Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information Yauwai Yim et.al. 2408.02559 null
2024-08-05 Generative AI as a Service in 6G Edge-Cloud: Generation Task Offloading by In-context Learning Hao Zhou et.al. 2408.02549 null
2024-08-05 RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Daniel Fleischer et.al. 2408.02545 link
2024-08-05 Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions Xinbei Ma et.al. 2408.02544 link
2024-08-05 Towards Coarse-grained Visual Language Navigation Task Planning Enhanced by Event Knowledge Graph Zhao Kaichen et.al. 2408.02535 null
2024-08-05 Practical Attacks against Black-box Code Completion Engines Slobodan Jenko et.al. 2408.02509 null
2024-08-05 UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model Zhaowei Li et.al. 2408.02503 link
2024-08-05 Context Conquers Parameters: Outperforming Proprietary LLM in Commit Message Generation Aaron Imani et.al. 2408.02502 null
2024-08-05 A First Look at License Compliance Capability of LLMs in Code Generation Weiwei Xu et.al. 2408.02487 link
2024-08-05 Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection Ting Lei et.al. 2408.02484 link
2024-08-05 From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future Haolin Jin et.al. 2408.02479 null
2024-08-02 Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting Xiangyu Zhao et.al. 2408.01423 null
2024-08-02 Mission Impossible: A Statistical Perspective on Jailbreaking LLMs Jingtong Su et.al. 2408.01420 null
2024-08-02 DebateQA: Evaluating Question Answering on Debatable Knowledge Rongwu Xu et.al. 2408.01419 link
2024-08-02 Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs Yilun Hua et.al. 2408.01417 null
2024-08-02 Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer Yu Yang et.al. 2408.01402 null
2024-08-02 Coalitions of Large Language Models Increase the Robustness of AI Agents Prattyush Mangal et.al. 2408.01380 null
2024-08-02 Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation Jheng-Hong Yang et.al. 2408.01363 null
2024-08-02 Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs Peng Ding et.al. 2408.01355 link
2024-08-02 MCGMark: An Encodable and Robust Online Watermark for LLM-Generated Malicious Code Kaiwen Ning et.al. 2408.01354 link
2024-08-02 Prompt Refinement or Fine-tuning? Best Practices for using LLMs in Computational Social Science Tasks Anders Giovanni Møller et.al. 2408.01346 null
2024-08-02 MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models Benno Weck et.al. 2408.01337 link
2024-08-02 A Backbone for Long-Horizon Robot Task Understanding Xiaoshuai Chen et.al. 2408.01334 null
2024-08-02 FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only He Zhu et.al. 2408.01323 null
2024-08-02 A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks Jiaqi Wang et.al. 2408.01319 null
2024-08-02 Reconsidering Token Embeddings with the Definitions for Pre-trained Language Models Ying Zhang et.al. 2408.01308 null
2024-08-02 The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models Hannah Chen et.al. 2408.01285 null
2024-08-02 RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework Kunlun Zhu et.al. 2408.01262 link
2024-08-02 The Phantom Menace: Unmasking Privacy Leakages in Vision-Language Models Simone Caldarella et.al. 2408.01228 null
2024-08-02 High-Throughput Phenotyping of Clinical Text Using Large Language Models Daniel B. Hier et.al. 2408.01214 null
2024-08-02 Misinforming LLMs: vulnerabilities, challenges and opportunities Bo Zhou et.al. 2408.01168 null
2024-08-01 AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation Mengkang Hu et.al. 2408.00764 null
2024-08-01 UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model Xiangyu Fan et.al. 2408.00762 null
2024-08-01 Tamper-Resistant Safeguards for Open-Weight LLMs Rishub Tamirisa et.al. 2408.00761 link
2024-08-01 Thermal Conductivity Predictions with Foundation Atomistic Models Balázs Póta et.al. 2408.00755 link
2024-08-01 Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model Benlin Liu et.al. 2408.00754 null
2024-08-01 Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation Siyu Jiao et.al. 2408.00744 link
2024-08-01 DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency Jovan Stojkovic et.al. 2408.00741 null
2024-08-01 Virchow 2: Scaling Self-Supervised Mixed Magnification Models in Pathology Eric Zimmermann et.al. 2408.00738 null
2024-08-01 Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions Guangzhi Xiong et.al. 2408.00727 null
2024-08-01 An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models Yangzhen Wu et.al. 2408.00724 null
2024-08-01 Pathway to Secure and Trustworthy 6G for LLMs: Attacks, Defense, and Opportunities Sunder Ali Khowaja et.al. 2408.00722 null
2024-08-01 SAM 2: Segment Anything in Images and Videos Nikhila Ravi et.al. 2408.00714 null
2024-08-01 Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM Xiaofeng Liu et.al. 2408.00706 null
2024-08-02 Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning Trapoom Ukarapol et.al. 2408.00690 link
2024-08-01 Can Developers Prompt? A Controlled Experiment for Code Documentation Generation Hans-Alexander Kruse et.al. 2408.00686 null
2024-08-01 ExpertAF: Expert Actionable Feedback from Video Kumar Ashutosh et.al. 2408.00672 null
2024-08-01 AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models Daqin Luo et.al. 2408.00665 link
2024-08-01 Disentangling Dense Embeddings with Sparse Autoencoders Charles O'Neill et.al. 2408.00657 null
2024-08-02 SentenceVAE: Faster, Longer and More Accurate Inference with Next-sentence Prediction for Large Language Models Hongjun An et.al. 2408.00655 link
2024-08-01 Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning Xuri Ge et.al. 2408.00644 null
2024-07-31 Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey Atsuyuki Miyai et.al. 2407.21794 null
2024-07-31 Vision-Language Model Based Handwriting Verification Mihir Chauhan et.al. 2407.21788 null
2024-07-31 Large Language Monkeys: Scaling Inference Compute with Repeated Sampling Bradley Brown et.al. 2407.21787 null
2024-07-31 The Llama 3 Herd of Models Abhimanyu Dubey et.al. 2407.21783 null
2024-07-31 Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs Shi Liu et.al. 2407.21771 null
2024-07-31 MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts Xi Victoria Lin et.al. 2407.21770 null
2024-07-31 ReplanVLM: Replanning Robotic Tasks with Visual Language Models Aoran Mei et.al. 2407.21762 null
2024-07-31 Learning Video Context as Interleaved Multimodal Sequences Kevin Qinghong Lin et.al. 2407.21757 link
2024-07-31 A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation Mothilal Asokan et.al. 2407.21739 null
2024-07-31 Open-Vocabulary Audio-Visual Semantic Segmentation Ruohao Guo et.al. 2407.21721 null
2024-07-31 Adaptive Retrieval-Augmented Generation for Conversational Systems Xi Wang et.al. 2407.21712 null
2024-07-31 CEAR: Automatic construction of a knowledge graph of chemical entities and roles from scientific literature Stefan Langer et.al. 2407.21708 null
2024-07-31 TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities Ming Zhang et.al. 2407.21693 link
2024-07-31 Synth-Empathy: Towards High-Quality Synthetic Empathy Data Hao Liang et.al. 2407.21669 link
2024-08-01 Defending Jailbreak Attack in VLMs via Cross-modality Information Detector Yue Xu et.al. 2407.21659 null
2024-07-31 MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment Anurag Das et.al. 2407.21654 null
2024-07-31 Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation Xiang Luo et.al. 2407.21633 link
2024-07-31 TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods Gabriel Loiseau et.al. 2407.21630 link
2024-07-31 LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows Lukas Teufelberger et.al. 2407.21593 null
2024-07-31 A Performance Study of LLM-Generated Code on Leetcode Tristan Coignion et.al. 2407.21579 null
2024-07-30 ThinK: Thinner Key Cache by Query-Driven Pruning Yuhui Xu et.al. 2407.21018 null
2024-07-30 CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning Yuexi Du et.al. 2407.21011 link
2024-07-30 GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language Models Ali Abdollahi et.al. 2407.21001 null
2024-07-31 MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning Yupeng Chen et.al. 2407.20999 null
2024-07-30 From Feature Importance to Natural Language Explanations Using LLMs with RAG Sule Tekkesinoglu et.al. 2407.20990 link
2024-07-30 Large Language Models (LLMs) for Semantic Communication in Edge-based IoT Networks Alakesh Kalita et.al. 2407.20970 null
2024-07-30 MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions Xiaowei Chi et.al. 2407.20962 link
2024-07-30 UniProcessor: A Text-induced Unified Low-level Image Processor Huiyu Duan et.al. 2407.20928 link
2024-07-30 SSPA: Split-and-Synthesize Prompting with Gated Alignments for Multi-Label Image Recognition Hao Tan et.al. 2407.20920 null
2024-07-30 Automated Review Generation Method Based on Large Language Models Shican Wu et.al. 2407.20906 link
2024-07-30 Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach Adam Wojciechowski et.al. 2407.20899 null
2024-07-30 ThinkRepair: Self-Directed Automated Program Repair Xin Yin et.al. 2407.20898 link
2024-07-30 Effective Black Box Testing of Sentiment Analysis Classification Networks Parsa Karbasizadeh et.al. 2407.20884 null
2024-07-30 Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification Boyang Zhang et.al. 2407.20859 null
2024-07-30 Learn by Selling: Equipping Large Language Models with Product Knowledge for Context-Driven Recommendations Sarthak Anand et.al. 2407.20856 null
2024-07-30 Large Language Model (LLM)-enabled Graphs in Dynamic Networking Geng Sun et.al. 2407.20840 null
2024-07-30 How to Measure the Intelligence of Large Language Models? Nils Körber et.al. 2407.20828 null
2024-07-30 Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning Norman Di Palo et.al. 2407.20798 null
2024-07-30 Interpretable Pre-Trained Transformers for Heart Time-Series Data Harry J. Davies et.al. 2407.20775 link
2024-07-30 OmniBal: Towards Fast Instruct-tuning for Vision-Language Models via Omniverse Computation Balance Yongqiang Yao et.al. 2407.20761 link
2024-07-29 Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing Ekaterina Iakovleva et.al. 2407.20232 null
2024-07-29 Improving 2D Feature Representations by 3D-Aware Fine-Tuning Yuanwen Yue et.al. 2407.20229 null
2024-07-29 FlexAttention for Efficient High-Resolution Vision-Language Models Junyan Li et.al. 2407.20228 null
2024-07-29 Can Editing LLMs Inject Harm? Canyu Chen et.al. 2407.20224 null
2024-07-29 SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction Çağhan Köksal et.al. 2407.20214 null
2024-07-29 QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval Hongming Tan et.al. 2407.20207 null
2024-07-29 MindSearch: Mimicking Human Minds Elicits Deep AI Searcher Zehui Chen et.al. 2407.20183 link
2024-07-29 Theia: Distilling Diverse Vision Foundation Models for Robot Learning Jinghuan Shang et.al. 2407.20179 link
2024-07-29 AutoScale: Automatic Prediction of Compute-optimal Data Composition for Training LLMs Feiyang Kang et.al. 2407.20177 null
2024-07-29 Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning Xingchen Zeng et.al. 2407.20174 link
2024-07-29 Diffusion Feedback Helps CLIP See Better Wenxuan Wang et.al. 2407.20171 link
2024-07-29 Language-Conditioned Offline RL for Multi-Robot Navigation Steven Morad et.al. 2407.20164 null
2024-07-29 rLLM: Relational Table Learning with LLMs Weichen Li et.al. 2407.20157 link
2024-07-29 ByteCheckpoint: A Unified Checkpointing System for LLM Development Borui Wan et.al. 2407.20143 null
2024-07-29 Strong Copyright Protection for Language Models via Adaptive Model Fusion Javier Abad et.al. 2407.20105 null
2024-07-29 Orca: Ocean Significant Wave Height Estimation with Spatio-temporally Aware Large Language Models Zhe Li et.al. 2407.20053 null
2024-07-29 Exploring Large Language Models to generate Easy to Read content Paloma Martínez et.al. 2407.20046 null
2024-07-29 MaskInversion: Localized Embeddings via Optimization of Explainability Maps Walid Bousselham et.al. 2407.20034 null
2024-07-29 Efficient Training of Large Language Models on Distributed Infrastructures: A Survey Jiangfei Duan et.al. 2407.20018 null
2024-07-29 Rosetta Statements: Lowering the Barrier for Semantic Parsing and Increasing the Cognitive Interoperability of Knowledge Graphs Lars Vogt et.al. 2407.20007 null
2024-07-26 Wolf: Captioning Everything with a World Summarization Framework Boyi Li et.al. 2407.18908 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907 null
2024-07-26 A Flexible and Scalable Approach for Collecting Wildlife Advertisements on the Web Juliana Barbosa et.al. 2407.18898 link
2024-07-26 Small Molecule Optimization with Large Language Models Philipp Guevorguian et.al. 2407.18897 link
2024-07-26 Human-artificial intelligence teaming for scientific information extraction from data-driven additive manufacturing research using large language models Mutahar Safdar et.al. 2407.18827 null
2024-07-26 Automatic Detection of Moral Values in Music Lyrics Vjosa Preniqi et.al. 2407.18787 link
2024-07-26 The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs Aleix Sant et.al. 2407.18786 null
2024-07-26 Foundation Models for the Digital Twin Creation of Cyber-Physical Systems Shaukat Ali et.al. 2407.18779 null
2024-07-26 TAGIFY: LLM-powered Tagging Interface for Improved Data Findability on OGD portals Kevin Kliimask et.al. 2407.18764 null
2024-07-26 Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery Yuni Susanti et.al. 2407.18752 link
2024-07-26 Towards Effective and Efficient Continual Pre-training of Large Language Models Jie Chen et.al. 2407.18743 null
2024-07-26 Towards Generalized Offensive Language Identification Alphaeus Dmonte et.al. 2407.18738 null
2024-07-26 LLASP: Fine-tuning Large Language Models for Answer Set Programming Erica Coppolillo et.al. 2407.18723 null
2024-07-26 Neurosymbolic AI for Enhancing Instructability in Generative AI Amit Sheth et.al. 2407.18722 null
2024-07-26 Cluster-norm for Unsupervised Probing of Knowledge Walter Laurito et.al. 2407.18712 link
2024-07-26 Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation Esteban Garces Arias et.al. 2407.18698 null
2024-07-26 Collaborative Evolving Strategy for Automatic Data-Centric Development Xu Yang et.al. 2407.18690 null
2024-07-26 The BIAS Detection Framework: Bias Detection in Word Embeddings and Language Models for European Languages Alexandre Puttick et.al. 2407.18689 link
2024-07-26 Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift Seongho Son et.al. 2407.18676 null
2024-07-26 Every Part Matters: Integrity Verification of Scientific Figures Based on Multimodal Large Language Models Xiang Shi et.al. 2407.18626 link
2024-07-25 Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning Tianduo Wang et.al. 2407.18248 link
2024-07-25 LoRA-Pro: Are Low-Rank Adapters Properly Optimized? Zhengbo Wang et.al. 2407.18242 link
2024-07-26 Recursive Introspection: Teaching Language Model Agents How to Self-Improve Yuxiao Qu et.al. 2407.18219 null
2024-07-26 Exploring Scaling Trends in LLM Robustness Nikolaus Howe et.al. 2407.18213 null
2024-07-25 AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction Chunan Liu et.al. 2407.18184 link
2024-07-25 Gene Regulatory Network Inference from Pre-trained Single-Cell Transcriptomics Transformer with Joint Graph Learning Sindhura Kommu et.al. 2407.18181 null
2024-07-25 Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models Sanae Lotfi et.al. 2407.18158 null
2024-07-25 $\mathbb{X}$ -Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs Vlad Sobal et.al. 2407.18134 null
2024-07-26 Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic Fakhraddin Alwajih et.al. 2407.18129 null
2024-07-25 Efficient Inference of Vision Instruction-Following Models with Elastic Cache Zuyan Liu et.al. 2407.18121 link
2024-07-25 Multi-Resolution Histopathology Patch Graphs for Ovarian Cancer Subtyping Jack Breen et.al. 2407.18105 link
2024-07-25 Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow Tian Guo et.al. 2407.18103 null
2024-07-25 PEFT-U: Parameter-Efficient Fine-Tuning for User Personalization Christopher Clarke et.al. 2407.18078 link
2024-07-25 C2P: Featuring Large Language Models with Causal Reasoning Abdolmahdi Bagheri et.al. 2407.18069 null
2024-07-25 ComPeer: A Generative Conversational Agent for Proactive Peer Support Tianjian Liu et.al. 2407.18064 null
2024-07-25 Audio Entailment: Assessing Deductive Reasoning for Audio Understanding Soham Deshmukh et.al. 2407.18062 link
2024-07-25 Difficulty Estimation and Simplification of French Text Using LLMs Henri Jamet et.al. 2407.18061 null
2024-07-25 The Geometry of Queries: Query-Based Innovations in Retrieval-Augmented Generation Eric Yang et.al. 2407.18044 null
2024-07-25 RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models Haoyu Chen et.al. 2407.18035 null
2024-07-25 GermanPartiesQA: Benchmarking Commercial Large Language Models for Political Bias and Sycophancy Jan Batzner et.al. 2407.18008 null
2024-07-24 I Could've Asked That: Reformulating Unanswerable Questions Wenting Zhao et.al. 2407.17469 link
2024-07-24 WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries Wenting Zhao et.al. 2407.17468 null
2024-07-24 CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models Jiawei Gu et.al. 2407.17467 null
2024-07-24 $VILA^2$ : VILA Augmented VILA Yunhao Fang et.al. 2407.17453 null
2024-07-24 Fluent Student-Teacher Redteaming T. Ben Thompson et.al. 2407.17447 link
2024-07-24 Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? Michael-Andrei Panaitescu-Liess et.al. 2407.17417 null
2024-07-24 (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork Tianjin Huang et.al. 2407.17412 null
2024-07-24 Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models Yida Zhao et.al. 2407.17406 link
2024-07-24 Grammar-based Game Description Generation using Large Language Models Tsunehiko Tanaka et.al. 2407.17404 null
2024-07-24 3D Question Answering for City Scene Understanding Penglei Sun et.al. 2407.17398 null
2024-07-24 PERSONA: A Reproducible Testbed for Pluralistic Alignment Louis Castricato et.al. 2407.17387 null
2024-07-24 A Comprehensive Approach to Misspelling Correction with BERT and Levenshtein Distance Amirreza Naziri et.al. 2407.17383 null
2024-07-24 MMRA: A Benchmark for Multi-granularity Multi-image Relational Association Siwei Wu et.al. 2407.17379 link
2024-07-24 ViPer: Visual Personalization of Generative Models via Individual Preference Learning Sogand Salehi et.al. 2407.17365 null
2024-07-24 Gradient-based inference of abstract task representations for generalization in neural networks Ali Hummos et.al. 2407.17356 null
2024-07-24 Scalify: scale propagation for efficient low-precision LLM training Paul Balança et.al. 2407.17353 link
2024-07-24 Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching Yuyang Ding et.al. 2407.17349 link
2024-07-24 DexGANGrasp: Dexterous Generative Adversarial Grasping Synthesis for Task-Oriented Manipulation Qian Feng et.al. 2407.17348 null
2024-07-24 Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition Ke Bao et.al. 2407.17344 null
2024-07-24 How Good (Or Bad) Are LLMs at Detecting Misleading Visualizations? Leo Yu-Ho Lo et.al. 2407.17291 null
2024-07-23 PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects Junyi Li et.al. 2407.16696 link
2024-07-23 Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack Xiaoyue Xu et.al. 2407.16695 link
2024-07-23 Can Large Language Models Automatically Jailbreak GPT-4V? Yuanwei Wu et.al. 2407.16686 null
2024-07-23 SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation Pengfei Chen et.al. 2407.16682 null
2024-07-23 RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent Huiyu Xu et.al. 2407.16667 null
2024-07-23 Course-Correction: Safety Alignment Using Synthetic Preferences Rongwu Xu et.al. 2407.16637 link
2024-07-23 Lawma: The Power of Specialization for Legal Tasks Ricardo Dominguez-Olmedo et.al. 2407.16615 null
2024-07-23 Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data? Jonathan Hayase et.al. 2407.16607 link
2024-07-23 Shared Imagination: LLMs Hallucinate Alike Yilun Zhou et.al. 2407.16604 null
2024-07-23 A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions Giorgos Lysandrou et.al. 2407.16593 null
2024-07-23 Exploring Automatic Cryptographic API Misuse Detection in the Era of LLMs Yifan Xia et.al. 2407.16576 null
2024-07-23 TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback Eunseop Yoon et.al. 2407.16574 null
2024-07-23 Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models Ioana Buhnila et.al. 2407.16565 link
2024-07-23 Patched RTC: evaluating LLMs for diverse software development tasks Asankhaya Sharma et.al. 2407.16557 link
2024-07-24 MicroEmo: Time-Sensitive Multimodal Emotion Recognition with Micro-Expression Dynamics in Video Dialogues Liyun Zhang et.al. 2407.16552 null
2024-07-23 Quantifying the Role of Textual Predictability in Automatic Speech Recognition Sean Robertson et.al. 2407.16537 null
2024-07-23 Imperfect Vision Encoders: Efficient and Robust Tuning for Vision-Language Models Aristeidis Panos et.al. 2407.16526 null
2024-07-24 AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game Yizhou Chi et.al. 2407.16521 null
2024-07-23 Language-Based Security for Low-Level MPC Christian Skalka et.al. 2407.16504 null
2024-07-23 Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models Kenza Benkirane et.al. 2407.16470 null
2024-07-22 AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description Junyu Xie et.al. 2407.15850 link
2024-07-22 LLMmap: Fingerprinting For Large Language Models Dario Pasquini et.al. 2407.15847 null
2024-07-22 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Mingze Xu et.al. 2407.15841 null
2024-07-22 MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity Yangzhou Liu et.al. 2407.15838 link
2024-07-22 dMel: Speech Tokenization made Simple He Bai et.al. 2407.15835 null
2024-07-22 J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling Wataru Nakata et.al. 2407.15828 null
2024-07-22 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight Ziyuan Huang et.al. 2407.15819 null
2024-07-22 Perceptions of Linguistic Uncertainty by Language Models and Humans Catarina G Belem et.al. 2407.15814 link
2024-07-22 AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection Yunkang Cao et.al. 2407.15795 link
2024-07-22 CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning Emanuele Frascaroli et.al. 2407.15793 link
2024-07-22 Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach Rian Dolphin et.al. 2407.15788 null
2024-07-22 Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels Zhuorui Ye et.al. 2407.15786 null
2024-07-22 Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning Kaiwen Wang et.al. 2407.15762 null
2024-07-22 MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation Marco Simoni et.al. 2407.15748 null
2024-07-22 OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context Steffen Kleinle et.al. 2407.15736 null
2024-07-22 TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON John Chong Min Tan et.al. 2407.15734 link
2024-07-22 Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders Laura Niss et.al. 2407.15731 null
2024-07-22 SAM2CLIP2SAM: Vision Language Model for Segmentation of 3D CT Scans for Covid-19 Detection Dimitrios Kollias et.al. 2407.15728 null
2024-07-22 DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design Zhi Hao Luo et.al. 2407.15723 link
2024-07-22 Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability Zhuoyan Xu et.al. 2407.15720 link
2024-07-19 Internal Consistency and Self-Feedback in Large Language Models: A Survey Xun Liang et.al. 2407.14507 link
2024-07-19 On Pre-training of Multimodal Language Models Customized for Chart Understanding Wan-Cyuan Fan et.al. 2407.14506 null
2024-07-19 PD-TPE: Parallel Decoder with Text-guided Position Encoding for 3D Visual Grounding Chenshu Hou et.al. 2407.14491 null
2024-07-19 Evaluating the Reliability of Self-Explanations in Large Language Models Korbinian Randl et.al. 2407.14487 link
2024-07-19 Data-Centric Human Preference Optimization with Rationales Hoang Anh Just et.al. 2407.14477 link
2024-07-19 Contrastive Learning with Counterfactual Explanations for Radiology Report Generation Mingjie Li et.al. 2407.14474 null
2024-07-19 Check-Eval: A Checklist-based Approach for Evaluating Text Quality Jayr Pereira et.al. 2407.14467 null
2024-07-19 Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier Zachary Wojtowicz et.al. 2407.14452 null
2024-07-19 Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding Renshan Zhang et.al. 2407.14439 link
2024-07-19 Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders Senthooran Rajamanoharan et.al. 2407.14435 null
2024-07-19 Mixture of Experts with Mixture of Precisions for Tuning Quality of Service HamidReza Imani et.al. 2407.14417 null
2024-07-19 System-1.x: Learning to Balance Fast and Slow Planning with Language Models Swarnadeep Saha et.al. 2407.14414 link
2024-07-19 DEAL: Disentangle and Localize Concept-level Explanations for VLMs Tang Li et.al. 2407.14412 link
2024-07-19 The Vision of Autonomic Computing: Can LLMs Make It a Reality? Zhiyang Zhang et.al. 2407.14402 null
2024-07-19 Frontiers of Deep Learning: From Novel Application to Real-World Deployment Rui Xie et.al. 2407.14386 null
2024-07-19 Open Artificial Knowledge Vadim Borisov et.al. 2407.14371 null
2024-07-19 Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models Xuenan Xu et.al. 2407.14355 link
2024-07-19 Improving Retrieval in Sponsored Search by Leveraging Query Context Signals Akash Kumar Mohankumar et.al. 2407.14346 null
2024-07-19 LLMs left, right, and center: Assessing GPT's capabilities to label political bias from web domains Raphael Hernandes et.al. 2407.14344 null
2024-07-19 Multimodal Misinformation Detection using Large Vision-Language Models Sahar Tahmasebi et.al. 2407.14321 null
2024-07-18 Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data Charles Jin et.al. 2407.13765 null
2024-07-18 SegPoint: Segment Any Point Cloud via Large Language Model Shuting He et.al. 2407.13761 null
2024-07-18 Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models Zhuo Chen et.al. 2407.13757 null
2024-07-18 CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications Mirza Masfiqur Rahman et.al. 2407.13742 null
2024-07-18 Baba Is AI: Break the Rules to Beat the Benchmark Nathan Cloos et.al. 2407.13729 null
2024-07-18 CoDefeater: Using LLMs To Find Defeaters in Assurance Cases Usman Gohar et.al. 2407.13717 link
2024-07-18 Understanding Reference Policies in Direct Preference Optimization Yixin Liu et.al. 2407.13709 link
2024-07-18 A Comprehensive Review of Recommender Systems: Transitioning from Theory to Practice Shaina Raza et.al. 2407.13699 null
2024-07-18 Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation Yotam Perlitz et.al. 2407.13696 link
2024-07-18 Prover-Verifier Games improve legibility of LLM outputs Jan Hendrik Kirchner et.al. 2407.13692 null
2024-07-18 Shaded Route Planning Using Active Segmentation and Identification of Satellite Images Longchao Da et.al. 2407.13689 null
2024-07-18 FuLG: 150B Romanian Corpus for Language Model Pretraining Vlad-Andrei Bădoiu et.al. 2407.13657 null
2024-07-18 COMCAT: Leveraging Human Judgment to Improve Automatic Documentation and Summarization Skyler Grandel et.al. 2407.13648 null
2024-07-18 Weak-to-Strong Reasoning Yuqing Yang et.al. 2407.13647 link
2024-07-18 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Chaofan Tao et.al. 2407.13623 link
2024-07-18 KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration Youfu Yan et.al. 2407.13598 null
2024-07-18 PLANTS: A Novel Problem and Dataset for Summarization of Planning-Like (PL) Tasks Vishal Pallagani et.al. 2407.13597 null
2024-07-18 EarthMarker: A Visual Prompt Learning Framework for Region-level and Point-level Remote Sensing Imagery Comprehension Wei Zhang et.al. 2407.13596 link
2024-07-18 Robust Calibration of Large Vision-Language Adapters Balamurali Murugesan et.al. 2407.13588 link
2024-07-18 Towards Zero-Shot Multimodal Machine Translation Matthieu Futeral et.al. 2407.13579 link
2024-07-17 LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Kaichen Zhang et.al. 2407.12772 link
2024-07-17 EchoSight: Advancing Visual-Language Models with Wiki Knowledge Yibin Yan et.al. 2407.12735 null
2024-07-17 NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model Zhongqun Zhang et.al. 2407.12727 null
2024-07-17 Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? Ben Yao et.al. 2407.12725 null
2024-07-17 The Future of Learning: Large Language Models through the Lens of Students He Zhang et.al. 2407.12723 null
2024-07-17 MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models Leyang Shen et.al. 2407.12709 link
2024-07-17 Subgraph-Aware Training of Text-based Methods for Knowledge Graph Completion Youmin Ko et.al. 2407.12703 null
2024-07-17 Patch-Level Training for Large Language Models Chenze Shao et.al. 2407.12665 link
2024-07-17 Zero-shot Text-guided Infinite Image Synthesis with LLM guidance Soyeong Kwon et.al. 2407.12642 null
2024-07-17 Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification? Aman Sinha et.al. 2407.12626 null
2024-07-17 Harnessing the Power of Artificial Intelligence to Vitalize Endangered Indigenous Languages: Technologies and Experiences Claudio Pinhanez et.al. 2407.12620 null
2024-07-17 AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism William Brannon et.al. 2407.12613 link
2024-07-17 VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding Ofir Abramovich et.al. 2407.12594 null
2024-07-18 Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk et.al. 2407.12588 link
2024-07-17 E5-V: Universal Embeddings with Multimodal Large Language Models Ting Jiang et.al. 2407.12580 link
2024-07-17 Audio Conditioning for Music Generation via Discrete Bottleneck Features Simon Rouard et.al. 2407.12563 null
2024-07-17 Conspiracy theories and where to find them on TikTok Francesco Corso et.al. 2407.12545 null
2024-07-17 Abstraction Alignment: Comparing Model and Human Conceptual Relationships Angie Boggust et.al. 2407.12543 link
2024-07-17 Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models Xihe Qiu et.al. 2407.12532 null
2024-07-17 Crafting the Path: Robust Query Rewriting for Information Retrieval Ingeol Baek et.al. 2407.12529 null
2024-07-16 UrbanWorld: An Urban World Model for 3D City Generation Yu Shang et.al. 2407.11965 null
2024-07-16 NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? Mo Li et.al. 2407.11963 link
2024-07-16 Code Documentation and Analysis to Secure Software Development Paul Attie et.al. 2407.11934 null
2024-07-16 What's Wrong? Refining Meeting Summaries with LLM Feedback Frederic Kirstein et.al. 2407.11919 null
2024-07-16 GraphFM: A Scalable Framework for Multi-Graph Pretraining Divyansha Lachi et.al. 2407.11907 null
2024-07-16 Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads Aritra Dhar et.al. 2407.11888 null
2024-07-16 Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection Gaetan Lopez Latouche et.al. 2407.11854 null
2024-07-16 Schema Matching with Large Language Models: an Experimental Study Marcel Parciak et.al. 2407.11852 link
2024-07-16 LoFTI: Localization and Factuality Transfer to Indian Locales Sona Elza Simon et.al. 2407.11833 link
2024-07-16 GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text Kyle Hamilton et.al. 2407.11827 null
2024-07-16 PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation Branden Butler et.al. 2407.11798 null
2024-07-16 Large Language Models as Misleading Assistants in Conversation Betty Li Hou et.al. 2407.11789 null
2024-07-16 SwitchCIT: Switching for Continual Instruction Tuning of Large Language Models Xinbo Wu et.al. 2407.11780 null
2024-07-16 Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text Seyedeh Fatemeh Ebrahimi et.al. 2407.11774 null
2024-07-16 Educational Personalized Learning Path Planning with Large Language Models Chee Ng et.al. 2407.11773 null
2024-07-16 XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach Truong Thanh Hung Nguyen et.al. 2407.11771 null
2024-07-16 Robust Utility-Preserving Text Anonymization Based on Large Language Models Tianyu Yang et.al. 2407.11770 link
2024-07-16 Vectoring Languages Joseph Chen et.al. 2407.11766 null
2024-07-16 Exploring Quantization for Efficient Pre-Training of Transformer Language Models Kamran Chitsaz et.al. 2407.11722 link
2024-07-17 Harnessing Large Language Models for Multimodal Product Bundling Xiaohao Liu et.al. 2407.11712 null
2024-07-15 VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation Bocheng Zou et.al. 2407.10972 link
2024-07-15 Q-Sparse: All Large Language Models can be Fully Sparsely-Activated Hongyu Wang et.al. 2407.10969 null
2024-07-15 Fast Matrix Multiplications for Lookup Table-Quantized LLMs Han Guo et.al. 2407.10960 link
2024-07-15 Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? Ruisheng Cao et.al. 2407.10956 link
2024-07-15 MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models Chengguang Gan et.al. 2407.10953 null
2024-07-15 Can Textual Semantics Mitigate Sounding Object Segmentation Preference? Yaoting Wang et.al. 2407.10947 link
2024-07-15 Learning from Naturally Occurring Feedback Shachar Don-Yehiya et.al. 2407.10944 link
2024-07-15 GRUtopia: Dream General Robots in a City at Scale Hanqing Wang et.al. 2407.10943 link
2024-07-15 Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together Dilara Soylu et.al. 2407.10930 null
2024-07-15 Benchmarking Vision Language Models for Cultural Understanding Shravan Nayak et.al. 2407.10920 null
2024-07-15 FinDKG: Dynamic Knowledge Graphs with Large Language Models for Detecting Global Trends in Financial Markets Xiaohui Victor Li et.al. 2407.10909 link
2024-07-15 Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique Mark Russinovich et.al. 2407.10887 null
2024-07-15 SLIP: Securing LLMs IP Using Weights Decomposition Yehonathan Refael et.al. 2407.10886 null
2024-07-15 Understanding the Importance of Evolutionary Search in Automated Heuristic Design with Large Language Models Rui Zhang et.al. 2407.10873 null
2024-07-15 GPT Sonograpy: Hand Gesture Decoding from Forearm Ultrasound Images via VLM Keshav Bimbraw et.al. 2407.10870 null
2024-07-15 Physics-Inspired Generative Models in Medical Imaging: A Review Dennis Hein et.al. 2407.10856 null
2024-07-15 Weighted Grouped Query Attention in Transformers Sai Sena Chinnakonduru et.al. 2407.10855 null
2024-07-15 An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use Cases Dylan Bouchard et.al. 2407.10853 null
2024-07-15 MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs Quang H. Nguyen et.al. 2407.10834 null
2024-07-15 BiasScanner: Automatic Detection and Classification of News Bias to Strengthen Democracy Tim Menzner et.al. 2407.10829 null
2024-07-12 FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3 Georgios Makridis et.al. 2407.09467 null
2024-07-12 Human-like Episodic Memory for Infinite Context LLMs Zafeirios Fountas et.al. 2407.09450 null
2024-07-12 ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts Amelia F. Hardy et.al. 2407.09447 link
2024-07-12 MUSCLE: A Model Update Strategy for Compatible LLM Evolution Jessica Echterhoff et.al. 2407.09435 null
2024-07-12 A Perspective on Foundation Models for the Electric Power Grid Hendrik F. Hamann et.al. 2407.09434 null
2024-07-12 Open (Clinical) LLMs are Sensitive to Instruction Phrasings Alberto Mario Ceballos Arroyo et.al. 2407.09429 link
2024-07-12 TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models Hang Zou et.al. 2407.09424 null
2024-07-12 Mitigating Entity-Level Hallucination in Large Language Models Weihang Su et.al. 2407.09417 link
2024-07-12 SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers Shraman Pramanick et.al. 2407.09413 link
2024-07-12 Deep Bag-of-Words Model: An Efficient and Interpretable Relevance Architecture for Chinese E-Commerce Zhe Lin et.al. 2407.09395 null
2024-07-12 PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents Saber Zerhoudi et.al. 2407.09394 link
2024-07-12 GAVEL: Generating Games Via Evolution and Language Models Graham Todd et.al. 2407.09388 null
2024-07-12 Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text Lucio La Cava et.al. 2407.09364 null
2024-07-12 Good Intentions, Risky Inventions: A Method for Assessing the Risks and Benefits of AI in Mobile and Wearable Uses Marios Constantinides et.al. 2407.09322 link
2024-07-12 Scalability of Bayesian Network Structure Elicitation with Large Language Models: a Novel Methodology and Comparative Analysis Nikolay Babakov et.al. 2407.09311 null
2024-07-12 Transformer Layers as Painters Qi Sun et.al. 2407.09298 link
2024-07-12 Security Matrix for Multimodal Agents on Mobile Devices: A Systematic and Proof of Concept Study Yulong Yang et.al. 2407.09295 null
2024-07-12 CEIPA: Counterfactual Explainable Incremental Prompt Attack Analysis on Large Language Models Dong Shu et.al. 2407.09292 null
2024-07-12 Structuring Authenticity Assessments on Historical Documents using LLMs Andrea Schimmenti et.al. 2407.09290 null
2024-07-12 WSESeg: Introducing a Dataset for the Segmentation of Winter Sports Equipment with a Baseline for Interactive Segmentation Robin Schön et.al. 2407.09288 link
2024-07-11 MAVIS: Mathematical Visual Instruction Tuning Renrui Zhang et.al. 2407.08739 link
2024-07-11 Real-Time Anomaly Detection and Reactive Planning with Large Language Models Rohan Sinha et.al. 2407.08735 null
2024-07-11 Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist Zihao Zhou et.al. 2407.08733 null
2024-07-11 A Taxonomy for Data Contamination in Large Language Models Medha Palavalli et.al. 2407.08716 null
2024-07-11 GTA: A Benchmark for General Tool Agents Jize Wang et.al. 2407.08713 link
2024-07-11 eyeballvul: a future-proof benchmark for vulnerability detection in the wild Timothee Chauvin et.al. 2407.08708 link
2024-07-11 Extracting Training Data from Document-Based VQA Models Francesco Pinto et.al. 2407.08707 null
2024-07-11 HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models Runhui Huang et.al. 2407.08706 null
2024-07-11 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Zhening Xing et.al. 2407.08701 null
2024-07-11 Mitigating Catastrophic Forgetting in Language Transfer via Model Merging Anton Alexandrov et.al. 2407.08699 null
2024-07-11 Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight Zhiqiang Xie et.al. 2407.08694 null
2024-07-11 Robotic Control via Embodied Chain-of-Thought Reasoning Zawalski Michał et.al. 2407.08693 null
2024-07-11 SEED-Story: Multimodal Long Story Generation with Large Language Model Shuai Yang et.al. 2407.08683 link
2024-07-11 NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning Yi Zhang et.al. 2407.08672 null
2024-07-11 Uncertainty Estimation of Large Language Models in Medical Question Answering Jiaxin Wu et.al. 2407.08662 null
2024-07-11 Towards Building Specialized Generalist AI with System 1 and System 2 Fusion Kaiyan Zhang et.al. 2407.08642 null
2024-07-11 $β$-DPO: Direct Preference Optimization with Dynamic $β$ Junkang Wu et.al. 2407.08639 link
2024-07-11 RoboMorph: Evolving Robot Morphology using Large Language Models Kevin Qiu et.al. 2407.08626 null
2024-07-11 Tamil Language Computing: the Present and the Future Kengatharaiyer Sarveswaran et.al. 2407.08618 null
2024-07-11 FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision Jay Shah et.al. 2407.08608 null
2024-07-10 Training on the Test Task Confounds Evaluation and Emergence Ricardo Dominguez-Olmedo et.al. 2407.07890 link
2024-07-10 Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization Junkang Wu et.al. 2407.07880 link
2024-07-11 Toto: Time Series Optimized Transformer for Observability Ben Cohen et.al. 2407.07874 null
2024-07-10 FACTS About Building Retrieval Augmented Generation-based Chatbots Rama Akkiraju et.al. 2407.07858 null
2024-07-10 OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training Sami Jaghouar et.al. 2407.07852 link
2024-07-10 Natural Language Mechanisms via Self-Resolution with Foundation Models Nicolas Della Penna et.al. 2407.07845 null
2024-07-10 Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data Perspective Shengjia Chen et.al. 2407.07841 link
2024-07-10 Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison Qian Yang et.al. 2407.07840 null
2024-07-10 Transformer Alignment in Large Language Models Murdock Aubry et.al. 2407.07810 null
2024-07-11 AVCap: Leveraging Audio-Visual Features as Text Tokens for Captioning Jongsuk Kim et.al. 2407.07801 link
2024-07-10 Attribute or Abstain: Large Language Models as Long Document Assistants Jan Buchmann et.al. 2407.07799 link
2024-07-11 Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard Oguzhan Topsakal et.al. 2407.07796 link
2024-07-10 Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities Tianjie Ju et.al. 2407.07791 link
2024-07-10 WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment Jiefu Ou et.al. 2407.07778 null
2024-07-10 Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs Hao-Tien Lewis Chiang et.al. 2407.07775 null
2024-07-10 Can ChatGPT Pass a Theory of Computing Course? Matei A. Golesteanu et.al. 2407.07757 null
2024-07-10 Fine-Tuning Large Language Models with User-Level Differential Privacy Zachary Charles et.al. 2407.07737 null
2024-07-10 PaliGemma: A versatile 3B VLM for transfer Lucas Beyer et.al. 2407.07726 link
2024-07-10 Why should we ever automate moral decision making? Vincent Conitzer et.al. 2407.07671 null
2024-07-10 A Proposed S.C.O.R.E. Evaluation Framework for Large Language Models : Safety, Consensus, Objectivity, Reproducibility and Explainability Ting Fang Tan et.al. 2407.07666 null
2024-07-09 AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning Jiaxi Cui et.al. 2407.07094 link
2024-07-09 FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation Liqun Ma et.al. 2407.07093 link
2024-07-09 CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation Tong Chen et.al. 2407.07087 link
2024-07-09 Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models Logan Cross et.al. 2407.07086 link
2024-07-09 Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities Shaltiel Shmidman et.al. 2407.07080 null
2024-07-09 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps Yung-Sung Chuang et.al. 2407.07071 link
2024-07-09 Prompting Techniques for Secure Code Generation: A Systematic Investigation Catherine Tony et.al. 2407.07064 null
2024-07-10 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence Weize Chen et.al. 2407.07061 link
2024-07-10 Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model Wenqi Zhang et.al. 2407.07053 link
2024-07-09 ProtoSAM -- One Shot Medical Image Segmentation With Foundational Models Lev Ayzenberg et.al. 2407.07042 link
2024-07-09 Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models Yue Zhang et.al. 2407.07035 null
2024-07-09 Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization Jeongseok Hyun et.al. 2407.07024 link
2024-07-09 Using Large Language Models for Generating Smart Contracts for Health Insurance from Textual Policies Inwon Kang et.al. 2407.07019 null
2024-07-09 End-To-End Causal Effect Estimation from Unstructured Natural Language Data Nikita Dhawan et.al. 2407.07018 null
2024-07-09 Is Large Language Model All You Need to Predict the Synthesizability and Precursors of Crystal Structures? Zhilong Song et.al. 2407.07016 null
2024-07-09 Induction Heads as an Essential Mechanism for Pattern Matching in In-context Learning J. Crosbie et.al. 2407.07011 null
2024-07-09 Metron: Holistic Performance Evaluation Framework for LLM Inference Systems Amey Agrawal et.al. 2407.07000 link
2024-07-09 Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective Yu-An Liu et.al. 2407.06992 link
2024-07-09 Segment-Based Interactive Machine Translation for Pre-trained Models Angel Navarro et.al. 2407.06990 null
2024-07-09 Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models Yi-Cheng Lin et.al. 2407.06957 link
2024-07-08 Multi-Object Hallucination in Vision-Language Models Xuweiyi Chen et.al. 2407.06192 null
2024-07-08 4D Contrastive Superflows are Dense 3D Representation Learners Xiang Xu et.al. 2407.06190 link
2024-07-08 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Orr Zohar et.al. 2407.06189 link
2024-07-08 CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation Xinying Guo et.al. 2407.06188 null
2024-07-08 JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation Yu Zeng et.al. 2407.06187 null
2024-07-08 Vision-Language Models under Cultural and Inclusive Considerations Antonia Karamolegkou et.al. 2407.06177 null
2024-07-08 On Speeding Up Language Model Evaluation Jin Peng Zhou et.al. 2407.06172 null
2024-07-08 What's Wrong with Your Code Generated by Large Language Models? An Extensive Study Shihan Dou et.al. 2407.06153 null
2024-07-08 Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks Lukas Netz et.al. 2407.06146 null
2024-07-08 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation Ethan Chern et.al. 2407.06135 link
2024-07-08 Evaluating the Semantic Profiling Abilities of LLMs for Natural Language Utterances in Data Visualization Hannah K. Bako et.al. 2407.06129 link
2024-07-08 Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities Avinash Anand et.al. 2407.06125 null
2024-07-08 Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning Yadong Zhang et.al. 2407.06112 null
2024-07-08 Artificial Intuition: Efficient Classification of Scientific Abstracts Harsh Sakhrani et.al. 2407.06093 null
2024-07-08 Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models Jinliang Lu et.al. 2407.06089 null
2024-07-08 From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty Maor Ivgi et.al. 2407.06071 link
2024-07-08 Variational Best-of-N Alignment Afra Amini et.al. 2407.06057 null
2024-07-08 MST5 -- Multilingual Question Answering over Knowledge Graphs Nikit Srivastava et.al. 2407.06041 link
2024-07-08 PAS: Data-Efficient Plug-and-Play Prompt Augmentation System Miao Zheng et.al. 2407.06027 null
2024-07-08 iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement Aoyu Pang et.al. 2407.06025 link
2024-07-05 Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs Rudolf Laine et.al. 2407.04694 link
2024-07-05 ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu et.al. 2407.04693 link
2024-07-05 Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge Yuanze Lin et.al. 2407.04681 null
2024-07-05 Lost in Translation: The Algorithmic Gap Between LMs and the Brain Tommaso Tosato et.al. 2407.04680 null
2024-07-05 Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition Ye Bai et.al. 2407.04675 null
2024-07-05 Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement Yongji Wu et.al. 2407.04656 null
2024-07-05 Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models Bolaji Yusuf et.al. 2407.04641 null
2024-07-05 Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework Reza Averly et.al. 2407.04629 null
2024-07-05 On scalable oversight with weak LLMs judging strong LLMs Zachary Kenton et.al. 2407.04622 null
2024-07-05 CountGD: Multi-Modal Open-World Counting Niki Amini-Naieni et.al. 2407.04619 null
2024-07-05 ARM: Efficient Guided Decoding with Autoregressive Reward Models Sergey Troshin et.al. 2407.04615 null
2024-07-05 AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation Yuhan Zhu et.al. 2407.04603 null
2024-07-05 Written Term Detection Improves Spoken Term Detection Bolaji Yusuf et.al. 2407.04601 link
2024-07-05 Testing learning hypotheses using neural networks by manipulating learning data Cara Su-Yi Leong et.al. 2407.04593 null
2024-07-05 Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions Shumaila Javaid et.al. 2407.04581 null
2024-07-05 VRSD: Rethinking Similarity and Diversity for Retrieval in Large Language Models Hang Gao et.al. 2407.04573 null
2024-07-05 Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition Aditya K Surikuchi et.al. 2407.04559 link
2024-07-05 Spontaneous Reward Hacking in Iterative Self-Refinement Jane Pan et.al. 2407.04549 null
2024-07-05 PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts Ana-Cristina Rogoz et.al. 2407.04541 link
2024-07-05 GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning Aleksander Ficek et.al. 2407.04528 null
2024-07-03 Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages Max Zuo et.al. 2407.03321 link
2024-07-03 InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output Pan Zhang et.al. 2407.03320 link
2024-07-03 BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations Zhantao Yang et.al. 2407.03314 null
2024-07-03 Universal Length Generalization with Turing Programs Kaiying Hou et.al. 2407.03310 null
2024-07-03 Large Language Models for JSON Schema Discovery Michael J. Mior et.al. 2407.03286 null
2024-07-03 LLM Internal States Reveal Hallucination Risk Faced With a Query Ziwei Ji et.al. 2407.03282 null
2024-07-03 STF: Sentence Transformer Fine-Tuning For Topic Categorization With Limited Data Kheir Eddine Daouadi et.al. 2407.03253 null
2024-07-03 Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning Zhili Shen et.al. 2407.03227 null
2024-07-03 How Does Quantization Affect Multilingual LLMs? Kelly Marchisio et.al. 2407.03211 null
2024-07-03 TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts Ruida Wang et.al. 2407.03203 link
2024-07-03 Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models Haritz Puerto et.al. 2407.03181 link
2024-07-03 Investigating Decoder-only Large Language Models for Speech-to-text Translation Chao-Wei Huang et.al. 2407.03169 null
2024-07-03 SOS! Soft Prompt Attack Against Open-Source Large Language Models Ziqing Yang et.al. 2407.03160 null
2024-07-03 Let the Code LLM Edit Itself When You Edit the Code Zhenyu He et.al. 2407.03157 null
2024-07-03 Reinforcement Learning for Sequence Design Leveraging Protein Language Models Jithendaraa Subramanian et.al. 2407.03154 null
2024-07-03 Enhancing Translation Accuracy of Large Language Models through Continual Pre-Training on Parallel Data Minato Kondo et.al. 2407.03145 null
2024-07-03 Social Bias Evaluation for Large Language Models Requires Prompt Variations Rem Hida et.al. 2407.03129 link
2024-07-03 KeyVideoLLM: Towards Large-scale Video Keyframe Selection Hao Liang et.al. 2407.03104 null
2024-07-03 Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory Suyeon Lee et.al. 2407.03103 link
2024-07-03 ScreenTK: Seamless Detection of Time-Killing Moments Using Continuous Mobile Screen Text Monitoring Le Fang et.al. 2407.03063 null
2024-07-02 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang et.al. 2407.02490 link
2024-07-02 Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Ali Safaya et.al. 2407.02486 link
2024-07-02 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Yue Yu et.al. 2407.02485 null
2024-07-02 MMedAgent: Learning to Use Medical Tools with Multi-modal Agent Binxu Li et.al. 2407.02483 null
2024-07-02 Understanding Alignment in Multimodal LLMs: A Comprehensive Study Elmira Amirloo et.al. 2407.02477 null
2024-07-02 Open Scene Graphs for Open World Object-Goal Navigation Joel Loo et.al. 2407.02473 null
2024-07-02 ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions Chan Young Park et.al. 2407.02472 link
2024-07-02 Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I Harrie Oosterhuis et.al. 2407.02464 null
2024-07-02 Ensemble of pre-trained language models and data augmentation for hate speech detection from Arabic tweets Kheir Eddine Daouadi et.al. 2407.02448 null
2024-07-03 Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs Jinmin Li et.al. 2407.02411 null
2024-07-02 CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models Song Wang et.al. 2407.02408 null
2024-07-02 Assessing the Code Clone Detection Capability of Large Language Models Zixian Zhang et.al. 2407.02402 null
2024-07-02 Learning to Refine with Fine-Grained Natural Language Feedback Manya Wadhwa et.al. 2407.02397 link
2024-07-02 Is Your AI-Generated Code Really Secure? Evaluating Large Language Models on Secure Code Generation with CodeSecEval Jiexin Wang et.al. 2407.02395 null
2024-07-02 TokenPacker: Efficient Visual Projector for Multimodal LLM Wentong Li et.al. 2407.02392 link
2024-07-02 Talking to Machines: do you read me? Lina M. Rojas-Barahona et.al. 2407.02354 null
2024-07-02 Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification Pritish Sahu et.al. 2407.02352 null
2024-07-02 Generative Large Language Models in Automated Fact-Checking: A Survey Ivan Vykopal et.al. 2407.02351 null
2024-07-02 Conceptual Codebook Learning for Vision-Language Models Yi Zhang et.al. 2407.02350 null
2024-07-02 MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space Yihong Tang et.al. 2407.02345 null
2024-06-28 Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs Sukmin Yun et.al. 2406.20098 link
2024-06-28 LLaRA: Supercharging Robot Learning Data for Vision-Language Policy Xiang Li et.al. 2406.20095 link
2024-06-28 Scaling Synthetic Data Creation with 1,000,000,000 Personas Xin Chan et.al. 2406.20094 link
2024-06-28 LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression Jieneng Chen et.al. 2406.20092 link
2024-06-28 ProgressGym: Alignment with a Millennium of Moral Progress Tianyi Qiu et.al. 2406.20087 null
2024-06-28 Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language Yicheng Chen et.al. 2406.20085 null
2024-06-28 Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification Anisha Gunjal et.al. 2406.20079 link
2024-06-28 EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model Yuxuan Zhang et.al. 2406.20076 link
2024-06-28 To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models Bastien Liétard et.al. 2406.20054 null
2024-06-28 Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation Danny Halawi et.al. 2406.20053 null
2024-07-02 BMW Agents -- A Framework For Task Automation Through Multi-Agent Collaboration Noel Crawford et.al. 2406.20041 null
2024-06-28 BioMNER: A Dataset for Biomedical Method Entity Recognition Chen Tang et.al. 2406.20038 null
2024-06-28 LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models Renzhi Wang et.al. 2406.20030 null
2024-06-28 ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models Yuxiang Zhang et.al. 2406.20015 link
2024-06-28 The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models Xinyi Chen et.al. 2406.19999 link
2024-06-28 Single Parent Family: A Spectrum of Family Members from a Single Pre-Trained Foundation Model Habib Hajimolahoseini et.al. 2406.19995 null
2024-06-28 ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting Rui Pan et.al. 2406.19976 null
2024-06-28 STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical Guohao Sun et.al. 2406.19973 null
2024-06-28 Into the Unknown: Generating Geospatial Descriptions for New Environments Tzuf Paz-Argaman et.al. 2406.19967 null
2024-06-28 Simulating Financial Market via Large Language Model based Agents Shen Gao et.al. 2406.19966 null
2024-06-27 ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos Jr-Jen Chen et.al. 2406.19392 link
2024-06-27 The Remarkable Robustness of LLMs: Stages of Inference? Vedang Lad et.al. 2406.19384 link
2024-06-27 The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models Xiliang Zhu et.al. 2406.19358 null
2024-06-27 DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions Nigel Fernandez et.al. 2406.19356 null
2024-06-27 Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs? Peter Hase et.al. 2406.19354 null
2024-06-27 IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language Lucky Susanto et.al. 2406.19349 null
2024-06-27 Jump Starting Bandits with LLM-Generated Prior Knowledge Parand A. Alamdari et.al. 2406.19317 null
2024-06-27 MCNC: Manifold Constrained Network Compression Chayne Thrash et.al. 2406.19301 null
2024-06-27 From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data Zheyang Xiong et.al. 2406.19292 null
2024-06-27 PhysioLLM: Supporting Personalized Health Insights with Wearables and Large Language Models Cathy Mengying Fang et.al. 2406.19283 null
2024-06-27 HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale Junying Chen et.al. 2406.19280 link
2024-06-27 VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation Yixiao Song et.al. 2406.19276 link
2024-06-27 AutoPureData: Automated Filtering of Web Data for LLM Fine-tuning Praneeth Vadlapati et.al. 2406.19271 link
2024-06-27 Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding Yue Fan et.al. 2406.19263 link
2024-06-27 Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment Hao Fei et.al. 2406.19255 null
2024-06-27 AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation Jia Fu et.al. 2406.19251 null
2024-06-27 Revealing Fine-Grained Values and Opinions in Large Language Models Dustin Wright et.al. 2406.19238 link
2024-06-28 FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts Shubhankar Singh et.al. 2406.19237 null
2024-06-27 Seeing Is Believing: Black-Box Membership Inference Attacks Against Retrieval Augmented Generation Yuying Li et.al. 2406.19234 null
2024-06-28 RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs Ekaterina Taktasheva et.al. 2406.19232 link
2024-06-26 Towards Compositionality in Concept Learning Adam Stein et.al. 2406.18534 link
2024-06-26 Symbolic Learning Enables Self-Evolving Agents Wangchunshu Zhou et.al. 2406.18532 link
2024-06-26 PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation Christoph Leiter et.al. 2406.18528 link
2024-06-26 CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs Zirui Wang et.al. 2406.18521 link
2024-06-26 "Is ChatGPT a Better Explainer than My Professor?": Evaluating the Explanation Capabilities of LLMs in Conversation Compared to a Human Baseline Grace Li et.al. 2406.18512 null
2024-06-26 WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models Liwei Jiang et.al. 2406.18510 link
2024-06-26 Mental Modeling of Reinforcement Learning Agents by Language Models Wenhao Lu et.al. 2406.18505 null
2024-06-26 Is In-Context Learning a Type of Gradient-Based Learning? Evidence from the Inverse Frequency Effect in Structural Priming Zhenghao Zhou et.al. 2406.18501 null
2024-06-26 Role-Play Zero-Shot Prompting with Large Language Models for Open-Domain Human-Machine Conversation Ahmed Njifenjou et.al. 2406.18460 null
2024-06-26 Cascading Large Language Models for Salient Event Graph Generation Xingwei Tan et.al. 2406.18449 link
2024-06-26 New intelligent empowerment for digital transformation Peng Yifeng et.al. 2406.18440 null
2024-06-26 IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons Dan Shi et.al. 2406.18406 null
2024-06-26 Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers Yibo Jiang et.al. 2406.18400 null
2024-06-26 Adversarial Search Engine Optimization for Large Language Models Fredrik Nestaas et.al. 2406.18382 null
2024-06-26 MALSIGHT: Exploring Malicious Source Code and Benign Pseudocode for Iterative Binary Malware Summarization Haolang Lu et.al. 2406.18379 null
2024-06-26 Themis: Towards Flexible and Interpretable NLG Evaluation Xinyu Hu et.al. 2406.18365 link
2024-06-26 AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations Adam Dahlgren Lindström et.al. 2406.18346 null
2024-06-26 PDFA Distillation via String Probability Queries {PDFA Distillation via String Probability Queries} Robert Baumgartner et.al. 2406.18328 link
2024-06-26 PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models Huixuan Zhang et.al. 2406.18326 null
2024-06-26 MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data Meng Fang et.al. 2406.18321 null
2024-06-25 MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning Xiangyu Zhao et.al. 2406.17770 link
2024-06-25 EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data Jesse Zhang et.al. 2406.17768 null
2024-06-25 BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning Ercong Nie et.al. 2406.17764 null
2024-06-25 CaLMQA: Exploring culturally specific long-form question answering across 23 languages Shane Arora et.al. 2406.17761 link
2024-06-25 Accelerating Clinical Evidence Synthesis with Large Language Models Zifeng Wang et.al. 2406.17755 null
2024-06-25 Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language Amalie Brogaard Pauli et.al. 2406.17753 null
2024-06-25 Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon USVSN Sai Prashanth et.al. 2406.17746 link
2024-06-25 Point-SAM: Promptable 3D Segmentation Model for Point Clouds Yuchen Zhou et.al. 2406.17741 link
2024-06-25 Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model Fei Xia et.al. 2406.17739 null
2024-06-25 LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users Elinor Poole-Dayan et.al. 2406.17737 null
2024-06-25 FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model Feijie Wu et.al. 2406.17706 link
2024-06-25 From Distributional to Overton Pluralism: Investigating Large Language Model Alignment Thom Lake et.al. 2406.17692 link
2024-06-26 VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation Kun Qian et.al. 2406.17681 link
2024-06-25 Quantifying AI Psychology: A Psychometrics Benchmark for Large Language Models Yuan Li et.al. 2406.17675 null
2024-06-25 LaTable: Towards Large Tabular Models Boris van Breugel et.al. 2406.17673 null
2024-06-25 LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic Aditya Kalyanpur et.al. 2406.17663 null
2024-06-25 Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients Aashiq Muhamed et.al. 2406.17660 link
2024-06-25 DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning Xiaohan Zhang et.al. 2406.17659 null
2024-06-25 Leveraging Large Language Models for Software Model Completion: Results from Industrial and Public Datasets Christof Tinnes et.al. 2406.17651 null
2024-06-25 Variationist: Exploring Multifaceted Variation and Bias in Written Language Data Alan Ramponi et.al. 2406.17647 link
2024-06-24 Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Shengbang Tong et.al. 2406.16860 link
2024-06-24 EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees Yuhui Li et.al. 2406.16858 link
2024-06-24 Long Context Transfer from Language to Vision Peiyuan Zhang et.al. 2406.16852 link
2024-06-24 Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts Aditya Sharma et.al. 2406.16851 null
2024-06-24 RaTEScore: A Metric for Radiology Report Generation Weike Zhao et.al. 2406.16845 null
2024-06-24 From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models Sean Welleck et.al. 2406.16838 null
2024-06-24 USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$ onversations Mounika Marreddy et.al. 2406.16833 null
2024-06-24 Understanding and Mitigating Tokenization Bias in Language Models Buu Phan et.al. 2406.16829 null
2024-06-24 Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track Ronak Pradeep et.al. 2406.16828 link
2024-06-24 GPT-4V Explorations: Mining Autonomous Driving Zixuan Li et.al. 2406.16817 null
2024-06-24 RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale Beck LaBash et.al. 2406.16801 link
2024-06-25 Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs Ashwinee Panda et.al. 2406.16797 link
2024-06-24 Adam-mini: Use Fewer Learning Rates To Gain More Yushun Zhang et.al. 2406.16793 link
2024-06-24 M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models Rishabh Maheshwary et.al. 2406.16783 null
2024-06-24 It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension Sagi Shaier et.al. 2406.16779 null
2024-06-24 Finding Transformer Circuits with Edge Pruning Adithya Bhaskar et.al. 2406.16778 link
2024-06-24 Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 Sai Koneru et.al. 2406.16777 null
2024-06-24 WARP: On the Benefits of Weight Averaged Rewarded Policies Alexandre Ramé et.al. 2406.16768 null
2024-06-24 The GPT-WritingPrompts Dataset: A Comparative Analysis of Character Portrayal in Short Stories Xi Yu Huang et.al. 2406.16767 link
2024-06-24 Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters Euiin Yi et.al. 2406.16758 link
2024-06-21 GenoTEX: A Benchmark for Evaluating LLM-Based Exploration of Gene Expression Data in Alignment with Bioinformaticians Haoyang Liu et.al. 2406.15341 link
2024-06-21 Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance Haoling Li et.al. 2406.15330 null
2024-06-21 Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks Hokyung Lee et.al. 2406.15325 link
2024-06-21 Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model Doyoung Kim et.al. 2406.15275 null
2024-06-21 Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics Weijia Zhang et.al. 2406.15264 null
2024-06-21 Unsupervised Morphological Tree Tokenizer Qingyang Zhu et.al. 2406.15245 null
2024-06-21 Large Batch Analysis for Adagrad Under Anisotropic Smoothness Yuxing Liu et.al. 2406.15244 null
2024-06-21 Detecting Synthetic Lyrics with Few-Shot Inference Yanis Labrak et.al. 2406.15231 null
2024-06-21 A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation Irune Zubiaga et.al. 2406.15227 null
2024-06-21 Unsupervised Extraction of Dialogue Policies from Conversations Makesh Narsimhan Sreedhar et.al. 2406.15214 null
2024-06-21 Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding Mohan Li et.al. 2406.15209 null
2024-06-21 Exploring the Efficacy of Robotic Assistants with ChatGPT and Claude in Enhancing ADHD Therapy: Innovating Treatment Paradigms Santiago Berrezueta-Guzman et.al. 2406.15198 null
2024-06-21 UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis Yulong Hui et.al. 2406.15187 link
2024-06-21 Hybrid Alignment Training for Large Language Models Chenglong Wang et.al. 2406.15178 link
2024-06-21 EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot Hao Fei et.al. 2406.15177 link
2024-06-21 Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss Wei He et.al. 2406.15175 null
2024-06-21 Évaluation des capacités de réponse de larges modèles de langage (LLM) pour des questions d'historiens Mathieu Chartier et.al. 2406.15173 null
2024-06-21 Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks Victor Hugo Nascimento Rocha et.al. 2406.15130 link
2024-06-21 Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network Badr AlKhamissi et.al. 2406.15109 link
2024-06-21 PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data Ishaan Watts et.al. 2406.15053 null
2024-06-20 Model Merging and Safety Alignment: One Bad Model Spoils the Bunch Hasan Abed Al Kader Hammoud et.al. 2406.14563 null
2024-06-20 Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities Sachit Menon et.al. 2406.14562 null
2024-06-20 How to Compute the Probability of a Word Tiago Pimentel et.al. 2406.14561 null
2024-06-21 Asynchronous Large Language Model Enhanced Planner for Autonomous Driving Yuan Chen et.al. 2406.14556 link
2024-06-20 GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models Shilong Li et.al. 2406.14550 null
2024-06-20 Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models Sunny Duan et.al. 2406.14549 null
2024-06-20 Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data Johannes Treutlein et.al. 2406.14546 link
2024-06-20 Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems Đorđe Klisura et.al. 2406.14545 null
2024-06-20 Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs Yuxuan Qiao et.al. 2406.14544 link
2024-06-21 Are LLMs Naturally Good at Synthetic Tabular Data Generation? Shengzhe Xu et.al. 2406.14541 link
2024-06-20 PostMark: A Robust Blackbox Watermark for Large Language Models Yapei Chang et.al. 2406.14517 link
2024-06-20 MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding Xinyu Fang et.al. 2406.14515 link
2024-06-20 Evidence of a log scaling law for political persuasion with large language models Kobi Hackenburg et.al. 2406.14508 link
2024-06-20 Overview of the CAIL 2023 Argument Mining Track Jingcong Liang et.al. 2406.14503 null
2024-06-20 Improving Expert Radiology Report Summarization by Prompting Large Language Models with a Layperson Summary Xingmeng Zhao et.al. 2406.14500 null
2024-06-20 LLaSA: Large Multimodal Agent for Human Activity Analysis Through Wearable Sensors Sheikh Asif Imran et.al. 2406.14498 link
2024-06-20 CodeRAG-Bench: Can Retrieval Augment Code Generation? Zora Zhiruo Wang et.al. 2406.14497 link
2024-06-20 African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification Gregor Geigle et.al. 2406.14496 link
2024-06-20 Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? Gregor Geigle et.al. 2406.14492 null
2024-06-20 Instruction Pre-Training: Language Models are Supervised Multitask Learners Daixuan Cheng et.al. 2406.14491 link
2024-06-18 DrVideo: Document Retrieval Based Long Video Understanding Ziyu Ma et.al. 2406.12846 null
2024-06-18 Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts Haoxiang Wang et.al. 2406.12845 link
2024-06-18 Synergizing Foundation Models and Federated Learning: A Survey Shenghui Li et.al. 2406.12844 null
2024-06-18 GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation Ci-Siang Lin et.al. 2406.12834 null
2024-06-18 LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation Seyedarmin Azizi et.al. 2406.12832 link
2024-06-18 What Are the Odds? Language Models Are Capable of Probabilistic Reasoning Akshay Paruchuri et.al. 2406.12830 null
2024-06-18 From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries Hitesh Wadhwa et.al. 2406.12824 null
2024-06-18 Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? Pinzhen Chen et.al. 2406.12822 null
2024-06-18 Adversarial Attacks on Multimodal Agents Chen Henry Wu et.al. 2406.12814 link
2024-06-18 Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? Zhe Yang et.al. 2406.12809 null
2024-06-18 Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents Zehao Wang et.al. 2406.12806 null
2024-06-18 Supporting Human Raters with the Detection of Harmful Content using Large Language Models Kurt Thomas et.al. 2406.12800 null
2024-06-18 ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Team GLM et.al. 2406.12793 link
2024-06-18 In-Context Learning of Energy Functions Rylan Schaeffer et.al. 2406.12785 null
2024-06-18 UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions Xunzhi Wang et.al. 2406.12784 link
2024-06-18 Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries Eden Biran et.al. 2406.12775 link
2024-06-18 Towards Exact Gradient-based Training on Analog In-memory Computing Zhaoxian Wu et.al. 2406.12774 null
2024-06-18 GFM4MPM: Towards Geospatial Foundation Models for Mineral Prospectivity Mapping Angel Daruna et.al. 2406.12756 null
2024-06-18 OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI Zhen Huang et.al. 2406.12753 link
2024-06-18 Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning Bingchen Zhao et.al. 2406.12742 link
2024-06-17 LLaNA: Large Language and NeRF Assistant Andrea Amaduzzi et.al. 2406.11840 null
2024-06-17 mDPO: Conditional Preference Optimization for Multimodal Large Language Models Fei Wang et.al. 2406.11839 null
2024-06-17 MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs Ziyu Liu et.al. 2406.11833 link
2024-06-17 Unveiling Encoder-Free Vision-Language Models Haiwen Diao et.al. 2406.11832 link
2024-06-17 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models Bingqi Ma et.al. 2406.11831 null
2024-06-17 Language Modeling with Editable External Knowledge Belinda Z. Li et.al. 2406.11830 link
2024-06-17 WPO: Enhancing RLHF with Weighted Preference Optimization Wenxuan Zhou et.al. 2406.11827 link
2024-06-17 On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning Geewook Kim et.al. 2406.11823 link
2024-06-17 MegaScenes: Scene-Level View Synthesis at Scale Joseph Tung et.al. 2406.11819 link
2024-06-17 Embodied Instruction Following in Unknown Environments Zhenyu Wu et.al. 2406.11818 null
2024-06-17 Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level Jie Liu et.al. 2406.11817 null
2024-06-17 VideoLLM-online: Online Video Large Language Model for Streaming Video Joya Chen et.al. 2406.11816 null
2024-06-17 How Do Large Language Models Acquire Factual Knowledge During Pretraining? Hoyeon Chang et.al. 2406.11813 null
2024-06-17 RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content Joao Monteiro et.al. 2406.11811 null
2024-06-17 Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations Rima Hazra et.al. 2406.11801 link
2024-06-17 DataComp-LM: In search of the next generation of training sets for language models Jeffrey Li et.al. 2406.11794 null
2024-06-17 CELL your Model: Contrastive Explanation Methods for Large Language Models Ronny Luss et.al. 2406.11785 null
2024-06-17 Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs Swanand Ravindra Kadhe et.al. 2406.11780 null
2024-06-17 Improving Multi-Agent Debate with Sparse Communication Topology Yunxuan Li et.al. 2406.11776 null
2024-06-17 Task Me Anything Jieyu Zhang et.al. 2406.11775 link
2024-06-14 Quantifying Variance in Evaluation Benchmarks Lovish Madaan et.al. 2406.10229 null
2024-06-14 EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models Julian Straub et.al. 2406.10224 null
2024-06-14 Short Film Dataset (SFD): A Benchmark for Story-Level Video Understanding Ridouane Ghermi et.al. 2406.10221 link
2024-06-14 Semantic Membership Inference Attack against Large Language Models Hamid Mozaffari et.al. 2406.10218 null
2024-06-14 Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs Rui Yang et.al. 2406.10216 null
2024-06-14 DevBench: A multimodal developmental benchmark for language learning Alvin Wei Ming Tan et.al. 2406.10215 link
2024-06-14 Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs Abhimanyu Hans et.al. 2406.10209 link
2024-06-14 A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors Naaman Tan et.al. 2406.10203 link
2024-06-14 TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners Tomas de la Rosa et.al. 2406.10196 null
2024-06-14 Detecting and Evaluating Medical Hallucinations in Large Vision Language Models Jiawei Chen et.al. 2406.10185 null
2024-06-14 Practical offloading for fine-tuning LLM on commodity GPU via learned subspace projectors Siyuan Chen et.al. 2406.10181 null
2024-06-14 Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation Mohamad Elzohbi et.al. 2406.10174 link
2024-06-14 IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce Wenxuan Ding et.al. 2406.10173 link
2024-06-14 Datasets for Multilingual Answer Sentence Selection Matteo Gabburo et.al. 2406.10172 null
2024-06-14 CarLLaVA: Vision language models for camera-only closed-loop driving Katrin Renz et.al. 2406.10165 null
2024-06-14 Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models Carson Denison et.al. 2406.10162 link
2024-06-14 RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model Hantao Zhou et.al. 2406.10157 null
2024-06-14 BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Yuri Kuratov et.al. 2406.10149 link
2024-06-14 Evaluation of Large Language Models: STEM education and Gender Stereotypes Smilla Due et.al. 2406.10133 null
2024-06-14 The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models Yan Liu et.al. 2406.10130 link
2024-06-13 VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding Muhammad Maaz et.al. 2406.09418 link
2024-06-13 Explore the Limits of Omni-modal Pretraining at Scale Yiyuan Zhang et.al. 2406.09412 link
2024-06-13 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Roman Bachmann et.al. 2406.09406 null
2024-06-13 Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models Yushi Hu et.al. 2406.09403 null
2024-06-13 OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation Junke Wang et.al. 2406.09399 link
2024-06-13 Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms Miaosen Zhang et.al. 2406.09397 null
2024-06-13 Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA Jongwoo Park et.al. 2406.09396 link
2024-06-13 Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition Youngtaek Oh et.al. 2406.09388 link
2024-06-13 Towards Vision-Language Geo-Foundation Model: A Survey Yue Zhou et.al. 2406.09385 link
2024-06-13 Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models Lukas Thede et.al. 2406.09384 null
2024-06-13 Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs Zijia Zhao et.al. 2406.09367 link
2024-06-13 ElicitationGPT: Text Elicitation Mechanisms via Language Models Yifan Wu et.al. 2406.09363 null
2024-06-13 Enhancing Domain Adaptation through Prompt Gradient Alignment Hoang Phan et.al. 2406.09353 null
2024-06-13 Separations in the Representational Capabilities of Transformers and Recurrent Architectures Satwik Bhattamishra et.al. 2406.09347 null
2024-06-13 DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding Suwon Shon et.al. 2406.09345 null
2024-06-13 ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models David Anugraha et.al. 2406.09334 link
2024-06-13 REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space Tomer Ashuach et.al. 2406.09325 null
2024-06-13 Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs Zhao Xu et.al. 2406.09324 link
2024-06-13 JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models Delong Ran et.al. 2406.09321 link
2024-06-13 Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases Meng Wang et.al. 2406.09317 link
2024-06-12 What If We Recaption Billions of Web Images with LLaMA-3? Xianhang Li et.al. 2406.08478 null
2024-06-12 Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens Ting-Ji Huang et.al. 2406.08477 null
2024-06-12 Real2Code: Reconstruct Articulated Objects via Code Generation Zhao Mandi et.al. 2406.08474 null
2024-06-12 PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences Daiwei Chen et.al. 2406.08469 null
2024-06-12 Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Zhangchen Xu et.al. 2406.08464 link
2024-06-12 AToM-Bot: Embodied Fulfillment of Unspoken Human Needs with Affective Theory of Mind Wei Ding et.al. 2406.08455 null
2024-06-12 OLMES: A Standard for Language Model Evaluations Yuling Gu et.al. 2406.08446 null
2024-06-12 SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models Chun Yin et.al. 2406.08445 null
2024-06-12 TasTe: Teaching Large Language Models to Translate through Self-Reflection Yutong Wang et.al. 2406.08434 link
2024-06-12 Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL Zijin Hong et.al. 2406.08426 null
2024-06-12 OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Qingyun Li et.al. 2406.08418 link
2024-06-12 Discovering Preference Optimization Algorithms with and for Large Language Models Chris Lu et.al. 2406.08414 link
2024-06-12 Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference Christopher Wolters et.al. 2406.08413 null
2024-06-13 MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos Xuehai He et.al. 2406.08407 link
2024-06-12 Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models Chun-Yi Kuan et.al. 2406.08402 link
2024-06-12 cPAPERS: A Dataset of Situated and Multimodal Interactive Conversations in Scientific Papers Anirudh Sundar et.al. 2406.08398 null
2024-06-12 VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks Jiannan Wu et.al. 2406.08394 link
2024-06-12 Large Language Models Must Be Taught to Know What They Don't Know Sanyam Kapoor et.al. 2406.08391 link
2024-06-12 Banal Deception Human-AI Ecosystems: A Study of People's Perceptions of LLM-generated Deceptive Behaviour Xiao Zhan et.al. 2406.08386 null
2024-06-13 APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation Weizhao He et.al. 2406.08372 null
2024-06-11 A3VLM: Actionable Articulation-Aware Vision Language Model Siyuan Huang et.al. 2406.07549 link
2024-06-11 Image and Video Tokenization with Binary Spherical Quantization Yue Zhao et.al. 2406.07548 link
2024-06-11 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena Aidar Myrzakhan et.al. 2406.07545 link
2024-06-11 QuickLLaMA: Query-aware Inference Acceleration for Large Language Models Jingyao Li et.al. 2406.07528 link
2024-06-11 Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo et.al. 2406.07524 link
2024-06-11 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Liliang Ren et.al. 2406.07522 link
2024-06-11 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement Yunzhen Feng et.al. 2406.07515 null
2024-06-11 THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report KBTG Labs et.al. 2406.07505 null
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502 link
2024-06-11 TextGrad: Automatic "Differentiation" via Text Mert Yuksekgonul et.al. 2406.07496 link
2024-06-12 CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization Frederic Kirstein et.al. 2406.07494 null
2024-06-11 Paraphrasing in Affirmative Terms Improves Negation Understanding MohammadHossein Rezaei et.al. 2406.07492 null
2024-06-11 PITCH: Productivity and Mental Well-being Coaching through Daily Conversational Interaction Adnan Abbas et.al. 2406.07485 null
2024-06-11 Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing Mao Li et.al. 2406.07483 null
2024-06-11 VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Zesen Cheng et.al. 2406.07476 link
2024-06-11 Anomaly Detection on Unstable Logs with GPT Models Fatemeh Hadadi et.al. 2406.07467 null
2024-06-11 Estimating the Hallucination Rate of Generative AI Andrew Jesson et.al. 2406.07457 null
2024-06-11 Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis Qining Zhang et.al. 2406.07455 null
2024-06-11 On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations Shiao Meng et.al. 2406.07444 link
2024-06-11 McEval: Massively Multilingual Code Evaluation Linzheng Chai et.al. 2406.07436 null
2024-06-10 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Peize Sun et.al. 2406.06525 link
2024-06-10 UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor Shivani Upadhyay et.al. 2406.06519 link
2024-06-10 Merlin: A Vision Language Foundation Model for 3D Computed Tomography Louis Blankemeier et.al. 2406.06512 null
2024-06-10 NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative Asmar Nadeem et.al. 2406.06499 null
2024-06-10 Direct Preference Optimization for Suppressing Hallucinated Prior Exams in Radiology Report Generation Oishi Banerjee et.al. 2406.06496 null
2024-06-10 Can Language Models Serve as Text-Based World Simulators? Ruoyao Wang et.al. 2406.06485 null
2024-06-10 Parallelizing Linear Transformers with the Delta Rule over Sequence Length Songlin Yang et.al. 2406.06484 link
2024-06-10 Towards a Personal Health Large Language Model Justin Cosentino et.al. 2406.06474 null
2024-06-10 AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Zhen Xing et.al. 2406.06465 null
2024-06-10 Transforming Wearable Data into Health Insights using Large Language Model Agents Mike A. Merrill et.al. 2406.06464 null
2024-06-10 VCR: Visual Caption Restoration Tianyu Zhang et.al. 2406.06462 link
2024-06-11 Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies Junlin Wang et.al. 2406.06461 null
2024-06-10 Evaluating the Retrieval Component in LLM-Based Question Answering Systems Ashkan Alinejad et.al. 2406.06458 null
2024-06-10 A Large Language Model Pipeline for Breast Cancer Oncology Tristen Pool et.al. 2406.06455 null
2024-06-10 Insights from Social Shaping Theory: The Appropriation of Large Language Models in an Undergraduate Programming Course Aadarsh Padiyath et.al. 2406.06451 null
2024-06-10 LLM Dataset Inference: Did you train on my dataset? Pratyush Maini et.al. 2406.06443 link
2024-06-10 Interpretability of Language Models via Task Spaces Lucas Weber et.al. 2406.06441 null
2024-06-10 Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain Brian Hu et.al. 2406.06435 link
2024-06-10 Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking Gabriel Rioux et.al. 2406.06425 null
2024-06-10 An Empirical Design Justice Approach to Identifying Ethical Considerations in the Intersection of Large Language Models and Social Robotics Alva Markelius et.al. 2406.06400 null
2024-06-07 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs Jianing Yang et.al. 2406.05132 link
2024-06-07 An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models Xiongtao Zhou et.al. 2406.05130 null
2024-06-07 Towards Semantic Equivalence of Tokenization in Multimodal LLM Shengqiong Wu et.al. 2406.05127 null
2024-06-07 Large Generative Graph Models Yu Wang et.al. 2406.05109 null
2024-06-07 LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration Tavor Lipman et.al. 2406.05107 null
2024-06-07 Corpus Poisoning via Approximate Greedy Gradient Descent Jinyan Su et.al. 2406.05087 link
2024-06-07 Multi-Head RAG: Solving Multi-Aspect Problems with LLMs Maciej Besta et.al. 2406.05085 link
2024-06-07 SUMIE: A Synthetic Benchmark for Incremental Entity Summarization Eunjeong Hwang et.al. 2406.05079 null
2024-06-07 Are Large Language Models More Empathetic than Humans? Anuradha Welivita et.al. 2406.05063 null
2024-06-07 Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions Shi-Yu Tian et.al. 2406.05055 null
2024-06-07 Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation Nachiket Kotalwar et.al. 2406.05053 null
2024-06-07 Bootstrapping Referring Multi-Object Tracking Yani Zhang et.al. 2406.05039 link
2024-06-07 Scenarios and Approaches for Situated Natural Language Explanations Pengshuo Qiu et.al. 2406.05035 null
2024-06-07 CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search Fengran Mo et.al. 2406.05013 link
2024-06-07 Compositional Generalization with Grounded Language Models Sondre Wold et.al. 2406.04989 link
2024-06-07 Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences Patrick Haller et.al. 2406.04988 link
2024-06-07 MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter Jitai Hao et.al. 2406.04984 link
2024-06-07 CityCraft: A Real Crafter for 3D City Generation Jie Deng et.al. 2406.04983 null
2024-06-07 Quantifying Geospatial in the Common Crawl Corpus Ilya Ilyankou et.al. 2406.04952 null
2024-06-07 BAMO at SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Baktash Ansari et.al. 2406.04947 link
2024-06-06 Verbalized Machine Learning: Revisiting Machine Learning with Language Models Tim Z. Xiao et.al. 2406.04344 null
2024-06-06 Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image Stanislaw Szymanowicz et.al. 2406.04343 link
2024-06-06 Learning 1D Causal Visual Representation with De-focus Attention Networks Chenxin Tao et.al. 2406.04342 link
2024-06-06 RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation Jiaming Liu et.al. 2406.04339 null
2024-06-06 Coherent Zero-Shot Visual Instruction Generation Quynh Phung et.al. 2406.04337 null
2024-06-06 DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs Lingchen Meng et.al. 2406.04334 null
2024-06-06 PaCE: Parsimonious Concept Engineering for Large Language Models Jinqi Luo et.al. 2406.04331 link
2024-06-06 Parameter-Inverted Image Pyramid Networks Xizhou Zhu et.al. 2406.04330 link
2024-06-06 Simplified and Generalized Masked Diffusion for Discrete Data Jiaxin Shi et.al. 2406.04329 null
2024-06-06 Causal Estimation of Memorisation Profiles Pietro Lesci et.al. 2406.04327 link
2024-06-06 ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Lin Chen et.al. 2406.04325 null
2024-06-06 Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step Zhanhao Liang et.al. 2406.04314 null
2024-06-06 Improving Alignment and Robustness with Short Circuiting Andy Zou et.al. 2406.04313 link
2024-06-06 Semantically Diverse Language Generation for Uncertainty Estimation in Language Models Lukas Aichberger et.al. 2406.04306 link
2024-06-06 Quixer: A Quantum Transformer Model Nikhil Khatri et.al. 2406.04305 null
2024-06-06 Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models Phat Nguyen et.al. 2406.04300 null
2024-06-06 VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval Junjie Zhou et.al. 2406.04292 link
2024-06-06 Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation Adam Fisch et.al. 2406.04291 null
2024-06-07 What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages Nadav Borenstein et.al. 2406.04289 null
2024-06-06 Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People Dun-Ming Huang et.al. 2406.04278 link
2024-06-05 Wings: Learning Multimodal LLMs without Text-only Forgetting Yi-Kai Zhang et.al. 2406.03496 null
2024-06-06 Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training Ao Sun et.al. 2406.03488 link
2024-06-05 Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends Sanjana Ramprasad et.al. 2406.03487 null
2024-06-05 BIPED: Pedagogically Informed Tutoring System for ESL Education Soonwoo Kwon et.al. 2406.03486 null
2024-06-05 Does your data spark joy? Performance gains from domain upsampling at the end of training Cody Blakeney et.al. 2406.03476 null
2024-06-05 AD-H: Autonomous Driving with Hierarchical Agents Zaibin Zhang et.al. 2406.03474 null
2024-06-05 What is the Best Way for ChatGPT to Translate Poetry? Shanshan Wang et.al. 2406.03450 null
2024-06-05 Pre-trained Large Language Models Use Fourier Features to Compute Addition Tianyi Zhou et.al. 2406.03445 null
2024-06-05 Are language models rational? The case of coherence norms and belief revision Thomas Hofweber et.al. 2406.03442 null
2024-06-05 Cycles of Thought: Measuring LLM Confidence through Stable Explanations Evan Becker et.al. 2406.03441 null
2024-06-05 Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis Moein Heidari et.al. 2406.03430 link
2024-06-05 Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach Saehyung Lee et.al. 2406.03411 link
2024-06-05 Automating Turkish Educational Quiz Generation Using Large Language Models Kamyar Zeinalipour et.al. 2406.03397 link
2024-06-05 Log Parsing with Self-Generated In-Context Learning and Self-Correction Yifan Wu et.al. 2406.03376 null
2024-06-05 IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models David Ifeoluwa Adelani et.al. 2406.03368 null
2024-06-05 CLMASP: Coupling Large Language Models with Answer Set Programming for Robotic Task Planning Xinrui Lin et.al. 2406.03367 null
2024-06-05 LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback Timon Ziegenbein et.al. 2406.03363 null
2024-06-05 Save It for the "Hot" Day: An LLM-Empowered Visual Analytics System for Heat Risk Management Haobo Li et.al. 2406.03317 null
2024-06-05 The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games Mikhail Mozikov et.al. 2406.03299 null
2024-06-05 SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms Xingrun Xing et.al. 2406.03287 link
2024-06-04 Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks Tianyu He et.al. 2406.02550 link
2024-06-04 Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation Mohamed El Amine Boudjoghra et.al. 2406.02548 link
2024-06-04 Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning Alex Jinpeng Wang et.al. 2406.02547 link
2024-06-04 To Believe or Not to Believe Your LLM Yasin Abbasi Yadkori et.al. 2406.02543 null
2024-06-04 Loki: Low-Rank Keys for Efficient Sparse Attention Prajwal Singhania et.al. 2406.02542 null
2024-06-04 Parrot: Multilingual Visual Instruction Tuning Hai-Long Sun et.al. 2406.02539 link
2024-06-04 TopViewRS: Vision-Language Models as Top-View Spatial Reasoners Chengzu Li et.al. 2406.02537 link
2024-06-04 Mitigate Position Bias in Large Language Models via Scaling a Single Dimension Yijiong Yu et.al. 2406.02536 link
2024-06-04 SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices Ruslan Svirschevski et.al. 2406.02532 link
2024-06-04 Scalable MatMul-free Language Modeling Rui-Jie Zhu et.al. 2406.02528 link
2024-06-04 CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks Maciej Besta et.al. 2406.02524 link
2024-06-04 RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots Soroush Nasiriany et.al. 2406.02523 null
2024-06-04 Demystifying the Compression of Mixture-of-Experts Through a Unified Framework Shwai He et.al. 2406.02500 link
2024-06-04 Hiding Text in Large Language Models: Introducing Unconditional Token Forcing Confusion Jakub Hoscilowicz et.al. 2406.02481 link
2024-06-04 Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding Zhihan Zhang et.al. 2406.02472 link
2024-06-04 Meta-Designing Quantum Experiments with Language Models Sören Arlt et.al. 2406.02470 null
2024-06-04 Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Philip Anastassiou et.al. 2406.02430 link
2024-06-04 Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion Ruiqi Li et.al. 2406.02429 null
2024-06-04 GrootVL: Tree Topology is All You Need in State Space Model Yicheng Xiao et.al. 2406.02395 link
2024-06-04 Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data Maxime Griot et.al. 2406.02394 link
2024-05-31 Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis Chaoyou Fu et.al. 2405.21075 null
2024-05-31 Code Pretraining Improves Entity Tracking Abilities of Language Models Najoung Kim et.al. 2405.21068 null
2024-05-31 Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality Tri Dao et.al. 2405.21060 link
2024-05-31 RydbergGPT David Fitzek et.al. 2405.21052 link
2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu et.al. 2405.21048 null
2024-05-31 Grammar-Aligned Decoding Kanghee Park et.al. 2405.21047 null
2024-05-31 Exploratory Preference Optimization: Harnessing Implicit Q-Approximation for Sample-Efficient RLHF* Tengyang Xie et.al. 2405.21046 null
2024-05-31 Direct Alignment of Language Models via Quality-Aware Self-Refinement Runsheng Yu et.al. 2405.21040 null
2024-05-31 Standards for Belief Representations in LLMs Daniel A. Herrmann et.al. 2405.21030 null
2024-05-31 LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models Elias Stengel-Eskin et.al. 2405.21028 link
2024-05-31 You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet Zhen Qin et.al. 2405.21022 null
2024-05-31 Improved Techniques for Optimization-Based Jailbreaking on Large Language Models Xiaojun Jia et.al. 2405.21018 link
2024-06-04 StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond Pengyuan Lyu et.al. 2405.21013 null
2024-05-31 Hard Cases Detection in Motion Prediction by Vision-Language Foundation Models Yi Yang et.al. 2405.20991 link
2024-05-31 DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models Linli Yao et.al. 2405.20985 link
2024-05-31 Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training Feiteng Fang et.al. 2405.20978 link
2024-05-31 SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales Tianyang Xu et.al. 2405.20974 link
2024-05-31 LCQ: Low-Rank Codebook based Quantization for Large Language Models Wen-Pu Cai et.al. 2405.20973 null
2024-06-03 Large Language Models are Zero-Shot Next Location Predictors Ciro Beneduce et.al. 2405.20962 link
2024-06-03 A Robot Walks into a Bar: Can Language Models Serve as Creativity Support Tools for Comedy? An Evaluation of LLMs' Humour Alignment with Comedians Piotr Wojciech Mirowski et.al. 2405.20956 null
2024-05-30 MotionLLM: Understanding Human Behaviors from Human Motions and Videos Ling-Hao Chen et.al. 2405.20340 link
2024-05-30 Visual Perception by Large Language Model's Weights Feipeng Ma et.al. 2405.20339 null
2024-05-30 Xwin-LM: Strong and Scalable Alignment Practice for LLMs Bolin Ni et.al. 2405.20335 link
2024-05-31 ParSEL: Parameterized Shape Editing with Language Aditya Ganeshan et.al. 2405.20319 null
2024-05-30 CausalQuest: Collecting Natural Causal Questions for AI Agents Roberto Ceraolo et.al. 2405.20318 link
2024-05-30 ANAH: Analytical Annotation of Hallucinations in Large Language Models Ziwei Ji et.al. 2405.20315 link
2024-05-30 Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation Guillaume Huguet et.al. 2405.20313 null
2024-05-30 Large Language Models Can Self-Improve At Web Agent Tasks Ajay Patel et.al. 2405.20309 link
2024-05-30 Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models Himangi Mittal et.al. 2405.20305 null
2024-05-30 Group Robust Preference Optimization in Reward-free RLHF Shyam Sundhar Ramesh et.al. 2405.20304 link
2024-05-30 Who Writes the Review, Human or AI? Panagiotis C. Theocharopoulos et.al. 2405.20285 null
2024-05-30 ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections Massimo Bini et.al. 2405.20271 link
2024-05-30 Evaluating Large Language Model Biases in Persona-Steered Generation Andy Liu et.al. 2405.20253 link
2024-05-30 Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization Yuchi Liu et.al. 2405.20252 link
2024-05-30 Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use Franz Louis Cesista et.al. 2405.20245 null
2024-05-30 Context Injection Attacks on Large Language Models Cheng'an Wei et.al. 2405.20234 null
2024-05-30 Data-efficient fine-tuning of foundational models for first-principles quality sublimation enthalpies Harveen Kaur et.al. 2405.20217 null
2024-05-30 TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models Chen Zhang et.al. 2405.20215 null
2024-05-30 One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments Ke Yi et.al. 2405.20202 null
2024-05-31 Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations Zilin Ma et.al. 2405.20195 null
2024-05-29 X-VILA: Cross-Modality Alignment for Large Language Model Hanrong Ye et.al. 2405.19335 null
2024-05-29 LLMs Meet Multimodal Generation and Editing: A Survey Yingqing He et.al. 2405.19334 link
2024-05-29 Multi-Modal Generative Embedding Model Feipeng Ma et.al. 2405.19333 null
2024-05-29 Self-Exploring Language Models: Active Preference Elicitation for Online Alignment Shenao Zhang et.al. 2405.19332 link
2024-05-29 Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation Atrisha Sarkar et.al. 2405.19328 null
2024-05-29 MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series Ge Zhang et.al. 2405.19327 link
2024-05-29 Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models Tianrun Chen et.al. 2405.19326 null
2024-05-29 Nearest Neighbor Speculative Decoding for LLM Generation and Attribution Minghan Li et.al. 2405.19325 null
2024-05-29 Are Large Language Models Chameleons? Mingmeng Geng et.al. 2405.19323 null
2024-05-29 Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF Shicong Cen et.al. 2405.19320 null
2024-05-29 Robust Preference Optimization through Reward Model Distillation Adam Fisch et.al. 2405.19316 null
2024-05-29 Matryoshka Query Transformer for Large Vision-Language Models Wenbo Hu et.al. 2405.19315 link
2024-05-29 Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice Jian-Qiao Zhu et.al. 2405.19313 null
2024-05-29 Expert-Guided Extinction of Toxic Tokens for Debiased Generation Xueyao Sun et.al. 2405.19299 null
2024-05-29 MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection Michael Regan et.al. 2405.19285 null
2024-05-29 Optimizing Foundation Model Inference on a Many-tiny-core Open-source RISC-V Platform Viviane Potocnik et.al. 2405.19284 null
2024-05-29 Programmable Motion Generation for Open-Set Motion Control Tasks Hanchao Liu et.al. 2405.19283 null
2024-05-29 PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications Dingkang Yang et.al. 2405.19266 null
2024-05-29 AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data Zifan Song et.al. 2405.19265 link
2024-05-29 Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models Zhanhui Zhou et.al. 2405.19262 link
2024-05-28 Why are Visually-Grounded Language Models Bad at Image Classification? Yuhui Zhang et.al. 2405.18415 link
2024-05-28 Don't Forget to Connect! Improving RAG with Graph-based Reranking Jialin Dong et.al. 2405.18414 null
2024-05-28 WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization Jiawei Ma et.al. 2405.18405 null
2024-05-29 Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass Ethan Shen et.al. 2405.18400 link
2024-05-28 Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning Yixiao Zhang et.al. 2405.18386 link
2024-05-28 OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning Pengxiang Li et.al. 2405.18380 link
2024-05-28 LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models Anthony Sarah et.al. 2405.18377 null
2024-05-28 Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning Dongjie Chen et.al. 2405.18376 link
2024-05-28 Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning Phakphum Artkaew et.al. 2405.18375 link
2024-05-28 PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework Eshaan Agarwal et.al. 2405.18369 null
2024-05-28 Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? Yifan Bai et.al. 2405.18361 null
2024-05-28 Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs Somnath Kumar et.al. 2405.18359 null
2024-05-28 MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning Somnath Kumar et.al. 2405.18358 null
2024-05-28 Faithful Logical Reasoning via Symbolic Chain-of-Thought Jundong Xu et.al. 2405.18357 link
2024-05-28 Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography Jie Liu et.al. 2405.18356 link
2024-05-28 Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation Anjanava Biswas et.al. 2405.18346 null
2024-05-28 The Battle of LLMs: A Comparative Study in Conversational QA Tasks Aryan Rangapur et.al. 2405.18344 null
2024-05-28 Frustratingly Easy Test-Time Adaptation of Vision-Language Models Matteo Farina et.al. 2405.18330 link
2024-05-28 Multi-modal Generation via Cross-Modal In-Context Learning Amandeep Kumar et.al. 2405.18304 link
2024-05-28 Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning Renzhi Wang et.al. 2405.18292 null
2024-05-27 Matryoshka Multimodal Models Mu Cai et.al. 2405.17430 null
2024-05-27 NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models Chankyu Lee et.al. 2405.17428 null
2024-05-27 Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model Kuan-Chih Huang et.al. 2405.17427 link
2024-05-27 LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence Zhuoling Li et.al. 2405.17424 null
2024-05-27 Privacy-Aware Visual Language Models Laurens Samson et.al. 2405.17423 null
2024-05-27 Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation Jiaming Liu et.al. 2405.17418 null
2024-05-27 THREAD: Thinking Deeper with Recursive Spawning Philip Schroeder et.al. 2405.17402 link
2024-05-27 The Expressive Capacity of State Space Models: A Formal Language Perspective Yash Sarrof et.al. 2405.17394 null
2024-05-27 MindMerger: Efficient Boosting LLM Reasoning in non-English Languages Zixian Huang et.al. 2405.17386 link
2024-05-27 Unlocking the Secrets of Linear Complexity Sequence Model from A Unified Perspective Zhen Qin et.al. 2405.17383 null
2024-05-27 ReMoDetect: Reward Models Recognize Aligned LLM's Generations Hyunseok Lee et.al. 2405.17382 null
2024-05-27 Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention Zhen Qin et.al. 2405.17381 link
2024-05-27 RTL-Repo: A Benchmark for Evaluating LLMs on Large-Scale RTL Design Projects Ahmed Allam et.al. 2405.17378 link
2024-05-28 Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models ShengYun Peng et.al. 2405.17374 null
2024-05-27 Prompt Optimization with Human Feedback Xiaoqiang Lin et.al. 2405.17346 link
2024-05-27 Exploring and steering the moral compass of Large Language Models Alejandro Tlaie et.al. 2405.17345 link
2024-05-27 Cost-efficient Knowledge-based Question Answering with Large Language Models Junnan Dong et.al. 2405.17337 null
2024-05-27 XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser Xianfu Cheng et.al. 2405.17336 null
2024-05-27 FedHPL: Efficient Heterogeneous Federated Learning with Prompt Tuning and Logit Distillation Yuting Ma et.al. 2405.17267 null
2024-05-27 On the Noise Robustness of In-Context Learning for Text Generation Hongfu Gao et.al. 2405.17264 null
2024-05-24 Scaling Laws for Discriminative Classification in Large Language Models Dean Wyatte et.al. 2405.15765 null
2024-05-24 Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence Abhinav Patil et.al. 2405.15750 link
2024-05-24 Sparse maximal update parameterization: A holistic approach to sparse training dynamics Nolan Dey et.al. 2405.15743 null
2024-05-24 Large Language Models Reflect Human Citation Patterns with a Heightened Citation Bias Andres Algaba et.al. 2405.15739 link
2024-05-24 LM4LV: A Frozen Large Language Model for Low-level Vision Tasks Boyang Zheng et.al. 2405.15734 link
2024-05-24 Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks Jerome Sieber et.al. 2405.15731 link
2024-05-24 Optimizing Large Language Models for OpenAPI Code Completion Bohdan Petryshyn et.al. 2405.15729 link
2024-05-24 Disease-informed Adaptation of Vision-Language Models Jiajin Zhang et.al. 2405.15728 link
2024-05-24 The Impact of Geometric Complexity on Neural Collapse in Transfer Learning Michael Munn et.al. 2405.15706 null
2024-05-24 Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models Yue Zhang et.al. 2405.15684 null
2024-05-24 VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap Sreyan Ghosh et.al. 2405.15683 null
2024-05-24 What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models Abdelrahman Abdelhamed et.al. 2405.15668 null
2024-05-24 Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning Wenhan Chang et.al. 2405.15662 null
2024-05-24 $$\mathbf{L^2\cdot M = C^2}$$ Large Language Models as Covert Channels... a Systematic Analysis Simen Gaure et.al. 2405.15652 null
2024-05-24 LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots Ruoyu Wang et.al. 2405.15646 null
2024-05-24 GECKO: Generative Language Model for English, Code and Korean Sungwoo Oh et.al. 2405.15640 null
2024-05-24 M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models Hongyu Wang et.al. 2405.15638 link
2024-05-24 GPTZoo: A Large-scale Dataset of GPTs for the Research Community Xinyi Hou et.al. 2405.15630 link
2024-05-24 A Comparative Analysis of Distributed Training Strategies for GPT-2 Ishan Patwardhan et.al. 2405.15628 null
2024-05-24 Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment Hao Sun et.al. 2405.15624 null
2024-05-23 PuzzleAvatar: Assembling 3D Avatars from Personal Albums Yuliang Xiu et.al. 2405.14869 null
2024-05-23 A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns Asaf Yehudai et.al. 2405.14863 null
2024-05-23 Bitune: Bidirectional Instruction-Tuning Dawid J. Kopiczko et.al. 2405.14862 null
2024-05-23 Not All Language Model Features Are Linear Joshua Engels et.al. 2405.14860 link
2024-05-23 PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression Vladimir Malinovskii et.al. 2405.14852 link
2024-05-23 A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis Yue Yang et.al. 2405.14839 null
2024-05-23 From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step Yuntian Deng et.al. 2405.14838 link
2024-05-23 HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models Bernal Jiménez Gutiérrez et.al. 2405.14831 link
2024-05-23 Designing A Sustainable Marine Debris Clean-up Framework without Human Labels Raymond Wang et.al. 2405.14815 link
2024-05-23 As an AI Language Model, "Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making Shomik Jain et.al. 2405.14812 null
2024-05-23 Implicit Personalization in Language Models: A Systematic Study Zhijing Jin et.al. 2405.14808 link
2024-05-23 Can LLMs Solve longer Math Word Problems Better? Xin Xu et.al. 2405.14804 null
2024-05-23 Lessons from the Trenches on Reproducible Evaluation of Language Models Stella Biderman et.al. 2405.14782 null
2024-05-23 WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models Peng Wang et.al. 2405.14768 link
2024-05-23 FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models Hongyang Yang et.al. 2405.14767 link
2024-05-23 Evaluating Large Language Models for Public Health Classification and Extraction Tasks Joshua Harris et.al. 2405.14766 null
2024-05-23 Large language models can be zero-shot anomaly detectors for time series? Sarah Alnegheimish et.al. 2405.14755 link
2024-05-23 A Transformer-Based Approach for Smart Invocation of Automatic Code Completion Aral de Moor et.al. 2405.14753 link
2024-05-23 MultiCast: Zero-Shot Multivariate Time Series Forecasting Using LLMs Georgios Chatzigeorgakidis et.al. 2405.14748 null
2024-05-23 Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View Xuan Liu et.al. 2405.14744 null
2024-05-21 Reducing Transformer Key-Value Cache Size with Cross-Layer Attention William Brandon et.al. 2405.12981 null
2024-05-21 OmniGlue: Generalizable Feature Matching with Foundation Model Guidance Hanwen Jiang et.al. 2405.12979 link
2024-05-21 BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once Theodore Zhao et.al. 2405.12971 null
2024-05-21 Energy Rank Alignment: Using Preference Optimization to Search Chemical Space at Scale Shriram Chennakesavalu et.al. 2405.12961 link
2024-05-21 Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models Zhangyue Yin et.al. 2405.12939 link
2024-05-21 Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs Bilgehan Sel et.al. 2405.12933 null
2024-05-21 Code-mixed Sentiment and Hate-speech Prediction Anjali Yadav et.al. 2405.12929 null
2024-05-21 Streamlining Software Reviews: Efficient Predictive Modeling with Minimal Examples Tim Menzies et.al. 2405.12920 link
2024-05-21 G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation Xingyuan Pan et.al. 2405.12915 link
2024-05-21 An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation Zhiyu Tan et.al. 2405.12914 link
2024-05-21 Topic Modelling Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment Holli Sargeant et.al. 2405.12910 link
2024-05-21 Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents San Kim et.al. 2405.12900 null
2024-05-21 Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models Abdurahmman Alzahrani et.al. 2405.12884 null
2024-05-21 LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language James Requeima et.al. 2405.12856 link
2024-05-21 OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models Zhaojian Yu et.al. 2405.12843 link
2024-05-21 SmartFlow: Robotic Process Automation using LLMs Arushi Jain et.al. 2405.12842 null
2024-05-21 Large Language Models Meet NLP: A Survey Libo Qin et.al. 2405.12819 link
2024-05-21 Test Oracle Automation in the era of LLMs Facundo Molina et.al. 2405.12766 null
2024-05-21 C3L: Content Correlated Vision-Language Instruction Tuning Data Generation via Contrastive Learning Ji Ma et.al. 2405.12752 null
2024-05-21 Generative AI and Large Language Models for Cyber Security: All Insights You Need Mohamed Amine Ferrag et.al. 2405.12750 null
2024-05-20 Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning Guanglin Zhou et.al. 2405.12217 link
2024-05-20 MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark Hongwei Liu et.al. 2405.12209 link
2024-05-20 Developers' Perceptions on the Impact of ChatGPT in Software Development: A Survey Thiago S. Vaillant et.al. 2405.12195 link
2024-05-20 CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models Haoxiang Shi et.al. 2405.12174 null
2024-05-20 Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and Bridging Xiaobo Liang et.al. 2405.12163 link
2024-05-20 Eliciting Problem Specifications via Large Language Models Robert E. Wray et.al. 2405.12147 null
2024-05-20 DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM Xuchen Li et.al. 2405.12139 null
2024-05-20 MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Ting Jiang et.al. 2405.12130 link
2024-05-20 Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation Zhankui He et.al. 2405.12119 null
2024-05-20 Imp: Highly Capable Large Multimodal Models for Mobile Devices Zhenwei Shao et.al. 2405.12107 link
2024-05-20 DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction Hao Chen et.al. 2405.12100 null
2024-05-20 Distributional Semantics, Holism, and the Instability of Meaning Jumbly Grindrod et.al. 2405.12084 null
2024-05-20 PARALLELGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation Zhuobin Huang et.al. 2405.12079 null
2024-05-20 CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models Tong Zhang et.al. 2405.12063 link
2024-05-20 STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents Yue Chen et.al. 2405.12059 null
2024-05-20 KG-RAG: Bridging the Gap Between Knowledge and Creativity Diego Sanmartin et.al. 2405.12035 null
2024-05-20 Can AI Relate: Testing Large Language Model Response for Mental Health Support Saadia Gabriel et.al. 2405.12021 null
2024-05-20 MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering Jingqun Tang et.al. 2405.11985 link
2024-05-20 A review on the use of large language models as virtual tutors Silvia García-Méndez et.al. 2405.11983 null
2024-05-20 Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays Zhichao Sun et.al. 2405.11976 link
2024-05-17 Observational Scaling Laws and the Predictability of Language Model Performance Yangjun Ruan et.al. 2405.10938 link
2024-05-17 A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers Kaiyu Huang et.al. 2405.10936 link
2024-05-17 The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks Lucius Bushnaq et.al. 2405.10928 link
2024-05-17 Blackbox Adaptation for Medical Image Segmentation Jay N. Paranjape et.al. 2405.10913 link
2024-05-17 COGNET-MD, an evaluation framework and dataset for Large Language Model benchmarks in the medical domain Dimitrios P. Panagoulias et.al. 2405.10893 null
2024-05-17 Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: Systematic Literature Review Hongyi Yang et.al. 2405.10883 null
2024-05-17 ECR-Chain: Advancing Generative Language Models to Better Emotion-Cause Reasoners through Reasoning Chains Zhaopei Huang et.al. 2405.10860 link
2024-05-17 The Future of Large Language Model Pre-training is Federated Lorenzo Sani et.al. 2405.10853 null
2024-05-17 Open-Vocabulary Spatio-Temporal Action Detection Tao Wu et.al. 2405.10832 null
2024-05-17 Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities Hao Zhou et.al. 2405.10825 null
2024-05-17 ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios Markus Bayer et.al. 2405.10808 null
2024-05-17 The Relational Machine Calculus Chris Barrett et.al. 2405.10801 null
2024-05-17 Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings Albert Sawczyn et.al. 2405.10745 null
2024-05-17 Efficient Multimodal Large Language Models: A Survey Yizhang Jin et.al. 2405.10739 link
2024-05-17 INDUS: Effective and Efficient Language Models for Scientific Applications Bishwaranjan Bhattacharjee et.al. 2405.10725 null
2024-05-17 SignLLM: Sign Languages Production Large Language Models Sen Fang et.al. 2405.10718 null
2024-05-17 Persian Pronoun Resolution: Leveraging Neural Networks and Language Models Hassan Haji Mohammadi et.al. 2405.10714 null
2024-05-17 SynDy: Synthetic Dynamic Dataset Generation Framework for Misinformation Tasks Michael Shliselberg et.al. 2405.10700 null
2024-05-17 Revolutionizing Process Mining: A Novel Architecture for ChatGPT Integration and Enhanced User Experience through Optimized Prompt Engineering Mehrdad Agha Mohammad Ali Kermani et.al. 2405.10689 null
2024-05-17 Realistic Evaluation of Toxicity in Large Language Models Tinh Son Luong et.al. 2405.10659 null
2024-05-16 UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models Sahel Sharifymoghaddam et.al. 2405.10311 null
2024-05-16 4D Panoptic Scene Graph Generation Jingkang Yang et.al. 2405.10305 link
2024-05-16 Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees Yu Gui et.al. 2405.10301 null
2024-05-16 HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models Rhea Sanjay Sukthanker et.al. 2405.10299 link
2024-05-17 Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Yuexiang Zhai et.al. 2405.10292 null
2024-05-16 Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction Jianhao Chen et.al. 2405.10288 link
2024-05-16 FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models Adrian Bulat et.al. 2405.10286 null
2024-05-16 Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers Tuo Zhang et.al. 2405.10276 null
2024-05-16 Keep It Private: Unsupervised Privatization of Online Text Calvin Bao et.al. 2405.10260 link
2024-05-16 When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models Xianzheng Ma et.al. 2405.10255 link
2024-05-16 PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology George Shaikovski et.al. 2405.10254 null
2024-05-16 A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks Xuanfan Ni et.al. 2405.10251 null
2024-05-16 IntelliExplain: Enhancing Interactive Code Generation through Natural Language Explanations for Non-Professional Programmers Hao Yan et.al. 2405.10250 null
2024-05-16 A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts Xinru Zhang et.al. 2405.10246 link
2024-05-16 DocuMint: Docstring Generation for Python using Small Language Models Bibek Poudel et.al. 2405.10243 link
2024-05-16 Low-Rank Adaptation of Time Series Foundational Models for Out-of-Domain Modality Forecasting Divij Gupta et.al. 2405.10216 null
2024-05-16 CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations Jiahao Zhao et.al. 2405.10212 null
2024-05-16 LFED: A Literary Fiction Evaluation Dataset for Large Language Models Linhao Yu et.al. 2405.10166 link
2024-05-16 PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning Jiancheng Pan et.al. 2405.10160 link
2024-05-16 Speaker Verification in Agent-Generated Conversations Yizhe Yang et.al. 2405.10150 null
2024-05-15 Modeling Bilingual Sentence Processing: Evaluating RNN and Transformer Architectures for Cross-Language Structural Priming Bushi Xiao et.al. 2405.09508 null
2024-05-15 Constrained Learning for Causal Inference and Semiparametric Statistics Tiffany Tianhui Cai et.al. 2405.09493 null
2024-05-15 Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts Donya Rooein et.al. 2405.09482 null
2024-05-15 Tell Me Why: Explainable Public Health Fact-Checking with Large Language Models Majid Zarharan et.al. 2405.09454 link
2024-05-15 M $^4$ oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts Yufeng Jiang et.al. 2405.09446 link
2024-05-15 Facilitating Opinion Diversity through Hybrid NLP Approaches Michiel van der Meer et.al. 2405.09439 null
2024-05-15 A Survey On Text-to-3D Contents Generation In The Wild Chenhan Jiang et.al. 2405.09431 null
2024-05-15 MicroPython Testbed for Federated Learning Algorithms Miroslav Popovic et.al. 2405.09423 link
2024-05-15 Matching domain experts by training from scratch on domain knowledge Xiaoliang Luo et.al. 2405.09395 null
2024-05-15 Compositional imprecise probability Jack Liell-Cock et.al. 2405.09391 null
2024-05-15 PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models Devansh Jain et.al. 2405.09373 link
2024-05-15 SARATR-X: A Foundation Model for Synthetic Aperture Radar Images Target Recognition Weijie L et.al. 2405.09365 null
2024-05-15 Large Language Model Bias Mitigation from the Perspective of Knowledge Editing Ruizhe Chen et.al. 2405.09341 null
2024-05-15 Prompting-based Synthetic Data Generation for Few-Shot Question Answering Maximilian Schmidt et.al. 2405.09335 link
2024-05-15 Transfer Learning in Pre-Trained Large Language Models for Malware Detection Based on System Calls Pedro Miguel Sánchez Sánchez et.al. 2405.09318 null
2024-05-15 Comparing the Efficacy of GPT-4 and Chat-GPT in Mental Health Care: A Blind Assessment of Large Language Models for Psychological Support Birger Moell et.al. 2405.09300 null
2024-05-15 Do language models capture implied discourse meanings? An investigation with exhaustivity implicatures of Korean morphology Hagyeong Shin et.al. 2405.09293 null
2024-05-15 Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection Dylan Phelps et.al. 2405.09279 null
2024-05-15 Dynamic Activation Pitfalls in LLaMA Models: An Empirical Study Chi Ma et.al. 2405.09274 null
2024-05-15 New Textual Corpora for Serbian Language Modeling Mihailo Škorić et.al. 2405.09250 null
2024-05-14 Efficient Vision-Language Pre-training by Cluster Masking Zihao Wei et.al. 2405.08815 link
2024-05-14 Towards Enhanced RAC Accessibility: Leveraging Datasets and LLMs Edison Jair Bejarano Sepulveda et.al. 2405.08792 link
2024-05-14 Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring Tiantian Zhang et.al. 2405.08786 link
2024-05-14 Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Intent Resolution in LLMs Akhila Yerukola et.al. 2405.08760 link
2024-05-14 Distributed Threat Intelligence at the Edge Devices: A Large Language Model-Driven Approach Syed Mhamudul Hasan et.al. 2405.08755 null
2024-05-14 Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Zhimin Li et.al. 2405.08748 link
2024-05-14 Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory Xueyan Niu et.al. 2405.08707 null
2024-05-14 EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera Beilei Cui et.al. 2405.08672 link
2024-05-14 Promoting AI Equity in Science: Generalized Domain Prompt Learning for Accessible VLM Research Qinglong Cao et.al. 2405.08668 link
2024-05-14 Thinking Tokens for Language Modeling David Herel et.al. 2405.08644 null
2024-05-15 ALMol: Aligned Language-Molecule Translation LLMs through Offline Preference Contrastive Optimisation Dimitris Gkoumas et.al. 2405.08619 null
2024-05-14 A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine Hanguang Xiao et.al. 2405.08603 null
2024-05-15 EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark Xiaohui Zhang et.al. 2405.08596 link
2024-05-14 Open-Vocabulary Object Detection via Neighboring Region Attention Alignment Sunyuan Qiang et.al. 2405.08593 null
2024-05-14 Improving Transformers with Dynamically Composable Multi-Head Attention Da Xiao et.al. 2405.08553 link
2024-05-14 Self-Distillation Improves DNA Sequence Inference Tong Yu et.al. 2405.08538 link
2024-05-14 Falcon 7b for Software Mention Detection in Scholarly Documents AmeerAli Khan et.al. 2405.08514 null
2024-05-14 Archimedes-AUEB at SemEval-2024 Task 5: LLM explains Civil Procedure Odysseas S. Chlapanis et.al. 2405.08502 link
2024-05-14 Is Less More? Quality, Quantity and Context in Idiom Processing with Natural Language Models Agne Knietaite et.al. 2405.08497 link
2024-05-14 Enhancing Gender-Inclusive Machine Translation with Neomorphemes and Large Language Models Andrea Piergentili et.al. 2405.08477 null
2024-05-13 Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots Chengyue Wu et.al. 2405.07990 null
2024-05-13 A Generalist Learner for Multifaceted Medical Image Interpretation Hong-Yu Zhou et.al. 2405.07988 null
2024-05-13 The Platonic Representation Hypothesis Minyoung Huh et.al. 2405.07987 link
2024-05-13 Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly Segmentation Kevin Stangl et.al. 2405.07969 null
2024-05-13 PyZoBot: A Platform for Conversational Information Extraction and Synthesis from Curated Zotero Reference Libraries through Advanced Retrieval-Augmented Generation Suad Alshammari et.al. 2405.07963 link
2024-05-13 AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments Samuel Schmidgall et.al. 2405.07960 null
2024-05-13 EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning Yinzhu Quan et.al. 2405.07938 link
2024-05-14 PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition Ziyang Zhang et.al. 2405.07932 link
2024-05-13 Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data Mahdi Morafah et.al. 2405.07925 null
2024-05-13 Can Better Text Semantics in Prompt Tuning Improve VLM Generalization? Hari Chandana Kuchibhotla et.al. 2405.07921 null
2024-05-13 A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking Ferdinand Schlatt et.al. 2405.07920 link
2024-05-13 PLUTO: Pathology-Universal Transformer Dinkar Juyal et.al. 2405.07905 null
2024-05-13 Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers Alena Tsanda et.al. 2405.07886 link
2024-05-13 Zero-Shot Tokenizer Transfer Benjamin Minixhofer et.al. 2405.07883 link
2024-05-13 RLHF Workflow: From Reward Modeling to Online RLHF Hanze Dong et.al. 2405.07863 link
2024-05-13 Can LLMs Help Predict Elections? (Counter)Evidence from the World's Largest Democracy Pratik Gujral et.al. 2405.07828 null
2024-05-13 A View of How Language Models Will Transform Law Frank Fagan et.al. 2405.07826 null
2024-05-13 FreeVA: Offline MLLM as Training-Free Video Assistant Wenhao Wu et.al. 2405.07798 link
2024-05-13 DEPTH: Discourse Education through Pre-Training Hierarchically Zachary Bamberger et.al. 2405.07788 link
2024-05-13 Generating Human Motion in 3D Scenes from Text Descriptions Zhi Cen et.al. 2405.07784 null
2024-05-10 Linearizing Large Language Models Jean Mercat et.al. 2405.06640 link
2024-05-10 Value Augmented Sampling for Language Model Alignment and Personalization Seungwook Han et.al. 2405.06639 link
2024-05-10 Multimodal LLMs Struggle with Basic Visual Network Analysis: a VNA Benchmark Evan M. Williams et.al. 2405.06634 link
2024-05-10 Characterizing the Accuracy - Efficiency Trade-off of Low-rank Decomposition in Language Models Chakshu Moar et.al. 2405.06626 null
2024-05-10 Explaining Text Similarity in Transformer Models Alexandros Vasileiou et.al. 2405.06604 link
2024-05-10 Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach Elham Ravanbakhsh et.al. 2405.06586 null
2024-05-10 What Can Natural Language Processing Do for Peer Review? Ilia Kuznetsov et.al. 2405.06563 link
2024-05-10 Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval Mengjia Niu et.al. 2405.06545 null
2024-05-10 Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts Wenyu Huang et.al. 2405.06524 null
2024-05-10 UniDM: A Unified Framework for Data Manipulation with Large Language Models Yichen Qian et.al. 2405.06510 null
2024-05-10 Storypark: Leveraging Large Language Models to Enhance Children Story Learning Through Child-AI collaboration Storytelling Lyumanshan Ye et.al. 2405.06495 null
2024-05-10 Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification Yaoqin Ye et.al. 2405.06468 link
2024-05-10 Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation JoonHo Lee et.al. 2405.06424 link
2024-05-10 Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions? Hunter McNichols et.al. 2405.06414 link
2024-05-10 Potential and Limitations of LLMs in Capturing Structured Semantics: A Case Study on SRL Ning Cheng et.al. 2405.06410 null
2024-05-10 Program Synthesis using Inductive Logic Programming for the Abstraction and Reasoning Corpus Filipe Marinho Rocha et.al. 2405.06399 null
2024-05-10 Memory Mosaics Jianyu Zhang et.al. 2405.06394 link
2024-05-10 LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play Li-Chun Lu et.al. 2405.06373 link
2024-05-10 LMD3: Language Model Data Density Dependence John Kirchenbauer et.al. 2405.06331 null
2024-05-10 Correlation Dimension of Natural Language in a Statistical Manifold Xin Du et.al. 2405.06321 null
2024-05-09 Natural Language Processing RELIES on Linguistics Juri Opitz et.al. 2405.05966 null
2024-05-09 OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning Dan Qiao et.al. 2405.05957 link
2024-05-09 Probing Multimodal LLMs as World Models for Driving Shiva Sreeram et.al. 2405.05956 link
2024-05-09 Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning Junzhi Chen et.al. 2405.05955 link
2024-05-09 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Jiachen Li et.al. 2405.05949 link
2024-05-09 DOLOMITES: Domain-Specific Long-Form Methodical Tasks Chaitanya Malaviya et.al. 2405.05938 null
2024-05-09 Trustworthy AI-Generative Content in Intelligent 6G Network: Adversarial, Privacy, and Fairness Siyuan Li et.al. 2405.05930 null
2024-05-09 Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? Zorik Gekhman et.al. 2405.05904 null
2024-05-09 Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes Ziang Guo et.al. 2405.05885 null
2024-05-09 FlockGPT: Guiding UAV Flocking with Linguistic Orchestration Artem Lykov et.al. 2405.05872 null
2024-05-09 Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control Gunshi Gupta et.al. 2405.05852 link
2024-05-09 Robots Can Feel: LLM-based Framework for Robot Ethical Reasoning Artem Lykov et.al. 2405.05824 link
2024-05-09 Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference Zhihang Lin et.al. 2405.05803 link
2024-05-09 Towards a More Inclusive AI: Progress and Perspectives in Large Language Model Training for the Sámi Language Ronny Paul et.al. 2405.05777 null
2024-05-09 Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions Polina Tsvilodub et.al. 2405.05776 null
2024-05-09 Large Language Model-Aided Evolutionary Search for Constrained Multiobjective Optimization Zeyi Wang et.al. 2405.05767 null
2024-05-09 Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media Zhizhen Zhang et.al. 2405.05760 null
2024-05-09 Exploring the Potential of Human-LLM Synergy in Advancing Qualitative Analysis: A Case Study on Mental-Illness Stigma Han Meng et.al. 2405.05758 null
2024-05-09 Can large language models understand uncommon meanings of common words? Jinyang Wu et.al. 2405.05741 null
2024-05-09 Evaluating Dialect Robustness of Language Models via Conversation Understanding Dipankar Srirag et.al. 2405.05688 link
2024-05-08 THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models Prannay Kaul et.al. 2405.05256 null
2024-05-09 You Only Cache Once: Decoder-Decoder Architectures for Language Models Yutao Sun et.al. 2405.05254 link
2024-05-08 Open Source Language Models Can Provide Feedback: Evaluating LLMs' Ability to Help Students Using GPT-4-As-A-Judge Charles Koutcheme et.al. 2405.05253 link
2024-05-09 LLMs with Personalities in Multi-issue Negotiation Games Sean Noh et.al. 2405.05248 null
2024-05-08 EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning Jingfeng Yao et.al. 2405.05237 link
2024-05-08 SuFIA: Language-Guided Augmented Dexterity for Robotic Surgical Assistants Masoud Moghani et.al. 2405.05226 null
2024-05-08 Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers Jiuxiang Gu et.al. 2405.05219 null
2024-05-08 FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models Jinglin Xu et.al. 2405.05216 link
2024-05-08 MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning Inderjeet Nair et.al. 2405.05189 link
2024-05-08 Encoder-Decoder Framework for Interactive Free Verses with Generation with Controllable High-Quality Rhyming Tommaso Pasini et.al. 2405.05176 null
2024-05-08 Air Gap: Protecting Privacy-Conscious Conversational Agents Eugene Bagdasaryan et.al. 2405.05175 null
2024-05-08 XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples Peiqin Lin et.al. 2405.05116 link
2024-05-08 QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs Weijia Zhang et.al. 2405.05109 null
2024-05-08 Concerns on Bias in Large Language Models when Creating Synthetic Personae Helena A. Haxvig et.al. 2405.05080 null
2024-05-08 Impact of Tone-Aware Explanations in Recommender Systems Ayano Okoso et.al. 2405.05061 null
2024-05-08 Conversational Topic Recommendation in Counseling and Psychotherapy with Decision Transformer and Large Language Models Aylin Gunal et.al. 2405.05060 null
2024-05-08 Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources Lasse Hyldig Hansen et.al. 2405.05049 null
2024-05-08 ${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields Ning Wang et.al. 2405.05010 null
2024-05-08 ADELIE: Aligning Large Language Models on Information Extraction Yunjia Qi et.al. 2405.05008 link
2024-05-08 NAVRepair: Node-type Aware C/C++ Code Vulnerability Repair Ruoke Wang et.al. 2405.04994 null
2024-05-07 ChatHuman: Language-driven 3D Human Understanding with Retrieval-Augmented Tool Reasoning Jing Lin et.al. 2405.04533 null
2024-05-07 QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving Yujun Lin et.al. 2405.04532 link
2024-05-07 NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts Shudan Zhang et.al. 2405.04520 null
2024-05-07 xLSTM: Extended Long Short-Term Memory Maximilian Beck et.al. 2405.04517 null
2024-05-07 A Transformer with Stack Attention Jiaoda Li et.al. 2405.04515 link
2024-05-08 Unveiling Disparities in Web Task Handling Between Human and Web Agent Kihoon Son et.al. 2405.04497 null
2024-05-07 Toward In-Context Teaching: Adapting Examples to Students' Misconceptions Alexis Ross et.al. 2405.04495 null
2024-05-07 Representation Learning of Daily Movement Data Using Text Encoders Alexander Capstick et.al. 2405.04494 link
2024-05-08 DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model DeepSeek-AI et.al. 2405.04434 link
2024-05-07 The Silicone Ceiling: Auditing GPT's Race and Gender Biases in Hiring Lena Armstrong et.al. 2405.04412 null
2024-05-07 Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks Georgios Pantazopoulos et.al. 2405.04403 link
2024-05-07 Large Language Models Cannot Explain Themselves Advait Sarkar et.al. 2405.04382 null
2024-05-07 A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and Generative AI Hannah Chafetz et.al. 2405.04333 null
2024-05-07 Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation Atharvan Dogra et.al. 2405.04325 null
2024-05-07 Granite Code Models: A Family of Open Foundation Models for Code Intelligence Mayank Mishra et.al. 2405.04324 link
2024-05-07 Accelerating Speculative Decoding using Dynamic Speculation Length Jonathan Mamou et.al. 2405.04304 null
2024-05-07 Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework Xiangpeng Wan et.al. 2405.04294 link
2024-05-07 Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore Junchao Wu et.al. 2405.04286 null
2024-05-07 On the Foundations of Earth and Climate Foundation Models Xiao Xiang Zhu et.al. 2405.04285 null
2024-05-07 Semantic API Alignment: Linking High-level User Goals to APIs Robert Feldt et.al. 2405.04236 null
2024-05-06 Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs Muhammad Uzair Khattak et.al. 2405.03690 null
2024-05-06 Pose Priors from Language Models Sanjay Subramanian et.al. 2405.03689 null
2024-05-06 Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames Keith Burghardt et.al. 2405.03688 link
2024-05-06 Language-Image Models with 3D Understanding Jang Hyun Cho et.al. 2405.03685 null
2024-05-06 AtomGPT: Atomistic Generative Pre-trained Transformer for Forward and Inverse Materials Design Kamal Choudhary et.al. 2405.03680 link
2024-05-06 When LLMs Meet Cybersecurity: A Systematic Literature Review Jie Zhang et.al. 2405.03644 link
2024-05-06 A Controlled Experiment on the Energy Efficiency of the Source Code Generated by Code Llama Vlad-Andrei Cursaru et.al. 2405.03616 null
2024-05-06 GREEN: Generative Radiology Report Evaluation and Error Notation Sophie Ostmeier et.al. 2405.03595 null
2024-05-06 Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment Abhinav Agarwalla et.al. 2405.03594 null
2024-05-06 Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing Han Liu et.al. 2405.03565 null
2024-05-07 ID-centric Pre-training for Recommendation Yiqing Wu et.al. 2405.03562 null
2024-05-06 AlphaMath Almost Zero: process Supervision without process Guoxin Chen et.al. 2405.03553 link
2024-05-06 MAmmoTH2: Scaling Instructions from the Web Xiang Yue et.al. 2405.03548 null
2024-05-06 Position Paper: Leveraging Foundational Models for Black-Box Optimization: Benefits, Challenges, and Future Directions Xingyou Song et.al. 2405.03547 null
2024-05-06 Are Human Rules Necessary? Generating Reusable APIs with CoT Reasoning and In-Context Learning Yubo Mai et.al. 2405.03509 null
2024-05-06 UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images Yiting Qu et.al. 2405.03486 null
2024-05-06 LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model Haowen Sun et.al. 2405.03485 link
2024-05-06 Doing Personal LAPS: LLM-Augmented Dialogue Construction for Personalized Multi-Session Conversational Search Hideaki Joko et.al. 2405.03480 link
2024-05-07 Large Language Models (LLMs) as Agents for Augmented Democracy Jairo Gudiño-Rosero et.al. 2405.03452 null
2024-05-06 SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large Language Models in Cyber Threat Intelligence Hangyuan Ji et.al. 2405.03446 link
2024-05-03 Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models Piotr Padlewski et.al. 2405.02287 link
2024-05-03 Structural Pruning of Pre-trained Language Models via Neural Architecture Search Aaron Klein et.al. 2405.02267 link
2024-05-03 On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? Maxime Zanella et.al. 2405.02266 link
2024-05-03 Leveraging Large Language Models to Enhance Domain Expert Inclusion in Data Science Workflows Jasmine Y. Shih et.al. 2405.02260 null
2024-05-03 What matters when building vision-language models? Hugo Laurençon et.al. 2405.02246 null
2024-05-03 REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs Deepa Tilwani et.al. 2405.02228 null
2024-05-03 Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks Lujing Zhang et.al. 2405.02225 null
2024-05-03 FairEvalLLM. A Comprehensive Framework for Benchmarking Fairness in Large Language Model Recommender Systems Yashar Deldjoo et.al. 2405.02219 null
2024-05-03 Automatic Programming: Large Language Models and Beyond Michael R. Lyu et.al. 2405.02213 null
2024-05-03 Assessing and Verifying Task Utility in LLM-Powered Applications Negar Arabzadeh et.al. 2405.02178 null
2024-05-03 Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset Hsuvas Borkakoty et.al. 2405.02175 link
2024-05-03 Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models Mohamad Al Mdfaa et.al. 2405.02162 null
2024-05-03 Neural Context Flows for Learning Generalizable Dynamical Systems Roussel Desmond Nzoyem et.al. 2405.02154 link
2024-05-03 The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates Giuseppe Russo Latona et.al. 2405.02150 link
2024-05-03 MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain Chao Jiang et.al. 2405.02144 null
2024-05-03 Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection Guillem Ramírez et.al. 2405.02134 null
2024-05-03 Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets Xuelong Geng et.al. 2405.02132 null
2024-05-03 Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph Vladyslav Nechakhin et.al. 2405.02105 null
2024-05-03 Argumentative Large Language Models for Explainable and Contestable Decision-Making Gabriel Freedman et.al. 2405.02079 null
2024-05-03 Comparative Analysis of Retrieval Systems in the Real World Dmytro Mozolevskyi et.al. 2405.02048 null
2024-05-02 Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Seungone Kim et.al. 2405.01535 link
2024-05-02 Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal et.al. 2405.01534 null
2024-05-02 OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning Shihao Wang et.al. 2405.01533 link
2024-05-02 FLAME: Factuality-Aware Alignment for Large Language Models Sheng-Chieh Lin et.al. 2405.01525 null
2024-05-03 A separability-based approach to quantifying generalization: which layer is best? Luciano Dyballa et.al. 2405.01524 null
2024-05-02 Transformer-Aided Semantic Communications Matin Mortaheb et.al. 2405.01521 null
2024-05-02 D2PO: Discriminator-Guided DPO with Response Evaluation Models Prasann Singhal et.al. 2405.01511 link
2024-05-02 Analyzing the Role of Semantic Representations in the Era of Large Language Models Zhijing Jin et.al. 2405.01502 link
2024-05-02 Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models Raymond Fok et.al. 2405.01501 null
2024-05-02 Controllable Text Generation in the Instruction-Tuning Era Dhananjay Ashok et.al. 2405.01490 null
2024-05-02 MANTIS: Interleaved Multi-Image Instruction Tuning Dongfu Jiang et.al. 2405.01483 link
2024-05-02 NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Gerald Shen et.al. 2405.01481 link
2024-05-02 V-FLUTE: Visual Figurative Language Understanding with Textual Explanations Arkadiy Saakyan et.al. 2405.01474 link
2024-05-02 Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning Théo Moutakanni et.al. 2405.01469 null
2024-05-02 Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models Yifei Ming et.al. 2405.01468 null
2024-05-02 A Systematic Literature Review on Large Language Models for Automated Program Repair Quanjun Zhang et.al. 2405.01466 link
2024-05-02 Natural Language to Verilog: Design of a Recurrent Spiking Neural Network using Large Language Models and ChatGPT Paola Vitolo et.al. 2405.01419 null
2024-05-02 MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors Yuan Tang et.al. 2405.01413 link
2024-05-02 Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving Xin Quan et.al. 2405.01379 null
2024-05-02 GAIA: A General AI Assistant for Intelligent Accelerator Operations Frank Mayet et.al. 2405.01359 null
2024-05-01 Self-Play Preference Optimization for Language Model Alignment Yue Wu et.al. 2405.00675 link
2024-05-01 Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3 Junsang Yoon et.al. 2405.00664 link
2024-05-01 HalluVault: A Novel Logic Programming-aided Metamorphic Testing Framework for Detecting Fact-Conflicting Hallucinations in Large Language Models Ningke Li et.al. 2405.00648 null
2024-05-01 When Quantization Affects Confidence of Large Language Models? Irina Proskurina et.al. 2405.00632 link
2024-05-01 "I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust Sunnie S. Y. Kim et.al. 2405.00623 null
2024-05-01 Causal Evaluation of Language Models Sirui Chen et.al. 2405.00622 link
2024-05-01 Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling Yida Mu et.al. 2405.00611 link
2024-05-01 Investigating Automatic Scoring and Feedback using Large Language Models Gloria Ashiya Katuka et.al. 2405.00602 null
2024-05-01 Are Models Biased on Text without Gender-related Language? Catarina G Belém et.al. 2405.00588 link
2024-05-01 The Real, the Better: Aligning Large Language Models with Online Human Behaviors Guanying Jiang et.al. 2405.00578 null
2024-05-01 EALD-MLLM: Emotion Analysis in Long-sequential and De-identity videos with Multi-modal Large Language Model Deng Li et.al. 2405.00574 null
2024-05-01 NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance Huan-Yi Su et.al. 2405.00566 null
2024-05-01 Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment Zhili Liu et.al. 2405.00557 null
2024-05-01 Long-Term Human Trajectory Prediction using 3D Dynamic Scene Graphs Nicolas Gorlo et.al. 2405.00552 link
2024-05-01 ChatBI: Towards Natural Language to Complex Business Intelligence SQL Jinqing Lian et.al. 2405.00527 null
2024-05-01 CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions Donghee Choi et.al. 2405.00523 null
2024-05-01 Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning Lucas-Andreï Thil et.al. 2405.00516 null
2024-05-01 GOLD: Geometry Problem Solver with Natural Language Description Jiaxin Zhang et.al. 2405.00494 link
2024-05-01 Is Temperature the Creativity Parameter of Large Language Models? Max Peeperkorn et.al. 2405.00492 link
2024-05-01 The Pyramid of Captions Delong Chen et.al. 2405.00485 null
2024-04-30 Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation Yunhao Ge et.al. 2404.19752 null
2024-04-30 PrivComp-KG : Leveraging Knowledge Graph and Large Language Models for Privacy Policy Compliance Verification Leon Garza et.al. 2404.19744 null
2024-04-30 Better & Faster Large Language Models via Multi-token Prediction Fabian Gloeckle et.al. 2404.19737 null
2024-04-30 A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications Steph Buongiorno et.al. 2404.19729 null
2024-04-30 PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games Steph Buongiorno et.al. 2404.19721 null
2024-04-30 Assessing LLMs in Malicious Code Deobfuscation of Real-world Malware Campaigns Constantinos Patsakis et.al. 2404.19715 null
2024-04-30 Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models Scott Sumpter et.al. 2404.19713 null
2024-04-30 When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively Tiziano Labruna et.al. 2404.19705 link
2024-04-30 Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners Chun Feng et.al. 2404.19696 null
2024-04-30 Towards Generalist Robot Learning from Internet Video: A Survey Robert McCarthy et.al. 2404.19664 null
2024-04-30 MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation Min Zhang et.al. 2404.19644 null
2024-04-30 On Training a Neural Network to Explain Binaries Alexander Interrante-Grant et.al. 2404.19631 null
2024-04-30 Seeing Through the Clouds: Cloud Gap Imputation with Prithvi Foundation Model Denys Godwin et.al. 2404.19609 null
2024-04-30 Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning Xuanli He et.al. 2404.19597 null
2024-04-30 RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing Yucheng Hu et.al. 2404.19543 link
2024-04-30 MoST: Multi-modality Scene Tokenization for Motion Prediction Norman Mu et.al. 2404.19531 null
2024-04-30 Do Large Language Models Understand Conversational Implicature -- A case study with a chinese sitcom Shisen Yue et.al. 2404.19509 link
2024-04-30 More Compute Is What You Need Zhen Guo et.al. 2404.19484 null
2024-05-01 Neuro-Vision to Language: Image Reconstruction and Language enabled Interaction via Brain Recordings Guobin Shen et.al. 2404.19438 null
2024-04-30 Can Large Language Models put 2 and 2 together? Probing for Entailed Arithmetical Relationships D. Panas et.al. 2404.19432 null
2024-04-29 Hallucination of Multimodal Large Language Models: A Survey Zechen Bai et.al. 2404.18930 link
2024-04-29 Holmes: Benchmark the Linguistic Competence of Language Models Andreas Waldis et.al. 2404.18923 null
2024-04-29 DPO Meets PPO: Reinforced Token Optimization for RLHF Han Zhong et.al. 2404.18922 null
2024-04-29 TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation Junhao Cheng et.al. 2404.18919 link
2024-04-29 Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting Fangcheng Liu et.al. 2404.18911 link
2024-04-29 Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking Hong Jin Kang et.al. 2404.18881 link
2024-04-29 More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness Aaron J. Li et.al. 2404.18870 link
2024-04-29 Truth-value judgment in language models: belief directions are context sensitive Stefan F. Schouten et.al. 2404.18865 null
2024-04-29 Performance-Aligned LLMs for Generating Fast Code Daniel Nichols et.al. 2404.18864 null
2024-04-29 A Survey on Vision Mamba: Models, Applications and Challenges Rui Xu et.al. 2404.18861 link
2024-04-29 VERT: Verified Equivalent Rust Transpilation with Few-Shot Learning Aidan Z. H. Yang et.al. 2404.18852 null
2024-04-30 FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition Yuxuan Yan et.al. 2404.18848 null
2024-04-29 It's Difficult to be Neutral -- Human and LLM-based Sentiment Annotation of Patient Comments Petter Mæhlum et.al. 2404.18832 null
2024-04-29 Benchmarking Benchmark Leakage in Large Language Models Ruijie Xu et.al. 2404.18824 link
2024-04-29 AppPoet: Large Language Model based Android malware detection via multi-view prompt engineering Wenxiang Zhao et.al. 2404.18816 null
2024-04-29 Unknown Script: Impact of Script on Cross-Lingual Transfer Wondimagegnhue Tsegaye Tufa et.al. 2404.18810 link
2024-04-29 Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Pat Verga et.al. 2404.18796 null
2024-04-29 PECC: Problem Extraction and Coding Challenges Patrick Haller et.al. 2404.18766 link
2024-04-29 Transitive Vision-Language Prompt Learning for Domain Generalization Liyuan Wang et.al. 2404.18758 null
2024-04-29 Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models Hongyi Zhu et.al. 2404.18746 null
2024-04-26 Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo Stephen Zhao et.al. 2404.17546 link
2024-04-26 Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models Yuhang Huang et.al. 2404.17534 null
2024-04-26 Large Language Model Agent as a Mechanical Designer Yayati Jadhav et.al. 2404.17525 null
2024-04-26 On the Use of Large Language Models to Generate Capability Ontologies Luis Miguel Vieira da Silva et.al. 2404.17524 link
2024-04-26 Enhancing Legal Compliance and Regulation Analysis with Large Language Models Shabnam Hassani et.al. 2404.17522 null
2024-04-26 A Comprehensive Evaluation on Event Reasoning of Large Language Models Zhengwei Tao et.al. 2404.17513 link
2024-04-26 CEval: A Benchmark for Evaluating Counterfactual Text Generation Van Bach Nguyen et.al. 2404.17475 link
2024-04-26 Ruffle&Riley: Insights from Designing and Evaluating a Large Language Model-Based Conversational Tutoring System Robin Schmucker et.al. 2404.17460 null
2024-04-26 "ChatGPT Is Here to Help, Not to Replace Anybody" -- An Evaluation of Students' Opinions On Integrating ChatGPT In CS Courses Bruno Pereira Cipriano et.al. 2404.17443 null
2024-04-26 PromptCIR: Blind Compressed Image Restoration with Prompt Learning Bingchen Li et.al. 2404.17433 link
2024-04-26 Evaluation of Geographical Distortions in Language Models: A Crucial Step Towards Equitable Representations Rémy Decoupes et.al. 2404.17401 null
2024-04-26 UniRGB-IR: A Unified Framework for Visible-Infrared Downstream Tasks via Adapter Tuning Maoxun Yuan et.al. 2404.17360 null
2024-04-26 InspectorRAGet: An Introspection Platform for RAG Evaluation Kshitij Fadnis et.al. 2404.17347 link
2024-04-26 Introducing cosmosGPT: Monolingual Training for Turkish Language Models H. Toprak Kesgin et.al. 2404.17336 null
2024-04-26 A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation Xin Zhang et.al. 2404.17335 null
2024-04-26 An Extendable Cloud-Native Alloy Property Explorer Zhuoyuan Li et.al. 2404.17330 link
2024-04-26 When to Trust LLMs: Aligning Confidence with Response Quality Shuchang Tao et.al. 2404.17287 null
2024-04-26 Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM Xuan Zhang et.al. 2404.17283 link
2024-04-26 Prompting Towards Alleviating Code-Switched Data Scarcity in Under-Resourced Languages with GPT as a Pivot Michelle Terblanche et.al. 2404.17216 null
2024-04-26 Low-Rank Knowledge Decomposition for Medical Foundation Models Yuhang Zhou et.al. 2404.17184 link
2024-04-25 The Third Monocular Depth Estimation Challenge Jaime Spencer et.al. 2404.16831 null
2024-04-25 Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials Ye Fang et.al. 2404.16829 null
2024-04-25 V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection Xuanyu Zhang et.al. 2404.16824 null
2024-04-25 How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites Zhe Chen et.al. 2404.16821 link
2024-04-25 IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Harman Singh et.al. 2404.16816 link
2024-04-26 Make Your LLM Fully Utilize the Context Shengnan An et.al. 2404.16811 link
2024-04-25 Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning Tianhui Zhang et.al. 2404.16807 null
2024-04-25 AAPL: Adding Attributes to Prompt Learning for Vision-Language Models Gahyeon Kim et.al. 2404.16804 link
2024-04-25 Weak-to-Strong Extrapolation Expedites Alignment Chujie Zheng et.al. 2404.16792 link
2024-04-25 SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension Bohao Li et.al. 2404.16790 link
2024-04-25 Continual Learning of Large Language Models: A Comprehensive Survey Haizhou Shi et.al. 2404.16789 link
2024-04-25 Modeling Selective Feature Attention for Representation-based Siamese Text Matching Jianxiang Zang et.al. 2404.16776 link
2024-04-25 REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao et.al. 2404.16767 link
2024-04-25 Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model Runzhe Zhan et.al. 2404.16766 null
2024-04-25 RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis Xiaoman Zhang et.al. 2404.16754 link
2024-04-25 Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class Mazda Moayeri et.al. 2404.16717 null
2024-04-25 Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding Mostafa Elhoushi et.al. 2404.16710 null
2024-04-25 Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents Giorgio Piatti et.al. 2404.16698 link
2024-04-25 Influence of Solution Efficiency and Valence of Instruction on Additive and Subtractive Solution Strategies in Humans and GPT-4 Lydia Uhler et.al. 2404.16692 null
2024-04-25 EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning Hongxia Xie et.al. 2404.16670 link
2024-04-24 Hybrid LLM/Rule-based Approaches to Business Insights Generation from Structured Data Aliaksei Vertsel et.al. 2404.15604 null
2024-04-24 ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction Henry Peng Zou et.al. 2404.15592 link
2024-04-24 MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis Jiaxin Zhuang et.al. 2404.15580 null
2024-04-24 Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations? Hossein Salami et.al. 2404.15578 null
2024-04-24 Retrieval Head Mechanistically Explains Long-Context Factuality Wenhao Wu et.al. 2404.15574 link
2024-04-23 PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models Shashi Kant Gupta et.al. 2404.15549 null
2024-04-23 BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis Shuhang Lin et.al. 2404.15532 link
2024-04-23 Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models Mihir Parmar et.al. 2404.15522 link
2024-04-23 Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval Young Kyun Jang et.al. 2404.15516 null
2024-04-23 ToM-LM: Delegating Theory Of Mind Reasoning to External Symbolic Executors in Large Language Models Weizhi Tang et.al. 2404.15515 null
2024-04-23 IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents Jean-Philippe Corbeil et.al. 2404.15488 link
2024-04-23 Large Language Models Spot Phishing Emails with Surprising Accuracy: A Comparative Analysis of Performance Het Patel et.al. 2404.15485 null
2024-04-23 Can Large Language Models Learn the Physics of Metamaterials? An Empirical Study with ChatGPT Darui Lu et.al. 2404.15458 null
2024-04-23 XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference João Monteiro et.al. 2404.15420 null
2024-04-23 Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs Davide Caffagni et.al. 2404.15406 null
2024-04-23 Aligning LLM Agents by Learning Latent Preference from User Edits Ge Gao et.al. 2404.15269 link
2024-04-23 XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts Yifeng Ding et.al. 2404.15247 link
2024-04-23 CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies Weiyan Shi et.al. 2404.15238 link
2024-04-23 Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models Aidan Z. H. Yang et.al. 2404.15236 null
2024-04-23 Re-Thinking Inverse Graphics With Large Language Models Peter Kulits et.al. 2404.15228 null
2024-04-23 Does Instruction Tuning Make LLMs More Consistent? Constanza Fierro et.al. 2404.15206 null
2024-04-23 Setting up the Data Printer with Improved English to Ukrainian Machine Translation Yurii Paniv et.al. 2404.15196 link
2024-04-23 Regressive Side Effects of Training Language Models to Mimic Student Misconceptions Shashank Sonkar et.al. 2404.15156 null
2024-04-23 Bias patterns in the application of LLMs for clinical decision support: A comprehensive study Raphael Poulain et.al. 2404.15149 link
2024-04-23 Rethinking LLM Memorization through the Lens of Adversarial Compression Avi Schwarzschild et.al. 2404.15146 null
2024-04-23 MedDr: Diagnosis-Guided Bootstrapping for Large-Scale Medical Vision-Language Learning Sunan He et.al. 2404.15127 null
2024-04-23 Identifying Fairness Issues in Automatically Generated Testing Content Kevin Stowe et.al. 2404.15104 null
2024-04-23 Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation Xun Wu et.al. 2404.15100 null
2024-04-23 Detection of circular permutations by Protein Language Models Yue Hu et.al. 2404.15087 link
2024-04-23 Multi-Head Mixture-of-Experts Xun Wu et.al. 2404.15045 null
2024-04-23 TAXI: Evaluating Categorical Knowledge Editing for Language Models Derek Powell et.al. 2404.15004 link
2024-04-23 Transformers Can Represent $n$ -gram Language Models Anej Svete et.al. 2404.14994 null
2024-04-23 A Short Review for Ontology Learning from Text: Stride from Shallow Learning, Deep Learning to Large Language Models Trend Rick Du et.al. 2404.14991 null
2024-04-23 $\texttt{MiniMol}$ : A Parameter-Efficient Foundation Model for Molecular Learning Kerstin Kläser et.al. 2404.14986 null
2024-04-23 Social Media and Artificial Intelligence for Sustainable Cities and Societies: A Water Quality Analysis Use-case Muhammad Asif Auyb et.al. 2404.14977 null
2024-04-22 AutoAD III: The Prequel -- Back to the Pixels Tengda Han et.al. 2404.14412 null
2024-04-22 SpaceByte: Towards Deleting Tokenization from Large Language Modeling Kevin Slagle et.al. 2404.14408 link
2024-04-22 RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios? Adrian de Wynter et.al. 2404.14397 link
2024-04-22 SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation Yuying Ge et.al. 2404.14396 link
2024-04-22 PARAMANU-GANITA: Language Model with Mathematical Capabilities Mitodru Niyogi et.al. 2404.14395 null
2024-04-22 A Multimodal Automated Interpretability Agent Tamar Rott Shaham et.al. 2404.14394 null
2024-04-22 A Survey on Self-Evolution of Large Language Models Zhengwei Tao et.al. 2404.14387 link
2024-04-22 Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph Xiaochen Kev Gao et.al. 2404.14372 link
2024-04-23 Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data Fahim Tajwar et.al. 2404.14367 link
2024-04-22 Better Synthetic Data by Retrieving and Transforming Existing Datasets Saumya Gandhi et.al. 2404.14361 link
2024-04-22 Rethinking Legal Compliance Automation: Opportunities with Large Language Models Shabnam Hassani et.al. 2404.14356 null
2024-04-22 Calc-CMU at SemEval-2024 Task 7: Pre-Calc -- Learning to Use the Calculator Improves Numeracy in Language Models Vishruth Veerendranath et.al. 2404.14355 link
2024-04-22 Automated Long Answer Grading with RiceChem Dataset Shashank Sonkar et.al. 2404.14316 link
2024-04-22 Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels Jan-Philipp Fränken et.al. 2404.14313 link
2024-04-22 Explaining Arguments' Strength: Unveiling the Role of Attacks and Supports (Technical Report) Xiang Yin et.al. 2404.14304 link
2024-04-22 Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits Shashank Sonkar et.al. 2404.14301 null
2024-04-22 Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach Yao Wan et.al. 2404.14296 link
2024-04-22 A Survey on Efficient Inference for Large Language Models Zixuan Zhou et.al. 2404.14294 null
2024-04-22 LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots Dongge Han et.al. 2404.14285 null
2024-04-22 Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback Wenyi Xiao et.al. 2404.14233 null
2024-04-19 MoVA: Adapting Mixture of Vision Experts to Multimodal Context Zhuofan Zong et.al. 2404.13046 link
2024-04-19 Unified Scene Representation and Reconstruction for 3D Large Language Models Tao Chu et.al. 2404.13044 null
2024-04-19 Data Alignment for Zero-Shot Concept Generation in Dermatology AI Soham Gadgil et.al. 2404.13043 null
2024-04-19 Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs Biyang Guo et.al. 2404.13033 link
2024-04-19 When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering Stephen Choi et.al. 2404.13028 null
2024-04-19 Stronger Random Baselines for In-Context Learning Gregory Yauney et.al. 2404.13020 link
2024-04-19 Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models Chuofan Ma et.al. 2404.13013 link
2024-04-19 Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs Clemencia Siro et.al. 2404.12994 link
2024-04-19 FineRec:Exploring Fine-grained Sequential Recommendation Xiaokun Zhang et.al. 2404.12975 link
2024-04-19 Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models Yian Li et.al. 2404.12966 null
2024-04-19 Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction Qinyuan Wu et.al. 2404.12957 null
2024-04-19 Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models Konstantinos Vilouras et.al. 2404.12920 null
2024-04-19 Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models Zhenyang Ni et.al. 2404.12916 link
2024-04-19 Large Language Models for Networking: Workflow, Advances and Challenges Chang Liu et.al. 2404.12901 null
2024-04-19 Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning Ahmed Elshabrawy et.al. 2404.12897 null
2024-04-19 Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation Guanhua Chen et.al. 2404.12879 null
2024-04-19 LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency Zhaodonghui Li et.al. 2404.12872 link
2024-04-19 How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? Yang Luo et.al. 2404.12866 null
2024-04-19 Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation Yilong Chen et.al. 2404.12861 null
2024-04-19 TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages Aleksei Dorkin et.al. 2404.12845 null
2024-04-18 BLINK: Multimodal Large Language Models Can See but Not Perceive Xingyu Fu et.al. 2404.12390 null
2024-04-18 Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Aitor Ormazabal et.al. 2404.12387 null
2024-04-18 MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale Xiaotang Gai et.al. 2404.12372 null
2024-04-18 When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes Asaf Yehudai et.al. 2404.12365 link
2024-04-18 From $r$ to $Q^*$ : Your Language Model is Secretly a Q-Function Rafael Rafailov et.al. 2404.12358 null
2024-04-18 Towards a Foundation Model for Partial Differential Equation: Multi-Operator Learning and Extrapolation Jingmin Sun et.al. 2404.12355 link
2024-04-18 V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning Hang Hua et.al. 2404.12353 null
2024-04-18 Evaluating AI for Law: Bridging the Gap with Open-Source Solutions Rohan Bhambhoria et.al. 2404.12349 null
2024-04-18 Large Language Models in Targeted Sentiment Analysis Nicolay Rusnachenko et.al. 2404.12342 link
2024-04-18 Normative Requirements Operationalization with Large Language Models Nick Feng et.al. 2404.12335 null
2024-04-18 Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment Zhaofeng Wu et.al. 2404.12318 null
2024-04-18 Large Language Models for Synthetic Participatory Planning of Shared Automated Electric Mobility Systems Jiangbo Yu et.al. 2404.12317 null
2024-04-18 Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair Yusuke Sakai et.al. 2404.12299 null
2024-04-18 Augmenting emotion features in irony detection with Large language modeling Yucheng Lin et.al. 2404.12291 null
2024-04-18 Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery Yona Falinie A. Gaus et.al. 2404.12285 null
2024-04-18 Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting Nicholas Harris et.al. 2404.12283 null
2024-04-18 Advancing the Robustness of Large Language Models through Self-Denoised Smoothing Jiabao Ji et.al. 2404.12274 link
2024-04-18 FedEval-LLM: Federated Evaluation of Large Language Models on Downstream Tasks with Collective Wisdom Yuanqin He et.al. 2404.12273 null
2024-04-18 Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences Shreya Shankar et.al. 2404.12272 null
2024-04-18 Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM Michelle S. Lam et.al. 2404.12259 link
2024-04-18 Private federated discovery of out-of-vocabulary words for Gboard Ziteng Sun et.al. 2404.11607 null
2024-04-17 VG4D: Vision-Language Model Goes 4D Video Recognition Zhichao Deng et.al. 2404.11605 link
2024-04-17 A Deep Dive into Large Language Models for Automated Bug Localization and Repair Soneya Binta Hossain et.al. 2404.11595 null
2024-04-17 Prompt Optimizer of Text-to-Image Diffusion Models for Abstract Concept Understanding Zezhong Fan et.al. 2404.11589 null
2024-04-17 LLMTune: Accelerate Database Knob Tuning with Large Language Models Xinmei Huang et.al. 2404.11581 link
2024-04-17 On the Scalability of GNNs for Molecular Graphs Maciej Sypetkowski et.al. 2404.11568 null
2024-04-17 MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation Kuan-Chieh et.al. 2404.11565 null
2024-04-17 Quantifying Multilingual Performance of Large Language Models Across Languages Zihao Li et.al. 2404.11553 null
2024-04-17 Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis Soyoung Yang et.al. 2404.11539 null
2024-04-17 FedPFT: Federated Proxy Fine-Tuning of Foundation Models Zhaopeng Peng et.al. 2404.11536 link
2024-04-17 Select and Reorder: A Novel Approach for Neural Sign Language Production Harry Walsh et.al. 2404.11532 null
2024-04-17 Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization Costas Mavromatis et.al. 2404.11531 link
2024-04-17 Embedding Privacy in Computational Social Science and Artificial Intelligence Research Keenan Jones et.al. 2404.11515 null
2024-04-17 Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models Yushuo Chen et.al. 2404.11502 link
2024-04-17 Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models Yue Zhou et.al. 2404.11500 link
2024-04-18 Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent Wei Chen et.al. 2404.11459 null
2024-04-17 Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models Sunhao Dai et.al. 2404.11457 link
2024-04-17 AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts Meng Jiang et.al. 2404.11449 null
2024-04-17 Open-Ended Wargames with Large Language Models Daniel P. Hogan et.al. 2404.11446 link
2024-04-17 DUPE: Detection Undermining via Prompt Engineering for Deepfake Text James Weichert et.al. 2404.11408 null
2024-04-16 Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback Qiwei Di et.al. 2404.10776 null
2024-04-16 COMBO: Compositional World Models for Embodied Multi-Agent Cooperation Hongxin Zhang et.al. 2404.10775 null
2024-04-16 Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification Yu-Yang Li et.al. 2404.10757 link
2024-04-16 Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study Shusheng Xu et.al. 2404.10719 null
2024-04-17 Dual Modalities of Text: Visual and Textual Generative Pre-training Yekun Chai et.al. 2404.10710 null
2024-04-16 Question Difficulty Ranking for Multiple-Choice Reading Comprehension Vatsal Raina et.al. 2404.10704 null
2024-04-16 An empirical study on code review activity prediction in practice Doriane Olewicki et.al. 2404.10703 null
2024-04-16 Automating REST API Postman Test Cases Using LLM S Deepika Sri et.al. 2404.10678 null
2024-04-16 Self-playing Adversarial Language Game Enhances LLM Reasoning Pengyu Cheng et.al. 2404.10642 link
2024-04-16 HLAT: High-quality Large Language Model Pre-trained on AWS Trainium Haozheng Fan et.al. 2404.10630 null
2024-04-16 Private Attribute Inference from Images with Vision-Language Models Batuhan Tömekçe et.al. 2404.10618 null
2024-04-16 Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases Yanze Li et.al. 2404.10595 null
2024-04-16 Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training Masanori Hirano et.al. 2404.10555 null
2024-04-16 Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning Xiao Wang et.al. 2404.10552 null
2024-04-16 Capturing the Macroscopic Behaviour of Molecular Dynamics with Membership Functions Alexander Sikorski et.al. 2404.10523 link
2024-04-16 CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity Moshe Berchansky et.al. 2404.10513 null
2024-04-16 White Men Lead, Black Women Help: Uncovering Gender, Racial, and Intersectional Bias in Language Agency Yixin Wan et.al. 2404.10508 null
2024-04-16 Self-Supervised Visual Preference Alignment Ke Zhu et.al. 2404.10501 link
2024-04-16 When Emotional Stimuli meet Prompt Designing: An Auto-Prompt Graphical Paradigm Chenggian Ma et.al. 2404.10500 null
2024-04-16 Spiral of Silences: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering Xiaoyang Chen et.al. 2404.10496 link
2024-04-15 KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models Avinash Anand et.al. 2404.09763 null
2024-04-15 Resilience of Large Language Models for Noisy Instructions Bin Wang et.al. 2404.09754 null
2024-04-15 Personalized Collaborative Fine-Tuning for On-Device Large Language Models Nicolas Wagner et.al. 2404.09753 link
2024-04-15 AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides Kewei Li et.al. 2404.09738 link
2024-04-15 Quantization of Large Language Models with an Overdetermined Basis Daniil Merkulov et.al. 2404.09737 null
2024-04-15 Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models Ziwei Luo et.al. 2404.09732 link
2024-04-15 Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model Hyunsoo Cho et.al. 2404.09717 null
2024-04-15 Enhancing Robot Explanation Capabilities through Vision-Language Models: a Preliminary Study by Interpreting Visual Inputs for Improved Human-Robot Interaction David Sobrín-Hidalgo et.al. 2404.09705 null
2024-04-15 Generative AI for Game Theory-based Mobile Networking Long He et.al. 2404.09699 null
2024-04-15 Are Large Language Models Reliable Argument Quality Annotators? Nailia Mirzakhmedova et.al. 2404.09696 link
2024-04-15 LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models Guangyan Li et.al. 2404.09695 null
2024-04-15 Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation Juhwan Choi et.al. 2404.09682 null
2024-04-15 Learn Your Reference Model for Real Good Alignment Alexey Gorbatovski et.al. 2404.09656 null
2024-04-15 Do LLMs Understand Visual Anomalies? Uncovering LLM Capabilities in Zero-shot Anomaly Detection Jiaqi Zhu et.al. 2404.09654 null
2024-04-15 Bridging Vision and Language Spaces with Assignment Prediction Jungin Park et.al. 2404.09632 link
2024-04-15 AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception Yipo Huang et.al. 2404.09624 link
2024-04-15 UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark Zhaokun Zhou et.al. 2404.09619 null
2024-04-15 A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions Pengfei Liu et.al. 2404.09606 link
2024-04-15 Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction Zepeng Ding et.al. 2404.09593 null
2024-04-15 Modelling Language Jumbly Grindrod et.al. 2404.09579 null
2024-04-15 Transformers, Contextualism, and Polysemy Jumbly Grindrod et.al. 2404.09577 null
2024-04-15 Large language models and linguistic intentionality Jumbly Grindrod et.al. 2404.09576 null
2024-04-12 Probing the 3D Awareness of Visual Foundation Models Mohamed El Banani et.al. 2404.08636 link
2024-04-12 Pre-training Small Base LMs with Fewer Tokens Sunny Sanyal et.al. 2404.08634 link
2024-04-12 FCert: Certifiably Robust Few-Shot Classification in the Era of Foundation Models Yanting Wang et.al. 2404.08631 link
2024-04-12 Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation Yanhao Zheng et.al. 2404.08603 link
2024-04-12 Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts Övgü Özdemir et.al. 2404.08589 link
2024-04-12 Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation Abu Bakor Hayat Arnob et.al. 2404.08584 link
2024-04-12 FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation Riza Velioglu et.al. 2404.08582 link
2024-04-12 Lossy Image Compression with Foundation Diffusion Models Lucas Relic et.al. 2404.08580 null
2024-04-12 Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation Hanlin Tian et.al. 2404.08570 link
2024-04-12 RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs Shreyas Chaudhari et.al. 2404.08555 null
2024-04-12 Memory Traces: Are Transformers Tulving Machines? Jean-Marie Chauvet et.al. 2404.08543 null
2024-04-12 Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward Xuan Xie et.al. 2404.08517 null
2024-04-12 ChatGPT and general-purpose AI count fruits in pictures surprisingly well Konlavach Mengsuwan et.al. 2404.08515 null
2024-04-12 Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction Haoran Qiu et.al. 2404.08509 link
2024-04-12 LaSagnA: Language-based Segmentation Assistant for Complex Queries Cong Wei et.al. 2404.08506 link
2024-04-12 Strategic Interactions between Large Language Models-based Agents in Beauty Contests Siting Lu et.al. 2404.08492 null
2024-04-12 Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation Haozhe Zhao et.al. 2404.08491 link
2024-04-12 Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian Stefano De Paoli et.al. 2404.08488 null
2024-04-12 Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task Hassan Ali et.al. 2404.08424 null
2024-04-12 Adapting the Segment Anything Model During Usage in Novel Situations Robin Schön et.al. 2404.08421 null
2024-04-11 OpenBias: Open-set Bias Detection in Text-to-Image Generative Models Moreno D'Incà et.al. 2404.07990 link
2024-04-11 Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding Yiwen Tang et.al. 2404.07989 link
2024-04-11 Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Representation Learning Simon Schrodi et.al. 2404.07983 null
2024-04-11 Language Imbalance Can Boost Cross-lingual Generalisation Anton Schäfer et.al. 2404.07982 link
2024-04-11 Manipulating Large Language Models to Increase Product Visibility Aounon Kumar et.al. 2404.07981 link
2024-04-11 LLoCO: Learning Long Contexts Offline Sijun Tan et.al. 2404.07979 link
2024-04-11 Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models Haotian Zhang et.al. 2404.07973 null
2024-04-11 Rho-1: Not All Tokens Are What You Need Zhenghao Lin et.al. 2404.07965 link
2024-04-11 On Unified Prompt Tuning for Request Quality Assurance in Public Code Review Xinyu Chen et.al. 2404.07942 null
2024-04-11 Leveraging Large Language Models (LLMs) to Support Collaborative Human-AI Online Risk Data Annotation Jinkyung Park et.al. 2404.07926 null
2024-04-11 LaVy: Vietnamese Multimodal Large Language Model Chi Tran et.al. 2404.07922 link
2024-04-11 AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs Zeyi Liao et.al. 2404.07921 link
2024-04-11 DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation Anna C. Doris et.al. 2404.07917 link
2024-04-11 HGRN2: Gated Linear RNNs with State Expansion Zhen Qin et.al. 2404.07904 link
2024-04-11 High-Dimension Human Value Representation in Large Language Models Samuel Cahyawijaya et.al. 2404.07900 link
2024-04-11 Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations Dayeon Ki et.al. 2404.07851 link
2024-04-11 On Training Data Influence of GPT Models Qingyi Liu et.al. 2404.07840 link
2024-04-11 RecurrentGemma: Moving Past Transformers for Efficient Open Language Models Aleksandar Botev et.al. 2404.07839 link
2024-04-11 Streamlined Photoacoustic Image Processing with Foundation Models: A Training-Free Solution Handi Deng et.al. 2404.07833 null
2024-04-11 Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese Yuichi Inoue et.al. 2404.07824 link
2024-04-10 BRAVE: Broadening the visual encoding of vision-language models Oğuzhan Fatih Kar et.al. 2404.07204 null
2024-04-10 UMBRAE: Unified Multimodal Decoding of Brain Signals Weihao Xia et.al. 2404.07202 link
2024-04-10 Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic Sachin Goyal et.al. 2404.07177 link
2024-04-10 Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention Tsendsuren Munkhdalai et.al. 2404.07143 null
2024-04-10 Open reaction-diffusion systems: bridging probabilistic theory across scales Mauricio J. del Razo et.al. 2404.07119 null
2024-04-10 Continuous Language Model Interpolation for Dynamic and Controllable Text Generation Sara Kangaslahti et.al. 2404.07117 link
2024-04-11 From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications Yongqiang Ma et.al. 2404.07108 null
2024-04-10 Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs Bowen Jin et.al. 2404.07103 link
2024-04-10 Dynamic Generation of Personalities with Large Language Models Jianzhi Liu et.al. 2404.07084 link
2024-04-10 VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning Alexandros Xenos et.al. 2404.07078 link
2024-04-10 Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers? Mingyu Jin et.al. 2404.07066 link
2024-04-10 Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study Alessandro Stolfo et.al. 2404.07060 null
2024-04-10 Meta4XNLI: A Crosslingual Parallel Corpus for Metaphor Detection and Interpretation Elisa Sanchez-Bayona et.al. 2404.07053 link
2024-04-10 ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling Ege Özsoy et.al. 2404.07031 link
2024-04-10 Improving Language Model Reasoning with Self-motivated Learning Yunlong Feng et.al. 2404.07017 null
2024-04-10 A Mathematical Theory for Learning Semantic Languages by Abstract Learners Kuo-Yu Liao et.al. 2404.07009 null
2024-04-10 WordDecipher: Enhancing Digital Workspace Communication with Explainable AI for Non-native English Speakers Yuexi Chen et.al. 2404.07005 null
2024-04-10 LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models Igor Tufanov et.al. 2404.07004 null
2024-04-10 Event Grounded Criminal Court View Generation withCooperative (Large) Language Models Linan Yue et.al. 2404.07001 link
2024-04-10 Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study Hongru Du et.al. 2404.06962 link
2024-04-09 InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD Xiaoyi Dong et.al. 2404.06512 link
2024-04-09 Can Feedback Enhance Semantic Grounding in Large Vision-Language Models? Yuan-Hong Liao et.al. 2404.06510 null
2024-04-09 On the Effect of (Near) Duplicate Subwords in Language Modelling Anton Schäfer et.al. 2404.06508 link
2024-04-09 Pitfalls of Conversational LLMs on News Debiasing Ipek Baris Schlicht et.al. 2404.06488 null
2024-04-10 Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks Chonghua Wang et.al. 2404.06480 link
2024-04-10 Text-Based Reasoning About Vector Graphics Zhenhailong Wang et.al. 2404.06479 null
2024-04-09 Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models Zihan Fang et.al. 2404.06448 null
2024-04-09 Large Language Models to the Rescue: Deadlock Resolution in Multi-Robot Systems Kunal Garg et.al. 2404.06413 null
2024-04-09 AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents Luca Gioacchini et.al. 2404.06411 link
2024-04-09 Take a Look at it! Rethinking How to Evaluate Language Model Jailbreak Hongyu Cai et.al. 2404.06407 link
2024-04-09 Apprentices to Research Assistants: Advancing Research with Large Language Models M. Namvarpour et.al. 2404.06404 null
2024-04-09 MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies Shengding Hu et.al. 2404.06395 link
2024-04-10 MuPT: A Generative Symbolic Music Pretrained Transformer Xingwei Qu et.al. 2404.06393 null
2024-04-09 Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis Mikel Zubillaga et.al. 2404.06392 null
2024-04-09 Latent Distance Guided Alignment Training for Large Language Models Haotian Luo et.al. 2404.06390 null
2024-04-09 Model Generation from Requirements with LLMs: an Exploratory Study Alessio Ferrari et.al. 2404.06371 null
2024-04-09 Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python Valdecy Pereira et.al. 2404.06370 link
2024-04-09 VISION2UI: A Real-World Dataset with Layout for Code Generation from UI Designs Yi Gui et.al. 2404.06369 null
2024-04-09 ClinLinker: Medical Entity Linking of Clinical Concept Mentions in Spanish Fernando Gallego et.al. 2404.06367 null
2024-04-09 Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation Sidra Aleem et.al. 2404.06362 link
2024-04-08 MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Bo He et.al. 2404.05726 link
2024-04-08 Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Keen You et.al. 2404.05719 null
2024-04-08 Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding Ahmad Idrissi-Yaghir et.al. 2404.05694 null
2024-04-08 Evaluating Mathematical Reasoning Beyond Accuracy Shijie Xia et.al. 2404.05692 link
2024-04-08 Retrieval-Augmented Open-Vocabulary Object Detection Jooyeon Kim et.al. 2404.05687 link
2024-04-08 MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Kunpeng Song et.al. 2404.05674 link
2024-04-08 CoReS: Orchestrating the Dance of Reasoning and Segmentation Xiaoyi Bao et.al. 2404.05673 null
2024-04-09 Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data Haitham Hammami et.al. 2404.05632 link
2024-04-08 LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking Faren Yan et.al. 2404.05624 null
2024-04-08 MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning Matteo Farina et.al. 2404.05621 link
2024-04-08 SpeechAlign: Aligning Speech Generation to Human Preferences Dong Zhang et.al. 2404.05600 link
2024-04-08 MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering Iñigo Alonso et.al. 2404.05590 null
2024-04-08 Enhancing Software Related Information Extraction with Generative Language Models through Single-Choice Question Answering Wolfgang Otto et.al. 2404.05587 null
2024-04-08 Towards More General Video-based Deepfake Detection through Facial Feature Guided Adaptation for Foundation Model Yue-Hua Han et.al. 2404.05583 null
2024-04-08 360°REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System Shen Gao et.al. 2404.05569 link
2024-04-08 Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models Bowen Pan et.al. 2404.05567 null
2024-04-08 Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training Longhui Zhang et.al. 2404.05560 link
2024-04-08 Evaluating Interventional Reasoning Capabilities of Large Language Models Tejas Kasetty et.al. 2404.05545 null
2024-04-08 OPSD: an Offensive Persian Social media Dataset and its baseline evaluations Mehran Safayani et.al. 2404.05540 null
2024-04-08 Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data Tim Baumgärtner et.al. 2404.05530 null
2024-04-05 Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2) Michael Saxon et.al. 2404.04251 link
2024-04-05 Physical Property Understanding from Language-Embedded Feature Fields Albert J. Zhai et.al. 2404.04242 null
2024-04-05 Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents Harsh Kohli et.al. 2404.04237 null
2024-04-05 player2vec: A Language Modeling Approach to Understand Player Behavior in Games Tianze Wang et.al. 2404.04234 null
2024-04-05 Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation Ji-Jia Wu et.al. 2404.04231 link
2024-04-05 Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation Tong Su et.al. 2404.04212 null
2024-04-05 Social Skill Training with Large Language Models Diyi Yang et.al. 2404.04204 null
2024-04-05 Do Sentence Transformers Learn Quasi-Geospatial Concepts from General Text? Ilya Ilyankou et.al. 2404.04169 null
2024-04-05 Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model Xinrun Du et.al. 2404.04167 null
2024-04-05 Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval João Coelho et.al. 2404.04163 null
2024-04-05 BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models Jacek Wiland et.al. 2404.04113 link
2024-04-05 Large language models as oracles for instantiating ontologies with domain-specific knowledge Giovanni Ciatto et.al. 2404.04108 link
2024-04-05 Robust Preference Optimization with Provable Noise Tolerance for LLMs Xize Liang et.al. 2404.04102 null
2024-04-05 Label Propagation for Zero-shot Classification with Vision-Language Models Vladan Stojnić et.al. 2404.04072 link
2024-04-05 Assessing the quality of information extraction Filip Seitl et.al. 2404.04068 null
2024-04-05 CLUE: A Clinical Language Understanding Evaluation for LLMs Amin Dada et.al. 2404.04067 link
2024-04-05 VoicePilot: Harnessing LLMs as Speech Interfaces for Physically Assistive Robots Akhil Padmanabha et.al. 2404.04066 null
2024-04-05 A Comparison of Methods for Evaluating Generative IR Negar Arabzadeh et.al. 2404.04044 link
2024-04-05 Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer Hele-Andra Kuulmets et.al. 2404.04042 link
2024-04-05 Willkommens-Merkel, Chaos-Johnson, and Tore-Klose: Modeling the Evaluative Meaning of German Personal Name Compounds Annerose Eichel et.al. 2404.04031 link
2024-04-04 OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views Francis Engelmann et.al. 2404.03650 null
2024-04-04 AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Hanyu Lai et.al. 2404.03648 link
2024-04-04 Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra Darioush Kevian et.al. 2404.03647 null
2024-04-04 Locating and Editing Factual Associations in Mamba Arnab Sen Sharma et.al. 2404.03646 link
2024-04-04 Training LLMs over Neurally Compressed Text Brian Lester et.al. 2404.03626 null
2024-04-04 Standardizing Knowledge Engineering Practices with a Reference Architecture Bradley P. Allen et.al. 2404.03624 null
2024-04-04 Unveiling LLMs: The Evolution of Latent Representations in a Temporal Knowledge Graph Marco Bronzini et.al. 2404.03623 link
2024-04-04 Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models Wenshan Wu et.al. 2404.03622 null
2024-04-04 DeViDe: Faceted medical knowledge for improved medical vision-language pre-training Haozhe Luo et.al. 2404.03618 null
2024-04-04 Sailor: Open Language Models for South-East Asia Longxu Dou et.al. 2404.03608 link
2024-04-04 Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization Aniruddha Nrusimha et.al. 2404.03605 link
2024-04-04 Evaluating LLMs at Detecting Errors in LLM Responses Ryo Kamoi et.al. 2404.03602 link
2024-04-04 Intent Detection and Entity Extraction from BioMedical Literature Ankan Mullick et.al. 2404.03598 link
2024-04-04 ReFT: Representation Finetuning for Language Models Zhengxuan Wu et.al. 2404.03592 link
2024-04-04 SemGrasp: Semantic Grasp Generation via Language Aligned Discretization Kailin Li et.al. 2404.03590 null
2024-04-04 Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models Yantao Liu et.al. 2404.03577 link
2024-04-04 Embodied AI with Two Arms: Zero-shot Learning, Safety and Modularity Jake Varley et.al. 2404.03570 null
2024-04-04 Personalized LLM Response Generation with Parameterized Memory Injection Kai Zhang et.al. 2404.03565 null
2024-04-04 Select and Summarize: Scene Saliency for Movie Script Summarization Rohit Saxena et.al. 2404.03561 link
2024-04-04 How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes Harmon Bhasin et.al. 2404.03558 link
2024-04-03 ALOHa: A New Measure for Hallucination in Captioning Models Suzanne Petryk et.al. 2404.02904 null
2024-04-03 MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment Duygu Ceylan et.al. 2404.02899 null
2024-04-03 ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Yifan Xu et.al. 2404.02893 link
2024-04-03 MODNO: Multi Operator Learning With Distributed Neural Operators Zecheng Zhang et.al. 2404.02892 null
2024-04-03 Linear Attention Sequence Parallelism Weigao Sun et.al. 2404.02882 link
2024-04-03 Integrating Explanations in Learning LTL Specifications from Demonstrations Ashutosh Gupta et.al. 2404.02872 null
2024-04-03 Toward Inference-optimal Mixture-of-Expert Large Language Models Longfei Yun et.al. 2404.02852 null
2024-04-03 I-Design: Personalized LLM Interior Designer Ata Çelen et.al. 2404.02838 null
2024-04-03 Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models Wanyun Cui et.al. 2404.02837 null
2024-04-03 Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison Maxime Bouthors et.al. 2404.02835 null
2024-04-03 Empowering Biomedical Discovery with AI Agents Shanghua Gao et.al. 2404.02831 null
2024-04-03 BAdam: A Memory Efficient Full Parameter Training Method for Large Language Models Qijun Luo et.al. 2404.02827 link
2024-04-03 Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models Haoran Sun et.al. 2404.02823 link
2024-04-03 A Survey of Optimization-based Task and Motion Planning: From Classical To Learning Approaches Zhigen Zhao et.al. 2404.02817 null
2024-04-03 The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers Hussein Mozannar et.al. 2404.02806 link
2024-04-03 Efficient Multi-Vector Dense Retrieval Using Bit Vectors Franco Maria Nardini et.al. 2404.02805 link
2024-04-03 AI and personalized learning: bridging the gap with modern educational goals Kristjan-Julius Laak et.al. 2404.02798 null
2024-04-03 CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech Jaehyeon Kim et.al. 2404.02781 null
2024-04-03 FPT: Feature Prompt Tuning for Few-shot Readability Assessment Ziyang Wang et.al. 2404.02772 link
2024-04-03 DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement Hao Wu et.al. 2404.02755 null
2024-04-02 Segment Any 3D Object with Language Seungjun Lee et.al. 2404.02157 null
2024-04-02 Iterated Learning Improves Compositionality in Large Vision-Language Models Chenhao Zheng et.al. 2404.02145 null
2024-04-02 Topic-based Watermarks for LLM-Generated Text Alexander Nemecek et.al. 2404.02138 null
2024-04-02 ViTamin: Designing Scalable Vision Models in the Vision-Language Era Jienneg Chen et.al. 2404.02132 link
2024-04-02 FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning Joel Niklaus et.al. 2404.02127 link
2024-04-02 Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models Wanyong Feng et.al. 2404.02124 link
2024-04-02 GINopic: Topic Modeling with Graph Isomorphism Network Suman Adhya et.al. 2404.02115 link
2024-04-02 CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems Sara Rosenthal et.al. 2404.02103 link
2024-04-02 Advancing LLM Reasoning Generalists with Preference Trees Lifan Yuan et.al. 2404.02078 link
2024-04-02 Red-Teaming Segment Anything Model Krzysztof Jankowski et.al. 2404.02067 link
2024-04-02 Digital Forgetting in Large Language Models: A Survey of Unlearning Methods Alberto Blanco-Justicia et.al. 2404.02062 null
2024-04-02 Long-context LLMs Struggle with Long In-context Learning Tianle Li et.al. 2404.02060 link
2024-04-02 IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT Junchen Fu et.al. 2404.02059 link
2024-04-02 Deconstructing In-Context Learning: Understanding Prompts via Corruption Namrata Shivagunde et.al. 2404.02054 link
2024-04-02 A Survey on Large Language Model-Based Game Agents Sihao Hu et.al. 2404.02039 link
2024-04-02 MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages Daryna Dementieva et.al. 2404.02037 null
2024-04-02 Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts Zhuo Chen et.al. 2404.02022 link
2024-04-02 Large Language Models for Orchestrating Bimanual Robots Kun Chu et.al. 2404.02018 null
2024-04-02 MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving Jiangfei Duan et.al. 2404.02015 link
2024-04-02 Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models Stephan Linzbach et.al. 2404.01992 null
2024-03-29 Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models Atsuyuki Miyai et.al. 2403.20331 link
2024-03-29 Are We on the Right Way for Evaluating Large Vision-Language Models? Lin Chen et.al. 2403.20330 link
2024-03-29 ReALM: Reference Resolution As Language Modeling Joel Ruben Antony Moniz et.al. 2403.20329 null
2024-03-29 Gecko: Versatile Text Embeddings Distilled from Large Language Models Jinhyuk Lee et.al. 2403.20327 null
2024-03-29 Convolutional Prompting meets Language Models for Continual Learning Anurag Roy et.al. 2403.20317 null
2024-03-29 Learn "No" to Say "Yes" Better: Improving Vision-Language Models via Negations Jaisidh Singh et.al. 2403.20312 link
2024-03-29 Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference Jovan Stojkovic et.al. 2403.20306 null
2024-03-29 Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain Burcu Sayin et.al. 2403.20288 link
2024-03-29 LUQ: Long-text Uncertainty Quantification for LLMs Caiqi Zhang et.al. 2403.20279 null
2024-04-01 Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want Weifeng Lin et.al. 2403.20271 link
2024-03-29 Latxa: An Open Language Model and Evaluation Suite for Basque Julen Etxaniz et.al. 2403.20266 link
2024-03-29 ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models Thibaut Thonet et.al. 2403.20262 link
2024-03-29 MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation Taha Koleilat et.al. 2403.20253 link
2024-03-29 Using LLMs to Model the Beliefs and Preferences of Targeted Populations Keiichi Namikoshi et.al. 2403.20252 null
2024-03-29 Long-Tailed Anomaly Detection with Learnable Class Names Chih-Hui Ho et.al. 2403.20236 null
2024-03-29 H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model Chao Pang et.al. 2403.20213 link
2024-03-29 Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science Yazheng Yang et.al. 2403.20208 null
2024-03-29 The Future of Combating Rumors? Retrieval, Discrimination, and Generation Junhao Xu et.al. 2403.20204 null
2024-03-29 ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Capability for Large Vision-Language Models Shuo Liu et.al. 2403.20194 null
2024-03-29 HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM Shuangjian Li et.al. 2403.20183 null
2024-03-28 RSMamba: Remote Sensing Image Classification with State Space Model Keyan Chen et.al. 2403.19654 link
2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu et.al. 2403.19652 null
2024-03-28 MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Kai Zhang et.al. 2403.19651 link
2024-03-28 Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models Samuel Marks et.al. 2403.19647 link
2024-03-28 Change-Agent: Towards Interactive Comprehensive Change Interpretation and Analysis from Change Detection and Change Captioning Chenyang Liu et.al. 2403.19646 link
2024-03-28 Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models Yucheng Shi et.al. 2403.19631 link
2024-03-28 RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents Zeren Chen et.al. 2403.19622 null
2024-03-28 SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects Avinash Ummadisingu et.al. 2403.19607 null
2024-03-28 Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation Zhongliang Zhou et.al. 2403.19584 link
2024-03-28 Keypoint Action Tokens Enable In-Context Imitation Learning in Robotics Norman Di Palo et.al. 2403.19578 null
2024-03-28 WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models Piotr Molenda et.al. 2403.19548 null
2024-03-28 Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models Ang Lv et.al. 2403.19521 link
2024-03-28 Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data Shan Chen et.al. 2403.19511 link
2024-03-28 LLMs as Academic Reading Companions: Extending HCI Through Synthetic Personae Celia Chen et.al. 2403.19506 null
2024-03-28 Evolving Assembly Code in an Adversarial Environment Irina Maliukov et.al. 2403.19489 link
2024-03-28 JDocQA: Japanese Document Question Answering Dataset for Generative Language Models Eri Onami et.al. 2403.19454 link
2024-03-28 Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model Qi Gou et.al. 2403.19443 null
2024-03-28 OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion Xinyu Zhan et.al. 2403.19417 null
2024-03-28 BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation Yuhong He et.al. 2403.19414 null
2024-03-28 Checkpoint Merging via Bayesian Optimization in LLM Pretraining Deyuan Liu et.al. 2403.19390 null
2024-03-27 Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Yanwei Li et.al. 2403.18814 link
2024-03-28 ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation Suraj Patni et.al. 2403.18807 link
2024-03-27 Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation Mateusz Klimaszewski et.al. 2403.18804 link
2024-03-27 Projective Methods for Mitigating Gender Bias in Pre-trained Language Models Hillary Dawkins et.al. 2403.18803 link
2024-03-27 Long-form factuality in large language models Jerry Wei et.al. 2403.18802 link
2024-03-27 Towards a World-English Language Model for On-Device Virtual Assistants Rricha Jalota et.al. 2403.18783 null
2024-03-27 3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation Ehsan Latif et.al. 2403.18778 null
2024-03-27 ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object Chenshuang Zhang et.al. 2403.18775 link
2024-03-27 CheckEval: Robust Evaluation Framework using Large Language Model via Checklist Yukyung Lee et.al. 2403.18771 null
2024-03-27 MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model Yike Wu et.al. 2403.18760 link
2024-03-27 CYCLE: Learning to Self-Refine the Code Generation Yangruibo Ding et.al. 2403.18746 link
2024-03-27 Understanding the Learning Dynamics of Alignment with Human Feedback Shawn Im et.al. 2403.18742 link
2024-03-27 PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations Ehsan Latif et.al. 2403.18721 null
2024-03-27 Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding Xintong Wang et.al. 2403.18715 null
2024-03-27 The Invalsi Benchmark: measuring Language Models Mathematical and Language understanding in Italian Andrea Esuli et.al. 2403.18697 null
2024-03-27 NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method Jakub Hoscilowicz et.al. 2403.18680 link
2024-03-27 An Exploratory Study on Upper-Level Computing Students' Use of Large Language Models as Tools in a Semester-Long Project Ben Arie Tanay et.al. 2403.18679 null
2024-03-27 SDSAT: Accelerating LLM Inference through Speculative Decoding with Semantic Adaptive Tokens Chengbo Liu et.al. 2403.18647 link
2024-03-27 To Recommend or Not: Recommendability Identification in Conversations with Pre-trained Language Models Zhefan Wang et.al. 2403.18628 link
2024-03-27 Vulnerability Detection with Code Language Models: How Far Are We? Yangruibo Ding et.al. 2403.18624 link
2024-03-26 OmniVid: A Generative Framework for Universal Video Understanding Junke Wang et.al. 2403.17935 link
2024-03-26 Track Everything Everywhere Fast and Robustly Yunzhou Song et.al. 2403.17931 null
2024-03-26 MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution Wei Tao et.al. 2403.17927 null
2024-03-26 **LISA: Layerwise Import

About

Automatically Update Arxiv Papers about Path Planning, LLM and Autonomous Driving using Github Actions since 2024.2.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages