Saved in:
| Main Authors: | Qin, Yulu, Wang, Wentao, Lake, Brenden M. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.07899 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Rapid Word Learning Through Meta In-Context Learning
by: Wang, Wentao, et al.
Published: (2025)
by: Wang, Wentao, et al.
Published: (2025)
On the robustness of modeling grounded word learning through a child's egocentric input
by: Vong, Wai Keen, et al.
Published: (2025)
by: Vong, Wai Keen, et al.
Published: (2025)
An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025)
by: Tang, Cheng, et al.
Published: (2025)
Do different prompting methods yield a common task representation in language models?
by: Davidson, Guy, et al.
Published: (2025)
by: Davidson, Guy, et al.
Published: (2025)
Self-supervised learning of video representations from a child's perspective
by: Orhan, A. Emin, et al.
Published: (2024)
by: Orhan, A. Emin, et al.
Published: (2024)
Vocabulary shapes cross-lingual variation of word-order learnability in language models
by: Martins, Jonas Mayer, et al.
Published: (2026)
by: Martins, Jonas Mayer, et al.
Published: (2026)
Compositional learning of functions in humans and machines
by: Zhou, Yanli, et al.
Published: (2024)
by: Zhou, Yanli, et al.
Published: (2024)
From learnable objects to learnable random objects
by: Anderson, Aaron, et al.
Published: (2025)
by: Anderson, Aaron, et al.
Published: (2025)
Are they human? Detecting large language models by probing human memory constraints
by: Schug, Simon, et al.
Published: (2026)
by: Schug, Simon, et al.
Published: (2026)
From Distributional to Overton Pluralism: Investigating Large Language Model Alignment
by: Lake, Thom, et al.
Published: (2024)
by: Lake, Thom, et al.
Published: (2024)
Multiple output samples per input in a single-output Gaussian process
by: Wong, Jeremy H. M., et al.
Published: (2023)
by: Wong, Jeremy H. M., et al.
Published: (2023)
Injecting linguistic knowledge into BERT for Dialogue State Tracking
by: Feng, Xiaohan, et al.
Published: (2023)
by: Feng, Xiaohan, et al.
Published: (2023)
Overcoming classic challenges for artificial neural networks by providing incentives and practice
by: Irie, Kazuki, et al.
Published: (2024)
by: Irie, Kazuki, et al.
Published: (2024)
LoRA-SP: Streamlined Partial Parameter Adaptation for Resource-Efficient Fine-Tuning of Large Language Models
by: Wu, Yichao, et al.
Published: (2024)
by: Wu, Yichao, et al.
Published: (2024)
Probing self-attention in self-supervised speech models for cross-linguistic differences
by: Gopinath, Sai, et al.
Published: (2024)
by: Gopinath, Sai, et al.
Published: (2024)
CoLLEGe: Concept Embedding Generation for Large Language Models
by: Teehan, Ryan, et al.
Published: (2024)
by: Teehan, Ryan, et al.
Published: (2024)
Dimensional Collapse in Transformer Attention Outputs: A Challenge for Sparse Dictionary Learning
by: Wang, Junxuan, et al.
Published: (2025)
by: Wang, Junxuan, et al.
Published: (2025)
Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents
by: Swain, Sankalp Tattwadarshi, et al.
Published: (2025)
by: Swain, Sankalp Tattwadarshi, et al.
Published: (2025)
Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research
by: Huo, Shuning, et al.
Published: (2024)
by: Huo, Shuning, et al.
Published: (2024)
Direct Multi-Turn Preference Optimization for Language Agents
by: Shi, Wentao, et al.
Published: (2024)
by: Shi, Wentao, et al.
Published: (2024)
Classification errors distort findings in automated speech processing: examples and solutions from child-development research
by: Gautheron, Lucas, et al.
Published: (2025)
by: Gautheron, Lucas, et al.
Published: (2025)
SUS backprop: linear backpropagation algorithm for long inputs in transformers
by: Pankov, Sergey, et al.
Published: (2025)
by: Pankov, Sergey, et al.
Published: (2025)
Building Korean linguistic resource for NLU data generation of banking app CS dialog system
by: Yoon, Jeongwoo, et al.
Published: (2026)
by: Yoon, Jeongwoo, et al.
Published: (2026)
Automatically Identifying Local and Global Circuits with Linear Computation Graphs
by: Ge, Xuyang, et al.
Published: (2024)
by: Ge, Xuyang, et al.
Published: (2024)
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device
by: Fu, Yicheng, et al.
Published: (2024)
by: Fu, Yicheng, et al.
Published: (2024)
Overcoming linguistic barriers in code assistants: creating a QLoRA adapter to improve support for Russian-language code writing instructions
by: Pronin, C. B., et al.
Published: (2024)
by: Pronin, C. B., et al.
Published: (2024)
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning
by: Tan, Weihao, et al.
Published: (2024)
by: Tan, Weihao, et al.
Published: (2024)
Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
by: Li, Moxin, et al.
Published: (2025)
by: Li, Moxin, et al.
Published: (2025)
From Parameters to Data: A Task-Parameter-Guided Fine-Tuning Pipeline for Efficient LLM Alignment
by: Chen, Hao, et al.
Published: (2026)
by: Chen, Hao, et al.
Published: (2026)
Inference time LLM alignment in single and multidomain preference spectrum
by: Shahriar, Sadat, et al.
Published: (2024)
by: Shahriar, Sadat, et al.
Published: (2024)
Towards Understanding the Nature of Attention with Low-Rank Sparse Decomposition
by: He, Zhengfu, et al.
Published: (2025)
by: He, Zhengfu, et al.
Published: (2025)
From Chat Logs to Collective Insights: Aggregative Question Answering
by: Zhang, Wentao, et al.
Published: (2025)
by: Zhang, Wentao, et al.
Published: (2025)
Learning Multiplex Representations on Text-Attributed Graphs with One Language Model Encoder
by: Jin, Bowen, et al.
Published: (2023)
by: Jin, Bowen, et al.
Published: (2023)
pQuant: Towards Effective Low-Bit Language Models via Decoupled Linear Quantization-Aware Training
by: Zhang, Wenzheng, et al.
Published: (2026)
by: Zhang, Wenzheng, et al.
Published: (2026)
Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA
by: Mo, Wentao, et al.
Published: (2024)
by: Mo, Wentao, et al.
Published: (2024)
Density estimation with LLMs: a geometric investigation of in-context learning trajectories
by: Liu, Toni J. B., et al.
Published: (2024)
by: Liu, Toni J. B., et al.
Published: (2024)
Supervised Fine-Tuning Needs to Unlock the Potential of Token Priority
by: Shen, Zhanming, et al.
Published: (2026)
by: Shen, Zhanming, et al.
Published: (2026)
Interactive Training: Feedback-Driven Neural Network Optimization
by: Zhang, Wentao, et al.
Published: (2025)
by: Zhang, Wentao, et al.
Published: (2025)
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search
by: Sun, Linzhuang, et al.
Published: (2024)
by: Sun, Linzhuang, et al.
Published: (2024)
Machine-assisted writing evaluation: Exploring pre-trained language models in analyzing argumentative moves
by: Qin, Wenjuan, et al.
Published: (2025)
by: Qin, Wenjuan, et al.
Published: (2025)
Similar Items
-
Rapid Word Learning Through Meta In-Context Learning
by: Wang, Wentao, et al.
Published: (2025) -
On the robustness of modeling grounded word learning through a child's egocentric input
by: Vong, Wai Keen, et al.
Published: (2025) -
An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025) -
Do different prompting methods yield a common task representation in language models?
by: Davidson, Guy, et al.
Published: (2025) -
Self-supervised learning of video representations from a child's perspective
by: Orhan, A. Emin, et al.
Published: (2024)