Saved in:
| Main Authors: | Tan, Bowen, Zhu, Yun, Liu, Lijuan, Wang, Hongyi, Zhuang, Yonghao, Chen, Jindong, Xing, Eric, Hu, Zhiting |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.16355 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs
by: Tan, Bowen, et al.
Published: (2025)
by: Tan, Bowen, et al.
Published: (2025)
Benchmarking Quantum Red TEA on CPUs, GPUs, and TPUs
by: Jaschke, Daniel, et al.
Published: (2024)
by: Jaschke, Daniel, et al.
Published: (2024)
Toward Inference-optimal Mixture-of-Expert Large Language Models
by: Yun, Longfei, et al.
Published: (2024)
by: Yun, Longfei, et al.
Published: (2024)
Critiques of World Models
by: Xing, Eric, et al.
Published: (2025)
by: Xing, Eric, et al.
Published: (2025)
General Agentic Planning Through Simulative Reasoning with World Models
by: Deng, Mingkai, et al.
Published: (2025)
by: Deng, Mingkai, et al.
Published: (2025)
Efficient Long-context Language Model Training by Core Attention Disaggregation
by: Zhuang, Yonghao, et al.
Published: (2025)
by: Zhuang, Yonghao, et al.
Published: (2025)
RLAX: Large-Scale, Distributed Reinforcement Learning for Large Language Models on TPUs
by: Zhou, Runlong, et al.
Published: (2025)
by: Zhou, Runlong, et al.
Published: (2025)
Leveraging Compute-in-Memory for Efficient Generative Model Inference in TPUs
by: Zhu, Zhantong, et al.
Published: (2025)
by: Zhu, Zhantong, et al.
Published: (2025)
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
by: Hao, Shibo, et al.
Published: (2023)
by: Hao, Shibo, et al.
Published: (2023)
The Host Galaxy (If Any) of the Little Red Dots
by: Chen, Chang-Hao, et al.
Published: (2024)
by: Chen, Chang-Hao, et al.
Published: (2024)
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
by: Fan, Zhiting, et al.
Published: (2024)
by: Fan, Zhiting, et al.
Published: (2024)
Response to Promises and Pitfalls of Deep Kernel Learning
by: Wilson, Andrew Gordon, et al.
Published: (2025)
by: Wilson, Andrew Gordon, et al.
Published: (2025)
Architectural Limits of Cloud TPUs in Finite-Field Cryptography
by: Dang, Hung, et al.
Published: (2026)
by: Dang, Hung, et al.
Published: (2026)
AGILE: Lightweight and Efficient Asynchronous GPU-SSD Integration
by: Yang, Zhuoping, et al.
Published: (2025)
by: Yang, Zhuoping, et al.
Published: (2025)
SlimPajama-DC: Understanding Data Combinations for LLM Training
by: Shen, Zhiqiang, et al.
Published: (2023)
by: Shen, Zhiqiang, et al.
Published: (2023)
Ep. 758: AI Surveillance: Mastering Frigate, YOLO, and TPUs
by: Rosehill, Daniel, et al.
Published: (2026)
by: Rosehill, Daniel, et al.
Published: (2026)
Adapting AlphaEvolve to Optimize Fully Homomorphic Encryption on TPUs
by: Gorantala, Shruthi, et al.
Published: (2026)
by: Gorantala, Shruthi, et al.
Published: (2026)
SCALE-Sim TPU: Validating and Extending SCALE-Sim for TPUs
by: Dang, Jingtian, et al.
Published: (2026)
by: Dang, Jingtian, et al.
Published: (2026)
Fully Bio‐Based TPUs From PLA/PTMC: Tunable Strength and Degradation Profiles
by: Qingshan Zhao, et al.
Published: (2025)
by: Qingshan Zhao, et al.
Published: (2025)
Enhancing Perception Capabilities of Multimodal LLMs with Training-Free Fusion
by: Chen, Zhuokun, et al.
Published: (2024)
by: Chen, Zhuokun, et al.
Published: (2024)
FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs
by: Fan, Zhiting, et al.
Published: (2024)
by: Fan, Zhiting, et al.
Published: (2024)
Collaborative Processing for Multi-Tenant Inference on Memory-Constrained Edge TPUs
by: Ng, Nathan, et al.
Published: (2026)
by: Ng, Nathan, et al.
Published: (2026)
AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation
by: Tan, Hengkai, et al.
Published: (2025)
by: Tan, Hengkai, et al.
Published: (2025)
Emergence of Superposition: Unveiling the Training Dynamics of Chain of Continuous Thought
by: Zhu, Hanlin, et al.
Published: (2025)
by: Zhu, Hanlin, et al.
Published: (2025)
Juvenile Species in Beach Seine Bycatch Along the Coast of Ghana: Any Implications for Fisheries?
by: Margaret Fafa Awushie Akwetey, et al.
Published: (2025)
by: Margaret Fafa Awushie Akwetey, et al.
Published: (2025)
BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models
by: Fan, Zhiting, et al.
Published: (2025)
by: Fan, Zhiting, et al.
Published: (2025)
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters
by: Zhang, WenZheng, et al.
Published: (2024)
by: Zhang, WenZheng, et al.
Published: (2024)
Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models
by: Singla, Somanshu, et al.
Published: (2024)
by: Singla, Somanshu, et al.
Published: (2024)
Research on a Camera Position Measurement Method based on a Parallel Perspective Error Transfer Model
by: Hu, Ning, et al.
Published: (2026)
by: Hu, Ning, et al.
Published: (2026)
scPilot: Large Language Model Reasoning Toward Automated Single-Cell Analysis and Discovery
by: Gao, Yiming, et al.
Published: (2026)
by: Gao, Yiming, et al.
Published: (2026)
TTrace: Lightweight Error Checking and Diagnosis for Distributed Training
by: Jiang, Haitian, et al.
Published: (2025)
by: Jiang, Haitian, et al.
Published: (2025)
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
by: Zheng, Bowen, et al.
Published: (2025)
by: Zheng, Bowen, et al.
Published: (2025)
DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training
by: Li, Dacheng, et al.
Published: (2023)
by: Li, Dacheng, et al.
Published: (2023)
Fusion-Eval: Integrating Assistant Evaluators with LLMs
by: Shu, Lei, et al.
Published: (2023)
by: Shu, Lei, et al.
Published: (2023)
RedCoder: Automated Multi-Turn Red Teaming for Code LLMs
by: Mo, Wenjie Jacky, et al.
Published: (2025)
by: Mo, Wenjie Jacky, et al.
Published: (2025)
CuPyMag: GPU-Accelerated Finite-Element Micromagnetics with Magnetostriction
by: Guan, Hongyi, et al.
Published: (2025)
by: Guan, Hongyi, et al.
Published: (2025)
HARP: Orchestrating Automated Parallel Training on Heterogeneous GPU Clusters
by: Liang, Antian, et al.
Published: (2025)
by: Liang, Antian, et al.
Published: (2025)
Lightweight Model Editing for LLMs to Correct Deprecated API Recommendations
by: Lin, Guancheng, et al.
Published: (2025)
by: Lin, Guancheng, et al.
Published: (2025)
Logit Poisoning Attack in Distillation-based Federated Learning and its Countermeasures
by: Yu, Yonghao, et al.
Published: (2024)
by: Yu, Yonghao, et al.
Published: (2024)
Cornfigurator: Automated Planning for Any-to-Any Multimodal Model Serving
by: Ma, Jeff J., et al.
Published: (2025)
by: Ma, Jeff J., et al.
Published: (2025)
Similar Items
-
Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs
by: Tan, Bowen, et al.
Published: (2025) -
Benchmarking Quantum Red TEA on CPUs, GPUs, and TPUs
by: Jaschke, Daniel, et al.
Published: (2024) -
Toward Inference-optimal Mixture-of-Expert Large Language Models
by: Yun, Longfei, et al.
Published: (2024) -
Critiques of World Models
by: Xing, Eric, et al.
Published: (2025) -
General Agentic Planning Through Simulative Reasoning with World Models
by: Deng, Mingkai, et al.
Published: (2025)