Updated on 2025.10.22

Table of Contents

<a href=#peft>PEFT</a>
<a href=#text-to-image-generation>Text-to-Image Generation</a>
<a href=#vision-language-models>Vision-Language Models</a>
<a href=#generative-weight-space-modeling>Generative Weight Space Modeling</a>
<a href=#data-distillation>Data Distillation</a>
<a href=#schrodinger-bridge>Schrodinger Bridge</a>
<a href=#dataset-distillation>Dataset Distillation</a>
<a href=#synthetic-data-generation>Synthetic Data Generation</a>

PEFT

Publish Date	Title	Authors	PDF	Code
2025-07-23	Swin-TUNA : A Novel PEFT Approach for Accurate Food Image Segmentation	Haotian Chen et.al.	2507.17347	null
2025-07-21	Regularized Low-Rank Adaptation for Few-Shot Organ Segmentation	Ghassen Baklouti et.al.	2507.15793	null
2025-07-20	Parameter-Efficient Fine-Tuning of Foundation Models for CLP Speech Classification	Susmita Bhattacharjee et.al.	2507.14898	null
2025-07-18	Solo Connection: A Parameter Efficient Fine-Tuning Technique for Transformers	Harsh Nilesh Pathak et.al.	2507.14353	null
2025-07-18	PRIDE – Parameter-Efficient Reduction of Identity Discrimination for Equality in LLMs	Maluna Menke et.al.	2507.13743	null
2025-07-17	Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy	Yiting Yang et.al.	2507.13260	null
2025-07-14	Enhancing Chain-of-Thought Reasoning with Critical Representation Fine-tuning	Chenxi Huang et.al.	2507.10085	null
2025-07-13	CKAA: Cross-subspace Knowledge Alignment and Aggregation for Robust Continual Learning	Lingfeng He et.al.	2507.09471	null
2025-07-09	ConsNoTrainLoRA: Data-driven Weight Initialization of Low-rank Adapters using Constraints	Debasmit Das et.al.	2507.08044	null
2025-07-07	EXPOTION: Facial Expression and Motion Control for Multimodal Music Generation	Fathinah Izzati et.al.	2507.04955	null
2025-07-06	AdS: Adapter-state Sharing Framework for Multimodal Sarcasm Detection	Soumyadeep Jana et.al.	2507.04508	null
2025-07-08	LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization	Xujia Wang et.al.	2507.04487	null
2025-07-05	Large Language Models for Zero-Shot Multicultural Name Recognition	Thanakorn Phonchai et.al.	2507.04149	null
2025-07-16	Symbiosis: Multi-Adapter Inference and Fine-Tuning	Saransh Gupta et.al.	2507.03220	null
2025-07-03	Preserving Privacy, Increasing Accessibility, and Reducing Cost: An On-Device Artificial Intelligence Model for Medical Transcription and Note Generation	Johnson Thomas et.al.	2507.03033	null
2025-07-03	DoMIX: An Efficient Framework for Exploiting Domain Knowledge in Fine-Tuning	Dohoon Kim et.al.	2507.02302	null
2025-06-25	WaRA: Wavelet Low Rank Adaptation	Moein Heidari et.al.	2506.24092	null
2025-06-29	MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition	Yuhuan Yang et.al.	2506.23283	null
2025-06-26	Exploring Adapter Design Tradeoffs for Low Resource Music Generation	Atharva Mehta et.al.	2506.21298	null
2025-06-26	WordCon: Word-level Typography Control in Scene Text Rendering	Wenda Shi et.al.	2506.21276	null
2025-06-25	LKA: Large Kernel Adapter for Enhanced Medical Image Classification	Ziquan Zhu et.al.	2506.19118	null
2025-06-22	Memba: Membrane-driven Parameter-Efficient Fine-Tuning for Mamba	Donghyun Lee et.al.	2506.18184	null
2025-06-19	Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights	Zhiyuan Liang et.al.	2506.16406	null
2025-06-18	Targeted Lexical Injection: Unlocking Latent Cross-Lingual Alignment in Lugha-Llama via Early-Layer LoRA Fine-Tuning	Stanley Ngugi et.al.	2506.15415	null
2025-06-18	Singular Value Decomposition on Kronecker Adaptation for Large Language Model	Yee Hin Chong et.al.	2506.15251	null
2025-06-17	GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors	Hengyuan Zhang et.al.	2506.14646	link
2025-06-17	Sharp Generalization Bounds for Foundation Models with Asymmetric Randomized Low-Rank Adapters	Anastasis Kratsios et.al.	2506.14530	null
2025-06-17	Prefix-Tuning+: Modernizing Prefix-Tuning by Decoupling the Prefix from Attention	Haonan Wang et.al.	2506.13674	null
2025-06-16	Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence	Yibo Yang et.al.	2506.13187	null
2025-06-14	LARGO: Low-Rank Regulated Gradient Projection for Robust Parameter Efficient Fine-Tuning	Haotian Zhang et.al.	2506.12394	null
2025-06-14	EKPC: Elastic Knowledge Preservation and Compensation for Class-Incremental Learning	Huaijie Wang et.al.	2506.12351	null
2025-06-13	Personalized LLM Decoding via Contrasting Personal Preference	Hyungjune Bu et.al.	2506.12109	null
2025-06-13	LoRA Users Beware: A Few Spurious Tokens Can Manipulate Your Finetuned Model	Pradyut Sekhsaria et.al.	2506.11402	link
2025-06-12	Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning	Chun-Mei Feng et.al.	2506.10575	null
2025-06-10	FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed	Sizhe Dang et.al.	2506.09034	null
2025-06-09	PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs	Tongzhou Yu et.al.	2506.07587	null
2025-06-07	Adapt Once, Thrive with Updates: Transferable Parameter-Efficient Fine-Tuning on Evolving Base Models	Naibin Gu et.al.	2506.06844	null
2025-06-06	InstantFT: An FPGA-Based Runtime Subsecond Fine-tuning of CNN Models	Keisuke Sugiura et.al.	2506.06505	null
2025-06-06	Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning	Yujia Huo et.al.	2506.05977	null
2025-06-06	MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models	Jie Cao et.al.	2506.05928	null
2025-06-04	Gradient Inversion Attacks on Parameter-Efficient Fine-Tuning	Hasin Us Sami et.al.	2506.04453	link
2025-06-03	DiaBlo: Diagonal Blocks Are Sufficient For Finetuning	Selcuk Gurses et.al.	2506.03230	link
2025-06-01	Taming LLMs by Scaling Learning Rates with Gradient Grouping	Siyuan Li et.al.	2506.01049	null
2025-06-01	FedQuad: Adaptive Layer-wise LoRA Deployment and Activation Quantization for Federated Fine-Tuning	Rukuo Li et.al.	2506.01001	null
2025-06-01	Uni-LoRA: One Vector is All You Need	Kaiyang Li et.al.	2506.00799	null
2025-05-31	Assortment of Attention Heads: Accelerating Federated PEFT with Head Pruning and Strategic Client Selection	Yeshwanth Venkatesha et.al.	2506.00743	null
2025-05-31	FLoE: Fisher-Based Layer Selection for Efficient Sparse Adaptation of Low-Rank Experts	Xinyi Wang et.al.	2506.00495	null
2025-05-30	Localized LoRA: A Structured Low-Rank Approximation for Efficient Fine-Tuning	Babak Barazandeh et.al.	2506.00236	null
2025-05-30	CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning	Jiangpeng He et.al.	2505.24816	link
2025-06-03	DLP: Dynamic Layerwise Pruning in Large Language Models	Yuli Chen et.al.	2505.23807	link
2025-05-29	Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need	Qiang Wang et.al.	2505.23744	null
2025-05-29	SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA	Minrui Luo et.al.	2505.23724	null
2025-06-01	DA-VPT: Semantic-Guided Visual Prompt Tuning for Vision Transformers	Li Ren et.al.	2505.23694	link
2025-05-29	Weight Spectra Induced Efficient Model Adaptation	Chongjie Si et.al.	2505.23099	null
2025-05-29	MAP: Revisiting Weight Decomposition for Low-Rank Adaptation	Chongjie Si et.al.	2505.23094	null
2025-05-28	MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning	Dacao Zhang et.al.	2505.22694	link
2025-05-28	On Geometry-Enhanced Parameter-Efficient Fine-Tuning for 3D Scene Segmentation	Liyao Tang et.al.	2505.22444	null
2025-05-28	Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning	Yongkang Liu et.al.	2505.22355	null
2025-05-28	LoKI: Low-damage Knowledge Implanting of Large Language Models	Runyu Wang et.al.	2505.22120	link
2025-06-03	InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective	Yuanhong Zhang et.al.	2505.21920	null
2025-05-26	GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning	Yeonjoon Jung et.al.	2505.20355	null
2025-05-26	Parameter-Efficient Fine-Tuning with Column Space Projection	Junseo Hwang et.al.	2505.20211	null
2025-05-26	UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models	Xueyan Zhang et.al.	2505.20154	null
2025-05-25	Optimization-Inspired Few-Shot Adaptation for Large Language Models	Boyan Gao et.al.	2505.19107	null
2025-05-27	Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs	Jaemin Kim et.al.	2505.19075	link
2025-05-24	HD-PiSSA: High-Rank Distributed Orthogonal Adaptation	Yiding Wang et.al.	2505.18777	null
2025-05-24	AuroRA: Breaking Low-Rank Bottleneck of LoRA with Nonlinear Mapping	Haonan Dong et.al.	2505.18738	null
2025-05-24	LLM-QFL: Distilling Large Language Model for Quantum Federated Learning	Dev Gurung et.al.	2505.18656	link
2025-05-24	Knowledge Grafting of Large Language Models	Guodong Du et.al.	2505.18502	link
2025-05-22	Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval	Hailong Ning et.al.	2505.16756	null
2025-05-28	Larger Is Not Always Better: Exploring Small Open-source Language Models in Logging Statement Generation	Renyi Zhong et.al.	2505.16590	null
2025-05-21	VP Lab: a PEFT-Enabled Visual Prompting Laboratory for Semantic Segmentation	Niccolo Avogaro et.al.	2505.15592	null
2025-05-21	CoLA: Collaborative Low-Rank Adaptation	Yiyun Zhou et.al.	2505.15471	link
2025-05-21	Gated Integration of Low-Rank Adaptation for Continual Learning of Language Models	Yan-Shuo Liang et.al.	2505.15424	link
2025-05-21	Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification	Bernardin Ligan et.al.	2505.15334	null
2025-05-21	Few-Shot Adversarial Low-Rank Fine-Tuning of Vision-Language Models	Sajjad Ghiasvand et.al.	2505.15130	null
2025-05-21	Dual Decomposition of Weights and Singular Value Low Rank Adaptation	Jialong Han et.al.	2505.14367	null
2025-05-21	OSoRA: Output-Dimension and Singular-Value Initialized Low-Rank Adaptation	Jialong Han et.al.	2505.14350	null
2025-05-23	ABBA: Highly Expressive Hadamard Product Adaptation for Large Language Models	Raghav Singhal et.al.	2505.14238	link
2025-05-18	Adaptive parameter-efficient fine-tuning via Hessian-informed subset selection	Shiyun Xu et.al.	2505.12579	null
2025-05-18	Exploring Sparsity for Parameter Efficient Fine Tuning Using Wavelets	Ahmet Bilican et.al.	2505.12532	link
2025-05-18	SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization	Haodong Yang et.al.	2505.12433	null
2025-05-16	Memory-Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation	Fei Wu et.al.	2505.11235	null
2025-05-15	Multi-Token Prediction Needs Registers	Anastasios Gerontopoulos et.al.	2505.10518	link
2025-05-14	PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning	Zongqian Li et.al.	2505.09519	link
2025-05-13	Parameter-Efficient Fine-Tuning of Vision Foundation Model for Forest Floor Segmentation from UAV Imagery	Mohammad Wasil et.al.	2505.08932	link
2025-05-10	Efficient Telecom Specific LLM: TSLAM-Mini with QLoRA and Digital Twin Data	Vignesh Ethiraj et.al.	2505.07877	null
2025-05-11	DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models	Junhao Xia et.al.	2505.07057	null
2025-05-10	Enfoque Odychess: Un método dialéctico, constructivista y adaptativo para la enseñanza del ajedrez con inteligencias artificiales generativas	Ernesto Giralt Hernandez et.al.	2505.06652	null
2025-05-07	GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model	Zixiang Ai et.al.	2505.04119	link
2025-05-05	HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models	Zheng Lin et.al.	2505.02795	null
2025-05-05	Parameter-Efficient Fine-Tuning with Attributed Patch Semantic Graph for Automated Patch Correctness Assessment	Zhenyu Yang et.al.	2505.02629	link
2025-05-01	AdCare-VLM: Leveraging Large Vision Language Model (LVLM) to Monitor Long-Term Medication Adherence and Care	Md Asaduzzaman Jabin et.al.	2505.00275	link
2025-04-30	Enhancing Health Mention Classification Performance: A Study on Advancements in Parameter Efficient Tuning	Reem Abdel-Salam et.al.	2504.21685	null
2025-05-09	A Systematic Literature Review of Parameter-Efficient Fine-Tuning for Large Code Models	Md Zahidul Haque et.al.	2504.21569	link
2025-04-29	TT-LoRA MoE: Unifying Parameter-Efficient Fine-Tuning and Sparse Mixture-of-Experts	Pradip Kunwar et.al.	2504.21190	null
2025-04-29	A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning	Jieming Bian et.al.	2504.21099	null
2025-04-29	ReCIT: Reconstructing Full Private Data from Gradient in Parameter-Efficient Fine-Tuning of Large Language Models	Jin Xie et.al.	2504.20570	null
2025-04-23	Parameter-Efficient Checkpoint Merging via Metrics-Weighted Averaging	Shi Jie Yu et.al.	2504.18580	null
2025-04-24	Fine-tune Smarter, Not Harder: Parameter-Efficient Fine-Tuning for Geospatial Foundation Models	Francesc Marti-Escofet et.al.	2504.17397	null
2025-04-22	PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning	Song Wang et.al.	2504.16023	link
2025-04-21	What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale	Xiaoyong Yuan et.al.	2504.14815	null
2025-04-20	Harnessing Generative LLMs for Enhanced Financial Event Entity Extraction Performance	Soo-joon Choi et.al.	2504.14633	null
2025-04-20	Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation	Guoyi Zhang et.al.	2504.14481	null
2025-04-19	PEFT A2Z: Parameter-Efficient Fine-Tuning Survey for Large Language and Vision Models	Nusrat Jahan Prottasha et.al.	2504.14117	null
2025-04-18	Parameter-Efficient Continual Fine-Tuning: A Survey	Eric Nuertey Coleman et.al.	2504.13822	null
2025-04-17	All-in-One Transferring Image Compression from Human Perception to Multi-Machine Perception	Jiancheng Zhao et.al.	2504.12997	null
2025-04-15	A Decade of Wheat Mapping for Lebanon	Hasan Wehbi et.al.	2504.11366	null
2025-04-14	CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation	Junchen Fu et.al.	2504.10307	link
2025-04-10	LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	Juzheng Zhang et.al.	2504.07448	link
2025-04-14	DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning	Songze Li et.al.	2504.06521	null
2025-04-16	Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation	Xiaoxing Hu et.al.	2504.06220	link
2025-04-11	AROMA: Autonomous Rank-one Matrix Adaptation	Hao Nan Sheng et.al.	2504.05343	link
2025-04-05	FISH-Tuning: Enhancing PEFT Methods with Fisher Information	Kang Xue et.al.	2504.04050	null
2025-04-02	CLIP-SLA: Parameter-Efficient CLIP Adaptation for Continuous Sign Language Recognition	Sarah Alyami et.al.	2504.01666	link
2025-04-01	Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations	Chongjie Si et.al.	2504.00851	null
2025-04-01	DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism	Dengchun Li et.al.	2504.00661	link
2025-03-31	ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning	Huandong Chang et.al.	2504.00254	null
2025-03-31	Order Matters: On Parameter-Efficient Image-to-Video Probing for Recognizing Nearly Symmetric Actions	Thinesh Thiyakesan Ponbagavathi et.al.	2503.24298	null
2025-03-29	Efficient Adaptation For Remote Sensing Visual Grounding	Hasan Moughnieh et.al.	2503.23083	null
2025-03-27	MSPLoRA: A Multi-Scale Pyramid Low-Rank Adaptation for Efficient Model Fine-Tuning	Jiancheng Zhao et.al.	2503.21838	link
2025-03-26	Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning	Sashuai Zhou et.al.	2503.20633	null
2025-03-26	IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting	Hao Fu et.al.	2503.20612	link
2025-03-26	Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection	Andrii Yermakov et.al.	2503.19683	link
2025-03-25	VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models	Suhas G Hegde et.al.	2503.19530	null
2025-03-24	MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning	Xu Han et.al.	2503.18368	link
2025-03-24	Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models	Zichen Miao et.al.	2503.18337	link
2025-03-23	Decoupling Angles and Strength in Low-rank Adaptation	Massimo Bini et.al.	2503.18225	link
2025-03-22	Visual Variational Autoencoder Prompt Tuning	Xi Xiao et.al.	2503.17650	null
2025-03-21	PE-CLIP: A Parameter-Efficient Fine-Tuning of Vision Language Models for Dynamic Facial Expression Recognition	Ibtissam Saadi et.al.	2503.16945	null
2025-03-20	VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis	Chia-Yi Hsu et.al.	2503.16195	null
2025-03-20	SALT: Singular Value Adaptation with Low-Rank Transformation	Abdelrahman Elsayed et.al.	2503.16055	link
2025-03-19	FedSCA: Federated Tuning with Similarity-guided Collaborative Aggregation for Heterogeneous Medical Image Segmentation	Yumin Zhang et.al.	2503.15390	null
2025-03-18	MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts	Runqi Meng et.al.	2503.14355	null
2025-03-15	A Survey on Federated Fine-tuning of Large Language Models	Yebo Wu et.al.	2503.12016	link
2025-03-14	Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages	Matteo Farina et.al.	2503.11609	link
2025-03-14	MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling	Rachel S. Y. Teo et.al.	2503.11144	link
2025-03-13	Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout	Shilong Wang et.al.	2503.10217	null
2025-03-13	Singular Value Fine-tuning for Few-Shot Class-Incremental Learning	Zhiwu Wang et.al.	2503.10214	null
2025-03-12	Revisiting semi-supervised learning in the era of foundation models	Ping Zhang et.al.	2503.09707	link
2025-03-11	1LoRA: Summation Compression for Very Low-Rank Adaptation	Alessio Quercia et.al.	2503.08333	null
2025-03-11	Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection	Ying Fu Lim et.al.	2503.08045	null
2025-03-09	MoFE: Mixture of Frozen Experts Architecture	Jean Seo et.al.	2503.06491	null
2025-03-08	Lifelong Learning with Task-Specific Adaptation: Addressing the Stability-Plasticity Dilemma	Ruiyu Wang et.al.	2503.06213	null
2025-03-07	Quantum-PEFT: Ultra parameter-efficient fine-tuning	Toshiaki Koike-Akino et.al.	2503.05431	null
2025-03-07	Personalized Text Generation with Contrastive Activation Steering	Jinghao Zhang et.al.	2503.05213	null
2025-03-06	TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models	Xinyi He et.al.	2503.04396	null
2025-03-05	State-offset Tuning: State-based Parameter-Efficient Fine-Tuning for State Space Models	Wonjun Kang et.al.	2503.03499	link
2025-03-11	PaCA: Partial Connection Adaptation for Efficient Fine-Tuning	Sunghyeon Woo et.al.	2503.01905	null
2025-03-03	Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace	Jia-Chen Zhang et.al.	2503.01419	null
2025-03-03	PROPER: A Progressive Learning Framework for Personalized Large Language Models with Group-Level Adaptation	Linhai Zhang et.al.	2503.01303	null
2025-03-03	Beyond QA Pairs: Assessing Parameter-Efficient Fine-Tuning for Fact Embedding in LLMs	Shivam Ratnakar et.al.	2503.01131	null
2025-03-09	Re-Imagining Multimodal Instruction Tuning: A Representation View	Yiyang Liu et.al.	2503.00723	link
2025-02-27	MobiLLM: Enabling LLM Fine-Tuning on the Mobile Device via Server Assisted Side Tuning	Liang Li et.al.	2502.20421	null
2025-02-26	LORENZA: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Adaptive SAM	Yehonathan Refael et.al.	2502.19571	null
2025-02-22	ELBA-Bench: An Efficient Learning Backdoor Attacks Benchmark for Large Language Models	Xuxu Liu et.al.	2502.18511	null
2025-03-04	SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models	Yuxuan Zhang et.al.	2502.18168	null
2025-02-21	Sparsity May Be All You Need: Sparse Random Parameter Adaptation	Jesus Rios et.al.	2502.15975	link
2025-02-19	Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition	Xinyu Tian et.al.	2502.15809	null
2025-02-21	R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning	Jinda Liu et.al.	2502.15455	link
2025-02-20	Generative Modeling of Individual Behavior at Scale	Nabil Omi et.al.	2502.14998	null
2025-02-20	LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation Optimization	Yupeng Chang et.al.	2502.14538	link
2025-02-20	NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models	Chenlu Guo et.al.	2502.14482	link
2025-02-21	Token Adaptation via Side Graph Convolution for Efficient Fine-tuning of 3D Point Cloud Transformers	Takahiko Furuya et.al.	2502.14142	link
2025-02-19	LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation	Xin Li et.al.	2502.13568	null
2025-02-24	GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning	Sifan Zhou et.al.	2502.12913	null
2025-02-17	Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent	Junda Wu et.al.	2502.11740	null
2025-02-13	DiffoRA: Enabling Parameter-Efficient LLM Fine-Tuning via Differential Low-Rank Matrix Adaptation	Tangyu Jiang et.al.	2502.08905	null
2025-02-12	LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits	Zikai Zhou et.al.	2502.08141	null
2025-02-12	Music for All: Exploring Multicultural Representations in Music Generation Models	Atharva Mehta et.al.	2502.07328	link
2025-02-10	Model Diffusion for Certifiable Few-shot Transfer Learning	Fady Rezk et.al.	2502.06970	null
2025-02-10	Hyper Compressed Fine-Tuning of Large Foundation Models with Quantum Inspired Adapters	Snehal Raj et.al.	2502.06916	null
2025-02-10	KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification	Yue Zhu et.al.	2502.06779	null
2025-02-10	FunduSAM: A Specialized Deep Learning Model for Enhanced Optic Disc and Cup Segmentation in Fundus Images	Jinchen Yu et.al.	2502.06220	null
2025-02-08	SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation	Yixian Shen et.al.	2502.05539	null
2025-02-07	SSMLoRA: Enhancing Low-Rank Adaptation with State Space Model	Jiayang Yu et.al.	2502.04958	link
2025-02-05	FedP $^2$ EFT: Federated Learning to Personalize Parameter Efficient Fine-Tuning for Multilingual LLMs	Royson Lee et.al.	2502.04387	null
2025-02-06	Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning	Peizhuang Cong et.al.	2502.03884	null
2025-02-05	Bilevel ZOFO: Bridging Parameter-Efficient and Zeroth-Order Techniques for Efficient LLM Fine-Tuning and Meta-Training	Reza Shirkavand et.al.	2502.03604	null
2025-02-05	RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts	Tuan Truong et.al.	2502.03044	null
2025-02-13	Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA	Shuangyi Chen et.al.	2502.01755	null
2025-02-03	Joint Localization and Activation Editing for Low-Resource Fine-Tuning	Wen Lai et.al.	2502.01179	link
2025-02-03	PARA: Parameter-Efficient Fine-tuning with Prompt Aware Representation Adjustment	Zequan Liu et.al.	2502.01033	null
2025-02-01	Parameter Efficient Fine-Tuning of Segment Anything Model	Carolin Teuber et.al.	2502.00418	link
2025-02-01	Sparse Gradient Compression for Fine-Tuning Large Language Models	David H. Yang et.al.	2502.00311	null
2025-01-30	Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability	Lumen AI et.al.	2501.18657	null
2025-01-23	Low-Rank Adapters Meet Neural Architecture Search for LLM Compression	J. Pablo Muñoz et.al.	2501.16372	link
2025-01-26	Fine Tuning without Catastrophic Forgetting via Selective Low Rank Adaptation	Reza Akbarian Bafghi et.al.	2501.15377	null
2025-02-09	Decentralized Low-Rank Fine-Tuning of Large Language Models	Sajjad Ghiasvand et.al.	2501.15361	null
2025-01-25	Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification	Zhongqi Wang et.al.	2501.15040	null
2025-01-24	Domain Expansion: Parameter-Efficient Modules as Building Blocks for Composite Domains	Mann Patel et.al.	2501.14321	link
2025-01-23	Parameter-Efficient Fine-Tuning for Foundation Models	Dan Zhang et.al.	2501.13787	link
2025-01-21	EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition	Hamid Nasiri et.al.	2501.12067	link
2025-01-21	Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs	Saiful Haq et.al.	2501.11833	null
2025-01-17	OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning	Jinyuan Feng et.al.	2501.10062	null
2025-01-15	Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models	Zerui Tao et.al.	2501.08727	null
2025-01-14	TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning	Yao Liang et.al.	2501.08008	null
2025-01-14	Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques	Shobhit Ratan et.al.	2501.07853	null
2025-01-12	A Hessian-informed hyperparameter optimization for differential learning rate	Shiyun Xu et.al.	2501.06954	null
2025-01-10	Aggregating Low Rank Adapters in Federated Fine-tuning	Evelyn Trautmann et.al.	2501.06332	null
2025-01-10	How to Tune a Multilingual Encoder Model for Germanic Languages: A Study of PEFT, Full Fine-Tuning, and Language Adapters	Romina Oji et.al.	2501.06025	link
2025-01-08	TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning	Seungmin Baek et.al.	2501.04293	null
2025-01-20	Spectral-Aware Low-Rank Adaptation for Speaker Verification	Zhe Li et.al.	2501.03829	link
2025-01-06	ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning	Pengwei Tang et.al.	2501.03291	link
2025-01-05	HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning	Saleh Ashkboos et.al.	2501.02625	link
2025-01-05	Efficient Deployment of Large Language Models on Resource-constrained Devices	Zhiwei Yao et.al.	2501.02438	null
2025-01-09	tCURLoRA: Tensor CUR Decomposition Based Low-Rank Parameter Adaptation and Its Application in Medical Image Segmentation	Guanghua He et.al.	2501.02227	null
2025-01-03	SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation	Mingjie Li et.al.	2501.01765	null
2025-01-07	Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption	Zhang Ruoyan et.al.	2501.01672	null
2024-12-30	Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment	Jianfei Zhang et.al.	2412.20834	link
2024-12-28	VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition	Lan Chen et.al.	2412.20064	link
2025-01-05	Gradient Weight-normalized Low-rank Projection for Efficient LLM Training	Jia-Hong Huang et.al.	2412.19616	link
2024-12-27	Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion	Koustav Ghosal et.al.	2412.19510	null
2024-12-24	Multi-Point Positional Insertion Tuning for Small Object Detection	Kanoko Goto et.al.	2412.18090	null
2024-12-23	Interweaving Memories of a Siamese Large Language Model	Xin Song et.al.	2412.17383	link
2024-12-26	LLMsAgainstHate @ NLU of Devanagari Script Languages 2025: Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs	Rushendra Sidibomma et.al.	2412.17131	link
2024-12-21	Label Privacy in Split Learning for Large Models with Parameter-Efficient Training	Philip Zmushko et.al.	2412.16669	link
2024-12-19	FedPIA – Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning	Pramit Saha et.al.	2412.14424	null
2024-12-18	Parameter-efficient Fine-tuning for improved Convolutional Baseline for Brain Tumor Segmentation in Sub-Saharan Africa Adult Glioma Dataset	Bijay Adhikari et.al.	2412.14100	link
2024-12-18	A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Method-Level Code Smell Detection	Beiqi Zhang et.al.	2412.13801	link
2024-12-18	Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models	Xinxin Liu et.al.	2412.13488	null
2024-12-17	Train More Parameters But Mind Their Placement: Insights into Language Adaptation with PEFT	Jenny Kunz et.al.	2412.12674	link
2024-12-16	Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering	Jinhe Bi et.al.	2412.12359	link
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048	null
2024-12-11	Adaptive Principal Components Allocation with the $\ell_{2,g}$ -regularized Gaussian Graphical Model for Efficient Fine-Tuning Large Models	Jingjing Zheng et.al.	2412.08592	link
2024-12-10	PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition	Kartik Narayan et.al.	2412.07771	null
2024-12-10	MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning	Yufei Ma et.al.	2412.07405	null
2024-12-13	Crack-EdgeSAM Self-Prompting Crack Segmentation System for Edge Devices	Yingchu Wang et.al.	2412.07205	null
2024-12-08	Taming Sensitive Weights : Noise Perturbation Fine-tuning for Robust LLM Quantization	Dongwei Wang et.al.	2412.06858	null
2024-12-09	BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation	Qiushi Wang et.al.	2412.06441	null
2024-12-19	S $^{2}$ FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity	Xinyu Yang et.al.	2412.06289	null
2024-12-08	KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models	Fan Wang et.al.	2412.06071	link
2024-12-07	Training-Free Bayesianization for Low-Rank Adapters of Large Language Models	Haizhou Shi et.al.	2412.05723	link
2024-12-06	PETapter: Leveraging PET-style classification heads for modular few-shot parameter-efficient fine-tuning	Jonas Rieger et.al.	2412.04975	null
2024-12-04	Prompting Large Language Models for Clinical Temporal Relation Extraction	Jianping He et.al.	2412.04512	null
2024-12-05	SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning	Seokju Yun et.al.	2412.04077	link
2024-12-04	Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning	Long Mai et.al.	2412.03343	link
2024-12-03	Mixture of Physical Priors Adapter for Parameter-Efficient Fine-Tuning	Zhaozhi Wang et.al.	2412.02759	null
2024-12-03	CPP-UT-Bench: Can LLMs Write Complex Unit Tests in C++?	Vaishnavi Bhargava et.al.	2412.02735	null
2024-12-03	LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization	Ethan Smith et.al.	2412.02352	null
2024-12-03	A Comprehensive Evaluation of Large Language Models on Aspect-Based Sentiment Analysis	Changzhi Zhou et.al.	2412.02279	null
2024-11-30	Unified Parameter-Efficient Unlearning for LLMs	Chenlu Ding et.al.	2412.00383	link
2024-11-29	SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks	Kim-Celine Kahl et.al.	2411.19688	link
2024-11-28	Parameter-Efficient Transfer Learning for Music Foundation Models	Yiwei Ding et.al.	2411.19371	link
2024-11-28	PEFT-as-an-Attack! Jailbreaking Language Models during Federated Parameter-Efficient Fine-Tuning	Shenghui Li et.al.	2411.19335	null
2024-11-28	Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation	Son Thai Ly et.al.	2411.19297	link
2024-11-27	Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning	Omkar Khade et.al.	2411.18571	null
2024-11-26	PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning	Zhen Sun et.al.	2411.17453	null
2024-11-29	Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning	Hui-Yue Yang et.al.	2411.17217	null
2024-11-25	Towards Efficient Model-Heterogeneity Federated Learning for Large Models	Ruofan Jia et.al.	2411.16796	null
2024-11-25	Parameter Efficient Instruction Tuning: An Empirical Study	Pengfei He et.al.	2411.16775	link
2024-11-25	Graph Adapter of EEG Foundation Models for Parameter Efficient Fine Tuning	Toyotaro Suzumura et.al.	2411.16155	null
2024-11-24	Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models	Olivia Ma et.al.	2411.15831	null
2024-11-21	Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation	Seokil Ham et.al.	2411.15224	null
2024-11-22	LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement	Jieming Bian et.al.	2411.14961	null
2024-11-21	Multi LoRA Meets Vision: Merging multiple adapters to create a multi task model	Ege Kesim et.al.	2411.14064	null
2024-11-17	F $^3$ OCUS – Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics	Pramit Saha et.al.	2411.11912	null
2024-11-16	HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization	Huaqin Zhao et.al.	2411.10696	null
2024-11-12	PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model	Yilun Liu et.al.	2411.08212	null
2024-11-10	Prompt-Efficient Fine-Tuning for GPT-like Deep Models to Reduce Hallucination and to Improve Reproducibility in Scientific Text Generation Using Stochastic Optimisation Techniques	Daniil Sulimov et.al.	2411.06445	null
2024-11-06	MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba	Masakazu Yoshimura et.al.	2411.03855	link
2024-11-04	PipeLLM: Fast and Confidential Large Language Model Services with Speculative Pipelined Encryption	Yifan Tan et.al.	2411.03357	null
2024-11-05	Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation	Junchen Fu et.al.	2411.02992	null
2024-11-04	Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study	André Storhaug et.al.	2411.02462	null
2024-11-04	Expanding Sparse Tuning for Low Memory Usage	Shufan Shen et.al.	2411.01800	link
2024-11-15	Visual Fourier Prompt Tuning	Runjia Zeng et.al.	2411.01327	link
2024-10-31	CleaR: Towards Robust and Generalized Parameter-Efficient Fine-Tuning for Noisy Label Learning	Yeachan Kim et.al.	2411.00873	null
2024-10-30	FPE-LLM: Highly Intelligent Time-Series Forecasting and Language Interaction LLM in Energy Systems	Zihang Qiu et.al.	2411.00852	null
2024-11-01	Dual Low-Rank Adaptation for Continual Learning with Pre-Trained Models	Huancheng Chen et.al.	2411.00623	null
2024-11-01	Is Multiple Object Tracking a Matter of Specialization?	Gianluca Mancusi et.al.	2411.00553	null
2024-11-01	C2A: Client-Customized Adaptation for Parameter-Efficient Federated Learning	Yeachan Kim et.al.	2411.00311	link
2024-10-29	Preserving Pre-trained Representation Space: On Effectiveness of Prefix-tuning for Large Multi-modal Models	Donghoon Kim et.al.	2411.00029	null
2024-10-30	Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation	Wei Dong et.al.	2410.22952	null
2024-10-30	MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning	Xujia Wang et.al.	2410.22782	null
2024-10-29	Meta-Learning Adaptable Foundation Models	Jacob L. Block et.al.	2410.22264	null
2024-10-29	Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models	Raman Dutt et.al.	2410.22149	link
2024-10-30	IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models	Hang Guo et.al.	2410.21759	link
2024-10-28	KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation	Rambod Azimi et.al.	2410.20777	link
2024-10-27	Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation	Maohao Shen et.al.	2410.20336	null
2024-11-01	Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies	Luping Wang et.al.	2410.19878	null
2024-10-23	MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning	Jingfan Zhang et.al.	2410.18035	null
2024-10-22	Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged Annotations	Cheng Lei et.al.	2410.16953	null
2024-10-22	MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report	Samrajya Thapa et.al.	2410.16239	link
2024-10-21	Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning	Arijit Das et.al.	2410.16029	link
2024-10-18	Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation	Shuai Zhao et.al.	2410.14425	link
2024-10-17	LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning	Yiming Shi et.al.	2410.13618	link
2024-10-16	Communication-Efficient and Tensorized Federated Fine-Tuning of Large Language Models	Sajjad Ghiasvand et.al.	2410.13097	null
2024-10-17	Prompt Compression for Large Language Models: A Survey	Zongqian Li et.al.	2410.12388	link
2024-10-15	Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models	Kai Yao et.al.	2410.11772	link
2024-10-15	LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models	Hossein Abdi et.al.	2410.11551	null
2024-10-15	RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates	Md Kowsher et.al.	2410.10075	link
2024-10-13	BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation	Peijia Qin et.al.	2410.09758	null
2024-10-12	Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks	Sungkyung Kim et.al.	2410.09489	link
2024-10-15	MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning	Yaming Yang et.al.	2410.09437	link
2024-10-09	Parameter-Efficient Fine-Tuning via Selective Discrete Cosine Transform	Yixian Shen et.al.	2410.09103	null
2024-10-04	BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models	Aofei Chang et.al.	2410.09079	null
2024-10-11	Parameter-Efficient Fine-Tuning of State Space Models	Kevin Galim et.al.	2410.09016	link
2024-10-10	Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning	Dingkang Liang et.al.	2410.08114	link
2024-10-10	SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture	Jiayi Han et.al.	2410.07739	null
2024-10-10	Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures	Yiming Chen et.al.	2410.07698	link
2024-10-09	SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers	Viktoriia Chekalina et.al.	2410.07383	link
2024-10-09	Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs	Ruijia Niu et.al.	2410.06431	null
2024-10-08	Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content?	Shenbin Qian et.al.	2410.06338	link
2024-10-15	LoRTA: Low Rank Tensor Adaptation of Large Language Models	Ignacio Hounie et.al.	2410.04060	null
2024-10-03	Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection	Tianxiang Chen et.al.	2410.02330	link
2024-10-02	TPP-LLM: Modeling Temporal Point Processes by Efficiently Fine-Tuning Large Language Models	Zefang Liu et.al.	2410.02062	link
2024-10-02	NEAT: Nonlinear Parameter-efficient Adaptation of Pre-trained Models	Yibo Zhong et.al.	2410.01870	null
2024-09-27	A GEN AI Framework for Medical Note Generation	Hui Yi Leong et.al.	2410.01841	null
2024-10-02	DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models	Yuxuan Zhang et.al.	2410.01497	link
2024-10-01	PrivTuner with Homomorphic Encryption and LoRA: A P3EFT Scheme for Privacy-Preserving Parameter-Efficient Fine-Tuning of AI Foundation Models	Yang Li et.al.	2410.00433	null
2024-09-30	Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation	Pedro Henrique Paiola et.al.	2410.00163	null
2024-09-30	Resource Allocation for Stable LLM Training in Mobile Edge Computing	Chang Liu et.al.	2409.20247	null
2024-09-30	Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models	Luohe Shi et.al.	2409.20181	link
2024-09-28	FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models	Yucheng Xie et.al.	2409.19289	null
2024-10-01	Backdoor Attacks for LLMs with Weak-To-Strong Knowledge Distillation	Shuai Zhao et.al.	2409.17946	null
2024-09-26	PEDRO: Parameter-Efficient Fine-tuning with Prompt DEpenDent Representation MOdification	Tianfang Xie et.al.	2409.17834	null
2024-09-30	Efficient In-Domain Question Answering for Resource-Constrained Environments	Isaac Chung et.al.	2409.17648	null
2024-10-07	PACE: marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization	Yao Ni et.al.	2409.17137	link
2024-09-25	Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation	Richard D. Paul et.al.	2409.17085	null
2024-10-02	Bone: Block Affine Transformation as Parameter Efficient Fine-tuning Methods for Large Language Models	Jiale Kang et.al.	2409.15371	link
2024-09-22	Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape	Tao Li et.al.	2409.14396	null
2024-10-01	Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning Paradigm	Jaehan Kim et.al.	2409.14119	link
2024-09-20	HUT: A More Computation Efficient Fine-Tuning Method With Hadamard Updated Transformation	Geyuan Zhang et.al.	2409.13501	null
2024-09-17	THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models	Mengfei Liang et.al.	2409.11353	link
2024-09-17	LPT++: Efficient Training on Mixture of Long-tailed Experts	Bowen Dong et.al.	2409.11323	null
2024-09-17	Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models	Divij Gupta et.al.	2409.11302	null
2024-09-18	Propulsion: Steering LLM with Tiny Fine-Tuning	Md Kowsher et.al.	2409.10927	link
2024-09-16	From Text to Emoji: How PEFT-Driven Personality Manipulation Unleashes the Emoji Potential in LLMs	Navya Jain et.al.	2409.10245	null
2024-09-14	COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare	Chia-Hao Li et.al.	2409.09549	null
2024-09-14	Comparing Retrieval-Augmentation and Parameter-Efficient Fine-Tuning for Privacy-Preserving Personalization of Large Language Models	Alireza Salemi et.al.	2409.09510	link
2024-09-13	Risks When Sharing LoRA Fine-Tuned Diffusion Model Weights	Dixi Yao et.al.	2409.08482	null
2024-09-12	Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?	Kerem Cekmeceli et.al.	2409.07960	link
2024-09-11	Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region	Muhammad Akhtar Munir et.al.	2409.07585	link
2024-09-10	Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts	Assefa Seyoum Wahd et.al.	2409.06821	link
2024-09-11	Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models	Yao Shu et.al.	2409.06277	link
2024-09-09	SVFit: Parameter-Efficient Fine-Tuning of Large Pre-Trained Models Using Singular Values	Chengwei Sun et.al.	2409.05926	null
2024-09-10	Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment	Zhixian Zhao et.al.	2409.05015	null
2024-09-06	Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning	Xinyue Liu et.al.	2409.04574	null
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-04	Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs	Ruoyu Wang et.al.	2409.02686	null
2024-09-04	Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA	Shuangyi Chen et.al.	2409.02346	null
2024-09-02	Unleashing the Power of Task-Specific Directions in Parameter Efficient Fine-tuning	Chongjie Si et.al.	2409.01035	link
2024-08-28	3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability	Baohao Liao et.al.	2409.00119	link
2024-08-21	SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models	Yang Cao et.al.	2409.00055	link
2024-08-30	MoRe Fine-Tuning with 10x Fewer Parameters	Wenxuan Tan et.al.	2408.17383	link
2024-09-02	Instant Adversarial Purification with Adversarial Consistency Distillation	Chun Tong Lei et.al.	2408.17064	null
2024-08-28	Scaling Up Summarization: Leveraging Large Language Models for Long Text Extractive Summarization	Léo Hemamou et.al.	2408.15801	null
2024-08-27	GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs	Maxim Zhelnin et.al.	2408.15300	link
2024-08-27	Pre-training Everywhere: Parameter-Efficient Fine-Tuning for Medical Image Analysis via Target Parameter Pre-training	Xingliang Lei et.al.	2408.15011	null
2024-08-27	CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task	Lingyun Huang et.al.	2408.14961	link
2024-08-27	Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models	Aradhye Agarwal et.al.	2408.14470	link
2024-08-24	Advancing Enterprise Spatio-Temporal Forecasting Applications: Data Mining Meets Instruction Tuning of Language Models For Multi-modal Time Series Analysis in Low-Resource Settings	Sagar Srinivas Sakhinana et.al.	2408.13622	null
2024-08-21	Positional Prompt Tuning for Efficient 3D Representation Learning	Shaochen Zhang et.al.	2408.11567	link
2024-08-20	Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning	Bei Ouyang et.al.	2408.10746	null
2024-08-20	TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning	Bin Wang et.al.	2408.10688	link
2024-08-19	TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition	Tianwei Lin et.al.	2408.09856	link
2024-08-16	Learning to Route for Dynamic Adapter Composition in Continual Learning with Language Models	Vladimir Araujo et.al.	2408.09053	null
2024-08-14	KIND: Knowledge Integration and Diversion in Diffusion Models	Yucheng Xie et.al.	2408.07337	link
2024-08-30	TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning	Yujie Feng et.al.	2408.05200	link
2024-08-08	Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models	Yupeng Chang et.al.	2408.04556	link
2024-08-06	SARA: Singular-Value Based Adaptive Low-Rank Adaption	Jihao Gu et.al.	2408.03290	null
2024-08-06	Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi	Pranita Deshmukh et.al.	2408.03172	null
2024-08-03	TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks	Yang Yu et.al.	2408.01835	link
2024-08-02	MoDE: Effective Multi-task Parameter Efficient Fine-Tuning with a Mixture of Dyadic Experts	Lin Ning et.al.	2408.01505	null
2024-08-02	Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs	Afia Anjum et.al.	2408.01008	null
2024-07-31	A Federated Learning-Friendly Approach for Parameter-Efficient Fine-Tuning of SAM in 3D Segmentation	Mothilal Asokan et.al.	2407.21739	null
2024-07-28	Forecast-PEFT: Parameter-Efficient Fine-Tuning for Pre-trained Motion Forecasting Models	Jifeng Wang et.al.	2407.19564	link
2024-07-24	Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective	Jingren Liu et.al.	2407.17120	null
2024-07-22	Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders	Laura Niss et.al.	2407.15731	null
2024-07-21	Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization	Jiajun Hu et.al.	2407.15085	link
2024-07-16	InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification	Yujia Hu et.al.	2407.12882	link
2024-07-18	Turning Generative Models Degenerate: The Power of Data Poisoning Attacks	Shuli Jiang et.al.	2407.12281	null
2024-07-16	Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification	Naif Alkhunaizi et.al.	2407.11573	null
2024-07-16	An efficient framework based on large foundation model for cervical cytopathology whole slide image screening	Jialong Huang et.al.	2407.11486	link
2024-07-10	RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization	Xijie Huang et.al.	2407.08044	link
2024-07-10	ROSA: Random Subspace Adaptation for Efficient Fine-Tuning	Marawan Gamal Abdel Hameed et.al.	2407.07802	link
2024-07-10	Parameter Efficient Fine Tuning for Multi-scanner PET to PET Reconstruction	Yumin Kim et.al.	2407.07517	null
2024-07-09	Reprogramming Distillation for Medical Foundation Models	Yuhang Zhou et.al.	2407.06504	link
2024-07-07	See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of Decomposition	Chongjie Si et.al.	2407.05417	link
2024-07-16	LoRA-GA: Low-Rank Adaptation with Gradient Approximation	Shaowen Wang et.al.	2407.05000	link
2024-07-05	GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning	Aleksander Ficek et.al.	2407.04528	null
2024-07-04	Deep Content Understanding Toward Entity and Aspect Target Sentiment Analysis on Foundation Models	Vorakit Vorakitphan et.al.	2407.04050	link
2024-07-04	ASteISR: Adapting Single Image Super-resolution Pre-trained Model for Efficient Stereo Image Super-resolution	Yuanbo Zhou et.al.	2407.03598	link
2024-07-03	Knowledge Composition using Task Vectors with Learned Anisotropic Scaling	Frederic Z. Zhang et.al.	2407.02880	link
2024-07-03	Exploring the Capabilities of LLMs for Code Change Related Tasks	Lishui Fan et.al.	2407.02824	link
2024-07-02	FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs	Haodong Chen et.al.	2407.02157	null
2024-07-02	CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications	Yupeng Cao et.al.	2407.01953	null
2024-07-05	Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models	Zihan Wang et.al.	2407.01906	link
2024-07-01	A Fingerprint for Large Language Models	Zhiguang Yang et.al.	2407.01235	null
2024-07-02	Embedded Prompt Tuning: Towards Enhanced Calibration of Pretrained Models for Medical Images	Wenqiang Zu et.al.	2407.01003	link
2024-06-25	Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning	Arijit Sehanobish et.al.	2406.17740	link
2024-06-19	Parameter Training Efficiency Aware Resource Allocation for AIGC in Space-Air-Ground Integrated Networks	Liangxin Qian et.al.	2406.13602	null
2024-06-19	Sparse High Rank Adapters	Kartikeya Bhardwaj et.al.	2406.13175	null
2024-06-18	Bayesian-LoRA: LoRA based Parameter Efficient Fine-Tuning using Optimal Quantization levels and Rank Values trough Differentiable Bayesian Gates	Cristian Meo et.al.	2406.13046	null
2024-06-18	Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation	Branislav Pecher et.al.	2406.12471	link
2024-06-17	A Semantic-based Layer Freezing Approach to Efficient Fine-Tuning of Language Models	Jian Gu et.al.	2406.11753	null
2024-06-16	ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts	Samar Khanna et.al.	2406.10973	null
2024-06-16	ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation	Yurun Song et.al.	2406.10785	link
2024-06-16	RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning	Haoyu Wang et.al.	2406.10777	link
2024-06-15	Benchmarking Children’s ASR with Supervised and Self-supervised Speech Foundation Models	Ruchao Fan et.al.	2406.10507	link
2024-06-15	Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts	Zhaoxuan Tan et.al.	2406.10471	link
2024-06-13	Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models	Lukas Thede et.al.	2406.09384	null
2024-06-12	Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods	Eugene Vyborov et.al.	2406.08582	null
2024-06-12	The Impact of Initialization on LoRA Finetuning Dynamics	Soufiane Hayou et.al.	2406.08447	null
2024-06-20	Low-Rank Quantization-Aware Training for LLMs	Yelysei Bondarenko et.al.	2406.06385	link
2024-06-10	A Parameter-efficient Language Extension Framework for Multilingual ASR	Wei Liu et.al.	2406.06329	null
2024-06-09	A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Automated Program Repair	Guochang Li et.al.	2406.05639	link
2024-06-07	Efficient Differentially Private Fine-Tuning of Diffusion Models	Jing Liu et.al.	2406.05257	null
2024-06-07	CorDA: Context-Oriented Decomposition Adaptation of Large Language Models	Yibo Yang et.al.	2406.05223	link
2024-06-07	An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models	Xiongtao Zhou et.al.	2406.05130	link
2024-06-07	MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter	Jitai Hao et.al.	2406.04984	link
2024-06-06	Time Sensitive Knowledge Editing through Efficient Finetuning	Xiou Ge et.al.	2406.04496	link
2024-06-06	VHDL-Eval: A Framework for Evaluating Large Language Models in VHDL Code Generation	Prashanth Vijayaraghavan et.al.	2406.04379	null
2024-06-10	Hypernetworks for Personalizing ASR to Atypical Speech	Max Müller-Eberstein et.al.	2406.04240	null
2024-06-06	Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning	Naibin Gu et.al.	2406.03792	link
2024-06-05	Choice of PEFT Technique in Continual Learning: Prompt Tuning is Not All You Need	Martin Wistuba et.al.	2406.03216	null
2024-06-06	Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision	Minglei Li et.al.	2406.03051	null
2024-05-31	Mamba State-Space Models Can Be Strong Downstream Learners	John T. Halloran et.al.	2406.00209	null
2024-05-30	ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections	Massimo Bini et.al.	2405.20271	link
2024-05-30	SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors	Vijay Lingam et.al.	2405.19597	link
2024-05-29	MemControl: Mitigating Memorization in Medical Diffusion Models via Automated Parameter Selection	Raman Dutt et.al.	2405.19458	link
2024-05-29	MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning	Junjie Wang et.al.	2405.18897	link
2024-05-29	Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	Zelin Peng et.al.	2405.18840	null
2024-06-01	Low-Rank Few-Shot Adaptation of Vision-Language Models	Maxime Zanella et.al.	2405.18541	null
2024-05-28	Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning	Renzhi Wang et.al.	2405.18292	null
2024-05-28	VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections	Roy Miles et.al.	2405.17991	link
2024-05-28	Sparsity- and Hybridity-Inspired Visual Parameter-Efficient Fine-Tuning for Medical Diagnosis	Mingyuan Liu et.al.	2405.17877	null
2024-05-27	LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters	Klaudia Bałazy et.al.	2405.17604	link
2024-05-23	EMR-Merging: Tuning-Free High-Performance Model Merging	Chenyu Huang et.al.	2405.17461	link
2024-05-28	DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution	Yulong Mao et.al.	2405.17357	link
2024-05-27	$\textit{Trans-LoRA}$ : towards data-free Transferable Parameter Efficient Finetuning	Runqian Wang et.al.	2405.17258	null
2024-05-30	Sparse Matrix in Large Language Model Fine-tuning	Haoze He et.al.	2405.15525	null
2024-05-24	Prompt Tuning Strikes Back: Customizing Foundation Models with Low-Rank Prompt Adaptation	Abhinav Jain et.al.	2405.15282	link
2024-05-27	VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks	Yang Li et.al.	2405.15179	link
2024-05-23	Bitune: Bidirectional Instruction-Tuning	Dawid J. Kopiczko et.al.	2405.14862	null
2024-05-23	Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference	Ting Liu et.al.	2405.14700	link
2024-05-22	Spectral Adapter: Fine-Tuning in Spectral Space	Fangzhao Zhang et.al.	2405.13952	link
2024-05-24	MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models	Jingwei Xu et.al.	2405.13053	link
2024-05-20	FeTT: Continual Class Incremental Learning via Feature Transformation Tuning	Sunyuan Qiang et.al.	2405.11822	null
2024-05-21	HARIS: Human-Like Attention for Reference Image Segmentation	Mengxi Zhang et.al.	2405.10707	null
2024-05-28	DP-DyLoRA: Fine-Tuning Transformer-Based Models On-Device under Differentially Private Federated Learning using Dynamic Low-Rank Adaptation	Jie Xu et.al.	2405.06368	null
2024-05-09	Selective Fine-tuning on LLM-labeled Data May Reduce Reliance on Human Annotation: A Case Study Using Schedule-of-Event Table Detection	Bhawesh Kumar et.al.	2405.06093	null
2024-05-09	Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning	Shibo Jie et.al.	2405.05615	link
2024-05-07	Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning	Karim Galliamov et.al.	2405.04126	link
2024-05-04	Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning	Jing Xu et.al.	2405.02596	link
2024-03-16	Empirical Studies of Parameter Efficient Methods for Large Language Models of Code and Knowledge Transfer to R	Amirreza Esmaeili et.al.	2405.01553	link
2024-05-02	NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment	Gerald Shen et.al.	2405.01481	link
2024-04-29	LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report	Justin Zhao et.al.	2405.00732	link
2024-05-01	Investigating Automatic Scoring and Feedback using Large Language Models	Gloria Ashiya Katuka et.al.	2405.00602	null
2024-05-01	MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model	Rajat Sahay et.al.	2405.00293	null
2024-04-30	SPAFIT: Stratified Progressive Adaptation Fine-tuning for Pre-trained Large Language Models	Samir Arora et.al.	2405.00201	null
2024-05-23	HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning	Chunlin Tian et.al.	2404.19245	link
2024-05-25	FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition	Yuxuan Yan et.al.	2404.18848	null
2024-04-25	Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Pre-trained Models	Jiawei Chen et.al.	2404.16385	null
2024-05-23	MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts	Dengchun Li et.al.	2404.15159	link
2024-04-22	ColA: Collaborative Adaptation with Gradient Learning	Enmao Diao et.al.	2404.13844	link
2024-04-23	Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications	Charith Chandra Sai Balne et.al.	2404.13506	null
2024-04-18	SKIP: Skill-Localized Prompt Tuning for Inference Speed Boost-Up	Nakyeong Yang et.al.	2404.11916	null
2024-04-16	Shears: Unstructured Sparsity with Neural Low-rank Adapter Search	J. Pablo Muñoz et.al.	2404.10934	link
2024-04-16	Exact and Efficient Unlearning for Large Language Model-based Recommendation	Zhiyu Hu et.al.	2404.10327	null
2024-04-15	LoRA Dropout as a Sparsity Regularizer for Overfitting Control	Yang Lin et.al.	2404.09610	null
2024-04-21	Analyzing the Impact of Data Selection and Fine-Tuning on Economic and Political Biases in LLMs	Ahmed Agiza et.al.	2404.08699	link
2024-04-08	Certified PEFTSmoothing: Parameter-Efficient Fine-Tuning with Randomized Smoothing	Chengyan Fu et.al.	2404.05350	null
2024-04-08	DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model	Chao Gao et.al.	2404.05182	null
2024-04-12	Q-PEFT: Query-dependent Parameter Efficient Fine-tuning for Text Reranking with Large Language Models	Zhiyuan Peng et.al.	2404.04522	null
2024-04-05	Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation	Tong Su et.al.	2404.04212	null
2024-05-22	ReFT: Representation Finetuning for Language Models	Zhengxuan Wu et.al.	2404.03592	link
2024-06-11	Personalized LLM Response Generation with Parameterized Memory Injection	Kai Zhang et.al.	2404.03565	link
2024-06-20	Eigenpruning: an Interpretability-Inspired PEFT Method	Tomás Vergara-Browne et.al.	2404.03147	link
2024-05-28	PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models	Fanxu Meng et.al.	2404.02948	link
2024-04-03	Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data	Parth Patwa et.al.	2404.02422	null
2024-04-11	IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT	Junchen Fu et.al.	2404.02059	link
2024-03-31	Query-driven Relevant Paragraph Extraction from Legal Judgments	T. Y. S. S Santosh et.al.	2404.00595	null
2024-03-30	Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4	Aryo Pradipta Gema et.al.	2404.00484	link
2024-04-03	InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning	Yan-Shuo Liang et.al.	2404.00228	link
2024-03-27	Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation	Mateusz Klimaszewski et.al.	2403.18804	link
2024-03-26	The Unreasonable Ineffectiveness of the Deeper Layers	Andrey Gromov et.al.	2403.17887	null
2024-04-15	ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models	Zequan Liu et.al.	2403.16187	null
2024-03-22	KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation	Xindi Luo et.al.	2403.14950	link
2024-03-22	A Single Linear Layer Yields Task-Adapted Low-Rank Matrices	Hwichan Kim et.al.	2403.14946	null
2024-03-21	AutoRE: Document-Level Relation Extraction with Large Language Models	Xue Lilong et.al.	2403.14888	link
2024-04-29	Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey	Zeyu Han et.al.	2403.14608	null
2024-03-20	Harnessing Large Language Models for Text-Rich Sequential Recommendation	Zhi Zheng et.al.	2403.13325	link
2024-04-16	AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models	Zeyu Liu et.al.	2403.13269	null
2024-03-18	Improving LoRA in Privacy-preserving Federated Learning	Youbang Sun et.al.	2403.12313	null
2024-03-18	Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation	Wangbo Zhao et.al.	2403.11808	link
2024-03-18	Let’s Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model	Haoyun Xu et.al.	2403.11621	null
2024-03-19	JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning	Anique Tahir et.al.	2403.11366	link
2024-03-14	Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks	Tingyu Qu et.al.	2403.09377	link
2024-03-14	PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation	Yizhe Xiong et.al.	2403.09192	link
2024-03-13	Data-oriented Dynamic Fine-tuning Parameter Selection Strategy for FISH Mask based Efficient Fine-tuning	Ming Dong et.al.	2403.08484	null

Text-to-Image Generation

Publish Date	Title	Authors	PDF	Code
2025-07-23	Yume: An Interactive World Generation Model	Xiaofeng Mao et.al.	2507.17744	null
2025-07-23	DataWink: Reusing and Adapting SVG-based Visualization Examples with Large Multimodal Models	Liwenhan Xie et.al.	2507.17734	null
2025-07-23	Flow Matching Meets Biology and Life Science: A Survey	Zihao Li et.al.	2507.17731	null
2025-07-23	The Scaling of Triboelectric Charging Powder Drops for Industrial Applications	Tom F. O’Hara et.al.	2507.17701	null
2025-07-23	Mammo-Mamba: A Hybrid State-Space and Transformer Architecture with Sequential Mixture of Experts for Multi-View Mammography	Farnoush Bayatmakou et.al.	2507.17662	null
2025-07-23	CNS-Bench: Benchmarking Image Classifier Robustness Under Continuous Nuisance Shifts	Olaf Dünkel et.al.	2507.17651	null
2025-07-23	Dual-branch Prompting for Multimodal Machine Translation	Jie Wang et.al.	2507.17588	null
2025-07-23	An h-space Based Adversarial Attack for Protection Against Few-shot Personalization	Xide Xu et.al.	2507.17554	null
2025-07-23	Enabling Cyber Security Education through Digital Twins and Generative AI	Vita Santa Barletta et.al.	2507.17518	null
2025-07-23	HOTA: Hamiltonian framework for Optimal Transport Advection	Nazar Buzun et.al.	2507.17513	null
2025-07-23	Accelerating Parallel Diffusion Model Serving with Residual Compression	Jiajun Luo et.al.	2507.17511	null
2025-07-23	Illicit object detection in X-ray imaging using deep learning techniques: A comparative evaluation	Jorgen Cani et.al.	2507.17508	null
2025-07-23	Exact results for active particle models: from long-range interactions to first-passage properties	Léo Touzo et.al.	2507.17504	null
2025-07-23	Unsupervised anomaly detection using Bayesian flow networks: application to brain FDG PET in the context of Alzheimer’s disease	Hugues Roy et.al.	2507.17486	null
2025-07-23	IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception	Haichuan Li et.al.	2507.17445	null
2025-07-22	Stellar Mass-Dispersion Measure Correlations Constrain Baryonic Feedback in Fast Radio Burst Host Galaxies	Calvin Leung et.al.	2507.16816	null
2025-07-22	Uncertainty-Aware Knowledge Transformers for Peer-to-Peer Energy Trading with Multi-Agent Reinforcement Learning	Mian Ibad Ali Shah et.al.	2507.16796	null
2025-07-22	Enhancing Domain Diversity in Synthetic Data Face Recognition with Dataset Fusion	Anjith George et.al.	2507.16790	null
2025-07-22	Generative Diffusion Models for Wireless Networks: Fundamental, Architecture, and State-of-the-Art	Dayu Fan et.al.	2507.16733	null
2025-07-22	HarmonPaint: Harmonized Training-Free Diffusion Inpainting	Ying Li et.al.	2507.16732	null
2025-07-22	Enhancing Remote Sensing Vision-Language Models Through MLLM and LLM-Based High-Quality Image-Text Dataset Generation	Yiguo He et.al.	2507.16716	null
2025-07-22	Custom Algorithm-based Fault Tolerance for Attention Layers in Transformers	Vasileios Titopoulos et.al.	2507.16676	null
2025-07-22	Towards Automated Regulatory Compliance Verification in Financial Auditing with Large Language Models	Armin Berger et.al.	2507.16642	null
2025-07-22	Automatic Fine-grained Segmentation-assisted Report Generation	Frederic Jonske et.al.	2507.16623	null
2025-07-22	A Target-based Multi-LiDAR Multi-Camera Extrinsic Calibration System	Lorenzo Gentilini et.al.	2507.16621	null
2025-07-22	Pyramid Hierarchical Masked Diffusion Model for Imaging Synthesis	Xiaojiao Xiao et.al.	2507.16579	null
2025-07-22	Alternative Loss Function in Evaluation of Transformer Models	Jakub Michańków et.al.	2507.16548	null
2025-07-22	Robust Noisy Pseudo-label Learning for Semi-supervised Medical Image Segmentation Using Diffusion Model	Lin Xi et.al.	2507.16429	null
2025-07-22	Knowledge-aware Diffusion-Enhanced Multimedia Recommendation	Xian Mo et.al.	2507.16396	null
2025-07-23	Ironman: Accelerating Oblivious Transfer Extension for Privacy-Preserving AI with Near-Memory Processing	Chenqi Lin et.al.	2507.16391	null
2025-07-21	Diffusion Beats Autoregressive in Data-Constrained Settings	Mihir Prabhudesai et.al.	2507.15857	null
2025-07-21	Latent Denoising Makes Good Visual Tokenizers	Jiawei Yang et.al.	2507.15856	null
2025-07-21	Optimized Fabrication Procedure for High-Quality Graphene-based Moiré Superlattice Devices	Shuwen Sun et.al.	2507.15853	null
2025-07-21	Can Your Model Separate Yolks with a Water Bottle? Benchmarking Physical Commonsense Understanding in Video Generation Models	Enes Sanli et.al.	2507.15824	null
2025-07-21	Diffusion models for multivariate subsurface generation and efficient probabilistic inversion	Roberto Miele et.al.	2507.15809	null
2025-07-22	Supernova: Achieving More with Less in Transformer Architectures	Andrei-Valentin Tanase et.al.	2507.15773	null
2025-07-21	Deep-Learning Investigation of Vibrational Raman Spectra for Plant-Stress Analysis	Anoop C. Patil et.al.	2507.15772	null
2025-07-21	DiffuMeta: Algebraic Language Models for Inverse Design of Metamaterials via Diffusion Transformers	Li Zheng et.al.	2507.15753	null
2025-07-21	DialogueForge: LLM Simulation of Human-Chatbot Dialogue	Ruizhe Zhu et.al.	2507.15752	null
2025-07-21	TokensGen: Harnessing Condensed Tokens for Long Video Generation	Wenqi Ouyang et.al.	2507.15728	null
2025-07-21	A Practical Investigation of Spatially-Controlled Image Generation with Transformers	Guoxuan Xia et.al.	2507.15724	null
2025-07-21	DiffPF: Differentiable Particle Filtering with Generative Sampling via Conditional Diffusion Models	Ziyu Wan et.al.	2507.15716	null
2025-07-21	Estimating Rate-Distortion Functions Using the Energy-Based Model	Shitong Wu et.al.	2507.15700	null
2025-07-21	BugScope: Learn to Find Bugs Like Human	Jinyao Guo et.al.	2507.15671	null
2025-07-21	SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models	Giordano d’Aloisio et.al.	2507.15663	null
2025-07-18	NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining	Maksim Kuprashevich et.al.	2507.14119	null
2025-07-18	D2IP: Deep Dynamic Image Prior for 3D Time-sequence Pulmonary Impedance Imaging	Hao Fang et.al.	2507.14046	null
2025-07-18	TGIF: Talker Group-Informed Familiarization of Target Speaker Extraction	Tsun-An Hsieh et.al.	2507.14044	null
2025-07-18	CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models	Quang-Binh Nguyen et.al.	2507.13984	null
2025-07-18	Generalist Forecasting with Frozen Video Models via Latent Diffusion	Jacob C Walker et.al.	2507.13942	null
2025-07-18	PositionIC: Unified Position and Identity Consistency for Image Customization	Junjie Hu et.al.	2507.13861	null
2025-07-18	DynFaceRestore: Balancing Fidelity and Quality in Diffusion-Guided Blind Face Restoration with Dynamic Blur-Level Mapping and Guidance	Huu-Phu Do et.al.	2507.13797	null
2025-07-18	Learning Spectral Diffusion Prior for Hyperspectral Image Reconstruction	Mingyang Yu et.al.	2507.13769	null
2025-07-18	MolPIF: A Parameter Interpolation Flow Model for Molecule Generation	Yaowei Jin et.al.	2507.13762	null
2025-07-18	Can Synthetic Images Conquer Forgetting? Beyond Unexplored Doubts in Few-Shot Class-Incremental Learning	Junsu Kim et.al.	2507.13739	null
2025-07-18	The Judge Variable: Challenging Judge-Agnostic Legal Judgment Prediction	Guillaume Zambrano et.al.	2507.13732	null
2025-07-18	Solving wave equation problems on D-Wave quantum annealers	Aigerim Bazarkhanova et.al.	2507.13724	null
2025-07-18	PoemTale Diffusion: Minimising Information Loss in Poem to Image Generation with Multi-Stage Prompt Refinement	Sofia Jamil et.al.	2507.13708	null
2025-07-18	Gaussian kernel-based motion measurement	Hongyi Liu et.al.	2507.13693	null
2025-07-18	CU-ICU: Customizing Unsupervised Instruction-Finetuned Language Models for ICU Datasets via Text-to-Text Transfer Transformer	Teerapong Panboonyuen et.al.	2507.13655	null
2025-07-17	VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding	Shihao Wang et.al.	2507.13353	null
2025-07-17	Hierarchical Rectified Flow Matching with Mini-Batch Couplings	Yichi Zhang et.al.	2507.13350	null
2025-07-17	Imbalance in Balance: Online Concept Balancing in Generation Models	Yukai Shi et.al.	2507.13345	null
2025-07-17	Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models	Yudong Jin et.al.	2507.13344	null
2025-07-17	FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization	Chuancheng Shi et.al.	2507.13311	null
2025-07-17	FocusView: Understanding and Customizing Informational Video Watching Experiences for Viewers with ADHD	Hanxiu ‘Hazel’ Zhu et.al.	2507.13309	null
2025-07-17	The Making of a Community Dark Matter Dataset with the National Science Data Fabric	Amy Roberts et.al.	2507.13297	null
2025-07-17	DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation	Ekta Balkrishna Gavas et.al.	2507.13292	null
2025-07-17	Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour	Emma M. A. Harrison et.al.	2507.13277	null
2025-07-17	RemVerse: Supporting Reminiscence Activities for Older Adults through AI-Assisted Virtual Reality	Ruohao Li et.al.	2507.13247	null
2025-07-17	VITA: Vision-to-Action Flow Matching Policy	Dechen Gao et.al.	2507.13231	null
2025-07-17	SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks	Kutub Uddin et.al.	2507.13170	null
2025-07-17	Online Rounding for Set Cover under Subset Arrivals	Jarosław Byrka et.al.	2507.13159	null
2025-07-17	Multi-population GAN Training: Analyzing Co-Evolutionary Algorithms	Walter P. Casas et.al.	2507.13157	null
2025-07-17	fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting	Alicia Durrer et.al.	2507.13146	null
2025-07-17	PhysX: Physical-Grounded 3D Asset Generation	Ziang Cao et.al.	2507.12465	null
2025-07-16	Characterizing State Space Model (SSM) and SSM-Transformer Hybrid Language Model Performance with Long Context Length	Saptarshi Mitra et.al.	2507.12442	null
2025-07-17	High-Performance Pipelined NTT Accelerators with Homogeneous Digit-Serial Modulo Arithmetic	George Alexakis et.al.	2507.12418	null
2025-07-16	Modeling Feasible Locomotion of Nanobots for Cancer Detection and Treatment	Noble Harasha et.al.	2507.12400	null
2025-07-16	Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker	Rachna Saxena et.al.	2507.12378	null
2025-07-16	Unsupervised Monocular 3D Keypoint Discovery from Multi-View Diffusion Priors	Subin Jeon et.al.	2507.12336	null
2025-07-17	Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models	Samuel Lavoie et.al.	2507.12318	null
2025-07-16	FADE: Adversarial Concept Erasure in Flow Models	Zixuan Fu et.al.	2507.12283	null
2025-07-16	Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes	Johann Frei et.al.	2507.12261	null
2025-07-16	Generate to Ground: Multimodal Text Conditioning Boosts Phrase Grounding in Medical Vision-Language Models	Felix Nützel et.al.	2507.12236	null
2025-07-16	Sparse Autoencoders for Sequential Recommendation Models: Interpretation and Flexible Control	Anton Klenitskiy et.al.	2507.12202	null
2025-07-16	RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models	Yiqi Tian et.al.	2507.12201	null
2025-07-16	Shape Adaptation for 3D Hairstyle Retargeting	Lu Yu et.al.	2507.12168	null
2025-07-16	RadioDiff-3D: A 3D $\times$ 3D Radio Map Dataset and Generative Diffusion Based Benchmark for 6G Environment-Aware Communication	Xiucheng Wang et.al.	2507.12166	null
2025-07-16	Multi-Component VAE with Gaussian Markov Random Field	Fouad Oubari et.al.	2507.12165	null
2025-07-15	DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering	Yinsheng Li et.al.	2507.11527	null
2025-07-15	CATVis: Context-Aware Thought Visualization	Tariq Mehmood et.al.	2507.11522	null
2025-07-15	HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing	Pan Du et.al.	2507.11474	null
2025-07-15	Implementing Adaptations for Vision AutoRegressive Model	Kaif Shaikh et.al.	2507.11441	null
2025-07-15	FLsim: A Modular and Library-Agnostic Simulation Framework for Federated Learning	Arnab Mukherjee et.al.	2507.11430	null
2025-07-15	Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models	Paul A. Bereuter et.al.	2507.11427	null
2025-07-15	Mapping Diffuse Radio Sources Using TUNA: A Transformer-Based Deep Learning Approach	Nicoletta Sanvitale et.al.	2507.11320	null
2025-07-15	Ocean Diviner: A Diffusion-Augmented Reinforcement Learning for AUV Robust Control in the Underwater Tasks	Weiyi Liu et.al.	2507.11283	null
2025-07-15	YOLOatr : Deep Learning Based Automatic Target Detection and Localization in Thermal Infrared Imagery	Aon Safdar et.al.	2507.11267	null
2025-07-15	Strategic Customer Behavior in an M/M/1 Feedback Queue with General Payoffs	Peter Taylor et.al.	2507.11263	null
2025-07-15	MFGDiffusion: Mask-Guided Smoke Synthesis for Enhanced Forest Fire Detection	Guanghao Wu et.al.	2507.11252	null
2025-07-15	Generative Click-through Rate Prediction with Applications to Search Advertising	Lingwei Kong et.al.	2507.11246	null
2025-07-15	NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models	X. Feng et.al.	2507.11245	null
2025-07-15	Latent Space Consistency for Sparse-View CT Reconstruction	Duoyou Chen et.al.	2507.11152	null
2025-07-15	EditGen: Harnessing Cross-Attention Control for Instruction-Based Auto-Regressive Audio Editing	Vassilis Sioros et.al.	2507.11096	null
2025-07-14	MP1: Mean Flow Tames Policy Learning in 1-step for Robotic Manipulation	Juyi Sheng et.al.	2507.10543	null
2025-07-14	Accurate generation of chemical reaction transition states by conditional flow matching	Ping Tuo et.al.	2507.10530	null
2025-07-14	Solving the compute crisis with physics-based ASICs	Maxwell Aifer et.al.	2507.10463	null
2025-07-14	TAT: Temporal-Aligned Transformer for Multi-Horizon Peak Demand Forecasting	Zhiyuan Zhao et.al.	2507.10349	null
2025-07-14	Parallel Sampling of Diffusion Models on $SO(3)$	Yan-Ting Chen et.al.	2507.10347	null
2025-07-15	Text Embedding Knows How to Quantize Text-Guided Diffusion Models	Hongjae Lee et.al.	2507.10340	null
2025-07-14	Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching	Yuhan Liu et.al.	2507.10318	null
2025-07-14	Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection	Jinglun Li et.al.	2507.10225	null
2025-07-14	From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation	Jeongho Kim et.al.	2507.10217	null
2025-07-14	History Matching under Uncertainty of Geological Scenarios with Implicit Geological Realism Control with Generative Deep Learning and Graph Convolutions	Gleb Shishaev et.al.	2507.10201	null
2025-07-14	Robust RL Control for Bipedal Locomotion with Closed Kinematic Chains	Egor Maslennikov et.al.	2507.10164	null
2025-07-14	FIX-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long Text	Bingchao Wang et.al.	2507.10095	null
2025-07-14	Towards High Supervised Learning Utility Training Data Generation: Data Pruning and Column Reordering	Tung Sum Thomas Kwok et.al.	2507.10088	null
2025-07-14	Frequency Regulation for Exposure Bias Mitigation in Diffusion Models	Meng Yu et.al.	2507.10072	null
2025-07-14	MEDebiaser: A Human-AI Feedback System for Mitigating Bias in Multi-label Medical Image Classification	Shaohan Shi et.al.	2507.10044	null
2025-07-11	NeuralOS: Towards Simulating Operating Systems via Neural Generative Models	Luke Rivard et.al.	2507.08800	null
2025-07-11	From One to More: Contextual Part Latents for 3D Generation	Shaocong Dong et.al.	2507.08772	null
2025-07-11	On Fair Epsilon Net and Geometric Hitting Set	Mohsen Dehghankar et.al.	2507.08758	null
2025-07-11	A Neutron Sensitive Detector Using 3D-Printed Scintillators	Adam Barr et.al.	2507.08663	null
2025-07-11	Robust inference under Benford’s law	Lucio Barabesi et.al.	2507.08650	null
2025-07-11	Adaptive Framework for Ambient Intelligence in Rehabilitation Assistance	Gábor Baranyi et.al.	2507.08624	null
2025-07-11	FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation	Yuxuan Jiang et.al.	2507.08557	null
2025-07-11	Anisotropic Diffusion of $e^\pm$ in Pulsar Halos over Multiple Coherence of Magnetic Field	Kai Yan et.al.	2507.08526	null
2025-07-11	Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation	Liu He et.al.	2507.08513	null
2025-07-11	SynBridge: Bridging Reaction States via Discrete Flow for Bidirectional Reaction Prediction	Haitao Lin et.al.	2507.08475	null
2025-07-11	Modulation of energy and angular momentum radiation of two-dimensional altermagnets	Yong-Mei Zhang et.al.	2507.08450	null
2025-07-11	Generative artificial intelligence and hybrid models to accelerate LES in reactive flows: Application to hydrogen/methane combustion	Xiangrui Zou et.al.	2507.08426	null
2025-07-11	Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers	Wongi Jeong et.al.	2507.08422	null
2025-07-11	InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes	Zesong Yang et.al.	2507.08416	null
2025-07-11	Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling	Meihua Dang et.al.	2507.08390	null
2025-07-10	Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs	Ziyue Li et.al.	2507.07996	null
2025-07-10	Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling	Haoyu Wu et.al.	2507.07982	null
2025-07-10	Scaling RL to Long Videos	Yukang Chen et.al.	2507.07966	null
2025-07-10	Dynamic Chunking for End-to-End Hierarchical Sequence Modeling	Sukjun Hwang et.al.	2507.07955	null
2025-07-10	Low Resource Reconstruction Attacks Through Benign Prompts	Sol Yarkoni et.al.	2507.07947	null
2025-07-10	Multimodal Framework for Explainable Autonomous Driving: Integrating Video, Sensor, and Textual Data for Enhanced Decision-Making and Transparency	Abolfazl Zarghani et.al.	2507.07938	null
2025-07-10	Towards Continuous Home Cage Monitoring: An Evaluation of Tracking and Identification Strategies for Laboratory Mice	Juan Pablo Oberhauser et.al.	2507.07929	null
2025-07-11	Single-Step Latent Diffusion for Underwater Image Restoration	Jiayi Wu et.al.	2507.07878	null
2025-07-10	Mitigating Watermark Stealing Attacks in Generative Models via Multi-Key Watermarking	Toluwani Aremu et.al.	2507.07871	null
2025-07-10	Re-Bottleneck: Latent Re-Structuring for Neural Audio Autoencoders	Dimitrios Bralios et.al.	2507.07867	null
2025-07-10	ROS Help Desk: GenAI Powered, User-Centric Framework for ROS Error Diagnosis and Debugging	Kavindie Katuwandeniya et.al.	2507.07846	null
2025-07-10	Benchmarking Content-Based Puzzle Solvers on Corrupted Jigsaw Puzzles	Richard Dirauf et.al.	2507.07828	null
2025-07-10	A Unified Empirical Risk Minimization Framework for Flexible N-Tuples Weak Supervision	Shuying Huang et.al.	2507.07771	null
2025-07-10	Phase-Space Synchronization Driven by Moon-Magnetosphere Coupling in Gas Giants	Adnane Osmane et.al.	2507.07739	null
2025-07-10	AI Human Impact: Toward a Model for Ethical Investing in AI-Intensive Companies	James Brusseau et.al.	2507.07703	null
2025-07-09	Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor	Vatsal Agarwal et.al.	2507.07106	null
2025-07-09	4KAgent: Agentic Any Image to 4K Super-Resolution	Yushen Zuo et.al.	2507.07105	null
2025-07-09	Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models	Tiezheng Zhang et.al.	2507.07104	null
2025-07-09	Lifetime study of the ColdADC for the Deep Underground Neutrino Experiment	Wenjie Wu et.al.	2507.07086	null
2025-07-09	Evaluating Attribute Confusion in Fashion Text-to-Image Generation	Ziyue Liu et.al.	2507.07079	null
2025-07-09	ZKTorch: Compiling ML Inference to Zero-Knowledge Proofs via Parallel Proof Accumulation	Bing-Jyue Chen et.al.	2507.07031	null
2025-07-09	Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices	Parshva Dhilankumar Patel et.al.	2507.07029	null
2025-07-09	Exact Evaluation of the Accuracy of Diffusion Models for Inverse Problems with Gaussian Data Distributions	Emile Pierret et.al.	2507.07008	null
2025-07-10	Hallucinating 360°: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting	Fei Teng et.al.	2507.06971	null
2025-07-09	DiffSpectra: Molecular Structure Elucidation from Spectra using Diffusion Models	Liang Wang et.al.	2507.06853	null
2025-07-09	Physics-Grounded Motion Forecasting via Equation Discovery for Trajectory-Guided Image-to-Video Generation	Tao Feng et.al.	2507.06830	null
2025-07-09	Democratizing High-Fidelity Co-Speech Gesture Video Generation	Xu Yang et.al.	2507.06812	null
2025-07-09	Electric-field-assisted phase switching for crystal phase quantum dot fabrication in GaAs nanowires	Qiang Yu et.al.	2507.06699	null
2025-07-09	Enhancing Diffusion Model Stability for Image Restoration via Gradient Management	Hongjie Wu et.al.	2507.06656	null
2025-07-09	Diff $^2$ I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior	Juncheng Mu et.al.	2507.06651	null
2025-07-08	Modern Methods in Associative Memory	Dmitry Krotov et.al.	2507.06211	null
2025-07-08	CultureCLIP: Empowering CLIP with Cultural Awareness through Synthetic Images and Contextualized Captions	Yuchen Huang et.al.	2507.06210	null
2025-07-08	A Survey on Latent Reasoning	Rui-Jie Zhu et.al.	2507.06203	null
2025-07-08	SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads	Jiale Lao et.al.	2507.06192	null
2025-07-08	Prompt-Free Conditional Diffusion for Multi-object Image Augmentation	Haoyu Wang et.al.	2507.06146	null
2025-07-08	OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety	Sanidhya Vijayvargiya et.al.	2507.06134	null
2025-07-08	Bridging Sequential Deep Operator Network and Video Diffusion: Residual Refinement of Spatio-Temporal PDE Solutions	Jaewan Park et.al.	2507.06133	null
2025-07-08	Unconditional Diffusion for Generative Sequential Recommendation	Yimeng Bai et.al.	2507.06121	null
2025-07-09	Omni-Video: Democratizing Unified Video Understanding and Generation	Zhiyu Tan et.al.	2507.06119	null
2025-07-08	Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis	Xintong Hu et.al.	2507.06116	null
2025-07-08	Fun with flags: How Compilers Break and Fix Constant-Time Code	Antoine Geimer et.al.	2507.06112	null
2025-07-08	ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion Models	Chihan Huang et.al.	2507.06078	null
2025-07-08	Kernel Trace Distance: Quantum Statistical Metric between Measures through RKHS Density Operators	Arturo Castellanos et.al.	2507.06055	null
2025-07-08	TextPixs: Glyph-Conditioned Diffusion with Character-Aware Attention and OCR-Guided Supervision	Syeda Anshrah Gillani et.al.	2507.06033	null
2025-07-08	KnowIt: Deep Time Series Modeling and Interpretation	M. W. Theunissen et.al.	2507.06009	null
2025-07-07	Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration	Benjamin Li et.al.	2507.05244	null
2025-07-08	SciMaster: Towards General-Purpose Scientific AI Agents, Part I. X-Master as Foundation: Can We Lead on Humanity’s Last Exam?	Jingyi Chai et.al.	2507.05241	null
2025-07-08	MedGemma Technical Report	Andrew Sellergren et.al.	2507.05201	null
2025-07-07	EmbodieDreamer: Advancing Real2Sim2Real Transfer for Policy Training via Embodied World Modeling	Boyuan Wang et.al.	2507.05198	null
2025-07-07	Viscoelastic Characterization of Melanoma Cells Using Brillouin Spectroscopy	Mykyta Kizilov et.al.	2507.05186	null
2025-07-07	A Dynamical Systems Perspective on the Analysis of Neural Networks	Dennis Chemnitz et.al.	2507.05164	null
2025-07-07	4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture	Yutian Chen et.al.	2507.05163	null
2025-07-07	AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models	Chinnappa Guggilla et.al.	2507.05157	null
2025-07-07	SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model	Chun Xie et.al.	2507.05148	null
2025-07-07	VERITAS: Verification and Explanation of Realness in Images for Transparency in AI Systems	Aadi Srivastava et.al.	2507.05146	null
2025-07-07	DICE: Discrete inverse continuity equation for learning population dynamics	Tobias Blickhan et.al.	2507.05107	null
2025-07-07	MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation	Yucheng Wang et.al.	2507.05092	null
2025-07-07	ICAS: Detecting Training Data from Autoregressive Image Generative Models	Hongyao Yu et.al.	2507.05068	null
2025-07-07	Replacing thinking with tool usage enables reasoning in small language models	Corrado Rainone et.al.	2507.05065	null
2025-07-07	AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics	Jan Carreras Boada et.al.	2507.05063	null
2025-07-03	MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real	Renhao Wang et.al.	2507.02864	null
2025-07-03	RefTok: Reference-Based Tokenization for Video Generation	Xiang Fan et.al.	2507.02862	null
2025-07-03	Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching	Xin Zhou et.al.	2507.02860	null
2025-07-03	AnyI2V: Animating Any Conditional Image with Motion Control	Ziye Li et.al.	2507.02857	null
2025-07-03	LLM-Driven Treatment Effect Estimation Under Inference Time Text Confounding	Yuchen Ma et.al.	2507.02843	null
2025-07-03	USAD: An Unsupervised Data Augmentation Spatio-Temporal Attention Diffusion Network	Ying Yu et.al.	2507.02827	null
2025-07-03	Towards Perception-Informed Latent HRTF Representations	You Zhang et.al.	2507.02815	null
2025-07-03	LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion	Fangfu Liu et.al.	2507.02813	null
2025-07-03	RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation	Liheng Zhang et.al.	2507.02792	null
2025-07-03	Grounding Intelligence in Movement	Melanie Segado et.al.	2507.02771	null
2025-07-03	FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models	Yuxuan Wang et.al.	2507.02714	null
2025-07-04	UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation	Qin Guo et.al.	2507.02713	null
2025-07-03	XPPLORE: Import, visualize, and analyze XPPAUT data in MATLAB	Matteo Martin et.al.	2507.02709	null
2025-07-03	APT: Adaptive Personalized Training for Diffusion Models with Limited Data	JungWoo Chae et.al.	2507.02687	null
2025-07-03	Learning few-step posterior samplers by unfolding and distillation of diffusion models	Charlesquin Kemajou Mbakam et.al.	2507.02686	null
2025-07-02	FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model	Yukang Cao et.al.	2507.01953	null
2025-07-02	Test-Time Scaling with Reflective Generative Model	Zixiao Wang et.al.	2507.01951	null
2025-07-02	LongAnimation: Long Animation Generation with Dynamic Global-Local Memory	Nan Chen et.al.	2507.01945	null
2025-07-02	IC-Custom: Diverse Image Customization via In-Context Learning	Yaowei Li et.al.	2507.01926	null
2025-07-02	Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning	Qingdong He et.al.	2507.01908	null
2025-07-02	Improving GANs by leveraging the quantum noise from real hardware	Hongni Jin et.al.	2507.01886	null
2025-07-02	Towards Foundation Auto-Encoders for Time-Series Anomaly Detection	Gastón García González et.al.	2507.01875	null
2025-07-02	Graded phononic metamaterials: Scalable design meets scalable microfabrication	Charles Dorn et.al.	2507.01874	null
2025-07-02	DIY-MKG: An LLM-Based Polyglot Language Learning System	Kenan Tang et.al.	2507.01872	null
2025-07-02	Bridging UI Design and chatbot Interactions: Applying Form-Based Principles to Conversational Agents	Sanjay Krishna Anbalagan et.al.	2507.01862	null
2025-07-02	MoIRA: Modular Instruction Routing Architecture for Multi-Task Robotics	Dmytro Kuzmenko et.al.	2507.01843	null
2025-07-02	Out-of-Distribution Detection Methods Answer the Wrong Questions	Yucen Lily Li et.al.	2507.01831	null
2025-07-02	FreeLoRA: Enabling Training-Free LoRA Fusion for Autoregressive Multi-Subject Personalization	Peng Zheng et.al.	2507.01792	null
2025-07-02	Frontiers of Generative AI for Network Optimization: Theories, Limits, and Visions	Bo Yang et.al.	2507.01773	null
2025-07-02	Enhanced Generative Model Evaluation with Clipped Density and Coverage	Nicolas Salvy et.al.	2507.01761	null
2025-06-30	Calligrapher: Freestyle Text Image Customization	Yue Ma et.al.	2506.24123	null
2025-06-30	TextMesh4D: High-Quality Text-to-4D Mesh Generation	Sisi Dai et.al.	2506.24121	null
2025-06-30	Epona: Autoregressive Diffusion World Model for Autonomous Driving	Kaiwen Zhang et.al.	2506.24113	null
2025-06-30	Navigating with Annealing Guidance Scale in Diffusion Space	Shai Yehezkel et.al.	2506.24108	null
2025-06-30	Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention	Wonwoong Cho et.al.	2506.24085	null
2025-06-30	Scout-Dose-TCM: Direct and Prospective Scout-Based Estimation of Personalized Organ Doses from Tube Current Modulated CT Exams	Maria Jose Medrano et.al.	2506.24062	null
2025-06-30	Faster Diffusion Models via Higher-Order Approximation	Gen Li et.al.	2506.24042	null
2025-06-30	Unsupervised Sparse Coding-based Spiking Neural Network for Real-time Spike Sorting	Alexis Melot et.al.	2506.24041	null
2025-06-30	Supervised Diffusion-Model-Based PET Image Reconstruction	George Webber et.al.	2506.24034	null
2025-06-30	Minimally dissipative multi-bit logical operations	Jérémie Klinger et.al.	2506.24021	null
2025-06-30	High-precision polarization measurements with Lumped Element Kinetic Inductance Detectors	Sofia Savorgnano et.al.	2506.23983	null
2025-06-30	Vortex Detection from Quantum Data	Chelsea A. Williams et.al.	2506.23976	null
2025-06-30	World4Omni: A Zero-Shot Framework from Image Generation World Model to Robotic Manipulation	Haonan Chen et.al.	2506.23919	null
2025-06-30	Scaling Self-Supervised Representation Learning for Symbolic Piano Performance	Louis Bradshaw et.al.	2506.23869	null
2025-06-30	VMoBA: Mixture-of-Block Attention for Video Diffusion Models	Jianzong Wu et.al.	2506.23858	null
2025-06-27	Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy	Yuhao Liu et.al.	2506.22432	null
2025-06-30	Can Large Language Models Help Students Prove Software Correctness? An Experimental Study with Dafny	Carolina Carreira et.al.	2506.22370	null
2025-06-27	DiffSoundStream: Efficient Speech Tokenization via Diffusion Decoding	Yang Yang et.al.	2506.22362	null
2025-06-27	Unfolding Generative Flows with Koopman Operators: Fast and Interpretable Sampling	Erkan Turan et.al.	2506.22304	null
2025-06-27	OutDreamer: Video Outpainting with a Diffusion Transformer	Linhao Zhong et.al.	2506.22298	null
2025-06-27	Design and Evaluation of IEEE 802.11ax Uplink Orthogonal Frequency Division Multiple Random Access in ns-3	Douglas Dziedzorm Agbeve et.al.	2506.22260	null
2025-06-27	Hybrid Generative Modeling for Incomplete Physics: Deep Grey-Box Meets Optimal Transport	Gurjeet Sangra Singh et.al.	2506.22204	null
2025-06-27	Autonomic Microservice Management via Agentic AI and MAPE-K Integration	Matteo Esposito et.al.	2506.22185	null
2025-06-27	Few-Shot Identity Adaptation for 3D Talking Heads via Global Gaussian Field	Hong Nie et.al.	2506.22044	null
2025-06-27	Noise-Inspired Diffusion Model for Generalizable Low-Dose CT Reconstruction	Qi Gao et.al.	2506.22012	null
2025-06-27	RoboEnvision: A Long-Horizon Video Generation Model for Multi-Task Robot Manipulation	Liudi Yang et.al.	2506.22007	null
2025-06-27	StableCodec: Taming One-Step Diffusion for Extreme Image Compression	Tianyu Zhang et.al.	2506.21977	null
2025-06-27	Optimal Return-to-Go Guided Decision Transformer for Auto-Bidding in Advertisement	Hao Jiang et.al.	2506.21956	null
2025-06-27	CAL-RAG: Retrieval-Augmented Multi-Agent Generation for Content-Aware Layout Design	Najmeh Forouzandehmehr et.al.	2506.21934	null
2025-06-27	Joint Task Offloading and Resource Allocation in Low-Altitude MEC via Graph Attention Diffusion	Yifan Xue et.al.	2506.21933	null
2025-06-26	PsyLite Technical Report	Fangjun Ding et.al.	2506.21536	null
2025-06-26	WAFT: Warping-Alone Field Transforms for Optical Flow	Yihan Wang et.al.	2506.21526	null
2025-06-26	G $^{2}$ D: Boosting Multimodal Learning with Gradient-Guided Distillation	Mohammed Rakib et.al.	2506.21514	null
2025-06-26	GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation	Wentao Hu et.al.	2506.21513	null
2025-06-26	SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture	Kehan Sui et.al.	2506.21478	null
2025-06-26	Rethinking Oversaturation in Classifier-Free Guidance via Low Frequency	Kaiyu Song et.al.	2506.21452	null
2025-06-26	Controllable 3D Placement of Objects with Scene-Aware Diffusion Models	Mohamed Omran et.al.	2506.21446	null
2025-06-26	Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning	Prajwal Koirala et.al.	2506.21427	null
2025-06-26	XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation	Bowen Chen et.al.	2506.21416	null
2025-06-26	GenFlow: Interactive Modular System for Image Generation	Duc-Hung Nguyen et.al.	2506.21369	null
2025-06-26	CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations	Julian Lorenz et.al.	2506.21357	null
2025-06-26	Exploring Adapter Design Tradeoffs for Low Resource Music Generation	Atharva Mehta et.al.	2506.21298	null
2025-06-26	HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation	Diego Biagini et.al.	2506.21287	null
2025-06-26	Hyperspherical Variational Autoencoders Using Efficient Spherical Cauchy Distribution	Lukas Sablica et.al.	2506.21278	null
2025-06-27	FairyGen: Storied Cartoon Video from a Single Child-Drawn Character	Jiayi Zheng et.al.	2506.21272	null
2025-06-25	EditP23: 3D Editing via Propagation of Image Prompts to Multi-View	Roi Bar-On et.al.	2506.20652	null
2025-06-25	Experimental demonstration of high compression of space by optical spaceplates	Ryan Hogan et.al.	2506.20647	null
2025-06-25	Telegrapher’s Generative Model via Kac Flows	Richard Duong et.al.	2506.20641	null
2025-06-26	DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation	Shansan Gong et.al.	2506.20639	null
2025-06-25	MC for Agriculture: A Framework for Nature-inspired Sustainable Pest Control	Fardad Vakilipoor et.al.	2506.20637	null
2025-06-25	Shape2Animal: Creative Animal Generation from Natural Silhouettes	Quoc-Duy Tran et.al.	2506.20616	null
2025-06-25	AI Assistants to Enhance and Exploit the PETSc Knowledge Base	Barry Smith et.al.	2506.20608	null
2025-06-25	Video Perception Models for 3D Scene Synthesis	Rui Huang et.al.	2506.20601	null
2025-06-25	SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection	Ji Qi et.al.	2506.20599	null
2025-06-25	Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks	Manyi Li et.al.	2506.20548	null
2025-06-25	Demonstration of effective UCB-based routing in skill-based queues on real-world data	Sanne van Kempen et.al.	2506.20543	null
2025-06-25	Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorch	Lei Liu et.al.	2506.20513	null
2025-06-25	HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling	Tobias Vontobel et.al.	2506.20452	null
2025-06-25	Med-Art: Diffusion Transformer for 2D Medical Text-to-Image Generation	Changlu Guo et.al.	2506.20449	null
2025-06-26	That’s Not the Feedback I Need! – Student Engagement with GenAI Feedback in the Tutor Kai	Sven Jacobs et.al.	2506.20433	null
2025-06-24	Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation	Xingyang Li et.al.	2506.19852	null
2025-06-24	AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models	Zehuan Huang et.al.	2506.19851	null
2025-06-24	GenHSI: Controllable Generation of Human-Scene Interaction Videos	Zekun Li et.al.	2506.19840	null
2025-06-24	Improving Progressive Generation with Decomposable Flow Matching	Moayed Haji-Ali et.al.	2506.19839	null
2025-06-24	SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution	Liangbin Xie et.al.	2506.19838	null
2025-06-24	Machine Learning with Privacy for Protected Attributes	Saeed Mahloujifar et.al.	2506.19836	null
2025-06-24	A standard transformer and attention with linear biases for molecular conformer generation	Viatcheslav Gurev et.al.	2506.19834	null
2025-06-24	ProxelGen: Generating Proteins as 3D Densities	Felix Faltings et.al.	2506.19820	null
2025-06-24	CoCo4D: Comprehensive and Complex 4D Scene Generation	Junwei Zhou et.al.	2506.19798	null
2025-06-24	Line ratio identification of external photoevaporation	Tyger Peake et.al.	2506.19788	null
2025-06-24	Alleviating User-Sensitive bias with Fair Generative Sequential Recommendation Model	Yang Liu et.al.	2506.19777	null
2025-06-24	Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation	Jun Wang et.al.	2506.19774	null
2025-06-24	Noise Consistency Training: A Native Approach for One-Step Generator in Learning Additional Controls	Yihong Luo et.al.	2506.19741	null
2025-06-24	Integrated Balanced and Staggered Routing in Autonomous Mobility-on-Demand Systems	Antonio Coppola et.al.	2506.19722	null
2025-06-24	Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales	Seyedmorteza Sadat et.al.	2506.19713	null
2025-06-23	Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models	Kiymet Akdemir et.al.	2506.18900	null
2025-06-23	FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation	Kaiyi Huang et.al.	2506.18899	null
2025-06-23	MinD: Unified Visual Imagination and Control via Hierarchical World Models	Xiaowei Chi et.al.	2506.18897	null
2025-06-23	Let Your Video Listen to Your Music!	Xinyu Zhang et.al.	2506.18881	null
2025-06-23	OmniGen2: Exploration to Advanced Multimodal Generation	Chenyuan Wu et.al.	2506.18871	null
2025-06-23	OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation	Qijun Gan et.al.	2506.18866	null
2025-06-23	Comparative analysis of machine learning techniques for feature selection and classification of Fast Radio Bursts	Ailton J. B. Júnior et.al.	2506.18854	null
2025-06-23	Context-Aware CodeLLM Eviction for AI-assisted Coding	Kishanthan Thangarajah et.al.	2506.18796	null
2025-06-23	ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs	Michal Nazarczuk et.al.	2506.18792	null
2025-06-23	3D Arena: An Open Platform for Generative 3D Evaluation	Dylan Ebert et.al.	2506.18787	null
2025-06-23	DefFusionNet: Learning Multimodal Goal Shapes for Deformable Object Manipulation via a Diffusion-based Probabilistic Model	Bao Thach et.al.	2506.18779	null
2025-06-23	ContinualFlow: Learning and Unlearning with Neural Flow Matching	Lorenzo Simone et.al.	2506.18747	null
2025-06-24	MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners	Fang-Duo Tsai et.al.	2506.18729	null
2025-06-23	TCDiff++: An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography	Yuqin Dai et.al.	2506.18671	null
2025-06-23	Simulation-Free Differential Dynamics through Neural Conservation Laws	Mengjian Hua et.al.	2506.18604	null
2025-06-23	Emergent Temporal Correspondences from Video Diffusion Transformers	Jisu Nam et.al.	2506.17220	link
2025-06-20	DreamCube: 3D Panorama Generation via Multi-plane Synchronization	Yukun Huang et.al.	2506.17206	null
2025-06-20	Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation	Jianglong Ye et.al.	2506.17198	null
2025-06-20	Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres	Samuel Howard et.al.	2506.17197	null
2025-06-20	Deep generative models as the probability transformation functions	Vitalii Bondar et.al.	2506.17171	null
2025-06-20	Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network	Mahin Montasir Afif et.al.	2506.17165	null
2025-06-20	MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification	David Jacob Drexlin et.al.	2506.17140	null
2025-06-20	Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models	Michael Plainer et.al.	2506.17139	link
2025-06-20	Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion	Wang Zhao et.al.	2506.17074	null
2025-06-23	Generative Modeling of Full-Atom Protein Conformations using Latent Diffusion on Graph Embeddings	Aditya Sengar et.al.	2506.17064	link
2025-06-20	LSCD: Lomb-Scargle Conditioned Diffusion for Time series Imputation	Elizabeth Fons et.al.	2506.17039	null
2025-06-20	The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation	Giulia Bertazzini et.al.	2506.17016	null
2025-06-20	Reversing Flow for Image Restoration	Haina Qin et.al.	2506.16961	null
2025-06-20	Wi-Fi Sensing Tool Release: Gathering 802.11ax Channel State Information from a Commercial Wi-Fi Access Point	Zisheng Wang et.al.	2506.16957	null
2025-06-20	RCNet: $ΔΣ$ IADCs as Recurrent AutoEncoders	Arnaud Verdant et.al.	2506.16903	null
2025-06-18	Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards	Qingming Liu et.al.	2506.15684	null
2025-06-18	Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model	Anirud Aggarwal et.al.	2506.15682	link
2025-06-18	UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting	Kai He et.al.	2506.15673	null
2025-06-18	deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses	Georgios Androutsopoulos et.al.	2506.15648	null
2025-06-18	Demystifying the Visual Quality Paradox in Multimodal Large Language Models	Shuo Xing et.al.	2506.15645	null
2025-06-18	HOIDiNi: Human-Object Interaction through Diffusion Noise Optimization	Roey Ron et.al.	2506.15625	null
2025-06-18	From Block to Byte: Transforming PCIe SSDs with CXL Memory Protocol and Instruction Annotation	Miryeong Kwon et.al.	2506.15613	null
2025-06-18	CXL-GPU: Pushing GPU Memory Boundaries with the Integration of CXL Technologies	Donghyun Gouk et.al.	2506.15601	null
2025-06-18	From Model to Classroom: Evaluating Generated MCQs for Portuguese with Narrative and Difficulty Concerns	Bernardo Leite et.al.	2506.15598	null
2025-06-18	LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters	Kunming Zhang et.al.	2506.15595	null
2025-06-18	One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution	Yujing Sun et.al.	2506.15591	link
2025-06-18	Gender Inclusivity Fairness Index (GIFI): A Multilevel Framework for Evaluating Gender Diversity in Large Language Models	Zhengyang Shan et.al.	2506.15568	link
2025-06-18	Control and Realism: Best of Both Worlds in Layout-to-Image without Training	Bonan Li et.al.	2506.15563	null
2025-06-18	Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models	Teysir Baoueb et.al.	2506.15530	null
2025-06-18	GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects	Shujia Li et.al.	2506.15483	null
2025-06-17	CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion	Jiahua Ma et.al.	2506.14769	null
2025-06-17	Gravitational-wave background detection using machine learning	Hugo Einsle et.al.	2506.14764	null
2025-06-17	Cost-Aware Routing for Efficient Text-To-Image Generation	Qinchan et.al.	2506.14753	null
2025-06-17	Adaptive Accompaniment with ReaLchords	Yusong Wu et.al.	2506.14723	null
2025-06-17	Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion	Ni Ou et.al.	2506.14706	null
2025-06-17	StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery	Jina Kim et.al.	2506.14670	null
2025-06-17	Align Your Flow: Scaling Continuous-Time Flow Map Distillation	Amirmojtaba Sabour et.al.	2506.14603	null
2025-06-17	NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving	Ren Xin et.al.	2506.14589	link
2025-06-17	Risk Estimation of Knee Osteoarthritis Progression via Predictive Multi-task Modelling from Efficient Diffusion Model using X-ray Images	David Butler et.al.	2506.14560	null
2025-06-17	Accurate Depth-Resolved Temperature Profiling via Thermal-Radiation Spectroscopy: Numerical Methods vs Machine Learning	Dmitrii Shymkiv et.al.	2506.14554	null
2025-06-17	DreamLight: Towards Harmonious and Consistent Image Relighting	Yong Liu et.al.	2506.14549	null
2025-06-17	Using BDF schemes in the temporal integration of POD-ROM methods	Bosco García-Archilla et.al.	2506.14543	null
2025-06-17	Reimagining Target-Aware Molecular Generation through Retrieval-Enhanced Aligned Diffusion	Dong Xu et.al.	2506.14488	null
2025-06-17	SimSpark: Interactive Simulation of Social Media Behaviors	Ziyue Lin et.al.	2506.14476	null
2025-06-17	Active Digital Twins via Active Inference	Matteo Torzoni et.al.	2506.14453	null
2025-06-16	Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value	Yixian Xu et.al.	2506.13763	null
2025-06-16	AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning	Zewei Zhou et.al.	2506.13757	link
2025-06-16	UltraZoom: Generating Gigapixel Images from Regular Photos	Jingwei Ma et.al.	2506.13756	null
2025-06-17	VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models	Edward Li et.al.	2506.13754	null
2025-06-16	Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry	Junyoung Seo et.al.	2506.13697	null
2025-06-16	UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions	Zhucun Xue et.al.	2506.13691	null
2025-06-16	Enforcing tail calibration when training probabilistic forecast models	Jakob Benjamin Wessel et.al.	2506.13687	link
2025-06-16	MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model	Bi Yuda et.al.	2506.13667	null
2025-06-16	Exploiting the Exact Denoising Posterior Score in Training-Free Guidance of Diffusion Models	Gregory Bellchambers et.al.	2506.13614	null
2025-06-16	Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching	Weimin Bai et.al.	2506.13594	null
2025-06-16	Flexible-length Text Infilling for Discrete Diffusion Models	Andrew Zhang et.al.	2506.13579	null
2025-06-16	X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability	Yu Yang et.al.	2506.13558	null
2025-06-16	Let’s play POLO: Integrating the probability of lesion origin into proton treatment plan optimization for low-grade glioma patients	Tim Ortkamp et.al.	2506.13539	null
2025-06-16	Seismic Acoustic Impedance Inversion Framework Based on Conditional Latent Generative Diffusion Model	Jie Chen et.al.	2506.13529	null
2025-06-16	UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data	Vasiliki Balaska et.al.	2506.13505	null
2025-06-13	A Robust Local Fréchet Regression Using Unbalanced Neural Optimal Transport with Applications to Dynamic Single-cell Genomics Data	Binghao Yan et.al.	2506.11969	null
2025-06-13	Automated Treatment Planning for Interstitial HDR Brachytherapy for Locally Advanced Cervical Cancer using Deep Reinforcement Learning	Mohammadamin Moradi et.al.	2506.11957	null
2025-06-13	Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation	Min-Seop Kwak et.al.	2506.11924	null
2025-06-13	Measurement-aligned Flow for Inverse Problem	Shaorong Zhang et.al.	2506.11893	null
2025-06-13	Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache	Xiaoran Liu et.al.	2506.11886	null
2025-06-13	Radar Ranging Using Rydberg Atomic Homodyne Receiver	Minze Chen et.al.	2506.11833	null
2025-06-13	CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection	Byeongchan Lee et.al.	2506.11772	null
2025-06-13	DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion Models	Muhammad Sarmad et.al.	2506.11764	link
2025-06-13	Exploring the Effectiveness of Deep Features from Domain-Specific Foundation Models in Retinal Image Synthesis	Zuzanna Skorniewska et.al.	2506.11753	null
2025-06-13	Simulating realistic radio continuum survey maps with diffusion models	Tobias Vičánek Martínez et.al.	2506.11715	link
2025-06-13	Fusion of multi-source precipitation records via coordinate-based generative model	Sencan Sun et.al.	2506.11698	null
2025-06-13	FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations	Shaun Shuster et.al.	2506.11635	null
2025-06-13	Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing	Thanapat Trachu et.al.	2506.11542	null
2025-06-13	Robust Filtering – Novel Statistical Learning and Inference Algorithms with Applications	Aamir Hussain Chughtai et.al.	2506.11530	null
2025-06-13	Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis	Yuan Gao et.al.	2506.11526	link
2025-06-12	SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis	Weiliang Chen et.al.	2506.10981	null
2025-06-12	InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model	Junqi You et.al.	2506.10980	null
2025-06-12	Fine-Grained Perturbation Guidance via Attention Head Selection	Donghoon Ahn et.al.	2506.10978	null
2025-06-12	GenWorld: Towards Detecting AI-generated Real-world Simulation Videos	Weiliang Chen et.al.	2506.10975	null
2025-06-12	What Exactly Does Guidance Do in Masked Discrete Diffusion Models	He Ye et.al.	2506.10971	null
2025-06-13	MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning	Yuxuan Luo et.al.	2506.10963	null
2025-06-12	SpectralAR: Spectral Autoregressive Visual Generation	Yuanhui Huang et.al.	2506.10962	null
2025-06-12	ReGuidance: A Simple Diffusion Wrapper for Boosting Sample Quality on Hard Inverse Problems	Aayush Karan et.al.	2506.10955	null
2025-06-12	SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks	Lianghong Guo et.al.	2506.10954	link
2025-06-12	Building a Media Ecosystem Observatory from Scratch: Infrastructure, Methodology, and Insights	Zeynep Pehlivan et.al.	2506.10942	null
2025-06-12	Sequential-Parallel Duality in Prefix Scannable Models	Morris Yau et.al.	2506.10918	null
2025-06-12	AIR: Zero-shot Generative Model Adaptation with Iterative Refinement	Guimeng Liu et.al.	2506.10895	link
2025-06-12	The Diffusion Duality	Subham Sekhar Sahoo et.al.	2506.10892	link
2025-06-12	MultiCoSim: A Python-based Multi-Fidelity Co-Simulation Framework	Quinn Thibeault et.al.	2506.10869	null
2025-06-12	LLM-Driven Personalized Answer Generation and Evaluation	Mohammadreza Molavi et.al.	2506.10829	null
2025-06-11	Text-Aware Image Restoration with Diffusion Models	Jaewon Min et.al.	2506.09993	null
2025-06-11	Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation	Xinyu Yang et.al.	2506.09991	null
2025-06-11	When Detection Fails: The Power of Fine-Tuned Models to Generate Human-Like Social Media Text	Hillary Dawkins et.al.	2506.09975	null
2025-06-11	A Branch-and-Cut Algorithm for the Optimal Design of Parking Lots with One-way and Two-way Lanes	Helen Thomas et.al.	2506.09961	null
2025-06-11	Canonical Latent Representations in Conditional Diffusion Models	Yitao Xu et.al.	2506.09955	null
2025-06-11	Microservices and Real-Time Processing in Retail IT: A Review of Open-Source Toolchains and Deployment Strategies	Aaditaa Vashisht et.al.	2506.09938	null
2025-06-11	Fluoroscopic Shape and Pose Tracking of Catheters with Custom Radiopaque Markers	Jared Lawson et.al.	2506.09934	null
2025-06-11	HadaNorm: Diffusion Transformer Quantization through Mean-Centered Transformations	Marco Federici et.al.	2506.09932	null
2025-06-11	On the Linear Programming Model for Dynamic Stochastic Matching and Its Application on Pricing	Junlin Chen et.al.	2506.09924	null
2025-06-12	Aspect-Based Opinion Summarization with Argumentation Schemes	Wendi Zhou et.al.	2506.09917	null
2025-06-11	Stakeholder Participation for Responsible AI Development: Disconnects Between Guidance and Current Practice	Emma Kallina et.al.	2506.09873	null
2025-06-11	A Deep Generative Model for the Simulation of Discrete Karst Networks	Dany Lauzon et.al.	2506.09832	null
2025-06-11	EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection	Christoph Schuhmann et.al.	2506.09827	null
2025-06-11	ComfyUI-R1: Exploring Reasoning Models for Workflow Generation	Zhenran Xu et.al.	2506.09790	link
2025-06-11	ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models	Qin Zhou et.al.	2506.09740	null
2025-06-10	MagCache: Fast Video Generation with Magnitude-Aware Cache	Zehong Ma et.al.	2506.09045	link
2025-06-10	Diffuse and Disperse: Image Generation with Representation Regularization	Runqian Wang et.al.	2506.09027	null
2025-06-10	Edit Flows: Flow Matching with Edit Operations	Marton Havasi et.al.	2506.09018	null
2025-06-10	Branched Schrödinger Bridge Matching	Sophia Tang et.al.	2506.09007	null
2025-06-10	Do Concept Replacement Techniques Really Erase Unacceptable Concepts?	Anudeep Das et.al.	2506.08991	null
2025-06-10	Yau-YauAL: A computer tool for solving nonlinear filtering problems	Yu Wang et.al.	2506.08976	null
2025-06-10	ORIDa: Object-centric Real-world Image Composition Dataset	Jinwoo Kim et.al.	2506.08964	null
2025-06-10	Evaluating Generative Vehicle Trajectory Models for Traffic Intersection Dynamics	Yash Ranjan et.al.	2506.08963	null
2025-06-10	IntTrajSim: Trajectory Prediction for Simulating Multi-Vehicle driving at Signalized Intersections	Yash Ranjan et.al.	2506.08957	null
2025-06-10	Striking Back At Cobalt: Using Network Traffic Metadata To Detect Cobalt Strike Masquerading Command and Control Channels	Clément Parssegny et.al.	2506.08922	link
2025-06-10	Quantifying Mix Network Privacy Erosion with Generative Models	Vasilios Mavroudis et.al.	2506.08918	null
2025-06-10	Product of Experts for Visual Generation	Yunzhi Zhang et.al.	2506.08894	null
2025-06-10	InfoDPCCA: Information-Theoretic Dynamic Probabilistic Canonical Correlation Analysis	Shiqin Tang et.al.	2506.08884	link
2025-06-10	FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency	Yifei Su et.al.	2506.08822	null
2025-06-10	Direct interferometric measurement of non-reciprocity induced by a plasmonic metasurface with false chirality	Ahmed Lafeef Ettapuram Naduvilepurayil et.al.	2506.08815	null
2025-06-09	StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets	Anh-Quan Cao et.al.	2506.08013	link
2025-06-09	Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion	Xun Huang et.al.	2506.08009	null
2025-06-09	Dreamland: Controllable World Creation with Simulator and Generative Models	Sicheng Mo et.al.	2506.08006	null
2025-06-09	Dynamic View Synthesis as an Inverse Problem	Hidir Yesiltepe et.al.	2506.08004	null
2025-06-09	MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation	Junhao Chen et.al.	2506.07999	null
2025-06-09	Generative Modeling of Weights: Generalization or Memorization?	Boya Zeng et.al.	2506.07998	link
2025-06-09	Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers	Zhengyao Lv et.al.	2506.07986	link
2025-06-09	Exposing Hidden Backdoors in NFT Smart Contracts: A Static Security Analysis of Rug Pull Patterns	Chetan Pathade et.al.	2506.07974	null
2025-06-09	Gradients: When Markets Meet Fine-tuning – A Distributed Approach to Model Optimisation	Christopher Subia-Waud et.al.	2506.07940	null
2025-06-09	Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor	Rishit Dagli et.al.	2506.07932	null
2025-06-09	Efficient Seismic Data Interpolation via Sparse Attention Transformer and Diffusion Model	Xiaoli Wei et.al.	2506.07923	null
2025-06-09	Ants3 toolkit: front-end for Geant4 with interactive GUI and Python scripting	A. Morozov et.al.	2506.07922	null
2025-06-09	Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces	Kevin Rojas et.al.	2506.07903	link
2025-06-09	FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling	Sifan Wang et.al.	2506.07902	link
2025-06-09	GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution	Shuja Khalid et.al.	2506.07897	null
2025-06-06	STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis	Jiatao Gu et.al.	2506.06276	null
2025-06-06	Model-Driven Graph Contrastive Learning	Ali Azizpour et.al.	2506.06212	null
2025-06-06	Antithetic Noise in Diffusion Models	Jing Jia et.al.	2506.06185	null
2025-06-06	ENMA: Tokenwise Autoregression for Generative Neural PDE Operators	Armand Kassaï Koupaï et.al.	2506.06158	null
2025-06-06	Masked Language Models are Good Heterogeneous Graph Generalizers	Jinyu Yang et.al.	2506.06157	link
2025-06-06	Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems	Haowei Wang et.al.	2506.06151	link
2025-06-06	Feedback Guidance of Diffusion Models	Koulischer Felix et.al.	2506.06085	null
2025-06-06	HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion	Shiyi Zhang et.al.	2506.06035	null
2025-06-06	End-to-End Framework for Robot Lawnmower Coverage Path Planning using Cellular Decomposition	Nikunj Shah et.al.	2506.06028	null
2025-06-06	On Inverse Problems, Parameter Estimation, and Domain Generalization	Deborah Pereg et.al.	2506.06024	null
2025-06-06	Restereo: Diffusion stereo video generation and restoration	Xingchang Huang et.al.	2506.06023	null
2025-06-06	Optimization-Free Universal Watermark Forgery with Regenerative Diffusion Models	Chaoyi Zhu et.al.	2506.06018	link
2025-06-06	Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning	Motoki Omura et.al.	2506.05968	link
2025-06-09	AQUATIC-Diff: Additive Quantization for Truly Tiny Compressed Diffusion Models	Adil Hasan et.al.	2506.05960	null
2025-06-06	Exponential Family Variational Flow Matching for Tabular Data Generation	Andrés Guzmán-Cordero et.al.	2506.05940	null
2025-06-05	Contrastive Flow Matching	George Stoica et.al.	2506.05350	link
2025-06-05	ContentV: Efficient Training of Video Generation Models with Limited Compute	Wenfeng Lin et.al.	2506.05343	null
2025-06-05	Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning	Xingjian Ran et.al.	2506.05341	null
2025-06-05	Exploring Diffusion Transformer Designs via Grafting	Keshigeyan Chandrasegaran et.al.	2506.05340	link
2025-06-05	LSM-2: Learning from Incomplete Wearable Sensor Data	Maxwell A. Xu et.al.	2506.05321	null
2025-06-05	Learning normalized image densities via dual score matching	Florentin Guth et.al.	2506.05310	null
2025-06-05	Video World Models with Long-term Spatial Memory	Tong Wu et.al.	2506.05284	null
2025-06-05	How to Unlock Time Series Editing? Diffusion-Driven Approach with Multi-Grained Control	Hao Yu et.al.	2506.05276	null
2025-06-05	Aligning Latent Spaces with Flow Priors	Yizhuo Li et.al.	2506.05240	null
2025-06-05	Progressive Tempering Sampler with Diffusion	Severi Rissanen et.al.	2506.05231	link
2025-06-05	DSG-World: Learning a 3D Gaussian World Model from Dual State Videos	Wenhao Hu et.al.	2506.05217	null
2025-06-05	OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View	Yanbo Wang et.al.	2506.05204	link
2025-06-05	Quantifying Cross-Modality Memorization in Vision-Language Models	Yuxin Wen et.al.	2506.05198	null
2025-06-05	Associative Memory and Generative Diffusion in the Zero-noise Limit	Joshua Hess et.al.	2506.05178	null
2025-06-05	Neural Jumps for Option Pricing	Duosi Zheng et.al.	2506.05137	null
2025-06-04	Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation	Tianyu Huang et.al.	2506.04225	null
2025-06-04	UNIC: Unified In-Context Video Editing	Zixuan Ye et.al.	2506.04216	null
2025-06-04	Sounding that Object: Interactive Object-Aware Image to Audio Generation	Tingle Li et.al.	2506.04214	null
2025-06-04	Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector	Boyong He et.al.	2506.04211	link
2025-06-04	Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints	Utkarsh Utkarsh et.al.	2506.04171	null
2025-06-04	Image Editing As Programs with Diffusion Models	Yujia Hu et.al.	2506.04158	null
2025-06-04	SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL	Jiaheng Hu et.al.	2506.04147	null
2025-06-04	Person Re-Identification System at Semantic Level based on Pedestrian Attributes Ontology	Ngoc Q. Ly et.al.	2506.04143	null
2025-06-04	Plant Bioelectric Early Warning Systems: A Five-Year Investigation into Human-Plant Electromagnetic Communication	Peter A. Gloor et.al.	2506.04132	link
2025-06-04	Global convergence rates in the relaxation limits for the compressible Euler and Euler-Maxwell systems in Sobolev spaces	Timothée Crin-Barat et.al.	2506.04103	null
2025-06-04	A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning	Zhiyu Zhang et.al.	2506.04083	null
2025-06-04	A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis	Esteban Gutiérrez et.al.	2506.04073	null
2025-06-04	Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning	Muling Wu et.al.	2506.04065	null
2025-06-04	Towards generating more interpretable counterfactuals via concept vectors: a preliminary study on chest X-rays	Bulat Maksudov et.al.	2506.04058	link
2025-06-04	Explainability-Based Token Replacement on LLM-Generated Text	Hadi Mohammadi et.al.	2506.04050	null
2025-06-03	Native-Resolution Image Synthesis	Zidong Wang et.al.	2506.03131	null
2025-06-03	AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation	Lu Qiu et.al.	2506.03126	null
2025-06-03	DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation	Zhengyao Lv et.al.	2506.03123	null
2025-06-03	Rectified Flows for Fast Multiscale Fluid Flow Modeling	Victor Armegioiu et.al.	2506.03111	null
2025-06-03	TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Chetwin Low et.al.	2506.03099	null
2025-06-03	SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis	Ssharvien Kumar Sivakumar et.al.	2506.03082	null
2025-06-03	ORV: 4D Occupancy-centric Robot Video Generation	Xiuyu Yang et.al.	2506.03079	link
2025-06-03	EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models	Mingzhe Li et.al.	2506.03067	null
2025-06-03	Sample complexity of Schrödinger potential estimation	Nikita Puchkin et.al.	2506.03043	null
2025-06-03	TestAgent: An Adaptive and Intelligent Expert for Human Assessment	Junhao Yu et.al.	2506.03032	null
2025-06-03	DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models	Jiarui Wang et.al.	2506.03007	null
2025-06-03	PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples	Junyu Liu et.al.	2506.03004	null
2025-06-03	Astrophotography turbulence mitigation via generative models	Joonyeoup Kim et.al.	2506.02981	null
2025-06-03	UniConFlow: A Unified Constrained Generalization Framework for Certified Motion Planning with Flow Matching Models	Zewen Yang et.al.	2506.02955	null
2025-06-03	Elasticity of substitution and general model of economic growth	Constantin Chilarescu et.al.	2506.02936	null
2025-05-30	AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion	Yangyi Huang et.al.	2505.24877	null
2025-05-30	ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL	Yu Zhang et.al.	2505.24875	null
2025-05-30	MiniMax-Remover: Taming Bad Noise Helps Video Object Removal	Bojia Zi et.al.	2505.24873	null
2025-05-30	GenSpace: Benchmarking Spatially-Aware Image Generation	Zehan Wang et.al.	2505.24870	null
2025-05-30	TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection	Xinqi Xiong et.al.	2505.24866	null
2025-05-30	ViStoryBench: Comprehensive Benchmark Suite for Story Visualization	Cailin Zhuang et.al.	2505.24862	link
2025-05-30	Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking	Heli Ben-Hamu et.al.	2505.24857	null
2025-05-30	RealDrive: Retrieval-Augmented Driving with Diffusion Models	Wenhao Ding et.al.	2505.24808	null
2025-05-30	Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding	Jiaru Zhang et.al.	2505.24791	null
2025-05-30	AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models	Conor Heins et.al.	2505.24784	null
2025-05-30	QGAN-based data augmentation for hybrid quantum-classical neural networks	Run-Ze He et.al.	2505.24780	null
2025-06-03	EVA-MILP: Towards Standardized Evaluation of MILP Instance Generation	Yidong Luo et.al.	2505.24779	link
2025-05-30	Diffusion-Based Symbolic Regression	Zachary Bastiani et.al.	2505.24776	null
2025-05-30	Supporting product launching decisions with adversarial risk analysis	Pablo G. Arce et.al.	2505.24771	link
2025-05-30	Generalization Dynamics of Linear Diffusion Models	Claudia Merger et.al.	2505.24769	null
2025-05-29	LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers	Yusuf Dalva et.al.	2505.23758	null
2025-05-29	DarkDiff: Advancing Low-Light Raw Enhancement by Retasking Diffusion Models for Camera ISP	Amber Yijia Zheng et.al.	2505.23743	null
2025-05-29	MAGREF: Masked Guidance for Any-Reference Video Generation	Yufan Deng et.al.	2505.23742	link
2025-05-29	LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization	Ronghuan Wu et.al.	2505.23740	null
2025-05-29	How Animals Dance (When You’re Not Looking)	Xiaojuan Wang et.al.	2505.23738	null
2025-05-29	SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA	Minrui Luo et.al.	2505.23724	null
2025-05-29	DiffER: Categorical Diffusion for Chemical Retrosynthesis	Sean Current et.al.	2505.23721	link
2025-05-29	DiCoFlex: Model-agnostic diverse counterfactuals with flexible control	Oleksii Furman et.al.	2505.23700	null
2025-05-29	ImmunoDiff: A Diffusion Model for Immunotherapy Response Prediction in Lung Cancer	Moinak Bhattacharya et.al.	2505.23675	null
2025-05-30	OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation	Size Wu et.al.	2505.23661	link
2025-05-29	VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models	Xiangdong Zhang et.al.	2505.23656	link
2025-05-29	Optimization-Free Diffusion Model – A Perturbation Theory Approach	Yuehaw Khoo et.al.	2505.23652	null
2025-05-29	ZeroSep: Separate Anything in Audio with Zero Training	Chao Huang et.al.	2505.23625	null
2025-05-29	Few-Shot Speech Deepfake Detection Adaptation with Gaussian Processes	Neta Glazer et.al.	2505.23619	link
2025-05-29	Inference-time Scaling of Diffusion Models through Classical Search	Xiangcheng Zhang et.al.	2505.23614	null
2025-05-28	SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation	Dekai Zhu et.al.	2505.22643	null
2025-05-28	Principled Out-of-Distribution Generalization via Simplicity	Jiawei Ge et.al.	2505.22622	null
2025-05-28	Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	Chengyue Wu et.al.	2505.22618	null
2025-05-28	TPDE: A Fast Adaptable Compiler Back-End Framework	Tobias Schwarz et.al.	2505.22610	link
2025-05-28	GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git	Tobias Lindenbauer et.al.	2505.22583	link
2025-05-28	ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models	Dmitrii Sorokin et.al.	2505.22569	null
2025-05-28	TabularQGAN: A Quantum Generative Model for Tabular Data	Pallavi Bhardwaj et.al.	2505.22533	null
2025-05-28	Symplectic Generative Networks (SGNs): A Hamiltonian Framework for Invertible Deep Generative Modeling	Agnideep Aich et.al.	2505.22527	null
2025-05-28	Test-Time Alignment of Discrete Diffusion Models with Sequential Monte Carlo	Chinmay Pani et.al.	2505.22524	null
2025-05-28	PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models	Junwen Chen et.al.	2505.22523	null
2025-05-28	ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods	Michal Kmicikiewicz et.al.	2505.22494	null
2025-05-28	Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics	Siyeop Yoon et.al.	2505.22489	null
2025-05-28	Understanding Adversarial Training with Energy-based Models	Mujtaba Hussain Mirza et.al.	2505.22486	null
2025-05-28	CPINN-ABPI: Physics-Informed Neural Networks for Accurate Power Estimation in MPSoCs	Mohamed R. Elshamy et.al.	2505.22469	null
2025-05-29	Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems	Jiaxi Yang et.al.	2505.22467	null
2025-05-27	Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers	Wei Pang et.al.	2505.21497	link
2025-05-27	Be Decisive: Noise-Induced Layouts for Multi-Subject Generation	Omer Dahary et.al.	2505.21488	null
2025-05-27	PropMolFlow: Property-guided Molecule Generation with Geometry-Complete Flow Matching	Cheng Zeng et.al.	2505.21469	null
2025-05-27	Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion	Zhanqiu Hu et.al.	2505.21467	null
2025-05-27	Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling	Xiangxin Zhou et.al.	2505.21452	null
2025-05-27	CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects	Huaijin Pi et.al.	2505.21437	null
2025-05-27	Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks	Francesco Cozzi et.al.	2505.21426	link
2025-05-27	GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation	Naizhu Jin et.al.	2505.21425	null
2025-05-27	A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment	Brett Bissey et.al.	2505.21414	null
2025-05-27	A Convergence Theory for Diffusion Language Models: An Information-Theoretic Perspective	Gen Li et.al.	2505.21400	null
2025-05-28	OVERT: A Benchmark for Over-Refusal Evaluation on Text-to-Image Models	Ziheng Cheng et.al.	2505.21347	link
2025-05-28	MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on	Guangyuan Li et.al.	2505.21325	null
2025-05-27	Evaluation of LLMs in Medical Text Summarization: The Role of Vocabulary Adaptation in High OOV Settings	Gunjan Balde et.al.	2505.21242	null
2025-05-28	Custom Representations of Inductive Families	Constantine Theocharis et.al.	2505.21225	null
2025-05-27	Simulations of the churning mode: toroidally symmetric plasma convection and turbulence around the X-points in a snowflake divertor	D Power et.al.	2505.21223	null
2025-05-26	Multimodal Federated Learning With Missing Modalities through Feature Imputation Network	Pranav Poudel et.al.	2505.20232	null
2025-05-26	Continuous Learning for Children’s ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence	Edem Ahadzi et.al.	2505.20216	null
2025-05-26	Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking	Pengxiang Li et.al.	2505.20199	link
2025-05-26	Private Geometric Median in Nearly-Linear Time	Syamantak Kumar et.al.	2505.20189	null
2025-05-26	Exposing Go’s Hidden Bugs: A Novel Concolic Framework	Karolina Gorna et.al.	2505.20183	link
2025-05-26	Long-Context State-Space Video World Models	Ryan Po et.al.	2505.20171	null
2025-05-26	MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning	Yuanxin Zhuang et.al.	2505.20131	null
2025-05-26	Understanding Generalization in Diffusion Models via Probability Flow Distance	Huijie Zhang et.al.	2505.20123	null
2025-05-26	Proxy-Free GFlowNet	Ruishuo Chen et.al.	2505.20110	null
2025-05-26	Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning	Ziyi Zhang et.al.	2505.20107	link
2025-05-26	Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models	Makesh Narsimhan Sreedhar et.al.	2505.20087	null
2025-05-26	PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation	Hongsong Wang et.al.	2505.20056	null
2025-05-26	Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion	Zheqi Lv et.al.	2505.20053	link
2025-05-26	The Many Challenges of Human-Like Agents in Virtual Game Environments	Maciej Świechowski et.al.	2505.20011	null
2025-05-26	ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications	Tong Wu et.al.	2505.19983	null
2025-05-26	Rethinking Probabilistic Circuit Parameter Learning	Anji Liu et.al.	2505.19982	null
2025-05-26	UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space	Yong Liu et.al.	2505.19958	null
2025-05-26	Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement	Afrah Shaahid et.al.	2505.19895	null
2025-05-26	A fully automated urban PV parameterization framework for improved estimation of energy production profiles	Bowen Tian et.al.	2505.19876	null
2025-05-26	StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation	Yi Wu et.al.	2505.19874	null
2025-05-26	Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling	Junhong Lee et.al.	2505.19868	null
2025-05-23	Generative Distribution Embeddings	Nic Fishman et.al.	2505.18150	link
2025-05-23	Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading	Mohamed Swailem et.al.	2505.18145	null
2025-05-26	TokBench: Evaluating Your Visual Tokenizer before Visual Generation	Junfeng Wu et.al.	2505.18142	null
2025-05-23	One RL to See Them All: Visual Triple Unified Reinforcement Learning	Yan Ma et.al.	2505.18129	null
2025-05-23	Towards more transferable adversarial attack in black-box manner	Chun Tong Lei et.al.	2505.18097	null
2025-05-23	DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations	Ziqiao Peng et.al.	2505.18096	null
2025-05-23	SpikeGen: Generative Framework for Visual Spike Stream Processing	Gaole Dai et.al.	2505.18049	null
2025-05-23	RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration	Sudarshan Rajagopalan et.al.	2505.18047	null
2025-05-26	Strictly Constrained Generative Modeling via Split Augmented Langevin Sampling	Matthieu Blanke et.al.	2505.18017	link
2025-05-23	Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation	Zhihua Liu et.al.	2505.17994	null
2025-05-23	To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models	Simone Gaisbauer et.al.	2505.17973	link
2025-05-23	Diffusion Classifiers Understand Compositionality, but Conditions Apply	Yujin Jeong et.al.	2505.17955	link
2025-05-23	SplatCo: Structure-View Collaborative Gaussian Splatting for Detail-Preserving Rendering of Large-Scale Unbounded Scenes	Haihong Xiao et.al.	2505.17951	null
2025-05-23	Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity	Zhihong Chen et.al.	2505.17937	link
2025-05-23	Flexible MOF Generation with Torsion-Aware Flow Matching	Nayoung Kim et.al.	2505.17914	null
2025-05-22	GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning	Chengqi Duan et.al.	2505.17022	link
2025-05-22	When Are Concepts Erased From Diffusion Models?	Kevin Lu et.al.	2505.17013	link
2025-05-22	Guided Diffusion Sampling on Function Spaces with Applications to PDEs	Jiachen Yao et.al.	2505.17004	link
2025-05-22	Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction	Dong Li et.al.	2505.16980	null
2025-05-22	Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On	Siqi Wan et.al.	2505.16977	link
2025-05-22	Creatively Upscaling Images with Global-Regional Priors	Yurui Qian et.al.	2505.16976	null
2025-05-22	Bigger Isn’t Always Memorizing: Early Stopping Overparameterized Diffusion Models	Alessandro Favero et.al.	2505.16959	null
2025-05-22	From Reality to Virtual Worlds: The Role of Photogrammetry in Game Development	Santiago Berrezueta-Guzman et.al.	2505.16951	null
2025-05-22	LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning	Zebin You et.al.	2505.16933	null
2025-05-22	Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks	Hongyuan Tao et.al.	2505.16901	null
2025-05-22	T2I-ConBench: Text-to-Image Benchmark for Continual Post-training	Zhehao Huang et.al.	2505.16875	null
2025-05-22	Training-Free Efficient Video Generation via Dynamic Token Carving	Yuechen Zhang et.al.	2505.16864	link
2025-05-22	Conditional Panoramic Image Generation via Masked Autoregressive Modeling	Chaoyang Wang et.al.	2505.16862	null
2025-05-23	LaViDa: A Large Diffusion Language Model for Multimodal Understanding	Shufan Li et.al.	2505.16839	link
2025-05-22	From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization	Haonian Ji et.al.	2505.16832	link
2025-05-21	Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image Colorization	Satoshi Kosugi et.al.	2505.15812	link
2025-05-21	On the creation of narrow AI: hierarchy and nonlocality of neural network skills	Eric J. Michaud et.al.	2505.15811	link
2025-05-21	Neural Conditional Transport Maps	Carlos Rodriguez-Pardo et.al.	2505.15808	null
2025-05-21	Interspatial Attention for Efficient 4D Human Video Generation	Ruizhi Shao et.al.	2505.15800	null
2025-05-21	VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL	Fengyuan Dai et.al.	2505.15791	null
2025-05-21	Exploring the Innovation Opportunities for Pre-trained Models	Minjung Park et.al.	2505.15790	null
2025-05-21	IA-T2I: Internet-Augmented Text-to-Image Generation	Chuanhao Li et.al.	2505.15779	null
2025-05-21	Constructing a 3D Town from a Single Image	Kaizhi Zheng et.al.	2505.15765	null
2025-05-21	HybridProver: Augmenting Theorem Proving with LLM-Driven Proof Synthesis and Refinement	Jilin Hu et.al.	2505.15740	null
2025-05-21	Distributionally Robust Planning of Hydrogen-Electrical Microgrids for Sea Islands	Yuchen Dong et.al.	2505.15733	null
2025-05-21	Can Large Language Models be Effective Online Opinion Miners?	Ryang Heo et.al.	2505.15695	link
2025-05-21	SwarmDiff: Swarm Robotic Trajectory Planning in Cluttered Environments via Diffusion Transformer	Kang Ding et.al.	2505.15679	null
2025-05-21	Graph Conditional Flow Matching for Relational Data Generation	Davide Scassola et.al.	2505.15668	link
2025-05-21	FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models	Zhen Sun et.al.	2505.15644	link
2025-05-21	Trial and Return Option Strategy in Omnichannel Retailing	Yasuyuki Kusuda et.al.	2505.15597	null
2025-05-20	NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search	Sunhao Dai et.al.	2505.14680	null
2025-05-20	Training-Free Watermarking for Autoregressive Image Generation	Yu Tong et.al.	2505.14673	link
2025-05-21	General-Reasoner: Advancing LLM Reasoning Across All Domains	Xueguang Ma et.al.	2505.14652	null
2025-05-20	Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs	Morgan Lindsay Heisler et.al.	2505.14620	null
2025-05-20	Towards a Foundation Model for Communication Systems	Davide Buffelli et.al.	2505.14603	null
2025-05-20	Neural Inverse Scattering with Score-based Regularization	Yuan Gao et.al.	2505.14560	null
2025-05-20	Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI	Marlène Careil et.al.	2505.14556	link
2025-05-20	GUARD: Constructing Realistic Two-Player Matrix and Security Games for Benchmarking Game-Theoretic Algorithms	Noah Krever et.al.	2505.14547	link
2025-05-20	NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation	Matteo El-Hariry et.al.	2505.14526	null
2025-05-21	Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling	Zhihao Li et.al.	2505.14521	null
2025-05-20	Learning to Integrate Diffusion ODEs by Averaging the Derivatives	Wenze Liu et.al.	2505.14502	null
2025-05-20	A Direct Comparison of Simultaneously Recorded Scalp, Around-Ear, and In-Ear EEG for Neural Selective Auditory Attention Decoding to Speech	Simon Geirnaert et.al.	2505.14478	null
2025-05-20	Enhancing Interpretability of Sparse Latent Representations with Class Information	Farshad Sangari Abiz et.al.	2505.14476	null
2025-05-20	CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation	Chihan Huang et.al.	2505.14455	null
2025-05-20	Compositional amortized inference for large-scale hierarchical Bayesian models	Jonas Arruda et.al.	2505.14429	null
2025-05-19	Mean Flows for One-step Generative Modeling	Zhengyang Geng et.al.	2505.13447	null
2025-05-19	Synthetic-Powered Predictive Inference	Meshi Bashari et.al.	2505.13432	link
2025-05-20	A Practical Guide for Incorporating Symmetry in Diffusion Policy	Dian Wang et.al.	2505.13431	null
2025-05-19	Faster Video Diffusion with Trainable Sparse Attention	Peiyuan Zhang et.al.	2505.13389	null
2025-05-19	Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation	Yasi Zhang et.al.	2505.13377	null
2025-05-20	Minimum-Excess-Work Guidance	Christopher Kolloff et.al.	2505.13375	null
2025-05-20	One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling	Nimrod Berman et.al.	2505.13358	link
2025-05-19	Frequency-Dependent Power Consumption Modeling of CMOS Transmitters for WNoC Architectures	Mohammad Shahmoradi et.al.	2505.13310	null
2025-05-19	FlowPure: Continuous Normalizing Flows for Adversarial Purification	Elias Collaert et.al.	2505.13280	link
2025-05-19	Seeing the Unseen: How EMoE Unveils Bias in Text-to-Image Diffusion Models	Lucas Berry et.al.	2505.13273	null
2025-05-19	Distilling a speech and music encoder with task arithmetic	Fabian Ritter-Gutierrez et.al.	2505.13270	null
2025-05-19	Correlation between U/Th and Pb/Os abundance ratios and its application in nuclear cosmochronology	Y. Y. Huang et.al.	2505.13269	null
2025-05-19	JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models	Jieying Xue et.al.	2505.13244	link
2025-05-19	Conformalized Decision Risk Assessment	Wenbin Zhou et.al.	2505.13243	null
2025-05-19	Diffusion Models with Double Guidance: Generate with aggregated datasets	Yanfeng Yang et.al.	2505.13213	null
2025-05-16	Evolution of granular salty ice analogs for Europa: Sublimation and Irradiation	Rafael Ottersberg et.al.	2505.11498	null
2025-05-16	QVGen: Pushing the Limit of Quantized Video Generative Models	Yushi Huang et.al.	2505.11497	null
2025-05-16	Unsupervised Detection of Distribution Shift in Inverse Problems using Diffusion Models	Shirin Shoushtari et.al.	2505.11482	null
2025-05-16	PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment	Dingbang Huang et.al.	2505.11468	null
2025-05-16	Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views	Abhishek Kashyap et.al.	2505.11467	null
2025-05-16	Disentangling Reasoning and Knowledge in Medical Large Language Models	Rahul Thapa et.al.	2505.11462	null
2025-05-16	A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation	Xinran Song et.al.	2505.11444	null
2025-05-19	MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production	Chao Jin et.al.	2505.11432	null
2025-05-16	Diff-Unfolding: A Model-Based Score Learning Framework for Inverse Problems	Yuanhao Wang et.al.	2505.11393	null
2025-05-16	LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models	Danilo de Oliveira et.al.	2505.11391	null
2025-05-16	MARRS: Masked Autoregressive Unit-based Reaction Synthesis	Y. B. Wang et.al.	2505.11334	null
2025-05-16	Decomposing stimulus-specific sensory neural information via diffusion models	Steeve Laquitaine et.al.	2505.11309	null
2025-05-16	Effective Probabilistic Time Series Forecasting with Fourier Adaptive Noise-Separated Diffusion	Xinyan Wang et.al.	2505.11306	null
2025-05-16	A Fourier Space Perspective on Diffusion Models	Fabian Falck et.al.	2505.11278	null
2025-05-16	DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models	Giulia Bertazzini et.al.	2505.11257	null
2025-05-15	3D-Fixup: Advancing Photo Editing with 3D Priors	Yen-Chi Cheng et.al.	2505.10566	null
2025-05-15	T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback	Zehan Wang et.al.	2505.10561	null
2025-05-15	Style Customization of Text-to-Vector Generation with Image Diffusion Priors	Peiying Zhang et.al.	2505.10558	null
2025-05-15	Flowing Through Hilbert Space: Quantum-Enhanced Generative Models for Lattice Field Theory	Jehu Martinez et.al.	2505.10553	null
2025-05-15	Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data	Yiwen Liu et.al.	2505.10551	link
2025-05-15	Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design	Amira Alakhdar et.al.	2505.10545	null
2025-05-15	LibIQ: Toward Real-Time Spectrum Classification in O-RAN dApps	Filippo Olimpieri et.al.	2505.10537	link
2025-05-15	Optimal Pricing With Impatient Customers	Jieqi Di et.al.	2505.10514	null
2025-05-15	CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs	Raman Dutt et.al.	2505.10496	link
2025-05-15	Campus AI vs Commercial AI: A Late-Breaking Study on How LLM As-A-Service Customizations Shape Trust and Usage Patterns	Leon Hannig et.al.	2505.10490	null
2025-05-15	UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation	Yi Li et.al.	2505.10483	null
2025-05-15	Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps	Ningyuan Yang et.al.	2505.10482	null
2025-05-15	AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge	Ranjan Sapkota et.al.	2505.10468	null
2025-05-15	Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models	Zemin Huang et.al.	2505.10446	null
2025-05-15	Score-based diffusion nowcasting of GOES imagery	Randy J. Chase et.al.	2505.10432	null
2025-05-14	Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors	Nicolas Dupuis et.al.	2505.09610	null
2025-05-14	LightLab: Controlling Light Sources in Images with Diffusion Models	Nadav Magar et.al.	2505.09608	null
2025-05-14	Don’t Forget your Inverse DDIM for Image Editing	Guillermo Gomez-Trenado et.al.	2505.09571	null
2025-05-14	BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset	Jiuhai Chen et.al.	2505.09568	link
2025-05-14	CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios	Raghav Garg et.al.	2505.09436	link
2025-05-14	Efficient Modelling of Lyman-α opacity fluctuations during late EoR	Barun Maity et.al.	2505.09369	null
2025-05-14	Diffusion Recommender Models and the Illusion of Progress: A Concerning Study of Reproducibility and a Conceptual Mismatch	Michael Benigni et.al.	2505.09364	null
2025-05-14	Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	Bingxin Ke et.al.	2505.09358	link
2025-05-14	APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression	Srinivas Ravuri et.al.	2505.09356	link
2025-05-14	Access Controls Will Solve the Dual-Use Dilemma	Evžen Wybitul et.al.	2505.09341	null
2025-05-14	DCSNet: A Lightweight Knowledge Distillation-Based Model with Explainable AI for Lung Cancer Diagnosis from Histopathological Images	Sadman Sakib Alif et.al.	2505.09334	null
2025-05-14	TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving	Xuefeng Jiang et.al.	2505.09315	null
2025-05-14	Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations	Panqi Chen et.al.	2505.09284	null
2025-05-14	A Note on Semantic Diffusion	Alexander P. Ryjov et.al.	2505.09283	null
2025-05-14	Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation	Guan Gui et.al.	2505.09263	link
2025-05-13	PCS-UQ: Uncertainty Quantification via the Predictability-Computability-Stability Framework	Abhineet Agarwal et.al.	2505.08784	null
2025-05-13	Generative Molecular Design with Steerable and Granular Synthesizability Control	Jeff Guo et.al.	2505.08774	link
2025-05-13	Controllable Image Colorization with Instance-aware Texts and Masks	Yanru An et.al.	2505.08705	null
2025-05-13	A Survey of Deep Learning for Complex Speech Spectrograms	Yuying Xie et.al.	2505.08694	null
2025-05-13	A Machine Learning Pipeline for Molecular Property Prediction using ChemXploreML	Aravindh Nivas Marimuthu et.al.	2505.08688	null
2025-05-13	Comparison of laser system designs for quantum technologies: BECCAL flight system vs. BECCAL ground test bed	Victoria A. Henderson et.al.	2505.08680	null
2025-05-13	A Study of Data-driven Methods for Inventory Optimization	Lee Yeung Ping et.al.	2505.08673	null
2025-05-13	WixQA: A Multi-Dataset Benchmark for Enterprise Retrieval-Augmented Generation	Dvir Cohen et.al.	2505.08643	null
2025-05-13	Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models	Donghoon Kim et.al.	2505.08622	null
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607	null
2025-05-13	Extract the Best, Discard the Rest: CSI Feedback with Offline Large AI Models	Jialin Zhuang et.al.	2505.08566	null
2025-05-13	DFA-CON: A Contrastive Learning Approach for Detecting Copyright Infringement in DeepFake Art	Haroon Wahab et.al.	2505.08552	null
2025-05-13	Diffusion-assisted Model Predictive Control Optimization for Power System Real-Time Operation	Linna Xu et.al.	2505.08535	null
2025-05-13	Building-Block Aware Generative Modeling for 3D Crystals of Metal Organic Frameworks	Chenru Duan et.al.	2505.08531	link
2025-05-14	Improving Data Fidelity via Diffusion Model-based Correction and Super-Resolution	Wuzhe Xu et.al.	2505.08526	null
2025-05-12	H $^{\mathbf{3}}$ DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning	Yiyang Lu et.al.	2505.07819	null
2025-05-12	DanceGRPO: Unleashing GRPO on Visual Generation	Zeyue Xue et.al.	2505.07818	null
2025-05-12	Pixel Motion as Universal Representation for Robot Control	Kanchana Ranasinghe et.al.	2505.07817	null
2025-05-12	Continuous Visual Autoregressive Generation via Score Maximization	Chenze Shao et.al.	2505.07812	link
2025-05-12	Improving Trajectory Stitching with Flow Models	Reece O’Mahoney et.al.	2505.07802	null
2025-05-12	Learning Dynamics in Continual Pre-Training for Large Language Models	Xingjin Wang et.al.	2505.07796	null
2025-05-12	Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation	Arya Grayeli et.al.	2505.07777	null
2025-05-12	LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention	Jiangling Zhang et.al.	2505.07734	null
2025-05-12	ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models	Ozgur Kara et.al.	2505.07652	null
2025-05-12	Markov Modelling Approach for Queues with Correlated Service Times – the $M/M_D/2$ Model	Suman Thapa et.al.	2505.07648	null
2025-05-12	Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models	Riccardo Passoni et.al.	2505.07615	null
2025-05-12	SecReEvalBench: A Multi-turned Security Resilience Evaluation Benchmark for Large Language Models	Huining Cui et.al.	2505.07584	null
2025-05-12	Noise Optimized Conditional Diffusion for Domain Adaptation	Lingkun Luo et.al.	2505.07548	null
2025-05-12	RAI: Flexible Agent Framework for Embodied AI	Kajetan Rachwał et.al.	2505.07532	link
2025-05-13	FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images	Raul Ismayilov et.al.	2505.07530	link
2025-05-09	Long time behaviour of Mean Field Games with fractional diffusion	Olav Ersland et.al.	2505.06183	null
2025-05-09	DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models	Radu Alexandru Rosu et.al.	2505.06166	null
2025-05-09	Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study	Faeze Ghorbanpour et.al.	2505.06149	null
2025-05-09	Constraints to Lorentz violation and ultrahigh-energy electrons in D-foamy space-times	Chengyi Li et.al.	2505.06121	null
2025-05-09	Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation	Dongying Li et.al.	2505.06117	null
2025-05-09	FIC-TSC: Learning Time Series Classification with Fisher Information Constraint	Xiwen Chen et.al.	2505.06114	null
2025-05-09	Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation	Kunpeng Qiu et.al.	2505.06068	link
2025-05-09	Droplet Outbursts from Onion Cutting	Zixuan Wu et.al.	2505.06016	null
2025-05-09	Offline Multi-agent Reinforcement Learning via Score Decomposition	Dan Qiao et.al.	2505.05968	null
2025-05-09	GEORCE: A Fast New Control Algorithm for Computing Geodesics	Frederik Möbius Rygaard et.al.	2505.05961	link
2025-05-09	Summarisation of German Judgments in conjunction with a Class-based Evaluation	Bianca Steffes et.al.	2505.05947	link
2025-05-09	Autoencoder-Based Hybrid Replay for Class-Incremental Learning	Milad Khademi Nori et.al.	2505.05926	null
2025-05-09	A 3D pocket-aware and evolutionary conserved interaction guided diffusion model for molecular optimization	Anjie Qiao et.al.	2505.05874	null
2025-05-09	Screening Mechanisms on White Dwarfs: Symmetron & Dilaton	Joan Bachs-Esteban et.al.	2505.05871	null
2025-05-09	Generative Discovery of Partial Differential Equations by Learning from Math Handbooks	Hao Xu et.al.	2505.05869	null
2025-05-08	SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation	Yonwoo Choi et.al.	2505.05475	link
2025-05-08	3D Scene Generation: A Survey	Beichen Wen et.al.	2505.05474	link
2025-05-08	DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion	Qitao Zhao et.al.	2505.05473	null
2025-05-08	Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation	Chao Liao et.al.	2505.05472	null
2025-05-08	Denoising Diffusion Probabilistic Models for Coastal Inundation Forecasting	Kazi Ashik Islam et.al.	2505.05381	null
2025-05-08	SDR-RDMA: Software-Defined Reliability Architecture for Planetary Scale RDMA Communication	Mikhail Khalilov et.al.	2505.05366	null
2025-05-08	Modelling and Verifying Neuronal Archetypes in Coq	Abdorrahim Bahrami et.al.	2505.05362	link
2025-05-08	SmartTrap: Automated Precision Experiments with Optical Tweezers	Martin Selin et.al.	2505.05290	null
2025-05-08	Diffusion Model Quantization: A Review	Qian Zeng et.al.	2505.05215	link
2025-05-08	EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution	Haizhen Xie et.al.	2505.05209	null
2025-05-08	Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt	Joel Z. Leibo et.al.	2505.05197	null
2025-05-08	Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning	Chuangtao Chen et.al.	2505.05151	link
2025-05-08	Research on Anomaly Detection Methods Based on Diffusion Models	Yi Chen et.al.	2505.05137	null
2025-05-08	Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach	Xuyang Chen et.al.	2505.05126	null
2025-05-08	MDAA-Diff: CT-Guided Multi-Dose Adaptive Attention Diffusion Model for PET Denoising	Xiaolong Niu et.al.	2505.05112	null
2025-05-07	Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond	Jessie Richter-Powell et.al.	2505.04621	null
2025-05-08	Flexing RISC-V Instruction Subset Processors (RISPs) to Extreme Edge	Alireza Raisiardali et.al.	2505.04567	null
2025-05-07	Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions	Shanyu Han et.al.	2505.04553	null
2025-05-07	Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model	Pengfei Guo et.al.	2505.04522	null
2025-05-08	HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation	Teng Hu et.al.	2505.04512	null
2025-05-07	Detecting Spelling and Grammatical Anomalies in Russian Poetry Texts	Ilya Koziev et.al.	2505.04507	null
2025-05-07	Uncovering Key Features for Model-Driven Engineering of Complex Performance Indicators: A Scoping Review	Benito Giunta et.al.	2505.04498	null
2025-05-08	Defining and Quantifying Creative Behavior in Popular Image Generators	Aditi Ramaswamy et.al.	2505.04497	null
2025-05-07	Efficient Flow Matching using Latent Variables	Anirban Samaddar et.al.	2505.04486	null
2025-05-08	FA-KPConv: Introducing Euclidean Symmetries to KPConv via Frame Averaging	Ali Alawieh et.al.	2505.04485	null
2025-05-07	Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration	Shigeki Karita et.al.	2505.04457	link
2025-05-07	An Asynchronous Distributed-Memory Parallel Algorithm for k-mer Counting	Souvadra Hati et.al.	2505.04431	link
2025-05-07	Recognizing Ornaments in Vocal Indian Art Music with Active Annotation	Sumit Kumar et.al.	2505.04419	null
2025-05-07	Localized Diffusion Models for High Dimensional Distributions Generation	Georg A. Gottwald et.al.	2505.04417	null
2025-05-07	The Aloe Family Recipe for Open and Specialized Healthcare LLMs	Dario Garcia-Gasulla et.al.	2505.04388	null
2025-05-06	FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios	Shiyi Zhang et.al.	2505.03730	null
2025-05-06	Demonstrating ViSafe: Vision-enabled Safety for High-speed Detect and Avoid	Parv Kapoor et.al.	2505.03694	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679	null
2025-05-06	Distribution-Conditional Generation: From Class Distribution to Creative Generation	Fu Feng et.al.	2505.03667	null
2025-05-06	Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map	Alessandro Simoni et.al.	2505.03623	link
2025-05-07	PAHA: Parts-Aware Audio-Driven Human Animation with Diffusion Model	Y. B. Wang et.al.	2505.03603	null
2025-05-06	From Pixels to Polygons: A Survey of Deep Learning Approaches for Medical Image-to-Mesh Reconstruction	Fengming Lin et.al.	2505.03599	null
2025-05-06	Real-Time Person Image Synthesis Using a Flow Matching Model	Jiwoo Jeong et.al.	2505.03562	link
2025-05-06	A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges	Feibo Jiang et.al.	2505.03556	link
2025-05-06	Efficient Training of Physics-enhanced Neural ODEs via Direct Collocation and Nonlinear Programming	Linus Langenkamp et.al.	2505.03552	null
2025-05-06	Causal Intervention Framework for Variational Auto Encoder Mechanistic Interpretability	Dip Roy et.al.	2505.03530	null
2025-05-06	Modality-Guided Dynamic Graph Fusion and Temporal Diffusion for Self-Supervised RGB-T Tracking	Shenglan Li et.al.	2505.03507	link
2025-05-06	A new membership inference attack that spots memorization in generative and predictive models: Loss-Based with Reference Model algorithm (LBRM)	Faiz Taleb et.al.	2505.03490	null
2025-05-06	Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients	Stefano Bruno et.al.	2505.03432	null
2025-05-06	Phenotype-Guided Generative Model for High-Fidelity Cardiac MRI Synthesis: Advancing Pretraining and Clinical Applications	Ziyu Li et.al.	2505.03426	null
2025-05-05	Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models	Kuofeng Gao et.al.	2505.02824	link
2025-05-05	MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing	Zinan Guo et.al.	2505.02823	link
2025-05-05	Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models	Yankai Jiang et.al.	2505.02753	link
2025-05-05	The use of Artificial Intelligence for Intervention and Assessment in Individuals with ASD	Aggeliki Sideraki et.al.	2505.02747	null
2025-05-05	Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play	Yemin Shi et.al.	2505.02707	link
2025-05-05	Hierarchical random measures without tables	Marta Catalano et.al.	2505.02653	null
2025-05-06	MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation	Mingcheng Li et.al.	2505.02648	null
2025-05-05	Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM Era	Chenxi Liu et.al.	2505.02583	link
2025-05-05	Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities	Xinjie Zhang et.al.	2505.02567	link
2025-05-05	Bielik v3 Small: Technical Report	Krzysztof Ociepa et.al.	2505.02550	null
2025-05-06	Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces	Yang Lyu et.al.	2505.02508	null
2025-05-05	Hypothesis testing and Stein’s lemma in general probability theories with Euclidean Jordan algebra and its quantum realization	Kanta Sonoda et.al.	2505.02487	null
2025-05-05	Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction	Biao Gong et.al.	2505.02471	link
2025-05-05	Data Augmentation With Back translation for Low Resource languages: A case of English and Luganda	Richard Kimera et.al.	2505.02463	null
2025-05-05	Predicting the Dynamics of Complex System via Multiscale Diffusion Autoencoder	Ruikun Li et.al.	2505.02450	null
2025-05-02	GENMO: A GENeralist Model for Human MOtion	Jiefeng Li et.al.	2505.01425	null
2025-05-02	Computational, Data-Driven, and Physics-Informed Machine Learning Approaches for Microstructure Modeling in Metal Additive Manufacturing	D. Patel et.al.	2505.01424	null
2025-05-02	VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models	Mohammadreza Teymoorianfard et.al.	2505.01406	link
2025-05-02	Provable Efficiency of Guidance in Diffusion Models for General Data Distribution	Gen Li et.al.	2505.01382	null
2025-05-02	Binamix – A Python Library for Generating Binaural Audio Datasets	Dan Barry et.al.	2505.01369	link
2025-05-02	FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors	Chenxi Li et.al.	2505.01322	null
2025-05-02	Model See Model Do: Speech-Driven Facial Animation with Style Control	Yifang Pan et.al.	2505.01319	null
2025-05-02	ViSA-Flow: Accelerating Robot Skill Learning via Large-Scale Video Semantic Action Flow	Changhe Chen et.al.	2505.01288	null
2025-05-02	Scoring-Assisted Generative Exploration for Proteins (SAGE-Prot): A Framework for Multi-Objective Protein Optimization via Iterative Sequence Generation and Evaluation	Hocheol Lim et.al.	2505.01277	link
2025-05-02	Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications	Elie Saad et.al.	2505.01261	null
2025-05-05	Enabling Training-Free Semantic Communication Systems with Generative Diffusion Models	Shunpu Tang et.al.	2505.01209	null
2025-05-02	A Secured Triad of IoT, Machine Learning, and Blockchain for Crop Forecasting in Agriculture	Najmus Sakib Sizan et.al.	2505.01196	null
2025-05-02	A Combinatorial Proof of Universal Optimality for Computing a Planar Convex Hull	Ivor van der Hoog et.al.	2505.01194	null
2025-05-02	FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis	Jiangtong Tan et.al.	2505.01172	link
2025-05-02	Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications	Jiawei He et.al.	2505.01146	null
2025-05-01	Controllable Weather Synthesis and Removal with Video Diffusion Models	Chih-Hao Lin et.al.	2505.00704	null
2025-05-01	T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT	Dongzhi Jiang et.al.	2505.00703	link
2025-05-01	GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution	Aditya Arora et.al.	2505.00687	null
2025-05-01	Visual Trajectory Prediction of Vessels for Inland Navigation	Alexander Puzicha et.al.	2505.00599	null
2025-05-01	ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models	Jiarong Wei et.al.	2505.00586	null
2025-05-01	Safety-Critical Traffic Simulation with Guided Latent Diffusion Model	Mingxing Peng et.al.	2505.00515	null
2025-05-01	A General Model for Linearly Polarized Optical Vector Beams	Jonathan Nichols et.al.	2505.00471	null
2025-05-01	A Neural Network Mode for PX4 on Embedded Flight Controllers	Sindre M. Hegre et.al.	2505.00432	link
2025-05-01	Over-the-Air Inference over Multi-hop MIMO Networks	Chenghong Bian et.al.	2505.00430	null
2025-05-01	Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly	Ruiyuan Zhang et.al.	2505.00426	null
2025-05-01	CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass	Bowen Zhang et.al.	2505.00389	link
2025-05-01	Towards Lightweight Hyperspectral Image Super-Resolution with Depthwise Separable Dilated Convolutional Network	Usman Muhammad et.al.	2505.00374	link
2025-05-01	Denoising weak lensing mass maps with diffusion model: systematic comparison with generative adversarial network	Shohei D. Aoyama et.al.	2505.00345	null
2025-05-01	T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation	Xuyang Guo et.al.	2505.00337	null
2025-05-01	Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution	Luigi Sigillo et.al.	2505.00334	null
2025-04-30	ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction	Qihao Liu et.al.	2504.21855	null
2025-04-30	3D Stylization via Large Reconstruction Model	Ipek Oztas et.al.	2504.21836	null
2025-04-30	From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems	Huan Zhang et.al.	2504.21815	null
2025-04-30	Anatomical Similarity as a New Metric to Evaluate Brain Generative Models	Bahram Jafrasteh et.al.	2504.21771	null
2025-04-30	MovementVR: An open-source tool for the study of motor control and learning in virtual reality	Cristina Rossi et.al.	2504.21696	null
2025-04-30	HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation	Haiyang Zhou et.al.	2504.21650	link
2025-04-30	Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection	Liqin Wang et.al.	2504.21646	null
2025-04-30	ODE and PDE models for COVID-19, with reinfection and vaccination process for Cameroon and Germany	Hamadjam Abboubakar et.al.	2504.21613	null
2025-04-30	Latent Feature-Guided Conditional Diffusion for High-Fidelity Generative Image Semantic Communication	Zehao Chen et.al.	2504.21577	null
2025-04-30	Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation	Bikash Saha et.al.	2504.21574	null
2025-04-30	FreeBeacon: Efficient Communication and Data Aggregation in Battery-Free IoT	Gaosheng Liu et.al.	2504.21571	null
2025-04-30	MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance	Mengting Wei et.al.	2504.21497	link
2025-04-30	DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration	Hebaixu Wang et.al.	2504.21487	link
2025-04-30	GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers	Xinyu Li et.al.	2504.21476	null
2025-04-30	SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments	Federico Nesti et.al.	2504.21454	null
2025-04-29	YoChameleon: Personalized Vision and Language Generation	Thao Nguyen et.al.	2504.20998	null
2025-04-29	TesserAct: Learning 4D Embodied World Models	Haoyu Zhen et.al.	2504.20995	null
2025-04-29	Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models	Tyler McDonald et.al.	2504.20946	null
2025-04-30	End-to-end Audio Deepfake Detection from RAW Waveforms: a RawNet-Based Approach with Cross-Dataset Evaluation	Andrea Di Pierno et.al.	2504.20923	link
2025-04-29	Evaluating Generative Models for Tabular Data: Novel Metrics and Benchmarking	Dayananda Herurkar et.al.	2504.20900	null
2025-04-29	The Leaderboard Illusion	Shivalika Singh et.al.	2504.20879	null
2025-04-29	AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection	Lorenzo Pellegrini et.al.	2504.20865	null
2025-04-29	Universal language model with the intervention of quantum theory	D. -F. Qin et.al.	2504.20839	null
2025-04-29	SoccerDiffusion: Toward Learning End-to-End Humanoid Robot Soccer from Gameplay Recordings	Florian Vahl et.al.	2504.20808	null
2025-04-29	JTreeformer: Graph-Transformer via Latent-Diffusion Model for Molecular Generation	Ji Shi et.al.	2504.20770	null
2025-04-29	DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs	Hao Luan et.al.	2504.20754	null
2025-04-29	Learning a General Model: Folding Clothing with Topological Dynamics	Yiming Liu et.al.	2504.20720	null
2025-04-29	What’s Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models	Jan Kapar et.al.	2504.20687	link
2025-04-29	DiffLiB: High-fidelity differentiable modeling of lithium-ion batteries and efficient gradient-based parameter identification	Weipeng Xu et.al.	2504.20674	link
2025-04-29	LDPoly: Latent Diffusion for Polygonal Road Outline Extraction in Large-Scale Topographic Mapping	Weiqin Jiao et.al.	2504.20645	null
2025-04-28	Shopformer: Transformer-Based Framework for Detecting Shoplifting via Human Pose	Narges Rashvand et.al.	2504.19970	null
2025-04-28	Warm-Starting QAOA with XY Mixers: A Novel Approach for Quantum-Enhanced Vehicle Routing Optimization	Rafael S. do Carmo et.al.	2504.19934	null
2025-04-28	CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition	Quynh Phung et.al.	2504.19894	null
2025-04-28	Queue or lounge: strategic design for strategic customer	Riya Sultana et.al.	2504.19889	null
2025-04-28	DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images	Mamadou Keita et.al.	2504.19876	link
2025-04-28	CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback	Chenhan Jiang et.al.	2504.19860	null
2025-04-28	Automated Generation of Precedence Graphs in Digital Value Chains for Automotive Production	Cornelius Hake et.al.	2504.19835	null
2025-04-28	Contextures: The Mechanism of Representation Learning	Runtian Zhai et.al.	2504.19792	null
2025-04-28	Heterophily-informed Message Passing	Haishan Wang et.al.	2504.19785	null
2025-04-28	Crafting a Personal Journaling Practice: Negotiating Ecosystems of Materials, Personal Context, and Community in Analog Journaling	Katherine Lin et.al.	2504.19767	null
2025-04-28	Lossy Beyond Diagonal Reconfigurable Intelligent Surfaces: Modeling and Optimization	Yiyang Peng et.al.	2504.19744	null
2025-04-28	RepText: Rendering Visual Text via Replicating	Haofan Wang et.al.	2504.19724	null
2025-04-28	$\texttt{SAGE}$ : A Generic Framework for LLM Safety Evaluation	Madhur Jindal et.al.	2504.19674	link
2025-04-28	Multimodal Conditioned Diffusive Time Series Forecasting	Chen Su et.al.	2504.19669	null
2025-04-28	Hardware/Software Co-Design of RISC-V Extensions for Accelerating Sparse DNNs on FPGAs	Muhammad Sabih et.al.	2504.19659	null
2025-04-25	Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation	Shivam Duggal et.al.	2504.18509	null
2025-04-25	Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup Functional	Sanjeev Raja et.al.	2504.18506	null
2025-04-25	LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning	Rui Li et.al.	2504.18424	null
2025-04-25	HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models	Jens Hooge et.al.	2504.18405	null
2025-04-25	Paradigm shift on Coding Productivity Using GenAI	Liang Yu et.al.	2504.18404	null
2025-04-25	The Foundation for Developing an Exoskeleton for the Rehabilitation of Temporomandibular Disorders	Paul-Otto Müller et.al.	2504.18379	link
2025-04-25	Enhanced Sampling, Public Dataset and Generative Model for Drug-Protein Dissociation Dynamics	Maodong Li et.al.	2504.18367	null
2025-04-25	SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations	Shuting Zhao et.al.	2504.18332	null
2025-04-25	STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting	Yunze Deng et.al.	2504.18318	null
2025-04-25	Seeing Soundscapes: Audio-Visual Generation and Separation from Soundscapes Using Audio-Visual Separator	Minjae Kang et.al.	2504.18283	null
2025-04-25	TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation	Shintaro Ozaki et.al.	2504.18269	null
2025-04-25	Efficient Single-Pass Training for Multi-Turn Reasoning	Ritesh Goru et.al.	2504.18246	null
2025-04-25	Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding	Kun Li et.al.	2504.18204	null
2025-04-25	Generative AI for Physical-Layer Authentication	Rui Meng et.al.	2504.18175	null
2025-04-25	Offline Learning of Controllable Diverse Behaviors	Mathieu Petitbois et.al.	2504.18160	null
2025-04-24	LiDPM: Rethinking Point Diffusion for Lidar Scene Completion	Tetiana Martyniuk et.al.	2504.17791	null
2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789	null
2025-04-24	WI2easy: warm inflation dynamics made easy	Gabriel S. Rodrigues et.al.	2504.17760	null
2025-04-24	User Profiles: The Achilles’ Heel of Web Browsers	Dolière Francis Somé et.al.	2504.17692	null
2025-04-24	DiMeR: Disentangled Mesh Reconstruction Model	Lutao Jiang et.al.	2504.17670	link
2025-04-24	polyGen: A Learning Framework for Atomic-level Polymer Structure Generation	Ayush Jain et.al.	2504.17656	null
2025-04-24	Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization	Abderrachid Hamrani et.al.	2504.17628	null
2025-04-24	Likelihood-Free Variational Autoencoders	Chen Xu et.al.	2504.17622	null
2025-04-24	Enhancing CNNs robustness to occlusions with bioinspired filters for border completion	Catarina P. Coutinho et.al.	2504.17619	null
2025-04-24	Mitigating xApp conflicts for efficient network slicing in 6G O-RAN: a graph convolutional-based attention network approach	Sihem Bakri et.al.	2504.17590	null
2025-04-24	TileLang: A Composable Tiled Programming Model for AI Systems	Lei Wang et.al.	2504.17577	null
2025-04-24	ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting	Junyan Zhang et.al.	2504.17524	null
2025-04-24	Unveiling Hidden Vulnerabilities in Digital Human Generation via Adversarial Attacks	Zhiying Li et.al.	2504.17457	null
2025-04-24	3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models	Min Wei et.al.	2504.17414	null
2025-04-24	DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition	Yiyan Xu et.al.	2504.17349	null
2025-04-23	Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light	Ali Hassani et.al.	2504.16922	link
2025-04-23	DreamO: A Unified Framework for Image Customization	Chong Mou et.al.	2504.16915	null
2025-04-23	BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation	Ruotong Wang et.al.	2504.16907	null
2025-04-23	Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials	Peichen Zhong et.al.	2504.16893	null
2025-04-23	Situational Preparedness Dynamics for Sequential Tropical Cyclone Hazards	Tianle Duan et.al.	2504.16878	null
2025-04-23	Planning with Diffusion Models for Target-Oriented Dialogue Systems	Hanwen Du et.al.	2504.16858	null
2025-04-23	Physically Consistent Humanoid Loco-Manipulation using Latent Diffusion Models	Ilyass Taouil et.al.	2504.16843	null
2025-04-23	Snorkeling in dark waters: A longitudinal surface exploration of unique Tor Hidden Services (Extended Version)	Alfonso Rodriguez Barredo-Valenzuela et.al.	2504.16836	null
2025-04-23	Evaluating Autoencoders for Parametric and Invertible Multidimensional Projections	Frederik L. Dennig et.al.	2504.16831	null
2025-04-23	Advanced Chest X-Ray Analysis via Transformer-Based Image Descriptors and Cross-Model Attention Mechanism	Lakshita Agarwal et.al.	2504.16774	null
2025-04-23	How Effective are Generative Large Language Models in Performing Requirements Classification?	Waad Alhoshan et.al.	2504.16768	null
2025-04-23	Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism	Lakshita Agarwal et.al.	2504.16761	null
2025-04-23	Feature Mixing Approach for Detecting Intraoperative Adverse Events in Laparoscopic Roux-en-Y Gastric Bypass Surgery	Rupak Bose et.al.	2504.16749	null
2025-04-24	Simple Graph Contrastive Learning via Fractional-order Neural Diffusion Networks	Yanan Zhao et.al.	2504.16748	null
2025-04-23	MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning	Itamar Mishani et.al.	2504.16738	null
2025-04-22	Survey of Video Diffusion Models: Foundations, Implementations, and Applications	Yimu Wang et.al.	2504.16081	link
2025-04-22	From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning	Le Zhuo et.al.	2504.16080	null
2025-04-22	Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation	Yuanpeng Qu et.al.	2504.16077	link
2025-04-22	High-performance training and inference for deep equivariant interatomic potentials	Chuin Wei Tan et.al.	2504.16068	link
2025-04-22	Boosting Generative Image Modeling via Joint Image-Feature Synthesis	Theodoros Kouzelis et.al.	2504.16064	null
2025-04-22	Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis	Frank Li et.al.	2504.16047	null
2025-04-22	Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework	Xinyuan Song et.al.	2504.16016	null
2025-04-22	Deep learning of point processes for modeling high-frequency data	Yoshihiro Gyotoku et.al.	2504.15944	null
2025-04-22	Adversarial Observations in Weather Forecasting	Erik Imgrund et.al.	2504.15942	link
2025-04-22	Text-based Animatable 3D Avatars with Morphable Model Alignment	Yiqian Wu et.al.	2504.15835	link
2025-04-22	Satellite to GroundScape – Large-scale Consistent Ground View Generation from Satellite Views	Ningli Xu et.al.	2504.15786	null
2025-04-22	Clifford Group Equivariant Diffusion Models for 3D Molecular Generation	Cong Liu et.al.	2504.15773	null
2025-04-22	Stochastic Programming for Dynamic Temperature Control of Refrigerated Road Transport	Francesco Giliberto et.al.	2504.15741	null
2025-04-22	Riemannian Neural Geodesic Interpolant	Jiawen Wu et.al.	2504.15736	null
2025-04-22	Structure-Preserving Zero-Shot Image Editing via Stage-Wise Latent Injection in Diffusion Models	Dasol Jeong et.al.	2504.15723	null
2025-04-21	Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction	Vaishnavh Nagarajan et.al.	2504.15266	link
2025-04-21	Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation	Yunxuan Cai et.al.	2504.15259	null
2025-04-21	Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators	Yilun Zhou et.al.	2504.15253	link
2025-04-21	DRAGON: Distributional Rewards Optimize Diffusion Generative Models	Yatong Bai et.al.	2504.15217	null
2025-04-21	Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs	Marina Sakharova et.al.	2504.15210	null
2025-04-21	Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform	Xianpan Zhou et.al.	2504.15182	null
2025-04-21	FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image	Fei Yin et.al.	2504.15179	null
2025-04-21	DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution	Miaomiao Cai et.al.	2504.15176	null
2025-04-21	Automatic Generation of Aerobatic Flight in Complex Environments via Diffusion Models	Yuhang Zhong et.al.	2504.15138	null
2025-04-21	Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations	Csongor Csanad Kariko et.al.	2504.15121	null
2025-04-22	VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation	Mingxia Zhan et.al.	2504.15095	null
2025-04-21	Generative Artificial Intelligence for Beamforming in Low-Altitude Economy	Geng Sun et.al.	2504.15079	null
2025-04-21	SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation	Yue Li et.al.	2504.15035	null
2025-04-21	Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models	Zijin Yang et.al.	2504.15026	null
2025-04-21	PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV	Qianyu Zhu et.al.	2504.14952	link
2025-04-18	Decoding Vision Transformers: the Diffusion Steering Lens	Ryota Takatsuki et.al.	2504.13763	link
2025-04-18	ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis	Andrea Rigo et.al.	2504.13745	null
2025-04-18	MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection	Lin Yuan et.al.	2504.13726	null
2025-04-18	Magnecko: Design and Control of a Quadrupedal Magnetic Climbing Robot	Stefan Leuthard et.al.	2504.13672	null
2025-04-18	Word Embedding Techniques for Classification of Star Ratings	Hesham Abdelmotaleb et.al.	2504.13653	null
2025-04-18	Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning	Tao He et.al.	2504.13643	null
2025-04-18	SupResDiffGAN a new approach for the Super-Resolution task	Dawid Kopeć et.al.	2504.13622	null
2025-04-18	Entropic Time Schedulers for Generative Diffusion Models	Dejan Stancevic et.al.	2504.13612	null
2025-04-18	WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion	Yang Wu et.al.	2504.13561	link
2025-04-18	Task Assignment and Exploration Optimization for Low Altitude UAV Rescue via Generative AI Enhanced Multi-agent Reinforcement Learning	Xin Tang et.al.	2504.13554	null
2025-04-18	Beyond One-Hot Labels: Semantic Mixing for Model Calibration	Haoyang Luo et.al.	2504.13548	link
2025-04-18	Enhancing Multilingual Sentiment Analysis with Explainability for Sinhala, English, and Code-Mixed Content	Azmarah Rizvi et.al.	2504.13545	null
2025-04-18	MusFlow: Multimodal Music Generation via Conditional Flow Matching	Jiahao Song et.al.	2504.13535	null
2025-04-18	U-Shape Mamba: State Space Model for faster diffusion	Alex Ergasti et.al.	2504.13499	link
2025-04-18	Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing	Joowon Kim et.al.	2504.13490	null
2025-04-17	Aligning Constraint Generation with Design Intent in Parametric CAD	Evan Casey et.al.	2504.13178	null
2025-04-17	SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs	Haoxuan Li et.al.	2504.13172	null
2025-04-17	Personalized Text-to-Image Generation with Auto-Regressive Models	Kaiyue Sun et.al.	2504.13162	link
2025-04-17	Science-T2I: Addressing Scientific Illusions in Image Synthesis	Jialuo Li et.al.	2504.13129	null
2025-04-17	UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models	Guanlong Jiao et.al.	2504.13109	null
2025-04-17	RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity	Ranjan Sapkota et.al.	2504.13099	null
2025-04-17	An All-Atom Generative Model for Designing Protein Complexes	Ruizhe Chen et.al.	2504.13075	link
2025-04-18	SkyReels-V2: Infinite-length Film Generative Model	Guibin Chen et.al.	2504.13074	link
2025-04-17	ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models	Linkang Du et.al.	2504.13061	link
2025-04-17	Design Topological Materials by Reinforcement Fine-Tuned Generative Model	Haosheng Xu et.al.	2504.13048	null
2025-04-17	Evidence for sulfur chemistry in the atmosphere of the warm sub-Neptune TOI-270 d	Lukas Felix et.al.	2504.13039	null
2025-04-17	TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution	Yide Liu et.al.	2504.13026	link
2025-04-17	GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration	Rendong Zhang et.al.	2504.12999	link
2025-04-17	QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?	Zhouyang Jiang et.al.	2504.12961	null
2025-04-17	Systemic risk mitigation in supply chains through network rewiring	Giacomo Zelbi et.al.	2504.12955	null
2025-04-16	VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate	Zhihang Yuan et.al.	2504.12259	link
2025-04-16	Cobra: Efficient Line Art COlorization with BRoAder References	Junhao Zhuang et.al.	2504.12240	null
2025-04-16	Coding-Prior Guided Diffusion Network for Video Deblurring	Yike Liu et.al.	2504.12222	null
2025-04-16	Validating and monitoring bibliographic and citation data in OpenCitations collections	Ivan Heibi et.al.	2504.12195	null
2025-04-16	Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging	Tristan S. W. Stevens et.al.	2504.12154	null
2025-04-16	Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis	Songping Wang et.al.	2504.12129	null
2025-04-16	A Diffusion-Based Framework for Terrain-Aware Remote Sensing Image Reconstruction	Zhenyu Yu et.al.	2504.12112	null
2025-04-16	Generalized Visual Relation Detection with Diffusion Models	Kaifeng Gao et.al.	2504.12100	null
2025-04-16	Generative Deep Learning Framework for Inverse Design of Fuels	Kiran K. Yalamanchi et.al.	2504.12075	null
2025-04-16	Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM	Zirui Pan et.al.	2504.12048	null
2025-04-17	Understanding Attention Mechanism in Video Diffusion Models	Bingyan Liu et.al.	2504.12027	null
2025-04-16	Instruction-augmented Multimodal Alignment for Image-Text and Element Matching	Xinli Yue et.al.	2504.12018	null
2025-04-17	Dual-Energy Cone-Beam CT Using Two Orthogonal Projection Views: A Phantom Study	Junbo Peng et.al.	2504.12010	null
2025-04-16	Generative Recommendation with Continuous-Token Diffusion	Haohao Qu et.al.	2504.12007	null
2025-04-16	R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors	Haoyang Wang et.al.	2504.11946	null
2025-04-15	Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception	Ziqi Pang et.al.	2504.11457	link
2025-04-16	Elucidating the Design Space of Multimodal Protein Language Models	Cheng-Yen Hsieh et.al.	2504.11454	null
2025-04-16	Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion	An Zhao et.al.	2504.11447	link
2025-04-15	NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors	Yanrui Bin et.al.	2504.11427	null
2025-04-15	ADT: Tuning Diffusion Models with Adversarial Supervision	Dazhong Shen et.al.	2504.11423	null
2025-04-15	VideoPanda: Video Panoramic Diffusion with Multi-view Attention	Kevin Xie et.al.	2504.11389	null
2025-04-15	Ring Artifacts Correction Based on Global-Local Features Interaction Guidance in the Projection Domain	Yunze Liu et.al.	2504.11375	null
2025-04-15	Evaluating DAO Sustainability and Longevity Through On-Chain Governance Metrics	Silvio Meneguzzo et.al.	2504.11341	null
2025-04-15	Autoregressive Distillation of Diffusion Transformers	Yeongmin Kim et.al.	2504.11295	link
2025-04-15	DeepSelective: Feature Gating and Representation Matching for Interpretable Clinical Prediction	Ruochi Zhang et.al.	2504.11264	null
2025-04-15	VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers	Run Wang et.al.	2504.11227	null
2025-04-15	Focal Split: Untethered Snapshot Depth from Differential Defocus	Junjie Luo et.al.	2504.11202	null
2025-04-15	DMAGaze: Gaze Estimation Based on Feature Disentanglement and Multi-Scale Attention	Haohan Chen et.al.	2504.11160	null
2025-04-15	SAR-to-RGB Translation with Latent Diffusion for Earth Observation	Kaan Aydin et.al.	2504.11154	null
2025-04-15	Taming Consistency Distillation for Accelerated Human Image Animation	Xiang Wang et.al.	2504.11143	null
2025-04-14	REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers	Xingjian Leng et.al.	2504.10483	null
2025-04-14	Online Advanced Labs in Physics	Peter A. Bennett et.al.	2504.10470	null
2025-04-14	Art3D: Training-Free 3D Generation from Flat-Colored Illustration	Xiaoyan Cong et.al.	2504.10466	null
2025-04-14	Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing	Taihang Hu et.al.	2504.10434	link
2025-04-14	MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model	Jian Liu et.al.	2504.10433	link
2025-04-14	AI-Driven Code Refactoring: Using Graph Neural Networks to Enhance Software Maintainability	Gopichand Bandarupalli et.al.	2504.10412	null
2025-04-14	LLM-driven Constrained Copy Generation through Iterative Refinement	Varun Vasudevan et.al.	2504.10391	null
2025-04-14	Improving diffusion modeling in all-solid-state lithium batteries: a novel approach for grain boundary effects	Lena Scholz et.al.	2504.10348	null
2025-04-14	$α$ -Flow: A Unified Framework for Continuous-State Discrete Flow Matching Models	Chaoran Cheng et.al.	2504.10283	null
2025-04-14	DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing	Jinyue Zhang et.al.	2504.10278	null
2025-04-14	When Technologies Are Not Enough: Understanding How Domestic Workers Employ (and Avoid) Online Technologies in Their Work Practices	Mariana Fernandez-Espinosa et.al.	2504.10265	null
2025-04-14	A Model Zoo of Vision Transformers	Damian Falk et.al.	2504.10231	link
2025-04-14	Localized Cultural Knowledge is Conserved and Controllable in Large Language Models	Veniamin Veselovsky et.al.	2504.10191	null
2025-04-14	Efficient Generative Model Training via Embedded Representation Warmup	Deyuan Liu et.al.	2504.10188	link
2025-04-14	A New Paradigm in IBR Modeling for Power Flow and Short Circuit Analysis	Zahid Javid et.al.	2504.10181	null
2025-04-11	Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model	Team Seawead et.al.	2504.08685	null
2025-04-11	Safe Flow Matching: Robot Motion Planning with Control Barrier Functions	Xiaobing Dai et.al.	2504.08661	null
2025-04-11	Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization	Jialu Li et.al.	2504.08641	null
2025-04-11	Quantum Fluctuation-enhanced Milli-Kelvin Magnetic Refrigeration in Triangular Lattice Magnet GdBO3	Weijie Lin et.al.	2504.08636	null
2025-04-11	Discretization Error Analysis of a High Order Unfitted Space-Time Method for moving domain problems	Fabian Heimann et.al.	2504.08608	null
2025-04-11	Neural Fidelity Calibration for Informative Sim-to-Real Adaptation	Youwei Yu et.al.	2504.08604	null
2025-04-11	ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration	Yongsheng Yu et.al.	2504.08591	null
2025-04-11	COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails	Miguel Espinosa et.al.	2504.08548	null
2025-04-11	Slicing the Gaussian Mixture Wasserstein Distance	Moritz Piening et.al.	2504.08544	link
2025-04-11	Discriminator-Free Direct Preference Optimization for Video Diffusion	Haoran Cheng et.al.	2504.08542	null
2025-04-11	On The Landscape of Spoken Language Models: A Comprehensive Survey	Siddhant Arora et.al.	2504.08528	null
2025-04-11	TickIt: Leveraging Large Language Models for Automated Ticket Escalation	Fengrui Liu et.al.	2504.08475	null
2025-04-11	Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation	Bram Vanherle et.al.	2504.08473	link
2025-04-11	On the Design of Diffusion-based Neural Speech Codecs	Pietro Foti et.al.	2504.08470	null
2025-04-11	Muon-Accelerated Attention Distillation for Real-Time Edge Synthesis via Optimized Latent Diffusion	Weiye Chen et.al.	2504.08451	link
2025-04-10	PixelFlow: Pixel-Space Generative Models with Flow	Shoufa Chen et.al.	2504.07963	link
2025-04-10	Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction	Zeren Jiang et.al.	2504.07961	link
2025-04-10	VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning	Zhong-Yu Li et.al.	2504.07960	null
2025-04-10	Activating high-power parametric oscillation in photonic-crystal resonators	Grant M. Brodnik et.al.	2504.07947	null
2025-04-10	GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces	Hao Yu et.al.	2504.07945	null
2025-04-10	Echo: An Open-Source, Low-Cost Teleoperation System with Force Feedback for Dataset Collection in Robot Learning	Artem Bazhenov et.al.	2504.07939	null
2025-04-10	Optimal Control For Anti-Abeta Treatment in Alzheimer’s Disease using a Reaction-Diffusion Model	Wenrui Hao et.al.	2504.07913	null
2025-04-10	DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows	Mashrur M. Morshed et.al.	2504.07894	null
2025-04-10	QubitHammer Attacks: Qubit Flipping Attacks in Multi-tenant Superconducting Quantum Computers	Yizhuo Tan et.al.	2504.07875	null
2025-04-11	Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs	Yichun Yin et.al.	2504.07866	null
2025-04-10	A Review of HPC-Accelerated CFD in National Security and Defense	James Afful et.al.	2504.07837	null
2025-04-10	The ISC Creator: Human-Centered Design of Learning Analytics Interactive Indicator Specification Cards	Shoeb Joarder et.al.	2504.07811	null
2025-04-10	Revisiting Likelihood-Based Out-of-Distribution Detection by Modeling Representations	Yifan Ding et.al.	2504.07793	link
2025-04-10	Characterization of the Electronic Noise in the Readout of Resistive Micromegas in the High-Angle Time Projection Chambers of the T2K Experiment	D. Attié et.al.	2504.07759	null
2025-04-10	Virtual-mask Informed Prior for Sparse-view Dual-Energy CT Reconstruction	Zini Chen et.al.	2504.07753	null
2025-04-09	Identifying Unknown Stochastic Dynamics via Finite expression methods	Senwei Liang et.al.	2504.07085	null
2025-04-09	Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety	Chad Melton et.al.	2504.07022	null
2025-04-09	Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies	Jonas Loos et.al.	2504.07008	link
2025-04-09	A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology	Marco Acerbis et.al.	2504.06957	link
2025-04-09	PathSegDiff: Pathology Segmentation using Diffusion model representations	Sachin Kumar Danisetty et.al.	2504.06950	null
2025-04-09	The Importance of Being Discrete: Measuring the Impact of Discretization in End-to-End Differentially Private Synthetic Data	Georgi Ganev et.al.	2504.06923	null
2025-04-09	Data Augmentation for Fake Reviews Detection in Multiple Languages and Multiple Domains	Ming Liu et.al.	2504.06917	null
2025-04-09	MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs	Jiawei Mao et.al.	2504.06897	null
2025-04-09	EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation	Diljeet Jagpal et.al.	2504.06861	null
2025-04-09	CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading	Mishan Aliev et.al.	2504.06856	null
2025-04-09	Open Problems and a Hypothetical Path Forward in LLM Knowledge Paradigms	Xiaotian Ye et.al.	2504.06823	null
2025-04-09	DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Wangbo Zhao et.al.	2504.06803	link
2025-04-09	A Meaningful Perturbation Metric for Evaluating Explainability Methods	Danielle Cohen et.al.	2504.06800	null
2025-04-09	FedMerge: Federated Personalization via Model Merging	Shutong Chen et.al.	2504.06768	null
2025-04-09	DIMA: DIffusing Motion Artifacts for unsupervised correction in brain MRI images	Paolo Angella et.al.	2504.06767	null
2025-04-08	OmniSVG: A Unified Scalable Vector Graphics Generation Model	Yiying Yang et.al.	2504.06263	null
2025-04-08	Transfer between Modalities with MetaQueries	Xichen Pan et.al.	2504.06256	null
2025-04-08	Electronic Structure Guided Inverse Design Using Generative Models	Shuyi Jia et.al.	2504.06249	link
2025-04-08	From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models	Chejian Xu et.al.	2504.06214	null
2025-04-08	WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care	Vanessa Borst et.al.	2504.06185	null
2025-04-08	Deploying Chatbots in Customer Service: Adoption Hurdles and Simple Remedies	Evgeny Kagan et.al.	2504.06145	null
2025-04-08	QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform	Movina Moses et.al.	2504.06136	null
2025-04-08	FaceCloak: Learning to Protect Face Templates	Sudipta Banerjee et.al.	2504.06131	link
2025-04-08	OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model	Xiaochen Wei et.al.	2504.06027	null
2025-04-08	CamContextI2V: Context-aware Controllable Video Generation	Luis Denninger et.al.	2504.06022	link
2025-04-08	Note on the Universality of Parameterized IQP Circuits with Hidden Units for Generating Probability Distributions	Andrii Kurkin et.al.	2504.05997	null
2025-04-08	An Empirical Study of GPT-4o Image Generation Capabilities	Sixiang Chen et.al.	2504.05979	link
2025-04-08	Diffusion Based Ambiguous Image Segmentation	Jakob Lønborg Christensen et.al.	2504.05977	null
2025-04-08	Adaptive Extended Kalman Filtering for Battery State of Charge Estimation on STM32	António Barros et.al.	2504.05936	null
2025-04-08	Pushing JWST to the extremes: search and scrutiny of bright galaxy candidates at z $\simeq$ 15-30	M. Castellano et.al.	2504.05893	null
2025-04-07	CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models	Kavana Venkatesh et.al.	2504.05306	null
2025-04-07	Gaussian Mixture Flow Matching Models	Hansheng Chen et.al.	2504.05304	link
2025-04-07	Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures	Gen Li et.al.	2504.05300	null
2025-04-07	Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Hengran Zhang et.al.	2504.05216	null
2025-04-07	P2Mark: Plug-and-play Parameter-intrinsic Watermarking for Neural Speech Generation	Yong Ren et.al.	2504.05197	null
2025-04-07	Learning symmetries in datasets	Veronica Sanz et.al.	2504.05174	null
2025-04-07	DDPM Score Matching and Distribution Learning	Sinho Chewi et.al.	2504.05161	null
2025-04-07	DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration	Jiamei Xiong et.al.	2504.05135	null
2025-04-07	Graph-based Diffusion Model for Collaborative Filtering	Xuan Zhang et.al.	2504.05029	null
2025-04-07	RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model	Congcong Wen et.al.	2504.04988	null
2025-04-07	Low-Rate Semantic Communication with Codebook-based Conditional Generative Models	Kailang Ye et.al.	2504.04977	null
2025-04-08	REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning	Jihyun Lee et.al.	2504.04956	null
2025-04-07	A Unified Pairwise Framework for RLHF: Bridging Generative Reward Modeling and Policy Optimization	Wenyuan Xu et.al.	2504.04950	null
2025-04-07	One Quantizer is Enough: Toward a Lightweight Audio Codec	Linwei Zhai et.al.	2504.04949	link
2025-04-07	Video-Bench: Human-Aligned Video Generation Benchmark	Hui Han et.al.	2504.04907	null
2025-04-04	MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models	Wulin Xie et.al.	2504.03641	null
2025-04-04	Enhancing Causal Effect Estimation with Diffusion-Generated Data	Li Chen et.al.	2504.03630	null
2025-04-04	Quantifying the uncertainty of model-based synthetic image quality metrics	Ciaran Bench et.al.	2504.03623	null
2025-04-04	VISTA-OCR: Towards generative and interactive end to end OCR models	Laziz Hamdi et.al.	2504.03621	null
2025-04-04	Autonomous and Self-Adapting System for Synthetic Media Detection and Attribution	Aref Azizpour et.al.	2504.03615	null
2025-04-04	Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal	Yuyang Hu et.al.	2504.03607	null
2025-04-04	HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration	Boyuan Wang et.al.	2504.03536	null
2025-04-04	Diffusion Active Learning: Towards Data-Driven Experimental Design in Computed Tomography	Luis Barba et.al.	2504.03491	null
2025-04-04	BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution	Zihao He et.al.	2504.03490	null
2025-04-04	Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej	Shubham Kumar Nigam et.al.	2504.03486	null
2025-04-04	Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis	Xi Wang et.al.	2504.03471	link
2025-04-04	D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations	Antoine Dumoulin et.al.	2504.03468	null
2025-04-04	Generating ensembles of spatially-coherent in-situ forecasts using flow matching	David Landry et.al.	2504.03463	null
2025-04-04	Conditioning Diffusions Using Malliavin Calculus	Jakiw Pidstrigach et.al.	2504.03461	null
2025-04-04	QuinID: Enabling FDMA-Based Fully Parallel RFID with Frequency-Selective Antenna	Xin Na et.al.	2504.03412	link
2025-04-03	Concept Lancet: Image Editing with Compositional Representation Transplant	Jinqi Luo et.al.	2504.02828	null
2025-04-03	Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization	Kangle Deng et.al.	2504.02817	null
2025-04-03	F-ViTA: Foundation Model Guided Visible to Thermal Translation	Jay N. Paranjape et.al.	2504.02801	link
2025-04-03	Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model	Shengjun Zhang et.al.	2504.02764	null
2025-04-03	MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection	Ahmet Burak Yildirim et.al.	2504.02762	null
2025-04-03	Echoes of the hidden: Uncovering coordination beyond network structure	Shahar Somin et.al.	2504.02757	null
2025-04-04	RBT4DNN: Requirements-based Testing of Neural Networks	Nusrat Jahan Mozumder et.al.	2504.02737	link
2025-04-03	Pushing the Limit of PPG Sensing in Sedentary Conditions by Addressing Poor Skin-sensor Contact	Manh Pham Hung et.al.	2504.02735	null
2025-04-03	RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models	ZhongLi Fang et.al.	2504.02640	null
2025-04-03	Variational Online Mirror Descent for Robust Learning in Schrödinger Bridge	Dong-Sig Han et.al.	2504.02618	null
2025-04-03	Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation	Jiwoo Chung et.al.	2504.02612	link
2025-04-03	Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression	Lucas Relic et.al.	2504.02579	null
2025-04-03	MAD: Makeup All-in-One with Cross-Domain Diffusion Model	Bo-Kai Ruan et.al.	2504.02545	null
2025-04-03	High Numerical Aperture Achromatic Meta-Devices through Dispersion Compensation	Yuzhong Wang et.al.	2504.02535	null
2025-04-04	ARCANE: Adaptive RISC-V Cache Architecture for Near-memory Extensions	Vincenzo Petrolo et.al.	2504.02533	null
2025-04-02	Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis	Niluthpol Chowdhury Mithun et.al.	2504.01960	null
2025-04-03	VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step	Hanyang Wang et.al.	2504.01956	null
2025-04-02	A Unified Approach to Analysis and Design of Denoising Markov Models	Yinuo Ren et.al.	2504.01938	null
2025-04-03	ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement	Runhui Huang et.al.	2504.01934	null
2025-04-02	Gen-C: Populating Virtual Worlds with Generative Crowds	Andreas Panayiotou et.al.	2504.01924	null
2025-04-03	Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation	Baban Gain et.al.	2504.01919	null
2025-04-02	Multi-fidelity Parameter Estimation Using Conditional Diffusion Models	Caroline Tatsuoka et.al.	2504.01894	null
2025-04-02	A Diffusion-Based Framework for Occluded Object Movement	Zheng-Peng Duan et.al.	2504.01873	null
2025-04-02	Interpreting Emergent Planning in Model-Free Reinforcement Learning	Thomas Bush et.al.	2504.01871	null
2025-04-02	BOGausS: Better Optimized Gaussian Splatting	Stéphane Pateux et.al.	2504.01844	null
2025-04-02	YourBench: Easy Custom Evaluation Sets for Everyone	Sumuk Shashidhar et.al.	2504.01833	link
2025-04-02	Implicit Bias Injection Attacks against Text-to-Image Diffusion Models	Huayang Huang et.al.	2504.01819	link
2025-04-02	DISINFOX: an open-source threat exchange platform serving intelligence on disinformation and influence operations	Felipe Sánchez González et.al.	2504.01803	null
2025-04-02	The protein escape process at the ribosomal exit tunnel has conserved mechanisms across the domains of life	Phuong Thuy Bui et.al.	2504.01731	null
2025-04-02	An Adaptive Proximal Inexact Gradient Framework and Its Application to Per-Antenna Constrained Joint Beamforming and Compression Design	Xilai Fan et.al.	2504.01721	null
2025-03-31	Consistent Subject Generation via Contrastive Instantiated Concepts	Lee Hsin-Ying et.al.	2503.24387	null
2025-03-31	Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation	Shengqiong Wu et.al.	2503.24379	null
2025-03-31	InstructRestore: Region-Customized Image Restoration with Human Instructions	Shuaizheng Liu et.al.	2503.24357	link
2025-03-31	Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach	Francesco Pio Ramunno et.al.	2503.24271	link
2025-04-01	Visual Acoustic Fields	Yuelei Li et.al.	2503.24270	null
2025-03-31	Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes	Daichi Otsuka et.al.	2503.24229	null
2025-03-31	AI-Assisted Colonoscopy: Polyp Detection and Segmentation using Foundation Models	Uxue Delaquintana-Aramendi et.al.	2503.24138	link
2025-03-31	Grounding Agent Reasoning in Image Schemas: A Neurosymbolic Approach to Embodied Cognition	François Olivier et.al.	2503.24110	null
2025-03-31	Controlled Latent Diffusion Models for 3D Porous Media Reconstruction	Danilo Naiff et.al.	2503.24083	link
2025-03-31	COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation	Siqi Zhang et.al.	2503.24065	null
2025-03-31	ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance	Tong Xie et.al.	2503.24053	link
2025-03-31	Automated Discovery of Tactic Libraries for Interactive Theorem Proving	Yutong Xin et.al.	2503.24036	null
2025-03-31	DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model	Ming Yuan et.al.	2503.23993	null
2025-03-31	Two-wheel-driven Electric Superbike Powertrain Optimization	Adelmo Niccolai et.al.	2503.23984	null
2025-04-02	Machine Learning-assisted High-speed Combinatorial Optimization with Ising Machines for Dynamically Changing Problems	Yohei Hamakawa et.al.	2503.23966	null
2025-03-28	DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness	Ruining Li et.al.	2503.22677	null
2025-03-28	Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model	Jangho Park et.al.	2503.22622	null
2025-03-28	Generative Latent Neural PDE Solver using Flow Matching	Zijie Li et.al.	2503.22600	null
2025-03-28	RELD: Regularization by Latent Diffusion Models for Image Restoration	Pasquale Cascarano et.al.	2503.22563	null
2025-03-28	Deterministic Medical Image Translation via High-fidelity Brownian Bridges	Qisheng He et.al.	2503.22531	null
2025-03-28	Automated UX Insights from User Research Videos by Integrating Facial Emotion and Text Sentiment	Simran Kaur Ghatoray et.al.	2503.22510	null
2025-03-28	Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments	Luke Rowe et.al.	2503.22496	null
2025-03-28	GAITGen: Disentangled Motion-Pathology Impaired Gait Generative Model – Bringing Motion Generation to the Clinical Domain	Vida Adeli et.al.	2503.22397	null
2025-03-28	Volumetric Material Decomposition Using Spectral Diffusion Posterior Sampling with a Compressed Polychromatic Forward Model	Xiao Jiang et.al.	2503.22392	null
2025-03-28	Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization	Barış Batuhan Topal et.al.	2503.22352	null
2025-03-28	GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion	Li-Heng Chen et.al.	2503.22349	null
2025-03-28	Semantix: An Energy Guided Sampler for Semantic Style Transfer	Huiang He et.al.	2503.22344	null
2025-03-28	SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection	Shrikant Malviya et.al.	2503.22338	link
2025-03-28	Imperceptible but Forgeable: Practical Invisible Watermark Forgery via Diffusion Models	Ziping Dong et.al.	2503.22330	null
2025-03-28	BanglAssist: A Bengali-English Generative AI Chatbot for Code-Switching and Dialect-Handling in Customer Service	Francesco Kruk et.al.	2503.22283	null
2025-03-27	VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models	Chi-Pin Huang et.al.	2503.21781	null
2025-03-27	StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion	Ziyu Guo et.al.	2503.21775	null
2025-03-27	Optimal Stepsize for Diffusion Sampling	Jianning Pei et.al.	2503.21774	link
2025-03-27	A Unified Image-Dense Annotation Generation Model for Underwater Scenes	Hongkai Lin et.al.	2503.21771	link
2025-03-27	Exploring the Evolution of Physics Cognition in Video Generation: A Survey	Minghui Lin et.al.	2503.21765	link
2025-03-27	A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One	Minyoung Kim et.al.	2503.21756	null
2025-03-27	VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness	Dian Zheng et.al.	2503.21755	link
2025-03-27	3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models	Yuhan Zhang et.al.	2503.21745	null
2025-03-27	Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data	Zhiyuan Ma et.al.	2503.21694	link
2025-03-27	A Comprehensive Benchmark for RNA 3D Structure-Function Modeling	Luis Wyss et.al.	2503.21681	link
2025-03-27	A friendly introduction to triangular transport	Maximilian Ramgraber et.al.	2503.21673	null
2025-03-27	Audio-driven Gesture Generation via Deviation Feature in the Latent Space	Jiahui Chen et.al.	2503.21616	null
2025-03-27	Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs	Yoann Boget et.al.	2503.21592	null
2025-03-27	AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion	Liuyue Xie et.al.	2503.21581	null
2025-03-27	SyncSDE: A Probabilistic Framework for Diffusion Synchronization	Hyunjun Lee et.al.	2503.21555	null
2025-03-26	Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency	Tianqi Liu et.al.	2503.20785	link
2025-03-26	FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks	Jinwei Li et.al.	2503.20784	link
2025-03-26	PUREPath-B: A Tessellated Bayesian Model for Recovering CMB B-modes over Large Angular Scales of the Sky	Vipin Sudevan et.al.	2503.20774	null
2025-03-26	Reliable algorithm selection for machine learning-guided design	Clara Fannjiang et.al.	2503.20767	null
2025-03-26	RecTable: Fast Modeling Tabular Data with Rectified Flow	Masane Fuchi et.al.	2503.20731	link
2025-03-26	Continual learning via probabilistic exchangeable sequence modelling	Hanwen Xing et.al.	2503.20725	null
2025-03-26	Dynamic Motion Blending for Versatile Motion Editing	Nan Jiang et.al.	2503.20724	null
2025-03-26	From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models	Nikita Neveditsin et.al.	2503.20715	null
2025-03-26	Flow of a two-dimensional liquid foam: Impact of surfactant type and boundary conditions	Farshad Nazari et.al.	2503.20710	null
2025-03-26	BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation	Yuyang Peng et.al.	2503.20672	null
2025-03-26	ARMO: Autoregressive Rigging for Multi-Category Objects	Mingze Sun et.al.	2503.20663	null
2025-03-26	MMGen: Unified Multi-modal Image Generation and Understanding in One Go	Jiepeng Wang et.al.	2503.20644	null
2025-03-26	Diffusion Counterfactuals for Image Regressors	Trung Duc Ha et.al.	2503.20595	link
2025-03-26	Supply chain network rewiring dynamics at the firm-level	Tobias Reisch et.al.	2503.20594	link
2025-03-26	Stochastic Transport Maps in Diffusion Models and Sampling	Xicheng Zhang et.al.	2503.20573	null
2025-03-25	Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models	Sangwon Beak et.al.	2503.19914	null
2025-03-25	PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model	Mingju Gao et.al.	2503.19913	null
2025-03-26	AvatarArtist: Open-Domain 4D Avatarization	Hongyu Liu et.al.	2503.19906	null
2025-03-25	ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models	Fernando Julio Cendra et.al.	2503.19902	null
2025-03-25	Scaling Down Text Encoders of Text-to-Image Diffusion Models	Lifu Wang et.al.	2503.19897	link
2025-03-25	Visuo-Tactile Object Pose Estimation for a Multi-Finger Robot Hand with Low-Resolution In-Hand Tactile Sensing	Lukas Mack et.al.	2503.19893	null
2025-03-25	FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model	Jun Zhou et.al.	2503.19839	null
2025-03-25	TopoGEN: topology-driven microstructure generation for in silico modeling of fiber network mechanics	Sara Cardona et.al.	2503.19832	null
2025-03-25	IgCraft: A versatile sequence generation framework for antibody discovery and engineering	Matthew Greenig et.al.	2503.19821	link
2025-03-25	Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models	Ruixi You et.al.	2503.19798	null
2025-03-26	In the Blink of an Eye: Instant Game Map Editing using a Generative-AI Smart Brush	Vitaly Gnatyuk et.al.	2503.19793	null
2025-03-25	SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation	Jingdan Kang et.al.	2503.19791	link
2025-03-25	Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models	Kartik Thakral et.al.	2503.19783	null
2025-03-25	PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models	Junhyuk So et.al.	2503.19731	null
2025-03-25	CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation	Rupak Bose et.al.	2503.19661	null
2025-03-24	Target-Aware Video Diffusion Models	Taeksoo Kim et.al.	2503.18950	null
2025-03-24	Equivariant Image Modeling	Ruixiao Dong et.al.	2503.18948	link
2025-03-25	Aether: Geometric-Aware Unified World Modeling	Aether Team et.al.	2503.18945	null
2025-03-24	Video-T1: Test-Time Scaling for Video Generation	Fangfu Liu et.al.	2503.18942	null
2025-03-24	Training-free Diffusion Acceleration with Bottleneck Sampling	Ye Tian et.al.	2503.18940	null
2025-03-24	SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction	Enrico Pallotta et.al.	2503.18933	link
2025-03-24	Entanglement swapping systems toward a quantum internet	Samantha I. Davis et.al.	2503.18906	null
2025-03-24	3DSwapping: Texture Swapping For 3D Object From Single Reference Image	Xiao Cao et.al.	2503.18853	null
2025-03-24	Dual-domain Multi-path Self-supervised Diffusion Model for Accelerated MRI Reconstruction	Yuxuan Zhang et.al.	2503.18836	null
2025-03-24	Blind structured illumination microscopy via generalized Richardson-Lucy method	Valentina Capalbo et.al.	2503.18786	null
2025-03-24	Duality Symmetry in Causality Constraints for Enhanced Acoustic Absorption	Sichao Qu et.al.	2503.18740	null
2025-03-24	RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation	Chengbo Yuan et.al.	2503.18738	null
2025-03-24	Thermalizer: Stable autoregressive neural emulation of spatiotemporal chaos	Chris Pedersen et.al.	2503.18731	null
2025-03-24	NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping	Tianyi Wang et.al.	2503.18678	null
2025-03-24	Human Motion Unlearning	Edoardo De Matteis et.al.	2503.18674	null
2025-03-21	Position: Interactive Generative Video as Next-Generation Game Engine	Jiwen Yu et.al.	2503.17359	null
2025-03-21	Predicting Potential Customer Support Needs and Optimizing Search Ranking in a Two-Sided Marketplace	Do-kyum Kim et.al.	2503.17329	null
2025-03-21	Preference-Guided Diffusion for Multi-Objective Offline Optimization	Yashas Annadani et.al.	2503.17299	null
2025-03-21	Cross-Band Modulation Design for Hybrid RF-Optical Systems	Thrassos K. Oikonomou et.al.	2503.17296	null
2025-03-21	Offline Model-Based Optimization: Comprehensive Review	Minsu Kim et.al.	2503.17286	link
2025-03-21	Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras	Shuang Guo et.al.	2503.17262	link
2025-03-21	Deep End-to-End Posterior ENergy (DEEPEN) for image recovery	Jyothi Rikhab Chand et.al.	2503.17244	null
2025-03-21	Leveraging Text-to-Image Generation for Handling Spurious Correlation	Aryan Yazdan Parast et.al.	2503.17226	null
2025-03-21	Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation	Giacomo Savazzi et.al.	2503.17224	null
2025-03-21	UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models	Fanghua Yu et.al.	2503.17221	null
2025-03-21	FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy	Xingchao Yang et.al.	2503.17197	null
2025-03-21	TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning	Sheng Wang et.al.	2503.17195	null
2025-03-21	ExplainitAI: When do we trust artificial intelligence? The influence of content and explainability in a cross-cultural comparison	Sora Kang et.al.	2503.17158	null
2025-03-21	D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens	Panpan Wang et.al.	2503.17155	null
2025-03-21	R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model	Boyuan Zheng et.al.	2503.17097	null
2025-03-20	Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation	Yuqing Wang et.al.	2503.16430	null
2025-03-20	SynCity: Training-Free Generation of 3D Worlds	Paul Engstler et.al.	2503.16420	null
2025-03-20	DreamTexture: Shape from Virtual Texture with Analysis by Augmentation	Ananta R. Bhattarai et.al.	2503.16412	null
2025-03-20	VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness	SeungJu Cha et.al.	2503.16406	link
2025-03-20	ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos	Haolin Yang et.al.	2503.16400	null
2025-03-20	Scale-wise Distillation of Diffusion Models	Nikita Starodubcev et.al.	2503.16397	null
2025-03-21	SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation	Chun-Han Yao et.al.	2503.16396	null
2025-03-20	Do Visual Imaginations Improve Vision-and-Language Navigation Agents?	Akhil Perincherry et.al.	2503.16394	null
2025-03-20	LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images	Leyang Wang et.al.	2503.16376	null
2025-03-20	Heat transfer and mixing in initiated Chemical Vapor Deposition analyzed by in-situ gas composition sensing	Simon Shindler et.al.	2503.16373	null
2025-03-20	Ultra-Resolution Adaptation with Ease	Ruonan Yu et.al.	2503.16322	link
2025-03-20	Rapid patient-specific neural networks for intraoperative X-ray to volume registration	Vivek Gopalakrishnan et.al.	2503.16309	link
2025-03-20	Unleashing Vecset Diffusion Model for Fast Shape Generation	Zeqiang Lai et.al.	2503.16302	link
2025-03-20	Diffusion-augmented Graph Contrastive Learning for Collaborative Filter	Fan Huang et.al.	2503.16290	null
2025-03-20	SceneMI: Motion In-betweening for Modeling Human-Scene Interactions	Inwoo Hwang et.al.	2503.16289	null
2025-03-19	FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers	Ruichen Chen et.al.	2503.15465	link
2025-03-19	Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator	Yuanzhi Zhu et.al.	2503.15457	null
2025-03-19	MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space	Lixing Xiao et.al.	2503.15451	null
2025-03-19	LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding	Amirhossein Kazerouni et.al.	2503.15420	null
2025-03-19	Temporal Regularization Makes Your Video Generator Stronger	Harold Haodong Chen et.al.	2503.15417	null
2025-03-19	Visual Persona: Foundation Model for Full-Body Human Customization	Jisu Nam et.al.	2503.15406	null
2025-03-19	HQNN-FSP: A Hybrid Classical-Quantum Neural Network for Regression-Based Financial Stock Market Prediction	Prashant Kumar Choudhary et.al.	2503.15403	null
2025-03-19	Online Matching under KIID: Enhanced Competitive Analysis through Ordinary Differential Equation Systems	Pan Xu et.al.	2503.15399	null
2025-03-19	CCDP: Composition of Conditional Diffusion Policies with Guided Sampling	Amirreza Razmjoo et.al.	2503.15386	null
2025-03-19	Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers	Corentin Vazia et.al.	2503.15383	null
2025-03-19	Real-world validation of a multimodal LLM-powered pipeline for High-Accuracy Clinical Trial Patient Matching leveraging EHR data	Anatole Callies et.al.	2503.15374	link
2025-03-19	SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models	I-Fan Lin et.al.	2503.15351	null
2025-03-19	Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images	Euclid Collaboration et.al.	2503.15321	null
2025-03-19	SENAI: Towards Software Engineering Native Generative Artificial Intelligence	Mootez Saad et.al.	2503.15282	null
2025-03-19	ImputeGAP: A Comprehensive Library for Time Series Imputation	Quentin Nater et.al.	2503.15250	null
2025-03-18	MusicInfuser: Making Video Diffusion Listen and Dance	Susung Hong et.al.	2503.14505	null
2025-03-18	The Power of Context: How Multimodality Improves Image Super-Resolution	Kangfu Mei et.al.	2503.14503	null
2025-03-18	Deeply Supervised Flow-Based Generative Models	Inkyu Shin et.al.	2503.14494	null
2025-03-18	Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control	NVIDIA et.al.	2503.14492	link
2025-03-18	Stable Virtual Camera: Generative View Synthesis with Diffusion Models	Jensen et.al.	2503.14489	null
2025-03-18	DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers	Minglei Shi et.al.	2503.14487	null
2025-03-18	Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset	Yiqun Mei et.al.	2503.14485	null
2025-03-18	ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing	Yulin Pan et.al.	2503.14482	null
2025-03-18	SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model	Yucheng Mao et.al.	2503.14463	null
2025-03-18	The Atacama Cosmology Telescope: DR6 Constraints on Extended Cosmological Models	Erminia Calabrese et.al.	2503.14454	null
2025-03-18	Bolt3D: Generating 3D Scenes in Seconds	Stanislaw Szymanowicz et.al.	2503.14445	null
2025-03-18	MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation	Hongyu Zhang et.al.	2503.14428	null
2025-03-18	Diffusion-based Facial Aesthetics Enhancement with 3D Structure Guidance	Lisha Li et.al.	2503.14402	null
2025-03-18	A Comprehensive Scatter Correction Model for Micro-Focus Dual-Source Imaging Systems: Combining Ambient, Cross, and Forward Scatter	Jianing Sun et.al.	2503.14386	null
2025-03-18	Impossible Videos	Zechen Bai et.al.	2503.14378	null
2025-03-17	Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images	Tianhao Wu et.al.	2503.13439	null
2025-03-17	Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation	Xinyu Lian et.al.	2503.13424	null
2025-03-17	Securing Virtual Reality Experiences: Unveiling and Tackling Cybersickness Attacks with Explainable AI	Ripan Kumar Kundu et.al.	2503.13419	null
2025-03-17	Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning	Mengyao Lyu et.al.	2503.13383	null
2025-03-17	One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation	Daniil Selikhanovych et.al.	2503.13358	null
2025-03-17	A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory – Technical Design	Otger Ballester et.al.	2503.13349	null
2025-03-17	Artificial Intelligence-Driven Prognostic Classification of COVID-19 Using Chest X-rays: A Deep Learning Approach	Alfred Simbun et.al.	2503.13277	null
2025-03-17	Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors	Katja Schwarz et.al.	2503.13272	null
2025-03-17	Graph Generative Models Evaluation with Masked Autoencoder	Chengen Wang et.al.	2503.13271	null
2025-03-17	FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis	Luxi Chen et.al.	2503.13265	null
2025-03-17	Dense Policy: Bidirectional Autoregressive Learning of Actions	Yue Su et.al.	2503.13217	null
2025-03-17	MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis	Marvin Seyfarth et.al.	2503.13211	null
2025-03-17	Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images	Yaxi Chen et.al.	2503.13131	null
2025-03-17	3D Human Interaction Generation: A Survey	Siyuan Fan et.al.	2503.13120	null
2025-03-17	DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry	Jing Li et.al.	2503.13110	link
2025-03-14	From few to many maps: A fast map-level emulator for extreme augmentation of CMB systematics datasets	P. Campeti et.al.	2503.11643	link
2025-03-14	Gradient-bridged Posterior: Bayesian Inference for Models with Implicit Functions	Cheng Zeng et.al.	2503.11637	null
2025-03-14	Pathology Image Compression with Pre-trained Autoencoders	Srikar Yellapragada et.al.	2503.11591	null
2025-03-14	Dynamics of a coupled nonlocal PDE-ODE system with spatial memory: well-posedness, stability, and bifurcation analysis	Yurij Salmaniw et.al.	2503.11550	null
2025-03-14	AugGen: Synthetic Augmentation Can Improve Discriminative Models	Parsa Rahimi et.al.	2503.11544	null
2025-03-14	Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models	Hao Cheng et.al.	2503.11519	null
2025-03-14	Perfect Stabilization of Biomolecular Adhesions under Load	Anton F. Burnet et.al.	2503.11510	null
2025-03-14	Exponential Quantum Advantage for Simulating Open Classical Systems	Agi Villanyi et.al.	2503.11483	null
2025-03-14	T2I-FineEval: Fine-Grained Compositional Metric for Text-to-Image Evaluation	Seyed Mohammad Hadi Hosseini et.al.	2503.11481	null
2025-03-14	Integrating LLMs in Gamified Systems	Carlos J. Costa et.al.	2503.11458	null
2025-03-14	Extending Ambient Pressure X-ray Photoelectron Spectroscopy to Plasma Studies: A novel and flexible plasma gun approach	Yang Gu et.al.	2503.11446	null
2025-03-14	TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation	Hongxiang Zhao et.al.	2503.11423	null
2025-03-14	MTV-Inpaint: Multi-Task Long Video Inpainting	Shiyuan Yang et.al.	2503.11412	null
2025-03-14	Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models	Jonas Thietke et.al.	2503.11404	null
2025-03-14	BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion Model	Ziyue Wang et.al.	2503.11372	link
2025-03-13	GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing	Rongyao Fang et.al.	2503.10639	link
2025-03-13	Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective	Xiaoming Zhao et.al.	2503.10638	null
2025-03-14	Distilling Diversity and Control in Diffusion Models	Rohit Gandikota et.al.	2503.10637	null
2025-03-13	HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model	Jiaming Liu et.al.	2503.10631	null
2025-03-13	NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models	Mert Albaba et.al.	2503.10626	null
2025-03-13	DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation	Chen Chen et.al.	2503.10618	null
2025-03-13	MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction	Yingshuang Zou et.al.	2503.10604	null
2025-03-13	CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models	Hao He et.al.	2503.10592	null
2025-03-13	Long Context Tuning for Video Generation	Yuwei Guo et.al.	2503.10589	null
2025-03-13	Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures	Nina Vesseron et.al.	2503.10576	null
2025-03-13	MASQUE: A Text-Guided Diffusion-Based Framework for Localized and Customized Adversarial Makeup	Youngjin Kwon et.al.	2503.10549	null
2025-03-13	Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression	Hooman Shahrokhi et.al.	2503.10512	null
2025-03-13	Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion	Evgeniia Vu et.al.	2503.10488	null
2025-03-13	Applying Tabular Deep Learning Models to Estimate Crash Injury Types of Young Motorcyclists	Shriyank Somvanshi et.al.	2503.10474	null
2025-03-13	Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback	Derun Li et.al.	2503.10434	null
2025-03-12	PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop	Chenyu Li et.al.	2503.09595	link
2025-03-12	Minimax Optimality of the Probability Flow ODE for Diffusion Models	Changxiao Cai et.al.	2503.09583	null
2025-03-12	Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models	Marianne Arriola et.al.	2503.09573	link
2025-03-12	TPDiff: Temporal Pyramid Video Diffusion Model	Lingmin Ran et.al.	2503.09566	null
2025-03-12	FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model	Jiahao Xia et.al.	2503.09560	null
2025-03-12	GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals	Shuokang Huang et.al.	2503.09537	null
2025-03-12	Total Ionizing Dose Measurements in Small Satellites in LEO using LabOSat-01	Lucas Finazzi et.al.	2503.09520	null
2025-03-12	CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images	Bin Hu et.al.	2503.09514	null
2025-03-12	DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction	Junjie Zhou et.al.	2503.09491	link
2025-03-12	Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation	Máté Tóth et.al.	2503.09464	null
2025-03-12	How Well Does Your Tabular Generator Learn the Structure of Tabular Data?	Xiangjian Jiang et.al.	2503.09453	link
2025-03-12	Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models	Zhihua Tian et.al.	2503.09446	link
2025-03-12	SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation	Qijian Zhang et.al.	2503.09439	null
2025-03-12	Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space	Yifan Zhou et.al.	2503.09419	link
2025-03-12	Diff-CL: A Novel Cross Pseudo-Supervision Method for Semi-supervised Medical Image Segmentation	Xiuzhen Guo et.al.	2503.09408	null
2025-03-11	OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models	Jialv Zou et.al.	2503.08686	link
2025-03-11	GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing	Yuanhao Wang et.al.	2503.08678	null
2025-03-12	OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting	Yongsheng Yu et.al.	2503.08677	null
2025-03-11	Language-Depth Navigated Thermal and Visible Image Fusion	Jinchang Zhang et.al.	2503.08676	null
2025-03-11	Keypoint Detection and Description for Raw Bayer Images	Jiakai Lin et.al.	2503.08673	null
2025-03-11	Modeling Stock Return Distributions and Pricing Options	Xinxin Jiang et.al.	2503.08666	null
2025-03-11	REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder	Yitian Zhang et.al.	2503.08665	null
2025-03-11	MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention	Yuhan Wang et.al.	2503.08664	link
2025-03-11	MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input	Zhenchen Wan et.al.	2503.08650	null
2025-03-11	Rethinking Diffusion Model in High Dimension	Zhenxin Zheng et.al.	2503.08643	link
2025-03-11	Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention	Emily Xiao et.al.	2503.08640	link
2025-03-11	LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization	Xianfeng Wu et.al.	2503.08619	link
2025-03-11	Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling	Subin Kim et.al.	2503.08605	null
2025-03-11	3D Point Cloud Generation via Autoregressive Up-sampling	Ziqiao Meng et.al.	2503.08594	null
2025-03-11	Proc4Gem: Foundation models for physical agency through procedural generation	Yixin Lin et.al.	2503.08593	null
2025-03-10	GenAIReading: Augmenting Human Cognition with Interactive Digital Textbooks Using Large Language Models and Image Generation Models	Ryugo Morita et.al.	2503.07463	null
2025-03-10	Advancing our Understanding of Optoionic Effects for the Design of Solar Batteries: A Theoretical Perspective	Matteo Rinaldi et.al.	2503.07460	null
2025-03-10	Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration	Dylan J. Foster et.al.	2503.07453	null
2025-03-10	DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks	Feiran You et.al.	2503.07433	link
2025-03-10	AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion	Mingzhen Sun et.al.	2503.07418	null
2025-03-10	TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision	Shaobin Zhuang et.al.	2503.07416	null
2025-03-10	SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models	Ouxiang Li et.al.	2503.07392	link
2025-03-10	PersonaBooth: Personalized Text-to-Motion Generation	Boeun Kim et.al.	2503.07390	null
2025-03-10	TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models	Ruidong Chen et.al.	2503.07389	link
2025-03-10	RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing	Yiqing Xie et.al.	2503.07358	link
2025-03-10	AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models	Bo Huang et.al.	2503.07307	link
2025-03-10	Cool-3D: An End-to-End Thermal-Aware Framework for Early-Phase Design Space Exploration of Microfluidic-Cooled 3DICs	Runxi Wang et.al.	2503.07297	link
2025-03-10	Efficient Distillation of Classifier-Free Guidance using Adapters	Cristian Perez Jensen et.al.	2503.07274	link
2025-03-10	Customized SAM 2 for Referring Remote Sensing Image Segmentation	Fu Rong et.al.	2503.07266	null
2025-03-11	AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis	Zhangyu Lai et.al.	2503.07253	null
2025-03-07	AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data	Zengqun Zhao et.al.	2503.05665	link
2025-03-07	TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models	Mark YU et.al.	2503.05638	null
2025-03-07	A functional approach for curve alignment and shape analysis	Issam-Ali Moindjié et.al.	2503.05632	null
2025-03-07	Geometric Optimization of Patterned Conductive Polymer Composite-based Strain Sensors Toward Enhanced Sensing Performance	Jia-Chen Shang et.al.	2503.05603	null
2025-03-07	Diffusion Models for Cayley Graphs	Michael R. Douglas et.al.	2503.05558	null
2025-03-07	Radio Frequency from Optical with Instabilities below $10^{-15}$ - Generation and Measurement	A. Hati et.al.	2503.05547	null
2025-03-10	*Accelerating db-A for Kinodynamic Motion Planning Using Diffusion**	Julius Franke et.al.	2503.05539	null
2025-03-07	Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations	Eren Erogullari et.al.	2503.05522	link
2025-03-07	Noise-Robust Radio Frequency Fingerprint Identification Using Denoise Diffusion Model	Guolin Yin et.al.	2503.05514	null
2025-03-07	Localized necking under global compression in two-scale metallic hierarchical solids	Naresh Chockalingam S. et.al.	2503.05498	null
2025-03-07	Umbilical Choir: Automated Live Testing for Edge-To-Cloud FaaS Applications	Mohammadreza Malekabbasi et.al.	2503.05495	link
2025-03-07	Statistical Deficiency for Task Inclusion Estimation	Loïc Fosse et.al.	2503.05491	null
2025-03-07	De Novo Design of Protein-Binding Peptides by Quantum Computing	Lars Meuser et.al.	2503.05458	null
2025-03-07	VLMs Play StarCraft II: A Benchmark and Multimodal Decision Method	Weiyu Ma et.al.	2503.05383	link
2025-03-07	PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?	Martin Spitznagel et.al.	2503.05333	null
2025-03-06	Compositional World Knowledge leads to High Utility Synthetic data	Sachit Gaudi et.al.	2503.04687	null
2025-03-06	What Are You Doing? A Closer Look at Controllable Human Video Generation	Emanuele Bugliarello et.al.	2503.04666	null
2025-03-06	Risk-aware Trading Portfolio Optimization	Marco Bianchetti et.al.	2503.04662	null
2025-03-06	IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval	Tingyu Song et.al.	2503.04644	null
2025-03-06	Simulating the Real World: A Unified Survey of Multimodal Generative Models	Yuqi Hu et.al.	2503.04641	link
2025-03-06	3HANDS Dataset: Learning from Humans for Generating Naturalistic Handovers with Supernumerary Robotic Limbs	Artin Saberpour Abadian et.al.	2503.04635	null
2025-03-06	The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation	Aoxiong Yin et.al.	2503.04606	link
2025-03-07	Method for recovering data on unreported low-severity crashes	Alberto Morando et.al.	2503.04529	null
2025-03-06	Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training	Adrian Chang et.al.	2503.04496	null
2025-03-06	InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference	Tianyu Cui et.al.	2503.04483	null
2025-03-06	ToolFuzz – Automated Agent Tool Testing	Ivan Milev et.al.	2503.04479	null
2025-03-06	Semantic Alignment of Unimodal Medical Text and Vision Representations	Maxime Di Folco et.al.	2503.04478	null
2025-03-06	PALo: Learning Posture-Aware Locomotion for Quadruped Robots	Xiangyu Miao et.al.	2503.04462	null
2025-03-06	Polling on a circle with non-uniform batch arrivals	Tim Engels et.al.	2503.04448	null
2025-03-06	Can Large Language Models Predict Antimicrobial Resistance Gene?	Hyunwoo Yoo et.al.	2503.04413	null
2025-03-05	Rethinking Video Tokenization: A Conditioned Diffusion-based Approach	Nianzu Yang et.al.	2503.03708	link
2025-03-05	DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance	Zhao Yang et.al.	2503.03689	link
2025-03-05	Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Bar Karov et.al.	2503.03669	link
2025-03-05	A Generative Approach to High Fidelity 3D Reconstruction from Text Data	Venkat Kumar R et.al.	2503.03664	null
2025-03-05	DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles	Rui Zhao et.al.	2503.03651	link
2025-03-05	Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias	Rui Lu et.al.	2503.03595	null
2025-03-05	Generative Artificial Intelligence in Robotic Manipulation: A Survey	Kun Zhang et.al.	2503.03464	null
2025-03-05	Predicting Practically? Domain Generalization for Predictive Analytics in Real-world Environments	Hanyu Duan et.al.	2503.03399	link
2025-03-05	Top-K Maximum Intensity Projection Priors for 3D Liver Vessel Segmentation	Xiaotong Zhang et.al.	2503.03367	null
2025-03-05	Video Super-Resolution: All You Need is a Video Diffusion Model	Zhihao Zhan et.al.	2503.03355	null
2025-03-05	Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters	Julia Hindel et.al.	2503.03299	null
2025-03-05	Group Delay Dispersion Measurements of Novel Multilayer Interference Coatings in the Mid-Infrared Spectral Regime	Ulrich Galander et.al.	2503.03289	null
2025-03-06	Optimizing for the Shortest Path in Denoising Diffusion Model	Ping Chen et.al.	2503.03265	link
2025-03-05	Mean Field Game of Controls with State Reflections: Existence and Limit Theory	Lijun Bo et.al.	2503.03253	null
2025-03-05	GenColor: Generative Color-Concept Association in Visual Design	Yihan Hou et.al.	2503.03236	null
2025-03-04	ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models	Qinyu Zhao et.al.	2503.02883	link
2025-03-04	SeqFusion: Sequential Fusion of Pre-Trained Models for Zero-Shot Time-Series Forecasting	Ting-Ji Huang et.al.	2503.02836	link
2025-03-04	A Multimodal Symphony: Integrating Taste and Sound through Generative AI	Matteo Spanio et.al.	2503.02823	link
2025-03-04	Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts	Marta Skreta et.al.	2503.02819	link
2025-03-04	“What If Smart Homes Could See Our Homes?”: Exploring DIY Smart Home Building Experiences with VLM-Based Camera Sensors	Sojeong Yun et.al.	2503.02816	null
2025-03-04	Generating Reliable Initial Velocity Models for Full-waveform Inversion with Well and Structural Constraints	Qingchen Zhang et.al.	2503.02815	null
2025-03-04	Applying Computational Engineering Modelling to Analyse the Social Impact of Conflict and Violent Events	Felix Schwebel et.al.	2503.02771	null
2025-03-04	Revolutionizing Command Interface: Maximizing Control Efficiency in INO ICAL Experiment with UDP Protocol	Yuvaraj Elangovan et.al.	2503.02751	null
2025-03-04	Seeded Poisson Factorization: Leveraging domain knowledge to fit topic models	Bernd Prostmaier et.al.	2503.02741	link
2025-03-04	Variable-Friction In-Hand Manipulation for Arbitrary Objects via Diffusion-Based Imitation Learning	Qiyang Yan et.al.	2503.02738	null
2025-03-04	Zero-Shot Complex Question-Answering on Long Scientific Documents	Wanting Wang et.al.	2503.02695	link
2025-03-04	Generative Modeling of Microweather Wind Velocities for Urban Air Mobility	Tristan A. Shah et.al.	2503.02690	link
2025-03-04	A user-friendly SPARQL query editor powered by lightweight metadata	Vincent Emonet et.al.	2503.02688	link
2025-03-04	Cellular Automaton With CNN	Valery Ashu et.al.	2503.02652	link
2025-03-04	Xavier: Toward Better Coding Assistance in Authoring Tabular Data Wrangling Scripts	Yunfan Zhou et.al.	2503.02639	null
2025-02-28	How far can we go with ImageNet for Text-to-Image generation?	L. Degeorge et.al.	2502.21318	null
2025-02-28	Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos	Zhiyu Tan et.al.	2502.21314	null
2025-02-28	Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion	Kulin Shah et.al.	2502.21278	null
2025-02-28	Dynamic Markov Blanket Detection for Macroscopic Physics Discovery	Jeff Beck et.al.	2502.21217	link
2025-02-28	AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks	Pedro Gimenes et.al.	2502.21196	null
2025-02-28	Joint Modeling in Recommendations: A Survey	Xiangyu Zhao et.al.	2502.21195	null
2025-02-28	SYN-LUNGS: Towards Simulating Lung Nodules with Anatomy-Informed Digital Twins for AI Training	Fakrul Islam Tushar et.al.	2502.21187	null
2025-02-28	A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images	Zineb Sordo et.al.	2502.21151	null
2025-02-28	Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure?	Charles Dawson et.al.	2502.21110	null
2025-02-28	Spatial Reasoning with Denoising Models	Christopher Wewer et.al.	2502.21075	null
2025-02-28	GUIDE: LLM-Driven GUI Generation Decomposition for Automated Prototyping	Kristian Kolthoff et.al.	2502.21068	null
2025-02-28	Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport	Jingru Fu et.al.	2502.21049	link
2025-02-28	Toward interoperable representation and sharing of disinformation incidents in cyber threat intelligence	Felipe Sánchez González et.al.	2502.20997	link
2025-02-28	Generative Uncertainty in Diffusion Models	Metod Jazbec et.al.	2502.20946	null
2025-02-28	DiffBrush:Just Painting the Art by Your Hands	Jiaming Chu et.al.	2502.20904	null
2025-02-27	InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions	Sirui Xu et.al.	2502.20390	link
2025-02-27	Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation	Sucheng Ren et.al.	2502.20388	link
2025-02-27	Tight Inversion: Image-Conditioned Inversion for Real Image Editing	Edo Kadosh et.al.	2502.20376	null
2025-02-27	Constrained Generative Modeling with Manually Bridged Diffusion Models	Saeid Naderiparizi et.al.	2502.20371	null
2025-02-27	ACCORD: Application Context-aware Cross-layer Optimization and Resource Design for 5G/NextG Machine-centric Applications	Azuka Chiejina et.al.	2502.20320	null
2025-02-27	FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction	Siyu Jiao et.al.	2502.20313	link
2025-02-27	Mobius: Text to Seamless Looping Video Generation via Latent Shift	Xiuli Bi et.al.	2502.20307	link
2025-02-27	Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions	Palawat Busaranuvong et.al.	2502.20277	null
2025-02-27	Do computer vision foundation models learn the low-level characteristics of the human visual system?	Yancheng Cai et.al.	2502.20256	null
2025-02-28	Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets	Chi-Chien Tsai et.al.	2502.20246	null
2025-02-27	From Retrieval to Generation: Comparing Different Approaches	Abdelrahman Abdallah et.al.	2502.20245	null
2025-02-27	Attention Distillation: A Unified Approach to Visual Characteristics Transfer	Yang Zhou et.al.	2502.20235	link
2025-02-27	AI Will Always Love You: Studying Implicit Biases in Romantic AI Companions	Clare Grogan et.al.	2502.20231	link
2025-02-27	Model Checking Linear Temporal Logic with Standpoint Modalities	Rajab Aghamov et.al.	2502.20193	null
2025-02-27	Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think	Liang Chen et.al.	2502.20172	link
2025-02-26	Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis	Minjoo Lim et.al.	2502.19390	null
2025-02-26	Deep Learning For Time Series Analysis With Application On Human Motion	Ali Ismail-Fawaz et.al.	2502.19364	null
2025-02-26	Shh, don’t say that! Domain Certification in LLMs	Cornelius Emde et.al.	2502.19320	null
2025-02-26	AI-Powered Bayesian Inference	Veronika Ročková et.al.	2502.19231	null
2025-02-26	HDM: Hybrid Diffusion Model for Unified Image Anomaly Detection	Zekang Weng et.al.	2502.19200	null
2025-02-27	INFO-SEDD: Continuous Time Markov Chains as Scalable Information Metrics Estimators	Alberto Foresti et.al.	2502.19183	null
2025-02-26	A Model-Centric Review of Deep Learning for Protein Design	Gregory W. Kyro et.al.	2502.19173	null
2025-02-27	RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images	Yuhan Tang et.al.	2502.19153	null
2025-02-26	Identification Under the Semantic Effective Secrecy Constraint	Abdalla Ibrahim et.al.	2502.19142	null
2025-02-26	Improving customer service with automatic topic detection in user emails	Bojana Bašaragin et.al.	2502.19115	null
2025-02-26	Modulation of the galactic cosmic ray spectrum in an anisotropic diffusion approach	V. D. Borisov et.al.	2502.19062	null
2025-02-26	A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models	Vu Tuan Truong Long et.al.	2502.19047	null
2025-02-26	OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment	Jiaxin Deng et.al.	2502.18965	null
2025-02-26	DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model	Lei Zhao et.al.	2502.18952	null
2025-02-26	A Novel Topology Recovery Method for Low Voltage Distribution Networks	Sina Mohammadi et.al.	2502.18939	null
2025-02-25	K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs	Ziheng Ouyang et.al.	2502.18461	null
2025-02-25	ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies	Pedro Sequeira et.al.	2502.18438	null
2025-02-25	Sparse Bayesian Generative Modeling for Joint Parameter and Channel Estimation	Benedikt Böck et.al.	2502.18369	null
2025-02-25	ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation	Yifan Pu et.al.	2502.18364	null
2025-02-25	Stretchable Capacitive and Resistive Strain Sensors: Accessible Manufacturing Using Direct Ink Writing	Lukas Cha et.al.	2502.18363	null
2025-02-25	Towards softerware: Enabling personalization of interactive data representations for users with disabilities	Frank Elavsky et.al.	2502.18348	link
2025-02-25	LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation	Pengzhi Li et.al.	2502.18302	null
2025-02-26	Bayesian Computation in Deep Learning	Wenlong Chen et.al.	2502.18300	null
2025-02-26	Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support	Guoxin Wang et.al.	2502.18274	link
2025-02-25	Imperfect Knowledge Management (IKM) in GEFRED (GENeralized model for Fuzzy RElational Databases)	Leoncio Jimenez et.al.	2502.18255	null
2025-02-25	A 3D Printed Quad-Ridged Flared Horn Antenna Feeder for Radio-Telescopes	Andreas Hofmann et.al.	2502.18243	null
2025-02-25	Causal AI-based Root Cause Identification: Research to Practice at Scale	Saurabh Jha et.al.	2502.18240	null
2025-02-25	Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints	Mihaela Cătălina Stoian et.al.	2502.18237	link
2025-02-25	Principled priors for Bayesian inference of circular models	Xiang Ye et.al.	2502.18223	null
2025-02-25	UASTrack: A Unified Adaptive Selection Framework with Modality-Customization in Single Object Tracking	He Wang et.al.	2502.18220	null
2025-02-24	Fractal Generative Models	Tianhong Li et.al.	2502.17437	link
2025-02-24	GCC: Generative Color Constancy via Diffusing a Color Checker	Chen-Wei Chang et.al.	2502.17435	null
2025-02-24	S4S: Solving for a Diffusion Model Solver	Eric Frankel et.al.	2502.17423	null
2025-02-24	X-Dancer: Expressive Music to Human Dance Video Generation	Zeyuan Chen et.al.	2502.17414	null
2025-02-24	What is a Good Question? Utility Estimation with LLM-based Simulations	Dong-Ho Lee et.al.	2502.17383	null
2025-02-25	KV-Edit: Training-Free Image Editing for Precise Background Preservation	Tianrui Zhu et.al.	2502.17363	link
2025-02-24	RELICT: A Replica Detection Framework for Medical Image Generation	Orhun Utku Aydin et.al.	2502.17360	link
2025-02-24	How Scientists Use Large Language Models to Program	Gabrielle O’Brien et.al.	2502.17348	null
2025-02-24	AnyTop: Character Animation Diffusion with Any Topology	Inbar Gat et.al.	2502.17327	link
2025-02-24	Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents	Prafulla Kumar Choubey et.al.	2502.17321	null
2025-02-24	Robust Federated Learning in Unreliable Wireless Networks: A Client Selection Approach	Yanmeng Wang et.al.	2502.17260	null
2025-02-24	VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing	Xiangpeng Yang et.al.	2502.17258	null
2025-02-24	Learning Image Fractals Using Chaotic Differentiable Point Splatting	Adarsh Djeacoumar et.al.	2502.17230	null
2025-02-24	Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation	Baptiste Chopin et.al.	2502.17198	null
2025-02-24	Unsupervised Accelerated MRI Reconstruction via Ground-Truth-Free Flow Matching	Xinzhe Luo et.al.	2502.17174	null
2025-02-21	One-step Diffusion Models with $f$ -Divergence Distribution Matching	Yilun Xu et.al.	2502.15681	null
2025-02-21	VaViM and VaVAM: Autonomous Driving through Video Generative Modeling	Florent Bartoccioni et.al.	2502.15672	link
2025-02-21	Overview of the data acquisition system architecture for the DarkSide-20k experiment	Maria Adriana Sabia et.al.	2502.15651	null
2025-02-21	WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents	Xinhang Liu et.al.	2502.15601	null
2025-02-21	Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid	Yunfeng Li et.al.	2502.15583	null
2025-02-21	Enhancing RWKV-based Language Models for Long-Sequence Text Generation	Xinghan Pan et.al.	2502.15485	link
2025-02-21	Development and Performance Validation of a Versatile VLBI Digital Backend Using the ROACH2 Platform	Jiyun Li et.al.	2502.15446	null
2025-02-21	Modeling Infectious Diseases: From SIR Models to Diffusion-Based Approaches and Numerical Solutions	Ayesha Baig et.al.	2502.15439	null
2025-02-21	Efficiently Solving Discounted MDPs with Predictions on Transition Matrices	Lixing Lyu et.al.	2502.15345	null
2025-02-21	Bridging Bug Localization and Issue Fixing: A Hierarchical Localization Framework Leveraging Large Language Models	Jianming Chang et.al.	2502.15292	null
2025-02-21	BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization	Tonghan Wang et.al.	2502.15283	null
2025-02-21	CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models	Shunchang Liu et.al.	2502.15278	null
2025-02-21	On the (In)Security of Non-resettable Device Identifiers in Custom Android Systems	Zikan Dong et.al.	2502.15270	null
2025-02-21	User Experience with LLM-powered Conversational Recommendation Systems: A Case of Music Recommendation	Sojeong Yun et.al.	2502.15229	null
2025-02-21	Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis	Yifan Jiang et.al.	2502.15204	link
2025-02-20	Improving the Diffusability of Autoencoders	Ivan Skorokhodov et.al.	2502.14831	null
2025-02-20	A Survey on Text-Driven 360-Degree Panorama Generation	Hai Wang et.al.	2502.14799	null
2025-02-20	Real-Time Device Reach Forecasting Using HLL and MinHash Data Sketches	Chandrashekar Muniyappa et.al.	2502.14785	null
2025-02-20	DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models	Hongji Yang et.al.	2502.14779	null
2025-02-20	Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes	Lukas Rauch et.al.	2502.14721	null
2025-02-20	ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation	Angxiao Yue et.al.	2502.14637	link
2025-02-20	A Theory for Conditional Generative Modeling on Multiple Data Sources	Rongzhen Wang et.al.	2502.14583	link
2025-02-20	Multiscale Byte Language Models – A Hierarchical Architecture for Causal Million-Length Sequence Modeling	Eric Egli et.al.	2502.14553	link
2025-02-20	Dynamic Preference-based Multi-modal Trip Planning of Public Transport and Shared Mobility	Yimeng Zhang et.al.	2502.14528	null
2025-02-20	How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?	Sergey Pletenev et.al.	2502.14502	link
2025-02-20	StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following	Jinnan Li et.al.	2502.14494	link
2025-02-20	How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation	Zhuohang Long et.al.	2502.14486	null
2025-02-20	Algorithms for min-buying in networks	Aaditya Bhardwaj et.al.	2502.14459	null
2025-02-20	PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data	Shijie Huang et.al.	2502.14397	link
2025-02-20	Enhancing Portuguese Variety Identification with Cross-Domain Approaches	Hugo Sousa et.al.	2502.14394	null
2025-02-19	IP-Composer: Semantic Composition of Visual Concepts	Sara Dorfman et.al.	2502.13951	null
2025-02-19	Image compositing is all you need for data augmentation	Ang Jia Ning Shermaine et.al.	2502.13936	null
2025-02-19	TESS 2: A Large-Scale Generalist Diffusion Language Model	Jaesung Tae et.al.	2502.13917	link
2025-02-19	DataSciBench: An LLM Agent Benchmark for Data Science	Dan Zhang et.al.	2502.13897	link
2025-02-19	Performance Comparison of Graph Representations Which Support Dynamic Graph Updates	Subhajit Sahu et.al.	2502.13862	link
2025-02-19	Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions	Xinwei Shen et.al.	2502.13747	null
2025-02-19	Deep Learning for VWAP Execution in Crypto Markets: Beyond the Volume Curve	Remi Genet et.al.	2502.13722	link
2025-02-19	Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation	Peiwen Yuan et.al.	2502.13576	link
2025-02-19	ETS: Efficient Tree Search for Inference-Time Scaling	Coleman Hooper et.al.	2502.13575	link
2025-02-19	RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior	Ching-Hua Lee et.al.	2502.13574	null
2025-02-19	Diffusion Model Agnostic Social Influence Maximization in Hyperbolic Space	Hongliang Qiao et.al.	2502.13571	null
2025-02-19	Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs	Joonatan Laato et.al.	2502.13566	null
2025-02-19	Controlling deposition and characterising dynamics of thin liquid films with high temporal and spatial resolution	G Le Lay et.al.	2502.13552	null
2025-02-19	VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation	Wei Zhao et.al.	2502.13508	link
2025-02-19	Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models	Chenyu Zhu et.al.	2502.13474	null
2025-02-18	AV-Flow: Transforming Text to Audio-Visual Human-like Interactions	Aggelina Chatziagapi et.al.	2502.13133	null
2025-02-18	Is Noise Conditioning Necessary for Denoising Generative Models?	Qiao Sun et.al.	2502.13129	null
2025-02-18	HARP: A Taxonomy for Heterogeneous and Hierarchical Processors for Mixed-reuse Workloads	Raveesh Garg et.al.	2502.13113	null
2025-02-18	Score Matching Riemannian Diffusion Means	Frederik Möbius Rygaard et.al.	2502.13106	null
2025-02-18	tn4ml: Tensor Network Training and Customization for Machine Learning	Ema Puljak et.al.	2502.13090	link
2025-02-18	A Neural Difference-of-Entropies Estimator for Mutual Information	Haoran Ni et.al.	2502.13085	null
2025-02-18	Personalized Image Generation with Deep Generative Models: A Decade Survey	Yuxiang Wei et.al.	2502.13081	link
2025-02-18	Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs	Longxu Dou et.al.	2502.12982	null
2025-02-18	Towards Variational Flow Matching on General Geometries	Olga Zaghen et.al.	2502.12981	null
2025-02-18	Does Training with Synthetic Data Truly Protect Privacy?	Yunpeng Zhao et.al.	2502.12976	link
2025-02-18	CooLBM: A Collaborative Open-Source Reactive Multi-Phase/Component Simulation Code via Lattice Boltzmann Method	R. Alamian et.al.	2502.12955	null
2025-02-18	Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression	Jaemoon Lee et.al.	2502.12951	null
2025-02-18	A Simplified and Numerically Stable Approach to the BG/NBD Churn Prediction model	Dylan Zammit et.al.	2502.12912	null
2025-02-18	Probabilistic neural operators for functional uncertainty quantification	Christopher Bülte et.al.	2502.12902	link
2025-02-18	CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image	Kaixin Yao et.al.	2502.12894	null
2025-02-17	Diffusion Models without Classifier-free Guidance	Zhicong Tang et.al.	2502.12154	link
2025-02-17	Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening	Ye Tian et.al.	2502.12146	link
2025-02-17	Correlative X-ray and electron tomography for scale-bridging, quantitative analysis of complex, hierarchical particle systems	Alexander Götz et.al.	2502.12140	null
2025-02-17	LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities	Florian Sestak et.al.	2502.12128	link
2025-02-17	Descriminative-Generative Custom Tokens for Vision-Language Models	Pramuditha Perera et.al.	2502.12095	null
2025-02-17	How compositional generalization and creativity improve as diffusion models are trained	Alessandro Favero et.al.	2502.12089	null
2025-02-17	AdaSplash: Adaptive Sparse Flash Attention	Nuno Gonçalves et.al.	2502.12082	link
2025-02-17	HumanGif: Single-View Human Diffusion with Generative Prior	Shoukang Hu et.al.	2502.12080	link
2025-02-17	A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond	Shreya Shukla et.al.	2502.12048	null
2025-02-17	Unsupervised Structural-Counterfactual Generation under Domain Shift	Krishn Vishwas Kher et.al.	2502.12013	null
2025-02-17	Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images	Negar Kamali et.al.	2502.11989	link
2025-02-17	Design Considerations Based on Stability for a Class of TCP Algorithms	Sreekanth Prabhakar et.al.	2502.11983	null
2025-02-17	Image Inversion: A Survey from GANs to Diffusion and Beyond	Yinan Chen et.al.	2502.11974	link
2025-02-17	Generating Text from Uniform Meaning Representation	Emma Markle et.al.	2502.11973	link
2025-02-17	Massively Scaling Explicit Policy-conditioned Value Functions	Nico Bohlinger et.al.	2502.11949	null
2025-02-14	Region-Adaptive Sampling for Diffusion Transformers	Ziming Liu et.al.	2502.10389	null
2025-02-14	ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences	Liyuan Zhu et.al.	2502.10377	null
2025-02-14	AffinityFlow: Guided Flows for Antibody Affinity Maturation	Can Chen et.al.	2502.10365	null
2025-02-14	Dimension-free Score Matching and Time Bootstrapping for Diffusion Models	Syamantak Kumar et.al.	2502.10354	null
2025-02-14	DiOpt: Self-supervised Diffusion for Constrained Optimization	Shutong Ding et.al.	2502.10330	null
2025-02-14	Generalised Parallel Tempering: Flexible Replica Exchange via Flows and Diffusions	Leo Zhang et.al.	2502.10328	null
2025-02-14	Analysis and Prediction of Coverage and Channel Rank for UAV Networks in Rural Scenarios with Foliage	Donggu Lee et.al.	2502.10324	null
2025-02-14	Probabilistic Super-Resolution for High-Fidelity Physical System Simulations with Uncertainty Quantification	Pengyu Zhang et.al.	2502.10280	null
2025-02-14	Dark Matter Attenuation Effects: Sensitivity Ceilings for Spin-Dependent and Spin-Independent Interactions	QUEST-DMC Collaboration et.al.	2502.10251	null
2025-02-14	Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control	Thomas Jiralerspong et.al.	2502.10236	null
2025-02-14	Integrated Multi-Simulation Environments for Aerial Robotics Research	Pascal Goldschmid et.al.	2502.10218	link
2025-02-14	VideoDiff: Human-AI Video Co-Creation with Alternatives	Mina Huh et.al.	2502.10190	null
2025-02-14	Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model	Bo Ni et.al.	2502.10173	null
2025-02-14	Modeling biases in binary decision-making within the generalized nonlinear q-voter model	Maciej Doniec et.al.	2502.10172	link
2025-02-14	Modeling and Simulating Emerging Memory Technologies: A Tutorial	Yun-Chih Chen et.al.	2502.10167	null
2025-02-13	Theoretical Benefit and Limitation of Diffusion Language Model	Guhao Feng et.al.	2502.09622	null
2025-02-13	RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets	Isabella Liu et.al.	2502.09615	null
2025-02-13	Designing a Conditional Prior Distribution for Flow-Based Generative Models	Noam Issachar et.al.	2502.09611	null
2025-02-14	Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions	Tejas Jayashankar et.al.	2502.09609	null
2025-02-13	Rolling Ahead Diffusion for Traffic Scene Simulation	Yunpeng Liu et.al.	2502.09587	null
2025-02-13	Memorization and Generalization in Generative Diffusion under the Manifold Hypothesis	Beatrice Achilli et.al.	2502.09578	null
2025-02-13	Wireless and passive pressure detection using magneto-mechanical resonances in process engineering	Timo Merbach et.al.	2502.09575	null
2025-02-13	DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra	Montgomery Bohde et.al.	2502.09571	link
2025-02-13	Diffusing DeBias: a Recipe for Turning a Bug into a Feature	Massimiliano Ciranni et.al.	2502.09564	null
2025-02-13	Cryogenic SiPMs for the Optical Readout of DarkSide-20k	Giuseppe Matteucci et.al.	2502.09558	null
2025-02-13	Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model	Fei Shen et.al.	2502.09533	null
2025-02-13	SQ-GAN: Semantic Image Communications Using Masked Vector Quantization	Francesco Pezone et.al.	2502.09520	link
2025-02-13	Diffusion Models for Molecules: A Survey of Methods and Tasks	Liang Wang et.al.	2502.09511	link
2025-02-14	EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling	Theodoros Kouzelis et.al.	2502.09509	null
2025-02-13	AttentionSmithy: A Modular Framework for Rapid Transformer Development and Customization	Caleb Cranney et.al.	2502.09503	null
2025-02-12	SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation	Ellie Arar et.al.	2502.08642	null
2025-02-12	CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation	Qinghe Wang et.al.	2502.08639	null
2025-02-12	Learning Selection Cuts With Gradients	Mike Hance et.al.	2502.08615	null
2025-02-12	An Initial Condition-Dependent Neural Network Approach for Optimal Control Problems	Mominul Rubel et.al.	2502.08607	null
2025-02-12	Chasing Charge Carriers: Diffusion Dynamics in Mixed-n Quasi-Two-Dimensional Colloidal MAPbBr3 Perovskites	Ronja Maria Piehler et.al.	2502.08601	null
2025-02-12	Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio	Khaled Kahouli et.al.	2502.08598	link
2025-02-12	Light-A-Video: Training-free Video Relighting via Progressive Light Fusion	Yujie Zhou et.al.	2502.08590	link
2025-02-12	Ultrasound Image Generation using Latent Diffusion Models	Benoit Freiche et.al.	2502.08580	null
2025-02-12	Mapping the Landscape of Generative AI in Network Monitoring and Management	Giampaolo Bovenzi et.al.	2502.08576	null
2025-02-12	Statistically validated projection of bipartite signed networks	Anna Gallo et.al.	2502.08567	null
2025-02-12	Human-Centric Foundation Models: Perception, Generation and Agentic Modeling	Shixiang Tang et.al.	2502.08556	link
2025-02-12	BCDDM: Branch-Corrected Denoising Diffusion Model for Black Hole Image Generation	Ao liu et.al.	2502.08528	null
2025-02-12	FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge Devices	Dezhong Yao et.al.	2502.08518	link
2025-02-12	One-Shot Federated Learning with Classifier-Free Diffusion Models	Obaidullah Zaland et.al.	2502.08488	null
2025-02-12	Computed fingertip touch for the instrumental control of musical sound with an excursion on the computed retinal afterimage	Staas de Jong et.al.	2502.08471	null
2025-02-11	Pippo: High-Resolution Multi-View Humans from a Single Image	Yash Kant et.al.	2502.07785	null
2025-02-11	MatSwap: Light-aware material transfers in images	Ivan Lopes et.al.	2502.07784	null
2025-02-11	Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection	Anirudh Sundara Rajan et.al.	2502.07778	null
2025-02-11	The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing	Dirk Bergemann et.al.	2502.07736	null
2025-02-11	Revisiting Non-Acyclic GFlowNets in Discrete Environments	Nikita Morozov et.al.	2502.07735	link
2025-02-11	DOGlove: Dexterous Manipulation with a Low-Cost Open-Source Haptic Force Feedback Glove	Han Zhang et.al.	2502.07730	null
2025-02-11	Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning	Aya Kayal et.al.	2502.07715	null
2025-02-11	Magic 1-For-1: Generating One Minute Video Clips within One Minute	Hongwei Yi et.al.	2502.07701	link
2025-02-11	Steering Protein Family Design through Profile Bayesian Flow	Jingjing Gong et.al.	2502.07671	null
2025-02-11	Guiding Time-Varying Generative Models with Natural Gradients on Exponential Family Manifold	Song Liu et.al.	2502.07650	null
2025-02-11	Distributional Instrumental Variable Method	Anastasiia Holovchak et.al.	2502.07641	link
2025-02-11	Consistency Training with Physical Constraints	Che-Chia Chang et.al.	2502.07636	null
2025-02-11	Tractable Transformers for Flexible Conditional Generation	Anji Liu et.al.	2502.07616	null
2025-02-11	YOLO Network For Defect Detection In Optical lenses	Habib Yaseen et.al.	2502.07592	null
2025-02-11	Generative Modeling with Bayesian Sample Inference	Marten Lienen et.al.	2502.07580	link
2025-02-10	Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT	Dongyang Liu et.al.	2502.06782	null
2025-02-10	Learning an Optimal Assortment Policy under Observational Data	Yuxuan Han et.al.	2502.06777	null
2025-02-10	Enhancing Performance of Explainable AI Models with Constrained Concept Refinement	Geyu Liang et.al.	2502.06775	null
2025-02-10	Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions	Jaeyeon Kim et.al.	2502.06768	null
2025-02-10	History-Guided Video Diffusion	Kiwhan Song et.al.	2502.06764	null
2025-02-10	Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists	Bojia Zi et.al.	2502.06734	null
2025-02-10	RSAttAE: An Information-Aware Attention-based Autoencoder Recommender System	Amirhossein Dadashzadeh Taromi et.al.	2502.06705	null
2025-02-10	No Trick, No Treat: Pursuits and Challenges Towards Simulation-free Training of Neural Samplers	Jiajun He et.al.	2502.06685	null
2025-02-10	Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene	Tai-Yu Pan et.al.	2502.06682	null
2025-02-10	Filling a gap in materials mechanics: Nanoindentation at high constant strain rates upto $10^5 s^{-1}$	Lalith Kumar Bhaskar et.al.	2502.06668	null
2025-02-11	Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification	Jiachen Li et.al.	2502.06619	link
2025-02-10	MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models	Kamil Garifullin et.al.	2502.06606	null
2025-02-10	Joint parameter and state estimation for regularized time-discrete multibody dynamics	Hannes Marklund et.al.	2502.06599	null
2025-02-10	A Large-scale AI-generated Image Inpainting Benchmark	Paschalis Giakoumoglou et.al.	2502.06593	null
2025-02-10	Optimizing Energy Efficiency in Subthreshold RISC-V Cores	Asbjørn Djupdal et.al.	2502.06588	null
2025-02-07	FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation	Shilong Zhang et.al.	2502.05179	link
2025-02-07	Fillerbuster: Multi-View Scene Completion for Casual Captures	Ethan Weber et.al.	2502.05175	null
2025-02-07	Multitwine: Multi-Object Compositing with Text and Layout Control	Gemma Canet Tarrés et.al.	2502.05165	null
2025-02-07	Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment	Minh-Quan Le et.al.	2502.05153	null
2025-02-07	Latent Swap Joint Diffusion for Long-Form Audio Generation	Yusheng Dai et.al.	2502.05130	null
2025-02-07	Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images	Aditya Kumar et.al.	2502.05066	link
2025-02-07	Prospects for detecting generic fast-time features in the neutrino lightcurve of nearby supernovae in neutrino telescopes	Jakob Beise et.al.	2502.05024	null
2025-02-07	Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning	Tristan K. Schuler et.al.	2502.05014	null
2025-02-07	Robust Graph Learning Against Adversarial Evasion Attacks via Prior-Free Diffusion-Based Structure Purification	Jiayi Luo et.al.	2502.05000	link
2025-02-07	C2GM: Cascading Conditional Generation of Multi-scale Maps from Remote Sensing Images Constrained by Geographic Features	Chenxing Sun et.al.	2502.04991	null
2025-02-07	FF7: A Code Package for High-throughput Calculations and Constructing Materials Database	Tiancheng Ma et.al.	2502.04984	null
2025-02-07	Generative-enhanced optimization for knapsack problems: an industry-relevant study	Yelyzaveta Vodovozova et.al.	2502.04928	null
2025-02-07	ARTInp: CBCT-to-CT Image Inpainting and Image Translation in Radiotherapy	Ricardo Coimbra Brioso et.al.	2502.04898	null
2025-02-07	Goku: Flow Based Video Generative Foundation Models	Shoufa Chen et.al.	2502.04896	null
2025-02-07	Training-free Task-oriented Grasp Generation	Jiaming Wang et.al.	2502.04873	null
2025-02-06	Can Grammarly and ChatGPT accelerate language change? AI-powered technologies and their impact on the English language: wordiness vs. conciseness	Karolina Rudnicka et.al.	2502.04324	null
2025-02-06	HOG-Diff: Higher-Order Guided Diffusion for Graph Generation	Yiming Huang et.al.	2502.04308	link
2025-02-06	MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation	Jinbo Xing et.al.	2502.04299	null
2025-02-06	Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression	Lirui Wang et.al.	2502.04296	null
2025-02-06	Breaking the Vault: A Case Study of the 2022 LastPass Data Breach	Jessica Gentles et.al.	2502.04287	null
2025-02-06	Non-Variational Quantum Random Access Optimization with Alternating Operator Ansatz	Zichang He et.al.	2502.04277	null
2025-02-06	Digital Gatekeeping: An Audit of Search Engine Results shows tailoring of queries on the Israel-Palestine Conflict	Íris Damião et.al.	2502.04266	null
2025-02-06	Realistic Image-to-Image Machine Unlearning via Decoupling and Knowledge Retention	Ayush K. Varshney et.al.	2502.04260	null
2025-02-06	TriNER: A Series of Named Entity Recognition Models For Hindi, Bengali & Marathi	Mohammed Amaan Dhamaskar et.al.	2502.04245	null
2025-02-06	NLP-Based .NET CLR Event Logs Analyzer	Maxim Stavtsev et.al.	2502.04219	link
2025-02-06	MRAMG-Bench: A BeyondText Benchmark for Multimodal Retrieval-Augmented Multimodal Generation	Qinhan Yu et.al.	2502.04176	link
2025-02-06	Diffusion-based mass map reconstruction from weak lensing data	Supranta S. Boruah et.al.	2502.04158	null
2025-02-06	Synthetic Datasets for Machine Learning on Spatio-Temporal Graphs using PDEs	Jost Arndt et.al.	2502.04140	link
2025-02-06	Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis	Zhen Ye et.al.	2502.04128	link
2025-02-06	Generative Adversarial Networks Bridging Art and Machine Intelligence	Junhao Song et.al.	2502.04116	null
2025-02-05	Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics	Xuan Li et.al.	2502.03449	null
2025-02-05	Masked Autoencoders Are Effective Tokenizers for Diffusion Models	Hao Chen et.al.	2502.03444	null
2025-02-05	Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization	Yu-Han Wu et.al.	2502.03435	null
2025-02-05	A Temporal Convolutional Network-Based Approach and a Benchmark Dataset for Colonoscopy Video Temporal Segmentation	Carlo Biffi et.al.	2502.03430	null
2025-02-05	TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer	Zhihong Xu et.al.	2502.03426	null
2025-02-05	Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation	Alexey A. Novikov et.al.	2502.03420	null
2025-02-05	A Mixture-Based Framework for Guiding Diffusion Models	Yazid Janati et.al.	2502.03332	link
2025-02-05	An efficient end-to-end computational framework for the generation of ECG calibrated volumetric models of human atrial electrophysiology	Elena Zappon et.al.	2502.03322	null
2025-02-05	Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques	Sangjun Han et.al.	2502.03321	null
2025-02-05	Electronic properties and transport in metal/2D material/metal vertical junctions	Gaëlle Bigeard et.al.	2502.03318	null
2025-02-05	Posterior SBC: Simulation-Based Calibration Checking Conditional on Data	Teemu Säilynoja et.al.	2502.03279	link
2025-02-05	General Time-series Model for Universal Knowledge Representation of Multivariate Time-Series data	Cheng He et.al.	2502.03264	null
2025-02-05	Practical Introduction to FEM with GMSH: A MATLAB/Octave Perspective	Victor Dominguez et.al.	2502.03248	null
2025-02-05	MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent	Xinyao Liao et.al.	2502.03207	null
2025-02-05	Low-cost analog signal chain for transmit-receive circuits of passive induction-based resonators	Fabian Mohn et.al.	2502.03202	null
2025-02-04	COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation	Xueqing Deng et.al.	2502.02589	null
2025-02-04	Calibrated Multi-Preference Optimization for Aligning Diffusion Models	Kyungmin Lee et.al.	2502.02588	null
2025-02-04	Open Materials Generation with Stochastic Interpolants	Philipp Hoellmer et.al.	2502.02582	null
2025-02-04	A Family-Based Approach to Safety Cases for Controlled Airspaces in Small Uncrewed Aerial Systems	Michael C. Hunter et.al.	2502.02559	null
2025-02-04	Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation	Jian Liu et.al.	2502.02525	link
2025-02-04	Privacy Attacks on Image AutoRegressive Models	Antoni Kowalczuk et.al.	2502.02514	link
2025-02-04	Generative Modeling on Lie Groups via Euclidean Generalized Score Matching	Marco Bertolini et.al.	2502.02513	null
2025-02-04	Learning to generate physical ocean states: Towards hybrid climate modeling	Etienne Meunier et.al.	2502.02499	null
2025-02-04	Do Graph Diffusion Models Accurately Capture and Generate Substructure Distributions?	Xiyuan Wang et.al.	2502.02488	null
2025-02-04	Distributional Diffusion Models with Scoring Rules	Valentin De Bortoli et.al.	2502.02483	null
2025-02-04	Style transfer as data augmentation: evaluating unpaired image-to-image translation models in mammography	Emir Ahmed et.al.	2502.02475	null
2025-02-04	Towards Consistent and Controllable Image Synthesis for Face Editing	Mengting Wei et.al.	2502.02465	null
2025-02-04	Personalization Toolkit: Training Free Personalization of Large Vision Language Models	Soroush Seifi et.al.	2502.02452	null
2025-02-04	Sparse Data Generation Using Diffusion Models	Phil Ostheimer et.al.	2502.02448	null
2025-02-04	TransformDAS: Mapping Φ-OTDR Signals to Riemannian Manifold for Robust Classification	Jiaju Kang et.al.	2502.02428	null
2025-01-31	LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks	Liudi Yang et.al.	2501.19382	link
2025-01-31	Creative Problem-Solving: A Study with Blind and Low Vision Software Professionals	Karina Kohl et.al.	2501.19380	null
2025-01-31	Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions	Sören Christensen et.al.	2501.19373	null
2025-01-31	Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters	Adrián Juan-Delgado et.al.	2501.19356	null
2025-01-31	Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023	Ting-Yao E. Hsu et.al.	2501.19353	null
2025-01-31	Low-cost Microfluidic Testbed for Molecular Communications with Integrated Hydrodynamic Gating and Screen-printed Sensors	Maide Miray Albay et.al.	2501.19341	null
2025-01-31	Pathological MRI Segmentation by Synthetic Pathological Data Generation in Fetuses and Neonates	Misha P. T Kaandorp et.al.	2501.19338	null
2025-01-31	Analysis of LLMs vs Human Experts in Requirements Engineering	Cory Hymel et.al.	2501.19297	null
2025-01-31	Medical Semantic Segmentation with Diffusion Pretrain	David Li et.al.	2501.19265	null
2025-01-31	Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search	Yuta Oshima et.al.	2501.19252	null
2025-01-31	Single cell resolution 3D imaging and segmentation within intact live tissues	G. Paci et.al.	2501.19203	link
2025-01-31	A Variational Perspective on Generative Protein Fitness Optimization	Lea Bogensperger et.al.	2501.19200	null
2025-01-31	PSyDUCK: Training-Free Steganography for Latent Diffusion	Georgia Channing et.al.	2501.19172	null
2025-01-31	RMDM: Radio Map Diffusion Model with Physics Informed	Haozhe Jia et.al.	2501.19160	link
2025-01-31	A theoretical framework for overfitting in energy-based modeling	Giovanni Catania et.al.	2501.19158	null
2025-01-30	Diffusion Autoencoders are Scalable Image Tokenizers	Yinbo Chen et.al.	2501.18593	null
2025-01-30	DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models	Ruofan Liang et.al.	2501.18590	null
2025-01-30	WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training	Benjamin Feuer et.al.	2501.18511	link
2025-01-30	Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline	Shivani Kapania et.al.	2501.18493	null
2025-01-30	CodeBrain: Impute Any Brain MRI via Instance-specific Scalar-quantized Codes	Yicheng Wu et.al.	2501.18328	null
2025-01-30	How to Select Datapoints for Efficient Human Evaluation of NLG Models?	Vilém Zouhar et.al.	2501.18251	link
2025-01-30	Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss	Wenshuo Chen et.al.	2501.18232	link
2025-01-30	Inverse source problem of sub-diffusion of variable exponent	Zhiyuan Li et.al.	2501.18228	null
2025-01-30	Behavior Modeling Space Reconstruction for E-Commerce Search	Yejing Wang et.al.	2501.18216	null
2025-01-30	Joint Design and Pricing of Extended Warranties for Multiple Automobiles with Different Price Bands	Yajing Chen et.al.	2501.18203	null
2025-01-30	Advancing Personalized Federated Learning: Integrative Approaches with AI for Enhanced Privacy and Customization	Kevin Cooper et.al.	2501.18174	null
2025-01-31	RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing	Jinyao Guo et.al.	2501.18160	link
2025-01-30	The Dilemma of Building Do-It-Yourself (DIY) Solutions for Workplace Accessibility	Yoonha Cha et.al.	2501.18148	null
2025-01-30	HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback	Xufeng Cai et.al.	2501.18126	null
2025-01-29	SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders	Bartosz Cywiński et.al.	2501.18052	link
2025-01-29	Enriched Immersed Finite Element and Isogeometric Analysis – Algorithms and Data Structures	Nils Wunsch et.al.	2501.17853	null
2025-01-29	acoupi: An Open-Source Python Framework for Deploying Bioacoustic AI Models on Edge Devices	Aude Vuilliomenet et.al.	2501.17841	link
2025-01-29	Atomic Transfer Graphs: Secure-by-design Protocols for Heterogeneous Blockchain Ecosystems	Stephan Dübler et.al.	2501.17786	null
2025-01-29	Generative Unordered Flow for Set-Structured Data Generation	Yangming Li et.al.	2501.17770	null
2025-01-29	Formally Verified Binary-level Pointer Analysis	Freek Verbeek et.al.	2501.17766	null
2025-01-29	In-IDE Programming Courses: Learning Software Development in a Real-World Setting	Anastasiia Birillo et.al.	2501.17747	null
2025-01-29	Testing Research Software: An In-Depth Survey of Practices, Methods, and Tools	Nasir U. Eisty et.al.	2501.17739	null
2025-01-29	A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches	Ana R. Baião et.al.	2501.17729	null
2025-01-29	VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback	Sayeh Gholipour Picha et.al.	2501.17726	link
2025-01-29	Source-Channel Separation Theorems for Distortion Perception Coding	Chao Tian et.al.	2501.17706	null
2025-01-29	Distinguished Quantized Guidance for Diffusion-based Sequence Recommendation	Wenyu Mao et.al.	2501.17670	null
2025-01-29	In-Context Meta LoRA Generation	Yihua Shao et.al.	2501.17635	null
2025-01-29	Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis	Kunrong Li et.al.	2501.17598	null
2025-01-29	Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding	Marco Pasini et.al.	2501.17578	null
2025-01-29	Exploring the Potential of Wireless-enabled Multi-Chip AI Accelerators	Emmanuel Irabor et.al.	2501.17567	null
2025-01-28	CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation	Nikolai Kalischek et.al.	2501.17162	null
2025-01-28	IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait	Han Yang et.al.	2501.17159	null
2025-01-28	First Axion-Like Particle Results from a Broadband Search for Wave-Like Dark Matter in the 44 to 52 $μ$ eV Range with a Coaxial Dish Antenna	Gabe Hoshino et.al.	2501.17119	null
2025-01-28	Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics	Guillaume Le Mailloux et.al.	2501.17107	link
2025-01-28	DataLens: ML-Oriented Interactive Tabular Data Quality Dashboard	Mohamed Abdelaal et.al.	2501.17074	null
2025-01-28	Generative diffusion models from a PDE perspective	Fei Cao et.al.	2501.17054	null
2025-01-28	MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition	Philippe Pasquier et.al.	2501.17011	null
2025-01-28	Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver	Shunya Minami et.al.	2501.16986	null
2025-01-28	A totally non-compensatory multi-criteria method for evaluating and improving level of satisfaction (LoS): proposal and application on Airport Terminal of Passengers	Phelipe Medeiros da Rocha et.al.	2501.16979	null
2025-01-28	Adversarial Masked Autoencoder Purifier with Defense Transferability	Yuan-Chih Chen et.al.	2501.16904	null
2025-01-28	Extending Information Bottleneck Attribution to Video Sequences	Veronika Solopova et.al.	2501.16889	link
2025-01-28	DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model	Josua Spisak et.al.	2501.16800	null
2025-01-28	Algorithm for Automatic Legislative Text Consolidation	Matias Etcheverry et.al.	2501.16794	null
2025-01-28	Exponential Family Attention	Kevin Christian Wibisono et.al.	2501.16790	link
2025-01-28	FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation	Arvin Tashakori et.al.	2501.16778	null
2025-01-27	RelightVid: Temporal-Consistent Diffusion Model for Video Relighting	Ye Fang et.al.	2501.16330	null
2025-01-27	Movement- and Traffic-based User Identification in Commercial Virtual Reality Applications: Threats and Opportunities	Sara Baldoni et.al.	2501.16326	link
2025-01-27	Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology	Meiyun Cao et.al.	2501.16309	null
2025-01-27	RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval	Long Nguyen et.al.	2501.16303	null
2025-01-27	Congested Crossing Pedestrian Traffic Flow : Dispersion vs. Transport in Crowded Areas	Mariam Al Khatib et.al.	2501.16275	null
2025-01-27	Improving DBMS Scheduling Decisions with Fine-grained Performance Prediction on Concurrent Queries – Extended	Ziniu Wu et.al.	2501.16256	null
2025-01-27	A foundation model for human-AI collaboration in medical literature mining	Zifeng Wang et.al.	2501.16255	null
2025-01-27	UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images	Tatiana Taís Schein et.al.	2501.16211	link
2025-01-27	HERITRACE: A User-Friendly Semantic Data Editor with Change Tracking and Provenance Management for Cultural Heritage Institutions	Arcangelo Massari et.al.	2501.16197	null
2025-01-27	Multi-front dynamics in spatially inhomogeneous Allen-Cahn equations	Robbin Bastiaansen et.al.	2501.16195	null
2025-01-27	BAG: Body-Aligned 3D Wearable Asset Generation	Zhongjin Luo et.al.	2501.16177	null
2025-01-27	Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors	Zhiyuan Lu et.al.	2501.16147	null
2025-01-27	Disruption-aware Microservice Re-orchestration for Cost-efficient Multi-cloud Deployments	Marco Zambianco et.al.	2501.16143	null
2025-01-27	Using Generative Models to Produce Realistic Populations of UK Windstorms	Yee Chun Tsoi et.al.	2501.16110	null
2025-01-27	ARFlow: Autogressive Flow with Hybrid Linear Attention	Mude Hui et.al.	2501.16085	null
2025-01-24	An Attentive Graph Agent for Topology-Adaptive Cyber Defence	Ilya Orson Sandoval et.al.	2501.14700	link
2025-01-24	Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning	Jisi Zhang et.al.	2501.14680	null
2025-01-24	End-to-end workflow for machine learning-based qubit readout with QICK and hls4ml	Giuseppe Di Guglielmo et.al.	2501.14663	null
2025-01-24	Towards Scalable Topological Regularizers	Hiu-Tung Wong et.al.	2501.14641	null
2025-01-24	Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data	Jordi Abante et.al.	2501.14615	null
2025-01-24	Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection	Viktor Kozák et.al.	2501.14587	null
2025-01-24	Training-Free Style and Content Transfer by Leveraging U-Net Skip Connections in Stable Diffusion 2.*	Ludovica Schaerf et.al.	2501.14524	null
2025-01-24	Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design	Taehan Kim et.al.	2501.14469	null
2025-01-24	CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios	Michael Fuest et.al.	2501.14426	null
2025-01-24	DeepFlow: Serverless Large Language Model Serving at Scale	Junhao Hu et.al.	2501.14417	null
2025-01-24	Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches	Ziad Sakr et.al.	2501.14366	null
2025-01-24	Advancing data-driven broadband seismic wavefield simulation with multi-conditional diffusion model	Zhengfa Bi et.al.	2501.14348	null
2025-01-24	HorNets: Learning from Discrete and Continuous Signals with Routing Neural Networks	Boshko koloski et.al.	2501.14346	link
2025-01-24	Stochastic Method for Delayed Neutron Precursors Transport in Liquid Fuel	Mathis Caprais et.al.	2501.14332	null
2025-01-24	PAID: A Framework of Product-Centric Advertising Image Design	Hongyu Chen et.al.	2501.14316	null
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	null
2025-01-23	Improving Video Generation with Human Feedback	Jie Liu et.al.	2501.13918	null
2025-01-23	Binary Diffusion Probabilistic Model	Vitaliy Kinakh et.al.	2501.13915	null
2025-01-23	Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models	Linh Tran et.al.	2501.13904	null
2025-01-23	A RAG-Based Institutional Assistant	Gustavo Kuratomi et.al.	2501.13880	null
2025-01-23	Unveiling the Power of Noise Priors: Enhancing Diffusion Models for Mobile Traffic Prediction	Zhi Sheng et.al.	2501.13794	null
2025-01-23	An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem	Mingzhao Wang et.al.	2501.13767	link
2025-01-23	A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation	Dario Serez et.al.	2501.13718	null
2025-01-23	YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID	Iñaki Erregue et.al.	2501.13710	link
2025-01-23	Training-Free Consistency Pipeline for Fashion Repose	Potito Aghilar et.al.	2501.13692	null
2025-01-23	A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification	Younes Yousef et.al.	2501.13598	link
2025-01-23	Funnelling super-resolution STED microscopy through multimode fibres	André Gomes et.al.	2501.13572	null
2025-01-24	One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt	Tao Liu et.al.	2501.13554	link
2025-01-23	Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse	Wenzhuo Ma et.al.	2501.13528	null
2025-01-23	LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation	JiaXin Chen et.al.	2501.13475	null
2025-01-22	Accelerate High-Quality Diffusion Models with Inner Loop Feedback	Matthew Gwilliam et.al.	2501.13107	null
2025-01-22	Robust Representation Consistency Model via Contrastive Denoising	Jiachen Lei et.al.	2501.13094	link
2025-01-22	Innovative Web Tool for Remote Data Acquisition and Analysis: Customized for SKA Low frequency Beamforming Test Bed LPDA Array at Gauribidanur Radio Observatory	Anumanchi Agastya Sai Ram Likhit et.al.	2501.13090	null
2025-01-22	Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation	Akshay Krishnan et.al.	2501.13087	null
2025-01-22	Robust Body Composition Analysis by Generating 3D CT Volumes from Limited 2D Slices	Lianrui Zuo et.al.	2501.13071	null
2025-01-22	Beyond the Lungs: Extending the Field of View in Chest CT with Latent Diffusion Models	Lianrui Zuo et.al.	2501.13068	null
2025-01-22	Neural network enhanced cross entropy benchmark for monitored circuits	Yangrui Hu et.al.	2501.13005	null
2025-01-22	Low-dimensional adaptation of diffusion models: Convergence in total variation	Jiadong Liang et.al.	2501.12982	null
2025-01-22	Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs	Jan Corazza et.al.	2501.12972	link
2025-01-22	Observation of Strong Nonreciprocal Thermal Emission	Zhenong Zhang et.al.	2501.12947	null
2025-01-22	3D Object Manipulation in a Single Image using Generative Models	Ruisi Zhao et.al.	2501.12935	null
2025-01-22	Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization	Xu Yang et.al.	2501.12881	null
2025-01-22	CrossDiff: Diffusion Probabilistic Model With Cross-conditional Encoder-Decoder for Crack Segmentation	Xianglong Shi et.al.	2501.12860	null
2025-01-22	AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation	Aghiles Kebaili et.al.	2501.12840	null
2025-01-22	Inverse Design of Chiral Structures for Giant Helical Dichroism	Chia-Chun Pan et.al.	2501.12825	null
2025-01-21	Towards Affordance-Aware Articulation Synthesis for Rigged Objects	Yu-Chu Yu et.al.	2501.12393	null
2025-01-22	GPS as a Control Signal for Image Generation	Chao Feng et.al.	2501.12390	null
2025-01-21	Audio Texture Manipulation by Exemplar-Based Analogy	Kan Jen Cheng et.al.	2501.12385	null
2025-01-21	Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks	Greg Olmschenk et.al.	2501.12383	null
2025-01-21	DiffDoctor: Diagnosing Image Diffusion Models Before Treating	Yiyang Wang et.al.	2501.12382	null
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	FuocChuVIP123 at CoMeDi Shared Task: Disagreement Ranking with XLM-Roberta Sentence Embeddings and Deep Neural Regression	Phuoc Duong Huy Chu et.al.	2501.12336	null
2025-01-21	VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models	Chaohao Xie et.al.	2501.12267	null
2025-01-21	Joint Reconstruction and Motion Estimation in Sparse-View 4DCT Using Diffusion Models within a Blind Inverse Problem Framework	Antoine De Paepe et.al.	2501.12249	null
2025-01-21	InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models	Pha Nguyen et.al.	2501.12231	null
2025-01-21	TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space	Daniel Garibi et.al.	2501.12224	null
2025-01-21	Early Detection and Classification of Breast Cancer Using Deep Learning Techniques	Mst. Mumtahina Labonno et.al.	2501.12217	null
2025-01-22	Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation	Zibo Zhao et.al.	2501.12202	link
2025-01-21	An End-to-End Approach for Korean Wakeword Systems with Speaker Authentication	Geonwoo Seo et.al.	2501.12194	link
2025-01-21	ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions	Shiyue Zhang et.al.	2501.12173	link
2025-01-17	Zero-Shot Monocular Scene Flow Estimation in the Wild	Yiqing Liang et.al.	2501.10357	null
2025-01-17	Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems	Weibo Gao et.al.	2501.10332	link
2025-01-17	DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration	Huiyun Cao et.al.	2501.10325	null
2025-01-17	SEANN: A Domain-Informed Neural Network for Epidemiological Insights	Jean-Baptiste Guimbaud et.al.	2501.10273	null
2025-01-17	Drift time calibration of the ultra-Low material budget GEM-based TPC for MIXE	X. Zhao et.al.	2501.10249	null
2025-01-17	Over-the-Air Multi-Sensor Inference with Neural Networks Using Memristor-Based Analog Computing	Busra Tegin et.al.	2501.10245	null
2025-01-17	Modelling Activity Scheduling Behaviour with Deep Generative Machine Learning	Fred Shone et.al.	2501.10221	null
2025-01-17	Adaptive Clustering for Efficient Phenotype Segmentation of UAV Hyperspectral Data	Ciem Cornelissen et.al.	2501.10199	null
2025-01-17	Optimizing Structured-Sparse Matrix Multiplication in RISC-V Vector Processors	Vasileios Titopoulos et.al.	2501.10189	null
2025-01-17	Convex Physics Informed Neural Networks for the Monge-Ampère Optimal Transport Problem	Alexandre Caboussat et.al.	2501.10162	null
2025-01-17	AI-Generated Music Detection and its Challenges	Darius Afchar et.al.	2501.10111	link
2025-01-17	DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency	Xiaohui Li et.al.	2501.10110	null
2025-01-17	landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images	Jef Jonkers et.al.	2501.10098	link
2025-01-17	Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning	Shengkui Zhao et.al.	2501.10052	link
2025-01-17	DiffuEraser: A Diffusion Model for Video Inpainting	Xiaowen Li et.al.	2501.10018	link
2025-01-16	SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces	Sumit Chaturvedi et.al.	2501.09756	null
2025-01-16	Learnings from Scaling Visual Tokenizers for Reconstruction and Generation	Philippe Hansen-Estruch et.al.	2501.09755	null
2025-01-16	KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports	Hajung Kim et.al.	2501.09744	null
2025-01-16	Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps	Nanye Ma et.al.	2501.09732	null
2025-01-16	Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text	Jihed Ncib et.al.	2501.09719	null
2025-01-16	Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review	Masatoshi Uehara et.al.	2501.09685	null
2025-01-16	A Survey of Research in Large Language Models for Electronic Design Automation	Jingyu Pan et.al.	2501.09655	null
2025-01-16	Fabrication of Mode-Matched, Low-Loss Optical Resonators by Combination of FIB-Milling and CO $_2$ Laser Ablation	Patrick Maier et.al.	2501.09577	null
2025-01-16	AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation	Junjie He et.al.	2501.09503	link
2025-01-16	Pruning for Sparse Diffusion Models based on Gradient Flow	Ben Wan et.al.	2501.09464	null
2025-01-16	“A Great Start, But…”: Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design	Tianhao He et.al.	2501.09457	null
2025-01-16	CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation	Hwan Heo et.al.	2501.09433	link
2025-01-16	Towards a Framework for Enterprise Architecture in Mobile Government: A Case Study	Son Pham et.al.	2501.09401	null
2025-01-16	Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse	Guangyuan Liu et.al.	2501.09391	null
2025-01-16	Identification of Traditional Medicinal Plant Leaves Using an effective Deep Learning model and Self-Curated Dataset	Deepjyoti Chetia et.al.	2501.09363	null
2025-01-15	How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias	Tosin Fadahunsi et.al.	2501.09014	link
2025-01-15	SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation	Aditya Bhat et.al.	2501.09008	null
2025-01-15	CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks	Krit Tangsongcharoen et.al.	2501.08998	link
2025-01-15	VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Youssef Abdalla et.al.	2501.08995	link
2025-01-15	RepVideo: Rethinking Cross-Layer Representation for Video Generation	Chenyang Si et.al.	2501.08994	null
2025-01-15	CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities	Haozhe Xie et.al.	2501.08983	link
2025-01-15	Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models	Karukriti Kaushik Ghosh et.al.	2501.08974	null
2025-01-15	Karatsuba Matrix Multiplication and its Efficient Custom Hardware Implementations	Trevor E. Pogue et.al.	2501.08889	link
2025-01-15	Connecting SPDE to SGMs	Junsu Seo et.al.	2501.08877	null
2025-01-16	Silent Abandonment in Text-Based Contact Centers: Identifying, Quantifying, and Mitigating its Operational Impacts	Antonio Castellanos et.al.	2501.08869	null
2025-01-15	Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution	Shao-Hao Lu et.al.	2501.08819	link
2025-01-15	Securities Transaction Settlement Optimization on superconducting quantum devices	Francesco Martini et.al.	2501.08794	null
2025-01-15	Near-Field ISAC: Synergy of Dual-Purpose Codebooks and Space-Time Adaptive Processing	Ahmed Hussain et.al.	2501.08776	null
2025-01-15	Adaptive Approximation Schemes for Matching Queues	Alireza AmaniHamedani et.al.	2501.08775	null
2025-01-15	An Ultra-Wideband Dual Polarization Antenna Array for the Detection and Localization of Bright Fast Radio Transients in the Milky Way	Diego Gallardo et.al.	2501.08764	null
2025-01-14	DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models	Hyeonwoo Kim et.al.	2501.08333	null
2025-01-14	MangaNinja: Line Art Colorization with Precise Reference Following	Zhiheng Liu et.al.	2501.08332	null
2025-01-14	Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise	Ryan Burgert et.al.	2501.08331	link
2025-01-14	GameFactory: Creating New Games with Generative Interactive Videos	Jiwen Yu et.al.	2501.08325	null
2025-01-14	Diffusion Adversarial Post-Training for One-Step Video Generation	Shanchuan Lin et.al.	2501.08316	null
2025-01-14	LayerAnimate: Layer-specific Control for Animation	Yuxue Yang et.al.	2501.08295	null
2025-01-14	HALoGEN: Fantastic LLM Hallucinations and Where to Find Them	Abhilasha Ravichander et.al.	2501.08292	null
2025-01-14	FDPP: Fine-tune Diffusion Policy with Human Preference	Yuxin Chen et.al.	2501.08259	null
2025-01-14	Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints	Jonathan Nöther et.al.	2501.08246	null
2025-01-14	Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps	Kannan Parthasarathy et.al.	2501.08243	null
2025-01-14	CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset	Jiawei Du et.al.	2501.08238	null
2025-01-14	FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors	Yabo Zhang et.al.	2501.08225	link
2025-01-14	D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models	Qian Zeng et.al.	2501.08180	link
2025-01-14	DM-Mamba: Dual-domain Multi-scale Mamba for MRI reconstruction	Yucong Meng et.al.	2501.08163	link
2025-01-14	Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data	Phai Vu Dinh et.al.	2501.08149	null
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-13	Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection	Shiman Zhang et.al.	2501.07533	link
2025-01-13	IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion	Tharun Anand et.al.	2501.07530	null
2025-01-13	LitmusKt: Concurrency Stress Testing for Kotlin	Denis Lochmelis et.al.	2501.07472	link
2025-01-13	PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations	Ting-Yu Dai et.al.	2501.07447	null
2025-01-13	Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation	Xiyue Zhu et.al.	2501.07430	null
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	null
2025-01-13	Bigger Isn’t Always Better: Towards a General Prior for Medical Image Reconstruction	Lukas Glaszner et.al.	2501.07376	link
2025-01-13	Simulating the Hubbard Model with Equivariant Normalizing Flows	Dominic Schuh et.al.	2501.07371	null
2025-01-13	Multimodal semantic retrieval for product search	Dong Liu et.al.	2501.07365	null
2025-01-13	Predicting System Dynamics of Universal Growth Patterns in Complex Systems	Leila Hedayatifar et.al.	2501.07349	null
2025-01-13	The Spectrum of C/2023 A3 Indicates A Depleted Composition	Yunyi Tang et.al.	2501.07340	null
2025-01-13	Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring	Buse Sibel Korkmaz et.al.	2501.07324	link
2025-01-13	ViewVR: Visual Feedback Modes to Achieve Quality of VR-based Telemanipulation	A. Erkhov et.al.	2501.07299	link
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-10	ScooterLab: A Programmable and Participatory Sensing Research Testbed using Micromobility Vehicles	Ubaidullah Khan et.al.	2501.06177	null
2025-01-10	VideoAuteur: Towards Long Narrative Video Generation	Junfei Xiao et.al.	2501.06173	null
2025-01-10	GenMol: A Drug Discovery Generalist with Discrete Diffusion	Seul Lee et.al.	2501.06158	null
2025-01-10	From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training	Julius Berner et.al.	2501.06148	link
2025-01-10	The interplay of user preference and precision in different gaze-based interaction methods	Björn Rene Severitt et.al.	2501.06073	null
2025-01-10	Photokinetics of Photothermal Reactions	Mounir Maafi et.al.	2501.06057	null
2025-01-10	Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction	Cecilia Curreli et.al.	2501.06035	null
2025-01-10	Resiliency metrics quantifying emergency response in a distribution system	Shikhar Pandey et.al.	2501.06030	null
2025-01-10	RPKI-Based Location-Unaware Tor Guard Relay Selection Algorithms	Zhifan Lu et.al.	2501.06010	link
2025-01-10	CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control	Stefan Popov et.al.	2501.06006	null
2025-01-10	Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory	Yunmeng Shu et.al.	2501.05965	null
2025-01-10	Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion	Michal Švento et.al.	2501.05959	link
2025-01-10	DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information	Yongfan Lai et.al.	2501.05932	link
2025-01-10	Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation	Minxing Luo et.al.	2501.05892	null
2025-01-10	Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models	Sofia Jamil et.al.	2501.05839	link
2025-01-09	Decentralized Diffusion Models	David McAllister et.al.	2501.05450	null
2025-01-09	Consistent Flow Distillation for Text-to-3D Generation	Runjie Yan et.al.	2501.05445	null
2025-01-09	Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces	Aniruddha Mahapatra et.al.	2501.05442	null
2025-01-09	The GAN is dead; long live the GAN! A Modern GAN Baseline	Yiwen Huang et.al.	2501.05441	link
2025-01-09	Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation	Xuyi Meng et.al.	2501.05427	null
2025-01-09	Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation	Darius Petermann et.al.	2501.05413	null
2025-01-09	TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts	Yu-Hao Huang et.al.	2501.05403	link
2025-01-09	Integrating Explainable AI for Effective Malware Detection in Encrypted Network Traffic	Sileshi Nibret Zeleke et.al.	2501.05387	null
2025-01-09	Accelerated Diffusion Models via Speculative Sampling	Valentin De Bortoli et.al.	2501.05370	null
2025-01-09	CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models	Junha Park et.al.	2501.05359	null
2025-01-09	Video-Conferencing Beyond Screen-Sharing and Thumbnail Webcam Videos: Gesture-Aware Augmented Reality Video for Data-Rich Remote Presentations	Matthew Brehmer et.al.	2501.05345	null
2025-01-09	The Bakers and Millers Game with Restricted Locations	Simon Krogmann et.al.	2501.05334	null
2025-01-09	Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal	Wanli Ma et.al.	2501.05265	null
2025-01-09	Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes	Ludwic Leonard et.al.	2501.05226	link
2025-01-09	A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education	Ziqing Li et.al.	2501.05220	null
2025-01-08	EditAR: Unified Conditional Generation with Autoregressive Models	Jiteng Mu et.al.	2501.04699	null
2025-01-08	ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning	Yuzhou Huang et.al.	2501.04698	null
2025-01-08	SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images	Zixuan Huang et.al.	2501.04689	null
2025-01-08	URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics	Ruilin Luo et.al.	2501.04686	link
2025-01-08	Integrating IPbus ALFRED into the ALICE-FIT setup	Krystian Roslon et.al.	2501.04685	null
2025-01-08	Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations	Archita Srivastava et.al.	2501.04675	null
2025-01-08	A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI	Kazusato Oko et.al.	2501.04641	link
2025-01-08	Knowledge Retrieval Based on Generative AI	Te-Lun Yang et.al.	2501.04635	null
2025-01-08	Disentangled Clothed Avatar Generation with Layered Representation	Weitian Zhang et.al.	2501.04631	null
2025-01-09	MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation	Daniele Molino et.al.	2501.04614	null
2025-01-08	Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion	Yangfan He et.al.	2501.04606	link
2025-01-08	Understanding Expectations for a Robotic Guide Dog for Visually Impaired People	J. Taery Kim et.al.	2501.04594	null
2025-01-08	Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time	Uri Berger et.al.	2501.04513	null
2025-01-08	Simultaneous MOKE imaging and measurement of magneto-resistance with vector magnet: a low noise customized setup for low field magnetic devices and thin films characterization	Imtiaz Noor Bhatti et.al.	2501.04431	null
2025-01-08	End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach	H. M. Shadman Tabib et.al.	2501.04425	null
2025-01-07	WAPTS: A Weighted Allocation Probability Adjusted Thompson Sampling Algorithm for High-Dimensional and Sparse Experiment Settings	Haochen Song et.al.	2501.03999	null
2025-01-07	Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance	Adil Rengim Cetingoz et.al.	2501.03993	null
2025-01-07	NeuralSVG: An Implicit Representation for Text-to-Vector Generation	Sagi Polaczek et.al.	2501.03992	null
2025-01-07	Stabilising effect of generic anomalous diffusion independent of the Rayleigh number	Antonio Barletta et.al.	2501.03990	null
2025-01-07	Synthetic Data Privacy Metrics	Amy Steier et.al.	2501.03941	null
2025-01-07	Visual question answering: from early developments to recent advances – a survey	Ngoc Dung Huynh et.al.	2501.03939	null
2025-01-07	A precise asymptotic analysis of learning diffusion models: theory and insights	Hugo Cui et.al.	2501.03937	link
2025-01-07	Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers	Yuechen Zhang et.al.	2501.03931	link
2025-01-07	HYB-VITON: A Hybrid Approach to Virtual Try-On Combining Explicit and Implicit Warping	Kosuke Takemoto et.al.	2501.03910	link
2025-01-07	mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training	Xudong Liao et.al.	2501.03905	null
2025-01-07	Rendezfood: A Design Case Study of a Conversational Location-based Approach in Restaurants	Philip Weber et.al.	2501.03862	null
2025-01-07	Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control	Zekai Gu et.al.	2501.03847	link
2025-01-07	Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging	Simon W. Penninga et.al.	2501.03825	null
2025-01-07	Impact of diffusion mechanisms on persistence and spreading	Nathanaël Boutillon et.al.	2501.03816	null
2025-01-07	Private, Auditable, and Distributed Ledger for Financial Institutes	Shaltiel Eloul et.al.	2501.03808	link
2025-01-06	MObI: Multimodal Object Inpainting Using Diffusion Models	Alexandru Buburuzan et.al.	2501.03173	null
2025-01-06	Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches	Alhassan Mumuni et.al.	2501.03151	null
2025-01-06	DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models	Mehmet Onurcan Kaya et.al.	2501.03030	link
2025-01-06	TransPixar: Advancing Text-to-Video Generation with Transparency	Luozhou Wang et.al.	2501.03006	link
2025-01-06	STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution	Rui Xie et.al.	2501.02976	null
2025-01-06	Leader Rotation Is Not Enough: Scrutinizing Leadership Democracy of Chained BFT Consensus	Yining Tang et.al.	2501.02970	null
2025-01-07	SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild	Jiawei Liu et.al.	2501.02962	null
2025-01-06	Inhibition of bacterial growth by antibiotics	Barnabe Ledoux et.al.	2501.02944	null
2025-01-06	Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions	Jianhua Pei et.al.	2501.02928	null
2025-01-06	Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis	Thang-Anh-Quan Nguyen et.al.	2501.02913	null
2025-01-06	Sim-to-Real Transfer for Mobile Robots with Reinforcement Learning: from NVIDIA Isaac Sim to Gazebo and Real ROS 2 Robots	Sahar Salimpour et.al.	2501.02902	link
2025-01-06	Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems	Shayan Mohajer Hamidi et.al.	2501.02880	null
2025-01-06	Towards HRTF Personalization using Denoising Diffusion Models	Juan Camilo Albarracín Sánchez et.al.	2501.02871	null
2025-01-07	Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans	Rezkellah Noureddine Khiati et.al.	2501.02867	null
2025-01-06	Large Language Models for Video Surveillance Applications	Ulindu De Silva et.al.	2501.02850	null
2025-01-03	Metadata Conditioning Accelerates Language Model Pre-training	Tianyu Gao et.al.	2501.01956	link
2025-01-03	MADGEN – Mass-Spec attends to De Novo Molecular generation	Yinkai Wang et.al.	2501.01950	link
2025-01-03	Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models	Manh Duong Nguyen et.al.	2501.01932	link
2025-01-03	EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation	Siyuan Huang et.al.	2501.01895	null
2025-01-03	Exploring Equality: An Investigation into Custom Loss Functions for Fairness Definitions	Gordon Lee et.al.	2501.01889	null
2025-01-03	LCFed: An Efficient Clustered Federated Learning Framework for Heterogeneous Data	Yuxin Zhang et.al.	2501.01850	null
2025-01-03	MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning	Pu Yang et.al.	2501.01834	null
2025-01-03	Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation	Mohammad Khalil et.al.	2501.01793	link
2025-01-03	Ingredients: Blending Custom Photos with Video Diffusion Transformers	Zhengcong Fei et.al.	2501.01790	link
2025-01-03	Nonparametric estimation of a factorizable density using diffusion models	Hyeok Kyu Kwon et.al.	2501.01783	null
2025-01-03	Customizing pseudospin unidirectional states of acoustic and electromagnetic waves in two-dimensional phoxonic topological insulators via multi-objective strategies	Gang-Gang Xu et.al.	2501.01766	null
2025-01-03	Constrained Pricing in Choice-based Revenue Management	Qian Shao et.al.	2501.01764	null
2025-01-03	Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models	Andrea Matteazzi et.al.	2501.01761	null
2025-01-03	MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling	Simon Rouard et.al.	2501.01757	null
2025-01-03	Combined Hyper-Extensible Extremely-Secured Zero-Trust CIAM-PAM architecture	Shivom Aggarwal et.al.	2501.01732	null
2025-01-02	Object-level Visual Prompts for Compositional Image Generation	Gaurav Parmar et.al.	2501.01424	null
2025-01-02	Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models	Jingfeng Yao et.al.	2501.01423	link
2025-01-02	Multi-Modal Video Feature Extraction for Popularity Prediction	Haixu Liu et.al.	2501.01422	null
2025-01-02	Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers	Seunghyun Lee et.al.	2501.01414	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-02	Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement	Z. Zhang et.al.	2501.01368	null
2025-01-02	Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation	Nathaniel Dennler et.al.	2501.01367	null
2025-01-03	Conditional Consistency Guided Image Translation and Enhancement	Amil Bhagat et.al.	2501.01223	link
2025-01-03	TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer	Jiayu Li et.al.	2501.01216	null
2025-01-02	Range-Only Localization System for Small-Scale Flapping-Wing Robots	Raul Tapia et.al.	2501.01213	link
2025-01-02	LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge	Kyoungkook Kang et.al.	2501.01197	null
2025-01-02	TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions	Vriksha Srihari et.al.	2501.01156	null
2025-01-02	Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission	Maojun Zhang et.al.	2501.01138	link
2025-01-02	Co-Design of a Robot Controller Board and Indoor Positioning System for IoT-Enabled Applications	Ali Safa et.al.	2501.01115	null
2025-01-02	MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification	Jimin Park et.al.	2501.01110	link
2024-12-30	The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick	Jonathan Berkheim et.al.	2412.21186	null
2024-12-30	Unified dimensionality reduction techniques in chronic liver disease detection	Anand Karna et.al.	2412.21156	null
2025-01-02	Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation	Yuanbo Yang et.al.	2412.21117	null
2024-12-30	Impact of Fourth Industrial Revolution (4IR) on Small and Medium Enterprises (SMEs) and Employment in Bangladesh: Opportunities and Challenges	Toukir Ahammed et.al.	2412.21106	null
2024-12-30	Quantum Diffusion Model for Quark and Gluon Jet Generation	Mariia Baidachna et.al.	2412.21082	link
2025-01-02	Edicho: Consistent Image Editing in the Wild	Qingyan Bai et.al.	2412.21079	link
2024-12-30	Varformer: Adapting VAR’s Generative Prior for Image Restoration	Siyang Wang et.al.	2412.21063	link
2024-12-30	VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation	Jiazheng Xu et.al.	2412.21059	link
2024-12-30	E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models	Zhiyu Tan et.al.	2412.21044	null
2024-12-30	Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration	Wanglong Lu et.al.	2412.21042	link
2024-12-30	TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization	Chia-Yu Hung et.al.	2412.21037	link
2024-12-30	Verified Lifting of Deep learning Operators	Qi Zhan et.al.	2412.20992	null
2024-12-30	AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies	Yibo Wen et.al.	2412.20984	null
2024-12-30	AGON: Automated Design Framework for Customizing Processors from ISA Documents	Chongxiao Li et.al.	2412.20954	null
2024-12-30	AI-Supported Data Analysis Boosts Student Motivation and Reduces Stress in Physics Education	Jannik Henze et.al.	2412.20951	null
2024-12-27	Tensor Network Estimation of Distribution Algorithms	John Gardiner et.al.	2412.19780	null
2024-12-27	Generative Video Propagation	Shaoteng Liu et.al.	2412.19761	null
2024-12-27	Complement or substitute? How AI increases the demand for human skills	Elina Mäkelä et.al.	2412.19754	null
2024-12-27	Text2Insight: Transform natural language text into insights seamlessly using multi-model architecture	Pradeep Sain et.al.	2412.19718	null
2024-12-27	From Elements to Design: A Layered Approach for Automatic Graphic Design Composition	Jiawei Lin et.al.	2412.19712	null
2024-12-27	An Integrated Optimization and Deep Learning Pipeline for Predicting Live Birth Success in IVF Using Feature Optimization and Transformer-Based Models	Arezoo Borji et.al.	2412.19696	null
2024-12-27	From prediction to explanation: managing influential negative reviews through explainable AI	Rongping Shen et.al.	2412.19692	null
2024-12-27	VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models	Tao Wu et.al.	2412.19645	null
2024-12-27	Diverse Rare Sample Generation with Pretrained GANs	Subeen Lee et.al.	2412.19543	link
2024-12-27	Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning	Xuan Zhou et.al.	2412.19538	null
2024-12-27	StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture	Miaomiao Dai et.al.	2412.19535	null
2024-12-27	Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy–Fokker–Planck Equations	Yuanfei Huang et.al.	2412.19520	link
2024-12-27	Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model	Hyunwoo Cho et.al.	2412.19517	null
2024-12-27	DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT	Xiaotao Hu et.al.	2412.19505	link
2024-12-27	RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion Model	Xiaohan Zhang et.al.	2412.19500	link
2024-12-24	PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models	Minghao Chen et.al.	2412.18608	null
2024-12-24	DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers	Yuntao Chen et.al.	2412.18607	null
2024-12-24	Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models	Tahira Kazimi et.al.	2412.18604	null
2024-12-24	Long-Form Speech Generation with Spoken Language Models	Se Jin Park et.al.	2412.18603	link
2024-12-24	ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation	Hongjie Li et.al.	2412.18600	null
2024-12-24	DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation	Minghong Cai et.al.	2412.18597	link
2024-12-24	LatentCRF: Continuous CRF for Efficient Latent Diffusion	Kanchana Ranasinghe et.al.	2412.18596	null
2024-12-24	Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation	Anselm Krainovic et.al.	2412.18584	null
2024-12-24	3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement	Yihang Luo et.al.	2412.18565	null
2024-12-24	Elevating Information System Performance: A Deep Dive into Quality Metrics	Dana A Abdullah et.al.	2412.18512	null
2024-12-24	A region-wide, multi-year set of crop field boundary labels for Africa	L. D. Estes et.al.	2412.18483	null
2024-12-24	GeFL: Model-Agnostic Federated Learning with Generative Models	Honggu Kang et.al.	2412.18460	null
2024-12-24	Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm	O. Deniz Akyildiz et.al.	2412.18432	null
2024-12-24	Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models	Qice Qin et.al.	2412.18421	null
2024-12-24	Discovery of 2D Materials via Symmetry-Constrained Diffusion Model	Shihang Xu et.al.	2412.18414	null
2024-12-23	FaceLift: Single Image to 3D Head with View Generation and GS-LRM	Weijie Lyu et.al.	2412.17812	null
2024-12-23	PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion	Sophia Tang et.al.	2412.17780	null
2024-12-23	The Superposition of Diffusion Models Using the Itô Density Estimator	Marta Skreta et.al.	2412.17762	null
2024-12-23	Superconductivity in Nanosystems: A Fruitful Path to New Phenomenology in Quantum Materials	M. V. Ramallo et.al.	2412.17722	null
2024-12-23	A Bias-Free Training Paradigm for More General AI-generated Image Detection	Fabrizio Guillaro et.al.	2412.17671	null
2024-12-23	Benchmarking Generative AI Models for Deep Learning Test Input Generation	Maryam et.al.	2412.17652	link
2024-12-23	DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder	Ente Lin et.al.	2412.17644	null
2024-12-23	ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance	Renyang Liu et.al.	2412.17632	link
2024-12-23	Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models	Parham Rezaei et.al.	2412.17622	link
2024-12-23	Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor	Yeonju Kim et.al.	2412.17572	null
2024-12-23	The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning	Shentong Mo et.al.	2412.17566	null
2024-12-23	S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field	Zixi Liang et.al.	2412.17561	link
2024-12-23	Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing	Prakash Aryan et.al.	2412.17548	link
2024-12-23	Retention Score: Quantifying Jailbreak Risks for Vision Language Models	Zaitang Li et.al.	2412.17544	null
2024-12-23	CiteBART: Learning to Generate Citations for Local Citation Recommendation	Ege Yiğit Çelik et.al.	2412.17534	link
2024-12-20	Personalized Representation from Personalized Generation	Shobhita Sundaram et.al.	2412.16156	link
2024-12-20	Can Generative Video Models Help Pose Estimation?	Ruojin Cai et.al.	2412.16155	null
2024-12-20	FedGAT: A Privacy-Preserving Federated Approximation Algorithm for Graph Attention Networks	Siddharth Ambekar et.al.	2412.16144	null
2024-12-20	NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems	Laura Weihl et.al.	2412.16141	null
2024-12-20	Predicting human cooperation: sensitizing drift-diffusion model to interaction and external stimuli	Lucila G. Alvarez-Zuzek et.al.	2412.16121	null
2024-12-20	Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation	Timur Sattarov et.al.	2412.16083	null
2024-12-20	Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy	Shaoyan Pan et.al.	2412.16050	null
2024-12-20	SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation	Jiadong Pan et.al.	2412.16039	null
2024-12-20	Electric Vehicle Charging Stations Placement Optimization in Vietnam Using Mixed-Integer Nonlinear Programming Model	Quynh Vu Truc et.al.	2412.16025	link
2024-12-20	Data-Centric Improvements for Enhancing Multi-Modal Understanding in Spoken Conversation Modeling	Maximillian Chen et.al.	2412.15995	null
2024-12-20	Optimization of Beyond Diagonal RIS: A Universal Framework Applicable to Arbitrary Architectures	Zheyu Wu et.al.	2412.15965	null
2024-12-20	Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation	Gautier Evennou et.al.	2412.15939	link
2024-12-20	RiTTA: Modeling Event Relations in Text-to-Audio Generation	Yuhang He et.al.	2412.15922	link
2024-12-20	Less is More: Towards Green Code Large Language Models via Unified Structural Pruning	Guang Yang et.al.	2412.15921	null
2024-12-20	Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation	Kai Brandenbusch et.al.	2412.15853	null
2024-12-19	LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis	Hanlin Wang et.al.	2412.15214	link
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	null
2024-12-19	Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation	Hadi Alzayer et.al.	2412.15211	null
2024-12-19	AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation	Moayed Haji-Ali et.al.	2412.15191	null
2024-12-19	LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation	Weijia Shi et.al.	2412.15188	null
2024-12-19	Tiled Diffusion	Or Madar et.al.	2412.15185	null
2024-12-19	SqueezeMe: Efficient Gaussian Avatars for VR	Shunsuke Saito et.al.	2412.15171	null
2024-12-19	OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization	Jiacheng Zhang et.al.	2412.15159	null
2024-12-19	Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM	Yatai Ji et.al.	2412.15156	link
2024-12-19	Jet: A Modern Transformer-Based Normalizing Flow	Alexander Kolesnikov et.al.	2412.15129	null
2024-12-19	Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation	Yang Tian et.al.	2412.15109	link
2024-12-19	Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation	Haoran Liu et.al.	2412.15086	null
2024-12-19	Eigenstate Preparation on Quantum Computers	Joey Bonitati et.al.	2412.15081	null
2024-12-19	Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion	Zhifei Chen et.al.	2412.15050	null
2024-12-19	DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space	Mang Ning et.al.	2412.15032	link
2024-12-18	AniDoc: Animation Creation Made Easier	Yihao Meng et.al.	2412.14173	null
2024-12-19	E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling	Zhihang Yuan et.al.	2412.14170	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169	link
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167	null
2024-12-18	MetaMorph: Multimodal Understanding and Generation via Instruction Tuning	Shengbang Tong et.al.	2412.14164	null
2024-12-18	MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation	Shenhao Zhu et.al.	2412.14148	null
2024-12-18	Event-based Photometric Bundle Adjustment	Shuang Guo et.al.	2412.14111	link
2024-12-18	Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report	Markus Dablander et.al.	2412.14085	null
2024-12-18	SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation	Tong Chen et.al.	2412.14018	null
2024-12-18	Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates	Sen Yan et.al.	2412.13966	null
2024-12-18	A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI	Beiduo Chen et.al.	2412.13942	link
2024-12-18	Development of a High-Resolution, High-Dynamic-Range Charge Detector for Ion Beam Monitoring	O. Adriani et.al.	2412.13934	null
2024-12-18	Investigating the Effects of Diffusion-based Conditional Generative Speech Models Used for Speech Enhancement on Dysarthric Speech	Joanna Reszka et.al.	2412.13933	null
2024-12-18	Graph-Driven Models for Gas Mixture Identification and Concentration Estimation on Heterogeneous Sensor Array Signals	Ding Wang et.al.	2412.13891	null
2024-12-18	Navigating limitations with precision: A fine-grained ensemble approach to wrist pathology recognition on a limited x-ray dataset	Ammar Ahmed et.al.	2412.13884	null
2024-12-17	CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models	Gaoyang Zhang et.al.	2412.13195	link
2024-12-17	StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models	Yunzhi Yan et.al.	2412.13188	null
2024-12-17	Move-in-2D: 2D-Conditioned Human Motion Generation	Hsin-Ping Huang et.al.	2412.13185	null
2024-12-17	F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration	Lu Liu et.al.	2412.13155	null
2024-12-17	Prompt Augmentation for Self-supervised Text-guided Image Manipulation	Rumeysa Bodur et.al.	2412.13081	null
2024-12-17	3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation	Haoshen Wang et.al.	2412.13059	null
2024-12-17	Guiding Generative Protein Language Models with Reinforcement Learning	Filippo Stocco et.al.	2412.12979	link
2024-12-18	Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance	Wenhao Sun et.al.	2412.12974	link
2024-12-17	ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting	Guillaume Couairon et.al.	2412.12971	link
2024-12-17	Modified UNIFAC 2.0 – A Group-Contribution Method Completed with Machine Learning	Nicolas Hayer et.al.	2412.12962	null
2024-12-17	MOPO: Multi-Objective Prompt Optimization for Affective Text Generation	Yarik Menchaca Resendiz et.al.	2412.12948	null
2024-12-17	Generation of cosmic ray trajectories by a Diffusion Model trained on test particles in 3D magnetohydrodynamic turbulence	Johannes Martin et.al.	2412.12923	null
2024-12-17	Unsupervised Region-Based Image Editing of Denoising Diffusion Models	Zixiang Li et.al.	2412.12912	null
2024-12-18	ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction	Zhongjie Duan et.al.	2412.12888	link
2024-12-17	Memory-minimal quantum generation of stochastic processes: spectral invariants of quantum hidden Markov models	Magdalini Zonnios et.al.	2412.12812	null
2024-12-16	Causal Diffusion Transformers for Generative Modeling	Chaorui Deng et.al.	2412.12095	link
2024-12-16	CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models	Felix Taubner et.al.	2412.12093	null
2024-12-16	Wonderland: Navigating 3D Scenes from a Single Image	Hanwen Liang et.al.	2412.12091	null
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048	null
2024-12-16	LLMs for Cold-Start Cutting Plane Separator Configuration	Connor Lawless et.al.	2412.12038	link
2024-12-16	Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps	Linfeng Zhao et.al.	2412.12024	null
2024-12-16	The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation	Gilles Mordant et.al.	2412.12007	null
2024-12-16	Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data	Onur Tasar et.al.	2412.11972	null
2024-12-16	The Erdős unit distance problem for small point sets	Boris Alexeev et.al.	2412.11914	null
2024-12-16	CharacterBench: Benchmarking Character Customization of Large Language Models	Jinfeng Zhou et.al.	2412.11912	link
2024-12-16	Towards Understanding Systems Trade-offs in Retrieval-Augmented Generation Model Inference	Michael Shen et.al.	2412.11854	null
2024-12-16	ColorFlow: Retrieval-Augmented Image Sequence Colorization	Junhao Zhuang et.al.	2412.11815	null
2024-12-16	InterDyn: Controllable Interactive Dynamics with Video Diffusion Models	Rick Akkerman et.al.	2412.11785	null
2024-12-16	Joint Reconstruction of the Activity and the Attenuation in PET by Diffusion Posterior Sampling: a Feasibility Study	Clémentine Phung-Ngoc et.al.	2412.11776	null
2024-12-17	No More Adam: Learning Rate Scaling at Initialization is All You Need	Minghao Xu et.al.	2412.11768	link
2024-12-13	Towards a foundation model for heavy-ion collision experiments through point cloud diffusion	Manjunath Omana Kuttan et.al.	2412.10352	null
2024-12-13	BrushEdit: All-In-One Image Inpainting and Editing	Yaowei Li et.al.	2412.10316	null
2024-12-13	Iterating the Transient Light Transport Matrix for Non-Line-of-Sight Imaging	Talha Sultan et.al.	2412.10300	null
2024-12-13	Coherent 3D Scene Diffusion From a Single RGB Image	Manuel Dahnert et.al.	2412.10294	null
2024-12-13	Adversarial Robustness of Bottleneck Injected Deep Neural Networks for Task-Oriented Communication	Alireza Furutanpey et.al.	2412.10265	null
2024-12-13	Targeted Angular Reversal of Weights (TARS) for Knowledge Removal in Large Language Models	Harry J. Davies et.al.	2412.10257	null
2024-12-13	Exploring the Frontiers of Animation Video Generation in the Sora Era: Method, Dataset and Benchmark	Yudong Jiang et.al.	2412.10255	link
2024-12-13	Radiator Tailoring for Enhanced Performance in InAs-Based Near-Field Thermophotovoltaics	Mathieu Giroux et.al.	2412.10217	null
2024-12-13	GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion	Jiapeng Tang et.al.	2412.10209	null
2024-12-13	Efficient Generative Modeling with Residual Vector Quantization-Based Tokens	Jaehyeon Kim et.al.	2412.10208	null
2024-12-13	Simple Guidance Mechanisms for Discrete Diffusion Models	Yair Schiff et.al.	2412.10193	link
2024-12-13	SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models	Hung Nguyen et.al.	2412.10178	null
2024-12-13	Learning payoffs while routing in skill-based queues	Sanne van Kempen et.al.	2412.10168	null
2024-12-13	The Art of Deception: Color Visual Illusions and Diffusion Models	Alex Gomez-Villa et.al.	2412.10122	null
2024-12-13	Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data	Jonas Golde et.al.	2412.10121	link
2024-12-12	FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion	Haonan Qiu et.al.	2412.09626	null
2024-12-12	Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors	Yue Feng et.al.	2412.09625	null
2024-12-12	GenEx: Generating an Explorable World	Taiming Lu et.al.	2412.09624	null
2024-12-12	OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation	Weiqi Li et.al.	2412.09623	null
2024-12-12	LoRACLR: Contrastive Adaptation for Customization of Diffusion Models	Enis Simsar et.al.	2412.09622	null
2024-12-12	SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training	Dongting Hu et.al.	2412.09619	null
2024-12-12	EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Zhuofan Zong et.al.	2412.09618	null
2024-12-12	Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG	Kavana Venkatesh et.al.	2412.09614	null
2024-12-13	Olympus: A Universal Task Router for Computer Vision Tasks	Yuanze Lin et.al.	2412.09612	link
2024-12-12	Owl-1: Omni World Model for Consistent Long Video Generation	Yuanhui Huang et.al.	2412.09600	link
2024-12-12	LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors	Yabo Chen et.al.	2412.09597	null
2024-12-12	Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion	Zexin He et.al.	2412.09593	null
2024-12-12	Improving the Reliability of Cable Broadband Networks via Proactive Network Maintenance	Jiyao Hu et.al.	2412.09564	null
2024-12-12	Meshtron: High-Fidelity, Artist-Like 3D Mesh Generation at Scale	Zekun Hao et.al.	2412.09548	null
2024-12-12	SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing	Xueting Li et.al.	2412.09545	null
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	null
2024-12-11	DMin: Scalable Training Data Influence Estimation for Diffusion Models	Huawei Lin et.al.	2412.08637	link
2024-12-11	Multimodal Latent Language Modeling with Next-Token Diffusion	Yutao Sun et.al.	2412.08635	link
2024-12-11	An SDR-Based Monostatic Wi-Fi System with Analog Self-Interference Cancellation for Sensing	Andreas Toftegaard Kristensen et.al.	2412.08612	null
2024-12-12	Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis	Feng Zhou et.al.	2412.08603	null
2024-12-11	TryOffAnyone: Tiled Cloth Generation from a Dressed Person	Ioannis Xarchakos et.al.	2412.08573	link
2024-12-12	Watermarking Training Data of Music Generation Models	Pascal Epple et.al.	2412.08549	null
2024-12-11	Orderly Management of Packets in RDMA by Eunomia	Sana Mahmood et.al.	2412.08540	null
2024-12-11	Ensemble-Based Quantum-Token Protocol Benchmarked on IBM Quantum Processors	Lucas Tsunaki et.al.	2412.08530	link
2024-12-11	Comparative Opinion Mining in Product Reviews: Multi-perspective Prompt-based Learning	Hai-Yen Thi Nguyen et.al.	2412.08508	null
2024-12-11	Open-Loop and Model Predictive Control for Electric Vehicle Charging to Manage Excess Renewable Energy Supply in Texas	Kelsey M. Nelson et.al.	2412.08505	null
2024-12-11	Learning Flow Fields in Attention for Controllable Person Image Generation	Zijian Zhou et.al.	2412.08486	link
2024-12-11	InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models	Min Hou et.al.	2412.08480	link
2024-12-11	CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis	Mu Zhang et.al.	2412.08464	null
2024-12-11	Federated Learning for Traffic Flow Prediction with Synthetic Data Augmentation	Fermin Orozco et.al.	2412.08460	null
2024-12-10	Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets	Zhen Liu et.al.	2412.07775	null
2024-12-10	UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics	Xi Chen et.al.	2412.07774	null
2024-12-10	From Slow Bidirectional to Fast Causal Video Generators	Tianwei Yin et.al.	2412.07772	null
2024-12-10	Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds	Xiaoyu Xiang et.al.	2412.07766	null
2024-12-10	Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences	Alan Nawzad Amin et.al.	2412.07763	link
2024-12-10	Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation	Jingxi Chen et.al.	2412.07761	null
2024-12-10	SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints	Jianhong Bai et.al.	2412.07760	link
2024-12-10	PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation	Fatemeh Nazarieh et.al.	2412.07754	null
2024-12-10	Multi-Shot Character Consistency for Text-to-Video Generation	Yuval Atzmon et.al.	2412.07750	null
2024-12-10	StyleMaster: Stylize Your Video with Artistic Generation and Translation	Zixuan Ye et.al.	2412.07744	null
2024-12-10	STIV: Scalable Text and Image Conditioned Video Generation	Zongyu Lin et.al.	2412.07730	null
2024-12-10	ObjCtrl-2.5D: Training-free Object Control with Camera Poses	Zhouxia Wang et.al.	2412.07721	null
2024-12-10	ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer	Jinyi Hu et.al.	2412.07720	link
2024-12-10	Privacy-Preserving Customer Support: A Framework for Secure and Scalable Interactions	Anant Prakash Awasthi et.al.	2412.07687	null
2024-12-10	Optimizing Sensor Redundancy in Sequential Decision-Making Problems	Jonas Nüßlein et.al.	2412.07686	null
2024-12-10	[MASK] is All You Need	Vincent Tao Hu et.al.	2412.06787	link
2024-12-09	Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation	Ruihan Gao et.al.	2412.06785	link
2024-12-09	Diverse Score Distillation	Yanbo Xu et.al.	2412.06780	null
2024-12-09	Visual Lexicon: Rich Image Features in Language Space	XuDong Wang et.al.	2412.06774	null
2024-12-09	InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention	Howard Zhang et.al.	2412.06753	null
2024-12-09	ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities	Adhiraj Ghosh et.al.	2412.06745	null
2024-12-10	ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet	Andrei-Robert Alexandrescu et.al.	2412.06742	null
2024-12-09	Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection	Caiyun Xie et.al.	2412.06727	link
2024-12-09	You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale	Baorui Ma et.al.	2412.06699	link
2024-12-09	Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy	Yuxuan Xue et.al.	2412.06698	null
2024-12-09	Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset	Shanshan Wang et.al.	2412.06666	null
2024-12-09	Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion	Shuaiting Li et.al.	2412.06661	null
2024-12-09	MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences	Weitao Wang et.al.	2412.06614	null
2024-12-09	Augmented reality for upper limb rehabilitation: real-time kinematic feedback with HoloLens 2	Beatrice Luciani et.al.	2412.06596	null
2024-12-09	EmoSpeech: A Corpus of Emotionally Rich and Contextually Detailed Speech Annotations	Weizhen Bian et.al.	2412.06581	null
2024-12-06	Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model	Lening Wang et.al.	2412.05280	link
2024-12-06	Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories	Susung Hong et.al.	2412.05279	null
2024-12-06	Birth and Death of a Rose	Chen Geng et.al.	2412.05278	null
2024-12-06	MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models	Tuna Han Salih Meral et.al.	2412.05275	null
2024-12-06	Go-or-Grow Models in Biology: a Monster on a Leash	R. Thiessen et.al.	2412.05191	null
2024-12-06	Privacy Drift: Evolving Privacy Concerns in Incremental Learning	Sayyed Farid Ahamed et.al.	2412.05183	null
2024-12-06	DNF: Unconditional 4D Generation with Dictionary-based Neural Fields	Xinyi Zhang et.al.	2412.05161	null
2024-12-06	A text-to-tabular approach to generate synthetic patient data using LLMs	Margaux Tornqvist et.al.	2412.05153	link
2024-12-06	LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation	Donald Shenaj et.al.	2412.05148	link
2024-12-06	How to Squeeze An Explanation Out of Your Model	Tiago Roxo et.al.	2412.05134	null
2024-12-06	Probabilistic Galaxy Field Generation with Diffusion Models	Tanner Sether et.al.	2412.05131	null
2024-12-06	The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation	Ruoyu Wang et.al.	2412.05101	null
2024-12-06	Reconstructing Quantitative Cerebral Perfusion Images Directly From Measured Sinogram Data Acquired Using C-arm Cone-Beam CT	Haotian Zhao et.al.	2412.05084	null
2024-12-06	ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration	Chi-Wei Hsiao et.al.	2412.05043	null
2024-12-06	Get It Right: Improving Comprehensibility with Adaptable Speech Expression of a Humanoid Service Robot	Thomas Sievers et.al.	2412.05022	null
2024-12-05	PaintScene4D: Consistent 4D Scene Generation from Text Prompts	Vinayak Gupta et.al.	2412.04471	null
2024-12-05	LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors	Yusuf Dalva et.al.	2412.04460	null
2024-12-05	Four-Plane Factorized Video Autoencoders	Mohammed Suhail et.al.	2412.04452	null
2024-12-05	MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation	Longtao Zheng et.al.	2412.04448	null
2024-12-05	DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models	Yizhuo Li et.al.	2412.04446	null
2024-12-05	Learning Artistic Signatures: Symmetry Discovery and Style Transfer	Emma Finn et.al.	2412.04441	null
2024-12-05	GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration	Kaiyi Huang et.al.	2412.04440	null
2024-12-05	Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation	Yuying Ge et.al.	2412.04432	link
2024-12-05	Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis	Jian Han et.al.	2412.04431	link
2024-12-05	Reversible molecular simulation for training classical and machine learning force fields	Joe G Greener et.al.	2412.04374	link
2024-12-05	Machine Theory of Mind for Autonomous Cyber-Defence	Luke Swaby et.al.	2412.04367	null
2024-12-05	ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation	Dayoung Gong et.al.	2412.04353	null
2024-12-05	RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse	Zhouyingcheng Liao et.al.	2412.04343	null
2024-12-05	Likelihood-Scheduled Score-Based Generative Modeling for Fully 3D PET Image Reconstruction	George Webber et.al.	2412.04339	null
2024-12-05	Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction	George Webber et.al.	2412.04324	null
2024-12-04	Navigation World Models	Amir Bar et.al.	2412.03572	null
2024-12-04	MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation	Zehuan Huang et.al.	2412.03558	null
2024-12-04	NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model	Xinheng Xie et.al.	2412.03539	null
2024-12-04	NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images	Lingen Li et.al.	2412.03517	null
2024-12-04	Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion	Shengyuan Zhang et.al.	2412.03515	link
2024-12-04	Data Fusion of Semantic and Depth Information in the Context of Object Detection	Md Abu Yusuf et.al.	2412.03490	null
2024-12-04	Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective	Neta Shaul et.al.	2412.03487	null
2024-12-04	Pre-trained Multiple Latent Variable Generative Models are good defenders against Adversarial Attacks	Dario Serez et.al.	2412.03453	link
2024-12-04	CleanDIFT: Diffusion Features without Noise	Nick Stracke et.al.	2412.03439	link
2024-12-04	SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model	Yan Li et.al.	2412.03430	null
2024-12-04	Skel3D: Skeleton Guided Novel View Synthesis	Aron Fóthi et.al.	2412.03407	null
2024-12-04	Identifiability implies consistency of MLE in partially observed diffusions on a torus	Ibrahim Ekren et.al.	2412.03380	null
2024-12-04	TASR: Timestep-Aware Diffusion Model for Image Super-Resolution	Qinwei Lin et.al.	2412.03355	link
2024-12-04	DIVE: Taming DINO for Subject-Driven Video Editing	Yi Huang et.al.	2412.03347	null
2024-12-04	Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis	Tao Jun Lin et.al.	2412.03315	null
2024-12-03	Motion Prompting: Controlling Video Generation with Motion Trajectories	Daniel Geng et.al.	2412.02700	null
2024-12-03	Diffusion-based Visual Anagram as Multi-task Learning	Zhiyuan Xu et.al.	2412.02693	link
2024-12-03	FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation	Kefan Chen et.al.	2412.02690	null
2024-12-04	SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance	Viet Nguyen et.al.	2412.02687	null
2024-12-03	AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction	Lingteng Qiu et.al.	2412.02684	null
2024-12-03	Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation	Yiftach Edelstein et.al.	2412.02631	null
2024-12-03	The effect of priors on Learning with Restricted Boltzmann Machines	Gianluca Manzan et.al.	2412.02623	null
2024-12-03	ComPair-2: A Next Generation Medium Energy Gamma-ray Telescope Prototype	Regina Caputo et.al.	2412.02562	null
2024-12-03	The Two-Center Problem of Uncertain Points on Cactus Graphs	Haitao Xu et.al.	2412.02559	null
2024-12-03	ShadowHack: Hacking Shadows via Luminance-Color Divide and Conquer	Jin Hu et.al.	2412.02545	link
2024-12-03	Unveiling Concept Attribution in Diffusion Models	Quang H. Nguyen et.al.	2412.02542	link
2024-12-03	LLMForecaster: Improving Seasonal Event Forecasts with Unstructured Textual Data	Hanyu Zhang et.al.	2412.02525	null
2024-12-03	GerPS-Compare: Comparing NER methods for legal norm analysis	Sarah T. Bachinger et.al.	2412.02427	null
2024-12-03	It Takes Two: Real-time Co-Speech Two-person’s Interaction Generation via Reactive Auto-regressive Diffusion Model	Mingyi Shi et.al.	2412.02419	null
2024-12-03	A Multi-Agent Framework for Extensible Structured Text Generation in PLCs	Donghao Yang et.al.	2412.02410	null
2024-11-29	Nanostructured micrometric-pore membranes for nanofiltration: Micrometric geometry may optimize performance, energy efficiency and operational lifetime	J. C. Verde et.al.	2411.19900	null
2024-11-29	Input-Output Optics as a Causal Time Series Mapping: A Generative Machine Learning Solution	Abhijit Sen et.al.	2411.19897	null
2024-11-29	MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks	Yiming Wu et.al.	2411.19786	null
2024-11-29	Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy	Jeheon Woo et.al.	2411.19769	null
2024-11-29	JetFormer: An Autoregressive Generative Model of Raw Images and Text	Michael Tschannen et.al.	2411.19722	link
2024-11-29	Inverse Design of Mechanical Metamaterials Using a Point-Cloud-Based Deep Generative Model	Seungwook Hong et.al.	2411.19681	null
2024-11-29	TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting	Bojun Xiong et.al.	2411.19654	link
2024-11-29	Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing	Wenyi Mo et.al.	2411.19652	link
2024-11-29	Enhancing Security in Third-Party Library Reuse – Comprehensive Detection of 1-day Vulnerability through Code Patch Analysis	Shangzhi Xu et.al.	2411.19648	null
2024-11-29	Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings	Qiong Wu et.al.	2411.19628	link
2024-11-29	Unimib Assistant: designing a student-friendly RAG-based chatbot for all their needs	Chiara Antico et.al.	2411.19554	null
2024-11-29	Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook	Florinel-Alin Croitoru et.al.	2411.19537	link
2024-11-29	Quantized Delta Weight Is Safety Keeper	Yule Liu et.al.	2411.19530	null
2024-12-02	DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding	Jungbin Cho et.al.	2411.19527	null
2024-11-29	Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis	Tianqi Li et.al.	2411.19509	link
2024-11-27	Textured Gaussians for Enhanced 3D Scene Appearance Modeling	Brian Chao et.al.	2411.18625	null
2024-11-27	GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data	Wentao Wang et.al.	2411.18624	null
2024-11-27	Diffusion Self-Distillation for Zero-Shot Customized Image Generation	Shengqu Cai et.al.	2411.18616	null
2024-11-27	CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models	Rundi Wu et.al.	2411.18613	null
2024-11-27	Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis	Eva Prakash et.al.	2411.18602	null
2024-11-27	Bit symmetry entails the symmetry of the quantum transition probability	Gerd Niestegge et.al.	2411.18589	null
2024-11-27	Building Confidence in Deep Generative Protein Design	Tianyuan Zheng et.al.	2411.18568	link
2024-11-27	High-throughput antibody screening with high-quality factor nanophotonics and bioprinting	Sajjad Abdollahramezani et.al.	2411.18557	null
2024-11-27	FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion	Haosen Yang et.al.	2411.18552	null
2024-11-28	Enhancing weed detection performance by means of GenAI-based image augmentation	Sourav Modak et.al.	2411.18513	null
2024-11-27	GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation	Pengfei Zhou et.al.	2411.18499	null
2024-11-27	Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification	José Fernando Núñez et.al.	2411.18456	null
2024-11-27	Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator	Frederic Kirstein et.al.	2411.18444	null
2024-11-27	Learning the Evolution of Physical Structure of Galaxies via Diffusion Models	Andrew Lizarraga et.al.	2411.18440	link
2024-11-27	Search for heavy scalar or pseudoscalar states in $\mathrm{t \bar{t}}$ events at CMS	Laurids Jeppe et.al.	2411.18414	null
2024-11-27	StableAnimator: High-Quality Identity-Preserving Human Image Animation	Shuyuan Tu et.al.	2411.17697	link
2024-11-26	ScribbleLight: Single Image Indoor Relighting with Scribbles	Jun Myeong Choi et.al.	2411.17696	null
2024-11-26	Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis	Akshita Gupta et.al.	2411.17690	null
2024-11-26	GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration	Sudarshan Rajagopalan et.al.	2411.17687	null
2024-11-26	Semi-analytical model for the calculation of solar radiation pressure and its effects on a LEO satellite with predicting the change in position vectors using machine learning techniques	Pranava Seth et.al.	2411.17626	null
2024-11-26	Accelerating Vision Diffusion Transformers with Skip Branches	Guanjie Chen et.al.	2411.17616	link
2024-11-26	Mixed-State Quantum Denoising Diffusion Probabilistic Model	Gino Kwun et.al.	2411.17608	null
2024-11-26	Making History Readable	Bipasha Banerjee et.al.	2411.17600	null
2024-11-26	VideoDirector: Precise Video Editing via Text-to-Video Models	Yukun Wang et.al.	2411.17592	null
2024-11-26	Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving	Jon Gutiérrez-Zaballa et.al.	2411.17543	null
2024-11-26	Metaverse Innovation Canvas: A Tool for Extended Reality Product/Service Development	Amir Reza Asadi et.al.	2411.17541	null
2024-11-26	IMPROVE: Improving Medical Plausibility without Reliance on HumanValidation – An Enhanced Prototype-Guided Diffusion Framework	Anurag Shandilya et.al.	2411.17535	null
2024-11-26	FTMoMamba: Motion Generation with Frequency and Text State Space Models	Chengjian Li et.al.	2411.17532	null
2024-11-26	Exact and Heuristic Approaches for the Covering Tour Location Routing Problem	Andreas Hagn et.al.	2411.17510	link
2024-11-26	WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model	Zongjian Li et.al.	2411.17459	link
2024-11-25	Generative Omnimatte: Learning to Decompose Video into Layers	Yao-Chih Lee et.al.	2411.16683	null
2024-11-25	Diffusion Features for Zero-Shot 6DoF Object Pose Estimation	Bernd Von Gimborn et.al.	2411.16668	null
2024-11-25	DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation	Zun Wang et.al.	2411.16657	null
2024-11-25	Exploring Discrete Flow Matching for 3D De Novo Molecule Generation	Ian Dunn et.al.	2411.16644	link
2024-11-25	LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction	Yiran Sun et.al.	2411.16629	link
2024-11-25	Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models	Ronghuan Wu et.al.	2411.16602	null
2024-11-25	Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification	Andre Kassis et.al.	2411.16598	link
2024-11-25	Rethinking Diffusion for Text-Driven Human Motion Generation	Zichong Meng et.al.	2411.16575	null
2024-11-25	Representation Collapsing Problems in Vector Quantization	Wenhao Zhao et.al.	2411.16550	null
2024-11-25	ADOBI: Adaptive Diffusion Bridge For Blind Inverse Problems with Application to MRI Reconstruction	Yuyang Hu et.al.	2411.16535	null
2024-11-25	PriorPath: Coarse-To-Fine Approach for Controlled De-Novo Pathology Semantic Masks Generation	Nati Daniel et.al.	2411.16515	null
2024-11-25	Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis	Boming Miao et.al.	2411.16503	null
2024-11-25	Multi-Resolution Generative Modeling of Human Motion from Limited Data	David Eduardo Moreno-Villamarín et.al.	2411.16498	null
2024-11-25	Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval	Xiaocong Yang et.al.	2411.16454	null
2024-11-25	Model-based reinforcement corrosion prediction: Continuous calibration with Bayesian optimization and corrosion wire sensor data	A. Potnis et.al.	2411.16447	null
2024-11-22	DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving	Bencheng Liao et.al.	2411.15139	link
2024-11-22	Material Anything: Generating Materials for Any 3D Object via Diffusion	Xin Huang et.al.	2411.15138	null
2024-11-22	VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement	Daeun Lee et.al.	2411.15115	null
2024-11-22	RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts	Hjalmar Wijk et.al.	2411.15114	link
2024-11-22	Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion	Samarth N Ramesh et.al.	2411.15113	null
2024-11-22	Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation	Lakshmikar R. Polamreddy et.al.	2411.15084	link
2024-11-22	Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network	Irfan Nafiz Shahan et.al.	2411.15082	link
2024-11-22	Empowering Clients: Transformation of Design Processes Due to Generative AI	Johannes Schneider et.al.	2411.15061	null
2024-11-22	The 1D nonlocal Fisher-KPP equation with a top hat kernel. Part 3. The effect of perturbations in the kernel	David John Needham et.al.	2411.15054	null
2024-11-22	FloAt: Flow Warping of Self-Attention for Clothing Animation Generation	Swasti Shreya Mishra et.al.	2411.15028	null
2024-11-22	Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation	Huy Le et.al.	2411.14913	null
2024-11-22	Dynamically Encircled Higher-order Exceptional Points in an Optical Fiber	Arpan Roy et.al.	2411.14874	null
2024-11-22	Prioritize Denoising Steps on Diffusion Model Preference Alignment via Explicit Denoised Distribution Estimation	Dingyuan Shi et.al.	2411.14871	null
2024-11-22	Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation	Jeongsol Kim et.al.	2411.14863	null
2024-11-22	Style-Friendly SNR Sampler for Style-Driven Generation	Jooyoung Choi et.al.	2411.14793	null
2024-11-21	Stable Flow: Vital Layers for Training-Free Image Editing	Omri Avrahami et.al.	2411.14430	link
2024-11-21	Transformer-based Heuristic for Advanced Air Mobility Planning	Jun Xiang et.al.	2411.14427	null
2024-11-21	A Python-Based Approach to Sputter Deposition Simulations in Combinatorial Materials Science	Felix Thelen et.al.	2411.14413	null
2024-11-21	Multi-Agent Environments for Vehicle Routing Problems	Ricardo Gama et.al.	2411.14411	link
2024-11-21	Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation	Yuanhao Cai et.al.	2411.14384	null
2024-11-21	CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields	Xin-Yang Liu et.al.	2411.14378	null
2024-11-21	Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models	Houze Liu et.al.	2411.14353	null
2024-11-21	DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding	Tianhe Ren et.al.	2411.14347	link
2024-11-21	Lower Dimensional Spherical Representation of Medium Voltage Load Profiles for Visualization, Outlier Detection, and Generative Modelling	Edgar Mauricio Salazar Duque et.al.	2411.14346	null
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	link
2024-11-21	Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models	Iacopo Ghinassi et.al.	2411.14272	link
2024-11-21	Guided MRI Reconstruction via Schrödinger Bridge	Yue Wang et.al.	2411.14269	null
2024-11-21	Regional Attention for Shadow Removal	Hengxing Liu et.al.	2411.14201	link
2024-11-21	TaQ-DiT: Time-aware Quantization for Diffusion Transformers	Xinyan Liu et.al.	2411.14172	null
2024-11-21	Creating a Formally Verified Neural Network for Autonomous Navigation: An Experience Report	Syed Ali Asadullah Bukhari et.al.	2411.14163	link
2024-11-20	REDUCIO! Generating 1024 $\times$ 1024 Video within 16 Seconds using Extremely Compressed Motion Latents	Rui Tian et.al.	2411.13552	link
2024-11-20	Identity Preserving 3D Head Stylization with Multiview Score Distillation	Bahri Batuhan Bilecen et.al.	2411.13536	null
2024-11-20	VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models	Ziqi Huang et.al.	2411.13503	link
2024-11-20	LIMBA: An Open-Source Framework for the Preservation and Valorization of Low-Resource Languages using Generative Models	Salvatore Mario Carta et.al.	2411.13453	null
2024-11-20	Heuristically Adaptive Diffusion-Model Evolutionary Strategy	Benedikt Hartl et.al.	2411.13420	null
2024-11-20	Energy-based generative models for monoclonal antibodies	Paul Pereira et.al.	2411.13390	link
2024-11-20	Small and Close-In Planets are Uncommon around A-type Stars	Steven Giacalone et.al.	2411.13363	null
2024-11-20	Vertical Validation: Evaluating Implicit Generative Models for Graphs on Thin Support Regions	Mai Elkady et.al.	2411.13358	null
2024-11-20	A CSI Feedback Framework based on Transmitting the Important Values and Generating the Others	Zhilin Du et.al.	2411.13298	null
2024-11-21	Structure-Based Molecule Optimization via Gradient-Guided Bayesian Update	Keyue Qiu et.al.	2411.13280	link
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	link
2024-11-20	BIPro: Zero-shot Chinese Poem Generation via Block Inverse Prompting Constrained Generation Framework	Xu Zou et.al.	2411.13237	null
2024-11-20	Building music with Lego bricks and Raspberry Pi	Ana M. Barbancho et.al.	2411.13224	null
2024-11-20	A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM)	Antonino Visalli et.al.	2411.13203	link
2024-11-20	OpenMS WebApps: Building User-Friendly Solutions for MS Analysis	Tom David Müller et.al.	2411.13189	link
2024-11-19	Enhancing Multi-Class Disease Classification: Neoplasms, Cardiovascular, Nervous System, and Digestive Disorders Using Advanced LLMs	Ahmed Akib Jawad Karim et.al.	2411.12712	null
2024-11-19	OrigamiPlot: An R Package and Shiny Web App Enhanced Visualizations for Multivariate Data	Yiwen Lu et.al.	2411.12674	null
2024-11-19	Auto-Evaluation with Few Labels through Post-hoc Regression	Benjamin Eyre et.al.	2411.12665	null
2024-11-19	PoM: Efficient Image and Video Generation with the Polynomial Mixer	David Picard et.al.	2411.12663	link
2024-11-19	Optimizing Airline Reservation Systems with Edge-Enabled Microservices: A Framework for Real-Time Data Processing and Enhanced User Responsiveness	Biman Barua et.al.	2411.12650	null
2024-11-19	DLBacktrace: A Model Agnostic Explainability for any Deep Learning Models	Vinay Kumar Sankarapu et.al.	2411.12643	link
2024-11-19	Improving Controllability and Editability for Pretrained Text-to-Music Generation Models	Yixiao Zhang et.al.	2411.12641	null
2024-11-19	Universal programmable waveguide arrays	Akram Youssry et.al.	2411.12610	null
2024-11-19	Whisper Finetuning on Nepali Language	Sanjay Rijal et.al.	2411.12587	null
2024-11-19	Predicting Customer Satisfaction by Replicating the Survey Response Distribution	Etienne Manderscheid et.al.	2411.12539	null
2024-11-19	Data Pruning in Generative Diffusion Models	Rania Briq et.al.	2411.12523	link
2024-11-19	Probe-Me-Not: Protecting Pre-trained Encoders from Malicious Probing	Ruyi Ding et.al.	2411.12508	null
2024-11-19	Empirical Privacy Evaluations of Generative and Predictive Machine Learning Models – A review and challenges for practice	Flavio Hafner et.al.	2411.12451	null
2024-11-19	Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models	Jun Xiao et.al.	2411.12450	null
2024-11-19	A general modeling and simulation framework for dynamic vehicle routing	Markó Horváth et.al.	2411.12406	link
2024-11-18	QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou	Xinchen Luo et.al.	2411.11739	null
2024-11-18	Aligning Few-Step Diffusion Models with Dense Reward Difference Learning	Ziyi Zhang et.al.	2411.11727	link
2024-11-18	Multiscale nonlinear integration drives accurate encoding of input information	Giorgio Nicoletti et.al.	2411.11710	null
2024-11-18	Robust Reinforcement Learning under Diffusion Models for Data with Jumps	Chenyang Jiang et.al.	2411.11697	null
2024-11-18	Active droplets controlled by enzymatic reactions	Jacques Fries et.al.	2411.11696	null
2024-11-18	Do Captioning Metrics Reflect Music Semantic Alignment?	Jinwoo Lee et.al.	2411.11692	null
2024-11-18	Conceptwm: A Diffusion Model Watermark for Concept Protection	Liangqi Lei et.al.	2411.11688	null
2024-11-19	GNN-Based Code Annotation Logic for Establishing Security Boundaries in C Code	Varun Gadey et.al.	2411.11567	null
2024-11-19	Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation	Rüveyda Yilmaz et.al.	2411.11515	link
2024-11-18	Collaborative Contrastive Network for Click-Through Rate Prediction	Chen Gao et.al.	2411.11508	null
2024-11-18	LaVin-DiT: Large Vision Diffusion Transformer	Zhaoqing Wang et.al.	2411.11505	null
2024-11-18	Alien Recombination: Exploring Concept Blends Beyond Human Cognitive Availability in Visual Art	Alejandro Hernandez et.al.	2411.11494	null
2024-11-18	MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion	Dongseok Shim et.al.	2411.11475	null
2024-11-18	GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts	Junwen He et.al.	2411.11435	null
2024-11-18	CLUE-MARK: Watermarking Diffusion Models using CLWE	Kareem Shehata et.al.	2411.11434	null
2024-11-15	M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation	Sucheng Ren et.al.	2411.10433	link
2024-11-15	Mitigating Parameter Degeneracy using Joint Conditional Diffusion Model for WECC Composite Load Model in Power Systems	Feiqin Zhu et.al.	2411.10431	null
2024-11-15	Multiscale Dubuc: A New Similarity Measure for Time Series	Mahsa Khazaei et.al.	2411.10418	link
2024-11-15	Experimental generation of extreme electron beams for advanced accelerator applications	Claudio Emma et.al.	2411.10413	null
2024-11-15	How to Build a Quantum Supercomputer: Scaling Challenges and Opportunities	Masoud Mohseni et.al.	2411.10406	null
2024-11-15	Nonlinearity-Driven Morphing and Control of Topological Modes in Non-Hermitian Systems	Zhao-Fan Cai et.al.	2411.10398	null
2024-11-15	Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion	Haoran Wei et.al.	2411.10369	null
2024-11-15	Safe Text-to-Image Generation: Simply Sanitize the Prompt Embedding	Huming Qiu et.al.	2411.10329	null
2024-11-15	Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence	Guodong Sun et.al.	2411.10321	null
2024-11-15	Assortment Optimization under the Multinomial Logit Model with Covering Constraints	Omar El Housni et.al.	2411.10310	null
2024-11-15	Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting	Ziqi Xie et.al.	2411.10309	link
2024-11-15	MDHP-Net: Detecting Injection Attacks on In-vehicle Network using Multi-Dimensional Hawkes Process and Temporal Model	Qi Liu et.al.	2411.10258	null
2024-11-15	The Unreasonable Effectiveness of Guidance for Diffusion Models	Tim Kaiser et.al.	2411.10257	null
2024-11-15	Smooth transport map via diffusion process	Arthur Stéphanovitch et.al.	2411.10235	null
2024-11-15	ColorEdit: Training-free Image-Guided Color editing with diffusion model	Xingxi Yin et.al.	2411.10232	null
2024-11-14	A Bayesian Optimization Approach to Machine Translation Reranking	Julius Cheng et.al.	2411.09694	link
2024-11-14	SimTube: Generating Simulated Video Comments through Multimodal AI and User Personas	Yu-Kai Hung et.al.	2411.09577	null
2024-11-14	Golden Noise for Diffusion Models: A Learning Framework	Zikai Zhou et.al.	2411.09502	link
2024-11-14	Sparse Bayesian Generative Modeling for Compressive Sensing	Benedikt Böck et.al.	2411.09483	link
2024-11-14	DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing	Junjie Zhou et.al.	2411.09451	null
2024-11-14	Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models	Chutian Meng et.al.	2411.09449	null
2024-11-14	A survey of probabilistic generative frameworks for molecular simulations	Richard John et.al.	2411.09388	link
2024-11-14	Multi-scale Generative Modeling for Fast Sampling	Xiongye Xiao et.al.	2411.09356	null
2024-11-14	ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models	Zixing Zhang et.al.	2411.09349	null
2024-11-15	Approximate Probabilistic Inference for Time-Series Data A Robust Latent Gaussian Model With Temporal Awareness	Anton Johansson et.al.	2411.09312	null
2024-11-14	EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models	Soowon Kim et.al.	2411.09302	null
2024-11-14	LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space	Guanwen Feng et.al.	2411.09268	null
2024-11-14	Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey	Xuannan Liu et.al.	2411.09259	link
2024-11-14	RibCageImp: A Deep Learning Framework for 3D Ribcage Implant Generation	Gyanendra Chaubey et.al.	2411.09204	null
2024-11-14	Improvement and Implementation of a Speech Emotion Recognition Model Based on Dual-Layer LSTM	Xiaoran Yang et.al.	2411.09189	null
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879	null
2024-11-13	A generalized software framework for consolidation of radiotherapy planning and delivery data from diverse data sources	Yasin Abdulkadir et.al.	2411.08876	null
2024-11-13	Offline Adaptation of Quadruped Locomotion using Diffusion Models	Reece O’Mahoney et.al.	2411.08832	link
2024-11-13	SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate	Yifei Jin et.al.	2411.08767	null
2024-11-13	Analyst Reports and Stock Performance: Evidence from the Chinese Market	Rui Liu et.al.	2411.08726	null
2024-11-14	Reducing ADC Front-end Costs During Training of On-sensor Printed Multilayer Perceptrons	Florentia Afentaki et.al.	2411.08674	link
2024-11-13	Joint Model Caching and Resource Allocation in Generative AI-Enabled Wireless Edge Networks	Zhang Liu et.al.	2411.08672	null
2024-11-13	Toward Human Understanding with Controllable Synthesis	Hanz Cuevas-Velasquez et.al.	2411.08663	null
2024-11-13	The Galactica database: an open, generic and versatile tool for the dissemination of simulation data in astrophysics	Damien Chapon et.al.	2411.08647	null
2024-11-13	Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models	Chengdong Dong et.al.	2411.08642	null
2024-11-13	Deep Generative Demand Learning for Newsvendor and Pricing	Shijin Gong et.al.	2411.08631	null
2024-11-13	LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation	Pengwei Yin et.al.	2411.08606	null
2024-11-13	CorrSynth – A Correlated Sampling Method for Diverse Dataset Generation from LLMs	Suhas S Kowshik et.al.	2411.08553	null
2024-11-13	Explainers’ Mental Representations of Explainees’ Needs in Everyday Explanations	Michael Erol Schaffer et.al.	2411.08514	null
2024-11-13	HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere	Hatef Otroshi Shahreza et.al.	2411.08470	null
2024-11-12	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-12	GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation	Yushi Lan et.al.	2411.08033	null
2024-11-12	Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings	Aditya Sanghi et.al.	2411.08017	link
2024-11-12	JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation	Yiyang Ma et.al.	2411.07975	link
2024-11-12	Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules	Binxu Wang et.al.	2411.07873	null
2024-11-12	Trustful LLMs: Customizing and Grounding Text Generation with Knowledge Bases and Dual Decoders	Xiaofeng Zhu et.al.	2411.07870	null
2024-11-12	CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory	Zhenkai Wu et.al.	2411.07863	link
2024-11-12	Sparsity-Aware Optimization of In-Memory Bayesian Binary Neural Network Accelerators	Prabodh Katti et.al.	2411.07842	null
2024-11-12	Novel View Synthesis with Pixel-Space Diffusion Models	Noam Elata et.al.	2411.07765	null
2024-11-12	Nanosecond nanothermometry in an electron microscope	Florian Castioni et.al.	2411.07764	null
2024-11-12	LapGSR: Laplacian Reconstructive Network for Guided Thermal Super-Resolution	Aditya Kasliwal et.al.	2411.07750	null
2024-11-12	The relationship between general equilibrium models with infinite-lived agents and overlapping generations models, and some applications	Ngoc-Sang Pham et.al.	2411.07674	null
2024-11-12	Evaluating the Generation of Spatial Relations in Text and Image Generative Models	Shang Hong Sim et.al.	2411.07664	null
2024-11-12	Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion	Kaiyu Song et.al.	2411.07627	null
2024-11-12	Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation	Kaiyu Song et.al.	2411.07625	null
2024-11-11	Score-based generative diffusion with “active” correlated noise sources	Alexandra Lamtyugina et.al.	2411.07233	null
2024-11-12	Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models	Yoad Tewel et.al.	2411.07232	null
2024-11-11	Learning from Limited and Imperfect Data	Harsh Rangwani et.al.	2411.07229	null
2024-11-11	TempCharBERT: Keystroke Dynamics for Continuous Access Control Based on Pre-trained Language Models	Matheus Simão et.al.	2411.07224	null
2024-11-11	DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID	Nyle Siddiqui et.al.	2411.07205	link
2024-11-11	Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter	Domitille Gérard et.al.	2411.07202	null
2024-11-11	OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision	Cong Wei et.al.	2411.07199	null
2024-11-11	More Expressive Attention with Negative Weights	Ang Lv et.al.	2411.07176	link
2024-11-11	Edify 3D: Scalable High-Quality 3D Asset Generation	NVIDIA et.al.	2411.07135	null
2024-11-11	Benchmarking LLMs’ Judgments with No Gold Standard	Shengwei Xu et.al.	2411.07127	link
2024-11-11	Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models	NVIDIA et.al.	2411.07126	null
2024-11-11	Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models	Yanchen Wang et.al.	2411.07121	link
2024-11-11	Scaling Mesh Generation via Compressive Tokenization	Haohan Weng et.al.	2411.07025	link
2024-11-11	An Electrocardiogram Monitoring Device Based on STM32	Wenqi Guan et.al.	2411.06962	null
2024-11-11	Generative Feature Training of Thin 2-Layer Networks	Johannes Hertrich et.al.	2411.06848	link
2024-11-08	StdGEN: Semantic-Decomposed 3D Character Generation from Single Images	Yuze He et.al.	2411.05738	null
2024-11-08	Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models	Jia-Hong Huang et.al.	2411.05706	null
2024-11-08	Improving Molecular Graph Generation with Flow Matching and Optimal Transport	Xiaoyang Hou et.al.	2411.05676	null
2024-11-08	Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion	Nan Song et.al.	2411.05544	null
2024-11-08	Improving image synthesis with diffusion-negative sampling	Alakh Desai et.al.	2411.05473	null
2024-11-08	Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation	Peidong Liu et.al.	2411.05472	link
2024-11-08	IntellBot: Retrieval Augmented LLM Chatbot for Cyber Threat Knowledge Delivery	Dincy R. Arikkat et.al.	2411.05442	link
2024-11-08	RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction	Xingyu Ai et.al.	2411.05354	link
2024-11-08	Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons	Rahul Gulati et.al.	2411.05329	null
2024-11-08	Social balance in directed networks	Bingjie Hao et.al.	2411.05327	null
2024-11-08	SeqRFM: Fast RFM Analysis in Sequence Data	Yanxin Zheng et.al.	2411.05317	link
2024-11-08	Differentiable Calibration of Inexact Stochastic Simulation Models via Kernel Score Minimization	Ziwei Su et.al.	2411.05315	null
2024-11-08	A Real-time Face Mask Detection and Social Distancing System for COVID-19 using Attention-InceptionV3 Model	Abdullah Al Asif et.al.	2411.05312	null
2024-11-08	Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet	Boxiao Yu et.al.	2411.05302	null
2024-11-08	GPT Semantic Cache: Reducing LLM Costs and Latency via Semantic Embedding Caching	Sajal Regmi et.al.	2411.05276	null
2024-11-07	SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models	Muyang Li et.al.	2411.05007	link
2024-11-07	ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing	Jun-Kun Chen et.al.	2411.05006	null
2024-11-07	Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models	Shuhong Zheng et.al.	2411.05005	null
2024-11-07	ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning	David Junhao Zhang et.al.	2411.05003	null
2024-11-07	SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation	Koichi Namekata et.al.	2411.04989	null
2024-11-07	Few-Shot Task Learning through Inverse Generative Modeling	Aviv Netanyahu et.al.	2411.04987	null
2024-11-07	How fast does the WallGo? A package for computing wall velocities in first-order phase transitions	Andreas Ekstedt et.al.	2411.04970	link
2024-11-07	VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes	Advaith V. Sethuraman et.al.	2411.04963	null
2024-11-07	Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification	Mischa Dombrowski et.al.	2411.04956	null
2024-11-07	Fed-LDR: Federated Local Data-infused Graph Creation with Node-centric Model Refinement	Jiechao Gao et.al.	2411.04936	null
2024-11-07	DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion	Wenqiang Sun et.al.	2411.04928	null
2024-11-07	StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration	Panwen Hu et.al.	2411.04925	null
2024-11-07	Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion	Kaizhe Hu et.al.	2411.04919	link
2024-11-07	GASE: Generatively Augmented Sentence Encoding	Manuel Frank et.al.	2411.04914	null
2024-11-07	Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation	Benito Buchheim et.al.	2411.04724	null
2024-11-06	Community Forensics: Using Thousands of Generators to Train Fake Image Detectors	Jeongsoo Park et.al.	2411.04125	link
2024-11-06	Stepping Forward on the Last Mile	Chen Feng et.al.	2411.04036	null
2024-11-06	Prototyping O-RAN Enabled UAV Experimentation for the AERPAW Testbed	Joshua Moore et.al.	2411.04027	null
2024-11-06	Object-Centric Dexterous Manipulation from Human Motion Data	Yuanpei Chen et.al.	2411.04005	null
2024-11-06	Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging	Yuan Bi et.al.	2411.04004	link
2024-11-06	ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy	Chenrui Tie et.al.	2411.03990	null
2024-11-06	ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models	Ashutosh Srivastava et.al.	2411.03982	null
2024-11-06	Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning	Jiawei Yao et.al.	2411.03978	link
2024-11-06	Bayesian algorithmic perfumery: A Hierarchical Relevance Vector Machine for the Estimation of Personalized Fragrance Preferences based on Three Sensory Layers and Jungian Personality Archetypes	Rolando Gonzales Martinez et.al.	2411.03965	null
2024-11-06	Long-Form Text-to-Music Generation with Adaptive Prompts: A Case of Study in Tabletop Role-Playing Games Soundtracks	Felipe Marra et.al.	2411.03948	link
2024-11-06	Can Custom Models Learn In-Context? An Exploration of Hybrid Architecture Performance on In-Context Learning Tasks	Ryan Campbell et.al.	2411.03945	link
2024-11-06	GUIDE-VAE: Advancing Data Generation with User Information and Pattern Dictionaries	Kutay Bölat et.al.	2411.03936	link
2024-11-06	Large Generative Model-assisted Talking-face Semantic Communication System	Feibo Jiang et.al.	2411.03876	null
2024-11-06	ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization	Huayang Huang et.al.	2411.03862	link
2024-11-06	Sub-DM:Subspace Diffusion Model with Orthogonal Decomposition for MRI Reconstruction	Yu Guan et.al.	2411.03758	link
2024-11-05	MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning	Ziliang Gan et.al.	2411.03314	null
2024-11-05	LLMs for Domain Generation Algorithm Detection	Reynier Leyva La O et.al.	2411.03307	null
2024-11-05	DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models	Ying Zhou et.al.	2411.03250	null
2024-11-05	On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models	Tariq Berrada Ifriqi et.al.	2411.03177	null
2024-11-05	Unleashing the power of novel conditional generative approaches for new materials discovery	Lev Novitskiy et.al.	2411.03156	link
2024-11-05	Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting	Adrian B. Chłopowiec et.al.	2411.03098	null
2024-11-05	Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising	Tao Huang et.al.	2411.03053	null
2024-11-05	GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details	Zhongjin Luo et.al.	2411.03047	null
2024-11-05	Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT	Pourya Jafarzadeh et.al.	2411.02964	null
2024-11-05	IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems	Heiko Oppel et.al.	2411.02954	null
2024-11-05	LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior	Xingjian Tang et.al.	2411.02951	null
2024-11-05	A scalable generative model for dynamical system reconstruction from neuroimaging data	Eric Volkmann et.al.	2411.02949	link
2024-11-05	Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey	Ao Fu et.al.	2411.02914	null
2024-11-05	The Unreasonable Effectiveness of LLMs for Query Optimization	Peter Akioyamen et.al.	2411.02862	link
2024-11-05	ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate	Shohei Taniguchi et.al.	2411.02853	link
2024-11-04	Training-free Regional Prompting for Diffusion Transformers	Anthony Chen et.al.	2411.02395	link
2024-11-04	How Far is Video Generation from World Model: A Physical Law Perspective	Bingyi Kang et.al.	2411.02385	null
2024-11-04	Virgo Filaments IV: Using WISE to Measure the Modification of Star-Forming Disks in the Extended Regions Around the Virgo Cluster	Kim Conger et.al.	2411.02352	null
2024-11-04	Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition	Xinkai Liu et.al.	2411.02334	null
2024-11-05	PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance	Ruyang Liu et.al.	2411.02327	link
2024-11-04	LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation	Mufei Li et.al.	2411.02322	link
2024-11-04	CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments	Kung-Hsiang Huang et.al.	2411.02305	link
2024-11-04	Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation	Xianghui Yang et.al.	2411.02293	null
2024-11-04	Counterfactual Explanations via Riemannian Latent Space Traversal	Paraskevas Pegios et.al.	2411.02259	null
2024-11-04	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-04	Recursive Learning of Asymptotic Variational Objectives	Alessandro Mastrototaro et.al.	2411.02217	null
2024-11-04	Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models	Anjith George et.al.	2411.02188	null
2024-11-04	Touch-to-Touch Translation – Learning the Mapping Between Heterogeneous Tactile Sensing Technologies	Francesco Grella et.al.	2411.02187	null
2024-11-04	CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality	Yiqin Zhao et.al.	2411.02179	null
2024-11-04	CryptoEL: A Novel Experiential Learning Tool for Enhancing K-12 Cryptography Education	Pranathi Rayavaram et.al.	2411.02143	null
2024-10-31	Bridging Geometric States via Geometric Diffusion Bridge	Shengjie Luo et.al.	2410.24220	null
2024-10-31	Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning	Penghui Ruan et.al.	2410.24219	link
2024-10-31	DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion	Weicai Ye et.al.	2410.24203	link
2024-10-31	Multi-Attribute Linguistic Tuning for Controlled Paraphrase Generation	Mohamed Elgaar et.al.	2410.24199	null
2024-10-31	Generative modelling for mass-mapping with fast uncertainty quantification	Jessica J. Whitney et.al.	2410.24197	link
2024-10-31	AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties	Xiayan Ji et.al.	2410.24178	link
2024-10-31	Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation	Fu Feng et.al.	2410.24160	null
2024-10-31	Scaling Concept With Text-Guided Diffusion Models	Chao Huang et.al.	2410.24151	null
2024-10-31	Repository-Level Compositional Code Translation and Validation	Ali Reza Ibrahimzada et.al.	2410.24117	link
2024-10-31	Extended electrochemical monitoring of biomolecular binding using commercially available, reusable electrodes in microliter volumes	Jeremy Mendez et.al.	2410.24110	null
2024-10-31	Sparsh: Self-supervised touch representations for vision-based tactile sensing	Carolina Higuera et.al.	2410.24090	null
2024-10-31	Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure	Xiang Li et.al.	2410.24060	link
2024-10-31	TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation	Sunjae Yoon et.al.	2410.24037	null
2024-10-31	Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities	Hatef Otroshi Shahreza et.al.	2410.24015	null
2024-10-31	DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination	Jia Fu et.al.	2410.24006	link
2024-10-30	ReferEverything: Towards Segmenting Everything We Can Speak of in Videos	Anurag Bagchi et.al.	2410.23287	null
2024-10-30	Provable acceleration for diffusion models under minimal assumptions	Gen Li et.al.	2410.23285	null
2024-10-30	RelationBooth: Towards Relation-Aware Customized Object Generation	Qingyu Shi et.al.	2410.23280	null
2024-10-30	SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation	Yining Hong et.al.	2410.23277	null
2024-10-30	Multi-student Diffusion Distillation for Better One-step Generators	Yanke Song et.al.	2410.23274	null
2024-10-30	ReaWristic: Remote Touch Sensation to Fingers from a Wristband via Visually Augmented Electro-Tactile Feedback	Yudai Tanaka et.al.	2410.23193	null
2024-10-30	Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning	Keqin Bao et.al.	2410.23136	link
2024-10-30	Educating for Hardware Specialization in the Chiplet Era: A Path for the HPC Community	Kazutomo Yoshii et.al.	2410.23127	null
2024-10-30	CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense	Mingkun Zhang et.al.	2410.23091	link
2024-10-30	General Bayesian quantile regression for counts via generative modeling	Yuta Yamauchi et.al.	2410.23081	null
2024-10-30	Controlling Language and Diffusion Models by Transporting Activations	Pau Rodriguez et.al.	2410.23054	link
2024-10-30	Dispersion kinks from electronic correlations in an unconventional iron-based superconductor	Ming-Hua Chang et.al.	2410.23044	null
2024-10-30	Improving Musical Accompaniment Co-creation via Diffusion Transformers	Javier Nistal et.al.	2410.23005	null
2024-10-30	DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes	Jialiang Zhang et.al.	2410.23004	null
2024-10-30	LumiSculpt: A Consistency Lighting Control Network for Video Generation	Yuxin Zhang et.al.	2410.22979	null
2024-10-29	CaStL: Constraints as Specifications through LLM Translation for Long-Horizon Task and Motion Planning	Weihang Guo et.al.	2410.22225	null
2024-10-29	A Gaussian Process Generative Model for QCD Equation of State	Jiaxuan Gong et.al.	2410.22160	null
2024-10-29	Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models	Raman Dutt et.al.	2410.22149	link
2024-10-29	AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts	Vishal Kumar et.al.	2410.22143	null
2024-10-29	Infrared photometry with InGaAs detectors: First light with SPECULOOS	Peter P. Pedersen et.al.	2410.22140	link
2024-10-29	SimRec: Mitigating the Cold-Start Problem in Sequential Recommendation by Integrating Item Similarity	Shaked Brody et.al.	2410.22136	link
2024-10-29	Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench	Zheyuan Liu et.al.	2410.22108	link
2024-10-29	Variational inference for pile-up removal at hadron colliders with diffusion models	Malte Algren et.al.	2410.22074	null
2024-10-29	PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement	Shutong Jin et.al.	2410.22059	null
2024-10-29	Dual Conditional Diffusion Models for Sequential Recommendation	Hongtao Huang et.al.	2410.21967	null
2024-10-29	PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference	Kendong Liu et.al.	2410.21966	null
2024-10-29	CT to PET Translation: A Large-scale Dataset and Domain-Knowledge-Guided Diffusion Approach	Dac Thai Nguyen et.al.	2410.21932	link
2024-10-29	Guided Diffusion-based Counterfactual Augmentation for Robust Session-based Recommendation	Muskan Gupta et.al.	2410.21892	null
2024-10-29	On the study of the limit cycles for a class of population models with time-varying factors	Renhao Tian et.al.	2410.21848	null
2024-10-29	Diffusion as Reasoning: Enhancing Object Goal Navigation with LLM-Biased Diffusion Model	Yiming Ji et.al.	2410.21842	null
2024-10-28	On Inductive Biases That Enable Generalization of Diffusion Transformers	Jie An et.al.	2410.21273	link
2024-10-28	EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation	Shih-Yang Liu et.al.	2410.21271	null
2024-10-28	LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior	Hanyu Wang et.al.	2410.21264	null
2024-10-28	One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation	Zhendong Wang et.al.	2410.21257	null
2024-10-28	On learning higher-order cumulants in diffusion models	Gert Aarts et.al.	2410.21212	null
2024-10-28	The VSPEC Collection: A suite of utilities to model spectroscopic phase curves of 3D exoplanet atmospheres in the presence of stellar variability	Ted M Johnson et.al.	2410.21190	null
2024-10-28	Trajectory Flow Matching with Applications to Clinical Time Series Modeling	Xi Zhang et.al.	2410.21154	link
2024-10-28	Synthetica: Large Scale Synthetic Data for Robot Perception	Ritvik Singh et.al.	2410.21153	null
2024-10-28	Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences	Zhihao Zhao et.al.	2410.21130	null
2024-10-28	Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models	Wenda Li et.al.	2410.21088	link
2024-10-28	Federated Time Series Generation on Feature and Temporally Misaligned Data	Chenrui Fan et.al.	2410.21072	null
2024-10-28	Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework	Vladimir Arkhipkin et.al.	2410.21061	link
2024-10-28	Beyond Autoregression: Fast LLMs via Self-Distillation Through Time	Justin Deschenaux et.al.	2410.21035	link
2024-10-29	EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior	Xin Xiang et.al.	2410.20981	null
2024-10-28	MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis	Di Qiu et.al.	2410.20974	null
2024-10-25	Model merging with SVD to tie the Knots	George Stoica et.al.	2410.19735	link
2024-10-25	Adversarial Environment Design via Regret-Guided Diffusion Models	Hojun Chung et.al.	2410.19715	null
2024-10-25	Perception, Control and Hardware for In-Hand Slip-Aware Object Manipulation with Parallel Grippers	Gabriel Arslan Waltersson et.al.	2410.19660	null
2024-10-25	DiffGS: Functional Gaussian Splatting Diffusion	Junsheng Zhou et.al.	2410.19657	null
2024-10-25	VARS: Vision-based Assessment of Risk in Security Systems	Pranav Gupta et.al.	2410.19642	null
2024-10-25	Diffusion models for lattice gauge field simulations	Qianteng Zhu et.al.	2410.19602	null
2024-10-25	Energy Efficient Dual Designs of FeFET-Based Analog In-Memory Computing with Inherent Shift-Add Capability	Zeyu Yang et.al.	2410.19593	null
2024-10-25	Hybrid Memetic Search for Electric Vehicle Routing with Time Windows, Simultaneous Pickup-Delivery, and Partial Recharges	Zubin Zheng et.al.	2410.19580	null
2024-10-25	Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series	Ilan Naiman et.al.	2410.19538	null
2024-10-25	Ensemble Data Assimilation for Particle-based Methods	Marius Duvillard et.al.	2410.19525	null
2024-10-25	Marked Temporal Bayesian Flow Point Processes	Hui Chen et.al.	2410.19512	null
2024-10-25	EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data	Xuetian Chen et.al.	2410.19461	null
2024-10-28	NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction	Zixuan Gong et.al.	2410.19452	link
2024-10-25	Learned Reference-based Diffusion Sampling for multi-modal distributions	Maxence Noble et.al.	2410.19449	null
2024-10-25	Generative Diffusion Models for Sequential Recommendations	Sharare Zolghadr et.al.	2410.19429	null
2024-10-24	Framer: Interactive Frame Interpolation	Wen Wang et.al.	2410.18978	null
2024-10-24	MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms	Ling-Hao Chen et.al.	2410.18977	null
2024-10-24	Unbounded: A Generative Infinite Game of Character Life Simulation	Jialu Li et.al.	2410.18975	null
2024-10-24	3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation	Hansheng Chen et.al.	2410.18974	link
2024-10-24	On the Crucial Role of Initialization for Matrix Factorization	Bingcong Li et.al.	2410.18965	null
2024-10-24	Stable Consistency Tuning: Understanding and Improving Consistency Models	Fu-Yun Wang et.al.	2410.18958	link
2024-10-24	Generation of synthetic financial time series by diffusion models	Tomonori Takahashi et.al.	2410.18897	null
2024-10-24	Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences	Weijian Luo et.al.	2410.18881	null
2024-10-24	The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods	Linda Laurier et.al.	2410.18866	null
2024-10-24	From Efficiency to Equity: Measuring Fairness in Preference Learning	Shreeyash Gowaikar et.al.	2410.18841	null
2024-10-24	From English-Centric to Effective Bilingual: LLMs with Custom Tokenizers for Underrepresented Languages	Artur Kiulian et.al.	2410.18836	null
2024-10-24	Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation	Xiaoyu Zhang et.al.	2410.18830	null
2024-10-24	Towards Visual Text Design Transfer Across Languages	Yejin Choi et.al.	2410.18823	null
2024-10-24	Fast constrained sampling in pre-trained diffusion models	Alexandros Graikos et.al.	2410.18804	null
2024-10-24	Large Generative AI Models meet Open Networks for 6G: Integration, Platform, and Monetization	Peizheng Li et.al.	2410.18790	null
2024-10-23	DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes	Hengwei Bian et.al.	2410.18084	null
2024-10-23	Prioritized Generative Replay	Renhao Wang et.al.	2410.18082	null
2024-10-23	WorldSimBench: Towards Video Generation Models as World Simulators	Yiran Qin et.al.	2410.18072	null
2024-10-23	TP-Eval: Tap Multimodal LLMs’ Potential in Evaluation by Customizing Prompts	Yuxuan Xie et.al.	2410.18071	null
2024-10-23	Training Free Guided Flow Matching with Optimal Control	Luran Wang et.al.	2410.18070	null
2024-10-23	Spectrally shaped THz pulses from tapered dielectric waveguides	Karel Peetermans et.al.	2410.17975	null
2024-10-23	Optical Generative Models	Shiqi Chen et.al.	2410.17970	null
2024-10-23	A Wavelet Diffusion GAN for Image Super-Resolution	Lorenzo Aloisi et.al.	2410.17966	null
2024-10-23	Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation	Wenfang Yao et.al.	2410.17918	link
2024-10-23	regAL: Python Package for Active Learning of Regression Problems	Elizaveta Surzhikova et.al.	2410.17917	null
2024-10-23	Scaling Diffusion Language Models via Adaptation from Autoregressive Models	Shansan Gong et.al.	2410.17891	link
2024-10-23	Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech	Danilo de Oliveira et.al.	2410.17834	null
2024-10-23	PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation	Feiyan Feng et.al.	2410.17812	null
2024-10-23	GenUDC: High Quality 3D Mesh Generation with Unsigned Dual Contouring Representation	Ruowei Wang et.al.	2410.17802	link
2024-10-23	Regularized autoregressive modeling and its application to audio signal declipping	Ondřej Mokrý et.al.	2410.17790	link
2024-10-22	Large Language Models Empowered Personalized Web Agents	Hongru Cai et.al.	2410.17236	null
2024-10-22	Creativity in AI: Progresses and Challenges	Mete Ismayilzada et.al.	2410.17218	link
2024-10-22	Audio-to-Score Conversion Model Based on Whisper methodology	Hongyao Zhang et.al.	2410.17209	null
2024-10-22	Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding	Yasha Ektefaie et.al.	2410.17173	link
2024-10-22	Performance of the CMS high-level trigger during LHC Run 2	CMS Collaboration et.al.	2410.17038	null
2024-10-22	Hybrid Generative AI for De Novo Design of Co-Crystals with Enhanced Tabletability	Nina Gubina et.al.	2410.17005	link
2024-10-22	DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization	Haowei Zhu et.al.	2410.16942	null
2024-10-22	Hierarchical Clustering for Conditional Diffusion in Image Generation	Jorge da Silva Goncalves et.al.	2410.16910	link
2024-10-22	Bayes without Underfitting: Fully Correlated Deep Learning Posteriors via Alternating Projections	Marco Miani et.al.	2410.16901	null
2024-10-22	VistaDream: Sampling multiview consistent images for single-view scene reconstruction	Haiping Wang et.al.	2410.16892	null
2024-10-22	CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Nicholas I-Hsien Kuo et.al.	2410.16872	null
2024-10-22	MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model	Meng Xu et.al.	2410.16840	null
2024-10-22	Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other?	Gustavo Penha et.al.	2410.16823	null
2024-10-22	Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection	Laurent Colbois et.al.	2410.16802	link
2024-10-22	One-Step Diffusion Distillation through Score Implicit Matching	Weijian Luo et.al.	2410.16794	link
2024-10-21	MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors	Honghua Chen et.al.	2410.16272	null
2024-10-21	Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos	Gengshan Yang et.al.	2410.16259	null
2024-10-21	Distribution Learning with Valid Outputs Beyond the Worst-Case	Nick Rittler et.al.	2410.16253	null
2024-10-21	Building A Coding Assistant via the Retrieval-Augmented Language Model	Xinze Li et.al.	2410.16229	link
2024-10-21	CiteClick: A Browser Extension for Real-Time Scholar Citation Tracking	Nishat Raihan et.al.	2410.16211	null
2024-10-21	A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data	Simon Deltadahl et.al.	2410.16177	null
2024-10-22	Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models	Giannis Daras et.al.	2410.16152	null
2024-10-21	Modelling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting	Robin Thériault et.al.	2410.16150	null
2024-10-21	SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation	Xinyi Zhou et.al.	2410.16119	null
2024-10-21	Critical Example Mining for Vehicle Trajectory Prediction using Flow-based Generative Models	Zhezhang Ding et.al.	2410.16083	null
2024-10-21	Continuous Speech Synthesis using per-token Latent Diffusion	Arnon Turetzky et.al.	2410.16048	null
2024-10-21	Some generalizations of the convective model of jet generation	S. N. Artekha et.al.	2410.16035	null
2024-10-21	ComPO: Community Preferences for Language Model Personalization	Sachin Kumar et.al.	2410.16027	null
2024-10-21	Massimo: Public Queue Monitoring and Management using Mass-Spring Model	Abhijeet Kumar et.al.	2410.16012	null
2024-10-21	AI-Driven Innovations in Modern Cloud Computing	Animesh Kumar et.al.	2410.15960	null
2024-10-18	BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities	Shaozhe Hao et.al.	2410.14672	link
2024-10-18	How Does Data Diversity Shape the Weight Landscape of Neural Networks?	Yang Ba et.al.	2410.14602	null
2024-10-18	Bayesian Multi-wavelength Imaging of the LMC SN1987A with SRG/eROSITA	Vincent Eberle et.al.	2410.14599	null
2024-10-18	Neuro-Symbolic Traders: Assessing the Wisdom of AI Crowds in Markets	Namid R. Stillman et.al.	2410.14587	null
2024-10-18	Reimagining partial thickness keratoplasty: An eye mountable robot for autonomous big bubble needle insertion	Y. Wang et.al.	2410.14577	null
2024-10-18	Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior	Calvin-Khang Ta et.al.	2410.14540	null
2024-10-18	Blockchain-Based Trust and Transparency in Airline Reservation Systems using Microservices Architecture	Biman Barua et.al.	2410.14518	null
2024-10-18	LEAD: Latent Realignment for Human Motion Diffusion	Nefeli Andreou et.al.	2410.14508	null
2024-10-18	Reinforcement Learning in Non-Markov Market-Making	Luca Lalor et.al.	2410.14504	null
2024-10-18	Data-driven topology design with persistent homology for enhancing population diversity	Taisei Kii et.al.	2410.14496	null
2024-10-18	ANT: Adaptive Noise Schedule for Time Series Diffusion Models	Seunghan Lee et.al.	2410.14488	link
2024-10-21	CaTs and DAGs: Integrating Directed Acyclic Graphs with Transformers and Fully-Connected Neural Networks for Causally Constrained Predictions	Matthew J. Vowels et.al.	2410.14485	link
2024-10-18	DRL Optimization Trajectory Generation via Wireless Network Intent-Guided Diffusion Models for Optimizing Resource Allocation	Junjie Wu et.al.	2410.14481	null
2024-10-18	Flow-based Sampling for Entanglement Entropy and the Machine Learning of Defects	Andrea Bulgarelli et.al.	2410.14466	null
2024-10-18	FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models	Rui Hu et.al.	2410.14429	null
2024-10-17	Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens	Lijie Fan et.al.	2410.13863	null
2024-10-17	Diffusing States and Matching Scores: A New Framework for Imitation Learning	Runzhe Wu et.al.	2410.13855	link
2024-10-17	Influence Functions for Scalable Data Attribution in Diffusion Models	Bruno Mlodozeniec et.al.	2410.13850	null
2024-10-17	VidPanos: Generative Panoramic Videos from Casual Panning Videos	Jingwei Ma et.al.	2410.13832	null
2024-10-17	DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control	Yujie Wei et.al.	2410.13830	null
2024-10-17	Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning	Xiaodan Xing et.al.	2410.13823	link
2024-10-17	ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution	Junhao Gu et.al.	2410.13807	null
2024-10-17	Probing the Latent Hierarchical Structure of Data via Diffusion Models	Antonio Sclocchi et.al.	2410.13770	null
2024-10-17	Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers	Yuchen Liang et.al.	2410.13746	null
2024-10-17	Improved Convergence Rate for Diffusion Probabilistic Models	Gen Li et.al.	2410.13738	null
2024-10-17	Optimizing Probabilistic Conformal Prediction with Vectorized Non-Conformity Scores	Minxing Zheng et.al.	2410.13735	null
2024-10-18	DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation	Hanbo Cheng et.al.	2410.13726	link
2024-10-17	Movie Gen: A Cast of Media Foundation Models	Adam Polyak et.al.	2410.13720	link
2024-10-18	Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion	Yijun Liang et.al.	2410.13674	link
2024-10-17	Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design	Chenyu Wang et.al.	2410.13643	link
2024-10-16	Geometry-Aware Generative Autoencoders for Warped Riemannian Metric Learning and Generative Modeling on Data Manifolds	Xingzhi Sun et.al.	2410.12779	null
2024-10-16	Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts	Hongcheng Gao et.al.	2410.12777	link
2024-10-16	SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation	Jaehong Yoon et.al.	2410.12761	null
2024-10-16	Signature of Vertical Mixing in Hydrogen-dominated Exoplanet Atmospheres	Vikas Soni et.al.	2410.12737	null
2024-10-16	Counterfactual Generative Modeling with Variational Causal Inference	Yulun Wu et.al.	2410.12730	link
2024-10-16	FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression	Zhenheng Tang et.al.	2410.12707	null
2024-10-16	Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization	Xingqi Wang et.al.	2410.12700	link
2024-10-16	AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing	DuoSheng Chen et.al.	2410.12696	link
2024-10-16	3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation	Dewei Zhou et.al.	2410.12669	link
2024-10-16	Towards Designing Scalable Quantum-Enhanced Generative Networks for Neutrino Physics Experiments with Liquid Argon Time Projection Chambers	Andrea Delgado et.al.	2410.12650	null
2024-10-16	A Robo-Advisor System: expected utility modeling via pairwise comparisons	Bo Chen et.al.	2410.12570	null
2024-10-16	One Step Diffusion via Shortcut Models	Kevin Frans et.al.	2410.12557	link
2024-10-16	Disentangling data distribution for Federated Learning	Xinyuan Zhao et.al.	2410.12530	null
2024-10-16	Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing	Mingce Guo et.al.	2410.12526	null
2024-10-16	MING: A Functional Approach to Learning Molecular Generative Models	Van Khoa Nguyen et.al.	2410.12522	null
2024-10-15	High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion	Junhwa Hur et.al.	2410.11838	null
2024-10-15	On the Effectiveness of Dataset Alignment for Fake Image Detection	Anirudh Sundara Rajan et.al.	2410.11835	null
2024-10-15	Bayesian Experimental Design via Contrastive Diffusions	Jacopo Iollo et.al.	2410.11826	link
2024-10-15	KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities	Hsin-Ping Huang et.al.	2410.11824	null
2024-10-15	Improving Long-Text Alignment for Text-to-Image Diffusion Models	Luping Liu et.al.	2410.11817	link
2024-10-15	SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing	Zhiyuan Zhang et.al.	2410.11815	null
2024-10-16	Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices	Zhiyuan Ma et.al.	2410.11795	null
2024-10-15	G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks	Guibin Zhang et.al.	2410.11782	null
2024-10-15	Technical Report of 1:10 Scale Autonomous Vehicle Robot	Amirhossein Kheiri Holighi et.al.	2410.11746	null
2024-10-15	Probabilistic Principles for Biophysics and Neuroscience: Entropy Production, Bayesian Mechanics & the Free-Energy Principle	Lancelot Da Costa et.al.	2410.11735	null
2024-10-15	Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems	Jason Hu et.al.	2410.11730	null
2024-10-15	Parameter estimation of structural dynamics with neural operators enabled surrogate modeling	Mingyuan Zhou et.al.	2410.11712	null
2024-10-15	Findings of the WMT 2024 Shared Task on Chat Translation	Wafaa Mohammed et.al.	2410.11624	null
2024-10-15	DeformPAM: Data-Efficient Learning for Long-horizon Deformable Object Manipulation via Preference-based Action Alignment	Wendi Chen et.al.	2410.11584	link
2024-10-15	A Data-Driven Aggressive Autonomous Racing Framework Utilizing Local Trajectory Planning with Velocity Prediction	Zhouheng Li et.al.	2410.11570	link
2024-10-14	Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models	Jingzhi Bao et.al.	2410.10821	link
2024-10-15	TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models	Mu Cai et.al.	2410.10818	link
2024-10-14	LVD-2M: A Long-take Video Dataset with Temporally Dense Captions	Tianwei Xiong et.al.	2410.10816	link
2024-10-14	Depth Any Video with Scalable Synthetic Data	Honghui Yang et.al.	2410.10815	link
2024-10-14	HART: Efficient Visual Generation with Hybrid Autoregressive Transformer	Haotian Tang et.al.	2410.10812	link
2024-10-14	TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction	Qingze et.al.	2410.10804	link
2024-10-14	Boosting Camera Motion Control for Video Diffusion Transformers	Soon Yau Cheong et.al.	2410.10802	null
2024-10-14	Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations	Litu Rout et.al.	2410.10792	null
2024-10-14	ControlMM: Controllable Masked Motion Generation	Ekkasit Pinyoanuntapong et.al.	2410.10780	null
2024-10-14	Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation	Youwei Yu et.al.	2410.10766	link
2024-10-14	DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships	Zhang Wan et.al.	2410.10751	null
2024-10-14	CosForce: A Force-Based General Model for Simulating Pedestrian Anticipation and Reaction Mechanisms	Jinghui Wang et.al.	2410.10746	null
2024-10-14	FlexGen: Flexible Multi-View Generation from Text and Image Inputs	Xinli Xu et.al.	2410.10745	null
2024-10-14	Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models	Junyu Chen et.al.	2410.10733	link
2024-10-14	Large Language Models Are Active Critics in NLG Evaluation	Shuying Xu et.al.	2410.10724	null
2024-10-11	SceneCraft: Layout-Guided 3D Scene Generation	Xiuyu Yang et.al.	2410.09049	link
2024-10-11	Linear Convergence of Diffusion Models Under the Manifold Hypothesis	Peter Potaptchik et.al.	2410.09046	null
2024-10-11	PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents	Xiangyu Yin et.al.	2410.09034	link
2024-10-11	Semantic Score Distillation Sampling for Compositional Text-to-3D Generation	Ling Yang et.al.	2410.09009	link
2024-10-11	WaveDiffusion: Exploring Full Waveform Inversion via Joint Diffusion in the Latent Space	Hanchen Wang et.al.	2410.09002	null
2024-10-11	Maximizing the Potential of Synthetic Data: Insights from Random Matrix Theory	Aymane El Firdoussi et.al.	2410.08942	null
2024-10-11	DiffPO: A causal diffusion model for learning distributions of potential outcomes	Yuchen Ma et.al.	2410.08924	null
2024-10-11	An End-to-End Deep Learning Method for Solving Nonlocal Allen-Cahn and Cahn-Hilliard Phase-Field Models	Yuwei Geng et.al.	2410.08914	null
2024-10-11	Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRI	Moritz Piening et.al.	2410.08894	link
2024-10-11	MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices	Mohamed Amine Hamdi et.al.	2410.08855	link
2024-10-14	LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection	Mingjia Li et.al.	2410.08810	link
2024-10-11	Bad Neighbors: On Understanding VPN Provider Networks	Teemu Rytilahti et.al.	2410.08737	link
2024-10-11	5G as Enabler for Industrie 4.0 Use Cases: Challenges and Concepts	M. Gundall et.al.	2410.08726	null
2024-10-11	Investigating Human-Computer Interaction and Visual Comprehension in Text Generation Process of Natural Language Generation Models	Yunchao Wang et.al.	2410.08723	null
2024-10-11	Impact of Surface Reflections in Maritime Obstacle Detection	Samed Yalçın et.al.	2410.08713	link
2024-10-10	LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts	Anh-Quan Cao et.al.	2410.08211	null
2024-10-10	DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models	Xiaoxiao He et.al.	2410.08207	null
2024-10-10	HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation	Shanyan Guan et.al.	2410.08192	null
2024-10-10	DifFRelight: Diffusion-Based Facial Performance Relighting	Mingming He et.al.	2410.08188	null
2024-10-10	RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image	Xiaoxue Chen et.al.	2410.08181	null
2024-10-10	ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion	Zitian Zhang et.al.	2410.08168	link
2024-10-10	DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation	Jiatao Gu et.al.	2410.08159	null
2024-10-10	Progressive Autoregressive Video Diffusion Models	Desai Xie et.al.	2410.08151	link
2024-10-10	Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction	Jarrid Rector-Brooks et.al.	2410.08134	null
2024-10-10	Robust AI-Generated Text Detection by Restricted Embeddings	Kristian Kuznetsov et.al.	2410.08113	link
2024-10-10	LiPO: LiDAR Inertial Odometry for ICP Comparison	Darwin Mick et.al.	2410.08097	null
2024-10-10	Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models	Vinith M. Suriyakumar et.al.	2410.08074	null
2024-10-10	Reversible Decoupling Network for Single Image Reflection Removal	Hao Zhao et.al.	2410.08063	link
2024-10-10	A Target-Aware Analysis of Data Augmentation for Hate Speech Detection	Camilla Casula et.al.	2410.08053	null
2024-10-10	LADIMO: Face Morph Generation through Biometric Template Inversion with Latent Diffusion	Marcel Grimmer et.al.	2410.07988	link
2024-10-09	IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation	Xinchen Zhang et.al.	2410.07171	link
2024-10-09	Sylber: Syllabic Embedding Representation of Speech from Raw Audio	Cheol Jun Cho et.al.	2410.07168	link
2024-10-09	AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation	Yukang Cao et.al.	2410.07164	null
2024-10-09	InstructG2I: Synthesizing Images from Multimodal Attributed Graphs	Bowen Jin et.al.	2410.07157	link
2024-10-09	Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis	Bohan Zeng et.al.	2410.07155	link
2024-10-10	EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models	Rui Zhao et.al.	2410.07133	link
2024-10-09	Personalized Visual Instruction Tuning	Renjie Pi et.al.	2410.07113	link
2024-10-09	A Gentle Introduction and Tutorial on Deep Generative Models in Transportation Research	Seongjin Choi et.al.	2410.07066	link
2024-10-09	Efficient Distribution Matching of Representations via Noise-Injected Deep InfoMax	Ivan Butakov et.al.	2410.06993	null
2024-10-09	Diffusion Density Estimators	Akhil Premkumar et.al.	2410.06986	null
2024-10-09	Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control	Shimon Vainer et.al.	2410.06985	null
2024-10-09	Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Runze Chen et.al.	2410.06982	null
2024-10-09	Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think	Sihyun Yu et.al.	2410.06940	link
2024-10-09	VEC-Sim: A Simulation Platform for Evaluating Service Caching and Computation Offloading Policies in Vehicular Edge Networks	Fan Wu et.al.	2410.06934	null
2024-10-09	Generative Model for Less-Resourced Language with 1 billion parameters	Domen Vreš et.al.	2410.06898	null
2024-10-07	DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control	Kaifeng Zhao et.al.	2410.05260	null
2024-10-07	GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting	Yukang Cao et.al.	2410.05259	null
2024-10-07	SePPO: Semi-Policy Preference Optimization for Diffusion Alignment	Daoan Zhang et.al.	2410.05255	link
2024-10-07	DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration	Yongtai Zhuo et.al.	2410.05234	link
2024-10-07	Density estimation with LLMs: a geometric investigation of in-context learning trajectories	Toni J. B. Liu et.al.	2410.05218	null
2024-10-07	Avoiding Deadlocks via Weak Deadlock Sets	Gianpaolo Oriolo et.al.	2410.05175	null
2024-10-07	Presto! Distilling Steps and Layers for Accelerating Music Generation	Zachary Novack et.al.	2410.05167	null
2024-10-08	A Simulation-Free Deep Learning Approach to Stochastic Optimal Control	Mengjian Hua et.al.	2410.05163	null
2024-10-07	Smart Jamming Attack and Mitigation on Deep Transfer Reinforcement Learning Enabled Resource Allocation for Network Slicing	Shavbo Salehi et.al.	2410.05153	null
2024-10-07	Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information	Timofey Efimov et.al.	2410.05143	null
2024-10-07	Agnostic Smoothed Online Learning	Moïse Blanchard et.al.	2410.05124	null
2024-10-07	Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning	Ayano Hiranaka et.al.	2410.05116	null
2024-10-07	Synthetic Generation of Dermatoscopic Images with GAN and Closed-Form Factorization	Rohan Reddy Mekala et.al.	2410.05114	null
2024-10-07	Hyper-Representations: Learning from Populations of Neural Networks	Konstantin Schürholt et.al.	2410.05107	link
2024-10-07	DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects	Nidhi Mathihalli et.al.	2410.05097	link
2024-10-04	Estimating Body and Hand Motion in an Ego-sensed World	Brent Yi et.al.	2410.03665	null
2024-10-04	Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models	Zhuochun Li et.al.	2410.03663	link
2024-10-04	Geometric Representation Condition Improves Equivariant Molecule Generation	Zian Li et.al.	2410.03655	null
2024-10-04	Aligning LLMs with Individual Preferences via Interaction	Shujin Wu et.al.	2410.03642	link
2024-10-04	Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models	Chumeng Liang et.al.	2410.03640	link
2024-10-04	Conditional Enzyme Generation Using Protein Language Models with Adapters	Jason Yang et.al.	2410.03634	null
2024-10-04	How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework	Yinuo Ren et.al.	2410.03601	null
2024-10-04	Teaching Transformers Modular Arithmetic at Scale	Eshika Saxena et.al.	2410.03569	null
2024-10-04	Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features	Benyuan Meng et.al.	2410.03558	link
2024-10-04	Loading Ceramics: Visualising Possibilities of Robotics in Ceramics	Varvara Guljajeva et.al.	2410.03550	null
2024-10-04	NRGBoost: Energy-Based Generative Boosted Trees	João Bravo et.al.	2410.03535	link
2024-10-04	Generative Artificial Intelligence for Navigating Synthesizable Chemical Space	Wenhao Gao et.al.	2410.03494	link
2024-10-04	SeBS-Flow: Benchmarking Serverless Cloud Function Workflows	Larissa Schmid et.al.	2410.03480	null
2024-10-04	Formalizing MLTL Formula Progression in Isabelle/HOL	Katherine Kosaian et.al.	2410.03465	null
2024-10-04	Diffusion State-Guided Projected Gradient for Inverse Problems	Rayhan Zirvi et.al.	2410.03463	link
2024-10-03	SIEVE: General Purpose Data Filtering System Matching GPT-4o Accuracy at 1% the Cost	Jifan Zhang et.al.	2410.02755	null
2024-10-03	CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation	Han He et.al.	2410.02748	link
2024-10-03	Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization	Lei Xu et.al.	2410.02741	link
2024-10-03	Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models	Zhengfeng Lai et.al.	2410.02740	null
2024-10-03	Custom Non-Linear Model Predictive Control for Obstacle Avoidance in Indoor and Outdoor Environments	Lara Laban et.al.	2410.02732	link
2024-10-03	A Photonic Parameter-shift Rule: Enabling Gradient Computation for Photonic Quantum Computers	Axel Pappalardo et.al.	2410.02726	null
2024-10-03	AlzhiNet: Traversing from 2DCNN to 3DCNN, Towards Early Detection and Diagnosis of Alzheimer’s Disease	Romoke Grace Akindele et.al.	2410.02714	null
2024-10-03	SteerDiff: Steering towards Safe Text-to-Image Diffusion Models	Hongxiang Zhang et.al.	2410.02710	null
2024-10-03	ControlAR: Controllable Image Generation with Autoregressive Models	Zongming Li et.al.	2410.02705	link
2024-10-03	User-centric Immersive Communications in 6G: A Data-oriented Approach via Digital Twin	Conghao Zhou et.al.	2410.02688	null
2024-10-03	GUD: Generation with Unified Diffusion	Mathis Gerdes et.al.	2410.02667	null
2024-10-03	Grounded Answers for Multi-agent Decision-making Problem through Generative World Model	Zeyang Liu et.al.	2410.02664	null
2024-10-03	Scalable Simulation-free Entropic Unbalanced Optimal Transport	Jaemoo Choi et.al.	2410.02656	null
2024-10-03	Measuring and Improving Persuasiveness of Generative Models	Somesh Singh et.al.	2410.02653	null
2024-10-03	Efficient calibration of the shifted square-root diffusion model to credit default swap spreads using asymptotic approximations	Ankush Agarwal et.al.	2410.02645	null
2024-10-02	FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images	Cheng Zhang et.al.	2410.01801	null
2024-10-02	Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space	Yangming Li et.al.	2410.01796	null
2024-10-02	Dynamical-generative downscaling of climate model ensembles	Ignacio Lopez-Gomez et.al.	2410.01776	null
2024-10-02	Towards deep learning sequence-structure co-generation for protein design	Chentong Wang et.al.	2410.01773	null
2024-10-02	ImageFolder: Autoregressive Image Generation with Folded Tokens	Xiang Li et.al.	2410.01756	link
2024-10-02	AssessITS: Integrating procedural guidelines and practical evaluation metrics for organizational IT and Cybersecurity risk assessment	Mir Mehedi Rahman et.al.	2410.01750	null
2024-10-02	VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models	Kailai Feng et.al.	2410.01738	link
2024-10-02	HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration	Yushi Huang et.al.	2410.01723	link
2024-10-02	Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective	Zeyu Gan et.al.	2410.01720	link
2024-10-02	COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation	Mingzhen Sun et.al.	2410.01718	null
2024-10-02	A Mathematics-Inspired Learning-to-Optimize Framework for Decentralized Optimization	Yutong He et.al.	2410.01700	null
2024-10-02	Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding	Yao Teng et.al.	2410.01699	link
2024-10-02	Lossy Semantic Communication for the Logical Deduction of the State of the World	Ahmet Faruk Saz et.al.	2410.01676	link
2024-10-02	Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering	Klaus-Rudolf Kladny et.al.	2410.01660	null
2024-10-02	On The Adaptation of Unlimiformer for Decoder-Only Transformers	Kian Ahrabian et.al.	2410.01637	null
2024-09-30	SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes	Tianchang Shen et.al.	2409.20562	null
2024-09-30	Annealing Flow Generative Model Towards Sampling High-Dimensional and Multi-Modal Distributions	Dongze Wu et.al.	2409.20547	link
2024-09-30	A Compact Quantum Random Number Generator Based on Balanced Detection of Shot Noise	Jaideep Singh et.al.	2409.20515	null
2024-09-30	NUTRIVISION: A System for Automatic Diet Management in Smart Healthcare	Madhumita Veeramreddy et.al.	2409.20508	null
2024-09-30	COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models	Divyanshu Daiya et.al.	2409.20502	null
2024-09-30	FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing	Lingling Cai et.al.	2409.20500	null
2024-09-30	All-optical autoencoder machine learning framework using diffractive processors	Peijie Feng et.al.	2409.20346	null
2024-09-30	Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation	Yuran Wang et.al.	2409.20332	link
2024-09-30	UIR-LoRA: Achieving Universal Image Restoration through Multiple Low-Rank Adaptation	Cheng Zhang et.al.	2409.20197	link
2024-09-30	Ensemble Kalman Diffusion Guidance: A Derivative-free Method for Inverse Problems	Hongkai Zheng et.al.	2409.20175	null
2024-09-30	Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model	Fulong Ma et.al.	2409.20164	null
2024-09-30	Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation	Rong Tang et.al.	2409.20124	null
2024-09-30	Training a Computer Vision Model for Commercial Bakeries with Primarily Synthetic Images	Thomas H. Schmitt et.al.	2409.20122	null
2024-09-30	Reaction-diffusion model for a population structured in phenotype and space I – Criterion for persistence	Nathanaël Boutillon et.al.	2409.20118	null
2024-09-30	Near-Field Coupling Coil System: A Novel Radiofrequency Coil Solution for MRI	Zhiguang Mo et.al.	2409.20095	null
2024-09-27	$O(d/T)$ Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions	Gen Li et.al.	2409.18959	null
2024-09-27	ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions	Wenfeng Huang et.al.	2409.18932	null
2024-09-27	Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors	Yunlong Lin et.al.	2409.18899	null
2024-09-27	Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis	Songrui Wang et.al.	2409.18897	null
2024-09-27	HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models	Yu Zhou et.al.	2409.18893	null
2024-09-27	Explainable Artifacts for Synthetic Western Blot Source Attribution	João Phillipe Cardenuto et.al.	2409.18881	link
2024-09-27	Emu3: Next-Token Prediction is All You Need	Xinlong Wang et.al.	2409.18869	null
2024-09-27	Challenges of Generating Structurally Diverse Graphs	Fedor Velikonivtsev et.al.	2409.18859	link
2024-09-27	Moldable Development Patterns	Oscar Nierstrasz et.al.	2409.18811	null
2024-09-27	Convergence of Diffusion Models Under the Manifold Hypothesis in High-Dimensions	Iskander Azangulov et.al.	2409.18804	null
2024-09-27	Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation	Chaomin Shen et.al.	2409.18785	null
2024-09-27	Geometric deep learning for galaxy-halo connection: a case study for galaxy intrinsic alignments	Yesukhei Jagvaral et.al.	2409.18761	null
2024-09-27	Cottention: Linear Transformers With Cosine Attention	Gabriel Mongaras et.al.	2409.18747	link
2024-09-27	Read Over the Lines: Attacking LLMs and Toxicity Detection Systems with ASCII Art to Mask Profanity	Sergey Berezin et.al.	2409.18708	link
2024-09-27	MG-Net: Learn to Customize QAOA with Circuit Depth Awareness	Yang Qian et.al.	2409.18692	link
2024-09-26	FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner	Wenliang Zhao et.al.	2409.18128	link
2024-09-26	Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction	Jing He et.al.	2409.18124	null
2024-09-26	EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation	Jiaxiang Tang et.al.	2409.18114	null
2024-09-26	MALPOLON: A Framework for Deep Species Distribution Modeling	Theo Larcher et.al.	2409.18102	link
2024-09-26	StackGen: Generating Stable Structures from Silhouettes via Diffusion	Luzhe Sun et.al.	2409.18098	null
2024-09-26	DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models	Helin Cao et.al.	2409.18092	null
2024-09-26	Stable Video Portraits	Mirela Ostrek et.al.	2409.18083	null
2024-09-26	LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field	Huan Wang et.al.	2409.18057	link
2024-09-26	Automated Detection and Analysis of Power Words in Persuasive Text Using Natural Language Processing	Sahil Garje et.al.	2409.18033	null
2024-09-26	PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging	Xin Cai et.al.	2409.17996	null
2024-09-26	Joint Localization and Planning using Diffusion	L. Lao Beyer et.al.	2409.17995	null
2024-09-26	Manufacturing, processing, applications, and advancements of Fe-based shape memory alloys	Anwar Algamal et.al.	2409.17973	null
2024-09-26	CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors	Linye Lyu et.al.	2409.17963	link
2024-09-26	Relativistic diffusion model for hadron production in p-Pb collisions at the LHC	Philipp Schulz et.al.	2409.17960	null
2024-09-26	Perturb, Attend, Detect and Localize (PADL): Robust Proactive Image Defense	Filippo Bartolucci et.al.	2409.17941	null
2024-09-25	DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion	Yukun Huang et.al.	2409.17145	link
2024-09-25	Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model	Xinfeng Wei et.al.	2409.17104	null
2024-09-25	Accumulator-Aware Post-Training Quantization	Ian Colbert et.al.	2409.17092	null
2024-09-25	Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification	Xinrui Zhou et.al.	2409.17091	null
2024-09-25	Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors	Aiping Zhang et.al.	2409.17058	link
2024-09-25	ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis	Fangshuo Zhou et.al.	2409.17049	link
2024-09-25	GeoBiked: A Dataset with Geometric Features and Automated Labeling Techniques to Enable Deep Generative Models in Engineering Design	Phillip Mueller et.al.	2409.17045	null
2024-09-25	CNN Mixture-of-Depths	Rinor Cakaj et.al.	2409.17016	null
2024-09-25	Single Image, Any Face: Generalisable 3D Face Generation	Wenqing Wang et.al.	2409.16990	null
2024-09-25	Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion	Vineet Punyamoorty et.al.	2409.16950	null
2024-09-25	DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling	Kyuheon Jung et.al.	2409.16949	link
2024-09-25	Divergence asymmetry and connected components in a general duplication-divergence graph model	Dario Borrelli et.al.	2409.16943	null
2024-09-25	Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model	Hongliang Zhong et.al.	2409.16938	link
2024-09-25	Linking in Style: Understanding learned features in deep learning models	Maren H. Wehrheim et.al.	2409.16865	link
2024-09-25	A Versatile and Differentiable Hand-Object Interaction Representation	Théo Morales et.al.	2409.16855	null
2024-09-18	Massively Multi-Person 3D Human Motion Forecasting with Scene Context	Felix B Mueller et.al.	2409.12189	link
2024-09-18	MoRAG – Multi-Fusion Retrieval Augmented Generation for Human Motion	Kalakonda Sai Shashank et.al.	2409.12140	link
2024-09-24	Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models	Sijing Chen et.al.	2409.12139	null
2024-09-18	Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance	Jaehoon Joo et.al.	2409.12099	null
2024-09-19	Skill matching at scale: freelancer-project alignment for efficient multilingual candidate retrieval	Warren Jouanneau et.al.	2409.12097	null
2024-09-18	Design of Ligand-Binding Proteins with Atomic Flow Matching	Junqi Liu et.al.	2409.12080	null
2024-09-18	Denoising diffusion models for high-resolution microscopy image restoration	Pamela Osuna-Vargas et.al.	2409.12078	null
2024-09-19	Using Large Language Models to Generate Clinical Trial Tables and Figures	Yumeng Yang et.al.	2409.12046	null
2024-09-18	LEMON: Localized Editing with Mesh Optimization and Neural Shaders	Furkan Mert Algan et.al.	2409.12024	null
2024-09-18	Promise and Peril of Collaborative Code Generation Models: Balancing Effectiveness and Memorization	Zhi Chen et.al.	2409.12020	null
2024-09-18	Towards Global Localization using Multi-Modal Object-Instance Re-Identification	Aneesh Chavan et.al.	2409.12002	link
2024-09-18	Tracking Any Point with Frame-Event Fusion Network at High Frame Rate	Jiaxiong Liu et.al.	2409.11953	null
2024-09-18	Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models	Lorenzo Mandelli et.al.	2409.11920	null
2024-09-18	AlignBot: Aligning VLM-powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots	Zhaxizhuoma et.al.	2409.11905	null
2024-09-18	Finding the Subjective Truth: Collecting 2 Million Votes for Comprehensive Gen-AI Model Evaluation	Dimitrios Christodoulou et.al.	2409.11904	null
2024-09-17	Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion	Zhenwei Wang et.al.	2409.11406	null
2024-09-17	Teaching dark matter simulations to speak the halo language	Shivam Pandey et.al.	2409.11401	link
2024-09-17	Ultrasound Image Enhancement with the Variance of Diffusion Models	Yuxin Zhang et.al.	2409.11380	link
2024-09-17	OSV: One Step is Enough for High-Quality Image to Video Generation	Xiaofeng Mao et.al.	2409.11367	null
2024-09-17	Ping! Your Food is Ready: Comparing Different Notification Techniques in 3D AR Cooking Environment	Aditya Raikwar et.al.	2409.11357	null
2024-09-17	Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Gonzalo Martin Garcia et.al.	2409.11355	link
2024-09-17	OmniGen: Unified Image Generation	Shitao Xiao et.al.	2409.11340	link
2024-09-17	fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction	Jianxiong Gao et.al.	2409.11315	null
2024-09-17	SpMis: An Investigation of Synthetic Spoken Misinformation Detection	Peizhuo Liu et.al.	2409.11308	null
2024-09-17	Measurement of top-quark pair production in association with charm quarks in proton-proton collisions at $\sqrt{s}=13$ TeV with the ATLAS detector	ATLAS Collaboration et.al.	2409.11305	null
2024-09-17	NirvaWave: An Accurate and Efficient Near Field Wave Propagation Simulator for 6G and Beyond	Vahid Yazdnian et.al.	2409.11293	link
2024-09-17	DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models	Avirup Das et.al.	2409.11292	null
2024-09-17	Neural Networks for Vehicle Routing Problem	László Kovács et.al.	2409.11290	null
2024-09-17	Attacking Slicing Network via Side-channel Reinforcement Learning Attack	Wei Shao et.al.	2409.11258	null
2024-09-17	Learning Source Disentanglement in Neural Audio Codec	Xiaoyu Bie et.al.	2409.11228	null
2024-09-16	Pennsieve - A Collaborative Platform for Translational Neuroscience and Beyond	Zack Goldblum et.al.	2409.10509	null
2024-09-16	Torres funerarias chullpa en el valle del río Lauca: un primer análisis arqueoastronómico	Alejandro Gangui et.al.	2409.10497	null
2024-09-16	Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation	Noah Buchanan et.al.	2409.10494	null
2024-09-16	SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing	Qi Qian et.al.	2409.10476	null
2024-09-16	MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion	Lehong Wu et.al.	2409.10473	null
2024-09-16	Signed Graph Autoencoder for Explainable and Polarization-Aware Network Embeddings	Nikolaos Nakis et.al.	2409.10452	null
2024-09-16	Mamba-ST: State Space Model for Efficient Style Transfer	Filippo Botti et.al.	2409.10385	link
2024-09-16	2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?	Téo Guichoux et.al.	2409.10357	null
2024-09-16	Taming Diffusion Models for Image Restoration: A Review	Ziwei Luo et.al.	2409.10353	null
2024-09-16	MEGS: Morphological Evaluation of Galactic Structure	Ufuk Çakır et.al.	2409.10346	link
2024-09-16	VAE-QWGAN: Improving Quantum GANs for High Resolution Image Generation	Aaron Mark Thomas et.al.	2409.10339	null
2024-09-16	Research and Design of a Financial Intelligent Risk Control Platform Based on Big Data Analysis and Deep Machine Learning	Shuochen Bi et.al.	2409.10331	null
2024-09-16	Fairness, not Emotion, Drives Socioeconomic Decision Making	Rudra Mukhopadhyay et.al.	2409.10322	null
2024-09-16	On Synthetic Texture Datasets: Challenges, Creation, and Curation	Blaine Hoak et.al.	2409.10297	null
2024-09-16	DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis	Fa-Ting Hong et.al.	2409.10281	null
2024-09-13	Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation	Qingwen Bu et.al.	2409.09016	link
2024-09-13	A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis	Yohan Poirier-Ginter et.al.	2409.08947	null
2024-09-13	Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions	Zahra Ashktorab et.al.	2409.08937	null
2024-09-13	Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation	Guojun Liang et.al.	2409.08917	link
2024-09-13	Gaussian is All You Need: A Unified Framework for Solving Inverse Problems via Diffusion Posterior Sampling	Nebiyou Yismaw et.al.	2409.08906	link
2024-09-13	Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control	Carles Domingo-Enrich et.al.	2409.08861	null
2024-09-13	The Line-Based Dial-a-Ride Problem	Kendra Reiter et.al.	2409.08860	link
2024-09-13	InstantDrag: Improving Interactivity in Drag-based Image Editing	Joonghyuk Shin et.al.	2409.08857	null
2024-09-13	DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s)	Yun Su Jeong et.al.	2409.08850	null
2024-09-13	Development of a Compton Imager Setup	Anuraag Arya et.al.	2409.08822	null
2024-09-13	LLaQo: Towards a Query-Based Coach in Expressive Music Performance Assessment	Huan Zhang et.al.	2409.08795	link
2024-09-13	What You Say = What You Want? Teaching Humans to Articulate Requirements for LLMs	Qianou Ma et.al.	2409.08775	link
2024-09-13	A Hybrid Meta-Learning and Multi-Armed Bandit Approach for Context-Specific Multi-Objective Recommendation Optimization	Tiago Cunha et.al.	2409.08752	null
2024-09-13	Adaptive Sampling for Continuous Group Equivariant Neural Networks	Berfin Inal et.al.	2409.08741	null
2024-09-13	DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset	Jiawei Du et.al.	2409.08731	link
2024-09-12	DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors	Thomas Hanwen Zhu et.al.	2409.08278	null
2024-09-12	Hand-Object Interaction Pretraining from Videos	Himanshu Gaurav Singh et.al.	2409.08273	null
2024-09-12	Click2Mask: Local Editing with Dynamic Mask Generation	Omer Regev et.al.	2409.08272	link
2024-09-12	DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer	Runjia Li et.al.	2409.08271	null
2024-09-12	Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation	Samanta Rodriguez et.al.	2409.08269	null
2024-09-12	Improving Text-guided Object Inpainting with Semantic Pre-inpainting	Yifu Chen et.al.	2409.08260	link
2024-09-12	Improving Virtual Try-On with Garment-focused Diffusion Models	Siqi Wan et.al.	2409.08258	link
2024-09-12	LoRID: Low-Rank Iterative Diffusion for Adversarial Purification	Geigh Zollicoffer et.al.	2409.08255	null
2024-09-12	Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding	Hongyu Li et.al.	2409.08251	null
2024-09-12	IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation	Yinwei Wu et.al.	2409.08240	null
2024-09-12	Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources	Alisia Lupidi et.al.	2409.08239	null
2024-09-12	LT3SD: Latent Trees for 3D Scene Diffusion	Quan Meng et.al.	2409.08215	null
2024-09-12	VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis	Hao Chen et.al.	2409.08207	null
2024-09-12	High-Frequency Anti-DreamBooth: Robust Defense Against Image Synthesis	Takuto Onikubo et.al.	2409.08167	link
2024-09-12	MagicStyle: Portrait Stylization Based on Reference Image	Zhaoli Deng et.al.	2409.08156	null
2024-09-11	DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation	Haibo Yang et.al.	2409.07454	null
2024-09-11	Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models	Haibo Yang et.al.	2409.07452	link
2024-09-11	FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process	Yang Luo et.al.	2409.07451	null
2024-09-11	Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging	Yunzhen Wang et.al.	2409.07417	null
2024-09-11	Extracting TCPIP Headers at High Speed for the Anonymized Network Traffic Graph Challenge	Zhaoyang Han et.al.	2409.07374	null
2024-09-11	Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination	Daniel Zhang-Li et.al.	2409.07372	null
2024-09-11	Event-based Mosaicing Bundle Adjustment	Shuang Guo et.al.	2409.07365	link
2024-09-11	Training-Free Guidance for Discrete Diffusion Models for Molecular Generation	Thomas J. Kerby et.al.	2409.07359	null
2024-09-11	Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching	Eugenio Chisari et.al.	2409.07343	null
2024-09-11	Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models	Fengzhe Zhang et.al.	2409.07323	null
2024-09-11	Optimizing Neural Network Performance and Interpretability with Diophantine Equation Encoding	Ronald Katende et.al.	2409.07310	null
2024-09-11	Exploring User-level Gradient Inversion with a Diffusion Prior	Zhuohang Li et.al.	2409.07291	null
2024-09-11	CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals	Weixiang Gao et.al.	2409.07271	link
2024-09-11	Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models	Sanoojan Baliah et.al.	2409.07269	link
2024-09-11	EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion	Jian Zhang et.al.	2409.07255	link
2024-09-10	Technical Report of Mobile Manipulator Robot for Industrial Environments	Erfan Amoozad Khalili et.al.	2409.06693	null
2024-09-10	SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation	Teng Hu et.al.	2409.06633	null
2024-09-10	MVGaussian: High-Fidelity text-to-3D Content Generation with Multi-View Guidance and Surface Densification	Phu Pham et.al.	2409.06620	null
2024-09-10	A Primer on Variational Inference for Physics-Informed Deep Generative Modelling	Alex Glyn-Davies et.al.	2409.06560	null
2024-09-10	From LIMA to DeepLIMA: following a new path of interoperability	Victor Bocharov et.al.	2409.06550	null
2024-09-10	Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models	Xin Jing et.al.	2409.06451	null
2024-09-10	Prompt2Fashion: An automatically generated fashion dataset	Georgia Argyro et.al.	2409.06442	link
2024-09-10	Fast nonparametric inference of network backbones for graph sparsification	Alec Kirkley et.al.	2409.06417	link
2024-09-10	Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition	Junzheng Zhang et.al.	2409.06371	null
2024-09-10	What happens to diffusion model likelihood when your model is conditional?	Mattias Cross et.al.	2409.06364	null
2024-09-10	DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement	Jia-Wei Liao et.al.	2409.06355	null
2024-09-10	Improving Conditional Level Generation using Automated Validation in Match-3 Games	Monica Villanueva Aylagas et.al.	2409.06349	null
2024-09-10	Foragax: An Agent Based Modelling framework based on JAX	Siddharth Chaturvedi et.al.	2409.06345	link
2024-09-10	G3PT: Unleash the power of Autoregressive Modeling in 3D Generation via Cross-scale Querying Transformer	Jinzhi Zhang et.al.	2409.06322	null
2024-09-10	Learning Augmentation Policies from A Model Zoo for Time Series Forecasting	Haochen Yuan et.al.	2409.06282	null
2024-09-09	Fast Generation of Custom Floating-Point Spatial Filters on FPGAs	Nelson Campos et.al.	2409.05837	null
2024-09-09	Enhancing Preference-based Linear Bandits via Human Response Time	Shen Li et.al.	2409.05798	null
2024-09-09	Predicting Critical Heat Flux with Uncertainty Quantification and Domain Generalization Using Conditional Variational Autoencoders and Deep Neural Networks	Farah Alsafadi et.al.	2409.05790	null
2024-09-09	Vector Quantized Diffusion Model Based Speech Bandwidth Extension	Yuan Fang et.al.	2409.05784	null
2024-09-09	AS-Speech: Adaptive Style For Speech Synthesis	Zhipeng Li et.al.	2409.05730	null
2024-09-09	pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning	Jiahao Lai et.al.	2409.05701	null
2024-09-09	Citizen-Led Personalization of User Interfaces: Investigating How People Customize Interfaces for Themselves and Others	Sérgio Alves et.al.	2409.05696	null
2024-09-09	Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models	Aakash Sen Sharma et.al.	2409.05668	null
2024-09-09	Forward KL Regularized Preference Optimization for Aligning Diffusion Policies	Zhao Shan et.al.	2409.05622	null
2024-09-09	CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization	Nan Chen et.al.	2409.05606	null
2024-09-09	Latent 3D Brain MRI Counterfactual	Wei Peng et.al.	2409.05585	null
2024-09-09	Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation	Muraleekrishna Gopinathan et.al.	2409.05583	link
2024-09-09	Design and Implementation of TAO DAQ System	Shuihan Zhang et.al.	2409.05522	null
2024-09-09	A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression	Nora Hofer et.al.	2409.05490	null
2024-09-09	DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation	Wei Wu et.al.	2409.05463	null
2024-09-06	VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation	Yecheng Wu et.al.	2409.04429	link
2024-09-06	Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques	Davide Clode da Silva et.al.	2409.04424	null
2024-09-06	Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation	Zhuoyan Luo et.al.	2409.04410	link
2024-09-06	Enhancing Skin Lesion Diagnosis with Ensemble Learning	Xiaoyi Liu et.al.	2409.04381	null
2024-09-06	How Fair is Your Diffusion Recommender Model?	Daniele Malitesta et.al.	2409.04339	null
2024-09-06	Random effects estimation in a fractional diffusion model based on continuous observations	Nesrine Chebli et.al.	2409.04331	null
2024-09-06	Advancing Automated Knowledge Transfer in Evolutionary Multitasking via Large Language Models	Yuxiao Huang et.al.	2409.04270	null
2024-09-06	An overview of domain-specific foundation model: key technologies, applications and challenges	Haolong Chen et.al.	2409.04267	null
2024-09-06	UniDet3D: Multi-dataset Indoor 3D Object Detection	Maksim Kolodiazhnyi et.al.	2409.04234	link
2024-09-06	Generative Modelling via Quantile Regression	Johannes Schmidt-Hieber et.al.	2409.04231	null
2024-09-06	Breaking the Brownian Barrier: Models and Manifestations of Molecular Diffusion in Complex Fluids	Harish Srinivasan et.al.	2409.04199	null
2024-09-06	GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers	Lorenza Prospero et.al.	2409.04196	link
2024-09-06	Subsampling of Correlated Graph Signals	Rishabh Ravi et.al.	2409.04107	null
2024-09-06	Estimation of service value parameters for a queue with unobserved balking	Daniel Podorojnyi et.al.	2409.04090	null
2024-09-06	D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection	Kentaro Hirahara et.al.	2409.04060	null
2024-09-05	Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding	Yunze Man et.al.	2409.03757	link
2024-09-05	WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild	Yuntian Deng et.al.	2409.03753	null
2024-09-05	ArtiFade: Learning to Generate High-quality Subject from Blemished Images	Shuya Yang et.al.	2409.03745	null
2024-09-06	RAG based Question-Answering for Contextual Response Prediction System	Sriram Veturi et.al.	2409.03708	null
2024-09-05	RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images	Benzhi Wang et.al.	2409.03644	link
2024-09-05	DiffEVC: Any-to-Any Emotion Voice Conversion with Expressive Guidance	Hsing-Hang Chou et.al.	2409.03636	null
2024-09-05	Generalizing Linear Graphs and Bond Graph Models with Hetero-functional Graphs for System-of-Systems Engineering Applications	Ehsanoddin Ghorbanichemazkati et.al.	2409.03630	null
2024-09-05	TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces	Bernardo Biesseck et.al.	2409.03600	link
2024-09-05	DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture	Qianlong Xiang et.al.	2409.03550	link
2024-09-05	Euclid preparation. Simulations and nonlinearities beyond $Λ$ CDM. 2. Results from non-standard simulations	Euclid Collaboration et.al.	2409.03523	null
2024-09-05	Blended Latent Diffusion under Attention Control for Real-World Video Editing	Deyin Liu et.al.	2409.03514	null
2024-09-05	Physical Modelling of Piano Sound	Haifan Xie et.al.	2409.03481	null
2024-09-05	Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration	Pei Wang et.al.	2409.03455	null
2024-09-05	Rx Strategist: Prescription Verification using LLM Agents System	Phuc Phan Van et.al.	2409.03440	null
2024-09-05	KiloBot: A Programming Language for Deploying Perception-Guided Industrial Manipulators at Scale	Wei Gao et.al.	2409.03439	null
2024-09-04	HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts	Xinyu Liu et.al.	2409.02919	link
2024-09-04	Latent Watermarking of Audio Generative Models	Robin San Roman et.al.	2409.02915	null
2024-09-04	Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling	Kaiwen Zheng et.al.	2409.02908	null
2024-09-04	Configurable Foundation Models: Building LLMs from a Modular Perspective	Chaojun Xiao et.al.	2409.02877	null
2024-09-04	Look Into the LITE in Deep Learning for Time Series Classification	Ali Ismail-Fawaz et.al.	2409.02869	link
2024-09-04	Building a Scalable, Effective, and Steerable Search and Ranking Platform	Marjan Celikik et.al.	2409.02856	null
2024-09-04	Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models	Zhibin Liu et.al.	2409.02851	link
2024-09-04	Anomaly Detection in Offshore Open Radio Access Network Using Long Short-Term Memory Models on a Novel Artificial Intelligence-Driven Cloud-Native Data Platform	Abdelrahim Ahmad et.al.	2409.02849	null
2024-09-04	Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model	Tornike Karchkhadze et.al.	2409.02845	null
2024-09-04	SNNAX – Spiking Neural Networks in JAX	Jamie Lohoff et.al.	2409.02842	null
2024-09-04	Experimental Framework for Generating Reliable Ground Truth for Laryngeal Spatial Segmentation Tasks	Hamzeh Ghasemzadeh et.al.	2409.02809	null
2024-09-04	Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL	Mohammad Reshadati et.al.	2409.02711	null
2024-09-04	Rethinking HTG Evaluation: Bridging Generation and Recognition	Konstantina Nikolaidou et.al.	2409.02683	link
2024-09-04	Introduction to Machine Learning	Laurent Younes et.al.	2409.02668	null
2024-09-04	Creating Domain-Specific Translation Memories for Machine Translation Fine-tuning: The TRENCARD Bilingual Cardiology Corpus	Gokhan Dogru et.al.	2409.02667	null
2024-08-30	Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes	Li Zhang et.al.	2408.17421	link
2024-08-30	Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain	Francesca Grasso et.al.	2408.17362	link
2024-08-30	Subspace Diffusion Posterior Sampling for Travel-Time Tomography	Xiang Cao et.al.	2408.17333	null
2024-08-30	Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations	Ahmed Hammam et.al.	2408.17311	null
2024-08-30	Leveraging Deep Generative Model For Computational Protein Design And Optimization	Boqiao Lai et.al.	2408.17241	null
2024-08-30	Towards Symbolic XAI – Explanation Through Human Understandable Logical Relationships Between Features	Thomas Schnake et.al.	2408.17198	null
2024-09-02	Leveraging Blockchain and ANFIS for Optimal Supply Chain Management	Amirfarhad Farhadi et.al.	2408.17161	null
2024-08-30	Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning	Xiaoye Qu et.al.	2408.17150	link
2024-08-30	Flow Matching for Optimal Reaction Coordinates of Biomolecular System	Mingyuan Zhang et.al.	2408.17139	link
2024-08-30	Temporal and Interactive Modeling for Efficient Human-Human Motion Generation	Yabiao Wang et.al.	2408.17135	null
2024-09-02	RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance	Avideep Mukherjee et.al.	2408.17095	null
2024-08-30	FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition	Chen Hu et.al.	2408.17090	link
2024-08-30	Approximately Invertible Neural Network for Learned Image Compression	Yanbo Gao et.al.	2408.17073	null
2024-09-02	Instant Adversarial Purification with Adversarial Consistency Distillation	Chun Tong Lei et.al.	2408.17064	null
2024-08-30	Text-to-Image Generation Via Energy-Based CLIP	Roy Ganz et.al.	2408.17046	null
2024-08-29	ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model	Fangfu Liu et.al.	2408.16767	null
2024-08-29	CSGO: Content-Style Composition in Text-to-Image Generation	Peng Xing et.al.	2408.16766	null
2024-08-29	A Score-Based Density Formula, with Applications in Diffusion Generative Models	Gen Li et.al.	2408.16765	null
2024-08-29	UV-free Texture Generation with Denoising and Geodesic Heat Diffusions	Simone Foti et.al.	2408.16762	link
2024-08-29	One-Shot Learning Meets Depth Diffusion in Multi-Object Videos	Anisha Jain et.al.	2408.16704	null
2024-08-29	VMC: A Grammar for Visualizing Statistical Model Checks	Ziyang Guo et.al.	2408.16702	null
2024-08-29	GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models	Moreno D’Incà et.al.	2408.16700	link
2024-08-29	Optimization Models for the Quadratic Traveling Salesperson Problem	Yuxiao Chen et.al.	2408.16680	null
2024-08-29	DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving	Yongjie Fu et.al.	2408.16647	null
2024-08-29	RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model	Zhuan Shi et.al.	2408.16634	null
2024-08-28	TEDRA: Text-based Editing of Dynamic and Photoreal Actors	Basavaraj Sunagad et.al.	2408.15995	null
2024-08-28	Distribution Backtracking Builds A Faster Convergence Trajectory for One-step Diffusion Distillation	Shengyuan Zhang et.al.	2408.15991	link
2024-08-28	Thoughtseeds: Evolutionary Priors, Nested Markov Blankets, and the Emergence of Embodied Cognition	Prakash Chandra Kavi et.al.	2408.15982	null
2024-08-28	Stability of Primal-Dual Gradient Flow Dynamics for Multi-Block Convex Optimization Problems	Ibrahim K. Ozaslan et.al.	2408.15969	null
2024-08-28	MetaGFN: Exploring Distant Modes with Adapted Metadynamics for Continuous GFlowNets	Dominic Phillips et.al.	2408.15905	null
2024-08-28	Gen-Swarms: Adapting Deep Generative Models to Swarms of Drones	Carlos Plou et.al.	2408.15899	null
2024-08-28	Airfoil Diffusion: Denoising Diffusion Model For Conditional Airfoil Generation	Reid Graves et.al.	2408.15898	link
2024-08-28	Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data	Ayodeji Ijishakin et.al.	2408.15890	null
2024-08-29	Recent Decade’s Power Outage Data Reveals the Increasing Vulnerability of U.S. Power Infrastructure	Bo Li et.al.	2408.15882	null
2024-08-28	GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model	Yongjie Fu et.al.	2408.15868	null
2024-08-27	GenRec: Unifying Video Generation and Recognition with Diffusion Models	Zejia Weng et.al.	2408.15241	link
2024-08-27	Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation	Xiaojuan Wang et.al.	2408.15239	null
2024-08-27	Simulation of Stochastic Discrete Dislocation Dynamics in Ductile Vs Brittle Materials	Santosh Chhetri et.al.	2408.15157	null
2024-08-27	How transformers learn structured data: insights from hierarchical filtering	Jerome Garnier-Brun et.al.	2408.15138	link
2024-08-27	DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays	Yiran Sun et.al.	2408.15118	link
2024-08-27	Data-Driven Nonlinear Deformation Design of 3D-Printable Shells	Samuel Silverman et.al.	2408.15097	link
2024-08-27	Constrained Diffusion Models via Dual Training	Shervin Khalafi et.al.	2408.15094	null
2024-08-27	LN-Gen: Rectal Lymph Nodes Generation via Anatomical Features	Weidong Guo et.al.	2408.14977	null
2024-08-27	MegActor- $Σ$ : Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer	Shurong Yang et.al.	2408.14975	null
2024-08-27	Integrated Bundling and Pricing of Unique Items	Maxime Bouscary et.al.	2408.14913	null
2024-08-26	K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences	Zhikai Li et.al.	2408.14468	null
2024-08-26	Uncovering Knowledge Gaps in Radiology Report Generation Models through Knowledge Graphs	Xiaoman Zhang et.al.	2408.14397	link
2024-08-26	Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning	Sakhinana Sagar Srinivas et.al.	2408.14387	null
2024-08-26	GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy	Peiyan Li et.al.	2408.14368	link
2024-08-27	Foundation Models for Music: A Survey	Yinghao Ma et.al.	2408.14340	link
2024-08-26	Automated Machine Learning in Insurance	Panyi Dong et.al.	2408.14331	link
2024-08-26	LLM-3D Print: Large Language Models To Monitor and Control 3D Printing	Yayati Jadhav et.al.	2408.14307	null
2024-08-26	Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes	Chao Chen et.al.	2408.14279	null
2024-08-26	Towards Synthetic Trace Generation of Modeling Operations using In-Context Learning Approach	Vittoriano Muttillo et.al.	2408.14259	null
2024-08-27	Text3DAug – Prompted Instance Augmentation for LiDAR Perception	Laurenz Reichardt et.al.	2408.14253	link
2024-08-23	How Diffusion Models Learn to Factorize and Compose	Qiyao Liang et.al.	2408.13256	null
2024-08-23	Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption	Sakhinana Sagar Srinivas et.al.	2408.13248	null
2024-08-23	CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities	Tao Wu et.al.	2408.13239	link
2024-08-23	Social Welfare Maximization for Federated Learning with Network Effects	Xiang Li et.al.	2408.13223	null
2024-08-23	Instruct-DeBERTa: A Hybrid Approach for Aspect-based Sentiment Analysis on Textual Reviews	Dineth Jayakody et.al.	2408.13202	null
2024-08-23	IFH: a Diffusion Framework for Flexible Design of Graph Generative Models	Samuel Cognolato et.al.	2408.13194	link
2024-08-23	Deep Learning for Lung Disease Classification Using Transfer Learning and a Customized CNN Architecture with Attention	Xiaoyi Liu et.al.	2408.13180	null
2024-08-26	Focus on Neighbors and Know the Whole: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation	Bonan Li et.al.	2408.13149	null
2024-08-23	Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning	Jihwan Oh et.al.	2408.13092	null
2024-08-23	General Intelligent Imaging and Uncertainty Quantification by Deterministic Diffusion Model	Weiru Fan et.al.	2408.13061	null
2024-08-22	xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations	Can Qin et.al.	2408.12590	null
2024-08-22	ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation	Lujia Zhong et.al.	2408.12561	link
2024-08-22	Show-o: One Single Transformer to Unify Multimodal Understanding and Generation	Jinheng Xie et.al.	2408.12528	null
2024-08-22	FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing	Jue Wang et.al.	2408.12429	link
2024-08-22	Enhanced Infield Agriculture with Interpretable Machine Learning Approaches for Crop Classification	Sudi Murindanyi et.al.	2408.12426	null
2024-08-22	4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment	Kaihui Cheng et.al.	2408.12419	null
2024-08-22	CODE: Confident Ordinary Differential Editing	Bastien van Delft et.al.	2408.12418	link
2024-08-22	Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures	Ce Liu et.al.	2408.12413	null
2024-08-22	A Stable Polygamy Approach to Spectrum Access with Channel Reuse	Dan Ben Ami et.al.	2408.12402	null
2024-08-22	Multi-Style Facial Sketch Synthesis through Masked Generative Modeling	Bowen Sun et.al.	2408.12400	null
2024-08-21	Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models	Chun-Yen Shih et.al.	2408.11810	null
2024-08-21	ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation	Shiqi Yang et.al.	2408.11805	null
2024-08-21	DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework	Zhifei Xie et.al.	2408.11788	null
2024-08-21	Timeline and Boundary Guided Diffusion Network for Video Shadow Detection	Haipeng Zhou et.al.	2408.11785	link
2024-08-21	Sum of Squares Circuits	Lorenzo Loconte et.al.	2408.11778	link
2024-08-21	Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards	Omar Erak et.al.	2408.11775	link
2024-08-21	D-RMGPT: Robot-assisted collaborative tasks driven by large multimodal models	M. Forlini et.al.	2408.11761	null
2024-08-21	JieHua Paintings Style Feature Extracting Model using Stable Diffusion with ControlNet	Yujia Gu et.al.	2408.11744	null
2024-08-21	Enhancing Cross-Modal Medical Image Segmentation through Compositionality	Aniek Eijpe et.al.	2408.11733	link
2024-08-21	AI-assisted Automated Short Answer Grading of Handwritten University Level Mathematics Exams	Tianyi Liu et.al.	2408.11728	null
2024-08-20	Reconciling Methodological Paradigms: Employing Large Language Models as Novice Qualitative Research Assistants in Talent Management Research	Sreyoshi Bhaduri et.al.	2408.11043	null
2024-08-20	Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model	Chunting Zhou et.al.	2408.11039	null
2024-08-20	Full Detector Simulation of a Projective Dual-Readout Segmented Crystal Electromagnetic Calorimeter with Precision Timing	Wonyong Chung et.al.	2408.11027	null
2024-08-20	MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning	Haoning Wu et.al.	2408.11001	link
2024-08-20	GreediRIS: Scalable Influence Maximization using Distributed Streaming Maximum Cover	Reet Barik et.al.	2408.10982	null
2024-08-21	Assortment Optimization Under History-Dependent Effects	Taotao He et.al.	2408.10967	null
2024-08-20	Kilometer-Scale Convection Allowing Model Emulation using Generative Diffusion Modeling	Jaideep Pathak et.al.	2408.10958	null
2024-08-20	SysBench: Can Large Language Models Follow System Messages?	Yanzhao Qin et.al.	2408.10943	link
2024-08-20	A Closer Look at Data Augmentation Strategies for Finetuning-Based Low/Few-Shot Object Detection	Vladislav Li et.al.	2408.10940	null
2024-08-20	Large Point-to-Gaussian Model for Image-to-3D Generation	Longfei Lu et.al.	2408.10935	null
2024-08-19	MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model	Minghua Liu et.al.	2408.10198	null
2024-08-19	SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views	Chao Xu et.al.	2408.10195	null
2024-08-19	Customizing Language Models with Instance-wise LoRA for Sequential Recommendation	Xiaoyu Kong et.al.	2408.10159	link
2024-08-19	Advancing Voice Cloning for Nepali: Leveraging Transfer Learning in a Low-Resource Language	Manjil Karki et.al.	2408.10128	null
2024-08-19	Learning Precise Affordances from Egocentric Videos for Robotic Manipulation	Gen Li et.al.	2408.10123	null
2024-08-19	Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision	Zhijun Jia et.al.	2408.10096	null
2024-08-19	Stacked Intelligent Metasurfaces for Integrated Sensing and Communications	Haoxian Niu et.al.	2408.10043	null
2024-08-19	General Impedance Modeling for Modular Multilevel Converter with Grid-forming and Grid-following Control	Chu Sun et.al.	2408.10017	null
2024-08-19	Uniting contrastive and generative learning for event sequences models	Aleksandr Yugay et.al.	2408.09995	null
2024-08-19	Multi-layer diffusion model of photovoltaic installations	Tomasz Weron et.al.	2408.09904	null
2024-08-16	Automated High-throughput Organic Crystal Structure Prediction via Population-based Sampling	Qiang Zhu et.al.	2408.08843	link
2024-08-16	PFDiff: Training-free Acceleration of Diffusion Models through the Gradient Guidance of Past and Future	Guangyi Wang et.al.	2408.08822	link
2024-08-16	A Unified Automata-Theoretic Approach to LTLf Modulo Theories (Extended Version)	Marco Faella et.al.	2408.08817	null
2024-08-16	EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics	Chenwei Wan et.al.	2408.08782	link
2024-08-16	Comparative Analysis of Generative Models: Enhancing Image Synthesis with VAEs, GANs, and Stable Diffusion	Sanchayan Vivekananthan et.al.	2408.08751	null
2024-08-16	The Blessing of Strategic Customers in Personalized Pricing	Zhi Chen et.al.	2408.08738	null
2024-08-16	ChatZero:Zero-shot Cross-Lingual Dialogue Generation via Pseudo-Target Language	Yongkang Liu et.al.	2408.08724	null
2024-08-16	An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation	Peiming Guo et.al.	2408.08650	link
2024-08-16	Modeling the Neonatal Brain Development Using Implicit Neural Representations	Florentin Bieder et.al.	2408.08647	link
2024-08-16	Sampling effects on Lasso estimation of drift functions in high-dimensional diffusion processes	Chiara Amorino et.al.	2408.08638	null
2024-08-15	Understanding the Local Geometry of Generative Model Manifolds	Ahmed Imtiaz Humayun et.al.	2408.08307	null
2024-08-15	Accelerated Image-Aware Generative Diffusion Modeling	Tanmay Asthana et.al.	2408.08306	null
2024-08-15	Marker or Markerless? Mode-Switchable Optical Tactile Sensing for Diverse Robot Tasks	Ni Ou et.al.	2408.08276	null
2024-08-15	mhGPT: A Lightweight Generative Pre-Trained Transformer for Mental Health Text Analysis	Dae-young Kim et.al.	2408.08261	null
2024-08-15	Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding	Xiner Li et.al.	2408.08252	link
2024-08-15	Picosecond laser pulses for quantum dot-microcavity based single photon generation by cascaded electro-optic modulation of a narrow-linewidth laser	Mio Poortvliet et.al.	2408.08213	null
2024-08-15	Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion	Adi Haviv et.al.	2408.08184	null
2024-08-15	Impact of Comprehensive Data Preprocessing on Predictive Modelling of COVID-19 Mortality	Sangita Das et.al.	2408.08142	link
2024-08-15	Decoding Memes: A Comparative Study of Machine Learning Models for Template Identification	Levente Murgás et.al.	2408.08126	link
2024-08-15	When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding	Pingping Zhang et.al.	2408.08093	null
2024-08-14	Detecting Near-Duplicate Face Images	Sudipta Banerjee et.al.	2408.07689	link
2024-08-14	Composing Automatic Differentiation with Custom Derivatives of Higher-Order Functions	Sam Estep et.al.	2408.07683	null
2024-08-14	Drug Discovery SMILES-to-Pharmacokinetics Diffusion Models with Deep Molecular Understanding	Bing Hu et.al.	2408.07636	null
2024-08-14	Anisotropic Diffusion Model of Communication in 2D Biofilm	Yanahan Paramalingam et.al.	2408.07626	null
2024-08-14	Neural Quantum States and Peaked Molecular Wave Functions: Curse or Blessing?	Aleksei Malyshev et.al.	2408.07625	null
2024-08-14	MatterGPT: A Generative Transformer for Multi-Property Inverse Design of Solid-State Materials	Yan Chen et.al.	2408.07608	null
2024-08-14	PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation	Sang-Hoon Lee et.al.	2408.07547	link
2024-08-14	New Curriculum, New Chance – Retrieval Augmented Generation for Lesson Planning in Ugandan Secondary Schools. Prototype Quality Evaluation	Simon Kloker et.al.	2408.07542	null
2024-08-14	DifuzCam: Replacing Camera Lens with a Mask and a Diffusion Model	Erez Yosef et.al.	2408.07541	null
2024-08-14	Towards Real-time Video Compressive Sensing on Mobile Devices	Miao Cao et.al.	2408.07530	link
2024-08-13	Imagen 3	Imagen-Team-Google et.al.	2408.07009	null
2024-08-13	Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models	Cheng Chen et.al.	2408.06995	null
2024-08-13	DCMSA: Multi-Head Self-Attention Mechanism Based on Deformable Convolution For Seismic Data Denoising	Wang Mingwei et.al.	2408.06963	null
2024-08-13	Neural Speech and Audio Coding	Minje Kim et.al.	2408.06954	null
2024-08-13	Diffusion Model for Slate Recommendation	Federico Tomasi et.al.	2408.06883	null
2024-08-13	Efficient Search for Customized Activation Functions with Gradient Descent	Lukas Strack et.al.	2408.06820	link
2024-08-13	Enhancing Diabetic Retinopathy Diagnosis: A Lightweight CNN Architecture for Efficient Exudate Detection in Retinal Fundus Images	Mujadded Al Rabbani Alif et.al.	2408.06784	null
2024-08-13	Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective	Ouxiang Li et.al.	2408.06741	link
2024-08-13	DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion	Yujia Wu et.al.	2408.06740	null
2024-08-13	Multimodal Analysis of White Blood Cell Differentiation in Acute Myeloid Leukemia Patients using a β-Variational Autoencoder	Gizem Mert et.al.	2408.06720	null
2024-08-12	The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery	Chris Lu et.al.	2408.06292	link
2024-08-12	Open-Source Molecular Processing Pipeline for Generating Molecules	Shreyas V et.al.	2408.06261	null
2024-08-12	3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs)	Jaydeep Rade et.al.	2408.06244	null
2024-08-12	Cislunar Constellation Design for Space Situational Awareness with Time-Expanded Facility Location Problem	Yuri Shimane et.al.	2408.06238	null
2024-08-12	Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance	Taewon Kang et.al.	2408.06157	null
2024-08-12	LipidBERT: A Lipid Language Model Pre-trained on METiS de novo Lipid Library	Tianhao Yu et.al.	2408.06150	null
2024-08-12	Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion Models	Ioannis Romanelis et.al.	2408.06145	link
2024-08-12	Med42-v2: A Suite of Clinical LLMs	Clément Christophe et.al.	2408.06142	null
2024-08-12	Five Pitfalls When Assessing Synthetic Medical Images with Reference Metrics	Melanie Dohmen et.al.	2408.06075	null
2024-08-12	CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer	Zhuoyi Yang et.al.	2408.06072	link
2024-08-09	Multi-Garment Customized Model Generation	Yichen Liu et.al.	2408.05206	null
2024-08-09	TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning	Yujie Feng et.al.	2408.05200	link
2024-08-09	Cell Morphology-Guided Small Molecule Generation with GFlowNets	Stephen Zhewen Lu et.al.	2408.05196	link
2024-08-09	Lithography-free patterning of chalcogenide materials for integrated photonic devices	Zhen Hu et.al.	2408.05099	null
2024-08-09	Social contagion under hybrid interactions	Xincheng Shu et.al.	2408.05050	null
2024-08-09	Infrared Beam-shaping on Demand via Tailored Geometric Phase Metasurfaces employing the Plasmonic Phase-Change Material In3SbTe2	Lukas Conrads et.al.	2408.05044	null
2024-08-09	Collaborative Static-Dynamic Teaching: A Semi-Supervised Framework for Stripe-Like Space Target Detection	Zijian Zhu et.al.	2408.05029	null
2024-08-09	Retrieval-augmented code completion for local projects using large language models	Marko Hostnik et.al.	2408.05026	null
2024-08-09	DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow	Hangyu Li et.al.	2408.05008	null
2024-08-09	Pay Attention To Mean Fields For Point Cloud Generation	Benno Käch et.al.	2408.04997	link
2024-08-08	Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics	Ruining Li et.al.	2408.04631	null
2024-08-08	Transformer Explainer: Interactive Learning of Text-Generative Models	Aeree Cho et.al.	2408.04619	null
2024-08-08	Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User’s Casual Sketches	Yongzhi Xu et.al.	2408.04567	null
2024-08-08	Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models	Yupeng Chang et.al.	2408.04556	link
2024-08-08	On the Asymptotic Convergence of Subgraph Generated Models	Xinchen Xu et.al.	2408.04541	null
2024-08-08	AExGym: Benchmarks and Environments for Adaptive Experimentation	Jimmy Wang et.al.	2408.04531	null
2024-08-08	NFDI4Health workflow and service for synthetic data generation, assessment and risk management	Sobhan Moazemi et.al.	2408.04478	null
2024-08-08	Deep Generative Models in Robotics: A Survey on Learning from Multimodal Demonstrations	Julen Urain et.al.	2408.04380	null
2024-08-08	Making sense of AI systems development	Mateusz Dolata et.al.	2408.04311	null
2024-08-08	AI-Driven Chatbot for Intrusion Detection in Edge Networks: Enhancing Cybersecurity with Ethical User Consent	Mugheez Asif et.al.	2408.04281	null
2024-08-07	Prospects for using drones to test formation-flying CubeSat concepts, and other astronomical applications	John D. Monnier et.al.	2408.03911	null
2024-08-07	Hate Speech Detection and Classification in Amharic Text with Deep Learning	Samuel Minale Gashe et.al.	2408.03849	null
2024-08-07	WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models	Prannaya Gupta et.al.	2408.03837	link
2024-08-07	A broken duet: multistable dynamics of dyadic interactions	Johan Medrano et.al.	2408.03809	link
2024-08-07	Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning	Martin Moder et.al.	2408.03807	link
2024-08-07	Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model	Guoqing Zhu et.al.	2408.03748	link
2024-08-07	Local Topology Measures of Contextual Language Model Latent Spaces With Applications to Dialogue Term Extraction	Benjamin Matthias Ruppik et.al.	2408.03706	null
2024-08-07	Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling	Zilyu Ye et.al.	2408.03695	link
2024-08-07	Unsupervised Detection of Fetal Brain Anomalies using Denoising Diffusion Models	Markus Ditlev Sjøgren Olsen et.al.	2408.03654	null
2024-08-07	Goal-oriented Semantic Communication for the Metaverse Application	Zhe Wang et.al.	2408.03646	null
2024-08-06	MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation	Xiaofeng Mao et.al.	2408.03312	null
2024-08-06	IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts	Ciara Rowles et.al.	2408.03209	null
2024-08-06	Personalizing Federated Instrument Segmentation with Visual Trait Priors in Robotic Surgery	Jialang Xu et.al.	2408.03208	link
2024-08-06	An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion	Xingguang Yan et.al.	2408.03178	null
2024-08-06	Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models	Sho Ozaki et.al.	2408.03156	null
2024-08-06	Enhancing Twitter Bot Detection via Multimodal Invariant Representations	Jibing Gong et.al.	2408.03096	null
2024-08-06	Analysis of Argument Structure Constructions in a Deep Recurrent Language Model	Pegah Ramezani et.al.	2408.03062	null
2024-08-06	OpenOmni: A Collaborative Open Source Tool for Building Future-Ready Multimodal Conversational Agents	Qiang Sun et.al.	2408.03047	link
2024-08-06	Targeted Visual Prompting for Medical Visual Question Answering	Sergio Tascon-Morales et.al.	2408.03043	link
2024-08-06	Training-Free Condition Video Diffusion Models for single frame Spatial-Semantic Echocardiogram Synthesis	Van Phi Nguyen et.al.	2408.03035	link
2024-08-05	Command-line Obfuscation Detection using Small Language Models	Vojtech Outrata et.al.	2408.02637	null
2024-08-05	VidGen-1M: A Large-Scale Dataset for Text-to-video Generation	Zhiyu Tan et.al.	2408.02629	null
2024-08-05	YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition	Duc Manh Nguyen Dang et.al.	2408.02623	link
2024-08-05	LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba	Yunxiang Fu et.al.	2408.02615	link
2024-08-05	MetaParticles: Computationally engineered nanomaterials with tunable and responsive properties	Massimiliano Paesani et.al.	2408.02564	null
2024-08-05	Fairness and Bias Mitigation in Computer Vision: A Survey	Sepehr Dehdashtian et.al.	2408.02464	null
2024-08-05	TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments	Daeun Song et.al.	2408.02454	null
2024-08-05	Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models	Zi Liang et.al.	2408.02416	link
2024-08-05	Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models	Tongtong Feng et.al.	2408.02408	null
2024-08-05	A Few-Shot Approach for Relation Extraction Domain Adaptation using Large Language Models	Vanni Zavarella et.al.	2408.02377	null
2024-08-02	Conditional LoRA Parameter Generation	Xiaolong Jin et.al.	2408.01415	null
2024-08-02	Autoencoders in Function Space	Justin Bunker et.al.	2408.01362	link
2024-08-02	MCGMark: An Encodable and Robust Online Watermark for LLM-Generated Malicious Code	Kaiwen Ning et.al.	2408.01354	link
2024-08-02	TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling	Dong Huo et.al.	2408.01291	null
2024-08-02	A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness	Lutao Jiang et.al.	2408.01269	null
2024-08-02	Exchange control in a MOS double quantum dot made using a 300 mm wafer process	Jacob F. Chittock-Wood et.al.	2408.01241	null
2024-08-02	CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models	Kushal Kumar Jain et.al.	2408.01233	null
2024-08-02	Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion	Ke Li et.al.	2408.01225	link
2024-08-02	PSP-GEN: Stochastic inversion of the Process-Structure-Property chain in materials design through deep, generative probabilistic modeling	Yaohua Zang et.al.	2408.01114	null
2024-08-02	Six Dragons Fly Again: Reviving 15th-Century Korean Court Music with Transformers and Novel Encoding	Danbinaerin Han et.al.	2408.01096	link
2024-08-01	Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation	Yixiao Wang et.al.	2408.00766	null
2024-08-01	Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention	Susung Hong et.al.	2408.00760	link
2024-08-01	DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency	Jovan Stojkovic et.al.	2408.00741	null
2024-08-01	TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models	Gilad Deutch et.al.	2408.00735	null
2024-08-01	A Natural Language Processing Framework for Hotel Recommendation Based on Users’ Text Reviews	Lavrentia Aravani et.al.	2408.00716	null
2024-08-02	Reinforcement Learning applied to Insurance Portfolio Pursuit	Edward James Young et.al.	2408.00713	link
2024-08-01	MotionFix: Text-Driven 3D Human Motion Editing	Nikos Athanasiou et.al.	2408.00712	null
2024-08-01	Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function	Matias Oscar Volman Stern et.al.	2408.00707	null
2024-08-01	AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models	Daqin Luo et.al.	2408.00665	link
2024-08-01	Privacy-preserving datasets by capturing feature distributions with Conditional VAEs	Francesco Di Salvo et.al.	2408.00639	link
2024-07-31	Detecting, Explaining, and Mitigating Memorization in Diffusion Models	Yuxin Wen et.al.	2407.21720	link
2024-07-31	Tora: Trajectory-oriented Diffusion Transformer for Video Generation	Zhenghao Zhang et.al.	2407.21705	link
2024-07-31	Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification	Xingchen Shi et.al.	2407.21683	null
2024-07-31	Quality Control for Radiology Report Generation Models via Auxiliary Auditing Components	Hermione Warr et.al.	2407.21638	null
2024-07-31	LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows	Lukas Teufelberger et.al.	2407.21593	null
2024-07-31	Long-term investment and energy procurement risk management under uncertainty for an electrolytic green hydrogen producer	Owen Palmer et.al.	2407.21574	null
2024-07-31	Conditioned Prompt-Optimization for Continual Deepfake Detection	Francesco Laiti et.al.	2407.21554	link
2024-07-31	CXSimulator: A User Behavior Simulation using LLM Embeddings for Web-Marketing Campaign Assessment	Akira Kasuga et.al.	2407.21553	null
2024-07-31	Explainable and Controllable Motion Curve Guided Cardiac Ultrasound Video Generation	Junxuan Yu et.al.	2407.21490	null
2024-07-31	Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends	Giuliano Martinelli et.al.	2407.21489	link
2024-07-30	Matting by Generation	Zhixiang Wang et.al.	2407.21017	null
2024-07-30	Add-SD: Rational Generation without Manual Reference	Lingfeng Yang et.al.	2407.21016	link
2024-07-30	Integrating Agent-Based and Compartmental Models for Infectious Disease Modeling: A Novel Hybrid Approach	Inan Bostanci et.al.	2407.20993	null
2024-07-30	MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions	Xiaowei Chi et.al.	2407.20962	link
2024-07-30	Mitigating calibration errors from mutual coupling with time-domain filtering of 21 cm cosmological radio observations	N. Charles et.al.	2407.20923	null
2024-07-30	Impact of Geographical Separation on Spectrum Sharing Markets	Kangle Mu et.al.	2407.20909	null
2024-07-30	Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering	Yanpeng Zhao et.al.	2407.20908	link
2024-07-30	Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks	Yunfeng Diao et.al.	2407.20836	null
2024-07-30	Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning	Norman Di Palo et.al.	2407.20798	null
2024-07-30	SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models	Zheng Liu et.al.	2407.20756	link
2024-07-29	Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing	Ekaterina Iakovleva et.al.	2407.20232	null
2024-07-29	LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework	Zhenqi He et.al.	2407.20172	link
2024-07-29	Diffusion Feedback Helps CLIP See Better	Wenxuan Wang et.al.	2407.20171	link
2024-07-29	DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models	Jing Yang et.al.	2407.20141	null
2024-07-29	Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning	Liyuan Mao et.al.	2407.20109	null
2024-07-29	On the significance of parameters and the projective level in the Choice and Collection axioms	Vladimir Kanovei et.al.	2407.20098	null
2024-07-29	Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African Populations	Fangyijie Wang et.al.	2407.20072	link
2024-07-29	ImagiNet: A Multi-Content Dataset for Generalizable Synthetic Image Detection via Contrastive Learning	Delyan Boychev et.al.	2407.20020	link
2024-07-29	Reproducibility Study of “ITI-GEN: Inclusive Text-to-Image Generation”	Daniel Gallo Fernández et.al.	2407.19996	link
2024-07-29	HeadsetOff: Enabling Photorealistic Video Conferencing on Economical VR Headsets	Yili Jin et.al.	2407.19988	null
2024-07-26	Generative Adversarial Networks for Imputing Sparse Learning Performance	Liang Zhang et.al.	2407.18875	null
2024-07-26	Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment	Yuze Zheng et.al.	2407.18854	null
2024-07-26	Scalable Group Choreography via Variational Phase Manifold Learning	Nhat Le et.al.	2407.18839	null
2024-07-26	Revision of calcium and scandium abundances in Am stars based on NLTE calculations and comparison with diffusion stellar evolution models	L. I. Mashonkina et.al.	2407.18736	null
2024-07-26	BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation	Peng Hao et.al.	2407.18715	null
2024-07-26	Q-gen: A Parameterized Quantum Circuit Generator	Yikai Mao et.al.	2407.18697	link
2024-07-26	Adversarial Robustification via Text-to-Image Diffusion Models	Daewon Choi et.al.	2407.18658	link
2024-07-26	Robust VAEs via Generating Process of Noise Augmented Data	Hiroo Irobe et.al.	2407.18632	null
2024-07-26	Denoising Lévy Probabilistic Models	Dario Shariatian et.al.	2407.18609	link
2024-07-26	How To Segment in 3D Using 2D Models: Automated 3D Segmentation of Prostate Cancer Metastatic Lesions on PET Volumes Using Multi-Angle Maximum Intensity Projections and Diffusion Models	Amirhosein Toosi et.al.	2407.18555	link
2024-07-25	RegionDrag: Fast Region-Based Image Editing with Diffusion Models	Jingyi Lu et.al.	2407.18247	null
2024-07-25	VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads	Orest Kupyn et.al.	2407.18245	link
2024-07-25	CodedVO: Coded Visual Odometry	Sachin Shah et.al.	2407.18240	null
2024-07-25	SuperFlow: A Fully-Customized RTL-to-GDS Design Automation Flow for Adiabatic Quantum-Flux-Parametron Superconducting Circuits	Yanyue Xie et.al.	2407.18209	null
2024-07-25	Test2VA: Reusing GUI Test Cases for Voice Assistant Features Development in Mobile Applications	Garrett Weaver et.al.	2407.18155	null
2024-07-25	Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images	Roberto Di Via et.al.	2407.18125	null
2024-07-25	Keypoint Promptable Re-Identification	Vladimir Somers et.al.	2407.18112	link
2024-07-25	SSTD: Stripe-Like Space Target Detection using Single-Point Supervision	Zijian Zhu et.al.	2407.18097	null
2024-07-25	Cross-Observatory Coordination with tilepy: A Novel Tool for Observations of Multi-Messenger Transient Events	Monica Seglar-Arroyo et.al.	2407.18076	null
2024-07-25	AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild	Junho Park et.al.	2407.18034	link
2024-07-24	SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency	Yiming Xie et.al.	2407.17470	null
2024-07-24	BlueTempNet: A Temporal Multi-network Dataset of Social Interactions in Bluesky Social	Ujun Jeong et.al.	2407.17451	link
2024-07-24	ProvenanceWidgets: A Library of UI Control Elements to Track and Dynamically Overlay Analytic Provenance	Arpit Narechania et.al.	2407.17431	link
2024-07-24	CDDIP: Constrained Diffusion-Driven Deep Image Prior for Seismic Image Reconstruction	Paul Goyes-Peñafiel et.al.	2407.17402	link
2024-07-24	Cosmic ray susceptibility of the Terahertz Intensity Mapper detector arrays	Lun-Jun Liu et.al.	2407.17381	null
2024-07-24	ViPer: Visual Personalization of Generative Models via Individual Preference Learning	Sogand Salehi et.al.	2407.17365	null
2024-07-24	Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching	Yuyang Ding et.al.	2407.17349	link
2024-07-24	Quantum nonlocal modulation cancellation with distributed clocks	Stephen D. Chapman et.al.	2407.17330	null
2024-07-25	Enhanced Deep Learning Methodologies and MRI Selection Techniques for Dementia Diagnosis in the Elderly Population	Nikolaos Ntampakis et.al.	2407.17324	null
2024-07-24	Edge-Cloud Continuum Orchestration of Critical Services: A Smart-City Approach	Rodrigo Rosmaninho et.al.	2407.17314	null
2024-07-23	Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions	Fabio Tosi et.al.	2407.16698	link
2024-07-23	From Imitation to Refinement – Residual RL for Precise Visual Assembly	Lars Ankile et.al.	2407.16677	null
2024-07-23	RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent	Huiyu Xu et.al.	2407.16667	null
2024-07-23	MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence	Canyu Zhao et.al.	2407.16655	null
2024-07-23	Unveiling and Mitigating Bias in Audio Visual Segmentation	Peiwen Sun et.al.	2407.16638	null
2024-07-23	Knowledge-driven AI-generated data for accurate and interpretable breast ultrasound diagnoses	Haojun Yu et.al.	2407.16634	null
2024-07-23	GenRec: A Flexible Data Generator for Recommendations	Erica Coppolillo et.al.	2407.16594	link
2024-07-23	COALA: A Practical and Vision-Centric Federated Learning Platform	Weiming Zhuang et.al.	2407.16560	link
2024-07-23	DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models	Zhenyu Xie et.al.	2407.16511	null
2024-07-23	qMRI Diffusor: Quantitative T1 Mapping of the Brain using a Denoising Diffusion Probabilistic Model	Shishuai Wang et.al.	2407.16477	null
2024-07-22	Artist: Aesthetically Controllable Text-Driven Stylization without Training	Ruixiang Jiang et.al.	2407.15842	link
2024-07-23	A Large-scale Benchmark Dataset for Commuting Origin-destination Matrix Generation	Can Rong et.al.	2407.15823	link
2024-07-22	Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget	Vikash Sehwag et.al.	2407.15811	link
2024-07-22	Quantum Computing for Phonon Scattering Effects on Thermal Conductivity	Xiangjun Tan et.al.	2407.15808	null
2024-07-22	Enhancing Mass Customization Manufacturing: Multiobjective Metaheuristic Algorithms for flow shop Production in Smart Industry	Diego Rossit et.al.	2407.15802	null
2024-07-22	Diffusion Model Based Resource Allocation Strategy in Ultra-Reliable Wireless Networked Control Systems	Amirhassan Babazadeh Darabi et.al.	2407.15784	null
2024-07-22	A Hamilton-Jacobi approach to road-field reaction-diffusion models	Christopher Henderson et.al.	2407.15760	null
2024-07-22	Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond	Silvio Galesso et.al.	2407.15739	link
2024-07-22	DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design	Zhi Hao Luo et.al.	2407.15723	link
2024-07-22	Estimating Probability Densities with Transformer and Denoising Diffusion	Henry W. Leung et.al.	2407.15703	link
2024-07-19	DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks	Sarah Jabbour et.al.	2407.14509	null
2024-07-19	On Pre-training of Multimodal Language Models Customized for Chart Understanding	Wan-Cyuan Fan et.al.	2407.14506	null
2024-07-19	T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation	Kaiyue Sun et.al.	2407.14505	link
2024-07-19	M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models	Seunggeun Chi et.al.	2407.14502	null
2024-07-19	A Precision Cryogenic Positioning Stage for Detector Dithering and Flexure Compensation	Stephen A. Smee et.al.	2407.14493	null
2024-07-19	Contrastive Learning with Counterfactual Explanations for Radiology Report Generation	Mingjie Li et.al.	2407.14474	null
2024-07-19	Describe Data to get Science-Data-Ready Tooling: Awkward as a Target for Kaitai Struct YAML	Manasvi Goyal et.al.	2407.14461	null
2024-07-19	Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model	Seonghui Min et.al.	2407.14434	null
2024-07-19	Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models	Hyun-Jic Oh et.al.	2407.14426	null
2024-07-19	GLAudio Listens to the Sound of the Graph	Aurelio Sulser et.al.	2407.14387	link
2024-07-18	LogoSticker: Inserting Logos into Diffusion Models for Customized Generation	Mingkang Zhu et.al.	2407.13752	null
2024-07-18	Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review	Masatoshi Uehara et.al.	2407.13734	link
2024-07-18	Shaded Route Planning Using Active Segmentation and Identification of Satellite Images	Longchao Da et.al.	2407.13689	null
2024-07-18	PASTA: Controllable Part-Aware Shape Generation with Autoregressive Transformers	Songlin Li et.al.	2407.13677	link
2024-07-18	MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis	Ziming Zhong et.al.	2407.13675	link
2024-07-18	Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models	Xiaoyu Zhu et.al.	2407.13642	null
2024-07-18	Training-free Composite Scene Generation for Layout-to-Image Synthesis	Jiaqi Liu et.al.	2407.13609	link
2024-07-18	EnergyDiff: Universal Time-Series Energy Data Generation using Diffusion Models	Nan Lin et.al.	2407.13538	link
2024-07-18	VeriQR: A Robustness Verification Tool for Quantum Machine Learning Models	Yanling Lin et.al.	2407.13533	null
2024-07-18	All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models	Charumathi Badrinath et.al.	2407.13449	link
2024-07-17	SMooDi: Stylized Motion Diffusion Model	Lei Zhong et.al.	2407.12783	null
2024-07-17	VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control	Sherwin Bahmani et.al.	2407.12781	null
2024-07-17	Hallucination Index: An Image Quality Metric for Generative Reconstruction Models	Matthew Tivnan et.al.	2407.12780	null
2024-07-17	GroundUp: Rapid Sketch-Based 3D City Massing	Gizem Esra Unlu et.al.	2407.12739	null
2024-07-17	EchoSight: Advancing Visual-Language Models with Wiki Knowledge	Yibin Yan et.al.	2407.12735	null
2024-07-17	NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model	Zhongqun Zhang et.al.	2407.12727	null
2024-07-17	An Evaluation of Continual Learning for Advanced Node Semiconductor Defect Inspection	Amit Prasad et.al.	2407.12724	null
2024-07-17	Unlocking planetesimal magnetic field histories: a refined, versatile model for thermal evolution and dynamo generation	Hannah R. Sanderson et.al.	2407.12721	null
2024-07-17	SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow	Yuanzhi Zhu et.al.	2407.12718	link
2024-07-17	Teleoperation in Robot-assisted MIS with Adaptive RCM via Admittance Control	Ehsan Nasiri et.al.	2407.12711	null
2024-07-16	Efficient Training with Denoised Neural Weights	Yifan Gong et.al.	2407.11966	null
2024-07-16	UrbanWorld: An Urban World Model for 3D City Generation	Yu Shang et.al.	2407.11965	link
2024-07-16	Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design	Leo Klarner et.al.	2407.11942	link
2024-07-16	Code Documentation and Analysis to Secure Software Development	Paul Attie et.al.	2407.11934	null
2024-07-16	Global Optimisation of Black-Box Functions with Generative Models in the Wasserstein Space	Tigran Ramazyan et.al.	2407.11917	link
2024-07-16	Quantised Global Autoencoder: A Holistic Approach to Representing Visual Data	Tim Elsner et.al.	2407.11913	null
2024-07-16	Data-Juicer Sandbox: A Comprehensive Suite for Multimodal Data-Model Co-development	Daoyuan Chen et.al.	2407.11784	link
2024-07-16	Diffusion-driven self-assembly of emerin nanodomains at the nuclear envelope	Carlos D. Alas et.al.	2407.11758	null
2024-07-16	Generating Multi-Modal and Multi-Attribute Single-Cell Counts with CFGen	Alessandro Palma et.al.	2407.11734	link
2024-07-16	Theoretical Insights into CycleGAN: Analyzing Approximation and Estimation Errors in Unpaired Data Generation	Luwei Sun et.al.	2407.11678	null
2024-07-15	Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion	Yongyuan Liang et.al.	2407.10973	null
2024-07-15	Fast Matrix Multiplications for Lookup Table-Quantized LLMs	Han Guo et.al.	2407.10960	link
2024-07-15	InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models	Nirat Saini et.al.	2407.10958	null
2024-07-16	DataDream: Few-shot Guided Dataset Generation	Jae Myung Kim et.al.	2407.10910	link
2024-07-15	Optical Diffusion Models for Image Generation	Ilker Oguz et.al.	2407.10897	null
2024-07-15	R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection	Zheyuan Zhou et.al.	2407.10862	null
2024-07-15	Physics-Inspired Generative Models in Medical Imaging: A Review	Dennis Hein et.al.	2407.10856	null
2024-07-15	Inferring dark energy properties from the scale factor parametrisation	Upala Mukhopadhayay et.al.	2407.10845	null
2024-07-15	MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration	Yulin Ren et.al.	2407.10833	null
2024-07-15	Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation	Tu Vu et.al.	2407.10817	null
2024-07-12	StyleSplat: 3D Object Style Transfer with Gaussian Splatting	Sahil Jain et.al.	2407.09473	null
2024-07-12	FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3	Georgios Makridis et.al.	2407.09467	null
2024-07-12	The $μ\mathcal{G}$ Language for Programming Graph Neural Networks	Matteo Belenchia et.al.	2407.09441	null
2024-07-12	Graph Neural Network Causal Explanation via Neural Causal Models	Arman Behnam et.al.	2407.09378	link
2024-07-12	Computationally Efficient Estimation of Large Probit Models	Patrick Ding et.al.	2407.09371	null
2024-07-12	Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text	Lucio La Cava et.al.	2407.09364	null
2024-07-15	Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees	Alexia Jolicoeur-Martineau et.al.	2407.09357	link
2024-07-12	PID: Physics-Informed Diffusion Model for Infrared Image Generation	Fangyuan Mao et.al.	2407.09299	link
2024-07-12	Learning Distances from Data with Normalizing Flows and Score Matching	Peter Sorrenson et.al.	2407.09297	null
2024-07-12	Surgical Text-to-Image Generation	Chinedu Innocent Nwoye et.al.	2407.09230	null
2024-07-11	Video Diffusion Alignment via Reward Gradients	Mihir Prabhudesai et.al.	2407.08737	link
2024-07-11	Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models	Zhening Xing et.al.	2407.08701	null
2024-07-11	FAR-Trans: An Investment Dataset for Financial Asset Recommendation	Javier Sanz-Cruzado et.al.	2407.08692	null
2024-07-11	Scattering transforms on the sphere, application to large scale structure modelling	Louise Mousset et.al.	2407.08687	null
2024-07-11	CAD-Prompted Generative Models: A Pathway to Feasible and Novel Engineering Designs	Leah Chong et.al.	2407.08675	null
2024-07-11	Still-Moving: Customized Video Generation without Customized Video Data	Hila Chefer et.al.	2407.08674	null
2024-07-11	Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density	Shuangqi Li et.al.	2407.08659	null
2024-07-11	Adaptive Smooth Non-Stationary Bandits	Joe Suk et.al.	2407.08654	link
2024-07-11	Fine-Tuning Stable Diffusion XL for Stylistic Icon Generation: A Comparison of Caption Size	Youssef Sultan et.al.	2407.08513	null
2024-07-11	Latent Conditional Diffusion-based Data Augmentation for Continuous-Time Dynamic Graph Mode	Yuxing Tian et.al.	2407.08500	null
2024-07-10	Generative Image as Action Models	Mohit Shridhar et.al.	2407.07875	link
2024-07-10	Dynamical Measure Transport and Neural PDE Solvers for Sampling	Jingtong Sun et.al.	2407.07873	null
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-10	Generic Numerical Analysis of Stochastic Reaction Diffusion Model with applications in excitable media	Yahya Alnashri et.al.	2407.07834	null
2024-07-10	Universal and non-universal signatures in the scaling functions of critical variables	Gianluca Teza et.al.	2407.07782	null
2024-07-10	Towards Human-Like Driving: Active Inference in Autonomous Vehicle Control	Elahe Delavari et.al.	2407.07684	null
2024-07-10	VEnhancer: Generative Space-Time Enhancement for Video Generation	Jingwen He et.al.	2407.07667	null
2024-07-10	A Coding-Theoretic Analysis of Hyperspherical Prototypical Learning Geometry	Martin Lindström et.al.	2407.07664	link
2024-07-10	The heterogeneous impact of the EU-Canada agreement with causal machine	Lionel Fontagné et.al.	2407.07652	null
2024-07-11	MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis	Wanggui He et.al.	2407.07614	link
2024-07-09	ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction	Shaozhe Hao et.al.	2407.07077	link
2024-07-09	Latent Space Imaging	Matheus Souza et.al.	2407.07052	null
2024-07-09	Generative models of astrophysical fields with scattering transforms on the sphere	Louise Mousset et.al.	2407.07007	link
2024-07-10	PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods	Yiying Wang et.al.	2407.06985	link
2024-07-09	Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach	Taolin Zhang et.al.	2407.06964	null
2024-07-09	RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models	Bowen Zhang et.al.	2407.06938	null
2024-07-09	HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance	Guian Fang et.al.	2407.06937	link
2024-07-09	Fine-grained large-scale content recommendations for MSX sellers	Manpreet Singh et.al.	2407.06910	null
2024-07-09	Enhanced Battery Degradation-Aware Scheduling for Distribution Network with Electric Vehicle Load	Vijay Babu Pamshetti et.al.	2407.06857	null
2024-07-09	A reaction-diffusion model for relapsing-remitting multiple sclerosis with a treatment term	Romina Travaglini et.al.	2407.06802	null
2024-07-08	Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images	Zhangyang Qi et.al.	2407.06191	null
2024-07-08	CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation	Xinying Guo et.al.	2407.06188	null
2024-07-08	JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation	Yu Zeng et.al.	2407.06187	null
2024-07-08	The Tug-of-War Between Deepfake Generation and Detection	Hannah Lee et.al.	2407.06174	null
2024-07-08	ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation	Ethan Chern et.al.	2407.06135	link
2024-07-08	Structured Generations: Using Hierarchical Clusters to guide Diffusion Models	Jorge da Silva Goncalves et.al.	2407.06124	link
2024-07-08	PerlDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models	Jinhua Zhang et.al.	2407.06109	link
2024-07-08	Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation	Xinyu Bai et.al.	2407.06095	null
2024-07-08	Assessing Cardiomegaly in Dogs Using a Simple CNN Model	Nikhil Deekonda et.al.	2407.06092	null
2024-07-08	Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis	Emaad Khwaja et.al.	2407.06079	null
2024-07-05	RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation	Yuxuan Kuang et.al.	2407.04689	link
2024-07-05	Thermal and mechanical study of a parametrised cryostat model for optical characterisation of upcoming CMB experiments	Thomas J. L. J. Gascard et.al.	2407.04613	link
2024-07-08	PartCraft: Crafting Creative Objects by Parts	Kam Woh Ng et.al.	2407.04604	link
2024-07-05	Structural Constraint Integration in Generative Model for Discovery of Quantum Material Candidates	Ryotaro Okabe et.al.	2407.04557	null
2024-07-05	Unified continuous-time q-learning for mean-field game and mean-field control problems	Xiaoli Wei et.al.	2407.04521	null
2024-07-08	Speed-accuracy trade-off for the diffusion models: Wisdom from nonequilibrium thermodynamics and optimal transport	Kotaro Ikeda et.al.	2407.04495	null
2024-07-05	PROUD: PaRetO-gUided Diffusion Model for Multi-objective Generation	Yinghua Yao et.al.	2407.04493	link
2024-07-05	Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model	Duy M. H. Nguyen et.al.	2407.04489	null
2024-07-05	Leveraging Graph Structures to Detect Hallucinations in Large Language Models	Noa Nonkes et.al.	2407.04485	link
2024-07-05	VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing	Shang Liu et.al.	2407.04461	null
2024-07-03	DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents	Yilun Xu et.al.	2407.03300	link
2024-07-03	Improved Noise Schedule for Diffusion Training	Tiankai Hang et.al.	2407.03297	null
2024-07-03	Anomaly-based Framework for Detecting Power Overloading Cyberattacks in Smart Grid AMI	Abdelaziz Amara Korba et.al.	2407.03264	null
2024-07-03	SOS! Soft Prompt Attack Against Open-Source Large Language Models	Ziqing Yang et.al.	2407.03160	null
2024-07-04	Spatio-Temporal Adaptive Diffusion Models for EEG Super-Resolution in Epilepsy Diagnosis	Tong Zhou et.al.	2407.03089	null
2024-07-03	Artificial Inductive Bias for Synthetic Tabular Data Generation in Data-Scarce Scenarios	Patricia A. Apellániz et.al.	2407.03080	link
2024-07-03	Electromagnetic Property Sensing Based on Diffusion Model in ISAC System	Yuhua Jiang et.al.	2407.03075	null
2024-07-03	Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models	Chunmei Xu et.al.	2407.03050	null
2024-07-03	SlerpFace: Face Template Protection via Spherical Linear Interpolation	Zhizhou Zhong et.al.	2407.03043	null
2024-07-03	An Organism Starts with a Single Pix-Cell: A Neural Cellular Diffusion for High-Resolution Image Synthesis	Marawan Elbatel et.al.	2407.03018	link
2024-07-02	Magic Insert: Style-Aware Drag-and-Drop	Nataniel Ruiz et.al.	2407.02489	null
2024-07-02	Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models	Fei Shen et.al.	2407.02482	link
2024-07-02	A Pattern Language for Machine Learning Tasks	Benjamin Rodatz et.al.	2407.02424	null
2024-07-02	GCF: Graph Convolutional Networks for Facial Expression Recognition	Hozaifa Kassab et.al.	2407.02361	link
2024-07-02	MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space	Yihong Tang et.al.	2407.02345	null
2024-07-02	Choice-based time slot management in attended home delivery	Dorsa Abdolhamidi et.al.	2407.02339	null
2024-07-02	Mining Constraints from Reference Process Models for Detecting Best-Practice Violations in Event Log	Adrian Rebmann et.al.	2407.02336	link
2024-07-02	A tactical time slot management problem under mixed logit demand	Dorsa Abdolhamidi et.al.	2407.02308	null
2024-07-02	Renard: A Modular Pipeline for Extracting Character Networks from Narrative Texts	Arthur Amalvy et.al.	2407.02284	link
2024-07-03	Federated Distillation for Medical Image Classification: Towards Trustworthy Computer-Aided Diagnosis	Sufen Ren et.al.	2407.02261	null
2024-06-28	Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language	Yicheng Chen et.al.	2406.20085	null
2024-06-28	The hybrid Josephson rhombus: A superconducting element with tailored current-phase relation	L. Banszerus et.al.	2406.20082	null
2024-06-28	HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model	Hieu T. Nguyen et.al.	2406.20077	null
2024-06-28	Modeling and LQR Control of Insect Sized Flapping Wing Robot	Daksh Dhingra et.al.	2406.20061	null
2024-06-28	Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence	Xiantao Fan et.al.	2406.20047	null
2024-06-28	Electrostatics-based particle sampling and approximate inference	Yongchao Huang et.al.	2406.20044	link
2024-06-28	HAITCH: A Framework for Distortion and Motion Correction in Fetal Multi-Shell Diffusion-Weighted MRI	Haykel Snoussi et.al.	2406.20042	null
2024-06-28	Concept Lens: Visually Analyzing the Consistency of Semantic Manipulation in GANs	Sangwon Jeong et.al.	2406.19987	null
2024-07-01	Text2Robot: Evolutionary Robot Design from Text Descriptions	Ryan P. Ringel et.al.	2406.19963	link
2024-06-28	Kolmogorov-Smirnov GAN	Maciej Falkiewicz et.al.	2406.19948	link
2024-06-27	Looking 3D: Anomaly Detection with 2D-3D Alignment	Ankan Bhunia et.al.	2406.19393	link
2024-06-27	Taming Data and Transformers for Audio Generation	Moayed Haji-Ali et.al.	2406.19388	null
2024-06-27	Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space	Core Francisco Park et.al.	2406.19370	link
2024-06-27	Accelerating Multiphase Flow Simulations with Denoising Diffusion Model Driven Initializations	Jaehong Chung et.al.	2406.19333	null
2024-06-27	Subtractive Training for Music Stem Insertion using Latent Diffusion Models	Ivan Villa-Renteria et.al.	2406.19328	null
2024-06-27	Efficient World Models with Context-Aware Tokenization	Vincent Micheli et.al.	2406.19320	link
2024-06-27	PNeRV: A Polynomial Neural Representation for Videos	Sonam Gupta et.al.	2406.19299	null
2024-06-27	Compositional Image Decomposition with Diffusion Models	Jocelin Su et.al.	2406.19298	null
2024-06-27	BISeizuRe: BERT-Inspired Seizure Data Representation to Improve Epilepsy Monitoring	Luca Benfenati et.al.	2406.19189	null
2024-06-27	On Pólya-Young urn models and growth processes	Markus Kuba et.al.	2406.19110	null
2024-06-26	MatchTime: Towards Automatic Soccer Game Commentary Generation	Jiayuan Rao et.al.	2406.18530	link
2024-06-26	MultiDiff: Consistent Novel View Synthesis from a Single Image	Norman Müller et.al.	2406.18524	null
2024-06-26	Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration	Kang Liao et.al.	2406.18516	link
2024-06-26	DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance	Younghyun Kim et.al.	2406.18459	link
2024-06-26	Cascading Large Language Models for Salient Event Graph Generation	Xingwei Tan et.al.	2406.18449	link
2024-06-26	Repeat and Concatenate: 2D to 3D Image Translation with 3D to 3D Generative Modeling	Abril Corona-Figueroa et.al.	2406.18422	link
2024-06-26	Towards diffusion models for large-scale sea-ice modelling	Tobias Sebastian Finn et.al.	2406.18417	null
2024-06-27	Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process	Tianyu Lin et.al.	2406.18361	link
2024-06-26	Molecular Diffusion Models with Virtual Receptors	Matan Halfon et.al.	2406.18330	null
2024-06-27	Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems	Italo Luis da Silva et.al.	2406.18245	link
2024-06-25	DiffusionPDE: Generative PDE-Solving Under Partial Observation	Jiahe Huang et.al.	2406.17763	link
2024-06-25	MotionBooth: Motion-Aware Customized Text-to-Video Generation	Jianzong Wu et.al.	2406.17758	null
2024-06-25	Accelerating Clinical Evidence Synthesis with Large Language Models	Zifeng Wang et.al.	2406.17755	null
2024-06-25	Extensions of Panjer’s recursion for mixed compound distributions	Spyridon M. Tzaninis et.al.	2406.17726	null
2024-06-25	PANDA: A self-driving lab for studying electrodeposited polymer films	Harley Quinn et.al.	2406.17725	null
2024-06-25	Unified Auto-Encoding with Masked Diffusion	Philippe Hansen-Estruch et.al.	2406.17688	link
2024-06-25	LaTable: Towards Large Tabular Models	Boris van Breugel et.al.	2406.17673	null
2024-06-26	SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond	Marco Comunità et.al.	2406.17672	null
2024-06-25	Banishing LLM Hallucinations Requires Rethinking Generalization	Johnny Li et.al.	2406.17642	null
2024-06-25	The experience of humans’ and robots’ mutual (im)politeness in enacted service scenarios: An empirical study	Victor Kaptelinin et.al.	2406.17641	null
2024-06-24	FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models	Haonan Qiu et.al.	2406.16863	link
2024-06-24	Dreamitate: Real-World Visuomotor Policy Learning via Video Generation	Junbang Liang et.al.	2406.16862	null
2024-06-24	DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation	Yuang Peng et.al.	2406.16855	link
2024-06-24	USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$ onversations	Mounika Marreddy et.al.	2406.16833	null
2024-06-24	General Binding Affinity Guidance for Diffusion Models in Structure-Based Drug Design	Yue Jian et.al.	2406.16821	link
2024-06-24	ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians	Yufei Liu et.al.	2406.16815	null
2024-06-24	Conformal time series decomposition with component-wise exchangeability	Derck W. E. Prinzhorn et.al.	2406.16766	link
2024-06-24	Inferring stochastic low-rank recurrent neural networks from neural data	Matthijs Pals et.al.	2406.16749	link
2024-06-24	Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image	Jinkun Hao et.al.	2406.16710	null
2024-06-24	Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling	Min-Seop Kwak et.al.	2406.16695	null
2024-06-21	Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild	Nadav Orzech et.al.	2406.15331	null
2024-06-21	Rethinking Remote Sensing Change Detection With A Mask View	Xiaowen Ma et.al.	2406.15320	link
2024-06-21	You Only Acquire Sparse-channel (YOAS): A Unified Framework for Dense-channel EEG Generation	Hongyu Chen et.al.	2406.15269	null
2024-06-21	Evaluating Diversity in Automatic Poetry Generation	Yanran Chen et.al.	2406.15267	link
2024-06-21	Fingerprint Membership and Identity Inference Against Generative Adversarial Networks	Saverio Cavasin et.al.	2406.15253	null
2024-06-21	MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation	Xuan He et.al.	2406.15252	null
2024-06-21	Unsupervised Bayesian Generation of Synthetic CT from CBCT Using Patient-Specific Score-Based Prior	Junbo Peng et.al.	2406.15219	null
2024-06-21	Sound and Fury, Signifying Nothing? Impact of Data Breach Disclosure Laws	Muhammad Zia Hydari et.al.	2406.15215	null
2024-06-21	Injecting Bias in Text-To-Image Models via Composite-Trigger Backdoors	Ali Naseh et.al.	2406.15213	link
2024-06-21	Exploring the Efficacy of Robotic Assistants with ChatGPT and Claude in Enhancing ADHD Therapy: Innovating Treatment Paradigms	Santiago Berrezueta-Guzman et.al.	2406.15198	null
2024-06-20	A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models	Xincheng Shuai et.al.	2406.14555	link
2024-06-21	Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation	Eyal Michaeli et.al.	2406.14551	link
2024-06-20	Consistency Models Made Easy	Zhengyang Geng et.al.	2406.14548	link
2024-06-20	IRASim: Learning Interactive Real-Robot Action Simulators	Fangqi Zhu et.al.	2406.14540	null
2024-06-20	Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps	Nikita Starodubcev et.al.	2406.14539	null
2024-06-20	Fantastic Copyrighted Beasts and How (Not) to Generate Them	Luxi He et.al.	2406.14526	null
2024-06-20	Photoacoustic methane detection assisted by a gas-filled anti-resonant hollow-core fiber laser	Cuiling Zhang et.al.	2406.14521	null
2024-06-20	V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data	Rotem Shalev-Arkushin et.al.	2406.14510	null
2024-06-20	CodeRAG-Bench: Can Retrieval Augment Code Generation?	Zora Zhiruo Wang et.al.	2406.14497	link
2024-06-20	SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset	Josef Dai et.al.	2406.14477	link
2024-06-20	CollaFuse: Collaborative Diffusion Models	Simeon Allmendinger et.al.	2406.14429	link
2024-06-20	Active Diffusion Subsampling	Oisin Nolan et.al.	2406.14388	link
2024-06-20	Multicoloured Hardcore Model: Fast Mixing and Queueing	Sam Olesker-Taylor et.al.	2406.14376	null
2024-06-20	FairX: A comprehensive benchmarking tool for model analysis using fairness, utility, and explainability	Md Fahim Sikder et.al.	2406.14281	link
2024-06-20	In Tree Structure Should Sentence Be Generated	Yaguang Li et.al.	2406.14189	link
2024-06-20	CriDiff: Criss-cross Injection Diffusion Framework via Generative Pre-train for Prostate Segmentation	Tingwei Liu et.al.	2406.14186	link
2024-06-20	Tractable Equilibrium Computation in Markov Games through Risk Aversion	Eric Mazumdar et.al.	2406.14156	null
2024-06-20	ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning	Zhongjie Duan et.al.	2406.14130	link
2024-06-20	Dye4AI: Assuring Data Boundary on Generative AI Services	Shu Wang et.al.	2406.14114	null
2024-06-20	HeartBeat: Towards Controllable Echocardiography Video Synthesis with Multimodal Conditions-Guided Diffusion Models	Xinrui Zhou et.al.	2406.14098	null
2024-06-20	Bridging bulk and surface: An interacting particle system towards the field-road diffusion model	Matthieu Alfaro et.al.	2406.14093	null
2024-06-20	A Practical Diffusion Path for Sampling	Omar Chehab et.al.	2406.14040	null
2024-06-20	Leveraging eBPF and AI for Ransomware Nose Out	Arjun Sekar et.al.	2406.14020	null
2024-06-20	Feature Fusion Based on Mutual-Cross-Attention Mechanism for EEG Emotion Recognition	Yimin Zhao et.al.	2406.14014	link
2024-06-20	Exploring Changes in Nation Perception with Nationality-Assigned Personas in LLMs	Mahammed Kamruzzaman et.al.	2406.13993	null
2024-06-20	The Elusive Pursuit of Replicating PATE-GAN: Benchmarking, Auditing, Debugging	Georgi Ganev et.al.	2406.13985	link
2024-06-20	Similarity-aware Syncretic Latent Diffusion Model for Medical Image Translation with Representation Learning	Tingyi Lin et.al.	2406.13977	null
2024-06-20	Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models	Yuan Zhong et.al.	2406.13942	null
2024-06-20	EnTruth: Enhancing the Traceability of Unauthorized Dataset Usage in Text-to-image Diffusion Models with Minimal and Robust Alterations	Jie Ren et.al.	2406.13933	null
2024-06-20	Generative AI for Enhancing Active Learning in Education: A Comparative Study of GPT-3.5 and GPT-4 in Crafting Customized Test Questions	Hamdireza Rouzegar et.al.	2406.13903	null
2024-06-19	INFusion: Diffusion Regularized Implicit Neural Representations for 2D and 3D accelerated MRI reconstruction	Yamin Arefeen et.al.	2406.13895	null
2024-06-19	Open Generative Large Language Models for Galician	Pablo Gamallo et.al.	2406.13893	null
2024-06-19	StackRAG Agent: Improving Developer Answers with Retrieval-Augmented Generation	Davit Abrahamyan et.al.	2406.13840	link
2024-06-19	RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone Design	Rishabh Anand et.al.	2406.13839	link
2024-06-19	COAC: Cross-layer Optimization of Accelerator Configurability for Efficient CNN Processing	Steven Colleman et.al.	2406.13752	null
2024-06-19	GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation	Baiqi Li et.al.	2406.13743	link
2024-06-19	Tree-Sliced Wasserstein Distance on a System of Lines	Viet-Hoang Tran et.al.	2406.13725	null
2024-06-19	Hitchhiker’s guide on Energy-Based Models: a comprehensive review on the relation with other generative models, sampling and statistical physics	Davide Carbone et.al.	2406.13661	null
2024-06-19	Towards Minimal Targeted Updates of Language Models with Targeted Negative Training	Lily H. Zhang et.al.	2406.13660	link
2024-06-19	Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics	Weitong Zhang et.al.	2406.13652	null
2024-06-19	On AI-Inspired UI-Design	Jialiang Wei et.al.	2406.13631	null
2024-06-19	Can AI be enabled to dynamical downscaling? Training a Latent Diffusion Model to mimic km-scale COSMO-CLM downscaling of ERA5 over Italy	Elena Tomasi et.al.	2406.13627	link
2024-06-19	Enhance the Image: Super Resolution using Artificial Intelligence in MRI	Ziyu Li et.al.	2406.13625	null
2024-06-19	Generative Modeling by Minimizing the Wasserstein-2 Loss	Yu-Jui Huang et.al.	2406.13619	null
2024-06-19	Parameter Training Efficiency Aware Resource Allocation for AIGC in Space-Air-Ground Integrated Networks	Liangxin Qian et.al.	2406.13602	null
2024-06-19	ModSec-Learn: Boosting ModSecurity with Machine Learning	Christian Scano et.al.	2406.13547	link
2024-06-19	Towards Cyber Threat Intelligence for the IoT	Alfonso Iacovazzi et.al.	2406.13543	null
2024-06-19	Image Distillation for Safe Data Sharing in Histopathology	Zhe Li et.al.	2406.13536	link
2024-06-19	Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement	Chenda Li et.al.	2406.13471	null
2024-06-19	Unifying nonlinearly constrained nonconvex optimization	Charlie Vanaret et.al.	2406.13454	link
2024-06-19	Federating to Grow Transformers with Constrained Resources without Model Sharing	Shikun Shen et.al.	2406.13450	null
2024-06-19	Multi-messenger modeling of the Monogem pulsar halo	Youyou Li et.al.	2406.13426	null
2024-06-19	Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images	Haruo Fujiwara et.al.	2406.13393	null
2024-06-19	Effective Edge-wise Representation Learning in Edge-Attributed Bipartite Graphs	Hewen Wang et.al.	2406.13369	null
2024-06-19	Situational Instructions Database: Task Guidance in Dynamic Environments	Muhammad Saif Ullah Khan et.al.	2406.13302	link
2024-06-19	ARDuP: Active Region Video Diffusion for Universal Policies	Shuaiyi Huang et.al.	2406.13301	null
2024-06-19	AniFaceDiff: High-Fidelity Face Reenactment via Facial Parametric Conditioned Diffusion Models	Ken Chen et.al.	2406.13272	null
2024-06-19	Self-Supervised Diffusion Model for 3-D Seismic Data Reconstruction	Xinyang Wang et.al.	2406.13252	null
2024-06-19	Optimizing Inventory Management through Multiobjective Reverse Logistics with Environmental Impact	I. B. Wadhawan et.al.	2406.13226	null
2024-06-19	Neural Residual Diffusion Models for Deep Scalable Vision Generation	Zhiyuan Ma et.al.	2406.13215	null
2024-06-19	Surgical Triplet Recognition via Diffusion Model	Daochang Liu et.al.	2406.13210	null
2024-06-19	Diffusion Model-based FOD Restoration from High Distortion in dMRI	Shuo Huang et.al.	2406.13209	null
2024-06-19	Toward Structure Fairness in Dynamic Graph Embedding: A Trend-aware Dual Debiasing Approach	Yicong Li et.al.	2406.13201	link
2024-06-19	Synthetic Context Generation for Question Generation	Naiming Liu et.al.	2406.13188	null
2024-06-19	Conditional score-based diffusion models for solving inverse problems in mechanics	Agnimitra Dasgupta et.al.	2406.13154	null
2024-06-19	von Mises Quasi-Processes for Bayesian Circular Regression	Yarden Cohen et.al.	2406.13151	null
2024-06-19	MCAD: Multi-modal Conditioned Adversarial Diffusion Model for High-Quality PET Image Reconstruction	Jiaqi Cui et.al.	2406.13150	null
2024-06-19	GVT2RPM: An Empirical Study for General Video Transformer Adaptation to Remote Physiological Measurement	Hao Wang et.al.	2406.13136	null
2024-06-19	Thruster-Assisted Incline Walking	Kaushik Venkatesh Krishnamurthy et.al.	2406.13118	null
2024-06-18	Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models	Paul Henderson et.al.	2406.13099	null
2024-06-18	RITA: A Real-time Interactive Talking Avatars Framework	Wuxinlin Cheng et.al.	2406.13093	null
2024-06-18	PIPPIN: Generating variable length full events from partons	Guillaume Quétant et.al.	2406.13074	link
2024-06-18	MaskPure: Improving Defense Against Text Adversaries with Stochastic Purification	Harrison Gietz et.al.	2406.13066	link
2024-06-18	Traffic Prediction considering Multiple Levels of Spatial-temporal Information: A Multi-scale Graph Wavelet-based Approach	Zilin Bian et.al.	2406.13038	null
2024-06-18	Sharp detection of low-dimensional structure in probability measures via dimensional logarithmic Sobolev inequalities	Matthew T. C. Li et.al.	2406.13036	null
2024-06-18	Data Plagiarism Index: Characterizing the Privacy Risk of Data-Copying in Tabular Generative Models	Joshua Ward et.al.	2406.13012	null
2024-06-18	Synergizing Foundation Models and Federated Learning: A Survey	Shenghui Li et.al.	2406.12844	null
2024-06-18	Evaluating the design space of diffusion-based generative models	Yuqing Wang et.al.	2406.12839	null
2024-06-18	Neural Approximate Mirror Maps for Constrained Diffusion Models	Berthy T. Feng et.al.	2406.12816	null
2024-06-19	AITTI: Learning Adaptive Inclusive Token for Text-to-Image Generation	Xinyu Hou et.al.	2406.12805	link
2024-06-18	Extracting Training Data from Unconditional Diffusion Models	Yunhao Chen et.al.	2406.12752	null
2024-06-18	Useful stochastic bounds in time-varying queues with service and patience times having general joint distribution	Shreehari Anand Bodas et.al.	2406.12745	null
2024-06-18	SUPER: Selfie Undistortion and Head Pose Editing with Identity Preservation	Polina Karpikova et.al.	2406.12700	null
2024-06-18	Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation	Miseul Kim et.al.	2406.12688	null
2024-06-18	GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models	Yongtao Ge et.al.	2406.12671	link
2024-06-18	Research and Implementation of Data Enhancement Techniques for Graph Neural Networks	Jingzhao Gu et.al.	2406.12640	null
2024-06-18	News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation	Andreea Iana et.al.	2406.12634	link
2024-06-18	Learning Diffusion at Lightspeed	Antonio Terpin et.al.	2406.12616	null
2024-06-18	Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in Images	Shivank Garg et.al.	2406.12592	link
2024-06-18	Behavior-Dependent Linear Recurrent Units for Efficient Sequential Recommendation	Chengkai Liu et.al.	2406.12580	link
2024-06-18	Training Diffusion Models with Federated Learning	Matthijs de Goede et.al.	2406.12575	null
2024-06-18	P-Tailor: Customizing Personality Traits for Language Models via Mixture of Specialized LoRA Experts	Yuhao Dan et.al.	2406.12548	null
2024-06-18	Structured Detection for Simultaneous Super-Resolution and Optical Sectioning in Laser Scanning Microscopy	Alessandro Zunino et.al.	2406.12542	link
2024-06-18	Variational Distillation of Diffusion Policies into Mixture of Experts	Hongyi Zhou et.al.	2406.12538	null
2024-06-18	HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors	Panwang Pan et.al.	2406.12459	link
2024-06-18	Planning Using Schrödinger Bridge Diffusion Models	Adarsh Srivastava et.al.	2406.12458	link
2024-06-18	Deep Temporal Deaggregation: Large-Scale Spatio-Temporal Generative Models	David Bergström et.al.	2406.12423	null
2024-06-18	ROVER: RTL Optimization via Verified E-Graph Rewriting	Samuel Coward et.al.	2406.12421	null
2024-06-18	TADM: Temporally-Aware Diffusion Model for Neurodegenerative Progression on Brain MRI	Mattia Litrico et.al.	2406.12411	null
2024-06-18	SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions	Yuexiong Ding et.al.	2406.12395	null

Vision-Language Models

Publish Date	Title	Authors	PDF	Code
2025-07-23	DataWink: Reusing and Adapting SVG-based Visualization Examples with Large Multimodal Models	Liwenhan Xie et.al.	2507.17734	null
2025-07-23	RoadBench: A Vision-Language Foundation Model and Benchmark for Road Damage Understanding	Xi Xiao et.al.	2507.17353	null
2025-07-23	Met $^2$ Net: A Decoupled Two-Stage Spatio-Temporal Forecasting Model for Complex Meteorological Systems	Shaohan Li et.al.	2507.17189	null
2025-07-22	VL-CLIP: Enhancing Multimodal Recommendations via Visual Grounding and LLM-Augmented CLIP Embeddings	Ramin Giahi et.al.	2507.17080	null
2025-07-22	Machine learning-based multimodal prognostic models integrating pathology images and high-throughput omic data for overall survival prediction in cancer: a systematic review	Charlotte Jennings et.al.	2507.16876	null
2025-07-20	TD-Interpreter: Enhancing the Understanding of Timing Diagrams with Visual-Language Learning	Jie He et.al.	2507.16844	null
2025-07-22	Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning	Ang Li et.al.	2507.16746	null
2025-07-21	Applying multimodal learning to Classify transient Detections Early (AppleCiDEr) I: Data set, methods, and infrastructure	Alexandra Junell et.al.	2507.16088	null
2025-07-21	Leveraging Context for Multimodal Fallacy Classification in Political Debates	Alessio Pittiglio et.al.	2507.15641	null
2025-07-23	Privacy-Preserving Multimodal News Recommendation through Federated Learning	Mehdi Khalaj et.al.	2507.15460	null
2025-07-21	MEETI: A Multimodal ECG Dataset from MIMIC-IV-ECG with Signals, Images, Features and Interpretations	Deyun Zhang et.al.	2507.15255	null
2025-07-20	InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis	Jiale Liu et.al.	2507.14899	null
2025-07-20	Benchmarking Foundation Models with Multimodal Public Electronic Health Records	Kunyu Yu et.al.	2507.14824	null
2025-07-20	LeAdQA: LLM-Driven Context-Aware Temporal Grounding for Video Question Answering	Xinxin Dong et.al.	2507.14784	null
2025-07-19	On the robustness of modeling grounded word learning through a child’s egocentric input	Wai Keen Vong et.al.	2507.14749	null
2025-07-19	Docopilot: Improving Multimodal Models for Document-Level Understanding	Yuchen Duan et.al.	2507.14675	null
2025-07-19	Multimodal AI for Gastrointestinal Diagnostics: Tackling VQA in MEDVQA-GI 2025	Sujata Gaihre et.al.	2507.14544	null
2025-07-18	A million-scale dataset and generalizable foundation model for nanomaterial-protein interactions	Hengjie Yu et.al.	2507.14245	null
2025-07-18	Team of One: Cracking Complex Video QA with Model Synergy	Jun Xie et.al.	2507.13820	null
2025-07-18	MaskHOI: Robust 3D Hand-Object Interaction Estimation via Masked Pre-training	Yuechen Xie et.al.	2507.13673	null
2025-07-17	SEER: Semantic Enhancement and Emotional Reasoning Network for Multimodal Fake News Detection	Peican Zhu et.al.	2507.13415	null
2025-07-17	Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning	Zihua Zhao et.al.	2507.12998	null
2025-07-17	City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning	Penglei Sun et.al.	2507.12795	null
2025-07-17	A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models	Weijieying Ren et.al.	2507.12774	null
2025-07-16	Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models	Alex Zook et.al.	2507.12666	null
2025-07-16	Multimodal Coordinated Online Behavior: Trade-offs and Strategies	Lorenzo Mannocci et.al.	2507.12108	null
2025-07-15	Partitioner Guided Modal Learning Framework	Guimin Hu et.al.	2507.11661	null
2025-07-15	MFGDiffusion: Mask-Guided Smoke Synthesis for Enhanced Forest Fire Detection	Guanghao Wu et.al.	2507.11252	null
2025-07-15	A Robust Incomplete Multimodal Low-Rank Adaptation Approach for Emotion Recognition	Xinkui Zhao et.al.	2507.11202	null
2025-07-15	A Survey on Interpretability in Visual Recognition	Qiyang Wan et.al.	2507.11099	null
2025-07-14	Boosting Multimodal Learning via Disentangled Gradient Learning	Shicai Wei et.al.	2507.10213	null
2025-07-14	Improving Multimodal Learning via Imbalanced Learning	Shicai Wei et.al.	2507.10203	null
2025-07-14	Cross-modal Associations in Vision and Language Models: Revisiting the bouba-kiki effect	Tom Kouwenhoven et.al.	2507.10013	null
2025-07-13	ExpStar: Towards Automatic Commentary Generation for Multi-discipline Scientific Experiments	Jiali Chen et.al.	2507.09693	null
2025-07-13	Bridging Bots: from Perception to Action via Multimodal-LMs and Knowledge Graphs	Margherita Martorana et.al.	2507.09617	null
2025-07-13	HMID-Net: An Exploration of Masked Image Modeling and Knowledge Distillation in Hyperbolic Space	Changli Wang et.al.	2507.09487	null
2025-07-12	Scaling Laws for Optimal Data Mixtures	Mustafa Shukor et.al.	2507.09404	null
2025-07-11	VIP: Visual Information Protection through Adversarial Attacks on Vision-Language Models	Hanene F. Z. Brachemi Meftah et.al.	2507.08982	null
2025-07-08	Unveiling Effective In-Context Configurations for Image Captioning: An External & Internal Analysis	Li Li et.al.	2507.08021	null
2025-07-10	Impact of Pretraining Word Co-occurrence on Compositional Generalization in Multimodal Models	Helen Qu et.al.	2507.08000	null
2025-07-10	Towards Interpretable Time Series Foundation Models	Matthieu Boileau et.al.	2507.07439	null
2025-07-10	EPIC: Efficient Prompt Interaction for Text-Image Classification	Xinyao Yu et.al.	2507.07415	null
2025-07-09	LinguaMark: Do Multimodal Models Speak Fairly? A Benchmark-Based Evaluation	Ananya Raval et.al.	2507.07274	null
2025-07-09	Robust Multimodal Learning Framework For Intake Gesture Detection Using Contactless Radar and Wearable IMU Sensors	Chunzhuo Wang et.al.	2507.07261	null
2025-07-09	Explainable Artificial Intelligence in Biomedical Image Analysis: A Comprehensive Survey	Getamesay Haile Dagnaw et.al.	2507.07148	null
2025-07-09	Evaluating Large Multimodal Models for Nutrition Analysis: A Benchmark Enriched with Contextual Metadata	Bruce Coburn et.al.	2507.07048	null
2025-07-07	SPARC: Concept-Aligned Sparse Autoencoders for Cross-Model and Cross-Modal Interpretability	Ali Nasiri-Sarvi et.al.	2507.06265	null
2025-07-08	Enhancing Synthetic CT from CBCT via Multimodal Fusion and End-To-End Registration	Maximilian Tschuchnig et.al.	2507.06067	null
2025-07-08	Exploring Partial Multi-Label Learning via Integrating Semantic Co-occurrence Knowledge	Xin Wu et.al.	2507.05992	null
2025-07-08	Graph Learning	Feng Xia et.al.	2507.05636	null
2025-07-07	Cultivating Multimodal Intelligence: Interpretive Reasoning and Agentic RAG Approaches to Dermatological Diagnosis	Karishma Thakrar et.al.	2507.05520	null
2025-07-07	Transcribing Spanish Texts from the Past: Experiments with Transkribus, Tesseract and Granite	Yanco Amor Torterolo-Orta et.al.	2507.04878	null
2025-07-07	SPATIA: Multimodal Model for Prediction and Generation of Spatial Cell Phenotypes	Zhenglun Kong et.al.	2507.04704	null
2025-07-07	Trojan Horse Prompting: Jailbreaking Conversational Multimodal Models by Forging Assistant Message	Wei Duan et.al.	2507.04673	null
2025-07-07	MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding	Zhicheng Zhang et.al.	2507.04635	null
2025-07-05	Are Learning-Based Approaches Ready for Real-World Indoor Navigation? A Case for Imitation Learning	Nigitha Selvaraj et.al.	2507.04086	null
2025-07-08	BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset	Zhiheng Xi et.al.	2507.03483	null
2025-07-04	MGAA: Multi-Granular Adaptive Allocation fof Low-Rank Compression of LLMs	Guangyan Li et.al.	2507.03294	null
2025-07-03	Intelligent Histology for Tumor Neurosurgery	Xinhai Hou et.al.	2507.03037	null
2025-07-01	Gated Recursive Fusion: A Stateful Approach to Scalable Multimodal Transformers	Yusuf Shihata et.al.	2507.02985	null
2025-07-02	Activation Reward Models for Few-Shot Model Alignment	Tianning Chai et.al.	2507.01368	null
2025-07-02	PULSE: Practical Evaluation Scenarios for Large Multimodal Model Unlearning	Tatsuki Kawakami et.al.	2507.01271	null
2025-07-07	Escaping Plato’s Cave: JAM for Aligning Independently Trained Vision and Language Models	Hyoseo et.al.	2507.01201	null
2025-06-27	XxaCT-NN: Structure Agnostic Multimodal Learning for Materials Science	Jithendaraa Subramanian et.al.	2507.01054	null
2025-07-02	Just Noticeable Difference for Large Multimodal Models	Zijian Chen et.al.	2507.00490	null
2025-06-30	MotionGPT3: Human Motion as a Second Modality	Bingfan Zhu et.al.	2506.24086	null
2025-06-30	Reinforcement Fine-Tuning Enables MLLMs Learning Novel Tasks Stably	Zhihao Zhang et.al.	2506.23508	null
2025-06-29	Decoding Memes: Benchmarking Narrative Role Classification across Multilingual and Multimodal Models	Shivam Sharma et.al.	2506.23122	null
2025-06-27	Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning	Zuyao You et.al.	2506.22624	null
2025-06-27	Test-Time Consistency in Vision Language Models	Shih-Han Chou et.al.	2506.22395	null
2025-06-27	Can Video Large Multimodal Models Think Like Doubters-or Double-Down: A Study on Defeasible Video Entailment	Yue Zhang et.al.	2506.22385	null
2025-06-27	Sheaf-Based Decentralized Multimodal Learning for Next-Generation Wireless Communication Systems	Abdulmomen Ghalkha et.al.	2506.22374	null
2025-06-26	ImplicitQA: Going beyond frames towards Implicit Video Reasoning	Sirnam Swetha et.al.	2506.21742	null
2025-06-28	G $^{2}$ D: Boosting Multimodal Learning with Gradient-Guided Distillation	Mohammed Rakib et.al.	2506.21514	null
2025-06-26	LLaVA-Pose: Enhancing Human Pose and Action Understanding via Keypoint-Integrated Instruction Tuning	Dewen Zhang et.al.	2506.21317	null
2025-06-26	Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents	Tianyi Men et.al.	2506.21252	null
2025-06-26	V2X-REALM: Vision-Language Model-Based Robust End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling	Junwei You et.al.	2506.21041	null
2025-06-26	TRIDENT: Tri-Modal Molecular Representation Learning with Taxonomic Annotations and Local Correspondence	Feng Jiang et.al.	2506.21028	null
2025-06-26	Bridging Video Quality Scoring and Justification via Large Multimodal Models	Qizhi Xie et.al.	2506.21011	null
2025-06-26	Where is AIED Headed? Key Topics and Emerging Frontiers (2020-2024)	Shihui Feng et.al.	2506.20971	null
2025-06-25	MMSearch-R1: Incentivizing LMMs to Search	Jinming Wu et.al.	2506.20670	null
2025-06-25	Personalized Mental State Evaluation in Human-Robot Interaction using Federated Learning	Andrea Bussolan et.al.	2506.20212	null
2025-06-24	Emergence of Text Readability in Vision Language Models	Jaeyoo Park et.al.	2506.19389	null
2025-06-23	TAMMs: Temporal-Aware Multimodal Model for Satellite Image Change Understanding and Forecasting	Zhongbin Guo et.al.	2506.18862	null
2025-06-23	OpenEvents V1: Large-Scale Benchmark Dataset for Multimodal Event Grounding	Hieu Nguyen et.al.	2506.18372	null
2025-06-23	BrainSymphony: A Transformer-Driven Fusion of fMRI Time Series and Structural Connectivity	Moein Khajehnejad et.al.	2506.18314	null
2025-06-27	Haptic-ACT – Pseudo Oocyte Manipulation by a Robot Using Multimodal Information and Action Chunking with Transformers	Pedro Miguel Uriguen Eljuri et.al.	2506.18212	null
2025-06-22	ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation	Junying Chen et.al.	2506.18095	null
2025-06-22	MUPA: Towards Multi-Path Agentic Reasoning for Grounded Video Question Answering	Jisheng Dang et.al.	2506.18071	null
2025-06-25	PP-DocBee2: Improved Baselines with Efficient Data for Multimodal Document Understanding	Kui Huang et.al.	2506.18023	null
2025-06-25	PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models	Lintao Wang et.al.	2506.17667	null
2025-06-21	Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning?	Yuesheng Huang et.al.	2506.17623	null
2025-06-24	AI-based Multimodal Biometrics for Detecting Smartphone Distractions: Application to Online Learning	Alvaro Becerra et.al.	2506.17364	null
2025-06-20	MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models	Xiaolong Wang et.al.	2506.17046	null
2025-06-20	With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You	Fabian Gröger et.al.	2506.16895	null
2025-06-19	Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding	Vishesh Tripathi et.al.	2506.16035	null
2025-06-18	A Strong View-Free Baseline Approach for Single-View Image Guided Point Cloud Completion	Fangzhou Lin et.al.	2506.15747	null
2025-06-20	Show-o2: Improved Native Unified Multimodal Models	Jinheng Xie et.al.	2506.15564	link
2025-06-17	MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models	Hongyu Wang et.al.	2506.14435	null
2025-06-16	Comparison of ConvNeXt and Vision-Language Models for Breast Density Assessment in Screening Mammography	Yusdivia Molina-Román et.al.	2506.13964	null
2025-06-16	ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection	Shang-Chi Tsai et.al.	2506.13956	null
2025-06-16	A Survey on World Models Grounded in Acoustic Physical Information	Xiaoliang Chen et.al.	2506.13833	link
2025-06-16	Discrete Diffusion in Large Language and Multimodal Models: A Survey	Runpeng Yu et.al.	2506.13759	link
2025-06-22	Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model	Shaolei Zhang et.al.	2506.13642	link
2025-06-16	MambaMia: A State-Space-Model-Based Compression for Efficient Video Understanding in Large Multimodal Models	Geewook Kim et.al.	2506.13564	null
2025-06-16	A Survey on Imitation Learning for Contact-Rich Tasks in Robotics	Toshiaki Tsuji et.al.	2506.13498	null
2025-06-19	Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval	Kshitij Kavimandan et.al.	2506.13496	null
2025-06-16	Efficient Medical VIE via Reinforcement Learning	Lijun Liu et.al.	2506.13363	null
2025-06-16	Fatigue-Aware Adaptive Interfaces for Wearable Devices Using Deep Learning	Yikan Wang et.al.	2506.13203	null
2025-06-16	Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding	Nikkie Hooman et.al.	2506.13104	null
2025-06-16	FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design	Kai Lan et.al.	2506.13066	null
2025-06-16	Rethinking Explainability in the Era of Multimodal AI	Chirag Agarwal et.al.	2506.13060	null
2025-06-16	Stress-Testing Multimodal Foundation Models for Crystallographic Reasoning	Can Polat et.al.	2506.13051	link
2025-06-17	HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs	Zijian Zhang et.al.	2506.13038	null
2025-06-15	Learning to Fuse: Modality-Aware Adaptive Scheduling for Robust Multimodal Foundation Models	Liam Bennett et.al.	2506.12733	null
2025-06-15	Dynamic Modality Scheduling for Multimodal Large Models via Confidence, Uncertainty, and Semantic Consistency	Hiroshi Tanaka et.al.	2506.12724	null
2025-06-14	InverTune: Removing Backdoors from Multimodal Contrastive Learning Models via Trigger Inversion and Activation Tuning	Mengyuan Sun et.al.	2506.12411	null
2025-06-13	VFaith: Do Large Multimodal Models Really Reason on Seen Images Rather than Previous Memories?	Jiachen Yu et.al.	2506.11571	null
2025-06-16	Improving Multimodal Learning Balance and Sufficiency through Data Remixing	Xiaoyu Ma et.al.	2506.11550	null
2025-06-13	Investigating Vulnerabilities and Defenses Against Audio-Visual Attacks: A Comprehensive Survey Emphasizing Multimodal Models	Jinming Wen et.al.	2506.11521	null
2025-06-13	RollingQ: Reviving the Cooperation Dynamics in Multimodal Transformer	Haotian Ni et.al.	2506.11465	null
2025-06-13	Dynamic Double Space Tower	Weikai Sun et.al.	2506.11394	null
2025-06-13	Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables	Yitong Zhou et.al.	2506.11375	null
2025-06-12	Combining Log Data and Collaborative Dialogue Features to Predict Project Quality in Middle School AI Education	Conrad Borchers et.al.	2506.11326	null
2025-06-12	Multimodal Modeling of CRISPR-Cas12 Activity Using Foundation Models and Chromatin Accessibility Data	Azim Dehghani Amirabad et.al.	2506.11182	null
2025-06-12	Developing a High-performance Framework for Speech Emotion Recognition in Naturalistic Conditions Challenge for Emotional Attribute Prediction	Thanathai Lertpetchpun et.al.	2506.10930	null
2025-06-12	CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation	Zhao Zhang et.al.	2506.10890	link
2025-06-12	Prompts to Summaries: Zero-Shot Language-Guided Video Summarization	Mario Barbara et.al.	2506.10807	null
2025-06-12	Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation	Zhiyang Xu et.al.	2506.10395	null
2025-06-11	One Patient, Many Contexts: Scaling Medical AI Through Contextual Intelligence	Michelle M. Li et.al.	2506.10157	null
2025-06-11	ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering	Caijun Jia et.al.	2506.10116	null
2025-06-11	CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models	Aaron Foss et.al.	2506.09943	link
2025-06-11	Dynamic Sub-region Search in Homogeneous Collections Using CLIP	Bastian Jäckl et.al.	2506.09506	null
2025-06-11	A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation	Yukang Feng et.al.	2506.09427	null
2025-06-11	Hidden in Plain Sight: Evaluation of the Deception Detection Capabilities of LLMs in Multimodal Settings	Md Messal Monem Miah et.al.	2506.09424	null
2025-06-11	Ming-Omni: A Unified Multimodal Model for Perception and Generation	Inclusion AI et.al.	2506.09344	link
2025-06-10	FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation	Zheqi He et.al.	2506.09081	link
2025-06-09	Segment Any Architectural Facades (SAAF):An automatic segmentation model for building facades, walls and windows based on multimodal semantics guidance	Peilin Li et.al.	2506.09071	null
2025-06-10	Enhancing Synthetic CT from CBCT via Multimodal Fusion: A Study on the Impact of CBCT Quality and Alignment	Maximilian Tschuchnig et.al.	2506.08716	null
2025-06-13	LLaVA-c: Continual Improved Visual Instruction Tuning	Wenzhuo Liu et.al.	2506.08666	null
2025-06-10	MOSAIC-F: A Framework for Enhancing Students’ Oral Presentation Skills through Personalized Feedback	Alvaro Becerra et.al.	2506.08634	null
2025-06-10	Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning	Fengjun Pan et.al.	2506.08477	null
2025-06-09	Instruction-Tuned Video-Audio Models Elucidate Functional Specialization in the Brain	Subba Reddy Oota et.al.	2506.08277	link
2025-06-09	CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray	Mingquan Lin et.al.	2506.07984	null
2025-06-12	Reinforcing Multimodal Understanding and Generation with Dual Self-rewards	Jixiang Hong et.al.	2506.07963	null
2025-06-09	Uncertainty-o: One Model-agnostic Framework for Unveiling Uncertainty in Large Multimodal Models	Ruiyang Zhang et.al.	2506.07575	null
2025-06-08	Speech Recognition on TV Series with Video-guided Post-Correction	Haoyuan Yang et.al.	2506.07323	null
2025-06-08	A Narrative Review on Large AI Models in Lung Cancer Screening, Diagnosis, and Treatment Planning	Jiachen Zhong et.al.	2506.07236	null
2025-06-08	Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning	Tianyi Bai et.al.	2506.07227	null
2025-06-08	Learning Compact Vision Tokens for Efficient Large Multimodal Models	Hao Tang et.al.	2506.07138	link
2025-06-08	A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge	Tarique Dahri et.al.	2506.07055	null
2025-06-10	Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning	LASA Team et.al.	2506.07044	null
2025-06-08	A Culturally-diverse Multilingual Multimodal Video Benchmark & Model	Bhuiyan Sanjid Shafique et.al.	2506.07032	null
2025-06-08	MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks	Sanjoy Chowdhury et.al.	2506.07016	null
2025-06-08	LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer	Ying Shen et.al.	2506.06952	null
2025-06-10	Meta-Adaptive Prompt Distillation for Few-Shot Visual Question Answering	Akash Gupta et.al.	2506.06905	null
2025-06-07	VisioMath: Benchmarking Figure-based Mathematical Reasoning in LMMs	Can Li et.al.	2506.06727	null
2025-06-06	Bridging Audio and Vision: Zero-Shot Audiovisual Segmentation by Connecting Pretrained Models	Seung-jae Lee et.al.	2506.06537	null
2025-06-06	Building Models of Neurological Language	Henry Watkins et.al.	2506.06208	null
2025-06-06	Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning	Sheng Chen et.al.	2506.06205	null
2025-06-06	MoralCLIP: Contrastive Alignment of Vision-and-Language Representations with Moral Foundations Theory	Ana Carolina Condez et.al.	2506.05696	null
2025-06-05	When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding	Yan Shu et.al.	2506.05551	null
2025-06-05	Refer to Anything with Vision-Language Prompts	Shengcao Cao et.al.	2506.05342	null
2025-06-05	VideoMolmo: Spatio-Temporal Grounding Meets Pointing	Ghazi Shazan Ahmad et.al.	2506.05336	link
2025-06-05	Unleashing Hour-Scale Video Training for Long Video-Language Understanding	Jingyang Lin et.al.	2506.05332	null
2025-06-05	Quantifying Cross-Modality Memorization in Vision-Language Models	Yuxin Wen et.al.	2506.05198	null
2025-06-09	Do Large Language Models Judge Error Severity Like Humans?	Diege Sun et.al.	2506.05142	null
2025-06-05	A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions	Anh Le et.al.	2506.05061	null
2025-06-05	Line of Sight: On Linear Representations in VLLMs	Achyuta Rajaram et.al.	2506.04706	null
2025-06-05	Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations	Linjie Li et.al.	2506.04633	link
2025-06-04	Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models	Fangrui Zhu et.al.	2506.04220	null
2025-06-04	Voice Activity Projection Model with Multimodal Encoders	Takeshi Saga et.al.	2506.03980	null
2025-06-04	EmoArt: A Multidimensional Dataset for Emotion-Aware Artistic Generation	Cheng Zhang et.al.	2506.03652	null
2025-06-03	Seeing the Arrow of Time in Large Multimodal Models	Zihui Xue et.al.	2506.03340	null
2025-06-03	DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models	Jiarui Wang et.al.	2506.03007	null
2025-06-03	Enriching Location Representation with Detailed Semantic Information	Junyuan Liu et.al.	2506.02744	null
2025-06-02	Entity Image and Mixed-Modal Image Retrieval Datasets	Cristian-Ioan Blaga et.al.	2506.02291	null
2025-06-02	RoboEgo System Card: An Omnimodal Model with Native Full Duplexity	Yiqun Yao et.al.	2506.01934	null
2025-06-02	Is Extending Modality The Right Path Towards Omni-Modality?	Tinghui Zhu et.al.	2506.01872	null
2025-06-02	ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding	Junliang Ye et.al.	2506.01853	link
2025-06-02	Generate, Not Recommend: Personalized Multimodal Content Generation	Jiongnan Liu et.al.	2506.01704	null
2025-06-02	EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models	Yan Shu et.al.	2506.01667	null
2025-06-02	Confidence-Aware Self-Distillation for Multimodal Sentiment Analysis with Incomplete Modalities	Yanxi Luo et.al.	2506.01490	null
2025-06-02	Contra4: Evaluating Contrastive Cross-Modal Reasoning in Audio, Video, Image, and 3D	Artemis Panagopoulou et.al.	2506.01275	null
2025-06-01	TIME: TabPFN-Integrated Multimodal Engine for Robust Tabular-Image Learning	Jiaqi Luo et.al.	2506.00813	null
2025-05-30	Beyond Atomic Geometry Representations in Materials Science: A Human-in-the-Loop Multimodal Framework	Can Polat et.al.	2506.00302	link
2025-05-30	Mixpert: Mitigating Multimodal Learning Conflicts with Efficient Mixture-of-Vision-Experts	Xin He et.al.	2505.24541	null
2025-05-30	When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways	Kailin Jiang et.al.	2505.24449	link
2025-05-29	Towards disentangling the contributions of articulation and acoustics in multimodal phoneme recognition	Sean Foley et.al.	2505.24059	null
2025-05-29	Semantics-Guided Generative Image Compression	Cheng-Lin Wu et.al.	2505.24015	link
2025-06-02	Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw Puzzles	Zifu Wang et.al.	2505.23590	link
2025-05-29	Evaluating AI capabilities in detecting conspiracy theories on YouTube	Leonardo La Rocca et.al.	2505.23570	link
2025-05-29	OmniEarth-Bench: Towards Holistic Evaluation of Earth’s Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data	Fengxiang Wang et.al.	2505.23522	null
2025-05-29	Bidirectional predictive coding	Gaspard Oliviers et.al.	2505.23415	null
2025-05-29	UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning	Weijia Mao et.al.	2505.23380	link
2025-05-29	ChartMind: A Comprehensive Benchmark for Complex Real-world Multimodal Chart Question Answering	Jingxuan Wei et.al.	2505.23242	null
2025-05-29	Elicit and Enhance: Advancing Multimodal Reasoning in Medical Scenarios	Linjie Mu et.al.	2505.23118	null
2025-05-29	Deep Modeling and Optimization of Medical Image Classification	Yihang Wu et.al.	2505.23040	link
2025-05-28	VidText: Towards Comprehensive Evaluation for Video Text Understanding	Zhoufaran Yang et.al.	2505.22810	link
2025-05-28	SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning	Jiaqi Huang et.al.	2505.22596	null
2025-05-28	Thinking with Generated Images	Ethan Chern et.al.	2505.22525	null
2025-05-28	From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications	Feibo Jiang et.al.	2505.22311	null
2025-05-29	YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction	Mingzhuang Wang et.al.	2505.22250	null
2025-05-28	Flexible Tool Selection through Low-dimensional Attribute Alignment of Vision and Language	Guangfu Hao et.al.	2505.22146	null
2025-05-28	SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model	Yifan Chang et.al.	2505.22126	null
2025-05-28	Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset	Fakhraddin Alwajih et.al.	2505.21979	null
2025-05-27	A Cross Modal Knowledge Distillation & Data Augmentation Recipe for Improving Transcriptomics Representations through Morphological Features	Ihab Bendidi et.al.	2505.21317	null
2025-05-27	Predicting Implicit Arguments in Procedural Video Instructions	Anil Batra et.al.	2505.21068	null
2025-05-27	PARTONOMY: Large Multimodal Models with Part-Level Visual Understanding	Ansel Blume et.al.	2505.20759	null
2025-05-27	Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models	Yufei Zhan et.al.	2505.20753	link
2025-05-26	CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists’ Diagnostic Logic	Yuxuan Sun et.al.	2505.20510	null
2025-05-26	MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding	Jeonghun Baek et.al.	2505.20298	link
2025-05-26	VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction	Zhiwen Fan et.al.	2505.20279	link
2025-05-26	Ten Principles of AI Agent Economics	Ke Yang et.al.	2505.20273	null
2025-05-26	Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models	Kai Sun et.al.	2505.20152	link
2025-05-26	FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities	Jin Wang et.al.	2505.20147	null
2025-05-26	Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)	Subba Reddy Oota et.al.	2505.20029	link
2025-05-26	Improving Speech Emotion Recognition Through Cross Modal Attention Alignment and Balanced Stacking Model	Lucas Ueda et.al.	2505.20007	link
2025-05-26	Learning Optimal Multimodal Information Bottleneck Representations	Qilong Wu et.al.	2505.19996	null
2025-05-26	ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs	Pooneh Mousavi et.al.	2505.19937	null
2025-05-26	Modeling Beyond MOS: Quality Assessment Models Must Integrate Context, Reasoning, and Multimodality	Mohamed Amine Kerkouri et.al.	2505.19696	null
2025-05-26	Benchmarking Large Multimodal Models for Ophthalmic Visual Question Answering with OphthalWeChat	Pusheng Xu et.al.	2505.19624	null
2025-05-26	Multiplicity is an Inevitable and Inherent Challenge in Multimodal Learning	Sanghyuk Chun et.al.	2505.19614	null
2025-05-26	Rethinking Gating Mechanism in Sparse MoE: Handling Arbitrary Modality Inputs with Confidence-Guided Gate	Liangwei Nathan Zheng et.al.	2505.19525	link
2025-05-26	Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models	Yifan Jia et.al.	2505.19509	link
2025-05-25	I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts	Jiayi Xin et.al.	2505.19190	link
2025-05-25	ASPO: Adaptive Sentence-Level Preference Optimization for Fine-Grained Multimodal Reasoning	Yeyuan Wang et.al.	2505.19100	null
2025-05-24	SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models	Ye Sun et.al.	2505.18812	null
2025-05-24	OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks	Jiayu Wang et.al.	2505.18775	link
2025-05-26	TokBench: Evaluating Your Visual Tokenizer before Visual Generation	Junfeng Wu et.al.	2505.18142	null
2025-05-23	VLM Models and Automated Grading of Atopic Dermatitis	Marc Lalonde et.al.	2505.17835	null
2025-05-23	Debiasing CLIP: Interpreting and Correcting Bias in Attention Heads	Wei Jie Yeo et.al.	2505.17425	null
2025-05-22	Analyzing Fine-Grained Alignment and Enhancing Vision Understanding in Multimodal Language Models	Jiachen Jiang et.al.	2505.17316	null
2025-05-22	MDIT-Bench: Evaluating the Dual-Implicit Toxicity in Large Multimodal Models	Bohan Jin et.al.	2505.17144	null
2025-05-21	Pixels Versus Priors: Controlling Knowledge Priors in Vision-Language Models through Visual Counterfacts	Michal Golovanevsky et.al.	2505.17127	null
2025-05-22	ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark	Sara Ghaboura et.al.	2505.17021	link
2025-05-22	CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms	Shilin Yan et.al.	2505.17020	link
2025-05-22	OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning	Zongyan Han et.al.	2505.16974	link
2025-05-22	ICYM2I: The illusion of multimodal informativeness under missingness	Young Sang Choi et.al.	2505.16953	link
2025-05-22	Four Eyes Are Better Than Two: Harnessing the Collaborative Potential of Large Models via Differentiated Thinking and Complementary Ensembles	Jun Xie et.al.	2505.16784	null
2025-05-22	IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models	Yiming Gao et.al.	2505.16774	link
2025-05-22	Multimodal Online Federated Learning with Modality Missing in Internet of Things	Heqiang Wang et.al.	2505.16138	null
2025-05-21	Streamline Without Sacrifice – Squeeze out Computation Redundancy in LMM	Penghao Wu et.al.	2505.15816	link
2025-05-21	Robust Multimodal Learning via Entropy-Gated Contrastive Fusion	Leon Chlon et.al.	2505.15417	null
2025-05-21	Exploring the Delocalization of Dark States in a Multimode Optical Cavity	Kunyang Sun et.al.	2505.15153	null
2025-05-21	Graph Foundation Models: A Comprehensive Survey	Zehong Wang et.al.	2505.15116	link
2025-05-20	Foundations of Unknown-aware Machine Learning	Xuefeng Du et.al.	2505.14933	null
2025-05-19	HR-VILAGE-3K3M: A Human Respiratory Viral Immunization Longitudinal Gene Expression Dataset for Systems Immunity	Xuejun Sun et.al.	2505.14725	link
2025-05-20	Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning	Jiaer Xia et.al.	2505.14677	null
2025-05-20	VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation	Wentao Ma et.al.	2505.14640	null
2025-05-20	Spiking Neural Networks with Temporal Attention-Guided Adaptive Fusion for imbalanced Multi-modal Learning	Jiangrong Shen et.al.	2505.14535	null
2025-05-20	Investigating and Enhancing the Robustness of Large Multimodal Models Against Temporal Inconsistency	Jiafeng Liang et.al.	2505.14405	null
2025-05-20	“Haet Bhasha aur Diskrimineshun”: Phonetic Perturbations in Code-Mixed Hinglish to Red-Team LLMs	Darpan Aswal et.al.	2505.14226	null
2025-05-20	Mixed Signals: Understanding Model Disagreement in Multimodal Empathy Detection	Maya Srikanth et.al.	2505.13979	link
2025-05-20	LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts	Qifeng Cai et.al.	2505.13928	link
2025-05-19	MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision	Lingxiao Du et.al.	2505.13427	link
2025-05-19	I’ll believe it when I see it: Images increase misinformation sharing in Vision-Language Models	Alice Plebe et.al.	2505.13302	link
2025-05-19	ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling	Ege Özsoy et.al.	2505.12890	null
2025-05-19	AdaToken-3D: Dynamic Spatial Gating for Efficient 3D Large Multimodal-Models Reasoning	Kai Zhang et.al.	2505.12782	null
2025-05-19	Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues?	Haibin He et.al.	2505.12766	link
2025-05-19	Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation	Weiliang Tang et.al.	2505.12744	null
2025-05-19	FLASH: Latent-Aware Semi-Autoregressive Speculative Decoding for Multimodal Tasks	Zihua Wang et.al.	2505.12728	link
2025-05-18	LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?	Maoyuan Ye et.al.	2505.12307	link
2025-05-18	LLaVA-4D: Embedding SpatioTemporal Prompt into LMMs for 4D Scene Understanding	Hanyu Zhou et.al.	2505.12253	null
2025-05-18	Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind	Qingmei Li et.al.	2505.12207	null
2025-05-17	Understanding the Capabilities of Molecular Graph Neural Networks in Materials Science Through Multimodal Learning and Physical Context Encoding	Can Polat et.al.	2505.12137	link
2025-05-17	LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation	Jiarui Wang et.al.	2505.12098	link
2025-05-17	IQBench: How “Smart’’ Are Vision-Language Models? A Study with Human IQ Tests	Tan-Hanh Pham et.al.	2505.12000	null
2025-05-17	SafeVid: Toward Safety Aligned Video Large Multimodal Models	Yixu Wang et.al.	2505.11926	null
2025-05-16	HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation	Shaina Raza et.al.	2505.11454	link
2025-05-16	A Step towards Interpretable Multimodal AI Models with MultiFIX	Mafalda Malafaia et.al.	2505.11262	null
2025-05-16	Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization	Yanhao Jia et.al.	2505.11217	null
2025-05-16	GeoMM: On Geodesic Perspective for Multi-modal Learning	Shibin Mei et.al.	2505.11216	null
2025-05-19	Creating General User Models from Computer Use	Omar Shaikh et.al.	2505.10831	null
2025-05-15	MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning	Ke Wang et.al.	2505.10557	link
2025-05-15	Enhancing Multi-Image Question Answering via Submodular Subset Selection	Aaryan Sharma et.al.	2505.10533	null
2025-05-15	UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation	Yi Li et.al.	2505.10483	null
2025-05-15	Incorporating brain-inspired mechanisms for multimodal learning in artificial intelligence	Xiang He et.al.	2505.10176	link
2025-05-15	ChronoSteer: Bridging Large Language Model and Time Series Foundation Model via Synthetic Data	Chengsen Wang et.al.	2505.10083	null
2025-05-15	PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language	Ijazul Haq et.al.	2505.10055	link
2025-05-16	PointArena: Probing Multimodal Grounding Through Language-Guided Pointing	Long Cheng et.al.	2505.09990	null
2025-05-14	Variational Visual Question Answering	Tobias Jan Wieczorek et.al.	2505.09591	null
2025-05-14	BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset	Jiuhai Chen et.al.	2505.09568	link
2025-05-14	GlobalMood: A cross-cultural benchmark for music emotion recognition	Harin Lee et.al.	2505.09539	null
2025-05-13	Multimodal Modeling of Ultradian Rhythms Using the Hankel Alternative View of Koopman (HAVOK) Analysis	Emmanuel Molefi et.al.	2505.08953	null
2025-05-12	Towards SFW sampling for diffusion models via external conditioning	Camilo Carvajal Reyes et.al.	2505.08817	link
2025-05-13	Aya Vision: Advancing the Frontier of Multilingual Multimodality	Saurabh Dash et.al.	2505.08751	null
2025-05-13	Advancing Food Nutrition Estimation via Visual-Ingredient Feature Fusion	Huiyan Qi et.al.	2505.08747	null
2025-05-13	ORACLE-Grasp: Zero-Shot Task-Oriented Robotic Grasping using Large Multimodal Models	Avihai Giuili et.al.	2505.08417	null
2025-05-13	Decoupled Multimodal Prototypes for Visual Recognition with Missing Modalities	Jueqing Lu et.al.	2505.08283	null
2025-05-12	MilChat: Introducing Chain of Thought Reasoning and GRPO to a Multimodal Small Language Model for Remote Sensing	Aybora Koksal et.al.	2505.07984	null
2025-05-12	Gameplay Highlights Generation	Vignesh Edithal et.al.	2505.07721	null
2025-05-15	A systematic review of challenges and proposed solutions in modeling multimodal data	Maryam Farhadizadeh et.al.	2505.06945	null
2025-05-11	MMiC: Mitigating Modality Incompleteness in Clustered Federated Learning	Lishan Yang et.al.	2505.06911	null
2025-05-10	Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding	Dawei Huang et.al.	2505.06685	link
2025-05-10	Batch Augmentation with Unimodal Fine-tuning for Multimodal Learning	H M Dipu Kabir et.al.	2505.06592	link
2025-05-09	Understanding and Mitigating Toxicity in Image-Text Pretraining Datasets: A Case Study on LLaVA	Karthik Reddy Kanjula et.al.	2505.06356	null
2025-05-09	NSF-MAP: Neurosymbolic Multimodal Fusion for Robust and Interpretable Anomaly Prediction in Assembly Pipelines	Chathurangi Shyalika et.al.	2505.06333	link
2025-05-09	Multimodal Sentiment Analysis on CMU-MOSEI Dataset using Transformer-based Models	Jugal Gajjar et.al.	2505.06110	link
2025-05-08	The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction	Tom Sander et.al.	2505.05644	null
2025-05-08	Looking Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models	Aarti Ghatkesar et.al.	2505.05626	null
2025-05-08	Does CLIP perceive art the same way we do?	Andrea Asperti et.al.	2505.05229	null
2025-05-08	A Benchmark Dataset and a Framework for Urdu Multimodal Named Entity Recognition	Hussain Ahmad et.al.	2505.05148	null
2025-05-13	FG-CLIP: Fine-Grained Visual and Textual Alignment	Chunyu Xie et.al.	2505.05071	link
2025-05-07	OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning	Xianhang Li et.al.	2505.04601	null
2025-05-07	VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction Tuning	Trinh T. L. Vuong et.al.	2505.04192	link
2025-05-07	Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision	Linhan Cao et.al.	2505.03631	link
2025-05-06	A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges	Feibo Jiang et.al.	2505.03556	link
2025-05-06	Reinforced Correlation Between Vision and Language for Precise Medical AI Assistant	Haonan Wang et.al.	2505.03380	null
2025-05-06	A Vision-Language Model for Focal Liver Lesion Classification	Song Jian et.al.	2505.03350	null
2025-05-06	SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation	Yu-Ren Guo et.al.	2505.03244	null
2025-05-06	Adversarial Attacks in Multimodal Systems: A Practitioner’s Survey	Shashank Kapoor et.al.	2505.03084	null
2025-05-05	The Multimodal Paradox: How Added and Missing Modalities Shape Bias and Performance in Multimodal AI	Kishore Sampath et.al.	2505.03020	null
2025-05-05	AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation	Qingqiu Li et.al.	2505.02830	null
2025-05-07	Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities	Xinjie Zhang et.al.	2505.02567	link
2025-05-05	Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks	Baoxia Du et.al.	2505.02413	null
2025-05-04	Robust AI-Generated Face Detection with Imbalanced Data	Yamini Sri Krubha et.al.	2505.02182	link
2025-05-04	TeMTG: Text-Enhanced Multi-Hop Temporal Graph Modeling for Audio-Visual Video Parsing	Yaru Chen et.al.	2505.02096	null
2025-05-04	R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation	Meng-Hao Guo et.al.	2505.02018	null
2025-05-07	RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation	Kaidong Zhang et.al.	2505.01709	null
2025-05-02	Grounding Task Assistance with Multimodal Cues from a Single Demonstration	Gabriel Sarch et.al.	2505.01578	null
2025-05-02	Dual-Forecaster: A Multimodal Time Series Model Integrating Descriptive and Predictive Texts	Wenfa Wu et.al.	2505.01135	null
2025-05-02	Aggregation of Dependent Expert Distributions in Multimodal Variational Autoencoders	Rogelio A Mancisidor et.al.	2505.01134	null
2025-05-01	SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models	Wufei Ma et.al.	2505.00788	null
2025-05-01	MINERVA: Evaluating Complex Video Reasoning	Arsha Nagrani et.al.	2505.00681	link
2025-05-01	LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving	Zhijie Qiao et.al.	2505.00284	link
2025-04-30	Investigating Zero-Shot Diagnostic Pathology in Vision-Language Models with Efficient Prompt Design	Vasudev Sharma et.al.	2505.00134	null
2025-04-30	Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision	Weicai Yan et.al.	2504.21423	null
2025-04-30	AGHI-QA: A Subjective-Aligned Dataset and Metric for AI-Generated Human Images	Yunhao Li et.al.	2504.21308	null
2025-04-28	AGATE: Stealthy Black-box Watermarking for Multimodal Model Copyright Protection	Jianbo Gao et.al.	2504.21044	null
2025-04-28	What’s Pulling the Strings? Evaluating Integrity and Attribution in AI Training and Inference through Concept Shift	Jiamin Chang et.al.	2504.21042	null
2025-04-29	YoChameleon: Personalized Vision and Language Generation	Thao Nguyen et.al.	2504.20998	null
2025-04-29	X-Fusion: Introducing New Modality to Frozen Large Language Models	Sicheng Mo et.al.	2504.20996	null
2025-05-05	LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs	Woo Yi Yang et.al.	2504.20466	null
2025-04-28	NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks	Chia-Yu Hung et.al.	2504.19854	null
2025-04-27	Platonic Grounding for Efficient Multimodal Language Models	Moulik Choraria et.al.	2504.19327	null
2025-04-27	VIST-GPT: Ushering in the Era of Visual Storytelling with LLMs?	Mohamed Gado et.al.	2504.19267	null
2025-04-27	DeepSPG: Exploring Deep Semantic Prior Guidance for Low-light Image Enhancement with Multimodal Learning	Jialang Lu et.al.	2504.19127	null
2025-04-25	ClassComet: Exploring and Designing AI-generated Danmaku in Educational Videos to Enhance Online Learning	Zipeng Ji et.al.	2504.18189	null
2025-04-25	ActionArt: Advancing Multimodal Large Models for Fine-Grained Human-Centric Video Understanding	Yi-Xing Peng et.al.	2504.18152	null
2025-04-24	Token Sequence Compression for Efficient Multimodal Computing	Yasmine Omri et.al.	2504.17892	null
2025-04-23	A multi-scale vision transformer-based multimodal GeoAI model for mapping Arctic permafrost thaw	Wenwen Li et.al.	2504.17822	null
2025-04-28	Step1X-Edit: A Practical Framework for General Image Editing	Shiyu Liu et.al.	2504.17761	link
2025-04-24	FRAG: Frame Selection Augmented Generation for Long Video and Long Document Understanding	De-An Huang et.al.	2504.17447	link
2025-04-24	DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition	Yiyan Xu et.al.	2504.17349	null
2025-04-23	Detecting and Understanding Hateful Contents in Memes Through Captioning and Visual Question-Answering	Ali Anaissi et.al.	2504.16723	null
2025-04-22	CLIP-IT: CLIP-based Pairing for Histology Images Classification	Banafsheh Karimian et.al.	2504.16181	link
2025-04-22	SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems	Manjunath D et.al.	2504.15728	link
2025-04-24	Vidi: Large Multimodal Models for Video Understanding and Editing	Vidi Team et.al.	2504.15681	null
2025-04-21	Event2Vec: Processing neuromorphic events directly by representations in vector space	Wei Fang et.al.	2504.15371	link
2025-04-21	VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models	Weiye Xu et.al.	2504.15279	null
2025-04-21	Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models	Guo Chen et.al.	2504.15271	null
2025-04-21	An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes	Ji Qi et.al.	2504.15270	null
2025-04-21	Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS’s LLM-CLIP Framework for Image Captioning	Yassir Benhammou et.al.	2504.15199	null
2025-04-21	DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding	Geng Li et.al.	2504.14920	link
2025-04-21	IoT-AMLHP: Aligned Multimodal Learning of Header-Payload Representations for Resource-Efficient Malicious IoT Traffic Classification	Fengyuan Nie et.al.	2504.14833	null
2025-04-20	Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark	Enxin Song et.al.	2504.14693	link
2025-04-20	Learning from Reasoning Failures via Synthetic Data Generation	Gabriela Ben Melech Stan et.al.	2504.14523	null
2025-04-19	Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction	Li Yu et.al.	2504.14267	null
2025-04-19	PipeWeaver: Addressing Data Dynamicity in Large Multimodal Model Training with Dynamic Interleaved Pipeline	Zhenliang Xue et.al.	2504.14145	null
2025-04-19	PEFT A2Z: Parameter-Efficient Fine-Tuning Survey for Large Language and Vision Models	Nusrat Jahan Prottasha et.al.	2504.14117	null
2025-04-18	Are you SURE? Enhancing Multimodal Pretraining with Missing Modalities through Uncertainty Estimation	Duy A. Nguyen et.al.	2504.13465	null
2025-04-14	Building Trustworthy Multimodal AI: A Review of Fairness, Transparency, and Ethics in Vision-Language Tasks	Mohammad Saleha et.al.	2504.13199	null
2025-04-17	A Survey on Cross-Modal Interaction Between Music and Multimodal Data	Sifei Li et.al.	2504.12796	null
2025-04-17	LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection	Weijia Li et.al.	2504.12749	null
2025-04-17	SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding	Qianqian Sun et.al.	2504.12704	null
2025-04-16	FedEPA: Enhancing Personalization and Modality Alignment in Multimodal Federated Learning	Yu Zhang et.al.	2504.12025	null
2025-04-15	Leveraging multimodal explanatory annotations for video interpretation with Modality Specific Dataset	Elisa Ancarani et.al.	2504.11232	null
2025-04-15	TerraMind: Large-Scale Generative Multimodality for Earth Observation	Johannes Jakubik et.al.	2504.11171	null
2025-04-15	PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving	Zeyu Zhang et.al.	2504.10885	null
2025-04-19	InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models	Jinguo Zhu et.al.	2504.10479	link
2025-04-21	InstructEngine: Instruction-driven Text-to-Image Alignment	Xingyu Lu et.al.	2504.10329	null
2025-04-14	Improving Multimodal Hateful Meme Detection Exploiting LMM-Generated Knowledge	Maria Tzelepi et.al.	2504.09914	link
2025-04-13	Automatic Detection of Intro and Credits in Video using CLIP and Multihead Attention	Vasilii Korolkov et.al.	2504.09738	null
2025-04-13	InfoMAE: Pair-Efficient Cross-Modal Alignment for Multimodal Time-Series Sensing Signals	Tomoyoshi Kimura et.al.	2504.09707	null
2025-04-13	TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning	Xingjian Zhang et.al.	2504.09641	link
2025-04-13	Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation	Yongchao Feng et.al.	2504.09480	link
2025-04-13	Don’t Deceive Me: Mitigating Gaslighting through Attention Reallocation in LMMs	Pengkun Jiao et.al.	2504.09456	null
2025-04-12	Application of Contrastive Learning on ECG Data: Evaluating Performance in Japanese and Classification with Around 100 Labels	Junichiro Takahashi et.al.	2504.09302	null
2025-04-12	PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks	Jianyu Wu et.al.	2504.09258	null
2025-04-12	FVQ: A Large-Scale Dataset and A LMM-based Method for Face Video Quality Assessment	Sijing Wu et.al.	2504.09255	link
2025-04-11	Mimic In-Context Learning for Multimodal Tasks	Yuchu Jiang et.al.	2504.08851	link
2025-04-11	LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs	Jiarui Wang et.al.	2504.08358	link
2025-04-11	Scaling Laws for Native Multimodal Models	Mustafa Shukor et.al.	2504.07951	null
2025-04-10	Unveiling the Impact of Multimodal Features on Chinese Spelling Correction: From Analysis to Design	Xiaowu Zhang et.al.	2504.07661	link
2025-04-10	Leveraging LLMs for Multimodal Retrieval-Augmented Radiology Report Generation via Key Phrase Extraction	Kyoyun Choi et.al.	2504.07415	null
2025-04-09	Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging	Siyuan Dai et.al.	2504.07336	null
2025-04-09	A Unified Agentic Framework for Evaluating Conditional Image Generation	Jifang Wang et.al.	2504.07046	link
2025-04-09	Classifying the Unknown: In-Context Learning for Open-Vocabulary Text and Symbol Recognition	Tom Simon et.al.	2504.06841	null
2025-04-09	Domain-Conditioned Scene Graphs for State-Grounded Task Planning	Jonas Herzog et.al.	2504.06661	null
2025-04-09	SCI-Reason: A Dataset with Chain-of-Thought Rationales for Complex Multimodal Reasoning in Academic Areas	Chenghao Ma et.al.	2504.06637	null
2025-04-08	Transfer between Modalities with MetaQueries	Xichen Pan et.al.	2504.06256	null
2025-04-07	SmolVLM: Redefining small and efficient multimodal models	Andrés Marafioti et.al.	2504.05299	null
2025-04-07	Mapping biodiversity at very-high resolution in Europe	César Leblanc et.al.	2504.05231	null
2025-04-07	Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework	Yu Min Park et.al.	2504.05187	null
2025-04-07	The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation	Hao Fang et.al.	2504.05178	null
2025-04-06	Foundation Models for Software Engineering of Cyber-Physical Systems: the Road Ahead	Chengjie Lu et.al.	2504.04630	null
2025-04-06	FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency	Shiyan Liu et.al.	2504.04427	null
2025-04-05	Could AI Trace and Explain the Origins of AI-Generated Images and Text?	Hongchao Fang et.al.	2504.04279	link
2025-04-10	VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models	Dahun Kim et.al.	2504.03970	link
2025-04-04	Interpretable Multimodal Learning for Tumor Protein-Metal Binding: Progress, Challenges, and Perspectives	Xiaokun Liu et.al.	2504.03847	null
2025-04-04	DML-RAM: Deep Multimodal Learning Framework for Robotic Arm Manipulation using Pre-trained Models	Sathish Kumar et.al.	2504.03423	null
2025-04-04	Scaling Open-Vocabulary Action Detection	Zhen Hao Sia et.al.	2504.03096	link
2025-04-03	VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence	Hao Li et.al.	2504.02227	link
2025-04-02	One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image	Ezzeldin Shereen et.al.	2504.02132	null
2025-04-02	Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness	Haochen Wang et.al.	2504.01901	null
2025-04-02	Deep Learning-Driven Protein Structure Prediction and Design: Key Model Developments by Nobel Laureates and Multi-Domain Applications	Wanqing Yang et.al.	2504.01490	null
2025-04-01	SViQA: A Unified Speech-Vision Multimodal Model for Textless Visual Question Answering	Bingxin Li et.al.	2504.01049	null
2025-03-29	Who Owns the Output? Bridging Law and Technology in LLMs Attribution	Emanuele Mezzi et.al.	2504.01032	null
2025-04-01	Experiential Semantic Information and Brain Alignment: Are Multimodal Models Better than Language Models?	Anna Bavaresco et.al.	2504.00942	null
2025-03-31	FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics	Yixuan Li et.al.	2503.24267	null
2025-03-31	Predicting Targeted Therapy Resistance in Non-Small Cell Lung Cancer Using Multimodal Machine Learning	Peiying Hua et.al.	2503.24165	null
2025-03-31	H2VU-Benchmark: A Comprehensive Benchmark for Hierarchical Holistic Video Understanding	Qi Wu et.al.	2503.24008	null
2025-03-31	Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Jiagen Li et.al.	2503.23721	null
2025-03-29	Evaluating Compositional Scene Understanding in Multimodal Generative Models	Shuhao Fu et.al.	2503.23125	link
2025-03-27	LeForecast: Enterprise Hybrid Forecast by Time Series Intelligence	Zheng Tan et.al.	2503.22747	null
2025-03-28	A Survey on Remote Sensing Foundation Models: From Vision to Multimodality	Ziyue Huang et.al.	2503.22081	link
2025-03-27	On Large Multimodal Models as Open-World Image Classifiers	Alessandro Conti et.al.	2503.21851	link
2025-03-27	Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model	Abdelrahman Shaker et.al.	2503.21782	link
2025-03-27	MAVERIX: Multimodal Audio-Visual Evaluation Reasoning IndeX	Liuyue Xie et.al.	2503.21699	null
2025-03-27	FusionSegReID: Advancing Person Re-Identification with Multimodal Retrieval and Precise Segmentation	Jincheng Yan et.al.	2503.21595	null
2025-03-27	Keyword-Oriented Multimodal Modeling for Euphemism Identification	Yuxue Hu et.al.	2503.21504	link
2025-03-27	Graph-to-Vision: Multi-graph Understanding and Reasoning using Vision-Language Models	Ruizhou Li et.al.	2503.21435	null
2025-03-27	UGen: Unified Autoregressive Multimodal Model with Progressive Vocabulary Learning	Hongxuan Tang et.al.	2503.21193	null
2025-03-27	AdaMHF: Adaptive Multimodal Hierarchical Fusion for Survival Prediction	Shuaiyu Zhang et.al.	2503.21124	link
2025-03-28	Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields	Shijie Zhou et.al.	2503.20776	null
2025-03-26	ADS-Edit: A Multimodal Knowledge Editing Dataset for Autonomous Driving Systems	Chenxi Wang et.al.	2503.20756	link
2025-03-26	Qwen2.5-Omni Technical Report	Jin Xu et.al.	2503.20215	null
2025-03-25	Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors	Yuke Lou et.al.	2503.20118	null
2025-03-25	Gemini Robotics: Bringing AI into the Physical World	Gemini Robotics Team et.al.	2503.20020	null
2025-03-25	CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning	Hao Yu et.al.	2503.19900	link
2025-03-27	RGB-Th-Bench: A Dense benchmark for Visual-Thermal Understanding of Vision Language Models	Mehdi Moshtaghi et.al.	2503.19654	null
2025-03-25	Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation	Hongcheng Gao et.al.	2503.19622	link
2025-03-25	VGAT: A Cancer Survival Analysis Framework Transitioning from Generative Visual Question Answering to Genomic Reconstruction	Zizhi Chen et.al.	2503.19367	link
2025-03-25	Membership Inference Attacks on Large-Scale Models: A Survey	Hengyu Wu et.al.	2503.19338	null
2025-03-25	LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text	Weizhi Chen et.al.	2503.19311	link
2025-03-24	Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition	Chengxiang Huang et.al.	2503.18595	link
2025-03-27	Image-to-Text for Medical Reports Using Adaptive Co-Attention and Triple-LSTM Module	Yishen Liu et.al.	2503.18297	null
2025-03-21	Audio-Enhanced Vision-Language Modeling with Latent Space Broadening for High Quality Data Expansion	Yu Sun et.al.	2503.17551	null
2025-03-21	Modeled vortex dynamics on a Bose-Einstein condensate in a rotating lattice	P. Capuzzi et.al.	2503.17317	null
2025-03-21	MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering	Jialin Chen et.al.	2503.16858	link
2025-03-18	Do Multimodal Large Language Models Understand Welding?	Grigorii Khvatskii et.al.	2503.16537	null
2025-03-24	GAEA: A Geolocation Aware Conversational Model	Ron Campos et.al.	2503.16423	null
2025-03-20	Disentangled and Interpretable Multimodal Attention Fusion for Cancer Survival Prediction	Aniek Eijpe et.al.	2503.16069	null
2025-03-20	What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation?	Xuanming Cui et.al.	2503.15846	null
2025-03-19	EarthScape: A Multimodal Dataset for Surficial Geologic Mapping and Earth Surface Analysis	Matthew Massey et.al.	2503.15625	link
2025-03-19	A Review on Large Language Models for Visual Analytics	Navya Sonal Agarwal et.al.	2503.15176	null
2025-03-19	Machine Unlearning in Hyperbolic vs. Euclidean Multimodal Contrastive Learning: Adapting Alignment Calibration to MERU	Àlex Pujol Vidal et.al.	2503.15166	null
2025-03-19	Optimal Transport Adapter Tuning for Bridging Modality Gaps in Few-Shot Remote Sensing Scene Classification	Zhong Ji et.al.	2503.14938	null
2025-03-19	Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation	Siwei Wen et.al.	2503.14905	null
2025-03-19	MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models	Chejian Xu et.al.	2503.14827	null
2025-03-18	Tracking Meets Large Multimodal Models for Driving Scenario Understanding	Ayesha Ishaq et.al.	2503.14498	link
2025-03-22	VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms	Seungwon Lim et.al.	2503.14427	link
2025-03-18	HySurvPred: Multimodal Hyperbolic Embedding with Angle-Aware Hierarchical Contrastive Learning and Uncertainty Constraints for Survival Prediction	Jiaqi Yang et.al.	2503.13862	null
2025-03-17	Triad: Empowering LMM-based Anomaly Detection with Vision Expert-guided Visual Tokenizer and Manufacturing Process	Yuanze Li et.al.	2503.13184	link
2025-03-17	HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model	Tao Wang et.al.	2503.13026	null
2025-03-17	Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning	Xueying Jiang et.al.	2503.12974	null
2025-03-17	Task-Oriented Feature Compression for Multimodal Understanding via Device-Edge Co-Inference	Cheng Yuan et.al.	2503.12926	null
2025-03-17	Federated Continual Instruction Tuning	Haiyang Guo et.al.	2503.12897	null
2025-03-16	Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding	Imran Kabir et.al.	2503.12663	link
2025-03-16	PEBench: A Fictitious Dataset to Benchmark Machine Unlearning for Multimodal Large Language Models	Zhaopan Xu et.al.	2503.12545	null
2025-03-16	BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries	Tianle Li et.al.	2503.12446	null
2025-03-15	TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification	Ans Munir et.al.	2503.12206	link
2025-03-14	Visualizing Thought: Conceptual Diagrams Enable Robust Planning in LMMs	Nasim Borazjanizadeh et.al.	2503.11790	null
2025-03-14	Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers	Weiming Ren et.al.	2503.11579	null
2025-03-14	Exploring the Potential of Large Multimodal Models as Effective Alternatives for Pronunciation Assessment	Ke Wang et.al.	2503.11229	null
2025-03-14	Cross-Modal Learning for Music-to-Music-Video Description Generation	Zhuoyuan Mao et.al.	2503.11190	null
2025-03-14	Augmenting Image Annotation: A Human-LMM Collaborative Framework for Efficient Object Selection and Label Generation	He Zhang et.al.	2503.11096	null
2025-03-14	Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models	Hongyang Wei et.al.	2503.11073	link
2025-03-13	Towards Understanding Graphical Perception in Large Multimodal Models	Kai Zhang et.al.	2503.10857	link
2025-03-13	DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding	Ayesha Ishaq et.al.	2503.10621	link
2025-03-13	ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning	Pengfei Luo et.al.	2503.10166	link
2025-03-13	ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content	Bhavik Chandna et.al.	2503.09964	null
2025-03-12	MOAT: Evaluating LMMs for Capability Integration and Instruction Grounding	Zhoutong Ye et.al.	2503.09348	link
2025-03-12	Teaching LMMs for Image Quality Scoring and Interpreting	Zicheng Zhang et.al.	2503.09197	link
2025-03-11	Proc4Gem: Foundation models for physical agency through procedural generation	Yixin Lin et.al.	2503.08593	null
2025-03-11	ComicsPAP: understanding comic strips by picking the correct panel	Emanuele Vivoli et.al.	2503.08561	null
2025-03-14	Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens	Qingsong Xie et.al.	2503.08377	null
2025-03-11	Uni $\textbf{F}^2$ ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models	Junzhe Li et.al.	2503.08120	null
2025-03-10	Video Action Differencing	James Burgess et.al.	2503.07860	null
2025-03-10	Federated Multimodal Learning with Dual Adapters and Selective Pruning for Communication and Computational Efficiency	Duy Phuong Nguyen et.al.	2503.07552	link
2025-03-11	LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL	Yingzhe Peng et.al.	2503.07536	null
2025-03-10	VisRL: Intention-Driven Visual Perception via Reinforced Reasoning	Zhangquan Chen et.al.	2503.07523	link
2025-03-10	WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation	Yuwei Niu et.al.	2503.07265	link
2025-03-10	A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis	Xiang Liu et.al.	2503.06973	link
2025-03-10	LLaFEA: Frame-Event Complementary Fusion for Fine-Grained Spatiotemporal Understanding in LMMs	Hanyu Zhou et.al.	2503.06934	null
2025-03-10	HiSTF Mamba: Hierarchical Spatiotemporal Fusion with Multi-Granular Body-Spatial Modeling for High-Fidelity Text-to-Motion Generation	Xingzu Zhan et.al.	2503.06897	null
2025-03-10	Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting	Cagri Gungor et.al.	2503.06860	null
2025-03-09	SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts	Shijia Zhao et.al.	2503.06467	link
2025-03-09	DynCIM: Dynamic Curriculum for Imbalanced Multimodal Learning	Chengxuan Qian et.al.	2503.06456	link
2025-03-08	UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces	Baining Zhao et.al.	2503.06157	null
2025-03-07	CASP: Compression of Large Multimodal Models Based on Attention Sparsity	Mohsen Gholami et.al.	2503.05936	link
2025-03-07	Robust Multimodal Learning for Ophthalmic Disease Grading via Disentangled Representation	Xinkun Wang et.al.	2503.05319	link
2025-03-06	Large Language Models in Bioinformatics: A Survey	Zhenyu Wang et.al.	2503.04490	null
2025-03-06	Semantic Alignment of Unimodal Medical Text and Vision Representations	Maxime Di Folco et.al.	2503.04478	null
2025-03-06	ToFu: Visual Tokens Reduction via Fusion for Multi-modal, Multi-patch, Multi-image Task	Vittorio Pippi et.al.	2503.04444	null
2025-03-05	Rebalanced Multimodal Learning with Data-aware Unimodal Sampling	Qingyuan Jiang et.al.	2503.03792	null
2025-03-06	DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms	Xiaojun Bi et.al.	2503.03644	link
2025-03-05	See What You Are Told: Visual Attention Sink in Large Multimodal Models	Seil Kang et.al.	2503.03321	null
2025-03-04	Multimodal Deep Learning for Subtype Classification in Breast Cancer Using Histopathological Images and Gene Expression Data	Amin Honarmandi Shandiz et.al.	2503.02849	link
2025-03-04	Multimodal AI predicts clinical outcomes of drug combinations from preclinical data	Yepeng Huang et.al.	2503.02781	link
2025-03-04	DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models	Saeed Ranjbar Alvar et.al.	2503.02175	link
2025-03-03	V $^2$ Dial: Unification of Video and Visual Dialog via Multimodal Experts	Adnen Abdessaied et.al.	2503.02063	null
2025-03-03	Abn-BLIP: Abnormality-aligned Bootstrapping Language-Image Pre-training for Pulmonary Embolism Diagnosis and Report Generation from CTPA	Zhusi Zhong et.al.	2503.02034	null
2025-03-07	Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs	Microsoft et.al.	2503.01743	null
2025-03-03	DeepSuM: Deep Sufficient Modality Learning Framework	Zhe Gao et.al.	2503.01728	null
2025-03-04	HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization	Zitang Zhou et.al.	2503.01725	null
2025-03-03	Learning to Generate Long-term Future Narrations Describing Activities of Daily Living	Ramanathan Rajendiran et.al.	2503.01416	null
2025-03-03	Dementia Insights: A Context-Based MultiModal Approach	Sahar Sinene Mehdoui et.al.	2503.01226	null
2025-03-03	HOP: Heterogeneous Topology-based Multimodal Entanglement for Co-Speech Gesture Generation	Hongye Cheng et.al.	2503.01175	null
2025-03-02	Re-Imagining Multimodal Instruction Tuning: A Representation View	Yiyang Liu et.al.	2503.00723	link
2025-03-01	Urban Safety Perception Through the Lens of Large Multimodal Models: A Persona-based Approach	Ciro Beneduce et.al.	2503.00610	null
2025-03-01	Taming Large Multimodal Agents for Ultra-low Bitrate Semantically Disentangled Image Compression	Juan Song et.al.	2503.00399	link
2025-02-28	Solar Multimodal Transformer: Intraday Solar Irradiance Predictor using Public Cameras and Time Series	Yanan Niu et.al.	2503.00250	null
2025-02-28	Multimodal Learning for Just-In-Time Software Defect Prediction in Autonomous Driving Systems	Faisal Mohammad et.al.	2502.20806	null
2025-02-28	Towards General Visual-Linguistic Face Forgery Detection(V2)	Ke Sun et.al.	2502.20698	link
2025-02-27	Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation	Shaharukh Khan et.al.	2502.20420	null
2025-03-01	R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts	Zhongyang Li et.al.	2502.20395	link
2025-02-27	Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think	Liang Chen et.al.	2502.20172	link
2025-02-27	Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion	QingYuan Jiang et.al.	2502.20120	null
2025-03-01	MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge	Yuntao Du et.al.	2502.19870	link
2025-02-27	Knowledge Bridger: Towards Training-free Missing Multi-modality Completion	Guanzhou Ke et.al.	2502.19834	link
2025-02-27	MICINet: Multi-Level Inter-Class Confusing Information Removal for Reliable Multimodal Classification	Tong Zhang et.al.	2502.19674	null
2025-02-25	Mind the Gap: Bridging the Divide Between AI Aspirations and the Reality of Autonomous Characterization	Grace Guinan et.al.	2502.18604	null
2025-02-25	Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs	Gaye Colakoglu et.al.	2502.18179	link
2025-02-25	CPVis: Evidence-based Multimodal Learning Analytics for Evaluation in Collaborative Programming	Gefei Zhang et.al.	2502.17835	null
2025-02-24	Contrastive Visual Data Augmentation	Yu Zhou et.al.	2502.17709	null
2025-02-22	A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models	Zihao Lin et.al.	2502.17516	null
2025-02-22	SAE-V: Interpreting Multimodal Models for Enhanced Alignment	Hantao Lou et.al.	2502.17514	null
2025-02-24	Applications of Large Models in Medicine	YunHe Su et.al.	2502.17132	null
2025-02-24	Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI	Syed Abdul Gaffar Shakhadri et.al.	2502.17092	null
2025-02-24	Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability	Ashhadul Islam et.al.	2502.17071	null
2025-02-24	DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications	Ibrahim Fayad et.al.	2502.17066	null
2025-02-23	Category-Selective Neurons in Deep Networks: Comparing Purely Visual and Visual-Language Models	Zitong Lu et.al.	2502.16456	null
2025-02-23	A Survey on Industrial Anomalies Synthesis	Xichen Xu et.al.	2502.16412	link
2025-02-22	Understanding the Emergence of Multimodal Representation Alignment	Megan Tjandrasuwita et.al.	2502.16282	link
2025-02-22	Beyond Visual Perception: Insights from Smartphone Interaction of Visually Impaired Users with Large Multimodal Models	Jingyi Xie et.al.	2502.16098	null
2025-02-21	Multi-Agent Multimodal Models for Multicultural Text to Image Generation	Parth Bhalerao et.al.	2502.15972	link
2025-02-21	Bridging Domain Gaps between Pretrained Multimodal Models and Recommendations	Wenyu Zhang et.al.	2502.15542	null
2025-02-21	LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models	Hongchen Wei et.al.	2502.15393	null
2025-02-21	M2LADS Demo: A System for Generating Multimodal Learning Analytics Dashboards	Alvaro Becerra et.al.	2502.15363	null
2025-02-21	Social Genome: Grounded Social Reasoning Abilities of Multimodal Models	Leena Mathur et.al.	2502.15109	null
2025-02-20	InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback	Henry Hengyuan Zhao et.al.	2502.15027	null
2025-02-18	Beyond Words: Exploring Cultural Value Sensitivity in Multimodal Models	Srishti Yadav et.al.	2502.14906	null
2025-02-20	Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts	Sara Ghaboura et.al.	2502.14865	link
2025-02-20	FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis	Fadillah Maani et.al.	2502.14807	link
2025-02-20	Harnessing PDF Data for Improving Japanese Large Multimodal Models	Jeonghun Baek et.al.	2502.14778	null
2025-02-20	Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective	Weizhong Huang et.al.	2502.14770	null
2025-02-19	Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data	Yucheng Shi et.al.	2502.14044	link
2025-02-19	Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences?	Xiaochen Wang et.al.	2502.13925	null
2025-02-19	Pretrained Image-Text Models are Secretly Video Captioners	Chunhui Zhang et.al.	2502.13363	link
2025-02-18	Magma: A Foundation Model for Multimodal AI Agents	Jianwei Yang et.al.	2502.13130	link
2025-02-18	Improved Fine-Tuning of Large Multimodal Models for Hateful Meme Detection	Jingbiao Mei et.al.	2502.13061	link
2025-02-18	Robust Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning	Mengshi Qi et.al.	2502.12425	link
2025-02-14	ClusMFL: A Cluster-Enhanced Framework for Modality-Incomplete Multimodal Federated Learning in Brain Imaging Analysis	Xinpeng Wang et.al.	2502.12180	null
2025-02-17	How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines	Ayan Sengupta et.al.	2502.12051	null
2025-02-17	HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims	Michiel van der Meer et.al.	2502.11753	null
2025-02-19	Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents	Vardaan Pahuja et.al.	2502.11357	null
2025-02-16	AudioSpa: Spatializing Sound Events with Text	Linfeng Feng et.al.	2502.11219	null
2025-02-16	UNITE-FND: Reframing Multimodal Fake News Detection through Unimodal Scene Translation	Arka Mukherjee et.al.	2502.11132	null
2025-02-16	Demystifying Hateful Content: Leveraging Large Multimodal Models for Hateful Meme Detection with Explainable Decisions	Ming Shan Hee et.al.	2502.11073	null
2025-02-18	BalanceBenchmark: A Survey for Imbalanced Learning	Shaoxuan Xu et.al.	2502.10816	link
2025-02-14	PolyPath: Adapting a Large Multimodal Model for Multi-slide Pathology Report Generation	Faruk Ahmed et.al.	2502.10536	null
2025-02-14	Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence	Granite Vision Team et.al.	2502.09927	null
2025-02-13	ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models	Jonathan Roberts et.al.	2502.09696	null
2025-02-13	MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency	Dongzhi Jiang et.al.	2502.09621	null
2025-02-13	Exploring the Potential of Encoder-free Architectures in 3D LMMs	Yiwen Tang et.al.	2502.09620	link
2025-02-17	Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation	Mohammad Mahdi Abootorabi et.al.	2502.08826	link
2025-02-17	SB-Bench: Stereotype Bias Benchmark for Large Multimodal Models	Vishal Narnaware et.al.	2502.08779	null
2025-02-13	PulseCheck457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models	Xingrui Wang et.al.	2502.08636	link
2025-02-12	What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations	Dongqi Liu et.al.	2502.08279	link
2025-02-12	UniCoRN: Unified Commented Retrieval Network with LMMs	Maximilian Jaritz et.al.	2502.08254	null
2025-02-13	NanoVLMs: How small can we go and still make coherent Vision Language Models?	Mukund Agarwalla et.al.	2502.07838	null
2025-02-11	Advancing Precision Oncology Through Modeling of Longitudinal and Multimodal Data	Luoting Zhuang et.al.	2502.07836	null
2025-02-10	Generative Distribution Prediction: A Unified Approach to Multimodal Learning	Xinyu Tian et.al.	2502.07090	null
2025-02-06	CAST: Cross Attention based multimodal fusion of Structure and Text for materials property prediction	Jaewan Lee et.al.	2502.06836	null
2025-02-10	Learning Musical Representations for Music Performance Question Answering	Xingjian Diao et.al.	2502.06710	null
2025-02-08	Large Multimodal Models for Low-Resource Languages: A Survey	Marian Lupascu et.al.	2502.05568	null
2025-02-06	Color in Visual-Language Models: CLIP deficiencies	Guillem Arias et.al.	2502.04470	null
2025-02-06	Transforming Multimodal Models into Action Models for Radiotherapy	Matteo Ferrante et.al.	2502.04408	null
2025-02-05	DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization	Zhenglin Zhou et.al.	2502.04370	null
2025-02-05	Tell2Reg: Establishing spatial correspondence between images by the same language prompts	Wen Yan et.al.	2502.03118	link
2025-02-04	Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment	Yaling Shen et.al.	2502.02438	null
2025-02-03	MemPal: Leveraging Multimodal AI and LLMs for Voice-Activated Object Retrieval in Homes of Older Adults	Natasha Maniar et.al.	2502.01801	null
2025-02-03	Efficiently Integrate Large Language Models with Visual Perception: A Survey from the Training Paradigm Perspective	Xiaorui Ma et.al.	2502.01524	null
2025-02-03	MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks	Alejandro Guerra-Manzanares et.al.	2502.01158	null
2025-02-03	CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling	Xinze Wang et.al.	2502.00965	null
2025-02-02	Towards Efficient Large Multimodal Model Serving	Haoran Qiu et.al.	2502.00937	null
2025-02-02	“I am bad”: Interpreting Stealthy, Universal and Robust Audio Jailbreaks in Audio-Language Models	Isha Gupta et.al.	2502.00718	null
2025-02-02	MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models	Huanqia Cai et.al.	2502.00698	link
2025-02-01	Embodied Intelligence for 3D Understanding: A Survey on 3D Scene Question Answering	Zechuan Li et.al.	2502.00342	null
2025-02-01	Mordal: Automated Pretrained Model Selection for Vision Language Models	Shiqi He et.al.	2502.00241	null
2025-01-31	Multimodal MRI-Ultrasound AI for Prostate Cancer Detection Outperforms Radiologist MRI Interpretation: A Multi-Center Study	Hassan Jahanandish et.al.	2502.00146	null
2025-02-04	AIN: The Arabic INclusive Large Multimodal Model	Ahmed Heakl et.al.	2502.00094	link
2025-01-31	Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023	Ting-Yao E. Hsu et.al.	2501.19353	null
2025-01-30	Integrating LMM Planners and 3D Skill Policies for Generalizable Manipulation	Yuelei Li et.al.	2501.18733	null
2025-01-30	AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment	Yuqin Cao et.al.	2501.18314	null
2025-01-30	Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment	Joanna Hong et.al.	2501.18157	null
2025-01-29	U2A: Unified Unimodal Adaptation for Robust and Efficient Multimodal Learning	Md Kaykobad Reza et.al.	2501.17823	null
2025-01-28	Molecular-driven Foundation Model for Oncologic Pathology	Anurag Vaidya et.al.	2501.16652	link
2025-01-26	TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding	Xingjian Zhang et.al.	2501.15513	link
2025-01-25	Deep Multimodal Learning for Real-Time DDoS Attacks Detection in Internet of Vehicles	Mohamed Ababsa et.al.	2501.15252	link
2025-01-25	PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures	Shreya Shukla et.al.	2501.15074	link
2025-01-23	GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing	Akashah Shabbir et.al.	2501.13925	link
2025-01-30	Temporal Preference Optimization for Long-Form Video Understanding	Rui Li et.al.	2501.13919	null
2025-01-23	Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning	Zuyao You et.al.	2501.13893	link
2025-01-23	Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos	Kairui Hu et.al.	2501.13826	null
2025-01-23	Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge	Haomiao Xiong et.al.	2501.13468	link
2025-01-21	Multi-stage intermediate fusion for multimodal learning to classify non-small cell lung cancer subtypes from CT and PET	Fatih Aksu et.al.	2501.12425	null
2025-01-21	Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2	Md. Rakibul Islam et.al.	2501.12356	null
2025-01-28	Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks	Zhenhailong Wang et.al.	2501.11733	null
2025-01-20	ITCFN: Incomplete Triple-Modal Co-Attention Fusion Network for Mild Cognitive Impairment Conversion Prediction	Xiangyang Hu et.al.	2501.11276	link
2025-01-19	Multimodal Techniques for Malware Classification	Jonathan Jiang et.al.	2501.10956	null
2025-01-18	Fake Advertisements Detection Using Automated Multimodal Learning: A Case Study for Vietnamese Real Estate Data	Duy Nguyen et.al.	2501.10848	null
2025-01-17	FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization	Zhaopeng Gu et.al.	2501.10067	link
2025-01-17	TeamVision: An AI-powered Learning Analytics System for Supporting Reflection in Team-based Healthcare Simulation	Vanessa Echeverria et.al.	2501.09930	null
2025-01-19	IDEA: Image Description Enhanced CLIP-Adapter	Zhipeng Ye et.al.	2501.08816	link
2025-01-12	SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval	Bhavin Jawade et.al.	2501.08347	null
2025-01-14	Benchmarking Multimodal Models for Fine-Grained Image Analysis: A Comparative Study Across Diverse Visual Features	Evgenii Evstafev et.al.	2501.08170	null
2025-01-13	Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis	Andrzej D. Dobrzycki et.al.	2501.07221	null
2025-01-13	Boosting Text-To-Image Generation via Multilingual Prompting in Large Multimodal Models	Yongyu Mu et.al.	2501.07086	link
2025-01-13	Unveiling the Potential of Text in High-Dimensional Time Series Forecasting	Xin Zhou et.al.	2501.07048	link
2025-01-10	MinMo: A Multimodal Large Language Model for Seamless Voice Interaction	Qian Chen et.al.	2501.06282	null
2025-01-08	Generative AI for Cel-Animation: A Survey	Yunlong Tang et.al.	2501.06250	link
2025-01-07	Detection, Retrieval, and Explanation Unified: A Violence Detection System Based on Knowledge Graphs and GAT	Wen-Dong Jiang et.al.	2501.06224	null
2025-01-13	Valley2: Exploring Multimodal Models with Scalable Vision-Language Design	Ziheng Wu et.al.	2501.05901	link
2025-01-09	V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer	Hangzhou He et.al.	2501.04975	link
2025-01-08	Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs	Yikang Zhou et.al.	2501.04670	link
2025-01-07	LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token	Shaolei Zhang et.al.	2501.03895	link
2025-01-07	CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds	Keonwoo Kim et.al.	2501.03879	null
2025-01-06	CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets	Tanay Agrawal et.al.	2501.03332	null
2025-01-06	EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models	Andrés Villa et.al.	2501.02699	null
2025-01-02	Asymmetric Reinforcing against Multi-modal Representation Bias	Xiyuan Gao et.al.	2501.01240	link
2025-01-02	3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer	Jiajun Deng et.al.	2501.01163	null
2025-01-02	Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning	Jian Lang et.al.	2501.01120	link
2025-01-10	Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs	Linhao Huang et.al.	2501.01042	null
2025-01-11	Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models	Bin Wang et.al.	2501.01034	link
2025-01-01	Negative to Positive Co-learning with Aggressive Modality Dropout	Nicholas Magal et.al.	2501.00865	link
2024-12-31	OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning	Ling Fu et.al.	2501.00321	link
2024-12-31	Dual Diffusion for Unified Image Generation and Understanding	Zijie Li et.al.	2501.00289	null
2024-12-30	Hierarchical Banzhaf Interaction for General Video-Language Representation Learning	Peng Jin et.al.	2412.20964	link
2024-12-30	Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling	Min Zhang et.al.	2412.20725	null
2024-12-30	YOLO-UniOW: Efficient Universal Open-World Object Detection	Lihao Liu et.al.	2412.20645	link
2024-12-29	Audiopedia: Audio QA with Knowledge	Abhirama Subramanyam Penamakuri et.al.	2412.20619	link
2024-12-29	Diff4MMLiTS: Advanced Multimodal Liver Tumor Segmentation via Diffusion-Based Image Synthesis and Alignment	Shiyun Chen et.al.	2412.20418	null
2024-12-27	From Elements to Design: A Layered Approach for Automatic Graphic Design Composition	Jiawei Lin et.al.	2412.19712	null
2024-12-26	Multi-Head Attention Driven Dynamic Visual-Semantic Embedding for Enhanced Image-Text Matching	Wenjing Chen et.al.	2412.19184	null
2024-12-26	CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting	Siyu Jiao et.al.	2412.19142	null
2024-12-24	MixMAS: A Framework for Sampling-Based Mixer Architecture Search for Multimodal Fusion and Learning	Abdelmadjid Chergui et.al.	2412.18437	link
2024-12-30	Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation	Yucong Luo et.al.	2412.18176	null
2024-12-24	VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection	Zhaohui Jin et.al.	2412.18124	null
2024-12-23	Multimodal Learning with Uncertainty Quantification based on Discounted Belief Fusion	Grigor Bezirganyan et.al.	2412.18024	link
2024-12-23	Survey of Large Multimodal Model Datasets, Application Categories and Taxonomy	Priyaranjan Pattnayak et.al.	2412.17759	null
2024-12-25	Reasoning to Attend: Try to Understand How Token Works	Rui Qian et.al.	2412.17741	link
2024-12-23	EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing Modalities	Zhe Chen et.al.	2412.17677	link
2024-12-23	V $^2$ -SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy	Long Bai et.al.	2412.17595	null
2024-12-23	More is Less? A Simulation-Based Approach to Dynamic Interactions between Biases in Multimodal Models	Mounia Drissi et.al.	2412.17505	null
2024-12-23	Diving into Self-Evolving Training for Multimodal Reasoning	Wei Liu et.al.	2412.17451	null
2024-12-23	VidCtx: Context-aware Video Question Answering with Image Models	Andreas Goulas et.al.	2412.17415	link
2024-12-22	Where am I? Cross-View Geo-localization with Natural Language Descriptions	Junyan Ye et.al.	2412.17007	null
2024-12-22	PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask	Jeongho Kim et.al.	2412.16978	link
2024-12-21	SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization	Tan-Hanh Pham et.al.	2412.16771	null
2024-12-21	LLaVA-SLT: Visual Language Tuning for Sign Language Translation	Han Liang et.al.	2412.16524	null
2024-12-20	A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation	Shijie Zhou et.al.	2412.16364	link
2024-12-20	Measuring Cross-Modal Interactions in Multimodal Models	Laura Wenderoth et.al.	2412.15828	link
2024-12-20	Precision ICU Resource Planning: A Multimodal Model for Brain Surgery Outcomes	Maximilian Fischer et.al.	2412.15818	null
2024-12-20	Error-driven Data-efficient Large Multimodal Model Tuning	Barry Menglong Yao et.al.	2412.15652	null
2024-12-19	OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving	Shuo Xing et.al.	2412.15208	link
2024-12-19	LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation	Weijia Shi et.al.	2412.15188	null
2024-12-19	Qwen2.5 Technical Report	Qwen et.al.	2412.15115	link
2024-12-19	Progressive Multimodal Reasoning via Active Retrieval	Guanting Dong et.al.	2412.14835	null
2024-12-21	Explainable Tampered Text Detection via Multimodal Large Models	Chenfan Qu et.al.	2412.14816	null
2024-12-18	Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception	Yanpeng Sun et.al.	2412.14233	link
2024-12-18	AnySat: An Earth Observation Model for Any Resolutions, Scales, and Modalities	Guillaume Astruc et.al.	2412.14123	link
2024-12-19	G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o	Tony Cheng Tong et.al.	2412.13647	link
2024-12-18	Detecting Machine-Generated Music with Explainability – A Challenge and Early Benchmarks	Yupei Li et.al.	2412.13421	null
2024-12-17	DoPTA: Improving Document Layout Analysis using Patch-Text Alignment	Nikitha SR et.al.	2412.12902	null
2024-12-17	Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models	YiFan Zhang et.al.	2412.12606	null
2024-12-17	PBVS 2024 Solution: Self-Supervised Learning and Sampling Strategies for SAR Classification in Extreme Long-Tail Distribution	Yuhyun Kim et.al.	2412.12565	null
2024-12-17	Causal Diffusion Transformers for Generative Modeling	Chaorui Deng et.al.	2412.12095	link
2024-12-16	CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology	Yuxuan Sun et.al.	2412.12077	null
2024-12-16	Gramian Multimodal Representation Learning and Alignment	Giordano Cicchetti et.al.	2412.11959	link
2024-12-16	LMM-Regularized CLIP Embeddings for Image Classification	Maria Tzelepi et.al.	2412.11663	null
2024-12-15	Seeing the Forest and the Trees: Solving Visual Graph and Tree Based Data Structure Problems using Large Multimodal Models	Sebastian Gutierrez et.al.	2412.11088	null
2024-12-13	Apollo: An Exploration of Video Understanding in Large Multimodal Models	Orr Zohar et.al.	2412.10360	null
2024-12-13	Performance of ChatGPT on tasks involving physics visual representations: the case of the Brief Electricity and Magnetism Assessment	Giulia Polverini et.al.	2412.10019	null
2024-12-12	Vision-Language Models Represent Darker-Skinned Black Individuals as More Homogeneous than Lighter-Skinned Black Individuals	Messi H. J. Lee et.al.	2412.09668	null
2024-12-12	Exemplar Masking for Multimodal Incremental Learning	Yi-Lun Lee et.al.	2412.09549	link
2024-12-12	Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis	Raj Hansini Khoiwal et.al.	2412.09445	null
2024-12-12	Enhancing Modality Representation and Alignment for Multimodal Cold-start Active Learning	Meng Shen et.al.	2412.09126	null
2024-12-12	A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter	Zirun Guo et.al.	2412.08979	link
2024-12-11	StreamChat: Chatting with Streaming Video	Jihao Liu et.al.	2412.08646	null
2024-12-11	Multimodal Latent Language Modeling with Next-Token Diffusion	Yutao Sun et.al.	2412.08635	link
2024-12-12	Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis	Feng Zhou et.al.	2412.08603	null
2024-12-11	Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions	Mohammadmostafa Rostamkhani et.al.	2412.08169	link
2024-12-10	Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning	Can Yaras et.al.	2412.07909	null
2024-12-10	BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities	Sahal Shaji Mullappilly et.al.	2412.07769	link
2024-12-10	ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer	Jinyi Hu et.al.	2412.07720	link
2024-12-13	DriveMM: All-in-One Large Multimodal Model for Autonomous Driving	Zhijian Huang et.al.	2412.07689	link
2024-12-10	Driving with InternVL: Oustanding Champion in the Track on Driving with Language of the Autonomous Grand Challenge at CVPR 2024	Jiahan Li et.al.	2412.07247	null
2024-12-10	Maya: An Instruction Finetuned Multilingual Multimodal Model	Nahid Alam et.al.	2412.07112	link
2024-12-09	How to Merge Your Multimodal Models Over Time?	Sebastian Dziadzio et.al.	2412.06712	link
2024-12-09	Ranked from Within: Ranking Large Multimodal Models for Visual Question Answering Without Labels	Weijie Tu et.al.	2412.06461	null
2024-12-09	iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models	Lianyu Hu et.al.	2412.06263	link
2024-12-08	A Self-Learning Multimodal Approach for Fake News Detection	Hao Chen et.al.	2412.05843	null
2024-12-08	SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation	Leigang Qu et.al.	2412.05818	null
2024-12-07	WavFusion: Towards wav2vec 2.0 Multimodal Speech Emotion Recognition	Feng Li et.al.	2412.05558	null
2024-12-07	Comprehensive Evaluation of Multimodal AI Models in Medical Imaging Diagnosis: From Data Augmentation to Preference-Based Comparison	Cailian Ruan et.al.	2412.05536	null
2024-12-06	Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling	Zhe Chen et.al.	2412.05271	link
2024-12-05	Lattice Lingo: Effect of Textual Detail on Multimodal Learning for Property Prediction of Crystals	Mrigi Munjal et.al.	2412.04670	null
2024-12-05	BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks	Juan Rodriguez et.al.	2412.04626	null
2024-12-05	MageBench: Bridging Large Multimodal Models to Agents	Miaosen Zhang et.al.	2412.04531	link
2024-12-04	Video Quality Assessment: A Comprehensive Survey	Qi Zheng et.al.	2412.04508	link
2024-12-05	SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model	Zhenglin Huang et.al.	2412.04292	null
2024-12-05	CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model	Ruoyu Yao et.al.	2412.04209	null
2024-12-05	AIpparel: A Large Multimodal Generative Model for Digital Garments	Kiyohiro Nakayama et.al.	2412.03937	null
2024-12-05	MegaCOIN: Enhancing Medium-Grained Color Perception for Vision-Language Models	Ming-Chang Chiu et.al.	2412.03927	link
2024-12-04	Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning	Wujian Peng et.al.	2412.03565	link
2024-12-04	Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning	Neale Ratzlaff et.al.	2412.03467	null
2024-12-06	SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection	Joongwon Chae et.al.	2412.02565	link
2024-12-03	Initial Study On Improving Segmentation By Combining Preoperative CT And Intraoperative CBCT Using Synthetic Data	Maximilian E. Tschuchnig et.al.	2412.02294	null
2024-12-05	CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy	Zhibo Yang et.al.	2412.02210	null
2024-12-03	VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding	Kangsan Kim et.al.	2412.02186	link
2024-12-04	Agri-LLaVA: Knowledge-Infused Large Multimodal Assistant on Agricultural Pests and Diseases	Liqiong Wang et.al.	2412.02158	link
2024-12-02	Attacks on multimodal models	Viacheslav Iablochnikov et.al.	2412.01725	link
2024-12-02	LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant	Yikun Liu et.al.	2412.01720	null
2024-12-01	VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation	Weiming Ren et.al.	2412.00927	null
2024-11-30	MaintAGT:Sim2Real-Guided Multimodal Large Model for Intelligent Maintenance with Chain-of-Thought Reasoning	Hongliang He et.al.	2412.00481	null
2024-11-30	Approximate Fiber Product: A Preliminary Algebraic-Geometric Perspective on Multimodal Embedding Alignment	Dongfang Zhao et.al.	2412.00373	null
2024-12-04	ROSE: Revolutionizing Open-Set Dense Segmentation with Patch-Wise Perceptual Large Multimodal Model	Kunyang Han et.al.	2412.00153	null
2024-11-28	Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers	Chancharik Mitra et.al.	2412.00142	null
2024-12-02	LUMIA: Linear probing for Unimodal and MultiModal Membership Inference Attacks leveraging internal LLM states	Luis Ibanez-Lissen et.al.	2411.19876	null
2024-11-29	SDR-GNN: Spectral Domain Reconstruction Graph Neural Network for Incomplete Multimodal Learning in Conversational Emotion Recognition	Fangze Fu et.al.	2411.19822	link
2024-11-29	JetFormer: An Autoregressive Generative Model of Raw Images and Text	Michael Tschannen et.al.	2411.19722	link
2024-11-28	Beyond Logit Lens: Contextual Embeddings for Robust Hallucination Detection & Grounding in VLMs	Anirudh Phukan et.al.	2411.19187	null
2024-11-28	Examining Multimodal Gender and Content Bias in ChatGPT-4o	Roberto Balestri et.al.	2411.19140	null
2024-11-28	ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges	Rao Fu et.al.	2411.18932	link
2024-11-27	Active Data Curation Effectively Distills Large-Scale Multimodal Models	Vishaal Udandarao et.al.	2411.18674	null
2024-11-27	AMPS: ASR with Multimodal Paraphrase Supervision	Amruta Parulekar et.al.	2411.18368	null
2024-12-03	Large Language Model-Brained GUI Agents: A Survey	Chaoyun Zhang et.al.	2411.18279	link
2024-11-27	Grid-augumented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents	Joongwon Chae et.al.	2411.18270	link
2024-11-27	Multimodal Integration of Longitudinal Noninvasive Diagnostics for Survival Prediction in Immunotherapy Using Deep Learning	Melda Yeghaian et.al.	2411.18253	link
2024-11-26	NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects?	Jiaxuan Li et.al.	2411.17794	null
2024-11-26	Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis	Akshita Gupta et.al.	2411.17690	null
2024-11-26	AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM	Jiarui Wang et.al.	2411.17221	link
2024-11-26	Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation	Xu Zheng et.al.	2411.17141	link
2024-11-26	Relations, Negations, and Numbers: Looking for Logic in Generative Text-to-Image Models	Colin Conwell et.al.	2411.17066	link
2024-11-26	Multimodal Alignment and Fusion: A Survey	Songtao Li et.al.	2411.17040	null
2024-11-27	SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE	Yongwei Chen et.al.	2411.16856	null
2024-11-23	Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents	Jun Chen et.al.	2411.16740	link
2024-11-26	All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages	Ashmal Vayani et.al.	2411.16508	link
2024-11-25	Boosting 3D Object Generation through PBR Materials	Yitong Wang et.al.	2411.16080	null
2024-11-24	M3-CVC: Controllable Video Compression with Multimodal Generative Models	Rui Wan et.al.	2411.15798	null
2024-11-23	Knowledge Transfer Across Modalities with Natural Language Supervision	Carlo Alberto Barbano et.al.	2411.15611	null
2024-11-23	From Complexity to Parsimony: Integrating Latent Class Analysis to Uncover Multimodal Learning Patterns in Collaborative Learning	Lixiang Yan et.al.	2411.15590	null
2024-11-23	Botfip-LLM: An Enhanced Multimodal Scientific Computing Framework Leveraging Knowledge Distillation from Large Language Models	Tianhao Chen et.al.	2411.15525	null
2024-11-23	MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking	Xinqi Liu et.al.	2411.15459	null
2024-11-23	freePruner: A Training-free Approach for Large Multimodal Model Acceleration	Bingxin Xu et.al.	2411.15446	null
2024-11-22	PRIMUS: Pretraining IMU Encoders with Multimodal Self-Supervision	Arnav M. Das et.al.	2411.15127	link
2024-11-22	Large Multi-modal Models Can Interpret Features in Large Multi-modal Models	Kaichen Zhang et.al.	2411.14982	link
2024-11-25	Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation	Aniket Bhattacharyya et.al.	2411.14957	null
2024-11-22	Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains	Yurii Paniv et.al.	2411.14647	null
2024-11-21	Generative AI for Music and Audio	Hao-Wen Dong et.al.	2411.14627	null
2024-11-21	FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers	Zehua Pei et.al.	2411.14507	null
2024-11-21	MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective	Hailang Huang et.al.	2411.14062	link
2024-11-21	Multimodal 3D Reasoning Segmentation with Complex Scenes	Xueying Jiang et.al.	2411.13927	null
2024-11-20	VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation	Ziyang Luo et.al.	2411.13281	null
2024-11-19	VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge	Vishwesh Nath et.al.	2411.12915	null
2024-11-19	Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality Assessment	Siyi Pan et.al.	2411.12791	null
2024-11-18	MMBind: Unleashing the Potential of Distributed and Heterogeneous Data for Multimodal Learning in IoT	Xiaomin Ouyang et.al.	2411.12126	null
2024-11-17	SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization	Hongrui Jia et.al.	2411.11909	link
2024-11-18	The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning	Longju Bai et.al.	2411.11758	link
2024-11-18	Artificial Scientific Discovery	Antonio Norelli et.al.	2411.11672	null
2024-11-18	InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models	Yu Yan et.al.	2411.11394	null
2024-11-19	SoK: Unifying Cybersecurity and Cybersafety of Multimodal Foundation Models with an Information Theory Approach	Ruoxi Sun et.al.	2411.11195	null
2024-11-16	ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models	Vipula Rawte et.al.	2411.10867	null
2024-11-19	MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language Models	Jianhong Tu et.al.	2411.10557	link
2024-11-15	Everything is a Video: Unifying Modalities through Next-Frame Prediction	G. Thomas Hudson et.al.	2411.10503	null
2024-11-15	Weakly-Supervised Multimodal Learning on MIMIC-CXR	Andrea Agostini et.al.	2411.10356	link
2024-11-21	Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era	Thanh Tam Nguyen et.al.	2411.09955	link
2024-11-14	Cross-Modal Consistency in Multimodal Large Language Models	Xiang Zhang et.al.	2411.09273	null
2024-11-14	SmartInv: Multimodal Learning for Smart Contract Invariant Inference	Sally Junsong Wang et.al.	2411.09217	null
2024-11-13	Multimodal Object Detection using Depth and Image Data for Manufacturing Parts	Nazanin Mahjourian et.al.	2411.09062	null
2024-11-13	Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions	Moran Yanuka et.al.	2411.09018	link
2024-11-13	AstroM $^3$ : A self-supervised multimodal model for astronomy	Mariia Rizhko et.al.	2411.08842	null
2024-11-13	Multimodal Instruction Tuning with Hybrid State Space Models	Jianing Zhou et.al.	2411.08840	null
2024-11-13	Retrieval Augmented Recipe Generation	Guoshan Liu et.al.	2411.08715	null
2024-11-12	DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection	Shawn Li et.al.	2411.08227	link
2024-11-12	Leveraging Multimodal Models for Enhanced Neuroimaging Diagnostics in Alzheimer’s Disease	Francesco Chiumento et.al.	2411.07871	null
2024-11-12	SparrowVQE: Visual Question Explanation for Course Content Understanding	Jialu Li et.al.	2411.07516	link
2024-11-12	BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions	Anas Awadalla et.al.	2411.07461	null
2024-11-11	Multimodal Fusion Balancing Through Game-Theoretic Regularization	Konstantinos Kontras et.al.	2411.07335	null
2024-11-11	OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision	Cong Wei et.al.	2411.07199	null
2024-11-09	M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework	Yew Ken Chia et.al.	2411.06176	null
2024-11-09	An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models	Fatemeh Shiri et.al.	2411.06048	link
2024-11-08	Towards Low-Resource Harmful Meme Detection with LMM Agents	Jianzhao Huang et.al.	2411.05383	link
2024-11-08	Exploring the Alignment Landscape: LLMs and Geometric Deep Models in Protein Representation	Dong Shu et.al.	2411.05316	link
2024-11-07	HourVideo: 1-Hour Video-Language Understanding	Keshigeyan Chandrasegaran et.al.	2411.04998	link
2024-11-07	VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos	Shehan Munasinghe et.al.	2411.04923	null
2024-11-07	Exploring Hierarchical Molecular Graph Representation in Multimodal LLMs	Chengxin Hu et.al.	2411.04708	null
2024-11-06	AutoGameUI: Constructing High-Fidelity Game UIs via Multimodal Learning and Interactive Web-Based Tool	Zhongliang Tang et.al.	2411.03709	null
2024-11-05	MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning	Ziliang Gan et.al.	2411.03314	null
2024-11-05	HumanVLM: Foundation for Human-Scene Vision-Language Model	Dawei Dai et.al.	2411.03034	null
2024-11-05	Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning	Mingcheng Li et.al.	2411.02793	null
2024-11-11	INQUIRE: A Natural World Text-to-Image Retrieval Benchmark	Edward Vendrow et.al.	2411.02537	link
2024-11-04	See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers	Jiaxin Zhuang et.al.	2411.02465	null
2024-11-07	TableGPT2: A Large Multimodal Model with Tabular Data Integration	Aofeng Su et.al.	2411.02059	link
2024-11-04	Foundations and Recent Trends in Multimodal Mobile Agents: A Survey	Biao Wu et.al.	2411.02006	link
2024-11-04	KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension	Jie Yang et.al.	2411.01846	null
2024-11-03	EEE-Bench: A Comprehensive Multimodal Electrical And Electronics Engineering Benchmark	Ming Li et.al.	2411.01492	null
2024-11-03	Classifier-guided Gradient Modulation for Enhanced Multimodal Learning	Zirun Guo et.al.	2411.01409	link
2024-11-02	LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding	Jian Chen et.al.	2411.01106	null
2024-11-01	Text2Freq: Learning Series Patterns from Text via Frequency Domain	Ming-Chih Lo et.al.	2411.00929	null
2024-11-01	V-LoRA: An Efficient and Flexible System Boosts Vision Applications with LoRA LMM	Liang Mi et.al.	2411.00915	null
2024-11-01	Analyzing Multimodal Integration in the Variational Autoencoder from an Information-Theoretic Perspective	Carlotta Langer et.al.	2411.00522	null
2024-10-31	TurtleBench: A Visual Programming Benchmark in Turtle Geometry	Sina Rismanchian et.al.	2411.00264	link
2024-10-31	ResiDual Transformer Alignment with Spectral Decomposition	Lorenzo Basile et.al.	2411.00246	null
2024-10-31	Nearest Neighbor Normalization Improves Multimodal Retrieval	Neil Chowdhury et.al.	2410.24114	link
2024-11-04	AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents	Yifan Xu et.al.	2410.24024	link
2024-10-31	Audio Is the Achilles’ Heel: Red Teaming Audio Large Multimodal Models	Hao Yang et.al.	2410.23861	link
2024-10-30	CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP	Tianyu Yang et.al.	2410.23330	null
2024-10-30	EMMA: End-to-End Multimodal Model for Autonomous Driving	Jyh-Jing Hwang et.al.	2410.23262	null
2024-10-29	ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding	Kimihiro Hasegawa et.al.	2410.22211	link
2024-10-29	Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications	Monica Riedler et.al.	2410.21943	link
2024-10-28	AiSciVision: A Framework for Specializing Large Multimodal Models in Scientific Image Classification	Brendan Hogan et.al.	2410.21480	link
2024-10-27	Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse	Ryan Liu et.al.	2410.21333	null
2024-10-28	IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks	Manjunath D et.al.	2410.20953	link
2024-10-27	Generator Matching: Generative modeling with arbitrary Markov processes	Peter Holderrieth et.al.	2410.20587	null
2024-10-27	PaPaGei: Open Foundation Models for Optical Physiological Signals	Arvind Pillai et.al.	2410.20542	link
2024-10-25	Turn-by-Turn Indoor Navigation for the Visually Impaired	Santosh Srinivasaiah et.al.	2410.19954	null
2024-10-25	A Multimodal Approach For Endoscopic VCE Image Classification Using BiomedCLIP-PubMedBERT	Nagarajan Ganapathy et.al.	2410.19944	link
2024-10-25	OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization	Hongliang He et.al.	2410.19609	link
2024-10-24	Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant	Abhirama Subramanyam Penamakuri et.al.	2410.19144	link
2024-10-24	VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks	Lawrence Jang et.al.	2410.19100	null
2024-10-24	CAMEL-Bench: A Comprehensive Arabic LMM Benchmark	Sara Ghaboura et.al.	2410.18976	link
2024-10-24	Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques	David Ortiz-Perez et.al.	2410.18972	null
2024-10-24	OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning	Xiaoqiang Wang et.al.	2410.18963	null
2024-10-24	A Survey of Multimodal Sarcasm Detection	Shafkat Farabi et.al.	2410.18882	null
2024-10-27	R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models	Linger Deng et.al.	2410.17885	link
2024-10-22	JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation	Shota Onohara et.al.	2410.17250	null
2024-10-22	An Eye for an AI: Evaluating GPT-4o’s Visual Perception Skills and Geometric Reasoning Skills Using Computer Graphics Questions	Tony Haoran Feng et.al.	2410.16991	null
2024-10-21	DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding	Manan Suri et.al.	2410.16472	null
2024-10-21	Promoting cross-modal representations to improve multimodal foundation models for physiological signals	Ching Fang et.al.	2410.16424	null
2024-10-22	Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance	Zhangwei Gao et.al.	2410.16261	link
2024-10-22	MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report	Samrajya Thapa et.al.	2410.16239	link
2024-10-21	Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models	Yufei Zhan et.al.	2410.16163	link
2024-10-21	LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset	Ruikun Zhang et.al.	2410.16095	link
2024-10-21	How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?	Zuojin Tang et.al.	2410.15885	null
2024-10-21	Multimodal Learning for Embryo Viability Prediction in Clinical IVF	Junsik Kim et.al.	2410.15581	null
2024-10-20	IPO: Interpretable Prompt Optimization for Vision-Language Models	Yingjun Du et.al.	2410.15397	link
2024-10-20	Modality-Fair Preference Optimization for Trustworthy MLLM Alignment	Songtao Jiang et.al.	2410.15334	null
2024-10-19	ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla	Deeparghya Dutta Barua et.al.	2410.14991	null
2024-10-19	SemiHVision: Enhancing Medical Multimodal Models with a Semi-Human Annotated Dataset and Fine-Tuned Instruction Generation	Junda Wang et.al.	2410.14948	link
2024-10-18	Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension	Yin Xie et.al.	2410.14332	link
2024-10-18	Personalized Image Generation with Large Multimodal Models	Yiyan Xu et.al.	2410.14170	link
2024-10-18	Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied Agents	Sabit Hassan et.al.	2410.14141	null
2024-10-17	Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation	Chengyue Wu et.al.	2410.13848	link
2024-10-18	Harnessing Webpage UIs for Text-Rich Visual Understanding	Junpeng Liu et.al.	2410.13824	null
2024-10-17	Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR	Abhishek Gupta et.al.	2410.13445	null
2024-10-16	The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio	Sicong Leng et.al.	2410.12787	null
2024-10-16	HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks	Fengji Zhang et.al.	2410.12381	link
2024-10-15	CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning	Qingqing Cao et.al.	2410.11963	null
2024-10-15	Generalizable Spacecraft Trajectory Generation via Multimodal Learning with Transformers	Davide Celestini et.al.	2410.11723	null
2024-10-15	Unveiling the Mystery of Visual Attributes of Concrete and Abstract Concepts: Variability, Nearest Neighbors, and Challenging Categories	Tarun Tater et.al.	2410.11657	link
2024-10-15	On-the-fly Modulation for Balanced Multimodal Learning	Yake Wei et.al.	2410.11582	link
2024-10-15	Enhancing Unimodal Latent Representations in Multimodal VAEs through Iterative Amortized Inference	Yuta Oshima et.al.	2410.11403	null
2024-10-14	Saliency Guided Optimization of Diffusion Latents	Xiwen Wang et.al.	2410.10257	null
2024-10-14	MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models	Peng Xia et.al.	2410.10139	link
2024-10-13	LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models	Junyan Ye et.al.	2410.09732	null
2024-10-12	Reconstructive Visual Instruction Tuning	Haochen Wang et.al.	2410.09575	null
2024-10-11	Can GPTs Evaluate Graphic Design Based on Design Principles?	Daichi Haraguchi et.al.	2410.08885	null
2024-10-11	VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding	Houlun Chen et.al.	2410.08593	link
2024-10-10	ElasticTok: Adaptive Tokenization for Image and Video	Wilson Yan et.al.	2410.08368	null
2024-10-10	Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts	Sukwon Yun et.al.	2410.08245	link
2024-10-10	LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts	Anh-Quan Cao et.al.	2410.08211	null
2024-10-10	Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision	Shengcao Cao et.al.	2410.08209	null
2024-10-10	MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models	Wenbo Hu et.al.	2410.08182	null
2024-10-10	Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image Generative Models	Abhishek Mandal et.al.	2410.07884	null
2024-10-09	The Cognitive Capabilities of Generative AI: A Comparative Analysis with Human Benchmarks	Isaac R. Galatzer-Levy et.al.	2410.07391	null
2024-10-12	Deep Correlated Prompting for Visual Recognition with Missing Modalities	Lianyu Hu et.al.	2410.06558	link
2024-10-11	Chip-Tuning: Classify Before Language Models Say	Fangwei Zhu et.al.	2410.06541	link
2024-10-09	Does Spatial Cognition Emerge in Frontier Models?	Santhosh Kumar Ramakrishnan et.al.	2410.06468	link
2024-10-08	Multimodal Representation Learning using Adaptive Graph Construction	Weichen Huang et.al.	2410.06395	null
2024-10-08	Temporal Image Caption Retrieval Competition – Description and Results	Jakub Pokrywka et.al.	2410.06314	null
2024-10-08	PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling	Xudong Xie et.al.	2410.05970	link
2024-10-08	ModalPrompt:Dual-Modality Guided Prompt for Continual Learning of Large Multimodal Models	Fanhu Zeng et.al.	2410.05849	null
2024-10-08	Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond	Soyeon Caren Han et.al.	2410.05608	link
2024-10-08	TeaserGen: Generating Teasers for Long Documentaries	Weihan Xu et.al.	2410.05586	null
2024-10-07	R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?	Chunyi Li et.al.	2410.05474	link
2024-10-07	RespLLM: Unifying Audio and Text with Multimodal LLMs for Generalized Respiratory Health Prediction	Yuwei Zhang et.al.	2410.05361	null
2024-10-07	Patch is Enough: Naturalistic Adversarial Patch against Vision-Language Pre-training Models	Dehong Kong et.al.	2410.04884	null
2024-10-06	VISTA: A Visual and Textual Attention Dataset for Interpreting Multimodal Models	Harshit et.al.	2410.04609	null
2024-10-06	UniMuMo: Unified Text, Music and Motion Generation	Han Yang et.al.	2410.04534	link
2024-10-08	Gamified crowd-sourcing of high-quality data for visual fine-tuning	Shashank Yadav et.al.	2410.04038	null
2024-10-07	Multimodal Point-of-Interest Recommendation	Yuta Kanzawa et.al.	2410.03265	null
2024-10-04	Bridging the Gap between Text, Audio, Image, and Any Sequence: A Novel Approach using Gloss-based Annotation	Sen Fang et.al.	2410.03146	null
2024-10-04	AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark	Wenhao Chai et.al.	2410.03051	null
2024-10-07	CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification	Jinghao Shi et.al.	2410.03038	null
2024-10-07	MMP: Towards Robust Multi-Modal Learning with Masked Modality Projection	Niki Nezakati et.al.	2410.03010	null
2024-10-03	Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos	Jianrui Zhang et.al.	2410.02763	null
2024-10-03	Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models	Zhengfeng Lai et.al.	2410.02740	null
2024-10-04	Video Instruction Tuning With Synthetic Data	Yuanhan Zhang et.al.	2410.02713	null
2024-10-03	LLaVA-Critic: Learning to Evaluate Multimodal Models	Tianyi Xiong et.al.	2410.02712	null
2024-10-03	Plots Unlock Time-Series Understanding in Multimodal Models	Mayank Daswani et.al.	2410.02637	null
2024-10-02	Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations	Minoh Jeong et.al.	2410.02086	null
2024-10-02	Toward a Holistic Evaluation of Robustness in CLIP Models	Weijie Tu et.al.	2410.01534	null
2024-10-02	SHAP-CAT: A interpretable multi-modal framework enhancing WSI classification via virtual staining and shapley-value-based multimodal fusion	Jun Wang et.al.	2410.01408	null
2024-10-02	Backdooring Vision-Language Models with Out-Of-Distribution Data	Weimin Lyu et.al.	2410.01264	null
2024-10-02	OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects	Wenmo Qiu et.al.	2410.01261	null
2024-09-30	Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning	Weitai Kang et.al.	2410.00255	link
2024-09-30	Using Large Multimodal Models to Extract Knowledge Components for Knowledge Tracing from Multimedia Question Information	Hyeongdon Moon et.al.	2409.20167	link
2024-10-02	Visual Context Window Extension: A New Perspective for Long Video Understanding	Hongchen Wei et.al.	2409.20018	null
2024-09-30	Towards Robust Multimodal Sentiment Analysis with Incomplete Data	Haoyu Zhang et.al.	2409.20012	link
2024-09-28	FairPIVARA: Reducing and Assessing Biases in CLIP-Based Multimodal Models	Diego A. B. Moreira et.al.	2409.19474	link
2024-09-28	From Unimodal to Multimodal: Scaling up Projectors to Align Modalities	Mayug Maniparambil et.al.	2409.19425	null
2024-10-02	CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling	Jihai Zhang et.al.	2409.19291	link
2024-09-28	TrojVLM: Backdoor Attack Against Vision Language Models	Weimin Lyu et.al.	2409.19232	null
2024-09-27	Multimodal Markup Document Models for Graphic Design Completion	Kotaro Kikuchi et.al.	2409.19051	null
2024-09-27	Emu3: Next-Token Prediction is All You Need	Xinlong Wang et.al.	2409.18869	null
2024-09-27	Data Analysis in the Era of Generative AI	Jeevana Priya Inala et.al.	2409.18475	null
2024-09-26	MultiClimate: Multimodal Stance Detection on Climate Change Videos	Jiawen Wang et.al.	2409.18346	link
2024-09-26	LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness	Chenming Zhu et.al.	2409.18125	null
2024-09-26	GSON: A Group-based Social Navigation Framework with Large Multimodal Model	Shangyi Luo et.al.	2409.18084	null
2024-09-26	A Multimodal Single-Branch Embedding Network for Recommendation in Cold-Start and Missing Modality Scenarios	Christian Ganhör et.al.	2409.17864	link
2024-09-26	Harnessing Shared Relations via Multimodal Mixup Contrastive Learning for Multimodal Classification	Raja Kumar et.al.	2409.17777	link
2024-09-26	MIO: A Foundation Model on Multimodal Tokens	Zekun Wang et.al.	2409.17692	link
2024-09-25	Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models	Matt Deitke et.al.	2409.17146	link
2024-09-24	CDChat: A Large Multimodal Model for Remote Sensing Change Description	Mubashir Noman et.al.	2409.16261	link
2024-09-24	CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation	Fuxian Huang et.al.	2409.15806	null
2024-09-18	Recommendation with Generative Models	Yashar Deldjoo et.al.	2409.15173	null
2024-09-23	With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models	Tyler Loakman et.al.	2409.14917	link
2024-09-22	Patch Ranking: Efficient CLIP by Learning to Rank Local Patches	Cheng-En Wu et.al.	2409.14607	link
2024-09-22	Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models	Yew Ken Chia et.al.	2409.14277	null
2024-09-20	Brain-Cognition Fingerprinting via Graph-GCCA with Contrastive Learning	Yixin Wang et.al.	2409.13887	null
2024-09-20	Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model	Li Zhou et.al.	2409.13407	link
2024-09-20	A Novel Adaptive Fine-Tuning Algorithm for Multimodal Models: Self-Optimizing Classification and Selection of High-Quality Datasets in Remote Sensing	Yi Ren et.al.	2409.13345	null
2024-09-20	ChemDFM-X: Towards Large Multimodal Model for Chemistry	Zihan Zhao et.al.	2409.13194	null
2024-09-19	MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines	Dongzhi Jiang et.al.	2409.12959	null
2024-09-24	TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation	Junjie Wen et.al.	2409.12514	null
2024-09-18	Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution	Peng Wang et.al.	2409.12191	link
2024-09-18	All-in-one foundational models learning across quantum chemical levels	Yuxinxin Chen et.al.	2409.12015	link
2024-09-18	LMMCoDrive: Cooperative Driving with Large Multimodal Model	Haichao Liu et.al.	2409.11981	link
2024-09-16	MusicLIME: Explainable Multimodal Music Understanding	Theodoros Sotirou et.al.	2409.10496	link
2024-09-19	IRIS: Interactive Responsive Intelligent Segmentation for 3D Affordance Analysis	Meng Chu et.al.	2409.10078	null
2024-09-16	AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing	Huawei Ji et.al.	2409.10016	link
2024-09-14	Keypoints-Integrated Instruction-Following Data Generation for Enhanced Human Pose Understanding in Multimodal Models	Dewen Zhang et.al.	2409.09306	null
2024-09-13	Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing	Minh-Duc Vu et.al.	2409.08885	null
2024-09-13	A Multimodal Approach for Fluid Overload Prediction: Integrating Lung Ultrasound and Clinical Data	Tianqi Yang et.al.	2409.08790	null
2024-09-13	Dynamics of Collective Group Affect: Group-level Annotations and the Multimodal Modeling of Convergence and Divergence	Navin Raj Prabhu et.al.	2409.08578	null
2024-09-13	A Comprehensive Survey on Deep Multimodal Learning with Missing Modality	Renjie Wu et.al.	2409.07825	null
2024-09-12	Top-down Activity Representation Learning for Video Question Answering	Yanan Wang et.al.	2409.07748	null
2024-09-11	What to align in multimodal contrastive learning?	Benoit Dufumier et.al.	2409.07402	null
2024-09-11	MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis	Hanyu Jiang et.al.	2409.07129	null
2024-09-11	FSMDet: Vision-guided feature diffusion for fully sparse 3D detector	Tianran Liu et.al.	2409.06945	null
2024-09-16	Scaling Law Hypothesis for Multimodal Model	Qingyun Sun et.al.	2409.06754	null
2024-09-10	Multiclass Arrhythmia Classification using Smartwatch Photoplethysmography Signals Collected in Real-life Settings	Dong Han et.al.	2409.06147	null
2024-09-11	A Survey of Multimodal Composite Editing and Retrieval	Suyan Li et.al.	2409.05405	link
2024-09-05	Learning in Order! A Sequential Strategy to Learn Invariant Features for Multimodal Sentiment Analysis	Xianbing Zhao et.al.	2409.04473	null
2024-09-06	Generating Faithful and Salient Text from Multimodal Data	Tahsina Hashem et.al.	2409.03961	link
2024-09-06	CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models	Wentao Liu et.al.	2409.02834	link
2024-09-10	MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark	Xiang Yue et.al.	2409.02813	null
2024-09-04	Understanding eGFR Trajectories and Kidney Function Decline via Large Multimodal Models	Chih-Yuan Li et.al.	2409.02530	null
2024-09-03	Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models	Bin Fu et.al.	2409.01560	null
2024-09-03	Think Twice Before Recognizing: Large Multimodal Models for General Fine-grained Traffic Sign Recognition	Yaozong Gan et.al.	2409.01534	null
2024-09-02	Towards General Industrial Intelligence: A Survey on IIoT-Enhanced Continual Large Models	Jiao Chen et.al.	2409.01207	null
2024-09-02	Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information	Yi Chen et.al.	2409.01179	link
2024-08-31	Comparative Analysis of Modality Fusion Approaches for Audio-Visual Person Identification and Verification	Aref Farhadipour et.al.	2409.00562	null
2024-08-30	UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios	Baichuan Zhou et.al.	2408.17267	link
2024-08-29	Seeking the Sufficiency and Necessity Causal Features in Multimodal Representation Learning	Boyu Chen et.al.	2408.16577	null
2024-08-29	Toward Robust Early Detection of Alzheimer’s Disease via an Integrated Multimodal Learning Approach	Yifei Chen et.al.	2408.16343	link
2024-08-28	Meta-Learn Unimodal Signals with Weak Supervision for Multimodal Sentiment Analysis	Sijie Mai et.al.	2408.16029	null
2024-08-28	ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation	Tiantian Feng et.al.	2408.15803	null
2024-08-28	Visual Prompt Engineering for Medical Vision Language Models in Radiology	Stefan Denner et.al.	2408.15802	null
2024-08-27	X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation	Hanjia Lyu et.al.	2408.15172	null
2024-08-27	The Benefits of Balance: From Information Projections to Variance Reduction	Lang Liu et.al.	2408.15065	null
2024-08-27	NeuralOOD: Improving Out-of-Distribution Generalization Performance with Brain-machine Fusion Learning Framework	Shuangchen Zhao et.al.	2408.14950	null
2024-08-26	MMR: Evaluating Reading Ability of Large Multimodal Models	Jian Chen et.al.	2408.14594	null
2024-09-03	Foundation Models for Music: A Survey	Yinghao Ma et.al.	2408.14340	link
2024-08-26	LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models	Qihang Ge et.al.	2408.14008	null
2024-08-27	Quantum Multimodal Contrastive Learning Framework	Chi-Sheng Chen et.al.	2408.13919	null
2024-08-25	Tangram: A Challenging Benchmark for Geometric Element Recognizing	Jiamin Tang et.al.	2408.13854	null
2024-08-25	Multimodal Ensemble with Conditional Feature Fusion for Dysgraphia Diagnosis in Children from Handwriting Samples	Jayakanth Kunhoth et.al.	2408.13754	null
2024-08-24	Preliminary Investigations of a Multi-Faceted Robust and Synergistic Approach in Semiconductor Electron Micrograph Analysis: Integrating Vision Transformers with Large Language and Multimodal Models	Sakhinana Sagar Srinivas et.al.	2408.13621	null
2024-08-23	Foundational Model for Electron Micrograph Analysis: Instruction-Tuning Small-Scale Language-and-Vision Assistant for Enterprise Adoption	Sakhinana Sagar Srinivas et.al.	2408.13248	null
2024-08-23	Indoor scene recognition from images under visual corruptions	Willams de Lima Costa et.al.	2408.13029	null
2024-08-23	Ada2I: Enhancing Modality Balance for Multimodal Conversational Emotion Recognition	Cam-Van Thi Nguyen et.al.	2408.12895	null
2024-08-23	Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey	Qika Lin et.al.	2408.12880	link
2024-08-22	Assessing Modality Bias in Video Question Answering Benchmarks with Multimodal Large Language Models	Jean Park et.al.	2408.12763	null
2024-08-22	Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization	Luyao Cheng et.al.	2408.12102	null
2024-08-22	Mental-Perceiver: Audio-Textual Multimodal Learning for Mental Health Assessment	Jinghui Qin et.al.	2408.12088	null
2024-08-21	GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models	Jonathan Roberts et.al.	2408.11817	null
2024-08-21	D-RMGPT: Robot-assisted collaborative tasks driven by large multimodal models	M. Forlini et.al.	2408.11761	null
2024-08-21	UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation	Xiangyu Zhao et.al.	2408.11305	link
2024-08-21	BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation	Haotian Peng et.al.	2408.11281	link
2024-08-20	Exploring the use of Generative AI to Support Automated Just-in-Time Programming for Visual Scene Displays	Cynthia Zastudil et.al.	2408.11137	null
2024-08-21	SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion Recognition	Zebang Cheng et.al.	2408.10500	link
2024-08-19	Enhance Modality Robustness in Text-Centric Multimodal Alignment with Adversarial Prompting	Yun-Da Tsai et.al.	2408.09798	null
2024-08-19	Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation	Yunxin Li et.al.	2408.09787	link
2024-08-18	PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding	Dawei Dai et.al.	2408.09530	link
2024-08-17	Measuring Visual Sycophancy in Multimodal Models	Jaehyuk Lim et.al.	2408.09111	link
2024-08-16	AdaRank: Disagreement Based Module Rank Prediction for Low-rank Adaptation	Yihe Dong et.al.	2408.09015	link
2024-08-16	xGen-MM (BLIP-3): A Family of Open Large Multimodal Models	Le Xue et.al.	2408.08872	null
2024-08-16	Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs	Jinming Liu et.al.	2408.08575	null
2024-08-15	LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learning	Jiajie Li et.al.	2408.07981	null
2024-08-15	MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark	Minxuan Zhou et.al.	2408.07543	link
2024-08-14	Modality Invariant Multimodal Learning to Handle Missing Modalities: A Single-Branch Approach	Muhammad Saad Saeed et.al.	2408.07445	null
2024-08-14	Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality Collaboration	Xiaogen Zhon et.al.	2408.07341	link
2024-08-14	Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion	Peiyuan Chen et.al.	2408.07303	null
2024-08-13	PathInsight: Instruction Tuning of Multimodal Datasets and Models for Intelligence Assisted Diagnosis in Histopathology	Xiaomin Wu et.al.	2408.07037	null
2024-08-13	EditScribe: Non-Visual Image Editing with Natural Language Verification Loops	Ruei-Che Chang et.al.	2408.06632	null
2024-08-13	CROME: Cross-Modal Adapters for Efficient Multimodal LLM	Sayna Ebrahimi et.al.	2408.06610	null
2024-08-13	Prioritizing Modalities: Flexible Importance Scheduling in Federated Multimodal Learning	Jieming Bian et.al.	2408.06549	null
2024-08-12	VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents	Xiao Liu et.al.	2408.06327	link
2024-08-11	HateSieve: A Contrastive Learning Framework for Detecting and Segmenting Hateful Content in Multimodal Memes	Xuanyu Su et.al.	2408.05794	null
2024-08-08	Enhancing Journalism with AI: A Study of Contextualized Image Captioning for News Articles using LLMs and LMMs	Aliki Anagnostopoulou et.al.	2408.04331	null
2024-08-06	LLaVA-OneVision: Easy Visual Task Transfer	Bo Li et.al.	2408.03326	link
2024-08-06	Multitask and Multimodal Neural Tuning for Large Models	Hao Sun et.al.	2408.03001	null
2024-08-06	Body of Her: A Preliminary Study on End-to-End Humanoid Agent	Tenglong Ao et.al.	2408.02879	null
2024-08-04	Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion	Shaoxu Cheng et.al.	2408.02695	null
2024-08-02	A Systematic Review of Intermediate Fusion in Multimodal Deep Learning for Biomedical Applications	Valerio Guarrasi et.al.	2408.02686	null
2024-08-05	REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models	Agneet Chatterjee et.al.	2408.02231	null
2024-08-04	CACE-Net: Co-guidance Attention and Contrastive Enhancement for Effective Audio-Visual Event Localization	Xiang He et.al.	2408.01952	link
2024-08-02	MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models	Benno Weck et.al.	2408.01337	link
2024-08-05	Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions	Jin Gao et.al.	2408.01091	link
2024-08-02	GraphAge: Unleashing the power of Graph Neural Network to Decode Epigenetic Aging	Saleh Sakib Ahmed et.al.	2408.00984	link
2024-08-01	MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities	Weihao Yu et.al.	2408.00765	link
2024-08-01	GalleryGPT: Analyzing Paintings with Large Multimodal Models	Yi Bin et.al.	2408.00491	link
2024-08-01	Everything We Hear: Towards Tackling Misinformation in Podcasts	Sachin Pathiyan Cherumanal et.al.	2408.00292	null
2024-08-01	OmniParser for Pure Vision Based GUI Agent	Yadong Lu et.al.	2408.00203	null
2024-07-30	Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection	Jinfa Huang et.al.	2407.21004	link
2024-07-30	HyperMM : Robust Multimodal Learning with Varying-sized Inputs	Hava Chaptoukaev et.al.	2407.20768	null
2024-07-30	Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos	Dhruv Verma et.al.	2407.20642	link
2024-07-29	Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter	Chao Liu et.al.	2407.19981	null
2024-07-29	ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2	Wenjun Huang et.al.	2407.19832	null
2024-08-02	XLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training	Biao Wu et.al.	2407.19546	link
2024-07-28	Detached and Interactive Multimodal Learning	Yunfeng Fan et.al.	2407.19514	link
2024-07-27	Data Processing Techniques for Modern Multimodal Models	Yinheng Li et.al.	2407.19180	null
2024-07-26	MangaUB: A Manga Understanding Benchmark for Large Multimodal Models	Hikaru Ikuta et.al.	2407.19034	null
2024-07-26	Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment	Yuze Zheng et.al.	2407.18854	null
2024-07-26	ChatSchema: A pipeline of extracting structured information with Large Multimodal Models based on schema	Fei Wang et.al.	2407.18716	null
2024-07-25	Sparse vs Contiguous Adversarial Pixel Perturbations in Multimodal Models: An Empirical Analysis	Cristian-Alexandru Botocan et.al.	2407.18251	link
2024-07-25	$\mathbb{X}$ -Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs	Vlad Sobal et.al.	2407.18134	null
2024-07-25	Cross-Vendor Reproducibility of Radiomics-based Machine Learning Models for Computer-aided Diagnosis	Jatin Chaudhary et.al.	2407.18060	null
2024-07-25	What does Kiki look like? Cross-modal associations between speech sounds and visual shapes in vision-and-language models	Tessa Verhoef et.al.	2407.17974	null
2024-07-25	Shapley Value-based Contrastive Alignment for Multimodal Information Extraction	Wen Luo et.al.	2407.17854	null
2024-07-25	Enhancing Model Performance: Another Approach to Vision-Language Instruction Tuning	Vedanshu et.al.	2407.17813	null
2024-07-25	KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models	Eunice Yiu et.al.	2407.17773	link
2024-07-24	Testing Large Language Models on Driving Theory Knowledge and Skills for Connected Autonomous Vehicles	Zuoyin Tang et.al.	2407.17211	null
2024-07-23	Chameleon: Images Are What You Need For Multimodal Learning Robust To Missing Modalities	Muhammad Irzam Liaqat et.al.	2407.16243	null
2024-07-22	LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding	Haoning Wu et.al.	2407.15754	link
2024-07-22	Resource-Efficient Federated Multimodal Learning via Layer-wise and Progressive Training	Ye Lin Tun et.al.	2407.15426	null
2024-07-21	VideoGameBunny: Towards vision assistants for video games	Mohammad Reza Taesiri et.al.	2407.15295	null
2024-07-22	Patch-based Intuitive Multimodal Prototypes Network (PIMPNet) for Alzheimer’s Disease classification	Lisa Anita De Santi et.al.	2407.14277	link
2024-07-18	Visual Haystacks: Answering Harder Questions About Sets of Images	Tsung-Han Wu et.al.	2407.13766	link
2024-07-17	Text- and Feature-based Models for Compound Multimodal Emotion Recognition in the Wild	Nicolas Richet et.al.	2407.12927	link
2024-07-16	ChatBCG: Can AI Read Your Slide Deck?	Nikita Singh et.al.	2407.12875	null
2024-07-17	LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models	Kaichen Zhang et.al.	2407.12772	link
2024-07-17	Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models	Donggeun Kim et.al.	2407.12616	null
2024-07-17	E5-V: Universal Embeddings with Multimodal Large Language Models	Ting Jiang et.al.	2407.12580	link
2024-07-16	FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models	Pengxiang Li et.al.	2407.11522	null
2024-07-16	COMET: “Cone of experience” enhanced large multimodal model for mathematical problem generation	Sannyuya Liu et.al.	2407.11315	null
2024-07-15	OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models	Zijian Zhou et.al.	2407.11213	link
2024-07-15	FabGPT: An Efficient Large Multimodal Model for Complex Wafer Defect Knowledge Queries	Yuqi Jiang et.al.	2407.10810	null
2024-07-15	Scaling 3D Reasoning with LMMs to Large Robot Mission Environments Using Datagraphs	W. J. Meijer et.al.	2407.10743	null
2024-07-16	Qwen2 Technical Report	An Yang et.al.	2407.10671	link
2024-07-15	How and where does CLIP process negation?	Vincent Quantmeyer et.al.	2407.10488	null
2024-07-12	Diagnosing and Re-learning for Balanced Multimodal Learning	Yake Wei et.al.	2407.09705	link
2024-07-12	Unifying Sequences, Structures, and Descriptions for Any-to-Any Protein Generation with the Large Multimodal Model HelixProtX	Zhiyuan Chen et.al.	2407.09274	link
2024-07-12	DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training	Chen Xin et.al.	2407.09174	link
2024-07-11	Emerging Practices for Large Multimodal Model (LMM) Assistance for People with Visual Impairments: Implications for Design	Jingyi Xie et.al.	2407.08882	null
2024-07-10	RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization	Xijie Huang et.al.	2407.08044	link
2024-07-10	LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models	Feng Li et.al.	2407.07895	link
2024-07-11	InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior	Chenguo Lin et.al.	2407.07580	null
2024-07-10	Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model	Wenqi Zhang et.al.	2407.07053	link
2024-07-08	ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation	Ethan Chern et.al.	2407.06135	link
2024-07-07	Multimodal Language Models for Domain-Specific Procedural Video Summarization	Nafisa Hussain et.al.	2407.05419	null
2024-07-07	Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition	Zirun Guo et.al.	2407.05374	link
2024-07-06	Enhance the Robustness of Text-Centric Multimodal Alignments	Ting-Yu Yen et.al.	2407.05036	null
2024-07-06	Completed Feature Disentanglement Learning for Multimodal MRIs Analysis	Tianling Liu et.al.	2407.04916	link
2024-07-06	MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension	Zekun Li et.al.	2407.04903	link
2024-07-05	VCoME: Verbal Video Composition with Multimodal Editing Effects	Weibo Gong et.al.	2407.04697	null
2024-07-05	Multimodal Classification via Modal-Aware Interactive Enhancement	Qing-Yuan Jiang et.al.	2407.04587	null
2024-07-05	Robust Multimodal Learning via Representation Decoupling	Shicai Wei et.al.	2407.04458	null
2024-07-05	Smart Vision-Language Reasoners	Denisa Roberts et.al.	2407.04212	link
2024-07-04	Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks	Amit Parekh et.al.	2407.03967	link
2024-07-04	ADAPT: Multimodal Learning for Detecting Physiological Changes under Missing Modalities	Julie Mordacq et.al.	2407.03836	link
2024-07-04	M $\mathbf5$ – A Diverse Benchmark to Assess the Performance of Large Multimodal Models Across Multilingual and Multicultural Vision-Language Tasks	Florian Schneider et.al.	2407.03791	null
2024-07-03	HEMM: Holistic Evaluation of Multimodal Foundation Models	Paul Pu Liang et.al.	2407.03418	link
2024-07-02	Multi-Peptide: Multimodality Leveraged Language-Graph Learning of Peptide Properties	Srivathsan Badrinarayanan et.al.	2407.03380	link
2024-07-02	Understanding Alignment in Multimodal LLMs: A Comprehensive Study	Elmira Amirloo et.al.	2407.02477	null
2024-07-02	Synthetic Multimodal Question Generation	Ian Wu et.al.	2407.02233	null
2024-07-02	Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models	Anjishnu Mukherjee et.al.	2407.02067	link
2024-07-01	Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational Agents	Mehdi Arjmand et.al.	2407.01824	link
2024-07-01	We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?	Runqi Qiao et.al.	2407.01284	link
2024-07-01	Unaligning Everything: Or Aligning Any Text to Any Image in Multimodal Models	Shaeke Salman et.al.	2407.01157	null
2024-06-29	AI-powered multimodal modeling of personalized hemodynamics in aortic stenosis	Caglar Ozturk et.al.	2407.00535	null
2024-06-29	MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation	Jinsheng Huang et.al.	2407.00468	link
2024-06-29	How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models	Jaeyoung Lee et.al.	2407.00369	null
2024-06-28	PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration	Yuxuan Sun et.al.	2407.00203	null
2024-06-28	EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model	Yuxuan Zhang et.al.	2406.20076	link
2024-06-28	InfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video Understanding	Kirolos Ataallah et.al.	2406.19875	link
2024-06-28	MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis	Jun-Yan He et.al.	2406.19859	null
2024-06-28	MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment	Jihao Liu et.al.	2406.19736	link
2024-06-28	Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction	Akash Awasthi et.al.	2406.19686	null
2024-06-28	SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs	Xin Su et.al.	2406.19593	null
2024-06-27	OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding	Tao Zhang et.al.	2406.19389	null
2024-06-28	FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts	Shubhankar Singh et.al.	2406.19237	null
2024-06-27	RAVEN: Multitask Retrieval Augmented Vision-Language Learning	Varun Nagaraj Rao et.al.	2406.19150	null
2024-06-27	DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming	Jiaxin Zhang et.al.	2406.19101	null
2024-06-27	Fairness and Bias in Multimodal AI: A Survey	Tosin Adewumi et.al.	2406.19097	null
2024-06-27	MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation	Sanggeon Yun et.al.	2406.18815	null
2024-06-26	MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data	William Berman et.al.	2406.18790	null
2024-06-26	S3: A Simple Strong Sample-effective Multimodal Dialog System	Elisei Rykov et.al.	2406.18305	link
2024-06-26	EHR-Based Mobile and Web Platform for Chronic Disease Risk Prediction Using Large Language Multimodal Models	Chun-Chieh Liao et.al.	2406.18087	null
2024-06-26	Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective Face and Body Expressions from Affordable Inputs	Uttaran Bhattacharya et.al.	2406.18068	null
2024-06-25	Human-centered In-building Embodied Delivery Benchmark	Zhuoqun Xu et.al.	2406.17898	link
2024-06-25	InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation	Jinbin Huang et.al.	2406.17838	null
2024-06-25	Data curation via joint example selection further accelerates multimodal learning	Talfan Evans et.al.	2406.17711	null
2024-06-25	Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights	Hao Yang et.al.	2406.17430	link
2024-06-24	At First Sight: Zero-Shot Classification of Astronomical Images with Large Multimodal Models	Dimitrios Tanoglidis et.al.	2406.17057	null
2024-06-24	Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models	Jierun Chen et.al.	2406.16866	link
2024-06-24	Long Context Transfer from Language to Vision	Peiyuan Zhang et.al.	2406.16852	link
2024-06-24	QuadrupedGPT: Towards a Versatile Quadruped Agent in Open-ended Worlds	Ye Wang et.al.	2406.16578	null
2024-06-21	Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning	Brandon Huang et.al.	2406.15334	link
2024-06-21	Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models	Jiayu Wang et.al.	2406.14852	link
2024-06-20	Evaluating vision-capable chatbots in interpreting kinematics graphs: a comparative study of free and subscription-based models	Giulia Polverini et.al.	2406.14685	null
2024-06-20	Revealing Vision-Language Integration in the Brain with Multimodal Networks	Vighnesh Subramaniam et.al.	2406.14481	link
2024-06-25	iWISDM: Assessing instruction following in multimodal models at scale	Xiaoxuan Lei et.al.	2406.14343	link
2024-06-20	Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models	Sherzod Hakimov et.al.	2406.14035	null
2024-06-20	Knowledge-driven Subspace Fusion and Gradient Coordination for Multi-modal Learning	Yupei Zhang et.al.	2406.13979	link
2024-06-20	PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents	Junjie Wang et.al.	2406.13923	null
2024-06-19	Through the Theory of Mind’s Eye: Reading Minds with Multimodal Video Large Language Models	Zhawnen Chen et.al.	2406.13763	null
2024-06-19	GUI Action Narrator: Where and When Did That Action Take Place?	Qinchen Wu et.al.	2406.13719	null
2024-06-19	Is AI fun? HumorDB: a curated dataset and benchmark to investigate graphical humor	Veedant Jain et.al.	2406.13564	null
2024-06-19	VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models	Haowen Hou et.al.	2406.13362	link
2024-06-19	Learnable In-Context Vector for Visual Question Answering	Yingzhe Peng et.al.	2406.13185	link
2024-06-18	Synergizing Foundation Models and Federated Learning: A Survey	Shenghui Li et.al.	2406.12844	null
2024-06-18	OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI	Zhen Huang et.al.	2406.12753	link
2024-06-18	Disturbing Image Detection Using LMM-Elicited Emotion Embeddings	Maria Tzelepi et.al.	2406.12668	null
2024-06-18	Automatic benchmarking of large multimodal models via iterative experiment programming	Alessandro Conti et.al.	2406.12321	link
2024-06-18	Language and Multimodal Models in Sports: A Survey of Datasets and Applications	Haotian Xia et.al.	2406.12252	null
2024-06-17	VideoLLM-online: Online Video Large Language Model for Streaming Video	Joya Chen et.al.	2406.11816	null
2024-06-17	LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning	Dantong Niu et.al.	2406.11815	null
2024-06-17	Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT	Maximilian E. Tschuchnig et.al.	2406.11650	null
2024-06-17	Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment	Chao Wen et.al.	2406.11334	null
2024-06-17	VideoVista: A Versatile Benchmark for Video Understanding and Reasoning	Yunxin Li et.al.	2406.11303	null
2024-06-17	i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment	Daechul Ahn et.al.	2406.11280	link
2024-06-17	MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens	Anas Awadalla et.al.	2406.11271	link
2024-06-17	Generative Visual Instruction Tuning	Jefferson Hernandez et.al.	2406.11262	link
2024-06-17	Relational Learning in Pre-Trained Models: A Theory from Hypergraph Recovery Perspective	Yang Chen et.al.	2406.11249	null
2024-06-16	Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies	Hung-Ting Su et.al.	2406.10923	null
2024-06-15	Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model	Lu Xu et.al.	2406.10484	link
2024-06-12	MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases	Rithesh Murthy et.al.	2406.10290	null
2024-06-14	VideoGUI: A Benchmark for GUI Automation from Instructional Videos	Kevin Qinghong Lin et.al.	2406.10227	null
2024-06-14	ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation	Chufan Shi et.al.	2406.09961	link
2024-06-14	BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval	Imanol Miranda et.al.	2406.09952	link
2024-06-13	VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding	Muhammad Maaz et.al.	2406.09418	link
2024-06-13	Explore the Limits of Omni-modal Pretraining at Scale	Yiyuan Zhang et.al.	2406.09412	link
2024-06-14	4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities	Roman Bachmann et.al.	2406.09406	null
2024-06-13	Yo’LLaVA: Your Personalized Language and Vision Assistant	Thao Nguyen et.al.	2406.09400	link
2024-06-13	CMC-Bench: Towards a New Paradigm of Visual Signal Compression	Chunyi Li et.al.	2406.09356	link
2024-06-13	Comparison Visual Instruction Tuning	Wei Lin et.al.	2406.09240	null
2024-06-13	Zoom and Shift are All You Need	Jiahao Qin et.al.	2406.08866	null
2024-06-11	Embedding-based Multimodal Learning on Pan-Squamous Cell Carcinomas for Improved Survival Outcomes	Asim Waqas et.al.	2406.08521	null
2024-06-14	Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models	Yi-Fan Zhang et.al.	2406.08487	link
2024-06-13	OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text	Qingyun Li et.al.	2406.08418	link
2024-06-12	A Concept-Based Explainability Framework for Large Multimodal Models	Jayneel Parekh et.al.	2406.08074	link
2024-06-12	LVBench: An Extreme Long Video Understanding Benchmark	Weihan Wang et.al.	2406.08035	link
2024-06-11	Cognitive Insights Across Languages: Enhancing Multimodal Interview Analysis	David Ortiz-Perez et.al.	2406.07542	link
2024-06-11	Understanding Visual Concepts Across Models	Brandon Trabucco et.al.	2406.07506	link
2024-06-11	Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology	Huahui Yi et.al.	2406.07078	link
2024-06-14	BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification	June-Woo Kim et.al.	2406.06786	link
2024-06-10	Vript: A Video Is Worth Thousands of Words	Dongjie Yang et.al.	2406.06040	link
2024-06-10	FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model	Yebin Lee et.al.	2406.06004	link
2024-06-10	CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark	David Romero et.al.	2406.05967	null
2024-06-09	Stealthy Targeted Backdoor Attacks against Image Captioning	Wenshu Fan et.al.	2406.05874	link
2024-06-09	F-LMM: Grounding Frozen Large Multimodal Models	Size Wu et.al.	2406.05821	link
2024-06-08	Generalist Multimodal AI: A Review of Architectures, Challenges and Opportunities	Sai Munikoti et.al.	2406.05496	null
2024-06-07	Semantic Segmentation on VSPW Dataset through Masked Video Consistency	Chen Liang et.al.	2406.04979	null
2024-06-07	Predictive Dynamic Fusion	Bing Cao et.al.	2406.04802	link
2024-06-07	MGIMM: Multi-Granularity Instruction Multimodal Model for Attribute-Guided Remote Sensing Image Detailed Description	Cong Yang et.al.	2406.04716	link
2024-06-07	AICoderEval: Improving AI Domain Code Generation of Large Language Models	Yinghui Xia et.al.	2406.04712	null
2024-06-06	GenAI Arena: An Open Evaluation Platform for Generative Models	Dongfu Jiang et.al.	2406.04485	null
2024-06-06	MAIRA-2: Grounded Radiology Report Generation	Shruthi Bannur et.al.	2406.04449	link
2024-06-06	DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs	Lingchen Meng et.al.	2406.04334	null
2024-06-06	BLSP-Emo: Towards Empathetic Large Speech-Language Models	Chen Wang et.al.	2406.03872	link
2024-06-05	Identification of Stone Deterioration Patterns with Large Multimodal Models	Daniele Corradetti et.al.	2406.03207	link
2024-06-05	Exploiting LMM-based knowledge for image classification tasks	Maria Tzelepi et.al.	2406.03071	null
2024-06-02	Multimodal Deep Learning for Low-Resource Settings: A Vector Embedding Alignment Approach for Healthcare Applications	David Restrepo et.al.	2406.02601	null
2024-06-04	Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning	Alex Jinpeng Wang et.al.	2406.02547	link
2024-06-04	Dealing with All-stage Missing Modality: Towards A Universal Model with Robust Reconstruction and Personalization	Yunpeng Zhao et.al.	2406.01987	null
2024-06-03	Automatic Fused Multimodal Deep Learning for Plant Identification	Alfreds Lapkovskis et.al.	2406.01455	link
2024-06-05	Pulmonary Embolism Mortality Prediction Using Multimodal Learning Based on Computed Tomography Angiography and Clinical Data	Zhusi Zhong et.al.	2406.01302	null
2024-06-03	Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model	Kezhen Chen et.al.	2406.00977	link
2024-06-02	Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient	Zechu Li et.al.	2406.00681	null
2024-06-04	StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond	Pengyuan Lyu et.al.	2405.21013	null
2024-05-31	Don’t Buy it! Reassessing the Ad Understanding Abilities of Contrastive Multimodal Models	A. Bavaresco et.al.	2405.20846	link
2024-06-17	Ovis: Structural Embedding Alignment for Multimodal Large Language Model	Shiyin Lu et.al.	2405.20797	link
2024-05-31	Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning	Yang Chen et.al.	2405.20606	link
2024-05-30	Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA	Qianqi Yan et.al.	2405.20421	link
2024-05-30	Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use	Franz Louis Cesista et.al.	2405.20245	null
2024-05-31	Visual Attention Analysis in Online Learning	Miriam Navarro et.al.	2405.20091	null
2024-05-30	MM-Lego: Modular Biomedical Multimodal Models with Minimal Fine-Tuning	Konstantin Hemker et.al.	2405.19950	link
2024-05-30	Instruction-Guided Visual Masking	Jinliang Zheng et.al.	2405.19783	link
2024-05-29	Thermodynamically Informed Multimodal Learning of High-Dimensional Free Energy Models in Molecular Coarse Graining	Blake R. Duschatko et.al.	2405.19386	null
2024-06-09	LLMs Meet Multimodal Generation and Editing: A Survey	Yingqing He et.al.	2405.19334	link
2024-05-29	Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare	Hanwei Zhu et.al.	2405.19298	link
2024-05-31	Benchmarking and Improving Detail Image Caption	Hongyuan Dong et.al.	2405.19092	link
2024-05-29	Topological Perspectives on Optimal Multimodal Embedding Spaces	Abdul Aziz A. B et.al.	2405.18867	null
2024-05-29	Exploring Exotic Decays of the Higgs Boson to Multi-Photons at the LHC via Multimodal Learning Approaches	A. Hammad et.al.	2405.18834	null
2024-05-28	The Evolution of Multimodal Model Architectures	Shakti N. Wadekar et.al.	2405.17927	null
2024-05-28	Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment	Xin Xiao et.al.	2405.17871	link
2024-05-28	Full-Stack Allreduce on Multi-Rail Networks	Enda Yu et.al.	2405.17870	null
2024-05-28	MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance	Yake Wei et.al.	2405.17730	link
2024-05-27	Matryoshka Multimodal Models	Mu Cai et.al.	2405.17430	null
2024-05-27	XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser	Xianfu Cheng et.al.	2405.17336	link
2024-05-28	LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding	Haoyu Zhao et.al.	2405.17104	null
2024-05-27	Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning	Zihua Zhao et.al.	2405.16996	link
2024-05-27	Multilingual Diversity Improves Vision-Language Representations	Thao Nguyen et.al.	2405.16915	null
2024-05-26	Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs	Mustafa Shukor et.al.	2405.16700	link
2024-05-25	How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect	Siddhartha K. Vemuri et.al.	2405.16128	null
2024-05-24	ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models	Chunjiang Ge et.al.	2405.15738	link
2024-05-24	Chain-of-Thought Prompting for Demographic Inference with Large Multimodal Models	Yongsheng Yu et.al.	2405.15687	null
2024-05-24	M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models	Hongyu Wang et.al.	2405.15638	link
2024-05-24	DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception	Run Luo et.al.	2405.15232	link
2024-05-24	Shopping Queries Image Dataset (SQID): An Image-Enriched ESCI Dataset for Exploring Multimodal Learning in Product Search	Marie Al Ghossein et.al.	2405.15190	link

Generative Weight Space Modeling

Publish Date	Title	Authors	PDF	Code
2025-07-22	laplax – Laplace Approximations with JAX	Tobias Weber et.al.	2507.17013	null
2025-07-22	$p$-th order generalized Fibonacci cubes and maximal cubes in Fibonacci $p$ -cubes	Michel Mollard et.al.	2507.16387	null
2025-07-21	Dark energy era with a resolution of Hubble tension in generalized entropic cosmology	Priyanka Adhikary et.al.	2507.15273	null
2025-07-18	Constraints on Rényi Entropy through Primordial Big-Bang Nucleosynthesis and Baryogenesis	Ahmad Sheykhi et.al.	2507.14250	null
2025-07-16	Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement	Junyu Lou et.al.	2507.12135	null
2025-07-15	Jailbreak-Tuning: Models Efficiently Learn Jailbreak Susceptibility	Brendan Murphy et.al.	2507.11630	null
2025-07-14	Flows and Diffusions on the Neural Manifold	Daniel Saragih et.al.	2507.10623	null
2025-07-14	MF-GLaM: A multifidelity stochastic emulator using generalized lambda models	K. Giannoukou et.al.	2507.10303	null
2025-07-08	Traveling waves in a continuum model for schooling swimmers	Anand U. Oza et.al.	2507.06095	null
2025-07-01	Dualities of Gaudin models with irregular singularities for general linear Lie (super)algebras	Wan Keng Cheong et.al.	2507.00730	null
2025-06-30	Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios	Deng Li et.al.	2506.24063	null
2025-06-30	A linear topological invariant for weighted spaces of holomorphic functions	Andreas Debrouwere et.al.	2506.23695	null
2025-06-30	The multilinear fractional bounded mean oscillation operator theory I: sparse domination, sparse $T1$ theorem, off-diagonal extrapolation, quantitative weighted estimate – for generalized commutators	Xi Cen et.al.	2506.23486	null
2025-06-27	Weak solutions to incompressible heat-conducting motions with large flux	Joanna Rencławowicz et.al.	2506.22155	null
2025-06-26	Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference	Colin Samplawski et.al.	2506.21408	null
2025-06-25	Structural System Identification via Validation and Adaptation	Cristian López et.al.	2506.20799	null
2025-06-24	Geometric-Aware Variational Inference: Robust and Adaptive Regularization with Directional Weight Uncertainty	Carlos Stein Brito et.al.	2506.19726	null
2025-06-21	CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation Learning	Angelos-Nikolaos Kanatas et.al.	2506.17818	null
2025-06-24	Scalable Machine Learning Algorithms using Path Signatures	Csaba Tóth et.al.	2506.17634	null
2025-06-21	LFR-PINO: A Layered Fourier Reduced Physics-Informed Neural Operator for Parametric PDEs	Jing Wang et.al.	2506.17582	null
2025-06-19	Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights	Zhiyuan Liang et.al.	2506.16406	null
2025-06-04	Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning	Juntae Lee et.al.	2506.15720	null
2025-07-09	LLM Agent for Hyper-Parameter Optimization	Wanzhe Wang et.al.	2506.15167	null
2025-06-13	Optimal trace norms for Helmholtz problems	Benedikt Gräßle et.al.	2506.11944	null
2025-06-13	Integral Operators on Generalized Weighted Central Morrey Spaces over Local Fields	Salman Ashraf et.al.	2506.11494	null
2025-06-13	A correlation-permutation approach for speech-music encoders model merging	Fabian Ritter-Gutierrez et.al.	2506.11403	null
2025-06-04	A phase transition in the Bakry-Émery gradient estimate for Dyson Brownian motion	Kohei Suzuki et.al.	2506.04424	null
2025-05-29	Zero-Shot Adaptation of Parameter-Efficient Fine-Tuning in Diffusion Models	Farzad Farhadzadeh et.al.	2506.04244	null
2025-06-01	Weight-Space Linear Recurrent Neural Networks	Roussel Desmond Nzoyem et.al.	2506.01153	null
2025-05-30	$\mathrm{SL}(2,\mathbb{R})$ families of Kerr black holes	Robert Penna et.al.	2506.00184	null
2025-05-29	Walking the Weight Manifold: a Topological Approach to Conditioning Inspired by Neuromodulation	Ari S. Benjamin et.al.	2505.22994	null
2025-05-28	From Dormant to Deleted: Tamper-Resistant Unlearning Through Weight-Space Regularization	Shoaib Ahmed Siddiqui et.al.	2505.22310	null
2025-05-27	Can Large Language Models Predict Audio Effects Parameters from Natural Language?	Seungheon Doh et.al.	2505.20770	null
2025-05-26	A duality of Bethe algebras for general linear Lie (super)algebras	Wan Keng Cheong et.al.	2505.19661	null
2025-05-26	Irreducible cuspidal $\mathfrak{sl}_{n+1}$-modules from finite-dimensional modules over the minimal nilpotent finite $W$ -algebra	Genqiang Liu et.al.	2505.19417	null
2025-05-25	Non-Hermitian effects on the quantum parameter estimation in pseudo-Hermitian systems	L. H. Wei et.al.	2505.19079	null
2025-05-23	Generalized upper and lower Legendre conjugates for Braun-Meise-Taylor weight functions	Gerhard Schindl et.al.	2505.17725	null
2025-05-21	Revealing Language Model Trajectories via Kullback-Leibler Divergence	Ryo Kishino et.al.	2505.15353	null
2025-05-20	Safety Subspaces are Not Distinct: A Fine-Tuning Case Study	Kaustubh Ponkshe et.al.	2505.14185	link
2025-05-20	Predicting Dynamical Systems across Environments via Diffusive Model Weight Generation	Ruikun Li et.al.	2505.13919	null
2025-05-19	Persistence of integrable wave dynamics in the Discrete Gross–Pitaevskii equation: the focusing case	G. Fotopoulos et.al.	2505.13139	null
2025-05-19	CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models	Shristi Das Biswas et.al.	2505.12677	null
2025-05-23	NeuroGen: Neural Network Parameter Generation via Large Language Models	Jiaqi Wang et.al.	2505.12470	null
2025-05-13	Differentiable Quantum Architecture Search in Quantum-Enhanced Neural Network Parameter Generation	Samuel Yen-Chi Chen et.al.	2505.09653	null
2025-05-12	Weights and characters over Borcherds-Kac-Moody algebras	Souvik Pal et.al.	2505.08102	null
2025-05-12	Generalized upper and lower Legendre conjugates for weight functions	Gerhard Schindl et.al.	2505.07497	null
2025-05-12	GMM with Many Weak Moment Conditions and Nuisance Parameters: General Theory and Applications to Causal Inference	Rui Wang et.al.	2505.07295	null
2025-05-10	Transverse linear stability of line solitons for 2D Toda	Tetsu Mizumachi et.al.	2505.06768	null
2025-04-28	FedAvgen: Metadata for Model Aggregation In Communication Systems	Anthony Kiggundu et.al.	2505.05486	null
2025-05-04	Information geometry and entanglement under phase-space deformation through nonsymplectic congruence transformation	Shilpa Nandi et.al.	2505.02269	null
2025-05-02	Fredholm properties of singular elliptic operators arising in the study of point defects	Gabriela Jaramillo et.al.	2505.01534	null
2025-04-30	Scrambling Dynamics with Imperfections in a Solvable Model	Nadie Yiluo LiTenn et.al.	2505.00070	null
2025-04-28	Weighted approximation By Max-product Kantrovich type Exponential Sampling Series	Satyaranjan Pradhan et.al.	2504.19668	null
2025-04-27	Composable and adaptive design of machine learning interatomic potentials guided by Fisher-information analysis	Weishi Wang et.al.	2504.19372	null
2025-04-27	Sharp bounds for the $\boldsymbol{p}$-adic $\boldsymbol{n}$-dimensional fractional Hardy operator and a class of integral operators on $\boldsymbol{p}$ -adic function spaces	Tianyang He et.al.	2504.19273	null
2025-04-26	On learning functions over biological sequence space: relating Gaussian process priors, regularization, and gauge fixing	Samantha Petti et.al.	2504.19034	null
2025-04-25	A Model Zoo on Phase Transitions in Neural Networks	Konstantin Schürholt et.al.	2504.18072	null
2025-04-24	An Inverse Source Problem for Semilinear Stochastic Hyperbolic Equations	Qi Lü et.al.	2504.17398	null
2025-04-23	Probing Bulk Band Topology from Time Boundary Effect in Synthetic Dimension	Huisheng Xu et.al.	2504.16390	null
2025-04-20	Analysis of stochastic fluid-rigid body dynamics: an approach by stochastic maximal regularity	Felix Brandt et.al.	2504.14676	null
2025-04-16	Advanced MST3 Encryption scheme based on generalized Suzuki 2-groups	Gennady Khalimov et.al.	2504.11804	null
2025-04-14	Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning	Saif Punjwani et.al.	2504.10646	link
2025-04-14	A Model Zoo of Vision Transformers	Damian Falk et.al.	2504.10231	link
2025-04-14	The Impact of Model Zoo Size and Composition on Weight Space Learning	Damian Falk et.al.	2504.10141	link
2025-04-10	Conformally weighted Einstein manifolds: the uniqueness problem	Miguel Brozos-Vázquez et.al.	2504.07860	null
2025-04-10	A Novel Deep Learning Approach for Emulating Computationally Expensive Postfire Debris Flows	Palak Patel et.al.	2504.07736	null
2025-04-08	QEMesh: Employing A Quadric Error Metrics-Based Representation for Mesh Generation	Jiaqi Li et.al.	2504.05720	null
2025-03-27	Geometric Flow Models over Neural Network Weights	Ege Erdogan et.al.	2504.03710	null
2025-04-04	The Torus Centralizing Subalgebra of $\text{Dist}(G_r)$	Paul Sobaje et.al.	2504.03121	null
2025-04-02	Instruction-Guided Autoregressive Neural Network Parameter Generation	Soro Bedionita et.al.	2504.02012	null
2025-04-01	MetaLoRA: Tensor-Enhanced Adaptive Low-Rank Fine-tuning	Maolin Wang et.al.	2504.00460	null
2025-04-08	ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion	Rana Muhammad Shahroz Khan et.al.	2503.24354	null
2025-03-26	Shape Generation via Weight Space Learning	Maximilian Plattner et.al.	2503.21830	null
2025-04-22	On Symmetries in Convolutional Weights	Bilal Alsallakh et.al.	2503.19215	null
2025-03-23	Spectral synthesis for exponentials in weighted $L^2$ -spaces	Andrei V. Semenov et.al.	2503.18131	null
2025-03-21	Adiabatic Fine-Tuning of Neural Quantum States Enables Detection of Phase Transitions in Weight Space	Vinicius Hernandes et.al.	2503.17140	null
2025-03-21	Structure Is Not Enough: Leveraging Behavior for Neural Network Weight Reconstruction	Léo Meynent et.al.	2503.17138	link
2025-03-18	Recursive Self-Similarity in Deep Weight Spaces of Neural Architectures: A Fractal and Coarse Geometry Perspective	Ambarish Moharil et.al.	2503.14298	null
2025-03-18	AI-Driven Diabetic Retinopathy Diagnosis Enhancement through Image Processing and Salp Swarm Algorithm-Optimized Ensemble Network	Saif Ur Rehman Khan et.al.	2503.14209	null
2025-03-18	GenPara: Enhancing the 3D Design Editing Process by Inferring Users’ Regions of Interest with Text-Conditional Shape Parameters	Jiin Choi et.al.	2503.14096	null
2025-03-17	Spacetime Structure of Regular Accelerating Black Hole Pair in General Relativity	M. M. Akbar et.al.	2503.13420	null
2025-03-17	ProDiF: Protecting Domain-Invariant Features to Secure Pre-Trained Models Against Extraction	Tong Zhou et.al.	2503.13224	null
2025-03-17	The VMC survey – LII. The internal kinematics of the LMC with new VISTA observations	S. Vijayasree et.al.	2503.13039	null
2025-03-16	Learning Privacy from Visual Entities	Alessio Xompero et.al.	2503.12464	null
2025-03-21	Interpolation of Matrix Weighted Spaces and Commutator Estimates	Félix Cabello Sánchez et.al.	2503.10577	null
2025-03-13	Architecture-Aware Minimization (A $^2$ M): How to Find Flat Minima in Neural Architecture Search	Matteo Gambella et.al.	2503.10404	link
2025-03-13	Rapid analysis of point-contact Andreev reflection spectra via machine learning with adaptive data augmentation	Dongik Lee et.al.	2503.10040	null
2025-03-12	On the Internal Representations of Graph Metanetworks	Taesun Yeom et.al.	2503.09120	null
2025-03-11	LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning	Weijie Zhou et.al.	2503.08508	link
2025-03-11	Sampling the space of solutions of an artificial neural network	Alessandro Zambon et.al.	2503.08266	null
2025-03-11	Threshold for the existence of scattering states for nonlinear Schrödinger equations without gauge invariance	Hayato Miyazaki et.al.	2503.07983	null
2025-03-10	You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time	Xiaotian Han et.al.	2503.07066	link
2025-03-17	Secure On-Device Video OOD Detection Without Backpropagation	Shawn Li et.al.	2503.06166	link
2025-03-06	Can We Optimize Deep RL Policy Weights as Trajectory Modeling?	Hongyao Tang et.al.	2503.04074	null
2025-03-04	Bridging VLM and KMP: Enabling Fine-grained robotic manipulation via Semantic Keypoints Representation	Junjie Zhu et.al.	2503.02748	null
2025-02-28	Post-Hoc Uncertainty Quantification in Pre-Trained Neural Networks via Activation-Level Gaussian Processes	Richard Bergna et.al.	2502.20966	null
2025-02-23	Irreducible components of affine Lusztig varieties	Xuhua He et.al.	2502.16441	null
2025-02-21	Three-parameter generalizations of formulas due to Guillera	John M. Campbell et.al.	2502.15249	null
2025-02-21	Weighted BMO-BLO estimates for Littlewood–Paley square operators	Hua Wang et.al.	2502.15125	null
2025-02-20	Dynamic Concepts Personalization from Single Videos	Rameen Abdal et.al.	2502.14844	null
2025-02-17	Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA	Patryk Marszałek et.al.	2502.12122	link
2025-02-17	Massively Scaling Explicit Policy-conditioned Value Functions	Nico Bohlinger et.al.	2502.11949	null
2025-02-17	Solutions for a critical elliptic system with periodic boundary condition	Qingfang Wang et.al.	2502.11373	null
2025-02-11	Some new results about Fibonacci p-cubes	Michel Mollard et.al.	2502.07520	null
2025-02-11	Cost-Efficient Continual Learning with Sufficient Exemplar Memory	Dongkyu Cho et.al.	2502.07274	null
2025-03-04	SelaFD:Seamless Adaptation of Vision Transformer Fine-tuning for Radar-based Human Activity Recognition	Yijun Wang et.al.	2502.04740	link
2025-01-31	SAGRAD: A Program for Neural Network Training with Simulated Annealing and the Conjugate Gradient Method	Javier Bernal et.al.	2502.00112	null
2025-01-30	In-Context Meta LoRA Generation	Yihua Shao et.al.	2501.17635	null
2025-01-28	Function Spaces on Uniformly Regular and Singular Riemannian Manifolds	Herbert Amann et.al.	2501.16845	null
2025-01-25	On the decay of solutions for the negative fractional KdV equation	Alysson Cunha et.al.	2501.15306	null
2025-01-24	Domain Expansion: Parameter-Efficient Modules as Building Blocks for Composite Domains	Mann Patel et.al.	2501.14321	link
2025-01-24	Linear enhanced dissipation for the 2D Taylor-Couette flow in the exterior region: A supplementary example for Gearhart-Prüss type lemma	Te Li et.al.	2501.14187	null
2025-01-23	Solutions of differential equations in Freud-weighted Sobolev spaces	Maxime Breden et.al.	2501.13672	link
2025-01-23	Weighted theory of Toeplitz operators on the Fock spaces	Jiale Chen et.al.	2501.13571	null
2025-01-22	Waveguide arrays interaction to second neighbors: Exact solution	M. A. Tapia-Valerdi et.al.	2501.12550	null
2025-02-11	Recurrent Diffusion for Large-Scale Parameter Generation	Kai Wang et.al.	2501.11587	link
2025-01-12	Evaluating Sample Utility for Data Selection by Mimicking Model Weights	Tzu-Heng Huang et.al.	2501.06708	null
2025-01-10	Random Sparse Lifts: Construction, Analysis and Convergence of finite sparse networks	David A. R. Robin et.al.	2501.05930	null
2025-01-09	CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models	Yewei Song et.al.	2501.05255	null
2025-01-08	Towards Fair Class-wise Robustness: Class Optimal Distribution Adversarial Training	Hongxin Zhi et.al.	2501.04527	null
2025-01-08	Cartan-covariant Quantum Channels and the PPT $^{2}$ conjecture	Sean Prudhoe et.al.	2501.03959	null
2025-01-07	On the surjectivity of the Cauchy-Riemann and Laplace operators on weighted spaces of smooth functions	Andreas Debrouwere et.al.	2501.03751	null
2025-01-06	The 3D energy-critical inhomogeneous nonlinear Schrodinger equation with strong singularity	Yoonjung Lee et.al.	2501.02697	null
2025-01-01	Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model	Chenyang Liu et.al.	2501.00895	null
2024-12-15	ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic Interpolation	Chenhui Deng et.al.	2412.19819	null
2024-12-27	Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model	Hyunwoo Cho et.al.	2412.19517	null
2024-12-26	Characterizing resources for multiparameter estimation of SU(2) and SU(1,1) unitaries	Shaowei Du et.al.	2412.19119	null
2024-12-25	Mixed Fourier norm spaces of analytic functions on the upper half-plane and Toeplitz operators	Zhirayr Avetisyan et.al.	2412.18954	null
2024-12-26	KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management	Rongxin Cheng et.al.	2412.18169	null
2024-12-19	DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation	Wang Zhao et.al.	2412.15200	null
2024-12-18	On the principle of linearized stability for quasilinear evolution equations in time-weighted spaces	Bogdan-Vasile Matioc et.al.	2412.13940	null
2024-12-17	On the Bäcklund transform and the stability of the line soliton of the KP-II equation on $\mathbb R^2$	Lorenzo Pompili et.al.	2412.12530	null
2024-12-13	On the embedding of weighted Sobolev spaces with applications to a planar nonlinear Schrödinger equation	Antonio Azzolini et.al.	2412.10067	null
2024-12-12	Modified scattering for the cubic dispersion-managed NLS	Jason Murphy et.al.	2412.09762	null
2024-12-12	LoRACLR: Contrastive Adaptation for Customization of Diffusion Models	Enis Simsar et.al.	2412.09622	null
2024-12-11	Exploring superconformal Yang-Mills theories through matrix Bessel kernels	Zoltan Bajnok et.al.	2412.08732	null
2024-12-09	Bilinear singular integral operators with kernels in weighted spaces	Petr Honzík et.al.	2412.07014	null
2024-12-04	Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach	Lingchen Sun et.al.	2412.03017	link
2024-11-21	Strong localization blurs criticality of time series for spreading phenomena on networks	Juliane T. Moraes et.al.	2412.01842	null
2024-12-02	Geometric invariant theory and stretched Kostka quasi-polynomials	Marc Besson et.al.	2412.01651	null
2024-11-29	Origin-Destination Demand Prediction: An Urban Radiation and Attraction Perspective	Xuan Ma et.al.	2412.00167	null
2024-11-29	Rényi complexity in mean-field disordered systems	Nina Javerzat et.al.	2411.19817	null
2024-11-28	An Extensive Evaluation of Factual Consistency in Large Language Models for Data-to-Text Generation	Joy Mahapatra et.al.	2411.19203	null
2024-11-27	Task Arithmetic Through The Lens Of One-Shot Federated Learning	Zhixu Tao et.al.	2411.18607	null
2024-11-25	Spectral properties of Lévy Fokker–Planck equations	Hardy Chan et.al.	2411.16424	null
2024-11-20	Nonlinear orbital stability of stationary shock profiles for the Lax-Wendroff scheme	Jean-François Coulombel et.al.	2411.13094	null
2024-11-26	Enhancing generalization in high energy physics using white-box adversarial attacks	Franck Rothen et.al.	2411.09296	null
2024-11-11	Minimal nilpotent finite $W$-algebra and cuspidal module category of $\mathfrak{sp}_{2n}$	Genqiang Liu et.al.	2411.06768	null
2024-11-07	Well-Posedness and Regularity of the Heat Equation with Robin Boundary Conditions in the Two-Dimensional Wedge	Marco Bravin et.al.	2411.04651	null
2024-11-04	SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF	Atoosa Chegini et.al.	2411.01798	null
2024-12-06	Modular Duality in Deep Learning	Jeremy Bernstein et.al.	2410.21265	null
2024-10-26	MarDini: Masked Autoregressive Diffusion for Video Generation at Scale	Haozhe Liu et.al.	2410.20280	null
2024-10-25	Four-parameter Mittag-Leffler functions and their associated coherent states	Dušan Popov et.al.	2410.19462	null
2024-10-24	Bielik 7B v0.1: A Polish Language Model – Development, Insights, and Evaluation	Krzysztof Ociepa et.al.	2410.18565	null
2024-10-21	Two dimensional delta Bose gas in a weighted space	Sudheesh Surendranath et.al.	2410.16550	null
2024-10-21	In Search of the Successful Interpolation: On the Role of Sharpness in CLIP Generalization	Alireza Abdollahpoorrostam et.al.	2410.16476	link
2024-10-23	Universal approximation results for neural networks with non-polynomial activation function over non-compact domains	Ariel Neufeld et.al.	2410.14759	null
2024-10-23	Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching	Jie Peng et.al.	2410.14740	null
2024-10-16	Differential Shape Optimization with Image Representation for Photonic Design	Zhaocheng Liu et.al.	2410.13074	null
2024-10-15	Scaling Laws for Multilingual Language Models	Yifei He et.al.	2410.12883	null
2024-10-16	AutoSimTTF: A Fully Automatic Pipeline for Electric Field Simulation and Treatment Planning of Tumor Treating Fields	Minmin Wang et.al.	2410.12196	null
2024-10-15	Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence	Shangbin Feng et.al.	2410.11163	null
2024-10-14	Deep Linear Probe Generators for Weight Space Learning	Jonathan Kahana et.al.	2410.10811	null
2024-10-14	Generating Model Parameters for Controlling: Parameter Diffusion for Controllable Multi-Task Recommendation	Chenglei Shen et.al.	2410.10639	null
2024-10-14	MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer	Minghao Zhu et.al.	2410.10589	link
2024-10-15	Regions of Level $\ell$ of Catalan/Semiorder-Type Arrangements	Yanru Chen et.al.	2410.10198	null
2024-10-13	A Quantum Circuit-Based Compression Perspective for Parameter-Efficient Learning	Chen-Yu Liu et.al.	2410.09846	null
2024-10-11	Meta-Transfer Learning Empowered Temporal Graph Networks for Cross-City Real Estate Appraisal	Weijia Zhang et.al.	2410.08947	null
2024-10-09	Efficient Weight-Space Laplace-Gaussian Filtering and Smoothing for Sequential Deep Learning	Joanna Sliwa et.al.	2410.06800	null
2024-10-09	Revisiting Multi-Permutation Equivariance through the Lens of Irreducible Representations	Yonatan Sverdlov et.al.	2410.06665	link
2024-10-08	Weighted Embeddings for Low-Dimensional Graph Representation	Thomas Bläsius et.al.	2410.06042	null
2024-10-05	Computing ground states of Bose-Einstein condensation by normalized deep neural network	Weizhu Bao et.al.	2410.05319	link
2024-10-07	Hyper-Representations: Learning from Populations of Neural Networks	Konstantin Schürholt et.al.	2410.05107	link
2024-10-06	Integrable Modules of Map full Toroidal Lie Algebras	Pradeep Bisht et.al.	2410.04495	null
2024-10-06	Global well-posedness for the defocusing 3D quadratic NLS in the sharp critical space	Jia Shen et.al.	2410.04337	null
2024-10-05	Equivariant Neural Functional Networks for Transformers	Viet-Hoang Tran et.al.	2410.04209	null
2024-10-15	Learning on LoRAs: GL-Equivariant Processing of Low-Rank Weight Spaces for Large Finetuned Models	Theo Putterman et.al.	2410.04207	null
2024-10-04	Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks	Ann Huang et.al.	2410.03972	null
2024-10-04	Autoregressive Moving-average Attention Mechanism for Time Series Forecasting	Jiecheng Lu et.al.	2410.03159	link
2024-10-02	Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets	Yuandong Tian et.al.	2410.01779	link
2024-10-01	SynCOM: A tool for simulating coronal outflows	Valmir Moraes Filho et.al.	2410.01004	null
2024-10-01	On the prime ideals of higher secant varieties of Veronese embeddings of small degrees	Katsuhisa Furukawa et.al.	2410.00652	null
2024-09-30	Old Optimizer, New Norm: An Anthology	Jeremy Bernstein et.al.	2409.20325	null
2024-09-27	Effects of Peierls phases in open linear chains	Anselmo M. Marques et.al.	2409.18780	null
2024-09-27	Density of states in neural networks: an in-depth exploration of learning in parameter space	Margherita Mele et.al.	2409.18683	null
2024-09-26	The time periodic problem for the Navier-Stokes equations in exterior domains in weighted spaces	Reinhard Farwig et.al.	2409.17590	null
2024-09-25	Scalable Ensemble Diversification for OOD Generalization and Detection	Alexander Rubinstein et.al.	2409.16797	null
2024-10-04	Lessons Learned from a Unifying Empirical Study of Parameter-Efficient Transfer Learning (PETL) in Visual Recognition	Zheda Mai et.al.	2409.16434	link
2024-09-24	VascX Models: Model Ensembles for Retinal Vascular Analysis from Color Fundus Images	Jose Vargas Quiros et.al.	2409.16016	link
2024-09-23	Efficient Large-Scale Quantum Optimization via Counterdiabatic Ansatz	Jie Liu et.al.	2409.15055	null
2024-09-24	Weighted Approximation By Max-Product Generalized Exponential Sampling Series	Satyaranjan Pradhan et.al.	2409.14884	null
2024-09-21	Weakly magnetized black holes in Einstein-ModMax theory	Haryanto M. Siahaan et.al.	2409.13967	null
2024-09-18	Monomial Matrix Group Equivariant Neural Functional Networks	Hoang V. Tran et.al.	2409.11697	link
2024-09-17	Existence of an extremal function of Sobolev critical embedding with an $α$ -homogeneous weight	Petr Gurka et.al.	2409.11193	null
2024-09-16	Inferring stellar parameters and their uncertainties from high-resolution spectroscopy using invertible neural networks	Nils Candebat et.al.	2409.10621	null
2024-09-13	Non-unitary Wightman CFTs and non-unitary vertex algebras	Sebastiano Carpi et.al.	2409.08454	null
2024-09-12	Global well-posedness and scattering in weighted space for nonlinear Schrödinger equations below the Strauss exponent without gauge-invariance	Masaki Kawamoto et.al.	2409.08432	null
2024-09-09	Fast gradient-free optimization of excitations in variational quantum eigensolvers	Jonas Jäger et.al.	2409.05939	link
2024-09-06	SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields	Yuze Wang et.al.	2409.04482	null
2024-09-04	Federated Quantum-Train with Batched Parameter Generation	Chen-Yu Liu et.al.	2409.02763	null
2024-09-16	Regret Analysis for Randomized Gaussian Process Upper Confidence Bound	Shion Takeno et.al.	2409.00979	null
2024-08-30	Abstracted Gaussian Prototypes for One-Shot Concept Learning	Chelsea Zou et.al.	2408.17251	link
2024-08-23	Emergence of global receptive fields capturing multipartite quantum correlations	Oleg M. Sotnikov et.al.	2408.13033	null
2024-08-22	**Action of $\mathfrak{osp}(1	2n)$ on polynomials tensor $\mathbb{C}^{0	2n}$**	Dwight Anderson Williams II et.al.
2024-08-19	Unimodal sequences and mixed false theta functions	Kevin Allen et.al.	2408.09789	null
2024-08-16	Onsager-Machlup functional for stochastic lattice dynamical systems driven by time-varying noise	Xinze Zhang et.al.	2408.08465	null
2024-08-10	Variational Inference Failures Under Model Symmetries: Permutation Invariant Posteriors for Bayesian Neural Networks	Yoav Gelberg et.al.	2408.05496	null
2024-08-09	Quasilinear parabolic equations with superlinear nonlinearities in critical spaces	Bogdan-Vasile Matioc et.al.	2408.05067	null
2024-08-08	A framework for generalizing toric inequalities for holographic entanglement entropy	Ning Bao et.al.	2408.04741	null
2024-08-07	Counterfactuals and Uncertainty-Based Explainable Paradigm for the Automated Detection and Segmentation of Renal Cysts in Computed Tomography Images: A Multi-Center Study	Zohaib Salahuddin et.al.	2408.03789	null
2024-08-05	BOTS-LM: Training Large Language Models for Setswana	Nathan Brown et.al.	2408.02239	null
2024-08-02	Conditional LoRA Parameter Generation	Xiaolong Jin et.al.	2408.01415	null
2024-08-01	Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization	Róisín Luo et.al.	2408.00923	null
2024-07-31	Semantic Codebook Learning for Dynamic Recommendation Models	Zheqi Lv et.al.	2408.00123	null
2024-07-29	Tensor product weight modules over the affine-Virasoro algebra	Qiu-Fan Chen et.al.	2407.19844	null
2024-07-24	Generalized Hilbert operators acting on weighted spaces of holomorphic functions with sup-norms	María J. Beltrán-Meneu et.al.	2407.17646	null
2024-07-24	Generalized Ordinal Priority Approach for Multi-Attribute Decision-Making under Incomplete Preference Information	Renlong Wang et.al.	2407.17099	null
2024-07-22	WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation	Zirui Shao et.al.	2407.15502	link
2024-07-18	FSP-Laplace: Function-Space Priors for the Laplace Approximation in Bayesian Deep Learning	Tristan Cinquin et.al.	2407.13711	null
2024-07-19	Parameter Generation of Quantum Approximate Optimization Algorithm with Diffusion Model	Fanxu Meng et.al.	2407.12242	null
2024-07-24	Effect Heterogeneity with Earth Observation in Randomized Controlled Trials: Exploring the Role of Data, Model, and Evaluation Metric Choice	Connor T. Jerzak et.al.	2407.11674	link
2024-07-15	Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion	Yongyuan Liang et.al.	2407.10973	null
2024-07-16	The well-posedness of generalized nonlinear wave equations on the lattice graph	Bobo Hua et.al.	2407.09815	null
2024-07-15	Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization	Jinlong Li et.al.	2407.08374	null
2024-07-09	Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic	Ruochen Jin et.al.	2407.07089	link
2024-07-04	Recovering Initial States in Semilinear Parabolic Problems from Time-Averages	Lina Sophie Schmitz et.al.	2407.03829	null
2024-07-01	A quantum deformation of the ${\mathcal N}=2$ superconformal algebra	H. Awata et.al.	2407.00901	null
2024-06-24	WARP: On the Benefits of Weight Averaged Rewarded Policies	Alexandre Ramé et.al.	2406.16768	null
2024-06-24	Improving robustness to corruptions with multiplicative weight perturbations	Trung Trinh et.al.	2406.16540	link
2024-06-21	Determination of certain mod $p$ Galois representations using local constancy	Abhik Ganguli et.al.	2406.15600	null
2024-06-21	Elliptic analysis on collapsing gravitational instantons modelled using the Gibbons-Hawking ansatz	Willem Adriaan Salm et.al.	2406.15008	null
2024-06-20	MEAT: Median-Ensemble Adversarial Training for Improving Robustness and Generalization	Zhaozhe Hu et.al.	2406.14259	link
2024-06-18	From Instance Training to Instruction Learning: Task Adapters Generation from Instructions	Huanxuan Liao et.al.	2406.12382	link
2024-06-17	Kaniadakis entropy in extreme gravitational and cosmological environments: a review on the state-of-the-art and future prospects	Giuseppe Gaetano Luciano et.al.	2406.11373	null
2024-06-16	Analysis and approximation of elliptic problems with Uhlenbeck structure in convex polytopes	Tadele Mengesha et.al.	2406.10762	null
2024-06-14	Towards Scalable and Versatile Weight Space Learning	Konstantin Schürholt et.al.	2406.09997	link
2024-06-13	Interpreting the Weight Space of Customized Diffusion Models	Amil Dravid et.al.	2406.09413	link
2024-06-12	Diffusion Soup: Model Merging for Text-to-Image Diffusion Models	Benjamin Biggs et.al.	2406.08431	null
2024-06-24	Cartan monopoles	Andrei Smilga et.al.	2406.06042	null
2024-06-08	Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models	Minho Park et.al.	2406.05432	link
2024-06-06	Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks	Tristan Cinquin et.al.	2406.04317	null
2024-06-06	A characterization of $(μ,ν)$ -dichotomies via admissibility	Lucas Backes et.al.	2406.04126	null
2024-06-05	Reproducing Kernel Thesis of Hankel Operators on Weighted Hardy Spaces	Ana Čolović et.al.	2406.03106	null
2024-05-21	Backpropogation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration	Wei Ji et.al.	2406.01601	null
2024-05-29	Thermodynamics of the most generalized form of Holographic Dark Energy and some particular cases with Corrected Entropies	Sanghati Saha et.al.	2405.20783	null
2024-06-20	The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof	Derek Lim et.al.	2405.20231	link
2024-05-28	Universal and Extensible Language-Vision Models for Organ Segmentation and Tumor Detection from Abdominal Computed Tomography	Jie Liu et.al.	2405.18356	link
2024-05-28	$C^2M^3$ : Cycle-Consistent Multi-Model Merging	Donato Crisostomi et.al.	2405.17897	link
2024-05-27	Smoothing effects and extinction in finite time for fractional fast diffusions on Riemannian manifolds	Elvise Berchio et.al.	2405.17126	null
2024-05-31	FedSheafHN: Personalized Federated Learning on Graph-structured Data	Wenfei Liang et.al.	2405.16056	null
2024-05-27	HyperInterval: Hypernetwork approach to training weight interval regions in continual learning	Patryk Krukowski et.al.	2405.15444	link
2024-05-23	Scalable Optimization in the Modular Norm	Tim Large et.al.	2405.14813	link
2024-06-16	A refined Weyl character formula for comodules on $\operatorname{GL}_{2,A}$	Helge Øystein Maakestad et.al.	2405.09210	null
2024-05-13	Localizing Task Information for Improved Model Merging and Compression	Ke Wang et.al.	2405.07813	link
2024-05-13	$α$ VIL: Learning to Leverage Auxiliary Tasks for Multitask Learning	Rafael Kourdis et.al.	2405.07769	null
2024-05-12	Approximation by a new sequence of operators involving Laguerre polynomials	Kapil Kumar et.al.	2405.07228	null
2024-05-06	Swarm intelligence for full Stokes dynamic imaging reconstruction of interferometric data	Alejandro Mus et.al.	2405.03330	null
2024-05-04	Large Deviation Principles of Invariant Measures of Stochastic Reaction-Diffusion Lattice Systems	Bixiang Wang et.al.	2405.02720	null
2024-05-03	The Immersed Inextensible Interface Problem in 2D Stokes Flow	Eduardo García-Juárez et.al.	2405.02446	null
2024-05-02	Customizing Text-to-Image Models with a Single Image Pair	Maxwell Jones et.al.	2405.01536	null
2024-04-25	Robust Fine-tuning for Pre-trained 3D Point Cloud Models	Zhibo Zhang et.al.	2404.16422	null
2024-04-23	The Geometry of the Set of Equivalent Linear Neural Networks	Jonathan Richard Shewchuk et.al.	2404.14855	null
2024-04-24	Nonexistence of solutions to parabolic problems with a potential on weighted graphs	Dario D. Monticelli et.al.	2404.12058	null
2024-04-17	On the relaxation to equilibrium of a quantum oscillator interacting with a radiation field	Pierre-A. Vuillermot et.al.	2404.11329	null
2024-04-15	Higher-curvature gravity in AdS $_3$, holographic $c$ -theorems and black hole microstates	Mariano Chernicoff et.al.	2404.10128	null
2024-04-16	Asymptotic-preserving approximations for stochastic incompressible viscous fluids and SPDEs on graph	Jianbo Cui et.al.	2404.09168	null
2024-04-09	Perspective on Physical Interpretations of Rényi Entropy in Statistical Mechanics	Misaki Ozawa et.al.	2404.06436	null
2024-04-09	A gluing construction of singular solutions for a fully non-linear equation in conformal geometry	María Fernanda Espinal et.al.	2404.05965	null
2024-04-05	Dissipative Euler flows originating from circular vortex filaments	Francisco Gancedo et.al.	2404.04250	null
2024-04-05	Macdonald characters from a new formula for Macdonald polynomials	Houcine Ben Dali et.al.	2404.03904	null
2024-04-04	Fundamental inequalities for the iterated Fourier-cosine convolution with Gaussian weight and its application	Nguyen Thi Hong Phuong et.al.	2404.03609	null
2024-03-29	Embracing Unknown Step by Step: Towards Reliable Sparse Training in Real World	Bowen Lei et.al.	2403.20047	link
2024-03-28	Model Stock: All we need is just a few fine-tuned models	Dong-Hwan Jang et.al.	2403.19522	link
2024-03-26	A location Invariant Statistic-Based Consistent Estimation Method for Three-Parameter Generalized Exponential Distribution	Kiran Prajapat et.al.	2403.17609	null
2024-06-03	FissionFusion: Fast Geometric Generation and Hierarchical Souping for Medical Image Analysis	Santosh Sanjeev et.al.	2403.13341	link
2024-06-18	Learning Useful Representations of Recurrent Neural Network Weight Matrices	Vincent Herrmann et.al.	2403.11998	link
2024-03-16	Function-space Parameterization of Neural Networks for Sequential Learning	Aidan Scannell et.al.	2403.10929	link
2024-03-14	Imprints of Barrow-Tsallis Cosmology in Primordial Gravitational Waves	Petr Jizba et.al.	2403.09797	null
2024-03-14	Eigenvariety for partially classical Hilbert modular forms	Mladen Dimitrov et.al.	2403.09784	null
2024-03-12	The solenoidal Heisenberg Virasoro algebra and its simple weight modules	Boujemaa Agrebaoui et.al.	2403.07381	null
2024-03-10	FrameQuant: Flexible Low-Bit Quantization for Transformers	Harshavardhan Adepu et.al.	2403.06082	link
2024-03-06	The solenoidal Virasoro algebra and its simple weight modules	Boujemaa Agrebaoui et.al.	2403.03753	null
2024-03-05	Tensor Decomposition-based Time Varying Channel Estimation for mmWave MIMO-OFDM Systems	Ruizhe Wang et.al.	2403.02942	null
2024-03-05	Neural Redshift: Random Networks are not Random Functions	Damien Teney et.al.	2403.02241	null
2024-03-04	Tiny fluctuations of the averaging process around its degenerate steady state	Federico Sau et.al.	2403.02032	null
2024-03-15	Training-Free Pretrained Model Merging	Zhengqi Xu et.al.	2403.01753	link
2024-04-22	HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances	Supreeth Narasimhaswamy et.al.	2403.01693	null
2024-03-13	TOOLVERIFIER: Generalization to New Tools via Self-Verification	Dheeraj Mekala et.al.	2402.14158	link
2024-02-21	Computing Tangent Spaces to Eigenvarieties	James Rawson et.al.	2402.13799	null
2024-05-28	Neural Network Parameter Diffusion	Kai Wang et.al.	2402.13144	link
2024-02-19	Exponential attractors for a nonlocal delayed reaction-diffusion equation on an unbounded domain	Wenjie Hu et.al.	2402.11856	null
2024-02-18	Discrete Neural Algorithmic Reasoning	Gleb Rodionov et.al.	2402.11628	link
2024-02-17	Uncertainty Quantification of Graph Convolution Neural Network Models of Evolving Processes	Jeremiah Hauth et.al.	2402.11179	null
2024-06-06	Generalizability of Mixture of Domain-Specific Adapters from the Lens of Signed Weight Directions and its Application to Effective Model Pruning	Tuc Nguyen et.al.	2402.10639	null
2024-02-14	TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction	Xueqi Guo et.al.	2402.09567	null
2024-02-14	The cohomology of $p$ -adic Deligne-Luszitg schemes of Coxeter type	Alexander B. Ivanov et.al.	2402.09017	null
2024-02-09	The Asymptotic Structure of Cosmological Integrals	Paolo Benincasa et.al.	2402.06558	null
2024-02-07	Universal Neural Functionals	Allan Zhou et.al.	2402.05232	link
2024-02-06	Maximal regularity and optimal control for a non-local Cahn-Hilliard tumour growth model	Matteo Fornoni et.al.	2402.04204	null
2024-02-06	Improved Generalization of Weight Space Networks via Augmentations	Aviv Shamsian et.al.	2402.04081	link
2024-02-02	Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion	Zexi Li et.al.	2402.01342	null
2024-02-01	Understanding Neural Network Systems for Image Analysis using Vector Spaces and Inverse Maps	Rebecca Pattichis et.al.	2402.00261	link
2024-01-26	Do deep neural networks utilize the weight space efficiently?	Onur Can Koyun et.al.	2401.16438	null
2024-01-22	On strong growth conditions for weighted spaces of entire functions	Gerhard Schindl et.al.	2401.14330	null
2024-01-24	Task structure and nonlinearity jointly determine learned representational geometry	Matteo Alleman et.al.	2401.13558	null
2024-01-25	Sparse Domination of Singular Bilinear Forms on Non-Homogeneous spaces	Paco Villarroya et.al.	2401.13130	null
2024-01-22	WARM: On the Benefits of Weight Averaged Reward Models	Alexandre Ramé et.al.	2401.12187	null
2024-01-17	Cesàro operators associated with Borel measures acting on weighted spaces of holomorphic functions with sup-norm	Maria José Beltrán Meneu et.al.	2401.09406	null
2024-01-15	Singular fractal dimension at periodicity cascades in parameters spaces	Carlos E. P. Abreu et.al.	2401.07648	null
2024-01-17	Computing Fringe Presentations of Multigraded Persistence Modules	Fabian Lenzen et.al.	2401.06008	null
2024-01-10	Grimoire is All You Need for Enhancing Large Language Models	Ding Chen et.al.	2401.03385	link
2024-03-26	Artificial Intelligence for Operations Research: Revolutionizing the Operations Research Process	Zhenan Fan et.al.	2401.03244	null
2023-12-31	A Compact Representation for Bayesian Neural Networks By Removing Permutation Symmetry	Tim Z. Xiao et.al.	2401.00611	link
2023-12-28	Fractional non-homogeneous counting process	Nick Laskin et.al.	2312.17389	null
2023-12-28	Some unimodal sequences of Kronecker coefficients	Alimzhan Amanov et.al.	2312.17054	null
2023-12-24	The Vlasov-Maxwell-Boltzmann/Landau system with polynomial perturbation near Maxwellian	Chuqi Cao et.al.	2312.15510	null
2023-12-22	Emage: Non-Autoregressive Text-to-Image Generation	Zhangyin Feng et.al.	2312.14988	null
2023-12-21	Hypercyclic shifts on lattice graphs	Anton Baranov et.al.	2312.13934	null
2023-12-21	Scattering for 2d semi-relativistic Hartree equations with short range potential	Changhun Yang et.al.	2312.13606	null
2023-12-21	Entropic Inflation in Presence of Scalar Field	Sergei D. Odintsov et.al.	2312.13587	null
2023-12-30	Time is Encoded in the Weights of Finetuned Language Models	Kai Nylund et.al.	2312.13401	link
2023-12-14	Efficient momentum space approach to superconductivity in quasiperiodic systems	Mao Yoshii et.al.	2312.09124	null
2023-12-13	Best one-sided algebraic approximation by average modulus	Raheam A. Al-Saphory et.al.	2312.08407	null
2023-12-19	Well-Posedness of Quasilinear Parabolic Equations in Time-Weighted Spaces	Bogdan Matioc et.al.	2312.07974	null
2023-12-12	Rethinking Compression: Reduced Order Modelling of Latent Features in Large Language Models	Arnav Chavan et.al.	2312.07046	link
2023-12-11	Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks	MohammadReza Davari et.al.	2312.06795	null
2023-12-08	Stoichiometry preservation and generalization of Bilger mixture fraction for non-premixed combustion with differential molecular diffusion	Haifeng Wang et.al.	2312.05204	null
2023-12-01	New polyconvolution product for Fourier-cosine and Laplace integral operators and their applications	Trinh Tuan et.al.	2312.00764	null
2023-11-30	Modelling Einstein cluster using Einasto profile	Ritwik Acharyya et.al.	2311.18622	null
2023-11-27	Extraction of the microscopic properties of quasi-particles using deep neural networks	Olga Soloveva et.al.	2311.15984	null
2024-01-24	Deep Latent Force Models: ODE-based Process Convolutions for Bayesian Deep Learning	Thomas Baldwin-McDonald et.al.	2311.14828	null

Data Distillation

Publish Date	Title	Authors	PDF	Code
2024-10-25	FLiP: Privacy-Preserving Federated Learning based on the Principle of Least Privileg	ShiMao Xu et.al.	2410.19548	null
2024-10-25	SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models	Jahyun Koo et.al.	2410.19503	null
2024-10-24	AlignCap: Aligning Speech Emotion Captioning to Human Preferences	Ziqi Liang et.al.	2410.19134	null
2024-10-24	High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws	M. Emrullah Ildiz et.al.	2410.18837	null
2024-10-24	Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data	Anup Shirgaonkar et.al.	2410.18588	null
2024-10-24	SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning	Shivam Adarsh et.al.	2410.18574	link
2024-10-23	ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams	Srija Anand et.al.	2410.17901	null
2024-10-23	Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need	Jon Irureta et.al.	2410.17648	null
2024-10-23	Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation	Muquan Li et.al.	2410.17606	link
2024-10-23	Physics-driven AI for Channel Estimation in Cellular Network	Xiaoqian Qi et.al.	2410.17525	null
2024-10-22	MiniPLM: Knowledge Distillation for Pre-Training Language Models	Yuxian Gu et.al.	2410.17215	link
2024-10-22	Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios	Kai Wang et.al.	2410.17193	link
2024-10-22	CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Nicholas I-Hsien Kuo et.al.	2410.16872	null
2024-10-22	AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models	Yongjian Wu et.al.	2410.16820	link
2024-10-22	SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation	Jing-Jing Li et.al.	2410.16665	null
2024-10-21	Pre-training Distillation for Large Language Models: A Design Space Exploration	Hao Peng et.al.	2410.16215	null
2024-10-18	Interpreting Microbiome Relative Abundance Data Using Symbolic Regression	Swagatam Haldar et.al.	2410.16109	link
2024-10-21	Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation?	Lingao Xiao et.al.	2410.15919	link
2024-10-21	Model Mimic Attack: Knowledge Distillation for Provably Transferable Adversarial Examples	Kirill Lukyanov et.al.	2410.15889	null
2024-10-20	Hybrid Memory Replay: Blending Real and Distilled Data for Class Incremental Learning	Jiangtao Kong et.al.	2410.15372	null
2024-10-20	GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning	Haiwen Diao et.al.	2410.15266	link
2024-10-19	LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound	Xuechen Guo et.al.	2410.15074	null
2024-10-19	Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS	Tuan Nam Nguyen et.al.	2410.14997	null
2024-10-17	CAKD: A Correlation-Aware Knowledge Distillation Framework Based on Decoupling Kullback-Leibler Divergence	Zao Zhang et.al.	2410.14741	null
2024-10-18	Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation	Shuai Zhao et.al.	2410.14425	link
2024-10-18	Preview-based Category Contrastive Learning for Knowledge Distillation	Muhe Ding et.al.	2410.14143	null
2024-10-17	Leveraging Fine-Tuned Language Models for Efficient and Accurate Smart Contract Auditing	Zhiyuan Wei et.al.	2410.13918	link
2024-10-17	GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning	Guibin Zhang et.al.	2410.13761	link
2024-10-17	An Active Learning Framework for Inclusive Generation by Large Language Models	Sabit Hassan et.al.	2410.13641	null
2024-10-18	Towards Satellite Non-IID Imagery: A Spectral Clustering-Assisted Federated Learning Approach	Luyao Zou et.al.	2410.13602	null
2024-10-17	Enhancing Dataset Distillation via Label Inconsistency Elimination and Learning Pattern Refinement	Chuhao Zhou et.al.	2410.13311	link
2024-10-18	Cyber Attacks Prevention Towards Prosumer-based EV Charging Stations: An Edge-assisted Federated Prototype Knowledge Distillation Approach	Luyao Zou et.al.	2410.13260	null
2024-10-16	TAS: Distilling Arbitrary Teacher and Student via a Hybrid Assistant	Guopeng Li et.al.	2410.12342	null
2024-10-16	Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm	Guanming Huang et.al.	2410.12259	null
2024-10-16	TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration	Yiwei Guo et.al.	2410.12183	link
2024-10-17	SAM-Guided Masked Token Prediction for 3D Scene Understanding	Zhimin Chen et.al.	2410.12158	null
2024-10-15	MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router	Yanyue Xie et.al.	2410.12013	null
2024-10-15	Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation	Andong Lu et.al.	2410.11586	link
2024-10-15	Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL	Qihuang Zhong et.al.	2410.11371	null
2024-10-15	Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling	Wenda Xu et.al.	2410.11325	null
2024-10-14	BrainMVP: Multi-modal Vision Pre-training for Brain Image Analysis using Multi-parametric MRI	Shaohao Rui et.al.	2410.10604	null
2024-10-14	ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection	Martin Aubard et.al.	2410.10554	link
2024-10-14	Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation	Siru Ouyang et.al.	2410.10141	null
2024-10-14	REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation	Zhiyun Song et.al.	2410.10097	null
2024-10-15	Self-Data Distillation for Recovering Quality in Pruned Large Language Models	Vithursan Thangarasa et.al.	2410.09982	null
2024-10-13	Generalized Group Data Attribution	Dan Ley et.al.	2410.09940	null
2024-10-12	Distilling Invariant Representations with Dual Augmentation	Nikolaos Giakoumoglou et.al.	2410.09474	null
2024-10-12	Declarative Knowledge Distillation from Large Language Models for Visual Question Answering Datasets	Thomas Eiter et.al.	2410.09428	link
2024-10-15	Transforming In-Vehicle Network Intrusion Detection: VAE-based Knowledge Distillation Meets Explainable AI	Muhammet Anil Yagiz et.al.	2410.09043	null
2024-10-11	Mentor-KD: Making Small Language Models Better Multi-step Reasoners	Hojae Lee et.al.	2410.09037	link
2024-10-11	Contrastive Knowledge Distillation for Robust Multimodal Sentiment Analysis	Zhongyi Sang et.al.	2410.08692	null
2024-10-11	DistDD: Distributed Data Distillation Aggregation through Gradient Matching	Peiran Wang et.al.	2410.08665	null
2024-10-11	GAI-Enabled Explainable Personalized Federated Semi-Supervised Learning	Yubo Peng et.al.	2410.08634	null
2024-10-11	Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both	Abhijnan Nath et.al.	2410.08458	null
2024-10-10	What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias	Aida Mohammadshahi et.al.	2410.08407	null
2024-10-10	A Lightweight Target-Driven Network of Stereo Matching for Inland Waterways	Jing Su et.al.	2410.07915	null
2024-10-10	SNN-PAR: Energy Efficient Pedestrian Attribute Recognition via Spiking Neural Networks	Haiyang Wang et.al.	2410.07857	link
2024-10-12	Relational Diffusion Distillation for Efficient Image Generation	Weilun Feng et.al.	2410.07679	link
2024-10-10	Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching	Ruonan Yu et.al.	2410.07579	null
2024-10-09	Unlocking Real-Time Fluorescence Lifetime Imaging: Multi-Pixel Parallelism for FPGA-Accelerated Processing	Ismail Erbas et.al.	2410.07364	null
2024-10-09	S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning	Weihao Lin et.al.	2410.07046	null
2024-10-09	Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Runze Chen et.al.	2410.06982	null
2024-10-09	Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching	Wenqi Niu et.al.	2410.06561	null
2024-10-10	KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server	Wenhao Wang et.al.	2410.05725	link
2024-10-07	Progressive distillation induces an implicit curriculum	Abhishek Panigrahi et.al.	2410.05464	null
2024-10-07	ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation	Yuelyu Ji et.al.	2410.05168	null
2024-10-07	MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization	Yunlong Zhao et.al.	2410.05103	null
2024-10-06	CAPEEN: Image Captioning with Early Exits and Knowledge Distillation	Divya Jyoti Bajpai et.al.	2410.04433	link
2024-10-06	DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs	Divya Jyoti Bajpai et.al.	2410.04424	link
2024-10-10	Towards Understanding and Enhancing Security of Proof-of-Training for DNN Model Ownership Verification	Yijia Chang et.al.	2410.04397	null
2024-10-10	Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution	Jianze Li et.al.	2410.04224	link
2024-10-05	Accelerating Diffusion Models with One-to-Many Knowledge Distillation	Linfeng Zhang et.al.	2410.04191	null
2024-10-05	DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech	Dominika Woszczyk et.al.	2410.04188	null
2024-10-05	Gap Preserving Distillation by Building Bidirectional Mappings with A Dynamic Teacher	Yong Guo et.al.	2410.04140	null
2024-10-05	WiDistill: Distilling Large-scale Wi-Fi Datasets with Trajectory Matching	Tiantian Wang et.al.	2410.04073	link
2024-10-04	Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models	Zhuochun Li et.al.	2410.03663	link
2024-10-04	DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models	Sungnyun Kim et.al.	2410.03061	null
2024-10-03	Dataset Distillation via Knowledge Distillation: Towards Efficient Self-Supervised Pre-Training of Deep Networks	Siddharth Joshi et.al.	2410.02116	link
2024-10-02	PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation	Mike Ranzinger et.al.	2410.01680	null
2024-10-04	HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models	Seanie Lee et.al.	2410.01524	link
2024-10-02	Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks	Edan Kinderman et.al.	2410.01483	link
2024-10-02	PairDistill: Pairwise Relevance Distillation for Dense Retrieval	Chao-Wei Huang et.al.	2410.01383	link
2024-10-02	“No Matter What You Do!”: Mitigating Backdoor Attacks in Graph Neural Networks	Jiale Zhang et.al.	2410.01272	link
2024-10-01	Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging	Ismail Erbas et.al.	2410.00948	null
2024-10-01	Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading	Mostafa Hajighasemloua et.al.	2410.00779	null
2024-10-01	Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation	Jiyoon Myung et.al.	2410.00683	null
2024-10-01	AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation	Ziyang Luo et.al.	2410.00558	link
2024-10-01	Self-Updatable Large Language Models with Parameter Integration	Yu Wang et.al.	2410.00487	null
2024-10-01	Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity	Hanqi Jiang et.al.	2410.00448	null
2024-09-30	Collaborative Knowledge Distillation via a Learning-by-Education Node Community	Anestis Kaimakamidis et.al.	2410.00074	null
2024-09-30	Enhancing Romanian Offensive Language Detection through Knowledge Distillation, Multi-Task Learning, and Data Augmentation	Vlad-Cristian Matei et.al.	2409.20498	null
2024-10-02	Linear Projections of Teacher Embeddings for Few-Class Distillation	Noel Loo et.al.	2409.20449	null
2024-09-30	Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies	Shalini Sarode et.al.	2409.20237	null
2024-10-01	HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning	Momin Ahmad Khan et.al.	2409.19912	null
2024-09-29	Tailored Federated Learning: Leveraging Direction Regulation & Knowledge Distillation	Huidong Tang et.al.	2409.19741	null
2024-09-29	InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries	Mengze Hong et.al.	2409.19689	null
2024-09-28	Mind the Gap: Promoting Missing Modality Brain Tumor Segmentation with Alignment	Tianyi Liu et.al.	2409.19366	null
2024-09-27	Semi-Supervised Bone Marrow Lesion Detection from Knee MRI Segmentation Using Mask Inpainting Models	Shihua Qin et.al.	2409.19185	null
2024-09-27	Multi-modal Cross-domain Self-supervised Pre-training for fMRI and EEG Fusion	Xinxu Wei et.al.	2409.19130	null
2024-10-01	Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models	Yize Li et.al.	2409.19128	link
2024-09-27	MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation	Junyou Zhu et.al.	2409.18800	null
2024-09-27	Student-Oriented Teacher Knowledge Refinement for Knowledge Distillation	Chaomin Shen et.al.	2409.18785	null
2024-09-27	Harmonizing knowledge Transfer in Neural Network with Unified Distillation	Yaomin Huang et.al.	2409.18565	null
2024-09-27	Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration	Mahdi Morafah et.al.	2409.18461	link
2024-10-01	Backdoor Attacks for LLMs with Weak-To-Strong Knowledge Distillation	Shuai Zhao et.al.	2409.17946	null
2024-09-26	Kendall’s $τ$ Coefficient for Logits Distillation	Yuchen Guan et.al.	2409.17823	null
2024-09-26	Diversity-Driven Synthesis: Enhancing Dataset Distillation through Directed Weight Adjustment	Jiawei Du et.al.	2409.17612	link
2024-09-26	Dataset Distillation-based Hybrid Federated Learning on Non-IID Data	Xiufang Shi et.al.	2409.17517	null
2024-09-26	Shape-intensity knowledge distillation for robust medical image segmentation	Wenhui Dong et.al.	2409.17503	link
2024-09-25	MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events	Xiaoyu Yang et.al.	2409.17010	null
2024-09-25	Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation	Hanyu Zhou et.al.	2409.17001	null
2024-09-25	A Novel Framework for Analyzing Structural Transformation in Data-Constrained Economies Using Bayesian Modeling and Machine Learning	Ronald Katende et.al.	2409.16738	null
2024-09-25	SelectiveKD: A semi-supervised framework for cancer detection in DBT through Knowledge Distillation and Pseudo-labeling	Laurent Dillard et.al.	2409.16581	null
2024-09-24	AIM 2024 Challenge on UHD Blind Photo Quality Assessment	Vlad Hosu et.al.	2409.16271	null
2024-09-24	Label-Augmented Dataset Distillation	Seoungyoon Kang et.al.	2409.16239	null
2024-09-25	Privacy Evaluation Benchmarks for NLP Models	Wei Huang et.al.	2409.15868	link
2024-09-24	Twin Network Augmentation: A Novel Training Strategy for Improved Spiking Neural Networks and Efficient Weight Quantization	Lucas Deckers et.al.	2409.15849	null
2024-09-23	TS-TCD: Triplet-Level Cross-Modal Distillation for Time-Series Forecasting Using Large Language Models	Pengfei Wang et.al.	2409.14978	null
2024-09-23	DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models	Sangyeon Cho et.al.	2409.14904	link
2024-09-23	Pre-trained Language Model and Knowledge Distillation for Lightweight Sequential Recommendation	Li Li et.al.	2409.14810	null
2024-09-23	An Adverse Weather-Immune Scheme with Unfolded Regularization and Foundation Model Knowledge Distillation for Street Scene Understanding	Wei-Bin Kou et.al.	2409.14737	null
2024-09-22	EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models	Hossein Rajabzadeh et.al.	2409.14595	null
2024-09-22	Prior Knowledge Distillation Network for Face Super-Resolution	Qiu Yang et.al.	2409.14385	null
2024-09-25	DilateQuant: Accurate and Efficient Diffusion Quantization via Weight Dilation	Xuewen Liu et.al.	2409.14307	null
2024-09-18	Applications of Knowledge Distillation in Remote Sensing: A Survey	Yassine Himeur et.al.	2409.12111	null
2024-09-18	Data Efficient Acoustic Scene Classification using Teacher-Informed Confusing Class Instruction	Jin Jie Sean Yeo et.al.	2409.11964	null
2024-09-18	Distillation-free Scaling of Large SSMs for Images and Videos	Hamid Suleman et.al.	2409.11867	null
2024-09-18	EFCM: Efficient Fine-tuning on Compressed Models for deployment of large models in medical image analysis	Shaojie Li et.al.	2409.11817	null
2024-09-18	Efficient Low-Resolution Face Recognition via Bridge Distillation	Shiming Ge et.al.	2409.11786	null
2024-09-18	RUIE: Retrieval-based Unified Information Extraction using Large Language Model	Xincheng Liao et.al.	2409.11673	link
2024-09-17	Time-Series Forecasting, Knowledge Distillation, and Refinement within a Multimodal PDE Foundation Model	Derek Jollie et.al.	2409.11609	link
2024-09-17	Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation	Rui Yu et.al.	2409.11018	null
2024-09-17	Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation	Gerard I. Gállego et.al.	2409.11003	null
2024-09-16	Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning	Amin Karimi Monsefi et.al.	2409.10362	link
2024-09-16	Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference	Huy-Dung Nguyen et.al.	2409.10095	null
2024-09-14	Effective Pre-Training of Audio Transformers for Sound Event Detection	Florian Schmid et.al.	2409.09546	link
2024-09-14	Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification	Wenhao Yang et.al.	2409.09389	null
2024-09-14	Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility	Xiaoyu Liu et.al.	2409.09357	null
2024-09-13	Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection	Dixi Yao et.al.	2409.08858	null
2024-09-13	AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation	Zechao Sun et.al.	2409.08516	null
2024-09-12	DiReDi: Distillation and Reverse Distillation for AIoT Applications	Chen Sun et.al.	2409.08308	null
2024-09-12	Ruri: Japanese General Text Embeddings	Hayato Tsukagoshi et.al.	2409.07737	link
2024-09-12	Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios	Xinlei Huang et.al.	2409.07694	null
2024-09-11	DS-ViT: Dual-Stream Vision Transformer for Cross-Task Distillation in Alzheimer’s Early Diagnosis	Ke Chen et.al.	2409.07584	null
2024-09-11	EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation using Synthetic Data	Grégoire Petit et.al.	2409.07566	link
2024-09-11	Enhancing CTC-Based Visual Speech Recognition	Hendrik Laux et.al.	2409.07210	null
2024-09-11	A Continual and Incremental Learning Approach for TinyML On-device Training Using Dataset Distillation and Model Size Adaption	Marcus Rüb et.al.	2409.07114	null
2024-09-16	Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator	Kangyang Luo et.al.	2409.06955	null
2024-09-10	Applied Federated Model Personalisation in the Industrial Domain: A Comparative Study	Ilias Siniosoglou et.al.	2409.06904	null
2024-09-10	EasyST: A Simple Framework for Spatio-Temporal Prediction	Jiabin Tang et.al.	2409.06748	link
2024-09-10	Knowledge Distillation via Query Selection for Detection Transformer	Yi Liu et.al.	2409.06443	null
2024-09-10	Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition	Junzheng Zhang et.al.	2409.06371	null
2024-09-09	Joint Input and Output Coordination for Class-Incremental Learning	Shuai Wang et.al.	2409.05620	null
2024-09-09	LEROjD: Lidar Extended Radar-Only Object Detection	Patrick Palmer et.al.	2409.05564	link
2024-09-09	Look One and More: Distilling Hybrid Order Relational Knowledge for Cross-Resolution Image Recognition	Shiming Ge et.al.	2409.05384	null
2024-09-09	FedBrain-Distill: Communication-Efficient Federated Brain Tumor Classification Using Ensemble Knowledge Distillation on Non-IID Data	Rasoul Jafari Gohari et.al.	2409.05359	link
2024-09-07	LoCa: Logit Calibration for Knowledge Distillation	Runming Yang et.al.	2409.04778	null
2024-09-06	SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields	Yuze Wang et.al.	2409.04482	null
2024-09-05	Experimentation in Content Moderation using RWKV	Umut Yildirim et.al.	2409.03939	null
2024-09-05	Data-Efficient Generation for Dataset Distillation	Zhe Li et.al.	2409.03929	null
2024-09-05	DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture	Qianlong Xiang et.al.	2409.03550	link
2024-09-05	Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration	Pei Wang et.al.	2409.03455	null
2024-09-05	Efficient Image Compression Using Advanced State Space Models	Bouzid Arezki et.al.	2409.02743	null
2024-09-04	CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation	Minhee Cho et.al.	2409.02699	null
2024-09-04	Low-Resolution Object Recognition with Cross-Resolution Relational Contrastive Distillation	Kangkai Zhang et.al.	2409.02555	null
2024-09-04	A design of magnetic tunnel junctions for the deployment of neuromorphic hardware for edge computing	Davi Rodrigues et.al.	2409.02528	null
2024-09-04	Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation	Yilong Chen et.al.	2409.02438	null
2024-09-03	Low-Resolution Face Recognition via Adaptable Instance-Relation Distillation	Ruixin Shi et.al.	2409.02049	null
2024-09-03	Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique	Qiang Zheng et.al.	2409.02020	null
2024-09-03	Contemporary Model Compression on Large Language Models Inference	Dong Liu et.al.	2409.01990	link
2024-09-05	Adaptive Explicit Knowledge Transfer for Knowledge Distillation	Hyungkeun Park et.al.	2409.01679	null
2024-09-03	Improving Apple Object Detection with Occlusion-Enhanced Distillation	Liang Geng et.al.	2409.01573	null
2024-09-02	Dataset Distillation from First Principles: Integrating Core Information Extraction and Purposeful Learning	Vyacheslav Kungurtsev et.al.	2409.01410	null
2024-09-02	MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation	Zewen Chen et.al.	2409.01212	link
2024-09-04	Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning	Jinglin Liang et.al.	2409.01128	link
2024-09-02	Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment	Aditya Bansal et.al.	2409.00880	null
2024-09-01	LanguaShrink: Reducing Token Overhead with Psycholinguistics	Xuechen Liang et.al.	2409.00855	null
2024-08-30	How Knowledge Distillation Mitigates the Synthetic Gap in Fair Face Recognition	Pedro C. Neto et.al.	2408.17399	link
2024-08-30	HiTSR: A Hierarchical Transformer for Reference-based Super-Resolution	Masoomeh Aslahishahri et.al.	2408.16959	link
2024-08-29	VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition	Zaiwei Zhang et.al.	2408.16930	null
2024-08-29	Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling	Hritik Bansal et.al.	2408.16737	null
2024-08-29	MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition	Eduarda Caldeira et.al.	2408.16563	link
2024-08-29	UDD: Dataset Distillation via Mining Underutilized Regions	Shiguang Wang et.al.	2408.16268	null
2024-08-29	Neural Spectral Decomposition for Dataset Distillation	Shaolei Yang et.al.	2408.16236	null
2024-08-28	EMP: Enhance Memory in Data Pruning	Jinying Xiao et.al.	2408.16031	null
2024-08-28	LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation	Fangxun Shu et.al.	2408.15881	link
2024-08-28	ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation	Tiantian Feng et.al.	2408.15803	null
2024-08-28	Online pre-training with long-form videos	Itsuki Kato et.al.	2408.15651	null
2024-08-28	Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation	Lujun Gui et.al.	2408.15562	null
2024-08-27	Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification	Yiqiang Cai et.al.	2408.14862	link
2024-08-26	Bridging the Gap: Unpacking the Hidden Challenges in Knowledge Distillation for Online Ranking Systems	Nikhil Khani et.al.	2408.14678	null
2024-08-26	TSAK: Two-Stage Semantic-Aware Knowledge Distillation for Efficient Wearable Modality and Model Optimization in Manufacturing Lines	Hymalai Bello et.al.	2408.14146	null

Schrodinger Bridge

Publish Date	Title	Authors	PDF	Code
2025-07-23	Yume: An Interactive World Generation Model	Xiaofeng Mao et.al.	2507.17744	null
2025-07-23	Flow Matching Meets Biology and Life Science: A Survey	Zihao Li et.al.	2507.17731	null
2025-07-23	Piecewise Control Barrier Functions for Stochastic Systems	Rayan Mazouz et.al.	2507.17703	null
2025-07-23	CNS-Bench: Benchmarking Image Classifier Robustness Under Continuous Nuisance Shifts	Olaf Dünkel et.al.	2507.17651	null
2025-07-23	Dual-branch Prompting for Multimodal Machine Translation	Jie Wang et.al.	2507.17588	null
2025-07-23	An h-space Based Adversarial Attack for Protection Against Few-shot Personalization	Xide Xu et.al.	2507.17554	null
2025-07-23	Federated Majorize-Minimization: Beyond Parameter Aggregation	Aymeric Dieuleveut et.al.	2507.17534	null
2025-07-23	Generalized Advantage Estimation for Distributional Policy Gradients	Shahil Shaik et.al.	2507.17530	null
2025-07-23	HOTA: Hamiltonian framework for Optimal Transport Advection	Nazar Buzun et.al.	2507.17513	null
2025-07-23	Accelerating Parallel Diffusion Model Serving with Residual Compression	Jiajun Luo et.al.	2507.17511	null
2025-07-23	Exact results for active particle models: from long-range interactions to first-passage properties	Léo Touzo et.al.	2507.17504	null
2025-07-23	Unsupervised anomaly detection using Bayesian flow networks: application to brain FDG PET in the context of Alzheimer’s disease	Hugues Roy et.al.	2507.17486	null
2025-07-23	Efficient and Robust Semantic Image Communication via Stable Cascade	Bilal Khalid et.al.	2507.17416	null
2025-07-23	An FDM-sFEM scheme on time-space manifolds and its superconvergence analysis	Chengrun Jiang et.al.	2507.17378	null
2025-07-23	PolarAnything: Diffusion-based Polarimetric Image Synthesis	Kailong Zhang et.al.	2507.17268	null
2025-07-22	Generative Diffusion Models for Wireless Networks: Fundamental, Architecture, and State-of-the-Art	Dayu Fan et.al.	2507.16733	null
2025-07-22	HarmonPaint: Harmonized Training-Free Diffusion Inpainting	Ying Li et.al.	2507.16732	null
2025-07-22	Pyramid Hierarchical Masked Diffusion Model for Imaging Synthesis	Xiaojiao Xiao et.al.	2507.16579	null
2025-07-22	Families of Optimal Transport Kernels for Cell Complexes	Rahul Khorana et.al.	2507.16569	null
2025-07-22	Robust Noisy Pseudo-label Learning for Semi-supervised Medical Image Segmentation Using Diffusion Model	Lin Xi et.al.	2507.16429	null
2025-07-22	Entropic approximations of the semigeostrophic shallow water equations	Jean-David Benamou et.al.	2507.16415	null
2025-07-22	Knowledge-aware Diffusion-Enhanced Multimedia Recommendation	Xian Mo et.al.	2507.16396	null
2025-07-22	Navigating Large-Pose Challenge for High-Fidelity Face Reenactment with Video Diffusion Model	Mingtao Guo et.al.	2507.16341	null
2025-07-22	Towards Resilient Safety-driven Unlearning for Diffusion Models against Downstream Fine-tuning	Boheng Li et.al.	2507.16302	null
2025-07-22	Pontryagin Maximum Principle for McKean-Vlasov Reaction-Diffusion Equations	Johan Benedikt Spille et.al.	2507.16288	null
2025-07-22	Spatial filtering of interlayer exciton ground state in WSe2/MoS2 heterobilayer	Disheng Chen et.al.	2507.16180	null
2025-07-22	LSSGen: Leveraging Latent Space Scaling in Flow and Diffusion for Efficient Text to Image Generation	Jyun-Ze Tang et.al.	2507.16154	null
2025-07-22	PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation	Yaofang Liu et.al.	2507.16116	null
2025-07-21	Improving Personalized Image Generation through Social Context Feedback	Parul Gupta et.al.	2507.16095	null
2025-07-21	Moment stability and large deviations for random dynamical systems on non-compact manifolds	Peter H Baxendale et.al.	2507.16092	null
2025-07-21	Diffusion Beats Autoregressive in Data-Constrained Settings	Mihir Prabhudesai et.al.	2507.15857	null
2025-07-21	Diffusion models for multivariate subsurface generation and efficient probabilistic inversion	Roberto Miele et.al.	2507.15809	null
2025-07-21	DiffuMeta: Algebraic Language Models for Inverse Design of Metamaterials via Diffusion Transformers	Li Zheng et.al.	2507.15753	null
2025-07-21	TokensGen: Harnessing Condensed Tokens for Long Video Generation	Wenqi Ouyang et.al.	2507.15728	null
2025-07-21	DiffPF: Differentiable Particle Filtering with Generative Sampling via Conditional Diffusion Models	Ziyu Wan et.al.	2507.15716	null
2025-07-21	SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models	Giordano d’Aloisio et.al.	2507.15663	null
2025-07-21	SegDT: A Diffusion Transformer-Based Segmentation Model for Medical Imaging	Salah Eddine Bekhouche et.al.	2507.15595	null
2025-07-21	Procedure Learning via Regularized Gromov-Wasserstein Optimal Transport	Syed Ahmed Mahmood et.al.	2507.15540	null
2025-07-21	Ultrafast Spatial Hole Burning Dynamics in Monolayer WS2: Insights from Time-resolved Photoluminescence Spectroscopy	Yichun Pan et.al.	2507.15538	null
2025-07-21	An Adaptive Random Fourier Features approach Applied to Learning Stochastic Differential Equations	Owen Douglas et.al.	2507.15442	null
2025-07-21	Blended Point Cloud Diffusion for Localized Text-guided Shape Editing	Etai Sella et.al.	2507.15399	null
2025-07-21	Latent Space Synergy: Text-Guided Data Augmentation for Direct Diffusion Biomedical Segmentation	Muhammad Aqeel et.al.	2507.15361	null
2025-07-21	RAD: Retrieval High-quality Demonstrations to Enhance Decision-making	Lu Guo et.al.	2507.15356	null
2025-07-21	RoadFusion: Latent Diffusion Model for Pavement Defect Detection	Muhammad Aqeel et.al.	2507.15346	null
2025-07-22	Exponential Runge-Kutta Galerkin finite element method for a reaction-diffusion system with nonsmooth initial data	Runjie Zhang et.al.	2507.15345	null
2025-07-18	Well posedness and propagation of chaos for multi-agent models with strategies and diffusive effects	Alessandro Baldi et.al.	2507.14058	null
2025-07-18	CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models	Quang-Binh Nguyen et.al.	2507.13984	null
2025-07-18	Generalist Forecasting with Frozen Video Models via Latent Diffusion	Jacob C Walker et.al.	2507.13942	null
2025-07-18	DynFaceRestore: Balancing Fidelity and Quality in Diffusion-Guided Blind Face Restoration with Dynamic Blur-Level Mapping and Guidance	Huu-Phu Do et.al.	2507.13797	null
2025-07-18	Learning Spectral Diffusion Prior for Hyperspectral Image Reconstruction	Mingyang Yu et.al.	2507.13769	null
2025-07-18	Malliavin Calculus and Stochastic Differential Equations	Shizan Fang et.al.	2507.13747	null
2025-07-18	Can Synthetic Images Conquer Forgetting? Beyond Unexplored Doubts in Few-Shot Class-Incremental Learning	Junsu Kim et.al.	2507.13739	null
2025-07-18	PoemTale Diffusion: Minimising Information Loss in Poem to Image Generation with Multi-Stage Prompt Refinement	Sofia Jamil et.al.	2507.13708	null
2025-07-18	Efficient Burst Super-Resolution with One-step Diffusion	Kento Kawai et.al.	2507.13607	null
2025-07-18	Learning Deblurring Texture Prior from Unpaired Data with Diffusion Model	Chengxu Liu et.al.	2507.13599	null
2025-07-18	GIFT: Gradient-aware Immunization of diffusion models against malicious Fine-Tuning with safe concepts retention	Amro Abdalla et.al.	2507.13598	null
2025-07-17	LoRA-Loop: Closing the Synthetic Replay Cycle for Continual VLM Learning	Kaihong Wang et.al.	2507.13568	null
2025-07-17	Who With Whom? Learning Optimal Matching Policies	Yagan Hazard et.al.	2507.13567	null
2025-07-17	Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models	Yudong Jin et.al.	2507.13344	null
2025-07-17	FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization	Chuancheng Shi et.al.	2507.13311	null
2025-07-17	DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation	Ekta Balkrishna Gavas et.al.	2507.13292	null
2025-07-17	BSDE Approach for $α$ -Potential Stochastic Differential Games	Xin Guo et.al.	2507.13256	null
2025-07-17	GradNetOT: Learning Optimal Transport Maps with GradNets	Shreyas Chaudhari et.al.	2507.13191	null
2025-07-17	McKean-Vlasov equations with rough common noise	Peter K. Friz et.al.	2507.13149	null
2025-07-17	fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting	Alicia Durrer et.al.	2507.13146	null
2025-07-17	Unsupervised Ground Metric Learning	Janis Auffenberg et.al.	2507.13094	null
2025-07-17	DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model	Han Zhang et.al.	2507.13087	null
2025-07-17	Label-Consistent Dataset Distillation with Detector-Guided Refinement	Yawen Zou et.al.	2507.13074	null
2025-07-17	Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities	Liuyi Wang et.al.	2507.13019	null
2025-07-17	From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation	Jinseo An et.al.	2507.12985	null
2025-07-17	Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning	Giwon Lee et.al.	2507.12977	null
2025-07-17	RGB Pre-Training Enhanced Unobservable Feature Latent Diffusion Model for Spectral Reconstruction	Keli Deng et.al.	2507.12967	null
2025-07-17	DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization	Dongyeun Lee et.al.	2507.12933	null
2025-07-16	Unsupervised Monocular 3D Keypoint Discovery from Multi-View Diffusion Priors	Subin Jeon et.al.	2507.12336	null
2025-07-17	Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models	Samuel Lavoie et.al.	2507.12318	null
2025-07-16	FADE: Adversarial Concept Erasure in Flow Models	Zixuan Fu et.al.	2507.12283	null
2025-07-16	Designing Algorithms for Entropic Optimal Transport from an Optimisation Perspective	Vishwak Srinivasan et.al.	2507.12246	null
2025-07-16	Generate to Ground: Multimodal Text Conditioning Boosts Phrase Grounding in Medical Vision-Language Models	Felix Nützel et.al.	2507.12236	null
2025-07-16	Classification of entire and ancient solutions of the diffusive Hamilton-Jacobi equation	Loth Damagui Chabi et.al.	2507.12214	null
2025-07-16	RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models	Yiqi Tian et.al.	2507.12201	null
2025-07-16	RadioDiff-3D: A 3D $\times$ 3D Radio Map Dataset and Generative Diffusion Based Benchmark for 6G Environment-Aware Communication	Xiucheng Wang et.al.	2507.12166	null
2025-07-16	SmokeSVD: Smoke Reconstruction from A Single View via Progressive Novel View Synthesis and Refinement with Diffusion Models	Chen Li et.al.	2507.12156	null
2025-07-16	RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA Optimization	Vladimir Bogachev et.al.	2507.12142	null
2025-07-16	LidarPainter: One-Step Away From Any Lidar View To Novel Guidance	Yuzhou Ji et.al.	2507.12114	null
2025-07-16	Robust Planning for Autonomous Vehicles with Diffusion-Based Failure Samplers	Juanran Wang et.al.	2507.11991	null
2025-07-16	ID-EA: Identity-driven Text Enhancement and Adaptation with Textual Inversion for Personalized Text-to-Image Generation	Hyun-Jun Jin et.al.	2507.11990	null
2025-07-16	EC-Diff: Fast and High-Quality Edge-Cloud Collaborative Inference for Diffusion Models	Jiajian Xie et.al.	2507.11980	null
2025-07-16	A Review of Generative AI in Aquaculture: Foundations, Applications, and Future Directions for Smart and Sustainable Farming	Waseem Akram et.al.	2507.11974	null
2025-07-15	CATVis: Context-Aware Thought Visualization	Tariq Mehmood et.al.	2507.11522	null
2025-07-15	HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing	Pan Du et.al.	2507.11474	null
2025-07-15	Implementing Adaptations for Vision AutoRegressive Model	Kaif Shaikh et.al.	2507.11441	null
2025-07-15	Markov approximation for controlled Hawkes Jump-Diffusions with general kernels	Mahmoud Khabou et.al.	2507.11294	null
2025-07-15	Ocean Diviner: A Diffusion-Augmented Reinforcement Learning for AUV Robust Control in the Underwater Tasks	Weiyi Liu et.al.	2507.11283	null
2025-07-15	Latent Space Consistency for Sparse-View CT Reconstruction	Duoyou Chen et.al.	2507.11152	null
2025-07-15	Human-Guided Shade Artifact Suppression in CBCT-to-MDCT Translation via Schrödinger Bridge with Conditional Diffusion	Sung Ho Kang et.al.	2507.11025	null
2025-07-15	Robust ID-Specific Face Restoration via Alignment Learning	Yushun Fang et.al.	2507.10943	null
2025-07-15	Quantum algorithm for solving McKean-Vlasov stochastic differential equations	Koichi Miyamoto et.al.	2507.10926	null
2025-07-14	Visually grounded emotion regulation via diffusion models and user-driven reappraisal	Edoardo Pinzuti et.al.	2507.10861	null
2025-07-14	Offline Reinforcement Learning with Wasserstein Regularization via Optimal Transport Maps	Motoki Omura et.al.	2507.10843	null
2025-07-14	MP1: Mean Flow Tames Policy Learning in 1-step for Robotic Manipulation	Juyi Sheng et.al.	2507.10543	null
2025-07-16	Accurate generation of chemical reaction transition states by conditional flow matching	Ping Tuo et.al.	2507.10530	null
2025-07-14	Solving the compute crisis with physics-based ASICs	Maxwell Aifer et.al.	2507.10463	null
2025-07-14	Non-exchangeable Conformal Prediction with Optimal Transport: Tackling Distribution Shifts with Unlabeled Data	Alvaro H. C. Correia et.al.	2507.10425	null
2025-07-14	Parallel Sampling of Diffusion Models on $SO(3)$	Yan-Ting Chen et.al.	2507.10347	null
2025-07-15	Text Embedding Knows How to Quantize Text-Guided Diffusion Models	Hongjae Lee et.al.	2507.10340	null
2025-07-14	Lévy Langevin Monte Carlo for sampling from heavy-tailed target distributions	Anita Behme et.al.	2507.10320	null
2025-07-14	Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching	Yuhan Liu et.al.	2507.10318	null
2025-07-14	Cross-Timeslot Optimization for Distributed GPU Inference Using Reinforcement Learning	Chengze Du et.al.	2507.10259	null
2025-07-14	Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection	Jinglun Li et.al.	2507.10225	null
2025-07-14	From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation	Jeongho Kim et.al.	2507.10217	null
2025-07-14	FIX-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long Text	Bingchao Wang et.al.	2507.10095	null
2025-07-14	Frequency Regulation for Exposure Bias Mitigation in Diffusion Models	Meng Yu et.al.	2507.10072	null
2025-07-14	An Accurate Discretized Approach to Parameter Estimation in the CKLS Model via the CIR Framework	Sourojyoti Barick et.al.	2507.10041	null
2025-07-14	Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies	Seokeon Choi et.al.	2507.10029	null
2025-07-11	From One to More: Contextual Part Latents for 3D Generation	Shaocong Dong et.al.	2507.08772	null
2025-07-11	To Trade or Not to Trade: An Agentic Approach to Estimating Market Risk Improves Trading Decisions	Dimitrios Emmanoulopoulos et.al.	2507.08584	null
2025-07-11	Anisotropic Diffusion of $e^\pm$ in Pulsar Halos over Multiple Coherence of Magnetic Field	Kai Yan et.al.	2507.08526	null
2025-07-11	Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers	Wongi Jeong et.al.	2507.08422	null
2025-07-11	Subject-Consistent and Pose-Diverse Text-to-Image Generation	Zhanxin Gao et.al.	2507.08396	null
2025-07-11	Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling	Meihua Dang et.al.	2507.08390	null
2025-07-11	From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning	Sen Wang et.al.	2507.08380	null
2025-07-14	Token-based Audio Inpainting via Discrete Diffusion	Tali Dror et.al.	2507.08333	null
2025-07-10	Optimal transport, determinantal point processes and the Bergman kernel	William Driot et.al.	2507.08204	null
2025-07-10	Cracking Instance Jigsaw Puzzles: An Alternative to Multiple Instance Learning for Whole Slide Image Analysis	Xiwen Chen et.al.	2507.08178	null
2025-07-10	Adaptive Diffusion Denoised Smoothing : Certified Robustness via Randomized Smoothing with Differentially Private Guided Denoising Diffusion	Frederick Shpilevskiy et.al.	2507.08163	null
2025-07-10	RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration	Chong Cheng et.al.	2507.08136	null
2025-07-10	Predicting Flow Dynamics using Diffusion Models	Yannick Gachnang et.al.	2507.08106	null
2025-07-10	Fluctuations in Hill’s equation parameters and application to cosmic reheating	Leia Barrowes et.al.	2507.08075	null
2025-07-10	Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling	Haoyu Wu et.al.	2507.07982	null
2025-07-10	Low Resource Reconstruction Attacks Through Benign Prompts	Sol Yarkoni et.al.	2507.07947	null
2025-07-10	Convergence rates for regularized unbalanced optimal transport: the discrete case	Luca Nenna et.al.	2507.07917	null
2025-07-11	Single-Step Latent Diffusion for Underwater Image Restoration	Jiayi Wu et.al.	2507.07878	null
2025-07-10	Re-Bottleneck: Latent Re-Structuring for Neural Audio Autoencoders	Dimitrios Bralios et.al.	2507.07867	null
2025-07-10	Benchmarking Content-Based Puzzle Solvers on Corrupted Jigsaw Puzzles	Richard Dirauf et.al.	2507.07828	null
2025-07-10	Phase-Space Synchronization Driven by Moon-Magnetosphere Coupling in Gas Giants	Adnane Osmane et.al.	2507.07739	null
2025-07-10	Capture Stage Environments: A Guide to Better Matting	Hannah Dröge et.al.	2507.07623	null
2025-07-10	Stable-Hair v2: Real-World Hair Transfer via Multiple-View Diffusion Model	Kuiyuan Sun et.al.	2507.07591	null
2025-07-10	Functional Time Series Forecasting of Distributions: A Koopman-Wasserstein Approach	Ziyue Wang et.al.	2507.07570	null
2025-07-10	Learnable Retrieval Enhanced Visual-Text Alignment and Fusion for Radiology Report Generation	Qin Zhou et.al.	2507.07568	null
2025-07-10	Divergence Minimization Preference Optimization for Diffusion Model Alignment	Binxu Li et.al.	2507.07510	null
2025-07-10	Degradation-Agnostic Statistical Facial Feature Transformation for Blind Face Restoration in Adverse Weather Conditions	Chang-Hwan Son et.al.	2507.07464	null
2025-07-10	EscherNet++: Simultaneous Amodal Completion and Scalable View Synthesis through Masked Fine-Tuning and Enhanced Feed-Forward 3D Reconstruction	Xinan Zhang et.al.	2507.07410	null
2025-07-10	The LDP of McKean-Vlasov stochastic differential equations with Hölder continuous conditions and integrable conditions	Hao Wu et.al.	2507.07368	null
2025-07-09	Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor	Vatsal Agarwal et.al.	2507.07106	null
2025-07-09	Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models	Tiezheng Zhang et.al.	2507.07104	null
2025-07-09	Exact Evaluation of the Accuracy of Diffusion Models for Inverse Problems with Gaussian Data Distributions	Emile Pierret et.al.	2507.07008	null
2025-07-09	General large deviations and functional iterated logarithm law for multivalued McKean-Vlasov stochastic differential equations	Lingyan Cheng et.al.	2507.07001	null
2025-07-09	DiffSpectra: Molecular Structure Elucidation from Spectra using Diffusion Models	Liang Wang et.al.	2507.06853	null
2025-07-10	HeLo: Heterogeneous Multi-Modal Fusion with Label Correlation for Emotion Distribution Learning	Chuhang Zheng et.al.	2507.06821	null
2025-07-09	Democratizing High-Fidelity Co-Speech Gesture Video Generation	Xu Yang et.al.	2507.06812	null
2025-07-09	MADPOT: Medical Anomaly Detection with CLIP Adaptation and Partial Optimal Transport	Mahshid Shiri et.al.	2507.06733	null
2025-07-09	Enhancing Diffusion Model Stability for Image Restoration via Gradient Management	Hongjie Wu et.al.	2507.06656	null
2025-07-09	Diff $^2$ I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior	Juncheng Mu et.al.	2507.06651	null
2025-07-09	Denoising Multi-Beta VAE: Representation Learning for Disentanglement and Generation	Anshuk Uppal et.al.	2507.06613	null
2025-07-09	MOST: Motion Diffusion Model for Rare Text via Temporal Clip Banzhaf Interaction	Yin Wang et.al.	2507.06590	null
2025-07-09	Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution	Yonghyun Park et.al.	2507.06547	null
2025-07-10	Concept Unlearning by Modeling Key Steps of Diffusion Process	Chaoshuo Zhang et.al.	2507.06526	null
2025-07-09	FedDifRC: Unlocking the Potential of Text-to-Image Diffusion Models in Heterogeneous Federated Learning	Huan Wang et.al.	2507.06482	null
2025-07-08	Modern Methods in Associative Memory	Dmitry Krotov et.al.	2507.06211	null
2025-07-08	CultureCLIP: Empowering CLIP with Cultural Awareness through Synthetic Images and Contextualized Captions	Yuchen Huang et.al.	2507.06210	null
2025-07-08	A Survey on Latent Reasoning	Rui-Jie Zhu et.al.	2507.06203	null
2025-07-08	Normalizing Diffusion Kernels with Optimal Transport	Nathan Kessler et.al.	2507.06161	null
2025-07-08	Prompt-Free Conditional Diffusion for Multi-object Image Augmentation	Haoyu Wang et.al.	2507.06146	null
2025-07-08	Bridging Sequential Deep Operator Network and Video Diffusion: Residual Refinement of Spatio-Temporal PDE Solutions	Jaewan Park et.al.	2507.06133	null
2025-07-08	Unconditional Diffusion for Generative Sequential Recommendation	Yimeng Bai et.al.	2507.06121	null
2025-07-08	Nonparametric Estimation in SDE Models Involving an Explanatory Process	Fabienne Comte et.al.	2507.06098	null
2025-07-08	ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion Models	Chihan Huang et.al.	2507.06078	null
2025-07-08	TextPixs: Glyph-Conditioned Diffusion with Character-Aware Attention and OCR-Guided Supervision	Syeda Anshrah Gillani et.al.	2507.06033	null
2025-07-08	T-LoRA: Single Image Diffusion Model Customization Without Overfitting	Vera Soboleva et.al.	2507.05964	null
2025-07-08	Rough SDEs and Robust Filtering for Jump-Diffusions	Andrew L. Allan et.al.	2507.05930	null
2025-07-08	Diffusion Dataset Condensation: Training Your Diffusion Model Faster with Less Data	Rui Huang et.al.	2507.05914	null
2025-07-08	A new approach to the study of the manifold of fixed rank covariance matrices	Leonardo Marconi et.al.	2507.05873	null
2025-07-08	USIGAN: Unbalanced Self-Information Feature Transport for Weakly Paired Image IHC Virtual Staining	Yue Peng et.al.	2507.05843	null
2025-07-07	EmbodieDreamer: Advancing Real2Sim2Real Transfer for Policy Training via Embodied World Modeling	Boyuan Wang et.al.	2507.05198	null
2025-07-07	SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model	Chun Xie et.al.	2507.05148	null
2025-07-07	VERITAS: Verification and Explanation of Realness in Images for Transparency in AI Systems	Aadi Srivastava et.al.	2507.05146	null
2025-07-07	Optimal Consumption-Investment for General Utility with a Drawdown Constraint over a Finite-Time Horizon	Chonghu Guan et.al.	2507.05115	null
2025-07-07	MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation	Yucheng Wang et.al.	2507.05092	null
2025-07-07	AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics	Jan Carreras Boada et.al.	2507.05063	null
2025-07-08	A COMPASS to Model Comparison and Simulation-Based Inference in Galactic Chemical Evolution	Berkay Gunes et.al.	2507.05060	null
2025-07-07	On a parabolic curvature lower bound generalizing Ricci flows	Marco Flaim et.al.	2507.05032	null
2025-07-07	A Generative Diffusion Model for Amorphous Materials	Kai Yang et.al.	2507.05024	null
2025-07-07	Robust Incomplete-Modality Alignment for Ophthalmic Disease Grading and Diagnosis via Labeled Optimal Transport	Qinkai Yu et.al.	2507.04999	null
2025-07-07	TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation	Zonglin Lyu et.al.	2507.04984	null
2025-07-07	A diffusion model for light scattering in ejecta	J. A. Don Jayamanne et.al.	2507.04972	null
2025-07-07	LAPS-Diff: A Diffusion-Based Framework for Singing Voice Synthesis With Language Aware Prosody-Style Guided Learning	Sandipan Dhar et.al.	2507.04966	null
2025-07-07	DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer	Yecheng Wu et.al.	2507.04947	null
2025-07-07	Taming the Tri-Space Tension: ARC-Guided Hallucination Modeling and Control for Text-to-Image Generation	Jianjiang Yang et.al.	2507.04946	null
2025-07-03	Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching	Xin Zhou et.al.	2507.02860	null
2025-07-03	AnyI2V: Animating Any Conditional Image with Motion Control	Ziye Li et.al.	2507.02857	null
2025-07-03	USAD: An Unsupervised Data Augmentation Spatio-Temporal Attention Diffusion Network	Ying Yu et.al.	2507.02827	null
2025-07-03	LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion	Fangfu Liu et.al.	2507.02813	null
2025-07-03	RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation	Liheng Zhang et.al.	2507.02792	null
2025-07-03	FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models	Yuxuan Wang et.al.	2507.02714	null
2025-07-04	UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation	Qin Guo et.al.	2507.02713	null
2025-07-03	APT: Adaptive Personalized Training for Diffusion Models with Limited Data	JungWoo Chae et.al.	2507.02687	null
2025-07-03	Learning few-step posterior samplers by unfolding and distillation of diffusion models	Charlesquin Kemajou Mbakam et.al.	2507.02686	null
2025-07-03	Guided Generation for Developable Antibodies	Siqi Zhao et.al.	2507.02670	null
2025-07-03	Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation	François Rozet et.al.	2507.02608	null
2025-07-03	AC-Refiner: Efficient Arithmetic Circuit Optimization Using Conditional Diffusion Models	Chenhao Xue et.al.	2507.02598	null
2025-07-03	Structure-aware Semantic Discrepancy and Consistency for 3D Medical Image Self-supervised Learning	Tan Pan et.al.	2507.02581	null
2025-07-03	Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning	Buzhen Huang et.al.	2507.02565	null
2025-07-03	Random dynamical systems for McKean–Vlasov SDEs via rough path theory	Benjamin Gess et.al.	2507.02449	null
2025-07-02	FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model	Yukang Cao et.al.	2507.01953	null
2025-07-02	Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning	Qingdong He et.al.	2507.01908	null
2025-07-02	Frontiers of Generative AI for Network Optimization: Theories, Limits, and Visions	Bo Yang et.al.	2507.01773	null
2025-07-02	Mind the jumps: when 2BSDEs meet semi-martingales	Dylan Possamaï et.al.	2507.01767	null
2025-07-02	Entropic optimal transport beyond product reference couplings: the Gaussian case on Euclidean space	Paul Freulon et.al.	2507.01709	null
2025-07-02	Vision-Aided ISAC in Low-Altitude Economy Networks via De-Diffused Visual Priors	Yulan Gao et.al.	2507.01574	null
2025-07-02	Loss Functions in Diffusion Models: A Comparative Study	Dibyanshu Kumar et.al.	2507.01516	null
2025-07-02	ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation	Jimyeong Kim et.al.	2507.01496	null
2025-07-02	Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think	Ge Wu et.al.	2507.01467	null
2025-07-02	QC-OT: Optimal Transport with Quasiconformal Mapping	Yuping Lv et.al.	2507.01456	null
2025-07-02	DiffMark: Diffusion-based Robust Watermark Against Deepfakes	Chen Sun et.al.	2507.01428	null
2025-07-02	DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal	Wenjie Liu et.al.	2507.01422	null
2025-07-02	Dynamic Programming Principle for Stochastic Control Problems on Riemannian Manifolds	Dingqian Gao et.al.	2507.01407	null
2025-07-03	Distributional Soft Actor-Critic with Diffusion Policy	Tong Liu et.al.	2507.01381	null
2025-07-02	Efficient Kilometer-Scale Precipitation Downscaling with Conditional Wavelet Diffusion	Chugang Yi et.al.	2507.01354	null
2025-06-30	Epona: Autoregressive Diffusion World Model for Autonomous Driving	Kaiwen Zhang et.al.	2506.24113	null
2025-06-30	Navigating with Annealing Guidance Scale in Diffusion Space	Shai Yehezkel et.al.	2506.24108	null
2025-06-30	Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention	Wonwoong Cho et.al.	2506.24085	null
2025-06-30	Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios	Deng Li et.al.	2506.24063	null
2025-06-30	Faster Diffusion Models via Higher-Order Approximation	Gen Li et.al.	2506.24042	null
2025-06-30	Supervised Diffusion-Model-Based PET Image Reconstruction	George Webber et.al.	2506.24034	null
2025-06-30	Minimally dissipative multi-bit logical operations	Jérémie Klinger et.al.	2506.24021	null
2025-06-30	Full history recursive multilevel Picard approximations suffer from the curse of dimensionality for the Hamilton-Jacobi-Bellman equation of a stochastic control problem	Martin Hutzenthaler et.al.	2506.23969	null
2025-06-30	VMoBA: Mixture-of-Block Attention for Video Diffusion Models	Jianzong Wu et.al.	2506.23858	null
2025-06-30	Random Distributionally Robust Optimization under Phi-divergence	Guohui Guan et.al.	2506.23839	null
2025-06-30	Interpretable Zero-Shot Learning with Locally-Aligned Vision-Language Model	Shiming Chen et.al.	2506.23822	null
2025-06-30	Controllable Reference-Based Real-World Remote Sensing Image Super-Resolution with Generative Diffusion Priors	Ce Wang et.al.	2506.23801	null
2025-06-30	Radioactive Watermarks in Diffusion and Autoregressive Image Generative Models	Michel Meintz et.al.	2506.23731	null
2025-06-30	Proteus-ID: ID-Consistent and Motion-Coherent Video Customization	Guiyu Zhang et.al.	2506.23729	null
2025-06-30	MDPG: Multi-domain Diffusion Prior Guidance for MRI Reconstruction	Lingtong Zhang et.al.	2506.23701	null
2025-06-27	Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy	Yuhao Liu et.al.	2506.22432	null
2025-06-27	DiffSoundStream: Efficient Speech Tokenization via Diffusion Decoding	Yang Yang et.al.	2506.22362	null
2025-06-27	OutDreamer: Video Outpainting with a Diffusion Transformer	Linhao Zhong et.al.	2506.22298	null
2025-06-27	Hybrid Generative Modeling for Incomplete Physics: Deep Grey-Box Meets Optimal Transport	Gurjeet Sangra Singh et.al.	2506.22204	null
2025-06-27	A Reinforcement Learning Framework for Some Singular Stochastic Control Problems	Zongxia Liang et.al.	2506.22203	null
2025-06-27	Pinsker’s inequality for adapted total variation	Mathias Beiglböck et.al.	2506.22106	null
2025-06-27	Noise-Inspired Diffusion Model for Generalizable Low-Dose CT Reconstruction	Qi Gao et.al.	2506.22012	null
2025-06-27	RoboEnvision: A Long-Horizon Video Generation Model for Multi-Task Robot Manipulation	Liudi Yang et.al.	2506.22007	null
2025-06-27	StableCodec: Taming One-Step Diffusion for Extreme Image Compression	Tianyu Zhang et.al.	2506.21977	null
2025-06-27	Joint Task Offloading and Resource Allocation in Low-Altitude MEC via Graph Attention Diffusion	Yifan Xue et.al.	2506.21933	null
2025-06-27	TOAST: Task-Oriented Adaptive Semantic Transmission over Dynamic Wireless Environments	Sheng Yun et.al.	2506.21900	null
2025-06-26	TADA: Improved Diffusion Sampling with Training-free Augmented Dynamics	Tianrong Chen et.al.	2506.21757	null
2025-06-26	Inverse Design of Diffractive Metasurfaces Using Diffusion Models	Liav Hen et.al.	2506.21748	null
2025-06-26	Multi-to one-dimensional screening and semi-discrete optimal transport	Omar Abdul Halim et.al.	2506.21740	null
2025-06-26	Elucidating and Endowing the Diffusion Training Paradigm for General Image Restoration	Xin Lu et.al.	2506.21722	null
2025-06-26	SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture	Kehan Sui et.al.	2506.21478	null
2025-06-26	Rethinking Oversaturation in Classifier-Free Guidance via Low Frequency	Kaiyu Song et.al.	2506.21452	null
2025-06-26	Controllable 3D Placement of Objects with Scene-Aware Diffusion Models	Mohamed Omran et.al.	2506.21446	null
2025-06-26	FastRef:Fast Prototype Refinement for Few-Shot Industrial Anomaly Detection	Long Tian et.al.	2506.21398	null
2025-06-26	HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation	Diego Biagini et.al.	2506.21287	null
2025-06-27	FairyGen: Storied Cartoon Video from a Single Child-Drawn Character	Jiayi Zheng et.al.	2506.21272	null
2025-06-26	Block Coordinate Descent Network Simplex for Optimal Transport	Lingrui Li et.al.	2506.21231	null
2025-06-27	Alternating Spintronics: Capacitive Behavior of Spin Valves and Resonator Applications	Yunwen Liu et.al.	2506.21176	null
2025-06-26	Compressed and Smooth Latent Space for Text Diffusion Modeling	Viacheslav Meshchaninov et.al.	2506.21170	null
2025-06-26	Geometry and Perception Guided Gaussians for Multiview-consistent 3D Generation from a Single Image	Pufan Li et.al.	2506.21152	null
2025-06-26	Learning to See in the Extremely Dark	Hai Jiang et.al.	2506.21132	null
2025-06-26	Unlasting: Unpaired Single-Cell Multi-Perturbation Estimation by Dual Conditional Diffusion Implicit Bridges	Changxi Chi et.al.	2506.21107	null
2025-06-26	Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling	Hansam Cho et.al.	2506.21045	null
2025-06-26	Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and Transferability	Boyong He et.al.	2506.21042	null
2025-06-27	DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation	Wenzhou Lyu et.al.	2506.21034	null
2025-06-25	EditP23: 3D Editing via Propagation of Image Prompts to Multi-View	Roi Bar-On et.al.	2506.20652	null
2025-06-25	Telegrapher’s Generative Model via Kac Flows	Richard Duong et.al.	2506.20641	null
2025-06-26	DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation	Shansan Gong et.al.	2506.20639	null
2025-06-25	MC for Agriculture: A Framework for Nature-inspired Sustainable Pest Control	Fardad Vakilipoor et.al.	2506.20637	null
2025-06-25	Shape2Animal: Creative Animal Generation from Natural Silhouettes	Quoc-Duy Tran et.al.	2506.20616	null
2025-06-25	Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks	Manyi Li et.al.	2506.20548	null
2025-06-25	HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling	Tobias Vontobel et.al.	2506.20452	null
2025-06-25	TDiR: Transformer based Diffusion for Image Restoration Tasks	Abbas Anwar et.al.	2506.20302	null
2025-06-25	Ctrl-Z Sampling: Diffusion Sampling with Controlled Random Zigzag Explorations	Shunqi Mao et.al.	2506.20294	null
2025-06-25	Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement	Kun Yuan et.al.	2506.20254	null
2025-06-25	Towards Efficient Exemplar Based Image Editing with Multimodal VLMs	Avadhoot Jadhav et.al.	2506.20155	null
2025-06-24	Robust Robotic Exploration and Mapping Using Generative Occupancy Map Synthesis	Lorin Achey et.al.	2506.20049	null
2025-06-24	Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting	Salva Rühling Cachay et.al.	2506.20024	null
2025-06-24	Approximating the order 2 quantum Wasserstein distance using the moment-SOS hierarchy	Saroj Prasad Chhatoi et.al.	2506.20006	null
2025-06-24	Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture	Shuchen Xue et.al.	2506.19935	null
2025-06-24	Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation	Xingyang Li et.al.	2506.19852	null
2025-06-24	AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models	Zehuan Huang et.al.	2506.19851	null
2025-06-24	GenHSI: Controllable Generation of Human-Scene Interaction Videos	Zekun Li et.al.	2506.19840	null
2025-06-24	Improving Progressive Generation with Decomposable Flow Matching	Moayed Haji-Ali et.al.	2506.19839	null
2025-06-24	SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution	Liangbin Xie et.al.	2506.19838	null
2025-06-24	Machine Learning with Privacy for Protected Attributes	Saeed Mahloujifar et.al.	2506.19836	null
2025-06-24	ProxelGen: Generating Proteins as 3D Densities	Felix Faltings et.al.	2506.19820	null
2025-06-24	CoCo4D: Comprehensive and Complex 4D Scene Generation	Junwei Zhou et.al.	2506.19798	null
2025-06-24	Alleviating User-Sensitive bias with Fair Generative Sequential Recommendation Model	Yang Liu et.al.	2506.19777	null
2025-06-24	Noise Consistency Training: A Native Approach for One-Step Generator in Learning Additional Controls	Yihong Luo et.al.	2506.19741	null
2025-06-24	Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales	Seyedmorteza Sadat et.al.	2506.19713	null
2025-06-24	SceneCrafter: Controllable Multi-View Driving Scene Editing	Zehao Zhu et.al.	2506.19488	null
2025-06-24	Stylized Structural Patterns for Improved Neural Network Pre-training	Farnood Salehi et.al.	2506.19465	null
2025-06-24	Angio-Diff: Learning a Self-Supervised Adversarial Diffusion Model for Angiographic Geometry Generation	Zhifeng Wang et.al.	2506.19455	null
2025-06-24	Generate the Forest before the Trees – A Hierarchical Diffusion model for Climate Downscaling	Declan J. Curran et.al.	2506.19391	null
2025-06-23	Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models	Kiymet Akdemir et.al.	2506.18900	null
2025-06-23	MinD: Unified Visual Imagination and Control via Hierarchical World Models	Xiaowei Chi et.al.	2506.18897	null
2025-06-23	A comparison principle for variational problems : with an application to optimal transport	Flavien Léger et.al.	2506.18884	null
2025-06-23	Let Your Video Listen to Your Music!	Xinyu Zhang et.al.	2506.18881	null
2025-06-23	ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs	Michal Nazarczuk et.al.	2506.18792	null
2025-06-23	TCDiff++: An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography	Yuqin Dai et.al.	2506.18671	null
2025-06-23	Simulation-Free Differential Dynamics through Neural Conservation Laws	Mengjian Hua et.al.	2506.18604	null
2025-06-23	Optimization-Induced Dynamics of Lipschitz Continuity in Neural Networks	Róisín Luo et.al.	2506.18588	null
2025-06-23	Averaging principles for time-inhomogeneous multi-scale SDEs with partially dissipative coefficients	Xiaobin Sun et.al.	2506.18558	null
2025-06-23	GANs vs. Diffusion Models for virtual staining with the HER2match dataset	Pascal Klöckner et.al.	2506.18484	null
2025-06-23	DIP: Unsupervised Dense In-Context Post-training of Visual Representations	Sophia Sirko-Galouchenko et.al.	2506.18463	null
2025-06-23	CPAM: Context-Preserving Adaptive Manipulation for Zero-Shot Real Image Editing	Dinh-Khoi Vo et.al.	2506.18438	null
2025-06-23	How Robust is Model Editing after Fine-Tuning? An Empirical Study on Text-to-Image Diffusion Models	Feng He et.al.	2506.18428	null
2025-06-23	Generative Diffusion Receivers: Achieving Pilot-Efficient MIMO-OFDM Communications	Yuzhi Yang et.al.	2506.18419	null
2025-06-23	Large-Scale Training Data Attribution for Music Generative Models via Unlearning	Woosung Choi et.al.	2506.18312	null
2025-06-23	Emergent Temporal Correspondences from Video Diffusion Transformers	Jisu Nam et.al.	2506.17220	link
2025-06-20	DreamCube: 3D Panorama Generation via Multi-plane Synchronization	Yukun Huang et.al.	2506.17206	null
2025-06-20	Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres	Samuel Howard et.al.	2506.17197	null
2025-06-20	Deep generative models as the probability transformation functions	Vitalii Bondar et.al.	2506.17171	null
2025-06-20	MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification	David Jacob Drexlin et.al.	2506.17140	null
2025-06-20	Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models	Michael Plainer et.al.	2506.17139	link
2025-06-20	Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion	Wang Zhao et.al.	2506.17074	null
2025-06-23	Generative Modeling of Full-Atom Protein Conformations using Latent Diffusion on Graph Embeddings	Aditya Sengar et.al.	2506.17064	link
2025-06-20	LSCD: Lomb-Scargle Conditioned Diffusion for Time series Imputation	Elizabeth Fons et.al.	2506.17039	null
2025-06-20	Inference for SDEs driven by Hermite processes	Petr Coupek et.al.	2506.16916	null
2025-06-20	Reward-Agnostic Prompt Optimization for Text-to-Image Diffusion Models	Semin Kim et.al.	2506.16853	link
2025-06-20	Beyond Blur: A Fluid Perspective on Generative Diffusion Models	Grzegorz Gruszczynski et.al.	2506.16827	null
2025-06-20	PQCAD-DM: Progressive Quantization and Calibration-Assisted Distillation for Extremely Efficient Diffusion Model	Beomseok Ko et.al.	2506.16776	null
2025-06-20	Noise-Informed Diffusion-Generated Image Detection with Anomaly Attention	Weinan Guan et.al.	2506.16743	link
2025-06-23	A Prior-Guided Joint Diffusion Model in Projection Domain for PET Tracer Conversion	Fang Chen et.al.	2506.16733	null
2025-06-18	Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards	Qingming Liu et.al.	2506.15684	null
2025-06-18	Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model	Anirud Aggarwal et.al.	2506.15682	link
2025-06-18	UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting	Kai He et.al.	2506.15673	null
2025-06-18	Pathwise convergence of a novel numerical scheme based on semi-implicit method for stochastic differential-algebraic equations with non-global Lipschitz coefficients	Guy Tsafack et.al.	2506.15627	null
2025-06-18	HOIDiNi: Human-Object Interaction through Diffusion Noise Optimization	Roey Ron et.al.	2506.15625	null
2025-06-18	One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution	Yujing Sun et.al.	2506.15591	link
2025-06-18	Control and Realism: Best of Both Worlds in Layout-to-Image without Training	Bonan Li et.al.	2506.15563	null
2025-06-18	Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models	Teysir Baoueb et.al.	2506.15530	null
2025-06-18	GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects	Shujia Li et.al.	2506.15483	null
2025-06-18	A deep shotgun method for solving high-dimensional parabolic partial differential equations	Wenjun Xu et.al.	2506.15481	null
2025-06-18	Provable Maximum Entropy Manifold Exploration via Diffusion Models	Riccardo De Santi et.al.	2506.15385	null
2025-06-18	Global Ground Metric Learning with Applications to scRNA data	Damin Kühn et.al.	2506.15383	link
2025-06-18	When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class	Yujin Kim et.al.	2506.15381	null
2025-06-18	Acoustic Waveform Inversion with Image-to-Image Schrödinger Bridges	A. S. Stankevich et.al.	2506.15346	link
2025-06-18	Superpositions for General Conditional Mckean-Vlasov Stochastic Differential Equations	Qi Feng et.al.	2506.15341	null
2025-06-17	CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion	Jiahua Ma et.al.	2506.14769	null
2025-06-17	Cost-Aware Routing for Efficient Text-To-Image Generation	Qinchan et.al.	2506.14753	null
2025-06-17	A Minkowski problem for $α$ -concave functions via optimal transport	Xiao Li et.al.	2506.14735	null
2025-06-17	Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion	Ni Ou et.al.	2506.14706	null
2025-06-17	On Quantum BSDE Solver for High-Dimensional Parabolic PDEs	Howard Su et.al.	2506.14612	null
2025-06-17	Risk Estimation of Knee Osteoarthritis Progression via Predictive Multi-task Modelling from Efficient Diffusion Model using X-ray Images	David Butler et.al.	2506.14560	null
2025-06-17	DreamLight: Towards Harmonious and Consistent Image Relighting	Yong Liu et.al.	2506.14549	null
2025-06-17	Using BDF schemes in the temporal integration of POD-ROM methods	Bosco García-Archilla et.al.	2506.14543	null
2025-06-17	Reimagining Target-Aware Molecular Generation through Retrieval-Enhanced Aligned Diffusion	Dong Xu et.al.	2506.14488	null
2025-06-17	LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs	Xiaoran Liu et.al.	2506.14429	null
2025-06-17	Causally Steered Diffusion for Automated Video Counterfactual Generation	Nikos Spyrou et.al.	2506.14404	link
2025-06-17	Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models	Tian Xia et.al.	2506.14399	null
2025-06-17	FRIDU: Functional Map Refinement with Guided Image Diffusion	Avigail Cohen Rimon et.al.	2506.14322	null
2025-06-17	Optimal Incentive for Regulated Production	Benhao Du et.al.	2506.14286	null
2025-06-17	CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation	Jia-Chen Zhang et.al.	2506.14206	null
2025-06-16	Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value	Yixian Xu et.al.	2506.13763	null
2025-06-17	VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models	Edward Li et.al.	2506.13754	null
2025-06-16	OTFusion: Bridging Vision-only and Vision-Language Models via Optimal Transport for Transductive Zero-Shot Learning	Qiyu Xu et.al.	2506.13723	null
2025-06-16	MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model	Bi Yuda et.al.	2506.13667	null
2025-06-16	Absolutely Continuous Curves of Stochastic Processes	Beatrice Acciaio et.al.	2506.13634	null
2025-06-16	Exploiting the Exact Denoising Posterior Score in Training-Free Guidance of Diffusion Models	Gregory Bellchambers et.al.	2506.13614	null
2025-06-16	Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching	Weimin Bai et.al.	2506.13594	null
2025-06-16	Flexible-length Text Infilling for Discrete Diffusion Models	Andrew Zhang et.al.	2506.13579	null
2025-06-16	X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability	Yu Yang et.al.	2506.13558	null
2025-06-16	Seismic Acoustic Impedance Inversion Framework Based on Conditional Latent Generative Diffusion Model	Jie Chen et.al.	2506.13529	null
2025-06-16	Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis	Martina Pastorino et.al.	2506.13484	null
2025-06-16	PRO: Projection Domain Synthesis for CT Imaging	Kang Chen et.al.	2506.13443	null
2025-06-16	Zero-Shot Solving of Imaging Inverse Problems via Noise-Refined Likelihood Guided Diffusion Models	Zhen Wang et.al.	2506.13391	null
2025-06-16	LapDDPM: A Conditional Graph Diffusion Model for scRNA-seq Generation with Spectral Adversarial Perturbations	Lorenzo Bini et.al.	2506.13344	null
2025-06-16	Propagation of Galactic cosmic rays: the influence of anisotropic diffusion	Ala’a AL-Zetoun et.al.	2506.13314	null
2025-06-13	A Robust Local Fréchet Regression Using Unbalanced Neural Optimal Transport with Applications to Dynamic Single-cell Genomics Data	Binghao Yan et.al.	2506.11969	null
2025-06-13	Random rotational invariance of integration by parts formulas within a Bismut-type approach	Susanna Dehò et.al.	2506.11937	null
2025-06-13	Measurement-aligned Flow for Inverse Problem	Shaorong Zhang et.al.	2506.11893	null
2025-06-13	Learning to Integrate	Oliver G. Ernst et.al.	2506.11801	link
2025-06-13	CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection	Byeongchan Lee et.al.	2506.11772	null
2025-06-13	State constrained stochastic optimal control of a PV system with battery storage via Fokker-Planck and Hamilton-Jacobi-Bellman equations	Alfredo Bermúdez et.al.	2506.11765	null
2025-06-13	DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion Models	Muhammad Sarmad et.al.	2506.11764	link
2025-06-13	Simulating realistic radio continuum survey maps with diffusion models	Tobias Vičánek Martínez et.al.	2506.11715	link
2025-06-13	Fusion of multi-source precipitation records via coordinate-based generative model	Sencan Sun et.al.	2506.11698	null
2025-06-13	Robust Filtering – Novel Statistical Learning and Inference Algorithms with Applications	Aamir Hussain Chughtai et.al.	2506.11530	null
2025-06-13	Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis	Yuan Gao et.al.	2506.11526	link
2025-06-13	An infinite horizon sufficient stochastic maximum principle for regime switching diffusions and applications	Kai Ding et.al.	2506.11523	null
2025-06-13	Taming Stable Diffusion for Computed Tomography Blind Super-Resolution	Chunlei Li et.al.	2506.11496	null
2025-06-13	Preserving Clusters in Prompt Learning for Unsupervised Domain Adaptation	Tung-Long Vuong et.al.	2506.11493	null
2025-06-13	LiLAC: A Lightweight Latent ControlNet for Musical Audio Generation	Tom Baker et.al.	2506.11476	null
2025-06-12	SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis	Weiliang Chen et.al.	2506.10981	null
2025-06-12	Fine-Grained Perturbation Guidance via Attention Head Selection	Donghoon Ahn et.al.	2506.10978	null
2025-06-12	What Exactly Does Guidance Do in Masked Discrete Diffusion Models	He Ye et.al.	2506.10971	null
2025-06-13	MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning	Yuxuan Luo et.al.	2506.10963	null
2025-06-12	SpectralAR: Spectral Autoregressive Visual Generation	Yuanhui Huang et.al.	2506.10962	null
2025-06-12	ReGuidance: A Simple Diffusion Wrapper for Boosting Sample Quality on Hard Inverse Problems	Aayush Karan et.al.	2506.10955	null
2025-06-12	The Diffusion Duality	Subham Sekhar Sahoo et.al.	2506.10892	link
2025-06-12	ME: Trigger Element Combination Backdoor Attack on Copyright Infringement	Feiyu Yang et.al.	2506.10776	null
2025-06-13	PDESpectralRefiner: Achieving More Accurate Long Rollouts with Spectral Adjustment	Li Luo et.al.	2506.10711	null
2025-06-12	Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework	Xia Du et.al.	2506.10685	null
2025-06-12	Admitted symmetries of Backward Stochastic Differential Equations	Anas Ouknine et.al.	2506.10650	null
2025-06-12	GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning	Xiaoyi Bao et.al.	2506.10639	null
2025-06-12	Anatomy-Grounded Weakly Supervised Prompt Tuning for Chest X-ray Latent Diffusion Models	Konstantinos Vilouras et.al.	2506.10633	null
2025-06-12	Hessian Geometry of Latent Space in Generative Models	Alexander Lobashev et.al.	2506.10632	link
2025-06-12	TexTailor: Customized Text-aligned Texturing via Effective Resampling	Suin Lee et.al.	2506.10612	link
2025-06-11	Text-Aware Image Restoration with Diffusion Models	Jaewon Min et.al.	2506.09993	null
2025-06-11	Constrained Denoising, Empirical Bayes, and Optimal Transport	Adam Quinn Jaffe et.al.	2506.09986	null
2025-06-11	Canonical Latent Representations in Conditional Diffusion Models	Yitao Xu et.al.	2506.09955	null
2025-06-11	HadaNorm: Diffusion Transformer Quantization through Mean-Centered Transformations	Marco Federici et.al.	2506.09932	null
2025-06-11	Wasserstein Distances on Quantum Structures: an Overview	Emily Beatty et.al.	2506.09794	null
2025-06-11	ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models	Qin Zhou et.al.	2506.09740	null
2025-06-11	Non-Euclidean dual gradient ascent for entropically regularized linear and semidefinite programming	Yuhang Cai et.al.	2506.09711	null
2025-06-11	Training-Free Voice Conversion with Factorized Optimal Transport	Alexander Lobashev et.al.	2506.09709	link
2025-06-11	Wasserstein Hypergraph Neural Network	Iulia Duta et.al.	2506.09682	null
2025-06-11	Assessing the Quality of Denoising Diffusion Models in Wasserstein Distance: Noisy Score and Optimal Bounds	Vahan Arsenyan et.al.	2506.09681	null
2025-06-11	VideoMat: Extracting PBR Materials from Video Diffusion Models	Jacob Munkberg et.al.	2506.09665	null
2025-06-11	DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning	Dongxu Liu et.al.	2506.09644	null
2025-06-11	AngleRoCL: Angle-Robust Concept Learning for Physically View-Invariant T2I Adversarial Patches	Wenjun Ji et.al.	2506.09538	null
2025-06-11	Gaussian Herding across Pens: An Optimal Transport Perspective on Global Gaussian Reduction for 3DGS	Tao Wang et.al.	2506.09534	null
2025-06-11	Fast Monte Carlo Tree Diffusion: 100x Speedup via Parallel Sparse Planning	Jaesik Yoon et.al.	2506.09498	null
2025-06-10	MagCache: Fast Video Generation with Magnitude-Aware Cache	Zehong Ma et.al.	2506.09045	link
2025-06-10	Diffuse and Disperse: Image Generation with Representation Regularization	Runqian Wang et.al.	2506.09027	null
2025-06-10	Branched Schrödinger Bridge Matching	Sophia Tang et.al.	2506.09007	null
2025-06-10	An Efficient Augmented Lagrangian Method for Dynamic Optimal Transport on Surfaces Based on Second-Order Cone Programming	Liang Chen et.al.	2506.08988	null
2025-06-10	Asymptotic error distribution for stochastic Runge–Kutta methods of strong order one	Diancong Jin et.al.	2506.08937	null
2025-06-10	HiSin: Efficient High-Resolution Sinogram Inpainting via Resolution-Guided Progressive Inference	Jiaze E et.al.	2506.08809	null
2025-06-10	Flow Diverse and Efficient: Learning Momentum Flow Matching via Stochastic Velocity Field Sampling	Zhiyuan Ma et.al.	2506.08796	null
2025-06-10	Normalized Radon Cumulative Distribution Transforms for Invariance and Robustness in Optimal Transport Based Image Classification	Matthias Beckmann et.al.	2506.08761	link
2025-06-10	MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning	Mohammadreza Salehi et.al.	2506.08694	link
2025-06-10	Efficient Uncertainty Propagation with Guarantees in Wasserstein Distance	Eduardo Figueiredo et.al.	2506.08689	null
2025-06-10	MAMBO: High-Resolution Generative Approach for Mammography Images	Milica Škipina et.al.	2506.08677	null
2025-06-10	RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping	Yang Bai et.al.	2506.08632	null
2025-06-10	Diffusion model for analyzing quantum fingerprints in conductance fluctuation	Naoto Yokoi et.al.	2506.08617	null
2025-06-10	Flow Matching Meets PDEs: A Unified Framework for Physics-Constrained Generation	Giacomo Baldan et.al.	2506.08604	null
2025-06-10	LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4 $\times$ RTX 4090s	Xijun Wang et.al.	2506.08529	null
2025-06-09	StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets	Anh-Quan Cao et.al.	2506.08013	link
2025-06-09	Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion	Xun Huang et.al.	2506.08009	null
2025-06-09	Dynamic View Synthesis as an Inverse Problem	Hidir Yesiltepe et.al.	2506.08004	null
2025-06-09	MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation	Junhao Chen et.al.	2506.07999	null
2025-06-09	Generative Modeling of Weights: Generalization or Memorization?	Boya Zeng et.al.	2506.07998	link
2025-06-09	Stochastic portfolio theory with price impact	David Itkin et.al.	2506.07993	null
2025-06-09	Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers	Zhengyao Lv et.al.	2506.07986	link
2025-06-09	Gradients: When Markets Meet Fine-tuning – A Distributed Approach to Model Optimisation	Christopher Subia-Waud et.al.	2506.07940	null
2025-06-09	Efficient Seismic Data Interpolation via Sparse Attention Transformer and Diffusion Model	Xiaoli Wei et.al.	2506.07923	null
2025-06-09	Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces	Kevin Rojas et.al.	2506.07903	link
2025-06-09	FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling	Sifan Wang et.al.	2506.07902	link
2025-06-09	Video Unlearning via Low-Rank Refusal Vector	Simone Facchiano et.al.	2506.07891	null
2025-06-09	Diffusion Counterfactual Generation with Semantic Abduction	Rajat Rasal et.al.	2506.07883	link
2025-06-09	Stability of Mean-Field Variational Inference	Shunan Sheng et.al.	2506.07856	null
2025-06-09	Jarzynski Reweighting and Sampling Dynamics for Training Energy-Based Models: Theoretical Analysis of Different Transition Kernels	Davide Carbone et.al.	2506.07843	null
2025-06-06	STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis	Jiatao Gu et.al.	2506.06276	null
2025-06-06	Antithetic Noise in Diffusion Models	Jing Jia et.al.	2506.06185	null
2025-06-06	Feedback Guidance of Diffusion Models	Koulischer Felix et.al.	2506.06085	null
2025-06-06	Restereo: Diffusion stereo video generation and restoration	Xingchang Huang et.al.	2506.06023	null
2025-06-06	Optimization-Free Universal Watermark Forgery with Regenerative Diffusion Models	Chaoyi Zhu et.al.	2506.06018	link
2025-06-09	AQUATIC-Diff: Additive Quantization for Truly Tiny Compressed Diffusion Models	Adil Hasan et.al.	2506.05960	null
2025-06-06	FADE: Frequency-Aware Diffusion Model Factorization for Video Editing	Yixuan Zhu et.al.	2506.05934	link
2025-06-06	Convection Anisotropies of Cosmic Rays in Highly Magnetized Plasma	Yiran Zhang et.al.	2506.05923	null
2025-06-06	WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction	Jakaria Islam Emon et.al.	2506.05899	null
2025-06-06	Stealix: Model Stealing via Prompt Evolution	Zhixiong Zhuang et.al.	2506.05867	null
2025-06-06	FontAdapter: Instant Font Adaptation in Visual Text Generation	Myungkyu Koo et.al.	2506.05843	null
2025-06-06	LLIA – Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models	Haojie Yu et.al.	2506.05806	null
2025-06-06	BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning	Yunpeng Qing et.al.	2506.05762	null
2025-06-06	Latent Diffusion Model Based Denoising Receiver for 6G Semantic Communication: From Stochastic Differential Theory to Application	Xiucheng Wang et.al.	2506.05710	null
2025-06-06	Learning Design-Score Manifold to Guide Diffusion Models for Offline Optimization	Tailin Zhou et.al.	2506.05680	null
2025-06-05	Contrastive Flow Matching	George Stoica et.al.	2506.05350	link
2025-06-05	Exploring Diffusion Transformer Designs via Grafting	Keshigeyan Chandrasegaran et.al.	2506.05340	link
2025-06-05	Progressive Tempering Sampler with Diffusion	Severi Rissanen et.al.	2506.05231	link
2025-06-05	Optimal-PhiBE: A PDE-based Model-free framework for Continuous-time Reinforcement Learning	Yuhua Zhu et.al.	2506.05208	link
2025-06-05	OGGSplat: Open Gaussian Growing for Generalizable Reconstruction with Expanded Field-of-View	Yanbo Wang et.al.	2506.05204	link
2025-06-05	Quantifying Cross-Modality Memorization in Vision-Language Models	Yuxin Wen et.al.	2506.05198	null
2025-06-05	Associative Memory and Generative Diffusion in the Zero-noise Limit	Joshua Hess et.al.	2506.05178	null
2025-06-05	Neural Jumps for Option Pricing	Duosi Zheng et.al.	2506.05137	null
2025-06-05	SeedEdit 3.0: Fast and High-Quality Generative Image Editing	Peng Wang et.al.	2506.05083	null
2025-06-05	UnHiPPO: Uncertainty-aware Initialization for State Space Models	Marten Lienen et.al.	2506.05065	null
2025-06-05	FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing	Guangzhao Li et.al.	2506.05046	null
2025-06-05	On the geometry of synthetic null hypersurfaces	Fabio Cavalletti et.al.	2506.04934	null
2025-06-05	Weak solutions of Stochastic Volterra Equations in convex domains with general kernels	Eduardo Abi Jaber et.al.	2506.04911	null
2025-06-05	Invisible Backdoor Triggers in Image Editing Model via Deep Watermarking	Yu-Feng Chen et.al.	2506.04879	link
2025-06-05	Sparse Autoencoders, Again?	Yin Lu et.al.	2506.04859	null
2025-06-04	Sounding that Object: Interactive Object-Aware Image to Audio Generation	Tingle Li et.al.	2506.04214	null
2025-06-04	Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector	Boyong He et.al.	2506.04211	link
2025-06-04	Transportation cost and contraction coefficient for channels on von Neumann algebras	Roy Araiza et.al.	2506.04197	null
2025-06-04	Image Editing As Programs with Diffusion Models	Yujia Hu et.al.	2506.04158	null
2025-06-04	Global convergence rates in the relaxation limits for the compressible Euler and Euler-Maxwell systems in Sobolev spaces	Timothée Crin-Barat et.al.	2506.04103	null
2025-06-04	A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning	Zhiyu Zhang et.al.	2506.04083	null
2025-06-04	Optimal Transport-based Domain Alignment as a Preprocessing Step for Federated Learning	Luiz Manella Pereira et.al.	2506.04071	null
2025-06-04	GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems	Tiehua Mei et.al.	2506.04015	null
2025-06-04	Large deviations for scaled families of Schrödinger bridges with reflection	Viktor Nilsson et.al.	2506.03999	null
2025-06-04	Beyond water limitation in vegetation-autotoxicity patterning: a cross-diffusion model	Francesco Giannino et.al.	2506.03981	null
2025-06-05	Solving Inverse Problems via Diffusion-Based Priors: An Approximation-Free Ensemble Sampling Approach	Haoxuan Chen et.al.	2506.03979	null
2025-06-04	Lower Ricci Curvature for Hypergraphs	Shiyi Yang et.al.	2506.03943	null
2025-06-04	DiffCAP: Diffusion-based Cumulative Adversarial Purification for Vision Language Models	Jia Fu et.al.	2506.03933	null
2025-06-04	Personalized MR-Informed Diffusion Models for 3D PET Image Reconstruction	George Webber et.al.	2506.03804	null
2025-06-04	OV-COAST: Cost Aggregation with Optimal Transport for Open-Vocabulary Semantic Segmentation	Aditya Gandhamal et.al.	2506.03706	null
2025-06-03	AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation	Lu Qiu et.al.	2506.03126	null
2025-06-03	DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation	Zhengyao Lv et.al.	2506.03123	null
2025-06-03	Rectified Flows for Fast Multiscale Fluid Flow Modeling	Victor Armegioiu et.al.	2506.03111	null
2025-06-03	TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Chetwin Low et.al.	2506.03099	null
2025-06-03	EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models	Mingzhe Li et.al.	2506.03067	null
2025-06-03	PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples	Junyu Liu et.al.	2506.03004	null
2025-06-03	Astrophotography turbulence mitigation via generative models	Joonyeoup Kim et.al.	2506.02981	null
2025-06-03	Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency	Bunlong Lay et.al.	2506.02908	null
2025-06-03	DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization	Geonyoung Lee et.al.	2506.02858	null
2025-06-03	Optimal control of the Poisson equation with transport regularization: Properties of optimal transport plans and transport map	Christian Meyer et.al.	2506.02808	null
2025-06-03	Geometric Visual Servo Via Optimal Transport	Ethan Canzini et.al.	2506.02768	null
2025-06-03	Investigating Mask-aware Prototype Learning for Tabular Anomaly Detection	Ruiying Lu et.al.	2506.02757	null
2025-06-03	Theoretical Performance Guarantees for Partial Domain Adaptation via Partial Optimal Transport	Jayadev Naram et.al.	2506.02712	null
2025-06-03	Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences	Yunhong Lu et.al.	2506.02698	null
2025-06-03	MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation	Mingyang Huang et.al.	2506.02661	null
2025-05-30	AdaHuman: Animatable Detailed 3D Human Generation with Compositional Multiview Diffusion	Yangyi Huang et.al.	2505.24877	null
2025-05-30	MiniMax-Remover: Taming Bad Noise Helps Video Object Removal	Bojia Zi et.al.	2505.24873	null
2025-05-30	Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking	Heli Ben-Hamu et.al.	2505.24857	null
2025-05-30	RealDrive: Retrieval-Augmented Driving with Diffusion Models	Wenhao Ding et.al.	2505.24808	null
2025-05-30	Generalization Dynamics of Linear Diffusion Models	Claudia Merger et.al.	2505.24769	null
2025-05-30	Unsupervised Evolutionary Cell Type Matching via Entropy-Minimized Optimal Transport	Mu Qiao et.al.	2505.24759	link
2025-05-30	Conformal Prediction for Zero-Shot Models	Julio Silva-Rodríguez et.al.	2505.24693	link
2025-05-30	WILTing Trees: Interpreting the Distance Between MPNN Embeddings	Masahiro Negishi et.al.	2505.24642	null
2025-05-30	A Composite Predictive-Generative Approach to Monaural Universal Speech Enhancement	Jie Zhang et.al.	2505.24576	null
2025-05-30	UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation	Yang-Tian Sun et.al.	2505.24521	null
2025-05-30	EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering	Runnan Lu et.al.	2505.24417	link
2025-05-30	IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models	Hanting Wang et.al.	2505.24406	link
2025-05-30	Provably convergent stochastic fixed-point algorithm for free-support Wasserstein barycenter of continuous non-parametric measures	Zeyi Chen et.al.	2505.24384	link
2025-05-30	Neural Drift Estimation for Ergodic Diffusions: Non-parametric Analysis and Numerical Exploration	Simone Di Gregorio et.al.	2505.24383	null
2025-06-03	Interpreting Large Text-to-Image Diffusion Models with Dictionary Learning	Stepan Shabalin et.al.	2505.24360	link
2025-05-29	LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers	Yusuf Dalva et.al.	2505.23758	null
2025-05-29	DarkDiff: Advancing Low-Light Raw Enhancement by Retasking Diffusion Models for Camera ISP	Amber Yijia Zheng et.al.	2505.23743	null
2025-05-29	LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization	Ronghuan Wu et.al.	2505.23740	null
2025-05-29	How Animals Dance (When You’re Not Looking)	Xiaojuan Wang et.al.	2505.23738	null
2025-05-29	DiffER: Categorical Diffusion for Chemical Retrosynthesis	Sean Current et.al.	2505.23721	link
2025-05-29	ImmunoDiff: A Diffusion Model for Immunotherapy Response Prediction in Lung Cancer	Moinak Bhattacharya et.al.	2505.23675	null
2025-05-30	OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation	Size Wu et.al.	2505.23661	link
2025-05-29	VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models	Xiangdong Zhang et.al.	2505.23656	link
2025-05-29	Optimization-Free Diffusion Model – A Perturbation Theory Approach	Yuehaw Khoo et.al.	2505.23652	null
2025-05-29	ZeroSep: Separate Anything in Audio with Zero Training	Chao Huang et.al.	2505.23625	null
2025-05-29	Inference-time Scaling of Diffusion Models through Classical Search	Xiangcheng Zhang et.al.	2505.23614	null
2025-05-29	Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model	Qingyu Shi et.al.	2505.23606	link
2025-05-29	Subgraph Gaussian Embedding Contrast for Self-Supervised Graph Representation Learning	Shifeng Xie et.al.	2505.23529	link
2025-05-29	Normalizing Flows are Capable Models for RL	Raj Ghugare et.al.	2505.23527	link
2025-05-29	LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter	Runyi Li et.al.	2505.23462	null
2025-05-28	SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation	Dekai Zhu et.al.	2505.22643	null
2025-05-28	Principled Out-of-Distribution Generalization via Simplicity	Jiawei Ge et.al.	2505.22622	null
2025-05-28	Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	Chengyue Wu et.al.	2505.22618	null
2025-05-28	ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models	Dmitrii Sorokin et.al.	2505.22569	null
2025-05-28	Test-Time Alignment of Discrete Diffusion Models with Sequential Monte Carlo	Chinmay Pani et.al.	2505.22524	null
2025-05-28	PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models	Junwen Chen et.al.	2505.22523	null
2025-05-28	Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics	Siyeop Yoon et.al.	2505.22489	null
2025-05-28	Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation	Jiadong Pan et.al.	2505.22407	null
2025-05-28	Physics-Informed Distillation of Diffusion Models for PDE-Constrained Generation	Yi Zhang et.al.	2505.22391	null
2025-05-28	Computing Optimal Transport Maps and Wasserstein Barycenters Using Conditional Normalizing Flows	Gabriele Visentin et.al.	2505.22364	null
2025-05-28	A Closer Look on Memorization in Tabular Diffusion Model: A Data-Centric Perspective	Zhengyu Fang et.al.	2505.22322	null
2025-05-28	StateSpaceDiffuser: Bringing Long Context to Diffusion World Models	Nedko Savov et.al.	2505.22246	null
2025-05-28	Physics-inspired Generative AI models via real hardware-based noisy quantum diffusion	Marco Parigi et.al.	2505.22193	null
2025-05-28	Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes	Bocheng Li et.al.	2505.22165	null
2025-05-28	What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?	Jinhong Ni et.al.	2505.22129	null
2025-05-27	Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment	Xiaojun Jia et.al.	2505.21494	link
2025-05-27	Be Decisive: Noise-Induced Layouts for Multi-Subject Generation	Omer Dahary et.al.	2505.21488	null
2025-05-27	PropMolFlow: Property-guided Molecule Generation with Geometry-Complete Flow Matching	Cheng Zeng et.al.	2505.21469	null
2025-05-27	Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion	Zhanqiu Hu et.al.	2505.21467	null
2025-05-27	An Integrated Time-Varying Ornstein-Uhlenbeck Process for Jointly Modeling Individual and Population-Level Dynamics of Golden Eagles	Michael L. Shull et.al.	2505.21453	null
2025-05-27	CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects	Huaijin Pi et.al.	2505.21437	null
2025-05-27	Learning Individual Behavior in Agent-Based Models with Graph Diffusion Networks	Francesco Cozzi et.al.	2505.21426	link
2025-05-27	A Convergence Theory for Diffusion Language Models: An Information-Theoretic Perspective	Gen Li et.al.	2505.21400	null
2025-05-27	A transfer principle for computing the adapted Wasserstein distance between stochastic processes	Yifan Jiang et.al.	2505.21337	null
2025-05-28	MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on	Guangyuan Li et.al.	2505.21325	null
2025-05-27	Sample complexity of optimal transport barycenters with discrete support	Léo Portales et.al.	2505.21274	null
2025-05-27	Simulations of the churning mode: toroidally symmetric plasma convection and turbulence around the X-points in a snowflake divertor	D Power et.al.	2505.21223	null
2025-05-27	Input Convex Kolmogorov Arnold Networks	Thomas Deschatre et.al.	2505.21208	null
2025-05-27	Sci-Fi: Symmetric Constraint for Frame Inbetweening	Liuhan Chen et.al.	2505.21205	null
2025-05-27	Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model	Dar-Yen Chen et.al.	2505.21179	null
2025-05-26	Long-Context State-Space Video World Models	Ryan Po et.al.	2505.20171	null
2025-05-26	MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning	Yuanxin Zhuang et.al.	2505.20131	null
2025-05-26	Understanding Generalization in Diffusion Models via Probability Flow Distance	Huijie Zhang et.al.	2505.20123	null
2025-05-26	Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning	Ziyi Zhang et.al.	2505.20107	link
2025-05-26	PAMD: Plausibility-Aware Motion Diffusion Model for Long Dance Generation	Hongsong Wang et.al.	2505.20056	null
2025-05-26	Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion	Zheqi Lv et.al.	2505.20053	link
2025-05-26	ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications	Tong Wu et.al.	2505.19983	null
2025-05-26	UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space	Yong Liu et.al.	2505.19958	null
2025-05-26	Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling	Junhong Lee et.al.	2505.19868	null
2025-05-26	On a retarded stochastic system with discrete diffusion modeling life tables	Tomás Caraballo et.al.	2505.19835	null
2025-05-26	TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning	Yuhui Chen et.al.	2505.19769	null
2025-05-26	On some coupled local and nonlocal diffusion models	Juan Pablo Borthagaray et.al.	2505.19765	null
2025-05-27	SAIL: Self-supervised Albedo Estimation from Real Images with a Latent Diffusion Model	Hala Djeghim et.al.	2505.19751	null
2025-05-26	Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning	Quentin Rouxel et.al.	2505.19717	null
2025-05-26	On the Relation between Rectified Flows and Optimal Transport	Johannes Hertrich et.al.	2505.19712	null
2025-05-26	Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition	Wen Yin et.al.	2505.19694	null
2025-05-23	Generative Distribution Embeddings	Nic Fishman et.al.	2505.18150	link
2025-05-23	Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading	Mohamed Swailem et.al.	2505.18145	null
2025-05-26	TokBench: Evaluating Your Visual Tokenizer before Visual Generation	Junfeng Wu et.al.	2505.18142	null
2025-05-23	Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning	Congren Dai et.al.	2505.18101	null
2025-05-23	Towards more transferable adversarial attack in black-box manner	Chun Tong Lei et.al.	2505.18097	null
2025-05-23	RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration	Sudarshan Rajagopalan et.al.	2505.18047	null
2025-05-26	Strictly Constrained Generative Modeling via Split Augmented Langevin Sampling	Matthieu Blanke et.al.	2505.18017	link
2025-05-23	Distances for Markov chains from sample streams	Sergio Calo et.al.	2505.18005	null
2025-05-23	Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation	Zhihua Liu et.al.	2505.17994	null
2025-05-23	Diffusion Classifiers Understand Compositionality, but Conditions Apply	Yujin Jeong et.al.	2505.17955	link
2025-05-23	Multi-Person Interaction Generation from Two-Person Motion Priors	Wenning Xu et.al.	2505.17860	null
2025-05-23	Generative Data Augmentation for Object Point Cloud Segmentation	Dekai Zhu et.al.	2505.17783	null
2025-05-23	TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis	Yu Xie et.al.	2505.17778	null
2025-05-23	R-Genie: Reasoning-Guided Generative Image Editing	Dong Zhang et.al.	2505.17768	null
2025-05-23	SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation	Dekai Zhu et.al.	2505.17721	null
2025-05-22	When Are Concepts Erased From Diffusion Models?	Kevin Lu et.al.	2505.17013	link
2025-05-22	Guided Diffusion Sampling on Function Spaces with Applications to PDEs	Jiachen Yao et.al.	2505.17004	link
2025-05-22	Pursuing Temporal-Consistent Video Virtual Try-On via Dynamic Pose Interaction	Dong Li et.al.	2505.16980	null
2025-05-22	Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On	Siqi Wan et.al.	2505.16977	link
2025-05-22	Creatively Upscaling Images with Global-Regional Priors	Yurui Qian et.al.	2505.16976	null
2025-05-22	Bigger Isn’t Always Memorizing: Early Stopping Overparameterized Diffusion Models	Alessandro Favero et.al.	2505.16959	null
2025-05-22	LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning	Zebin You et.al.	2505.16933	null
2025-05-22	T2I-ConBench: Text-to-Image Benchmark for Continual Post-training	Zhehao Huang et.al.	2505.16875	null
2025-05-22	Training-Free Efficient Video Generation via Dynamic Token Carving	Yuechen Zhang et.al.	2505.16864	link
2025-05-22	Conditional Panoramic Image Generation via Masked Autoregressive Modeling	Chaoyang Wang et.al.	2505.16862	null
2025-05-23	LaViDa: A Large Diffusion Language Model for Multimodal Understanding	Shufan Li et.al.	2505.16839	link
2025-05-22	From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization	Haonian Ji et.al.	2505.16832	link
2025-05-22	SEED: Speaker Embedding Enhancement Diffusion Model	KiHyun Nam et.al.	2505.16798	link
2025-05-22	Learning Flexible Forward Trajectories for Masked Molecular Diffusion	Hyunjin Seo et.al.	2505.16790	null
2025-05-22	A Riemannian Optimization Approach for Finding the Nearest Reversible Markov Chain	Fabio Durastante et.al.	2505.16762	link
2025-05-21	Leveraging the Powerful Attention of a Pre-trained Diffusion Model for Exemplar-based Image Colorization	Satoshi Kosugi et.al.	2505.15812	link
2025-05-21	Neural Conditional Transport Maps	Carlos Rodriguez-Pardo et.al.	2505.15808	null
2025-05-21	Interspatial Attention for Efficient 4D Human Video Generation	Ruizhi Shao et.al.	2505.15800	null
2025-05-21	VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL	Fengyuan Dai et.al.	2505.15791	null
2025-05-21	SwarmDiff: Swarm Robotic Trajectory Planning in Cluttered Environments via Diffusion Transformer	Kang Ding et.al.	2505.15679	null
2025-05-21	FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models	Zhen Sun et.al.	2505.15644	link
2025-05-21	Deep Learning for Continuous-time Stochastic Control with Jumps	Patrick Cheridito et.al.	2505.15602	null
2025-05-21	Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off	Yury Belousov et.al.	2505.15594	null
2025-05-21	Milstein-type methods for strong approximation of systems of SDEs with a discontinuous drift coefficient	Christopher Rauhögger et.al.	2505.15509	null
2025-05-21	Comprehensive Evaluation and Analysis for NSFW Concept Erasure in Text-to-Image Diffusion Models	Die Chen et.al.	2505.15450	null
2025-05-21	Responsible Diffusion Models via Constraining Text Embeddings within Safe Regions	Zhiwen Li et.al.	2505.15427	link
2025-05-21	My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping	Hon Ming Yam et.al.	2505.15336	null
2025-05-21	FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion	Kazuaki Mishima et.al.	2505.15313	null
2025-05-21	Cascaded Diffusion Models for Neural Motion Planning	Mohit Sharma et.al.	2505.15157	null
2025-05-21	Sculpting Features from Noise: Reward-Guided Hierarchical Diffusion for Task-Optimal Feature Transformation	Nanxu Gong et.al.	2505.15152	link
2025-05-20	Training-Free Watermarking for Autoregressive Image Generation	Yu Tong et.al.	2505.14673	link
2025-05-20	Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI	Marlène Careil et.al.	2505.14556	link
2025-05-21	Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling	Zhihao Li et.al.	2505.14521	null
2025-05-20	Learning to Integrate Diffusion ODEs by Averaging the Derivatives	Wenze Liu et.al.	2505.14502	null
2025-05-20	CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation	Chihan Huang et.al.	2505.14455	null
2025-05-20	Compositional amortized inference for large-scale hierarchical Bayesian models	Jonas Arruda et.al.	2505.14429	null
2025-05-20	The Koopmanization of controlled nonlinear Itô stochastic differential systems and its comparison with the Carleman embedding: new results	Amruta Lambe et.al.	2505.14369	null
2025-05-20	Vid2World: Crafting Video Diffusion Models to Interactive World Models	Siqiao Huang et.al.	2505.14357	null
2025-05-20	Malliavin derivative and sensitivity for optimal liquidation	Alexandre Popier et.al.	2505.14287	null
2025-05-20	Instructing Text-to-Image Diffusion Models via Classifier-Guided Semantic Optimization	Yuanyuan Chang et.al.	2505.14254	link
2025-05-20	Challenges and Limitations in the Synthetic Generation of mHealth Sensor Data	Flavio Di Martino et.al.	2505.14206	null
2025-05-20	FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning	Marvin Alles et.al.	2505.14139	null
2025-05-20	Adaptive Cyclic Diffusion for Inference Scaling	Gyubin Lee et.al.	2505.14036	null
2025-05-20	Combining Deterministic Enhanced Conditions with Dual-Streaming Encoding for Diffusion-Based Speech Enhancement	Hao Shi et.al.	2505.13983	link
2025-05-20	Predicting Dynamical Systems across Environments via Diffusive Model Weight Generation	Ruikun Li et.al.	2505.13919	null
2025-05-19	Joint Velocity-Growth Flow Matching for Single-Cell Dynamics Modeling	Dongyi Wang et.al.	2505.13413	null
2025-05-19	Faster Video Diffusion with Trainable Sparse Attention	Peiyuan Zhang et.al.	2505.13389	null
2025-05-19	Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation	Yasi Zhang et.al.	2505.13377	null
2025-05-20	Minimum-Excess-Work Guidance	Christopher Kolloff et.al.	2505.13375	null
2025-05-20	One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling	Nimrod Berman et.al.	2505.13358	link
2025-05-19	FlowPure: Continuous Normalizing Flows for Adversarial Purification	Elias Collaert et.al.	2505.13280	link
2025-05-19	Seeing the Unseen: How EMoE Unveils Bias in Text-to-Image Diffusion Models	Lucas Berry et.al.	2505.13273	null
2025-05-19	Diffusion Models with Double Guidance: Generate with aggregated datasets	Yanfeng Yang et.al.	2505.13213	null
2025-05-20	Filtering in a hazard rate change-point model with financial and life-insurance applications	Matteo Buttarazzi et.al.	2505.13185	null
2025-05-19	Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model	Jonas Brenig et.al.	2505.13152	link
2025-05-19	Neurosymbolic Diffusion Models	Emile van Krieken et.al.	2505.13138	link
2025-05-19	Constraint-Aware Diffusion Guidance for Robotics: Real-Time Obstacle Avoidance for Autonomous Racing	Hao Ma et.al.	2505.13131	null
2025-05-19	Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction	Yuanbo Wang et.al.	2505.13091	null
2025-05-19	Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR	Xugang Lu et.al.	2505.13079	null
2025-05-19	A continuous calibration of the ATLAS flavour-tagging classifiers via optimal transportation maps	ATLAS Collaboration et.al.	2505.13063	null
2025-05-16	QVGen: Pushing the Limit of Quantized Video Generative Models	Yushi Huang et.al.	2505.11497	null
2025-05-16	Unsupervised Detection of Distribution Shift in Inverse Problems using Diffusion Models	Shirin Shoushtari et.al.	2505.11482	null
2025-05-16	PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment	Dingbang Huang et.al.	2505.11468	null
2025-05-16	Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views	Abhishek Kashyap et.al.	2505.11467	null
2025-05-16	A Generative Framework for Causal Estimation via Importance-Weighted Diffusion Distillation	Xinran Song et.al.	2505.11444	null
2025-05-16	Diff-Unfolding: A Model-Based Score Learning Framework for Inverse Problems	Yuanhao Wang et.al.	2505.11393	null
2025-05-16	LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models	Danilo de Oliveira et.al.	2505.11391	null
2025-05-16	MARRS: Masked Autoregressive Unit-based Reaction Synthesis	Y. B. Wang et.al.	2505.11334	null
2025-05-16	Decomposing stimulus-specific sensory neural information via diffusion models	Steeve Laquitaine et.al.	2505.11309	null
2025-05-16	Effective Probabilistic Time Series Forecasting with Fourier Adaptive Noise-Separated Diffusion	Xinyan Wang et.al.	2505.11306	null
2025-05-16	A Fourier Space Perspective on Diffusion Models	Fabian Falck et.al.	2505.11278	null
2025-05-16	DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models	Giulia Bertazzini et.al.	2505.11257	null
2025-05-16	LD-Scene: LLM-Guided Diffusion for Controllable Generation of Adversarial Safety-Critical Driving Scenarios	Mingxing Peng et.al.	2505.11247	null
2025-05-16	Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models	Fu-Yun Wang et.al.	2505.11245	link
2025-05-16	Formal Uncertainty Propagation for Stochastic Dynamical Systems with Additive Noise	Steven Adams et.al.	2505.11219	null
2025-05-15	3D-Fixup: Advancing Photo Editing with 3D Priors	Yen-Chi Cheng et.al.	2505.10566	null
2025-05-15	Style Customization of Text-to-Vector Generation with Image Diffusion Priors	Peiying Zhang et.al.	2505.10558	null
2025-05-15	Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data	Yiwen Liu et.al.	2505.10551	link
2025-05-15	Pharmacophore-Conditioned Diffusion Model for Ligand-Based De Novo Drug Design	Amira Alakhdar et.al.	2505.10545	null
2025-05-15	Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps	Ningyuan Yang et.al.	2505.10482	null
2025-05-15	Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models	Zemin Huang et.al.	2505.10446	null
2025-05-15	Score-based diffusion nowcasting of GOES imagery	Randy J. Chase et.al.	2505.10432	null
2025-05-16	Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems	Jeffrey Alido et.al.	2505.10311	link
2025-05-15	Strong and weak convergence rates for fully coupled multiscale stochastic differential equations driven by $α$ -stable processes	Kun Yin et.al.	2505.10229	null
2025-05-15	From Combinatorics to Partial Differential Equations	Francesco Mattesini et.al.	2505.10175	null
2025-05-15	FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation	Jun Guo et.al.	2505.10075	null
2025-05-15	ORL-LDM: Offline Reinforcement Learning Guided Latent Diffusion Model Super-Resolution Reconstruction	Shijie Lyu et.al.	2505.10027	null
2025-05-15	From Air to Wear: Personalized 3D Digital Fashion with AR/VR Immersive 3D Sketching	Ying Zang et.al.	2505.09998	null
2025-05-15	Ordered-subsets Multi-diffusion Model for Sparse-view CT Reconstruction	Pengfei Yu et.al.	2505.09985	null
2025-05-15	Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity	Zichen Liu et.al.	2505.09922	null
2025-05-14	Robust Representation and Estimation of Barycenters and Modes of Probability Measures on Metric Spaces	Washington Mio et.al.	2505.09609	link
2025-05-14	LightLab: Controlling Light Sources in Images with Diffusion Models	Nadav Magar et.al.	2505.09608	null
2025-05-14	Don’t Forget your Inverse DDIM for Image Editing	Guillermo Gomez-Trenado et.al.	2505.09571	null
2025-05-14	BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset	Jiuhai Chen et.al.	2505.09568	link
2025-05-14	Primal-dual splitting methods for phase-field surfactant model with moving contact lines	Wei Wu et.al.	2505.09469	null
2025-05-14	Diffusion Recommender Models and the Illusion of Progress: A Concerning Study of Reproducibility and a Conceptual Mismatch	Michael Benigni et.al.	2505.09364	null
2025-05-14	Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	Bingxin Ke et.al.	2505.09358	link
2025-05-14	TransDiffuser: End-to-end Trajectory Generation with Decorrelated Multi-modal Representation for Autonomous Driving	Xuefeng Jiang et.al.	2505.09315	null
2025-05-14	Stochastic Optimal Control for Systems with Drifts of Bounded Variation: A Maximum Principle Approach	Antoine Marie Bogso et.al.	2505.09309	null
2025-05-14	Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations	Panqi Chen et.al.	2505.09284	null
2025-05-14	A Note on Semantic Diffusion	Alexander P. Ryjov et.al.	2505.09283	null
2025-05-14	Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation	Guan Gui et.al.	2505.09263	link
2025-05-14	Optimal Transport-Based Domain Adaptation for Rotated Linear Regression	Brian Britos et.al.	2505.09229	null
2025-05-15	Generating time-consistent dynamics with discriminator-guided image diffusion models	Philipp Hess et.al.	2505.09089	null
2025-05-14	Reflected stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems	Lu Liu et.al.	2505.09070	null
2025-05-13	Controllable Image Colorization with Instance-aware Texts and Masks	Yanru An et.al.	2505.08705	null
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607	null
2025-05-13	Diffusion-assisted Model Predictive Control Optimization for Power System Real-Time Operation	Linna Xu et.al.	2505.08535	null
2025-05-13	Building-Block Aware Generative Modeling for 3D Crystals of Metal Organic Frameworks	Chenru Duan et.al.	2505.08531	link
2025-05-14	Improving Data Fidelity via Diffusion Model-based Correction and Super-Resolution	Wuzhe Xu et.al.	2505.08526	null
2025-05-13	ConDiSim: Conditional Diffusion Models for Simulation Based Inference	Mayank Nautiyal et.al.	2505.08403	null
2025-05-13	Adaptive Diffusion Policy Optimization for Robotic Manipulation	Huiyun Jiang et.al.	2505.08376	null
2025-05-13	Stationary Mean-Field Games of Singular Control under Knightian Uncertainty	Giorgio Ferrari et.al.	2505.08317	null
2025-05-13	Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion	Anle Ke et.al.	2505.08281	link
2025-05-13	Skeleton-Guided Diffusion Model for Accurate Foot X-ray Synthesis in Hallux Valgus Diagnosis	Midi Wan et.al.	2505.08247	link
2025-05-13	Identifying Memorization of Diffusion Models through p-Laplace Analysis	Jonathan Brokman et.al.	2505.08246	link
2025-05-13	ACT-R: Adaptive Camera Trajectories for 3D Reconstruction from Single Image	Yizhi Wang et.al.	2505.08239	null
2025-05-13	EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation	Hanle Zheng et.al.	2505.08235	null
2025-05-13	Removing Watermarks with Partial Regeneration using Semantic Information	Krti Tallam et.al.	2505.08234	link
2025-05-13	Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix	Unai Gurbindo et.al.	2505.08228	null
2025-05-12	DanceGRPO: Unleashing GRPO on Visual Generation	Zeyue Xue et.al.	2505.07818	null
2025-05-12	Pixel Motion as Universal Representation for Robot Control	Kanchana Ranasinghe et.al.	2505.07817	null
2025-05-12	Singular Control in Inventory Management with Smooth Ambiguity	Arnon Archankul et.al.	2505.07761	null
2025-05-12	LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention	Jiangling Zhang et.al.	2505.07734	null
2025-05-12	Zero-sum Stochastic Differential Games of Impulse Control with Random Intervention Costs	Magnus Perninge et.al.	2505.07666	null
2025-05-12	ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models	Ozgur Kara et.al.	2505.07652	null
2025-05-12	Langevin Diffusion Approximation to Same Marginal Schrödinger Bridge	Medha Agarwal et.al.	2505.07647	null
2025-05-12	Identifiability of SDEs for reaction networks	Louis Faul et.al.	2505.07638	link
2025-05-12	Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models	Riccardo Passoni et.al.	2505.07615	null
2025-05-12	Noise Optimized Conditional Diffusion for Domain Adaptation	Lingkun Luo et.al.	2505.07548	null
2025-05-12	Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning	Bohan Wang et.al.	2505.07538	null
2025-05-12	Measuring Financial Resilience Using Backward Stochastic Differential Equations	Roger J. A. Laeven et.al.	2505.07502	null
2025-05-12	Addressing degeneracies in latent interpolation for diffusion models	Erik Landolsi et.al.	2505.07481	null
2025-05-12	You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts	Hongkun Dou et.al.	2505.07477	link
2025-05-12	DiffCrysGen: A Score-Based Diffusion Model for Design of Diverse Inorganic Crystalline Materials	Sourav Mal et.al.	2505.07442	null
2025-05-09	Long time behaviour of Mean Field Games with fractional diffusion	Olav Ersland et.al.	2505.06183	null
2025-05-09	DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models	Radu Alexandru Rosu et.al.	2505.06166	null
2025-05-09	Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation	Kunpeng Qiu et.al.	2505.06068	link
2025-05-09	Universal Approximation Theorem for Deep Q-Learning via FBSDE System	Qian Qi et.al.	2505.06023	null
2025-05-09	A 3D pocket-aware and evolutionary conserved interaction guided diffusion model for molecular optimization	Anjie Qiao et.al.	2505.05874	null
2025-05-09	PICD: Versatile Perceptual Image Compression with Diffusion Rendering	Tongda Xu et.al.	2505.05853	null
2025-05-09	Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition	Zhiyuan Chen et.al.	2505.05829	link
2025-05-09	Demystifying Diffusion Policies: Action Memorization and Simple Lookup Table Alternatives	Chengyang He et.al.	2505.05787	null
2025-05-09	Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions	Dhruvesh Patel et.al.	2505.05755	null
2025-05-09	Automated Learning of Semantic Embedding Representations for Diffusion Models	Limai Jiang et.al.	2505.05732	null
2025-05-09	Towards Secure Semantic Transmission In the Era of GenAI: A Diffusion-based Framework	Boxiang He et.al.	2505.05724	null
2025-05-09	Semantic-Space-Intervened Diffusive Alignment for Visual Classification	Zixuan Li et.al.	2505.05721	null
2025-05-08	An Efficient Transport-Based Dissimilarity Measure for Time Series Classification under Warping Distortions	Akram Aldroubi et.al.	2505.05676	null
2025-05-08	Unsupervised Blind Speech Separation with a Diffusion Prior	Zhongweiyang Xu et.al.	2505.05657	link
2025-05-08	ReactDance: Progressive-Granular Representation for Long-Term Coherent Reactive Dance Generation	Jingzhong Lin et.al.	2505.05589	null
2025-05-08	SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation	Yonwoo Choi et.al.	2505.05475	link
2025-05-08	3D Scene Generation: A Survey	Beichen Wen et.al.	2505.05474	link
2025-05-08	DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion	Qitao Zhao et.al.	2505.05473	null
2025-05-08	Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation	Chao Liao et.al.	2505.05472	null
2025-05-08	Flow-GRPO: Training Flow Matching Models via Online RL	Jie Liu et.al.	2505.05470	link
2025-05-08	An efficient second-order cone programming approach for dynamic optimal transport on staggered grid discretization	Liang Chen et.al.	2505.05424	link
2025-05-08	Denoising Diffusion Probabilistic Models for Coastal Inundation Forecasting	Kazi Ashik Islam et.al.	2505.05381	null
2025-05-08	The Ergodic Linear-Quadratic Optimal Control Problems for Stochastic Mean-Field Systems with Periodic Coefficients	Jiacheng Wu et.al.	2505.05296	null
2025-05-08	Diffusion Model Quantization: A Review	Qian Zeng et.al.	2505.05215	link
2025-05-08	EAM: Enhancing Anything with Diffusion Transformers for Blind Super-Resolution	Haizhen Xie et.al.	2505.05209	null
2025-05-08	Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning	Chuangtao Chen et.al.	2505.05151	link
2025-05-08	Research on Anomaly Detection Methods Based on Diffusion Models	Yi Chen et.al.	2505.05137	null
2025-05-08	MDAA-Diff: CT-Guided Multi-Dose Adaptive Attention Diffusion Model for PET Denoising	Xiaolong Niu et.al.	2505.05112	null
2025-05-08	MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models	Hongyang Zhu et.al.	2505.05101	null
2025-05-08	ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model	Sagnik Bhattacharya et.al.	2505.05082	null
2025-05-07	Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond	Jessie Richter-Powell et.al.	2505.04621	null
2025-05-07	Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model	Pengfei Guo et.al.	2505.04522	null
2025-05-07	Efficient Flow Matching using Latent Variables	Anirban Samaddar et.al.	2505.04486	null
2025-05-07	Localized Diffusion Models for High Dimensional Distributions Generation	Georg A. Gottwald et.al.	2505.04417	null
2025-05-07	Discrete Optimal Transport and Voice Conversion	Anton Selitskiy et.al.	2505.04382	null
2025-05-07	Large Deviations and the Peano Phenomenon in Stochastic Differential Equations with Homogeneous Drift	Paola Bermolen et.al.	2505.04377	null
2025-05-07	CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion	Yanyu Li et.al.	2505.04347	null
2025-05-07	Adjoint-based optimal control of jump-diffusion processes	Jan Bartsch et.al.	2505.04328	null
2025-05-07	Beyond entropic regularization: Debiased Gaussian estimators for discrete optimal transport and general linear programs	Shuyu Liu et.al.	2505.04312	null
2025-05-07	MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition	Qiannan Fan et.al.	2505.04306	null
2025-05-07	TS-Diff: Two-Stage Diffusion Model for Low-Light RAW Image Enhancement	Yi Li et.al.	2505.04281	link
2025-05-07	HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation	Yajie Fu et.al.	2505.04276	link
2025-05-07	Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting	Feng Yang et.al.	2505.04262	null
2025-05-07	Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement	Rauf Nasretdinov et.al.	2505.04237	null
2025-05-07	Convergence rate of Euler-Maruyama scheme to the invariant probability measure under total variation distance	Yinna Ye et.al.	2505.04218	null
2025-05-06	Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning	François Role et.al.	2505.03703	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679	null
2025-05-06	Vector valued optimal transport: from dynamic to static formulations	Katy Craig et.al.	2505.03670	null
2025-05-06	Distribution-Conditional Generation: From Class Distribution to Creative Generation	Fu Feng et.al.	2505.03667	null
2025-05-06	Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map	Alessandro Simoni et.al.	2505.03623	link
2025-05-07	PAHA: Parts-Aware Audio-Driven Human Animation with Diffusion Model	Y. B. Wang et.al.	2505.03603	null
2025-05-06	Maximum likelihood estimation for the $λ$ -exponential family	Xiwei Tian et.al.	2505.03582	null
2025-05-06	On the non-Markovian quantum stochastic network dynamics	Haijin Ding et.al.	2505.03578	null
2025-05-06	A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges	Feibo Jiang et.al.	2505.03556	link
2025-05-06	Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients	Stefano Bruno et.al.	2505.03432	null
2025-05-06	Phenotype-Guided Generative Model for High-Fidelity Cardiac MRI Synthesis: Advancing Pretraining and Clinical Applications	Ziyu Li et.al.	2505.03426	null
2025-05-06	Safer Prompts: Reducing IP Risk in Visual Generative AI	Lena Reissinger et.al.	2505.03338	null
2025-05-06	FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing	Rui Lan et.al.	2505.03329	link
2025-05-06	Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation	Jincheng Zhang et.al.	2505.03314	link
2025-05-06	A piston to counteract diffusion: The influence of an inward-shifting boundary on the heat equation in half-space	Samuel Tréton et.al.	2505.03304	null
2025-05-05	Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models	Kuofeng Gao et.al.	2505.02824	link
2025-05-05	Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models	Yankai Jiang et.al.	2505.02753	link
2025-05-06	MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation	Mingcheng Li et.al.	2505.02648	null
2025-05-06	Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces	Yang Lyu et.al.	2505.02508	null
2025-05-05	Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction	Biao Gong et.al.	2505.02471	link
2025-05-05	Predicting the Dynamics of Complex System via Multiscale Diffusion Autoencoder	Ruikun Li et.al.	2505.02450	null
2025-05-05	T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models	Yunfeng Ge et.al.	2505.02417	link
2025-05-04	Universal Approximation Theorem of Deep Q-Networks	Qian Qi et.al.	2505.02288	null
2025-05-04	Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset	Jakub Wąsala et.al.	2505.02255	null
2025-05-04	Quantizing Diffusion Models from a Sampling-Aware Perspective	Qian Zeng et.al.	2505.02242	null
2025-05-06	Regression is all you need for medical image translation	Sebastian Rassmann et.al.	2505.02048	link
2025-05-03	OT-Talk: Animating 3D Talking Head with Optimal Transportation	Xinmu Wang et.al.	2505.01932	null
2025-05-03	Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling	Javier E. Santos et.al.	2505.01917	null
2025-05-03	Rethinking Score Distilling Sampling for 3D Editing and Generation	Xingyu Miao et.al.	2505.01888	null
2025-05-03	DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion	Haoteng Li et.al.	2505.01857	null
2025-05-02	VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models	Mohammadreza Teymoorianfard et.al.	2505.01406	link
2025-05-02	Provable Efficiency of Guidance in Diffusion Models for General Data Distribution	Gen Li et.al.	2505.01382	null
2025-05-02	FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors	Chenxi Li et.al.	2505.01322	null
2025-05-02	Model See Model Do: Speech-Driven Facial Animation with Style Control	Yifang Pan et.al.	2505.01319	null
2025-05-05	Enabling Training-Free Semantic Communication Systems with Generative Diffusion Models	Shunpu Tang et.al.	2505.01209	null
2025-05-02	Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings	Andreas Sochopoulos et.al.	2505.01179	null
2025-05-02	FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis	Jiangtong Tan et.al.	2505.01172	link
2025-05-02	VSC: Visual Search Compositional Text-to-Image Diffusion Model	Do Huu Dat et.al.	2505.01104	null
2025-05-02	Integration Matters for Learning PDEs with Backwards SDEs	Sungje Park et.al.	2505.01078	link
2025-05-02	Multi-Step Consistency Models: Fast Generation with Theoretical Guarantees	Nishant Jain et.al.	2505.01049	null
2025-05-02	Where’s the liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content	Haoyue Bai et.al.	2505.01008	null
2025-05-02	Tree-Sliced Wasserstein Distance with Nonlinear Projection	Thanh Tran et.al.	2505.00968	null
2025-05-01	Controllable Weather Synthesis and Removal with Video Diffusion Models	Chih-Hao Lin et.al.	2505.00704	null
2025-05-01	GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution	Aditya Arora et.al.	2505.00687	null
2025-05-01	The local coupling of noise technique and its application to lower error bounds for strong approximation of SDEs with irregular coefficients	Simon Ellinger et.al.	2505.00656	null
2025-05-01	Combining LLMs with Logic-Based Framework to Explain MCTS	Ziyan An et.al.	2505.00610	null
2025-05-01	ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models	Jiarong Wei et.al.	2505.00586	null
2025-05-01	Safety-Critical Traffic Simulation with Guided Latent Diffusion Model	Mingxing Peng et.al.	2505.00515	null
2025-05-01	Robust Parameter Estimation in Dynamical Systems by Stochastic Differential Equations	Qingchuan Sun et.al.	2505.00491	null
2025-05-01	Lévy processes under level-dependent Poissonian switching	Noah Beelders et.al.	2505.00453	null
2025-05-01	Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly	Ruiyuan Zhang et.al.	2505.00426	null
2025-05-01	SOTA: Spike-Navigated Optimal TrAnsport Saliency Region Detection in Composite-bias Videos	Wenxuan Liu et.al.	2505.00394	null
2025-05-01	Denoising weak lensing mass maps with diffusion model: systematic comparison with generative adversarial network	Shohei D. Aoyama et.al.	2505.00345	null
2025-05-01	Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution	Luigi Sigillo et.al.	2505.00334	null
2025-05-01	Feature preserving data assimilation via feature alignment	Amit N. Subrahmanya et.al.	2505.00249	null
2025-05-01	Affine constraints in non-reversible diffusions with degenerate noise	Carsten Hartmann et.al.	2505.00243	null
2025-04-30	Bayesian Discrepancy Measure: Higher-order and Skewed approximations	Elena Bortolato et.al.	2505.00185	null
2025-04-30	ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction	Qihao Liu et.al.	2504.21855	null
2025-04-30	Reconciling Discrete-Time Mixed Policies and Continuous-Time Relaxed Controls in Reinforcement Learning and Stochastic Control	Rene Carmona et.al.	2504.21793	null
2025-04-30	HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation	Haiyang Zhou et.al.	2504.21650	link
2025-04-30	Diffusion-based Adversarial Identity Manipulation for Facial Privacy Protection	Liqin Wang et.al.	2504.21646	null
2025-04-30	ODE and PDE models for COVID-19, with reinfection and vaccination process for Cameroon and Germany	Hamadjam Abboubakar et.al.	2504.21613	null
2025-04-30	Latent Feature-Guided Conditional Diffusion for High-Fidelity Generative Image Semantic Communication	Zehao Chen et.al.	2504.21577	null
2025-04-30	Regularity properties of densities of SDEs using the Fourier analytic approach	Simon Ellinger et.al.	2504.21516	null
2025-04-30	MagicPortrait: Temporally Consistent Face Reenactment with 3D Geometric Guidance	Mengting Wei et.al.	2504.21497	link
2025-04-30	DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration	Hebaixu Wang et.al.	2504.21487	link
2025-04-30	Existence and non-existence of the CLT for a family of SDEs driven by stable process	Yingjun Mo et.al.	2504.21430	null
2025-04-30	Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision	Weicai Yan et.al.	2504.21423	null
2025-04-30	IDDM: Bridging Synthetic-to-Real Domain Gap from Physics-Guided Diffusion for Real-world Image Dehazing	Shijun Zhou et.al.	2504.21385	null
2025-04-30	Sparse-to-Sparse Training of Diffusion Models	Inês Cardoso Oliveira et.al.	2504.21380	null
2025-04-30	Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing	Hong Zhang et.al.	2504.21356	link
2025-04-30	Text-Conditioned Diffusion Model for High-Fidelity Korean Font Generation	Abdul Sami et.al.	2504.21325	null
2025-04-29	AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection	Lorenzo Pellegrini et.al.	2504.20865	null
2025-04-29	Chaos Meets Attention: Transformers for Large-Scale Dynamical Prediction	Yi He et.al.	2504.20858	link
2025-04-29	SoccerDiffusion: Toward Learning End-to-End Humanoid Robot Soccer from Gameplay Recordings	Florian Vahl et.al.	2504.20808	null
2025-04-29	Semi-discrete optimal transport techniques for the compressible semi-geostrophic equations	David P. Bourne et.al.	2504.20807	null
2025-04-29	JTreeformer: Graph-Transformer via Latent-Diffusion Model for Molecular Generation	Ji Shi et.al.	2504.20770	null
2025-04-29	DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs	Hao Luan et.al.	2504.20754	null
2025-04-29	On optimal error rates for strong approximation of SDEs with a Hölder continuous drift coefficient	Simon Ellinger et.al.	2504.20728	null
2025-04-29	LDPoly: Latent Diffusion for Polygonal Road Outline Extraction in Large-Scale Topographic Mapping	Weiqin Jiao et.al.	2504.20645	null
2025-04-29	DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models	Sagi Della Torre et.al.	2504.20625	null
2025-04-29	TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution	Yue Li et.al.	2504.20532	null
2025-04-29	Dynamic Attention Analysis for Backdoor Detection in Text-to-Image Diffusion Models	Zhongqi Wang et.al.	2504.20518	link
2025-04-29	Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding	Gabe Guo et.al.	2504.20456	link
2025-04-29	ADiff4TPP: Asynchronous Diffusion Models for Temporal Point Processes	Amartya Mukherjee et.al.	2504.20411	null
2025-04-28	Image Interpolation with Score-based Riemannian Metrics of Diffusion Models	Shinnosuke Saito et.al.	2504.20288	null
2025-04-28	Generative Diffusion Models for Resource Allocation in Wireless Networks	Yigit Berkay Uslu et.al.	2504.20277	null
2025-04-28	DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images	Mamadou Keita et.al.	2504.19876	link
2025-04-28	CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback	Chenhan Jiang et.al.	2504.19860	null
2025-04-28	A high-order recombination algorithm for weak approximation of stochastic differential equations	Syoiti Ninomiya et.al.	2504.19717	null
2025-04-28	Multimodal Conditioned Diffusive Time Series Forecasting	Chen Su et.al.	2504.19669	null
2025-04-28	Robot Motion Planning using One-Step Diffusion with Noise-Optimized Approximate Motions	Tomoharu Aizu et.al.	2504.19652	null
2025-04-28	AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis	Haroui Ma et.al.	2504.19621	link
2025-04-28	Image Generation Method Based on Heat Diffusion Models	Pengfei Zhang et.al.	2504.19600	null
2025-04-28	GenPTW: In-Generation Image Watermarking for Provenance Tracing and Tamper Localization	Zhenliang Gan et.al.	2504.19567	null
2025-04-28	SynergyAmodal: Deocclude Anything with Text Control	Xinyang Li et.al.	2504.19506	null
2025-04-28	Simultaneous Pick and Place Detection by Combining SE(3) Diffusion Models with Differential Kinematics	Tianyi Ko et.al.	2504.19502	null
2025-04-28	GTSD: Generative Text Steganography Based on Diffusion Model	Zhengxian Wu et.al.	2504.19433	null
2025-04-28	Boosting 3D Liver Shape Datasets with Diffusion Models and Implicit Neural Representations	Khoa Tuan Nguyen et.al.	2504.19402	null
2025-04-27	Metric Similarity and Manifold Learning of Circular Dichroism Spectra of Proteins	Gionni Marchetti et.al.	2504.19355	null
2025-04-27	Sketch2Anim: Towards Transferring Sketch Storyboards into 3D Animation	Lei Zhong et.al.	2504.19189	null
2025-04-27	Optimal dividends for a NatCat insurer in the presence of a climate tipping point	Hansjoerg Albrecher et.al.	2504.19151	null
2025-04-25	Revisiting Data Auditing in Large Vision-Language Models	Hongyu Zhu et.al.	2504.18349	null
2025-04-25	SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations	Shuting Zhao et.al.	2504.18332	null
2025-04-25	STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-to-4D Gaussian Splatting	Yunze Deng et.al.	2504.18318	null
2025-04-25	Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding	Kun Li et.al.	2504.18204	null
2025-04-25	Generative AI for Physical-Layer Authentication	Rui Meng et.al.	2504.18175	null
2025-04-25	Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation	Weipeng Tan et.al.	2504.18087	null
2025-04-25	Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models	Chen Chen et.al.	2504.18032	null
2025-04-25	Diffusion-Driven Universal Model Inversion Attack for Face Recognition	Hanrui Wang et.al.	2504.18015	null
2025-04-24	Ergodic control of McKean-Vlasov systems on the Wasserstein space	Marco Fuhrman et.al.	2504.17958	null
2025-04-24	DCT-Shield: A Robust Frequency Domain Defense against Malicious Image Editing	Aniruddha Bala et.al.	2504.17894	null
2025-04-24	Flow Matching Ergodic Coverage	Max Muchen Sun et.al.	2504.17872	null
2025-04-24	LiDPM: Rethinking Point Diffusion for Lidar Scene Completion	Tetiana Martyniuk et.al.	2504.17791	null
2025-04-27	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789	null
2025-04-24	Embedding Empirical Distributions for Computing Optimal Transport Maps	Mingchen Jiang et.al.	2504.17740	null
2025-04-27	UNILoc: Unified Localization Combining Model-Based Geometry and Unsupervised Learning	Yuhao Zhang et.al.	2504.17676	null
2025-04-24	polyGen: A Learning Framework for Atomic-level Polymer Structure Generation	Ayush Jain et.al.	2504.17656	null
2025-04-24	Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization	Abderrachid Hamrani et.al.	2504.17628	null
2025-04-24	TarDiff: Target-Oriented Diffusion Guidance for Synthetic Electronic Health Record Time Series Generation	Bowen Deng et.al.	2504.17613	null
2025-04-24	Convex order and increasing convex order for McKean-Vlasov processes with common noise	Armand Bernou et.al.	2504.17576	null
2025-04-24	ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting	Junyan Zhang et.al.	2504.17524	null
2025-04-24	3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models	Min Wei et.al.	2504.17414	null
2025-04-24	DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition	Yiyan Xu et.al.	2504.17349	null
2025-04-24	CKMDiff: A Generative Diffusion Model for CKM Construction via Inverse Problems with Learned Priors	Shen Fu et.al.	2504.17323	null
2025-04-24	Towards Generalized and Training-Free Text-Guided Semantic Manipulation	Yu Hong et.al.	2504.17269	null
2025-04-24	DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks	Yinqi Li et.al.	2504.17253	link
2025-04-24	AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models	Mohammad Zarei et.al.	2504.17179	null
2025-04-23	Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials	Peichen Zhong et.al.	2504.16893	null
2025-04-23	Computing Optimal Transport Plans via Min-Max Gradient Flows	Lauren Conger et.al.	2504.16890	link
2025-04-23	Planning with Diffusion Models for Target-Oriented Dialogue Systems	Hanwen Du et.al.	2504.16858	null
2025-04-23	Physically Consistent Humanoid Loco-Manipulation using Latent Diffusion Models	Ilyass Taouil et.al.	2504.16843	null
2025-04-24	Simple Graph Contrastive Learning via Fractional-order Neural Diffusion Networks	Yanan Zhao et.al.	2504.16748	null
2025-04-23	MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning	Itamar Mishani et.al.	2504.16738	null
2025-04-23	Revisiting Regret Benchmarks in Online Non-Stochastic Control	Vijeth Hebbar et.al.	2504.16581	null
2025-04-24	Hyper-Transforming Latent Diffusion Models	Ignacio Peis et.al.	2504.16580	null
2025-04-23	A Comprehensive Survey of Synthetic Tabular Data Generation	Ruxue Shi et.al.	2504.16506	link
2025-04-23	The Dance of Atoms-De Novo Protein Design with Diffusion Model	Yujie Qin et.al.	2504.16479	null
2025-04-23	Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion	Ruixiang Zhang et.al.	2504.16431	null
2025-04-23	VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models	Xuming Hu et.al.	2504.16359	null
2025-04-22	SignX: The Foundation Model for Sign Recognition	Sen Fang et.al.	2504.16315	null
2025-04-22	Survey of Video Diffusion Models: Foundations, Implementations, and Applications	Yimu Wang et.al.	2504.16081	link
2025-04-22	From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning	Le Zhuo et.al.	2504.16080	null
2025-04-22	Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation	Yuanpeng Qu et.al.	2504.16077	link
2025-04-22	Boosting Generative Image Modeling via Joint Image-Feature Synthesis	Theodoros Kouzelis et.al.	2504.16064	null
2025-04-22	Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework	Xinyuan Song et.al.	2504.16016	null
2025-04-22	Adversarial Observations in Weather Forecasting	Erik Imgrund et.al.	2504.15942	link
2025-04-22	The area of spheres in the Brownian plane	Jean-François Le Gall et.al.	2504.15860	null
2025-04-22	Text-based Animatable 3D Avatars with Morphable Model Alignment	Yiqian Wu et.al.	2504.15835	link
2025-04-22	Satellite to GroundScape – Large-scale Consistent Ground View Generation from Satellite Views	Ningli Xu et.al.	2504.15786	null
2025-04-22	Clifford Group Equivariant Diffusion Models for 3D Molecular Generation	Cong Liu et.al.	2504.15773	null
2025-04-22	Riemannian Neural Geodesic Interpolant	Jiawen Wu et.al.	2504.15736	null
2025-04-22	Structure-Preserving Zero-Shot Image Editing via Stage-Wise Latent Injection in Diffusion Models	Dasol Jeong et.al.	2504.15723	null
2025-04-22	RadioDiff- $k^2$ : Helmholtz Equation Informed Generative Diffusion Model for Multi-Path Aware Radio Map Construction	Xiucheng Wang et.al.	2504.15623	null
2025-04-22	InstaRevive: One-Step Image Enhancement via Dynamic Score Matching	Yixuan Zhu et.al.	2504.15513	null
2025-04-21	Emergence and Evolution of Interpretable Concepts in Diffusion Models	Berk Tinaz et.al.	2504.15473	null
2025-04-21	Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction	Vaishnavh Nagarajan et.al.	2504.15266	link
2025-04-21	Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation	Yunxuan Cai et.al.	2504.15259	null
2025-04-21	DRAGON: Distributional Rewards Optimize Diffusion Generative Models	Yatong Bai et.al.	2504.15217	null
2025-04-21	FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image	Fei Yin et.al.	2504.15179	null
2025-04-21	DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution	Miaomiao Cai et.al.	2504.15176	null
2025-04-21	Automatic Generation of Aerobatic Flight in Complex Environments via Diffusion Models	Yuhang Zhong et.al.	2504.15138	null
2025-04-21	$\mathbb{L}^p$-solutions $(1 <p< 2)$ for reflected BSDEs with general jumps and stochastic monotone generators	Badr Elmansouri et.al.	2504.15136	null
2025-04-22	VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation	Mingxia Zhan et.al.	2504.15095	null
2025-04-21	Generative Artificial Intelligence for Beamforming in Low-Altitude Economy	Geng Sun et.al.	2504.15079	null
2025-04-21	SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation	Yue Li et.al.	2504.15035	null
2025-04-21	Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models	Zijin Yang et.al.	2504.15026	null
2025-04-21	PIV-FlowDiffuser:Transfer-learning-based denoising diffusion models for PIV	Qianyu Zhu et.al.	2504.14952	link
2025-04-21	TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models	Mazharul Islam Rakib et.al.	2504.14933	null
2025-04-21	Some Optimizers are More Equal: Understanding the Role of Optimizers in Group Fairness	Mojtaba Kolahdouzi et.al.	2504.14882	null
2025-04-21	Uncertainty quantification of neural network models of evolving processes via Langevin sampling	Cosmin Safta et.al.	2504.14854	null
2025-04-18	Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion	Sandipan Dhar et.al.	2504.13791	null
2025-04-18	Decoding Vision Transformers: the Diffusion Steering Lens	Ryota Takatsuki et.al.	2504.13763	link
2025-04-18	ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis	Andrea Rigo et.al.	2504.13745	null
2025-04-18	Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning	Tao He et.al.	2504.13643	null
2025-04-18	SupResDiffGAN a new approach for the Super-Resolution task	Dawid Kopeć et.al.	2504.13622	null
2025-04-18	Entropic Time Schedulers for Generative Diffusion Models	Dejan Stancevic et.al.	2504.13612	null
2025-04-18	WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion	Yang Wu et.al.	2504.13561	link
2025-04-18	Task Assignment and Exploration Optimization for Low Altitude UAV Rescue via Generative AI Enhanced Multi-agent Reinforcement Learning	Xin Tang et.al.	2504.13554	null
2025-04-18	Beyond One-Hot Labels: Semantic Mixing for Model Calibration	Haoyang Luo et.al.	2504.13548	link
2025-04-18	Continuous-time filtering in Lie groups: estimation via the Fr{é}chet mean of solutions to stochastic differential equations	Magalie Bénéfice et.al.	2504.13502	null
2025-04-18	U-Shape Mamba: State Space Model for faster diffusion	Alex Ergasti et.al.	2504.13499	link
2025-04-18	Open-Loop and Closed-Loop Strategies for Linear Quadratic Mean Field Games: The Direct Approach	Yong Liang et.al.	2504.13496	null
2025-04-18	Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing	Joowon Kim et.al.	2504.13490	null
2025-04-17	SMPL-GPTexture: Dual-View 3D Human Texture Estimation using Text-to-Image Generation Models	Mingxiao Tu et.al.	2504.13378	null
2025-04-17	On the minimax optimality of Flow Matching through the connection to kernel density estimation	Lea Kunkel et.al.	2504.13336	null
2025-04-17	Personalized Text-to-Image Generation with Auto-Regressive Models	Kaiyue Sun et.al.	2504.13162	link
2025-04-17	UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models	Guanlong Jiao et.al.	2504.13109	null
2025-04-18	SkyReels-V2: Infinite-length Film Generative Model	Guibin Chen et.al.	2504.13074	link
2025-04-17	TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-Resolution	Yide Liu et.al.	2504.13026	link
2025-04-17	A uniform particle approximation to the Navier-Stokes-alpha models in three dimensions with advection noise	Filippo Giovagnini et.al.	2504.12960	null
2025-04-17	Sliced-Wasserstein Distance-based Data Selection	Julien Pallage et.al.	2504.12918	null
2025-04-17	Image-Editing Specialists: An RLAIF Approach for Diffusion Models	Elior Benarous et.al.	2504.12833	link
2025-04-17	Saliency-Aware Diffusion Reconstruction for Effective Invisible Watermark Removal	Inzamamul Alam et.al.	2504.12809	link
2025-04-17	Privacy Protection Against Personalized Text-to-Image Synthesis via Cross-image Consistency Constraints	Guanyu Wang et.al.	2504.12747	null
2025-04-17	Efficient Primal-dual Forward-backward Splitting Method for Wasserstein-like Gradient Flows with General Nonlinear Mobilities	Yunhong Deng et.al.	2504.12713	null
2025-04-17	Tangent Space Parametrization for Stochastic Differential Equations on SO(n)	Xi Wang et.al.	2504.12650	null
2025-04-17	A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation	Rongtao Xu et.al.	2504.12636	null
2025-04-17	Geometry-preserving Numerical Scheme for Riemannian Stochastic Differential Equations	Xi Wang et.al.	2504.12631	null
2025-04-17	Packing Input Frame Context in Next-Frame Prediction Models for Video Generation	Lvmin Zhang et.al.	2504.12626	link
2025-04-17	Prompt-Driven and Training-Free Forgetting Approach and Dataset for Large Language Models	Zhenyu Yu et.al.	2504.12574	null
2025-04-16	Cobra: Efficient Line Art COlorization with BRoAder References	Junhao Zhuang et.al.	2504.12240	null
2025-04-16	Coding-Prior Guided Diffusion Network for Video Deblurring	Yike Liu et.al.	2504.12222	null
2025-04-16	Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis	Songping Wang et.al.	2504.12129	null
2025-04-16	A Diffusion-Based Framework for Terrain-Aware Remote Sensing Image Reconstruction	Zhenyu Yu et.al.	2504.12112	null
2025-04-16	Generalized Visual Relation Detection with Diffusion Models	Kaifeng Gao et.al.	2504.12100	null
2025-04-16	Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM	Zirui Pan et.al.	2504.12048	null
2025-04-17	Understanding Attention Mechanism in Video Diffusion Models	Bingyan Liu et.al.	2504.12027	null
2025-04-17	Dual-Energy Cone-Beam CT Using Two Orthogonal Projection Views: A Phantom Study	Junbo Peng et.al.	2504.12010	null
2025-04-16	Generative Recommendation with Continuous-Token Diffusion	Haohao Qu et.al.	2504.12007	null
2025-04-16	R-Meshfusion: Reinforcement Learning Powered Sparse-View Mesh Reconstruction with Diffusion Priors	Haoyang Wang et.al.	2504.11946	null
2025-04-16	Exact noise and dissipation operators for quantum stochastic thermodynamics	Stefano Giordano et.al.	2504.11938	null
2025-04-16	SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models	Zeyu Dai et.al.	2504.11923	null
2025-04-16	A Bidirectional DeepParticle Method for Efficiently Solving Low-dimensional Transport Map Problems	Tan Zhang et.al.	2504.11851	null
2025-04-16	ACE: Attentional Concept Erasure in Diffusion Models	Finn Carter et.al.	2504.11850	null
2025-04-16	TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation	Kangbo Ma et.al.	2504.11825	null
2025-04-15	Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception	Ziqi Pang et.al.	2504.11457	link
2025-04-16	Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion	An Zhao et.al.	2504.11447	link
2025-04-15	NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors	Yanrui Bin et.al.	2504.11427	null
2025-04-15	ADT: Tuning Diffusion Models with Adversarial Supervision	Dazhong Shen et.al.	2504.11423	null
2025-04-15	VideoPanda: Video Panoramic Diffusion with Multi-view Attention	Kevin Xie et.al.	2504.11389	null
2025-04-15	Autoregressive Distillation of Diffusion Transformers	Yeongmin Kim et.al.	2504.11295	link
2025-04-15	Simulation-based inference for stochastic nonlinear mixed-effects models with applications in systems biology	Henrik Häggström et.al.	2504.11279	link
2025-04-15	SAR-to-RGB Translation with Latent Diffusion for Earth Observation	Kaan Aydin et.al.	2504.11154	null
2025-04-15	Taming Consistency Distillation for Accelerated Human Image Animation	Xiang Wang et.al.	2504.11143	null
2025-04-15	A definition of the background state of the atmosphere using optimal transport	Charlie Egan et.al.	2504.11141	null
2025-04-15	Hessian stability and convergence rates for entropic and Sinkhorn potentials via semiconcavity	Giacomo Greco et.al.	2504.11133	null
2025-04-15	Defending Against Frequency-Based Attacks with Diffusion Models	Fatemeh Amerehi et.al.	2504.11034	null
2025-04-15	AnimeDL-2M: Million-Scale AI-Generated Anime Image Detection and Localization in Diffusion Era	Chenyang Zhu et.al.	2504.11015	null
2025-04-15	TMCIR: Token Merge Benefits Composed Image Retrieval	Chaoyang Wang et.al.	2504.10995	null
2025-04-15	ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings	Zitai Kong et.al.	2504.10983	null
2025-04-14	REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers	Xingjian Leng et.al.	2504.10483	null
2025-04-15	Maximum entropy modeling of Optimal Transport: the sub-optimality regime and the transition from dense to sparse networks	Lorenzo Buffa et.al.	2504.10444	null
2025-04-14	Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing	Taihang Hu et.al.	2504.10434	link
2025-04-14	MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model	Jian Liu et.al.	2504.10433	link
2025-04-14	Improving diffusion modeling in all-solid-state lithium batteries: a novel approach for grain boundary effects	Lena Scholz et.al.	2504.10348	null
2025-04-14	DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing	Jinyue Zhang et.al.	2504.10278	null
2025-04-14	Efficient Generative Model Training via Embedded Representation Warmup	Deyuan Liu et.al.	2504.10188	link
2025-04-14	NaviDiffusor: Cost-Guided Diffusion Model for Visual Navigation	Yiming Zeng et.al.	2504.10003	null
2025-04-15	OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation	Si-Tong Wei et.al.	2504.09975	link
2025-04-14	Semi-implicit-explicit Runge-Kutta method for nonlinear differential equations	Lingyun Ding et.al.	2504.09969	link
2025-04-14	Efficient Task-specific Conditional Diffusion Policies: Shortcut Model Acceleration and SO(3) Optimization	Haiyong Yu et.al.	2504.09927	null
2025-04-14	Separate to Collaborate: Dual-Stream Diffusion Model for Coordinated Piano Hand Motion Synthesis	Zihao Liu et.al.	2504.09885	null
2025-04-14	Density-based Object Detection in Crowded Scenes	Chenyang Zhao et.al.	2504.09819	null
2025-04-14	Optimizing disorder with machine learning to harness synchronization	Jun-Yin Huang et.al.	2504.09808	null
2025-04-14	EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise	Chao Liu et.al.	2504.09789	null
2025-04-11	Deriving the Gradients of Some Popular Optimal Transport Algorithms	Fangzhou Xie et.al.	2504.08722	null
2025-04-11	Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model	Team Seawead et.al.	2504.08685	null
2025-04-11	Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization	Jialu Li et.al.	2504.08641	null
2025-04-11	Discretization Error Analysis of a High Order Unfitted Space-Time Method for moving domain problems	Fabian Heimann et.al.	2504.08608	null
2025-04-11	Neural Fidelity Calibration for Informative Sim-to-Real Adaptation	Youwei Yu et.al.	2504.08604	null
2025-04-11	ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration	Yongsheng Yu et.al.	2504.08591	null
2025-04-11	COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails	Miguel Espinosa et.al.	2504.08548	null
2025-04-11	Slicing the Gaussian Mixture Wasserstein Distance	Moritz Piening et.al.	2504.08544	link
2025-04-11	Discriminator-Free Direct Preference Optimization for Video Diffusion	Haoran Cheng et.al.	2504.08542	null
2025-04-11	Controlled stochastic processes for simulated annealing	Vincent Molin et.al.	2504.08506	link
2025-04-11	Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation	Bram Vanherle et.al.	2504.08473	link
2025-04-11	On the Design of Diffusion-based Neural Speech Codecs	Pietro Foti et.al.	2504.08470	null
2025-04-11	Muon-Accelerated Attention Distillation for Real-Time Edge Synthesis via Optimized Latent Diffusion	Weiye Chen et.al.	2504.08451	link
2025-04-11	Diffusion Models for Robotic Manipulation: A Survey	Rosa Wolf et.al.	2504.08438	null
2025-04-11	Single View Garment Reconstruction Using Diffusion Mapping Via Pattern Coordinates	Ren Li et.al.	2504.08353	link
2025-04-10	Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction	Zeren Jiang et.al.	2504.07961	link
2025-04-10	VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning	Zhong-Yu Li et.al.	2504.07960	null
2025-04-10	GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces	Hao Yu et.al.	2504.07945	null
2025-04-10	Optimal Control For Anti-Abeta Treatment in Alzheimer’s Disease using a Reaction-Diffusion Model	Wenrui Hao et.al.	2504.07913	null
2025-04-10	Revisiting Likelihood-Based Out-of-Distribution Detection by Modeling Representations	Yifan Ding et.al.	2504.07793	link
2025-04-10	Virtual-mask Informed Prior for Sparse-view Dual-Energy CT Reconstruction	Zini Chen et.al.	2504.07753	null
2025-04-10	Merging Embedded Topics with Optimal Transport for Online Topic Modeling on Data Streams	Federica Granese et.al.	2504.07711	null
2025-04-10	PhaseGen: A Diffusion-Based Approach for Complex-Valued MRI Data Generation	Moritz Rempe et.al.	2504.07560	link
2025-04-10	STeP: A General and Scalable Framework for Solving Video Inverse Problems with Spatiotemporal Diffusion Priors	Bingliang Zhang et.al.	2504.07549	link
2025-04-10	A mass conserved reaction-diffusion system reveals switching between coexisting polar and oscillatory cell motility states	Jack M. Hughes et.al.	2504.07446	null
2025-04-10	Unifying and extending Diffusion Models through PDEs for solving Inverse Problems	Agnimitra Dasgupta et.al.	2504.07437	null
2025-04-10	Conditional Data Synthesis Augmentation	Xinyu Tian et.al.	2504.07426	null
2025-04-10	Routing to the Right Expertise: A Trustworthy Judge for Instruction-based Image Editing	Chenxi Sun et.al.	2504.07424	null
2025-04-10	ID-Booth: Identity-consistent Face Generation with Diffusion Models	Darian Tomašević et.al.	2504.07392	link
2025-04-10	Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction	Junyi Ma et.al.	2504.07375	link
2025-04-09	Identifying Unknown Stochastic Dynamics via Finite expression methods	Senwei Liang et.al.	2504.07085	null
2025-04-09	Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies	Jonas Loos et.al.	2504.07008	link
2025-04-09	Tractable reformulations of DRO problems over structured optimal transport ambiguity sets	Lotfi M. Chaouach et.al.	2504.06966	null
2025-04-09	PathSegDiff: Pathology Segmentation using Diffusion model representations	Sachin Kumar Danisetty et.al.	2504.06950	null
2025-04-09	MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs	Jiawei Mao et.al.	2504.06897	null
2025-04-09	EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation	Diljeet Jagpal et.al.	2504.06861	null
2025-04-09	CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading	Mishan Aliev et.al.	2504.06856	null
2025-04-09	DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Wangbo Zhao et.al.	2504.06803	link
2025-04-09	DIMA: DIffusing Motion Artifacts for unsupervised correction in brain MRI images	Paolo Angella et.al.	2504.06767	null
2025-04-10	Compass Control: Multi Object Orientation Control for Text-to-Image Generation	Rishubh Parihar et.al.	2504.06752	null
2025-04-09	Optimal Execution and Macroscopic Market Making	Ivan Guo et.al.	2504.06717	null
2025-04-09	Probability Density Geodesics in Image Diffusion Latent Space	Qingtao Yu et.al.	2504.06675	null
2025-04-09	RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism	Elia Peruzzo et.al.	2504.06672	null
2025-04-09	Diffusion Factor Models: Generating High-Dimensional Returns with Factor Structure	Minshuo Chen et.al.	2504.06566	link
2025-04-09	DiffusionCom: Structure-Aware Multimodal Diffusion Model for Multimodal Knowledge Graph Completion	Wei Huang et.al.	2504.06543	null
2025-04-08	Transfer between Modalities with MetaQueries	Xichen Pan et.al.	2504.06256	null
2025-04-08	A Mean-Reverting Model of Exchange Rate Risk Premium Using Ornstein-Uhlenbeck Dynamics	SeungJae Hwang et.al.	2504.06028	null
2025-04-08	OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model	Xiaochen Wei et.al.	2504.06027	null
2025-04-08	CamContextI2V: Context-aware Controllable Video Generation	Luis Denninger et.al.	2504.06022	link
2025-04-08	An Empirical Study of GPT-4o Image Generation Capabilities	Sixiang Chen et.al.	2504.05979	link
2025-04-08	Diffusion Based Ambiguous Image Segmentation	Jakob Lønborg Christensen et.al.	2504.05977	null
2025-04-08	Physics-aware generative models for turbulent fluid flows through energy-consistent stochastic interpolants	Nikolaj T. Mücke et.al.	2504.05852	link
2025-04-08	On the Importance of Conditioning for Privacy-Preserving Data Augmentation	Julian Lorenz et.al.	2504.05849	null
2025-04-08	Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking	Junxi Chen et.al.	2504.05838	link
2025-04-08	Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models	Jiahao Chen et.al.	2504.05815	null
2025-04-08	Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling	Jaskirat Singh et.al.	2504.05800	null
2025-04-08	QEMesh: Employing A Quadric Error Metrics-Based Representation for Mesh Generation	Jiaqi Li et.al.	2504.05720	null
2025-04-08	Reconstruction-Free Anomaly Detection with Diffusion Models via Direct Latent Likelihood Evaluation	Shunsuke Sakai et.al.	2504.05662	link
2025-04-08	Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model	Qi Mao et.al.	2504.05594	null
2025-04-07	Studying Image Diffusion Features for Zero-Shot Video Object Segmentation	Thanos Delatolas et.al.	2504.05468	null
2025-04-07	CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models	Kavana Venkatesh et.al.	2504.05306	null
2025-04-07	Gaussian Mixture Flow Matching Models	Hansheng Chen et.al.	2504.05304	link
2025-04-07	Dimension-Free Convergence of Diffusion Models for Approximate Gaussian Mixtures	Gen Li et.al.	2504.05300	null
2025-04-07	DA2Diff: Exploring Degradation-aware Adaptive Diffusion Priors for All-in-One Weather Restoration	Jiamei Xiong et.al.	2504.05135	null
2025-04-07	Safe and Efficient Coexistence of Autonomous Vehicles with Human-Driven Traffic at Signalized Intersections	Filippos N. Tzortzoglou et.al.	2504.05101	null
2025-04-07	Graph-based Diffusion Model for Collaborative Filtering	Xuan Zhang et.al.	2504.05029	null
2025-04-07	Solving the fully nonlinear Monge-Ampère equation using the Legendre-Kolmogorov-Arnold Network method	Bingcheng Hu et.al.	2504.05022	null
2025-04-08	REWIND: Real-Time Egocentric Whole-Body Motion Diffusion with Exemplar-Based Identity Conditioning	Jihyun Lee et.al.	2504.04956	null
2025-04-07	Stochastic differential equations driven by fractional Brownian motion: dependence on the Hurst parameter	Anna P. Kwossek et.al.	2504.04860	null
2025-04-07	Approach to optimal quantum transport via states over time	Matt Hoogsteder-Riera et.al.	2504.04856	null
2025-04-07	Topological Schrödinger Bridge Matching	Maosheng Yang et.al.	2504.04799	link
2025-04-08	TabRep: a Simple and Effective Continuous Representation for Training Tabular Diffusion Models	Jacob Si et.al.	2504.04798	link
2025-04-07	Disentangling Instruction Influence in Diffusion Transformers for Parallel Multi-Instruction-Guided Image Editing	Hui Liu et.al.	2504.04784	null
2025-04-07	Strong approximation and central limit theorems for multiscale stochastic gene networks	Baptiste Nicolas Huguet et.al.	2504.04768	null
2025-04-07	Continuous Locomotive Crowd Behavior Generation	Inhwan Bae et.al.	2504.04756	link
2025-04-04	Enhancing Causal Effect Estimation with Diffusion-Generated Data	Li Chen et.al.	2504.03630	null
2025-04-04	Quantifying the uncertainty of model-based synthetic image quality metrics	Ciaran Bench et.al.	2504.03623	null
2025-04-04	Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal	Yuyang Hu et.al.	2504.03607	null
2025-04-04	Diffusion Active Learning: Towards Data-Driven Experimental Design in Computed Tomography	Luis Barba et.al.	2504.03491	null
2025-04-04	BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution	Zihao He et.al.	2504.03490	null
2025-04-04	Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis	Xi Wang et.al.	2504.03471	link
2025-04-04	D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations	Antoine Dumoulin et.al.	2504.03468	null
2025-04-04	Stochastic Control of Drawdowns via Reinsurance under Random Inspection	Kira Dudziak et.al.	2504.03319	null
2025-04-04	Nonlinear Dynamical Unbalanced Optimal Transport: Relaxation and Duality	Dongjun Wu et.al.	2504.03301	null
2025-04-04	FaR: Enhancing Multi-Concept Text-to-Image Diffusion via Concept Fusion and Localized Refinement	Gia-Nghia Tran et.al.	2504.03292	null
2025-04-04	Dynamic Optimal Transport with Optimal Preferential Paths	Marcello Carioni et.al.	2504.03285	null
2025-04-04	The Ground Cost for Optimal Transport of Angular Velocity	Karthik Elamvazhuthi et.al.	2504.03190	null
2025-04-04	Simultaneous Learning of Optimal Transports for Training All-to-All Flow-Based Condition Transfer Model	Kotaro Ikeda et.al.	2504.03188	null
2025-04-04	On the Connection Between Diffusion Models and Molecular Dynamics	Liam Harcombe et.al.	2504.03187	null
2025-04-04	Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models	Xuran Ma et.al.	2504.03140	link
2025-04-03	Concept Lancet: Image Editing with Compositional Representation Transplant	Jinqi Luo et.al.	2504.02828	null
2025-04-03	Convergence of the Markovian iteration for coupled FBSDEs via a differentiation approach	Zhipeng Huang et.al.	2504.02814	null
2025-04-03	F-ViTA: Foundation Model Guided Visible to Thermal Translation	Jay N. Paranjape et.al.	2504.02801	link
2025-04-03	Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model	Shengjun Zhang et.al.	2504.02764	null
2025-04-03	MD-ProjTex: Texturing 3D Shapes with Multi-Diffusion Projection	Ahmet Burak Yildirim et.al.	2504.02762	null
2025-04-04	RBT4DNN: Requirements-based Testing of Neural Networks	Nusrat Jahan Mozumder et.al.	2504.02737	link
2025-04-03	Critical Scaling of the Quantum Wasserstein Distance	Gonzalo Camacho et.al.	2504.02709	null
2025-04-03	Symplectic techniques for stochastic differential equations on reductive Lie groups with applications to Langevin diffusions	Erwin Luesink et.al.	2504.02707	null
2025-04-03	RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models	ZhongLi Fang et.al.	2504.02640	null
2025-04-03	Variational Online Mirror Descent for Robust Learning in Schrödinger Bridge	Dong-Sig Han et.al.	2504.02618	null
2025-04-03	Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression	Lucas Relic et.al.	2504.02579	null
2025-04-03	MAD: Makeup All-in-One with Cross-Domain Diffusion Model	Bo-Kai Ruan et.al.	2504.02545	null
2025-04-03	Translation of Fetal Brain Ultrasound Images into Pseudo-MRI Images using Artificial Intelligence	Naomi Silverstein et.al.	2504.02408	null
2025-04-03	Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation	Laibin Chang et.al.	2504.02391	null
2025-04-03	OmniCam: Unified Multimodal Video Generation via Camera Control	Xiaoda Yang et.al.	2504.02312	null
2025-04-02	Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis	Niluthpol Chowdhury Mithun et.al.	2504.01960	null
2025-04-03	VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step	Hanyang Wang et.al.	2504.01956	null
2025-04-02	A Unified Approach to Analysis and Design of Denoising Markov Models	Yinuo Ren et.al.	2504.01938	null
2025-04-03	ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement	Runhui Huang et.al.	2504.01934	null
2025-04-02	Multi-fidelity Parameter Estimation Using Conditional Diffusion Models	Caroline Tatsuoka et.al.	2504.01894	null
2025-04-02	A Diffusion-Based Framework for Occluded Object Movement	Zheng-Peng Duan et.al.	2504.01873	null
2025-04-02	Implicit Bias Injection Attacks against Text-to-Image Diffusion Models	Huayang Huang et.al.	2504.01819	link
2025-04-02	The protein escape process at the ribosomal exit tunnel has conserved mechanisms across the domains of life	Phuong Thuy Bui et.al.	2504.01731	null
2025-04-02	InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems	Noam Elata et.al.	2504.01689	link
2025-04-02	On the performance of the Euler-Maruyama scheme for multidimensional SDEs with discontinuous drift coefficient	Thomas Müller-Gronbach et.al.	2504.01630	null
2025-04-02	Instance Migration Diffusion for Nuclear Instance Segmentation in Pathology	Lirui Qi et.al.	2504.01577	null
2025-04-02	Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training	Luca Ciampi et.al.	2504.01547	link
2025-04-02	Hyperbolic Diffusion Recommender Model	Meng Yuan et.al.	2504.01541	null
2025-04-02	Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model	Jincheng Zhong et.al.	2504.01521	link
2025-04-02	Optimal Control of an Interconnected SDE -Parabolic PDE System	Gabriel Velho et.al.	2504.01475	null
2025-03-31	Style Quantization for Data-Efficient GAN Training	Jian Wang et.al.	2503.24282	null
2025-03-31	Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach	Francesco Pio Ramunno et.al.	2503.24271	link
2025-04-01	Visual Acoustic Fields	Yuelei Li et.al.	2503.24270	null
2025-03-31	Many-to-Many Matching via Sparsity Controlled Optimal Transport	Weijie Liu et.al.	2503.24204	null
2025-03-31	Controlled Latent Diffusion Models for 3D Porous Media Reconstruction	Danilo Naiff et.al.	2503.24083	link
2025-03-31	DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model	Ming Yuan et.al.	2503.23993	null
2025-03-31	JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation	Fangda Chen et.al.	2503.23951	null
2025-03-31	DiffuSE: Cross-Layer Design Space Exploration of DNN Accelerator via Diffusion-Driven Optimization	Yi Ren et.al.	2503.23945	null
2025-03-31	Training-Free Text-Guided Image Editing with Visual Autoregressive Model	Yufei Wang et.al.	2503.23897	link
2025-03-31	DiffScale: Continuous Downscaling and Bias Correction of Subseasonal Wind Speed Forecasts using Diffusion Models	Maximilian Springenberg et.al.	2503.23893	null
2025-03-31	MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach	Xin Zhang et.al.	2503.23888	null
2025-03-31	ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image	Tianyi Gong et.al.	2503.23881	null
2025-03-31	Shannon-and Von Neumann-entropy regularizations of linear and semidefinite programs	Saroj Prasad Chhatoi et.al.	2503.23815	null
2025-03-31	Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism	Linghao Feng et.al.	2503.23767	null
2025-03-31	StrokeFusion: Vector Sketch Generation via Joint Stroke-UDF Encoding and Latent Sequence Diffusion	Jin Zhou et.al.	2503.23752	null
2025-03-28	DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness	Ruining Li et.al.	2503.22677	null
2025-03-28	On the effects of parameters on galaxy properties in CAMELS and the predictability of $Ω_{\rm m}$	Gabriella Contardo et.al.	2503.22654	null
2025-03-28	Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model	Jangho Park et.al.	2503.22622	null
2025-03-28	Generative Latent Neural PDE Solver using Flow Matching	Zijie Li et.al.	2503.22600	null
2025-03-28	RELD: Regularization by Latent Diffusion Models for Image Restoration	Pasquale Cascarano et.al.	2503.22563	null
2025-03-28	Deterministic Medical Image Translation via High-fidelity Brownian Bridges	Qisheng He et.al.	2503.22531	null
2025-03-28	Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments	Luke Rowe et.al.	2503.22496	null
2025-03-28	Volumetric Material Decomposition Using Spectral Diffusion Posterior Sampling with a Compressed Polychromatic Forward Model	Xiao Jiang et.al.	2503.22392	null
2025-03-28	Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization	Barış Batuhan Topal et.al.	2503.22352	null
2025-03-28	GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion	Li-Heng Chen et.al.	2503.22349	null
2025-03-28	Semantix: An Energy Guided Sampler for Semantic Style Transfer	Huiang He et.al.	2503.22344	null
2025-03-28	Imperceptible but Forgeable: Practical Invisible Watermark Forgery via Diffusion Models	Ziping Dong et.al.	2503.22330	null
2025-03-28	Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion	Songsong Yu et.al.	2503.22262	null
2025-03-28	On the convergence of the Euler-Maruyama scheme for McKean-Vlasov SDEs	Noufel Frikha et.al.	2503.22226	null
2025-03-28	Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance	Haijie Yang et.al.	2503.22225	null
2025-03-27	VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models	Chi-Pin Huang et.al.	2503.21781	null
2025-03-27	StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion	Ziyu Guo et.al.	2503.21775	null
2025-03-27	Optimal Stepsize for Diffusion Sampling	Jianning Pei et.al.	2503.21774	link
2025-03-27	Exploring the Evolution of Physics Cognition in Video Generation: A Survey	Minghui Lin et.al.	2503.21765	link
2025-03-27	A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One	Minyoung Kim et.al.	2503.21756	null
2025-03-27	Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data	Zhiyuan Ma et.al.	2503.21694	link
2025-03-27	Strong convergence and stability of stochastic theta method for time-changed stochastic differential equations with local Lipschitz coefficients	Jingwei Chen et.al.	2503.21653	null
2025-03-27	Audio-driven Gesture Generation via Deviation Feature in the Latent Space	Jiahui Chen et.al.	2503.21616	null
2025-03-27	Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs	Yoann Boget et.al.	2503.21592	null
2025-03-27	AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion	Liuyue Xie et.al.	2503.21581	null
2025-03-27	Fusion of Graph Neural Networks via Optimal Transport	Weronika Ormaniec et.al.	2503.21579	null
2025-03-27	SyncSDE: A Probabilistic Framework for Diffusion Synchronization	Hyunjun Lee et.al.	2503.21555	null
2025-03-28	LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing	Achint Soni et.al.	2503.21541	link
2025-03-27	Nonlinear Stability of Large-Period Traveling Waves Bifurcating from the Heteroclinic Loop in the FitzHugh-Nagumo Equation	Ji Li et.al.	2503.21509	null
2025-03-27	Invert2Restore: Zero-Shot Degradation-Blind Image Restoration	Hamadi Chihaoui et.al.	2503.21486	null
2025-03-26	Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency	Tianqi Liu et.al.	2503.20785	link
2025-03-26	FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks	Jinwei Li et.al.	2503.20784	link
2025-03-26	RecTable: Fast Modeling Tabular Data with Rectified Flow	Masane Fuchi et.al.	2503.20731	link
2025-03-26	Dynamic Motion Blending for Versatile Motion Editing	Nan Jiang et.al.	2503.20724	null
2025-03-26	Decoherence time maximization and partial isolation for open quantum harmonic oscillator memory networks	Igor G. Vladimirov et.al.	2503.20675	null
2025-03-26	ARMO: Autoregressive Rigging for Multi-Category Objects	Mingze Sun et.al.	2503.20663	null
2025-03-26	MMGen: Unified Multi-modal Image Generation and Understanding in One Go	Jiepeng Wang et.al.	2503.20644	null
2025-03-26	Stochastic Transport Maps in Diffusion Models and Sampling	Xicheng Zhang et.al.	2503.20573	null
2025-03-26	Infinite Time Horizon Optimal Control of McKean-Vlasov SDEs	Silvia Rudà et.al.	2503.20572	null
2025-03-26	Exploring Robustness of Cortical Morphometry in the presence of white matter lesions, using Diffusion Models for Lesion Filling	Vinzenz Uhr et.al.	2503.20571	null
2025-03-26	TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration	Ziying Zhang et.al.	2503.20537	null
2025-03-26	Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation	Qi Si et.al.	2503.20484	null
2025-03-26	Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability	Yingdong Shi et.al.	2503.20483	null
2025-03-26	Latent Beam Diffusion Models for Decoding Image Sequences	Guilherme Fernandes et.al.	2503.20429	null
2025-03-26	ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On	Ji Woo Hong et.al.	2503.20418	null
2025-03-25	Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models	Sangwon Beak et.al.	2503.19914	null
2025-03-25	PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model	Mingju Gao et.al.	2503.19913	null
2025-03-26	AvatarArtist: Open-Domain 4D Avatarization	Hongyu Liu et.al.	2503.19906	null
2025-03-25	ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models	Fernando Julio Cendra et.al.	2503.19902	null
2025-03-25	Scaling Down Text Encoders of Text-to-Image Diffusion Models	Lifu Wang et.al.	2503.19897	link
2025-03-25	FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model	Jun Zhou et.al.	2503.19839	null
2025-03-25	Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models	Ruixi You et.al.	2503.19798	null
2025-03-26	In the Blink of an Eye: Instant Game Map Editing using a Generative-AI Smart Brush	Vitaly Gnatyuk et.al.	2503.19793	null
2025-03-25	SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation	Jingdan Kang et.al.	2503.19791	link
2025-03-25	Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models	Kartik Thakral et.al.	2503.19783	null
2025-03-25	PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models	Junhyuk So et.al.	2503.19731	null
2025-03-25	CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation	Rupak Bose et.al.	2503.19661	null
2025-03-25	OpenSDI: Spotting Diffusion-Generated Images in the Open World	Yabin Wang et.al.	2503.19653	link
2025-03-25	GIViC: Generative Implicit Video Compression	Ge Gao et.al.	2503.19604	null
2025-03-25	Variational conditional normalizing flows for computing second-order mean field control problems	Jiaxi Zhao et.al.	2503.19580	link
2025-03-24	Target-Aware Video Diffusion Models	Taeksoo Kim et.al.	2503.18950	null
2025-03-24	Training-free Diffusion Acceleration with Bottleneck Sampling	Ye Tian et.al.	2503.18940	null
2025-03-24	SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction	Enrico Pallotta et.al.	2503.18933	link
2025-03-24	Dual-domain Multi-path Self-supervised Diffusion Model for Accelerated MRI Reconstruction	Yuxuan Zhang et.al.	2503.18836	null
2025-03-24	An improved central limit theorem for the empirical sliced Wasserstein distance	David Rodríguez-Vítores et.al.	2503.18831	null
2025-03-24	Thermalizer: Stable autoregressive neural emulation of spatiotemporal chaos	Chris Pedersen et.al.	2503.18731	null
2025-03-24	Human Motion Unlearning	Edoardo De Matteis et.al.	2503.18674	null
2025-03-24	Dig2DIG: Dig into Diffusion Information Gains for Image Fusion	Bing Cao et.al.	2503.18627	null
2025-03-24	Generative Dataset Distillation using Min-Max Diffusion Model	Junqiao Fan et.al.	2503.18626	null
2025-03-24	Joint Spectrogram Separation and TDOA Estimation using Optimal Transport	Linda Fabiani et.al.	2503.18600	null
2025-03-24	Unified Uncertainty-Aware Diffusion for Multi-Agent Trajectory Modeling	Guillem Capellera et.al.	2503.18589	null
2025-03-24	Adapting Video Diffusion Models for Time-Lapse Microscopy	Alexander Holmberg et.al.	2503.18583	link
2025-03-25	AMD-Hummingbird: Towards an Efficient Text-to-Video Model	Takashi Isobe et.al.	2503.18559	link
2025-03-24	EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation	Qiang Qu et.al.	2503.18552	null
2025-03-24	Discriminative protein sequence modelling with Latent Space Diffusion	Eoin Quinn et.al.	2503.18551	null
2025-03-21	Preference-Guided Diffusion for Multi-Objective Offline Optimization	Yashas Annadani et.al.	2503.17299	null
2025-03-21	Deep End-to-End Posterior ENergy (DEEPEN) for image recovery	Jyothi Rikhab Chand et.al.	2503.17244	null
2025-03-21	Leveraging Text-to-Image Generation for Handling Spurious Correlation	Aryan Yazdan Parast et.al.	2503.17226	null
2025-03-21	UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models	Fanghua Yu et.al.	2503.17221	null
2025-03-21	FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy	Xingchao Yang et.al.	2503.17197	null
2025-03-21	D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens	Panpan Wang et.al.	2503.17155	null
2025-03-21	Martingale property and moment explosions in signature volatility models	Eduardo Abi Jaber et.al.	2503.17103	null
2025-03-21	R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model	Boyuan Zheng et.al.	2503.17097	null
2025-03-21	Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection	Gensheng Pei et.al.	2503.17080	null
2025-03-21	DIDiffGes: Decoupled Semi-Implicit Diffusion Models for Real-time Gesture Generation from Speech	Yongkang Cheng et.al.	2503.17059	null
2025-03-21	Enabling Versatile Controls for Video Diffusion Models	Xu Zhang et.al.	2503.16983	link
2025-03-21	Real-Time Diffusion Policies for Games: Enhancing Consistency Policies with Q-Ensembles	Ruoqi Zhang et.al.	2503.16978	null
2025-03-24	Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model	Yingying Fan et.al.	2503.16942	null
2025-03-21	When Preferences Diverge: Aligning Diffusion Models with Minority-Aware Adaptive DPO	Lingfan Zhang et.al.	2503.16921	null
2025-03-21	Malliavin-Bismut Score-based Diffusion Models	Ehsan Mirafzali et.al.	2503.16917	null
2025-03-20	DreamTexture: Shape from Virtual Texture with Analysis by Augmentation	Ananta R. Bhattarai et.al.	2503.16412	null
2025-03-20	VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness	SeungJu Cha et.al.	2503.16406	link
2025-03-20	ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos	Haolin Yang et.al.	2503.16400	null
2025-03-20	Scale-wise Distillation of Diffusion Models	Nikita Starodubcev et.al.	2503.16397	null
2025-03-21	SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation	Chun-Han Yao et.al.	2503.16396	null
2025-03-20	Do Visual Imaginations Improve Vision-and-Language Navigation Agents?	Akhil Perincherry et.al.	2503.16394	null
2025-03-20	LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images	Leyang Wang et.al.	2503.16376	null
2025-03-20	Heat transfer and mixing in initiated Chemical Vapor Deposition analyzed by in-situ gas composition sensing	Simon Shindler et.al.	2503.16373	null
2025-03-20	Ultra-Resolution Adaptation with Ease	Ruonan Yu et.al.	2503.16322	link
2025-03-20	Unleashing Vecset Diffusion Model for Fast Shape Generation	Zeqiang Lai et.al.	2503.16302	link
2025-03-20	Diffusion-augmented Graph Contrastive Learning for Collaborative Filter	Fan Huang et.al.	2503.16290	null
2025-03-20	SceneMI: Motion In-betweening for Modeling Human-Scene Interactions	Inwoo Hwang et.al.	2503.16289	null
2025-03-21	Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens	Shuqi Lu et.al.	2503.16278	link
2025-03-20	Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts	Yu Cao et.al.	2503.16218	null
2025-03-20	Improving Discriminator Guidance in Diffusion Models	Alexandre Verine et.al.	2503.16117	null
2025-03-19	FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers	Ruichen Chen et.al.	2503.15465	link
2025-03-19	Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator	Yuanzhi Zhu et.al.	2503.15457	null
2025-03-19	MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space	Lixing Xiao et.al.	2503.15451	null
2025-03-19	Visual Persona: Foundation Model for Full-Body Human Customization	Jisu Nam et.al.	2503.15406	null
2025-03-19	CCDP: Composition of Conditional Diffusion Policies with Guided Sampling	Amirreza Razmjoo et.al.	2503.15386	null
2025-03-19	Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers	Corentin Vazia et.al.	2503.15383	null
2025-03-19	Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport	Hao Tan et.al.	2503.15337	link
2025-03-19	Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images	Euclid Collaboration et.al.	2503.15321	null
2025-03-19	Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization	Feifei Li et.al.	2503.15197	null
2025-03-19	A proposal of smooth interpolation to optimal transport for restoring biased data for algorithmic fairness	Elena M. De Diego et.al.	2503.15119	null
2025-03-19	Control, Optimal Transport and Neural Differential Equations in Supervised Learning	Minh-Nhat Phung et.al.	2503.15105	null
2025-03-19	Proximal Gradient Dynamics and Feedback Control for Equality-Constrained Composite Optimization	Veronica Centorrino et.al.	2503.15093	null
2025-03-19	Single-Step Bidirectional Unpaired Image Translation Using Implicit Bridge Consistency Distillation	Suhyeon Lee et.al.	2503.15056	null
2025-03-19	Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired Training	Yunwei Lan et.al.	2503.15017	link
2025-03-19	Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening	Zihan Cao et.al.	2503.14975	null
2025-03-18	MusicInfuser: Making Video Diffusion Listen and Dance	Susung Hong et.al.	2503.14505	null
2025-03-18	The Power of Context: How Multimodality Improves Image Super-Resolution	Kangfu Mei et.al.	2503.14503	null
2025-03-18	Stable Virtual Camera: Generative View Synthesis with Diffusion Models	Jensen et.al.	2503.14489	null
2025-03-18	DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers	Minglei Shi et.al.	2503.14487	null
2025-03-18	Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset	Yiqun Mei et.al.	2503.14485	null
2025-03-18	La Méthode du Gradient Proximé	Patrick L. Combettes et.al.	2503.14479	null
2025-03-18	SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model	Yucheng Mao et.al.	2503.14463	null
2025-03-18	Bolt3D: Generating 3D Scenes in Seconds	Stanislaw Szymanowicz et.al.	2503.14445	null
2025-03-18	MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation	Hongyu Zhang et.al.	2503.14428	null
2025-03-18	RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment	Chao Wang et.al.	2503.14358	null
2025-03-18	VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation	Shoubin Yu et.al.	2503.14350	null
2025-03-18	LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models	Yu Cheng et.al.	2503.14325	link
2025-03-18	Controllability concepts for mean-field dynamics with reduced-rank coefficients	Dan Goreac et.al.	2503.14278	null
2025-03-18	Free-Lunch Color-Texture Disentanglement for Stylized Image Generation	Jiang Qin et.al.	2503.14275	null
2025-03-19	CTSR: Controllable Fidelity-Realness Trade-off Distillation for Real-World Image Super Resolution	Runyi Li et.al.	2503.14272	null
2025-03-17	Mixtures of ensembles: System separation and identification via optimal transport	Filip Elvander et.al.	2503.13362	null
2025-03-17	One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation	Daniil Selikhanovych et.al.	2503.13358	null
2025-03-17	Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors	Katja Schwarz et.al.	2503.13272	null
2025-03-17	FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis	Luxi Chen et.al.	2503.13265	null
2025-03-17	MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis	Marvin Seyfarth et.al.	2503.13211	null
2025-03-17	The deep multi-FBSDE method: a robust deep learning method for coupled FBSDEs	Kristoffer Andersson et.al.	2503.13193	null
2025-03-17	Evolution of a trait distributed over a large fragmented population: Propagation of chaos meets adaptive dynamics	Amaury Lambert et.al.	2503.13154	null
2025-03-17	Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images	Yaxi Chen et.al.	2503.13131	null
2025-03-17	DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry	Jing Li et.al.	2503.13110	link
2025-03-17	Beyond Classical Diffusion: Fractional Derivatives in Transport and Stochastic Systems	Cypres Verbeeck et.al.	2503.13096	null
2025-03-17	Preserving invariant domains and strong approximation of stochastic differential equations	Utku Erdogan et.al.	2503.13094	null
2025-03-17	On reflected isotropic stable processes	Loïc Béthencourt et.al.	2503.13071	null
2025-03-17	TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba	Jiaxu Liu et.al.	2503.13004	null
2025-03-17	Training Video Foundation Models with NVIDIA NeMo	Zeeshan Patel et.al.	2503.12964	null
2025-03-17	Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait	Chaolong Yang et.al.	2503.12963	link
2025-03-14	Pathology Image Compression with Pre-trained Autoencoders	Srikar Yellapragada et.al.	2503.11591	null
2025-03-14	Dynamics of a coupled nonlocal PDE-ODE system with spatial memory: well-posedness, stability, and bifurcation analysis	Yurij Salmaniw et.al.	2503.11550	null
2025-03-14	Quadratic BSDEs with Singular Generators and Unbounded Terminal Conditions: Theory and Applications	Wenbo Wang et.al.	2503.11443	null
2025-03-17	COIN: Confidence Score-Guided Distillation for Annotation-Free Cell Segmentation	Sanghyun Jo et.al.	2503.11439	null
2025-03-14	FlowKac: An Efficient Neural Fokker-Planck solver using Temporal Normalizing flows and the Feynman Kac-Formula	Naoufal El Bekri et.al.	2503.11427	link
2025-03-14	TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation	Hongxiang Zhao et.al.	2503.11423	null
2025-03-14	MTV-Inpaint: Multi-Task Long Video Inpainting	Shiyuan Yang et.al.	2503.11412	null
2025-03-14	Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models	Jonas Thietke et.al.	2503.11404	null
2025-03-14	BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion Model	Ziyue Wang et.al.	2503.11372	link
2025-03-14	Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-seq Data Analysis	Zhenyi Zhang et.al.	2503.11347	null
2025-03-14	Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking	Ziyi Wang et.al.	2503.11324	null
2025-03-14	CyclePose – Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy	Jonas Utz et.al.	2503.11266	null
2025-03-14	Noise Synthesis for Low-Light Image Denoising with Diffusion Models	Liying Lu et.al.	2503.11262	null
2025-03-14	Spherical Tree-Sliced Wasserstein Distance	Hoang V. Tran et.al.	2503.11249	link
2025-03-14	Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards	Zijing Hu et.al.	2503.11240	link
2025-03-13	GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing	Rongyao Fang et.al.	2503.10639	link
2025-03-13	Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective	Xiaoming Zhao et.al.	2503.10638	null
2025-03-14	Distilling Diversity and Control in Diffusion Models	Rohit Gandikota et.al.	2503.10637	null
2025-03-14	The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation	Ho Kei Cheng et.al.	2503.10636	link
2025-03-13	HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model	Jiaming Liu et.al.	2503.10631	null
2025-03-13	NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models	Mert Albaba et.al.	2503.10626	null
2025-03-13	DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation	Chen Chen et.al.	2503.10618	null
2025-03-13	MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction	Yingshuang Zou et.al.	2503.10604	null
2025-03-13	CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models	Hao He et.al.	2503.10592	null
2025-03-13	Long Context Tuning for Video Generation	Yuwei Guo et.al.	2503.10589	null
2025-03-13	Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures	Nina Vesseron et.al.	2503.10576	null
2025-03-13	A large multi-agent system with noise both in position and control	Giuseppe D’Onofrio et.al.	2503.10543	null
2025-03-13	Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion	Evgeniia Vu et.al.	2503.10488	null
2025-03-13	CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance	Yufan Deng et.al.	2503.10391	null
2025-03-13	Enhancing Facial Privacy Protection via Weakening Diffusion Purification	Ali Salar et.al.	2503.10350	link
2025-03-12	PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop	Chenyu Li et.al.	2503.09595	link
2025-03-12	Minimax Optimality of the Probability Flow ODE for Diffusion Models	Changxiao Cai et.al.	2503.09583	null
2025-03-12	Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models	Marianne Arriola et.al.	2503.09573	link
2025-03-12	TPDiff: Temporal Pyramid Video Diffusion Model	Lingmin Ran et.al.	2503.09566	null
2025-03-12	FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model	Jiahao Xia et.al.	2503.09560	null
2025-03-12	CM-Diff: A Single Generative Network for Bidirectional Cross-Modality Translation Diffusion Model Between Infrared and Visible Images	Bin Hu et.al.	2503.09514	null
2025-03-12	DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction	Junjie Zhou et.al.	2503.09491	link
2025-03-12	Fast computation of the TGOSPA metric for multiple target tracking via unbalanced optimal transport	Viktor Nevelius Wernholm et.al.	2503.09449	null
2025-03-12	Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models	Zhihua Tian et.al.	2503.09446	link
2025-03-12	SuperCarver: Texture-Consistent 3D Geometry Super-Resolution for High-Fidelity Surface Detail Generation	Qijian Zhang et.al.	2503.09439	null
2025-03-12	Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space	Yifan Zhou et.al.	2503.09419	link
2025-03-12	Diff-CL: A Novel Cross Pseudo-Supervision Method for Semi-supervised Medical Image Segmentation	Xiuzhen Guo et.al.	2503.09408	null
2025-03-12	Task Allocation for Multi-agent Systems via Unequal-dimensional Optimal Transport	Anqi Dong et.al.	2503.09369	null
2025-03-12	Schrödinger Bridges for Systems of Interacting Particles	Henri Orland et.al.	2503.09328	null
2025-03-12	UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer	Haoxuan Wang et.al.	2503.09277	null
2025-03-11	GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing	Yuanhao Wang et.al.	2503.08678	null
2025-03-11	Language-Depth Navigated Thermal and Visible Image Fusion	Jinchang Zhang et.al.	2503.08676	null
2025-03-11	Modeling Stock Return Distributions and Pricing Options	Xinxin Jiang et.al.	2503.08666	null
2025-03-11	REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder	Yitian Zhang et.al.	2503.08665	null
2025-03-11	MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention	Yuhan Wang et.al.	2503.08664	link
2025-03-11	MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input	Zhenchen Wan et.al.	2503.08650	null
2025-03-11	Rethinking Diffusion Model in High Dimension	Zhenxin Zheng et.al.	2503.08643	link
2025-03-11	Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling	Subin Kim et.al.	2503.08605	null
2025-03-11	Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation	Mingkang Zhu et.al.	2503.08575	null
2025-03-11	Posterior-Mean Denoising Diffusion Model for Realistic PET Image Reconstruction	Yiran Sun et.al.	2503.08546	null
2025-03-11	SAS: Segment Any 3D Scene with Integrated 2D Priors	Zhuoyuan Li et.al.	2503.08512	null
2025-03-11	Learning to Match Unpaired Data with Minimum Entropy Coupling	Mustapha Bounoua et.al.	2503.08501	null
2025-03-11	Generalizable AI-Generated Image Detection Based on Fractal Self-Similarity in the Spectrum	Shengpeng Xiao et.al.	2503.08484	null
2025-03-11	NullFace: Training-Free Localized Face Anonymization	Han-Wei Kung et.al.	2503.08478	link
2025-03-11	Controlling Latent Diffusion Using Latent CLIP	Jason Becker et.al.	2503.08455	link
2025-03-10	DRESS: Diffusion Reasoning-based Reward Shaping Scheme For Intelligent Networks	Feiran You et.al.	2503.07433	link
2025-03-10	AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion	Mingzhen Sun et.al.	2503.07418	null
2025-03-10	TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision	Shaobin Zhuang et.al.	2503.07416	null
2025-03-10	SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models	Ouxiang Li et.al.	2503.07392	link
2025-03-10	PersonaBooth: Personalized Text-to-Motion Generation	Boeun Kim et.al.	2503.07390	null
2025-03-10	TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models	Ruidong Chen et.al.	2503.07389	link
2025-03-10	AttenST: A Training-Free Attention-Driven Style Transfer Framework with Pre-Trained Diffusion Models	Bo Huang et.al.	2503.07307	link
2025-03-10	Efficient Distillation of Classifier-Free Guidance using Adapters	Cristian Perez Jensen et.al.	2503.07274	link
2025-03-11	AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis	Zhangyu Lai et.al.	2503.07253	null
2025-03-10	Stochastic Epidemic Models with Partial Information	Florent Ouabo Kamkumo et.al.	2503.07251	null
2025-03-11	Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios	Chenglu Pan et.al.	2503.07232	null
2025-03-10	Synthetic Lung X-ray Generation through Cross-Attention and Affinity Transformation	Ruochen Pi et.al.	2503.07209	null
2025-03-10	Effective and Efficient Masked Image Generation Models	Zebin You et.al.	2503.07197	link
2025-03-11	Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms	Jiaming Song et.al.	2503.07154	null
2025-03-10	Controllable 3D Outdoor Scene Generation via Scene Graphs	Yuheng Liu et.al.	2503.07152	link
2025-03-07	AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data	Zengqun Zhao et.al.	2503.05665	link
2025-03-07	TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models	Mark YU et.al.	2503.05638	null
2025-03-07	Multi-asset optimal trade execution with stochastic cross-effects: An Obizhaeva-Wang-type framework	Julia Ackermann et.al.	2503.05594	null
2025-03-07	Diffusion Models for Cayley Graphs	Michael R. Douglas et.al.	2503.05558	null
2025-03-10	*Accelerating db-A for Kinodynamic Motion Planning Using Diffusion**	Julius Franke et.al.	2503.05539	null
2025-03-07	Exploiting Inexact Computations in Multilevel Sampling Methods	Josef Martínek et.al.	2503.05533	link
2025-03-07	Noise-Robust Radio Frequency Fingerprint Identification Using Denoise Diffusion Model	Guolin Yin et.al.	2503.05514	null
2025-03-07	Riemannian Metric Learning: Closer to You than You Imagine	Samuel Gruffaz et.al.	2503.05321	null
2025-03-07	Entropic transfer operators for stochastic systems	Hancheng Bi et.al.	2503.05308	null
2025-03-07	Policy Constraint by Only Support Constraint for Offline Reinforcement Learning	Yunkai Gao et.al.	2503.05207	link
2025-03-07	Generative Trajectory Stitching through Diffusion Composition	Yunhao Luo et.al.	2503.05153	null
2025-03-07	Development and Enhancement of Text-to-Image Diffusion Models	Rajdeep Roshan Sahu et.al.	2503.05149	null
2025-03-07	Partial Distribution Alignment via Adaptive Optimal Transport	Pei Yang et.al.	2503.05087	null
2025-03-07	Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs	Yingji Zhong et.al.	2503.05082	null
2025-03-06	Energy-Weighted Flow Matching for Offline Reinforcement Learning	Shiyuan Zhang et.al.	2503.04975	null
2025-03-06	Compositional World Knowledge leads to High Utility Synthetic data	Sachit Gaudi et.al.	2503.04687	null
2025-03-06	The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation	Aoxiong Yin et.al.	2503.04606	link
2025-03-06	Double metasurfaces and Optimal transport	Irem Altiner et.al.	2503.04536	null
2025-03-06	Guided smoothing and control for diffusion processes	Oskar Eklund et.al.	2503.04326	link
2025-03-06	Opinion Dynamics with Continuous Age Structure	Andrew Nugent et.al.	2503.04319	null
2025-03-06	How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects	Wonkwang Lee et.al.	2503.04257	null
2025-03-06	Synthetic Data is an Elegant GIFT for Continual Vision-Language Models	Bin Wu et.al.	2503.04229	null
2025-03-06	Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models	Rui Jiang et.al.	2503.04215	null
2025-03-06	CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation	Yuki Tanaka et.al.	2503.04164	null
2025-03-07	Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration	Qianliang Wu et.al.	2503.04127	null
2025-03-06	FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis	Ziqi Ni et.al.	2503.04067	null
2025-03-06	RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics Replanning	Xi Ye et.al.	2503.04051	null
2025-03-06	Underlying Semantic Diffusion for Effective and Efficient In-Context Learning	Zhong Ji et.al.	2503.04050	null
2025-03-06	Beyond Existance: Fulfill 3D Reconstructed Scenes with Pseudo Details	Yifei Gao et.al.	2503.04037	null
2025-03-06	TextDoctor: Unified Document Image Inpainting via Patch Pyramid Diffusion Models	Wanglong Lu et.al.	2503.04021	null
2025-03-05	Constrained Gaussian Wasserstein Optimal Transport with Commutative Covariance Matrices	Jun Chen et.al.	2503.03744	null
2025-03-05	Rethinking Video Tokenization: A Conditioned Diffusion-based Approach	Nianzu Yang et.al.	2503.03708	link
2025-03-05	DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance	Zhao Yang et.al.	2503.03689	link
2025-03-05	Gaussian-type density estimates for mixed SDEs driven by correlated fractional Brownian motions	Maximilian Buthenhoff et.al.	2503.03685	null
2025-03-05	Strong solutions for singular SDEs driven by long-range dependent fractional Brownian motion and other Volterra processes	Maximilian Buthenhoff et.al.	2503.03677	null
2025-03-05	Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias	Rui Lu et.al.	2503.03595	null
2025-03-05	Generative Artificial Intelligence in Robotic Manipulation: A Survey	Kun Zhang et.al.	2503.03464	null
2025-03-05	Top-K Maximum Intensity Projection Priors for 3D Liver Vessel Segmentation	Xiaotong Zhang et.al.	2503.03367	null
2025-03-05	Video Super-Resolution: All You Need is a Video Diffusion Model	Zhihao Zhan et.al.	2503.03355	null
2025-03-06	Optimizing for the Shortest Path in Denoising Diffusion Model	Ping Chen et.al.	2503.03265	link
2025-03-05	GenColor: Generative Color-Concept Association in Visual Design	Yihan Hou et.al.	2503.03236	null
2025-03-05	Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture	Zhumei Wang et.al.	2503.03222	null
2025-03-05	An Analytical Theory of Power Law Spectral Bias in the Learning Dynamics of Diffusion Models	Binxu Wang et.al.	2503.03206	null
2025-03-05	WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models	Tao Feng et.al.	2503.03110	null
2025-03-05	$C$-existence families, $C$ -semigroups and their associated abstract Cauchy problems in complete random normed modules	Xia Zhang et.al.	2503.03096	null
2025-03-04	Generating Reliable Initial Velocity Models for Full-waveform Inversion with Well and Structural Constraints	Qingchen Zhang et.al.	2503.02815	null
2025-03-04	An optimal-transport finite-particle method for driven mass diffusion	Anna Pandolfi et.al.	2503.02813	null
2025-03-04	StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts	Zhaoxing Gan et.al.	2503.02595	null
2025-03-04	TS-CGNet: Temporal-Spatial Fusion Meets Centerline-Guided Diffusion for BEV Mapping	Xinying Hong et.al.	2503.02578	link
2025-03-04	SPG: Improving Motion Diffusion by Smooth Perturbation Guidance	Boseong Jeon et.al.	2503.02577	null
2025-03-04	RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification	Zhen Yang et.al.	2503.02537	null
2025-03-04	On the optimal stopping problem for diffusions and an approximation result for stopping times	Andrea Cosso et.al.	2503.02514	null
2025-03-05	BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Iterative Optimization and Diffusion Modelling	Hao Li et.al.	2503.02445	null
2025-03-04	Monge-Kantorovich quantiles and ranks for image data	Gauthier Thurin et.al.	2503.02427	link
2025-03-04	Controllable Motion Generation via Diffusion Modal Coupling	Luobin Wang et.al.	2503.02353	link
2025-03-04	CQ CNN: A Hybrid Classical Quantum Convolutional Neural Network for Alzheimer’s Disease Detection Using Diffusion Generated and U Net Segmented 3D MRI	Mominul Islam et.al.	2503.02345	link
2025-03-04	Diffusion-Based mmWave Radar Point Cloud Enhancement Driven by Range Images	Ruixin Wu et.al.	2503.02300	null
2025-03-04	h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform	Toan Nguyen et.al.	2503.02187	link
2025-03-03	HanDrawer: Leveraging Spatial Information to Render Realistic Hands Using a Conditional Diffusion Model in Single Stage	Qifan Fu et.al.	2503.02127	null
2025-03-03	Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection	Boyong He et.al.	2503.02101	link
2025-02-28	Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos	Zhiyu Tan et.al.	2502.21314	null
2025-02-28	Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion	Kulin Shah et.al.	2502.21278	null
2025-02-28	On a class of adversarial classification problems which admit a continuous solution	Guillaume Carlier et.al.	2502.21170	null
2025-02-28	A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images	Zineb Sordo et.al.	2502.21151	null
2025-02-28	Generative Uncertainty in Diffusion Models	Metod Jazbec et.al.	2502.20946	null
2025-02-28	GridOT – a discrete optimal transport solver on grids	Johannes Rauch et.al.	2502.20905	null
2025-02-28	DiffBrush:Just Painting the Art by Your Hands	Jiaming Chu et.al.	2502.20904	null
2025-02-28	Dimension-independent convergence rate of propagation of chaos and numerical analysis for McKean-Vlasov stochastic differential equations with coefficients nonlinearly dependent on measure	Yuhang Zhang et.al.	2502.20786	null
2025-02-28	CADDreamer: CAD object Generation from Single-view Images	Yuan Li et.al.	2502.20732	null
2025-02-28	Diffusion Restoration Adapter for Real-World Image Restoration	Hanbang Liang et.al.	2502.20679	null
2025-02-28	Wavelet-based density sketching with functional hierarchical tensor	Xun Tang et.al.	2502.20655	null
2025-02-28	Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion Models	Yu Pan et.al.	2502.20650	link
2025-02-28	T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting	Yifei Qian et.al.	2502.20625	null
2025-02-27	Unifying Model Predictive Path Integral Control, Reinforcement Learning, and Diffusion Models for Optimal Control and Planning	Yankai Li et.al.	2502.20476	null
2025-02-27	Tight Inversion: Image-Conditioned Inversion for Real Image Editing	Edo Kadosh et.al.	2502.20376	null
2025-02-27	Constrained Generative Modeling with Manually Bridged Diffusion Models	Saeid Naderiparizi et.al.	2502.20371	null
2025-02-27	FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction	Siyu Jiao et.al.	2502.20313	link
2025-02-27	Mobius: Text to Seamless Looping Video Generation via Latent Shift	Xiuli Bi et.al.	2502.20307	link
2025-02-27	Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions	Palawat Busaranuvong et.al.	2502.20277	null
2025-02-27	How cancer emerges: Data-driven universal insights into tumorigenesis via hallmark networks	Jiahe Wang et.al.	2502.20275	link
2025-02-27	Exponential convergence of general iterative proportional fitting procedures	Stephan Eckstein et.al.	2502.20264	null
2025-02-27	Attention Distillation: A Unified Approach to Visual Characteristics Transfer	Yang Zhou et.al.	2502.20235	link
2025-02-27	Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think	Liang Chen et.al.	2502.20172	link
2025-02-28	Robust sensitivity control in digital pathology via tile score distribution matching	Arthur Pignet et.al.	2502.20144	null
2025-02-27	Your contrastive learning problem is secretly a distribution alignment problem	Zihao Chen et.al.	2502.20141	link
2025-02-27	Scalability of the second-order reliability method for stochastic differential equations with multiplicative noise	Timo Schorlepp et.al.	2502.20114	null
2025-02-27	Generative augmentations for improved cardiac ultrasound segmentation using diffusion models	Gilles Van De Vyver et.al.	2502.20100	link
2025-02-27	Image Referenced Sketch Colorization Based on Animation Creation Workflow	Dingkun Yan et.al.	2502.19937	link
2025-02-27	DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models	Weihao wu et.al.	2502.19924	null
2025-02-26	Joint Optimal Transport and Embedding for Network Alignment	Qi Yu et.al.	2502.19334	link
2025-02-26	Polynomial McKean-Vlasov SDEs	Christa Cuchiero et.al.	2502.19203	null
2025-02-26	HDM: Hybrid Diffusion Model for Unified Image Anomaly Detection	Zekang Weng et.al.	2502.19200	null
2025-02-27	RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images	Yuhan Tang et.al.	2502.19153	null
2025-02-26	Modulation of the galactic cosmic ray spectrum in an anisotropic diffusion approach	V. D. Borisov et.al.	2502.19062	null
2025-02-26	Foundation Inference Models for Stochastic Differential Equations: A Transformer-based Approach for Zero-shot Function Estimation	Patrick Seifner et.al.	2502.19049	null
2025-02-26	A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models	Vu Tuan Truong Long et.al.	2502.19047	null
2025-02-26	DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model	Lei Zhao et.al.	2502.18952	null
2025-02-26	Physics-Aware Inverse Design for Nanowire Single-Photon Avalanche Detectors via Deep Learning	Boyang Zhang et.al.	2502.18857	null
2025-02-26	Optimal Stochastic Trace Estimation in Generative Modeling	Xinyang Liu et.al.	2502.18808	null
2025-02-26	Towards Optimal Multi-draft Speculative Decoding	Zhengmian Hu et.al.	2502.18779	null
2025-02-26	Ptychographic Image Reconstruction from Limited Data via Score-Based Diffusion Models with Physics-Guidance	Refik Mert Cam et.al.	2502.18767	null
2025-02-25	Adaptive conditional latent diffusion maps beam loss to 2D phase space projections	Alexander Scheinker et.al.	2502.18684	null
2025-02-25	Diffusion Models for conditional MRI generation	Miguel Herencia García del Castillo et.al.	2502.18620	null
2025-02-25	K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs	Ziheng Ouyang et.al.	2502.18461	null
2025-02-25	ToMCAT: Theory-of-Mind for Cooperative Agents in Teams via Multiagent Diffusion Policies	Pedro Sequeira et.al.	2502.18438	null
2025-02-25	$N$ -Player Stochastic Differential Games with Regime Switching and Mean Field Convergence	Mingrui Wang et.al.	2502.18333	null
2025-02-25	LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation	Pengzhi Li et.al.	2502.18302	null
2025-02-25	Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training	Botao Ye et.al.	2502.18219	null
2025-02-25	Training Consistency Models with Variational Noise Coupling	Gianluigi Silvestri et.al.	2502.18197	link
2025-02-25	Multi-Perspective Data Augmentation for Few-shot Object Detection	Anh-Khoa Nguyen Vu et.al.	2502.18195	link
2025-02-25	Determined Blind Source Separation with Sinkhorn Divergence-based Optimal Allocation of the Source Power	Jianyu Wang et.al.	2502.18182	null
2025-02-25	CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification	Mingkun Zhang et.al.	2502.18176	link
2025-02-25	Joint Reconstruction of Spatially-Coherent and Realistic Clothed Humans and Objects from a Single Image	Ayushi Dutta et.al.	2502.18150	null
2025-02-25	An approximate solution of a case of perturbed Fokker-Planck equation	Yan Luo et.al.	2502.18131	null
2025-02-25	PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching	Han Nie et.al.	2502.18104	link
2025-02-25	HEROS-GAN: Honed-Energy Regularized and Optimal Supervised GAN for Enhancing Accuracy and Range of Low-Cost Accelerometers	Yifeng Wang et.al.	2502.18064	null
2025-02-25	Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models	Jia Yu et.al.	2502.17951	link
2025-02-25	3D Anatomical Structure-guided Deep Learning for Accurate Diffusion Microstructure Imaging	Xinrui Ma et.al.	2502.17933	null
2025-02-24	GCC: Generative Color Constancy via Diffusing a Color Checker	Chen-Wei Chang et.al.	2502.17435	null
2025-02-24	S4S: Solving for a Diffusion Model Solver	Eric Frankel et.al.	2502.17423	null
2025-02-24	X-Dancer: Expressive Music to Human Dance Video Generation	Zeyuan Chen et.al.	2502.17414	null
2025-02-24	AnyTop: Character Animation Diffusion with Any Topology	Inbar Gat et.al.	2502.17327	link
2025-02-24	VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing	Xiangpeng Yang et.al.	2502.17258	null
2025-02-24	Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation	Baptiste Chopin et.al.	2502.17198	null
2025-02-24	Scaling Limits for Exponential Hedging in the Brownian Framework	Yan Dolinksy et.al.	2502.17186	null
2025-02-25	DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks	Canyu Zhao et.al.	2502.17157	link
2025-02-24	Strong convergence of the adaptive Milstein method for nonlinear stochastic differential equations with piecewise continuous arguments	Yuhang Zhang et.al.	2502.17126	null
2025-02-24	Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions	Zhong Li et.al.	2502.17119	link
2025-02-24	SFLD: Reducing the content bias for AI-generated Image Detection	Seoyeon Gye et.al.	2502.17105	null
2025-02-25	Generative Models in Decision Making: A Survey	Yinchuan Li et.al.	2502.17100	null
2025-02-24	Conditional Diffusion-Flow models for generating 3D cosmic density fields: applications to f(R) cosmologies	Julieth Katherine Riveros et.al.	2502.17087	link
2025-02-24	Quasi-likelihood ratio test for jump-diffusion processes based on adaptive maximum likelihood inference	Hiromasa Nishikawa et.al.	2502.17058	null
2025-02-24	SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations	Wendi Liu et.al.	2502.17056	null
2025-02-21	One-step Diffusion Models with $f$ -Divergence Distribution Matching	Yilun Xu et.al.	2502.15681	null
2025-02-21	Kinetic Optimal Transport (OTIKIN) – Part 1: Second-Order Discrepancies Between Probability Measures	Giovanni Brigati et.al.	2502.15665	null
2025-02-21	Convergence rates for the vanishing viscosity approximation of Hamilton-Jacobi equations: the convex case	Marco Cirant et.al.	2502.15495	null
2025-02-21	Modeling Infectious Diseases: From SIR Models to Diffusion-Based Approaches and Numerical Solutions	Ayesha Baig et.al.	2502.15439	null
2025-02-21	Audio signal interpolation using optimal transportation of spectrograms	David Valdivia et.al.	2502.15430	null
2025-02-21	BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization	Tonghan Wang et.al.	2502.15283	null
2025-02-21	CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models	Shunchang Liu et.al.	2502.15278	null
2025-02-21	Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis	Yifan Jiang et.al.	2502.15204	link
2025-02-21	Methods and Trends in Detecting Generated Images: A Comprehensive Review	Arpan Mahara et.al.	2502.15176	null
2025-02-20	Pseudoinverse Diffusion Models for Generative CT Image Reconstruction from Low Dose Data	Matthew Tivnan et.al.	2502.15064	null
2025-02-20	FIP: Endowing Robust Motion Capture on Daily Garment by Fusing Flex and Inertial Sensors	Jiawei Fang et.al.	2502.15058	null
2025-02-20	Generative Super-Resolution PET Imaging with Fourier Diffusion Models	Matthew Tivnan et.al.	2502.15055	null
2025-02-20	DDAT: Diffusion Policies Enforcing Dynamically Admissible Robot Trajectories	Jean-Baptiste Bouvier et.al.	2502.15043	null
2025-02-20	Improving the Diffusability of Autoencoders	Ivan Skorokhodov et.al.	2502.14831	null
2025-02-20	A Survey on Text-Driven 360-Degree Panorama Generation	Hai Wang et.al.	2502.14799	null
2025-02-20	DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models	Hongji Yang et.al.	2502.14779	null
2025-02-20	Multi-Layer Deep xVA: Structural Credit Models, Measure Changes and Convergence Analysis	Kristoffer Andersson et.al.	2502.14766	null
2025-02-20	PEARL: Towards Permutation-Resilient LLMs	Liang Chen et.al.	2502.14628	link
2025-02-20	Central Limit Theorem for Irregular Discretization Scheme of Multilevel Monte Carlo Method	Yi Guo et.al.	2502.14395	null
2025-02-20	A Similarity Paradigm Through Textual Regularization Without Forgetting	Fangming Cui et.al.	2502.14376	null
2025-02-20	Textured 3D Regenerative Morphing with 3D Diffusion Prior	Songlin Yang et.al.	2502.14316	null
2025-02-19	Population Dynamics Control with Partial Observations	Zhou Lu et.al.	2502.14079	null
2025-02-19	DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models	Daewon Chae et.al.	2502.14070	null
2025-02-19	d-Sketch: Improving Visual Fidelity of Sketch-to-Image Translation with Pretrained Latent Diffusion Models without Retraining	Prasun Roy et.al.	2502.14007	link
2025-02-19	Im2SurfTex: Surface Texture Generation via Neural Backprojection of Multi-View Images	Yiangos Georgiou et.al.	2502.14006	null
2025-02-19	IP-Composer: Semantic Composition of Visual Concepts	Sara Dorfman et.al.	2502.13951	null
2025-02-19	TESS 2: A Large-Scale Generalist Diffusion Language Model	Jaesung Tae et.al.	2502.13917	link
2025-02-19	Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions	Xinwei Shen et.al.	2502.13747	null
2025-02-19	RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior	Ching-Hua Lee et.al.	2502.13574	null
2025-02-19	Diffusion Model Agnostic Social Influence Maximization in Hyperbolic Space	Hongliang Qiao et.al.	2502.13571	null
2025-02-19	Kernel Mean Embedding Topology: Weak and Strong Forms for Stochastic Kernels and Implications for Model Learning	Naci Saldi et.al.	2502.13486	null
2025-02-19	Interleaved Gibbs Diffusion for Constrained Generation	Gautham Govind Anil et.al.	2502.13450	null
2025-02-18	Secure and Efficient Watermarking for Latent Diffusion Models in Model Distribution Scenarios	Liangqi Lei et.al.	2502.13345	null
2025-02-18	Geometry-Aware Diffusion Models for Multiview Scene Inpainting	Ahmad Salimi et.al.	2502.13335	null
2025-02-18	MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching	Yen-Siang Wu et.al.	2502.13234	null
2025-02-18	Is Noise Conditioning Necessary for Denoising Generative Models?	Qiao Sun et.al.	2502.13129	null
2025-02-18	Score Matching Riemannian Diffusion Means	Frederik Möbius Rygaard et.al.	2502.13106	null
2025-02-18	Personalized Image Generation with Deep Generative Models: A Decade Survey	Yuxiang Wei et.al.	2502.13081	link
2025-02-18	Does Training with Synthetic Data Truly Protect Privacy?	Yunpeng Zhao et.al.	2502.12976	link
2025-02-18	A measure-valued HJB perspective on Bayesian optimal adaptive control	Alexander M. G. Cox et.al.	2502.12957	null
2025-02-18	Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression	Jaemoon Lee et.al.	2502.12951	null
2025-02-18	Stochastic Parareal Algorithm for Stochastic Differential Equations	Huanxin Wang et.al.	2502.12909	null
2025-02-18	RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models	Tanqiu Jiang et.al.	2502.12794	link
2025-02-18	Unsupervised Anomaly Detection through Mass Repulsing Optimal Transport	Eduardo Fernandes Montesuma et.al.	2502.12793	null
2025-02-18	Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo	James Thornton et.al.	2502.12786	null
2025-02-18	Approximation theorems for stochastic integrals and stochastic differential equations concerning weak solutions with the spatial variable involved	Xi Lin et.al.	2502.12765	null
2025-02-18	High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion	Xiang Zhang et.al.	2502.12752	null
2025-02-18	3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces	Fabian Bongratz et.al.	2502.12742	null
2025-02-18	Using Sinkhorn in the JKO scheme adds linear diffusion	Aymeric Baradat et.al.	2502.12666	null
2025-02-18	NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation	Zhiyuan Liu et.al.	2502.12638	link
2025-02-17	Diffusion Models without Classifier-free Guidance	Zhicong Tang et.al.	2502.12154	link
2025-02-17	Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening	Ye Tian et.al.	2502.12146	link
2025-02-17	How compositional generalization and creativity improve as diffusion models are trained	Alessandro Favero et.al.	2502.12089	null
2025-02-17	HumanGif: Single-View Human Diffusion with Generative Prior	Shoukang Hu et.al.	2502.12080	link
2025-02-17	Fractional Sobolev paths on Wasserstein spaces and their energy-minimizing particle representations	Ehsan Abedi et.al.	2502.12068	null
2025-02-17	A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond	Shreya Shukla et.al.	2502.12048	null
2025-02-17	Unsupervised Structural-Counterfactual Generation under Domain Shift	Krishn Vishwas Kher et.al.	2502.12013	null
2025-02-17	Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images	Negar Kamali et.al.	2502.11989	link
2025-02-17	Image Inversion: A Survey from GANs to Diffusion and Beyond	Yinan Chen et.al.	2502.11974	link
2025-02-17	Approximating a spatially-heterogeneously mass-emitting object by multiple point sources in a diffusion model	Qiyao Peng et.al.	2502.11908	null
2025-02-17	BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model	Weilin Lin et.al.	2502.11798	link
2025-02-17	MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow	Hanzhuo Huang et.al.	2502.11697	null
2025-02-17	GaussianMotion: End-to-End Learning of Animatable Gaussian Avatars with Pose Guidance from Text	Gyumin Shim et.al.	2502.11642	null
2025-02-17	Membership Inference Attacks for Face Images Against Fine-Tuned Latent Diffusion Models	Lauritz Christian Holme et.al.	2502.11619	null
2025-02-18	Maximum Entropy Reinforcement Learning with Diffusion Policy	Xiaoyi Dong et.al.	2502.11612	link
2025-02-14	Region-Adaptive Sampling for Diffusion Transformers	Ziming Liu et.al.	2502.10389	null
2025-02-14	On creating convexity in high dimensions	Samuel G. G. Johnston et.al.	2502.10382	null
2025-02-14	ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences	Liyuan Zhu et.al.	2502.10377	null
2025-02-14	Dimension-free Score Matching and Time Bootstrapping for Diffusion Models	Syamantak Kumar et.al.	2502.10354	null
2025-02-14	DiOpt: Self-supervised Diffusion for Constrained Optimization	Shutong Ding et.al.	2502.10330	null
2025-02-14	Generalised Parallel Tempering: Flexible Replica Exchange via Flows and Diffusions	Leo Zhang et.al.	2502.10328	null
2025-02-14	Dark Matter Attenuation Effects: Sensitivity Ceilings for Spin-Dependent and Spin-Independent Interactions	QUEST-DMC Collaboration et.al.	2502.10251	null
2025-02-14	Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control	Thomas Jiralerspong et.al.	2502.10236	null
2025-02-14	Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model	Bo Ni et.al.	2502.10173	null
2025-02-14	IRS-assisted Edge Computing for Vehicular Networks: A Generative Diffusion Model-based Stackelberg Game Approach	Yixian Wang et.al.	2502.10149	null
2025-02-14	Diffusion Trajectory-guided Policy for Long-horizon Robot Manipulation	Shichao Fan et.al.	2502.10040	null
2025-02-14	Large Language Diffusion Models	Shen Nie et.al.	2502.09992	null
2025-02-14	Generating on Generated: An Approach Towards Self-Evolving Diffusion Models	Xulu Zhang et.al.	2502.09963	null
2025-02-14	Precise Parameter Localization for Textual Generation in Diffusion Models	Łukasz Staniszewski et.al.	2502.09935	null
2025-02-14	Fused Partial Gromov-Wasserstein for Structured Objects	Yikun Bai et.al.	2502.09934	null
2025-02-13	Theoretical Benefit and Limitation of Diffusion Language Model	Guhao Feng et.al.	2502.09622	null
2025-02-13	RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets	Isabella Liu et.al.	2502.09615	null
2025-02-14	Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions	Tejas Jayashankar et.al.	2502.09609	null
2025-02-13	Rolling Ahead Diffusion for Traffic Scene Simulation	Yunpeng Liu et.al.	2502.09587	null
2025-02-13	Memorization and Generalization in Generative Diffusion under the Manifold Hypothesis	Beatrice Achilli et.al.	2502.09578	null
2025-02-13	DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra	Montgomery Bohde et.al.	2502.09571	link
2025-02-13	Statistical Equilibrium of Optimistic Beliefs	Yu Gui et.al.	2502.09569	null
2025-02-13	Diffusing DeBias: a Recipe for Turning a Bug into a Feature	Massimiliano Ciranni et.al.	2502.09564	null
2025-02-13	Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model	Fei Shen et.al.	2502.09533	null
2025-02-13	Diffusion Models for Molecules: A Survey of Methods and Tasks	Liang Wang et.al.	2502.09511	link
2025-02-13	Journey from the Wilson exact RG towards the Wegner-Morris Fokker-Planck RG and the Carosso field-coarsening via Langevin stochastic processes	Cecile Monthus et.al.	2502.09506	null
2025-02-13	A class of point-wise operating SPDE coefficients for HJM models	Nils Detering et.al.	2502.09486	null
2025-02-13	Redistribute Ensemble Training for Mitigating Memorization in Diffusion Models	Xiaoliu Guan et.al.	2502.09434	link
2025-02-13	ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation	Rotem Shalev-Arkushin et.al.	2502.09411	null
2025-02-13	Non-asymptotic Analysis of Diffusion Annealed Langevin Monte Carlo for Generative Modelling	Paula Cordero-Encinar et.al.	2502.09306	null
2025-02-12	SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation	Ellie Arar et.al.	2502.08642	null
2025-02-12	CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation	Qinghe Wang et.al.	2502.08639	null
2025-02-12	Chasing Charge Carriers: Diffusion Dynamics in Mixed-n Quasi-Two-Dimensional Colloidal MAPbBr3 Perovskites	Ronja Maria Piehler et.al.	2502.08601	null
2025-02-12	Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise Ratio	Khaled Kahouli et.al.	2502.08598	link
2025-02-12	Light-A-Video: Training-free Video Relighting via Progressive Light Fusion	Yujie Zhou et.al.	2502.08590	link
2025-02-12	Ultrasound Image Generation using Latent Diffusion Models	Benoit Freiche et.al.	2502.08580	null
2025-02-12	Mapping the Landscape of Generative AI in Network Monitoring and Management	Giampaolo Bovenzi et.al.	2502.08576	null
2025-02-12	BCDDM: Branch-Corrected Denoising Diffusion Model for Black Hole Image Generation	Ao liu et.al.	2502.08528	null
2025-02-12	One-Shot Federated Learning with Classifier-Free Diffusion Models	Obaidullah Zaland et.al.	2502.08488	null
2025-02-12	Discontinuous stochastic forcing in Greenland ice core data	Keno Riechers et.al.	2502.08460	null
2025-02-12	A Survey on Pre-Trained Diffusion Model Distillations	Xuhui Fan et.al.	2502.08364	null
2025-02-12	A posteriori error control for a finite volume scheme for a cross-diffusion model of ion transport	Arne Berrens et.al.	2502.08306	null
2025-02-12	BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video	Yu Hong et.al.	2502.08297	null
2025-02-12	FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis	Wonjoon Jin et.al.	2502.08244	null
2025-02-12	DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias	Song Park et.al.	2502.08167	null
2025-02-11	MatSwap: Light-aware material transfers in images	Ivan Lopes et.al.	2502.07784	null
2025-02-11	A note on the $\mathcal{W}_2$-convergence rate of the empirical measure of an ergodic $\mathbb{R}^d$ -valued diffusion	Jean-Francois Chassagneux et.al.	2502.07704	null
2025-02-11	Private Low-Rank Approximation for Covariance Matrices, Dyson Brownian Motion, and Eigenvalue-Gap Bounds for Gaussian Perturbations	Oren Mangoubi et.al.	2502.07657	null
2025-02-11	Consistency Training with Physical Constraints	Che-Chia Chang et.al.	2502.07636	null
2025-02-11	Understanding the Generalization Error of Markov algorithms through Poissonization	Benjamin Dupuis et.al.	2502.07584	null
2025-02-11	Generative Modeling with Bayesian Sample Inference	Marten Lienen et.al.	2502.07580	link
2025-02-11	Single-Step Consistent Diffusion Samplers	Pascal Jutras-Dubé et.al.	2502.07579	null
2025-02-11	The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray Generation	Raman Dutt et.al.	2502.07516	link
2025-02-11	Joint Metric Space Embedding by Unbalanced OT with Gromov-Wasserstein Marginal Penalization	Florian Beier et.al.	2502.07510	link
2025-02-11	Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models	Lin Zhu et.al.	2502.07466	link
2025-02-11	MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification	Anh-Tien Nguyen et.al.	2502.07409	link
2025-02-11	Bandit Optimal Transport	Lorenzo Croissant et.al.	2502.07397	null
2025-02-11	Nonlinear Open-Loop Mean field Stackelberg Stochastic Differential Game	Jianhui Huang et.al.	2502.07390	null
2025-02-12	Spatial Degradation-Aware and Temporal Consistent Diffusion Model for Compressed Video Super-Resolution	Hongyu An et.al.	2502.07381	null
2025-02-11	Colloidal Model for Investigating Optimal Efficiency in Weakly Coupled Ratchet Motors	José Martín-Roca et.al.	2502.07362	null
2025-02-10	Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions	Jaeyeon Kim et.al.	2502.06768	null
2025-02-10	History-Guided Video Diffusion	Kiwhan Song et.al.	2502.06764	null
2025-02-10	Rough Stochastic Pontryagin Maximum Principle and an Indirect Shooting Method	Thomas Lew et.al.	2502.06726	null
2025-02-10	Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene	Tai-Yu Pan et.al.	2502.06682	null
2025-02-11	Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification	Jiachen Li et.al.	2502.06619	link
2025-02-10	MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models	Kamil Garifullin et.al.	2502.06606	null
2025-02-10	A Large-scale AI-generated Image Inpainting Benchmark	Paschalis Giakoumoglou et.al.	2502.06593	null
2025-02-10	Diffusion Models for Computational Neuroimaging: A Survey	Haokai Zhao et.al.	2502.06552	link
2025-02-10	Properties of Wasserstein Gradient Flows for the Sliced-Wasserstein Distance	Christophe Vauthier et.al.	2502.06525	null
2025-02-10	Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation	Soobin Um et.al.	2502.06516	link
2025-02-10	WyckoffDiff - A Generative Diffusion Model for Crystal Symmetry	Filip Ekström Kelvinius et.al.	2502.06485	link
2025-02-10	Habitizing Diffusion Planning for Efficient and Effective Decision Making	Haofei Lu et.al.	2502.06401	link
2025-02-10	TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints	Pengyu Long et.al.	2502.06392	null
2025-02-10	Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo	Filip Ekström Kelvinius et.al.	2502.06379	null
2025-02-10	The randomization method in stochastic optimal control	Marco Fuhrman et.al.	2502.06356	null
2025-02-07	FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation	Shilong Zhang et.al.	2502.05179	link
2025-02-07	Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment	Minh-Quan Le et.al.	2502.05153	null
2025-02-07	Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images	Aditya Kumar et.al.	2502.05066	link
2025-02-07	Stochastic neutral fractions and the effective population size	Raphaël Forien et.al.	2502.05062	null
2025-02-07	On modified Euler methods for McKean-Vlasov stochastic differential equations with super-linear coefficients	Jiamin Jian et.al.	2502.05057	null
2025-02-07	Robust Graph Learning Against Adversarial Evasion Attacks via Prior-Free Diffusion-Based Structure Purification	Jiayi Luo et.al.	2502.05000	link
2025-02-07	C2GM: Cascading Conditional Generation of Multi-scale Maps from Remote Sensing Images Constrained by Geographic Features	Chenxing Sun et.al.	2502.04991	null
2025-02-07	Scalable and consistent embedding of probability measures into Hilbert spaces via measure quantization	Erell Gachon et.al.	2502.04907	null
2025-02-07	Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration	Yifeng Yu et.al.	2502.04849	null
2025-02-07	Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning	Chen-Xiao Gao et.al.	2502.04778	null
2025-02-07	Can Diffusion Models Learn Hidden Inter-Feature Rules Behind Images?	Yujin Han et.al.	2502.04725	null
2025-02-07	G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models	Mengdi Liu et.al.	2502.04684	null
2025-02-07	CCS: Controllable and Constrained Sampling with Diffusion Models via Initial Noise Perturbation	Bowen Song et.al.	2502.04670	null
2025-02-07	A Comprehensive Review on Noise Control of Diffusion Model	Zhehao Guo et.al.	2502.04669	null
2025-02-07	Overcoming Fake Solutions in Semi-Dual Neural Optimal Transport: A Smoothing Approach for Learning the Optimal Transport Plan	Jaemoo Choi et.al.	2502.04583	null
2025-02-06	HOG-Diff: Higher-Order Guided Diffusion for Graph Generation	Yiming Huang et.al.	2502.04308	link
2025-02-06	MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation	Jinbo Xing et.al.	2502.04299	null
2025-02-06	Diffusion-based mass map reconstruction from weak lensing data	Supranta S. Boruah et.al.	2502.04158	null
2025-02-06	Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis	Zhen Ye et.al.	2502.04128	link
2025-02-06	Generative Adversarial Networks Bridging Art and Machine Intelligence	Junhao Song et.al.	2502.04116	null
2025-02-06	TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers	Younghye Hwang et.al.	2502.04056	null
2025-02-06	PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models	Aleksandar Cvejic et.al.	2502.04050	null
2025-02-06	Hierarchical Entropic Diffusion for Ransomware Detection: A Probabilistic Approach to Behavioral Anomaly Isolation	Vasili Iskorohodov et.al.	2502.03882	null
2025-02-06	Interactions between resource dependent branching processes and equilibria	F. Thomas Bruss et.al.	2502.03872	null
2025-02-06	DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models	Lingshun Kong et.al.	2502.03810	null
2025-02-06	PINS: Proximal Iterations with Sparse Newton and Sinkhorn for Optimal Transport	Di Wu et.al.	2502.03749	null
2025-02-06	DICE: Distilling Classifier-Free Guidance into Text Embeddings	Zhenyu Zhou et.al.	2502.03726	null
2025-02-06	SDEs with subcritical Lebesgue–Hölder drifts and driven by $α$ -stable processes	Rongrong Tian et.al.	2502.03712	null
2025-02-06	Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free	Gian Mario Favero et.al.	2502.03687	null
2025-02-06	Variational Control for Guidance in Diffusion Models	Kushagra Pandey et.al.	2502.03686	link
2025-02-05	Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics	Xuan Li et.al.	2502.03449	null
2025-02-05	Masked Autoencoders Are Effective Tokenizers for Diffusion Models	Hao Chen et.al.	2502.03444	null
2025-02-05	Linearized Optimal Transport pyLOT Library: A Toolkit for Machine Learning on Point Clouds	Jun Linwu et.al.	2502.03439	null
2025-02-05	TruePose: Human-Parsing-guided Attention Diffusion for Full-ID Preserving Pose Transfer	Zhihong Xu et.al.	2502.03426	null
2025-02-05	A Mixture-Based Framework for Guiding Diffusion Models	Yazid Janati et.al.	2502.03332	link
2025-02-05	An efficient end-to-end computational framework for the generation of ECG calibrated volumetric models of human atrial electrophysiology	Elena Zappon et.al.	2502.03322	null
2025-02-05	MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent	Xinyao Liao et.al.	2502.03207	null
2025-02-05	Multilevel Picard approximations for McKean-Vlasov stochastic differential equations with nonconstant diffusion	Ariel Neufeld et.al.	2502.03205	link
2025-02-05	Poisson Flow Joint Model for Multiphase contrast-enhanced CT	Rongjun Ge et.al.	2502.03079	null
2025-02-05	Direct Distributional Optimization for Provable Alignment of Diffusion Models	Ryotaro Kawata et.al.	2502.02954	null
2025-02-05	Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization	Yang Li et.al.	2502.02941	null
2025-02-05	Data denoising with self consistency, variance maximization, and the Kantorovich dominance	Joshua Zoen-Git Hiew et.al.	2502.02925	null
2025-02-05	Elucidating the Preconditioning in Consistency Distillation	Kaiwen Zheng et.al.	2502.02922	null
2025-02-04	When are Diffusion Priors Helpful in Sparse Reconstruction? A Study with Sparse-view CT	Matt Y. Cheung et.al.	2502.02771	null
2025-02-04	Multimarginal Schrödinger Barycenter	Pengtao Li et.al.	2502.02726	null
2025-02-04	Calibrated Multi-Preference Optimization for Aligning Diffusion Models	Kyungmin Lee et.al.	2502.02588	null
2025-02-04	Open Materials Generation with Stochastic Interpolants	Philipp Hoellmer et.al.	2502.02582	null
2025-02-04	Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation	Jian Liu et.al.	2502.02525	link
2025-02-04	Privacy Attacks on Image AutoRegressive Models	Antoni Kowalczuk et.al.	2502.02514	link
2025-02-04	Generative Modeling on Lie Groups via Euclidean Generalized Score Matching	Marco Bertolini et.al.	2502.02513	null
2025-02-04	Do Graph Diffusion Models Accurately Capture and Generate Substructure Distributions?	Xiyuan Wang et.al.	2502.02488	null
2025-02-04	Distributional Diffusion Models with Scoring Rules	Valentin De Bortoli et.al.	2502.02483	null
2025-02-04	SDE Matching: Scalable and Simulation-Free Training of Latent Stochastic Differential Equations	Grigory Bartosh et.al.	2502.02472	null
2025-02-04	Towards Consistent and Controllable Image Synthesis for Face Editing	Mengting Wei et.al.	2502.02465	null
2025-02-04	Sparse Data Generation Using Diffusion Models	Phil Ostheimer et.al.	2502.02448	null
2025-02-04	Towards Fast Graph Generation via Autoregressive Noisy Filtration Modeling	Markus Krimmel et.al.	2502.02415	link
2025-02-04	DIME:Diffusion-Based Maximum Entropy Reinforcement Learning	Onur Celik et.al.	2502.02316	null
2025-02-05	A User’s Guide to Sampling Strategies for Sliced Optimal Transport	Keanu Sisouk et.al.	2502.02275	null
2025-02-04	One-sided measure theoretic elliptic operators and applications to SDEs driven by Gaussian white noise with atomic intensity	Alexandre B. Simas et.al.	2502.02264	null
2025-02-04	Exploring the latent space of diffusion models directly through singular value decomposition	Li Wang et.al.	2502.02225	null
2025-01-31	Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions	Sören Christensen et.al.	2501.19373	null
2025-01-31	Optimal transportation and pressure at zero temperature	Jairo K. Mengue et.al.	2501.19369	null
2025-01-31	Pathological MRI Segmentation by Synthetic Pathological Data Generation in Fetuses and Neonates	Misha P. T Kaandorp et.al.	2501.19338	null
2025-01-31	Medical Semantic Segmentation with Diffusion Pretrain	David Li et.al.	2501.19265	null
2025-01-31	Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search	Yuta Oshima et.al.	2501.19252	null
2025-01-31	Convergence of the micro-macro Parareal Method for a Linear Scale-Separated Ornstein-Uhlenbeck SDE: extended version	Ignace Bossuyt et.al.	2501.19210	null
2025-01-31	Strong uniform Wong–Zakai approximations of Lévy-driven Marcus SDEs	Ilya Pavlyukevich et.al.	2501.19175	null
2025-01-31	PSyDUCK: Training-Free Steganography for Latent Diffusion	Georgia Channing et.al.	2501.19172	null
2025-01-31	RMDM: Radio Map Diffusion Model with Physics Informed	Haozhe Jia et.al.	2501.19160	link
2025-01-31	Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data	Xichen Xu et.al.	2501.19094	null
2025-01-31	MotionPCM: Real-Time Motion Synthesis with Phased Consistency Model	Lei Jiang et.al.	2501.19083	null
2025-01-31	New perspectives on the d’Alembertian from general relativity. An invitation	Mathias Braun et.al.	2501.19071	null
2025-01-31	Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations	Dahye Kim et.al.	2501.19066	link
2025-01-31	Collaborative Diffusion Model for Recommender System	Gyuseok Lee et.al.	2501.18997	null
2025-01-31	Optimal Transport-based Conformal Prediction	Gauthier Thurin et.al.	2501.18991	null
2025-01-30	DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models	Ruofan Liang et.al.	2501.18590	null
2025-01-30	Log-Gaussian Cox Processes on General Metric Graphs	David Bolin et.al.	2501.18558	null
2025-01-30	Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss	Wenshuo Chen et.al.	2501.18232	link
2025-01-30	Inverse source problem of sub-diffusion of variable exponent	Zhiyuan Li et.al.	2501.18228	null
2025-01-30	Dual-Bounded Nonlinear Optimal Transport for Size Constrained Min Cut Clustering	Fangyuan Xie et.al.	2501.18143	null
2025-01-29	Stochastic scattering control of spider diffusion governed by an optimal diffraction probability measure selected from its own local-time	Isaac Ohavi et.al.	2501.18057	null
2025-01-29	SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders	Bartosz Cywiński et.al.	2501.18052	link
2025-01-29	An optimal level of Stubbornness to win a soccer match	Paramahansa Pramanik et.al.	2501.18050	null
2025-01-29	Conformal Dimensions On Causal Random Geometry	Ryan Barouki et.al.	2501.17930	null
2025-01-31	Consensus Based Stochastic Control	Liyao Lyu et.al.	2501.17801	link
2025-01-29	BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation – Challenges and Insights	Chan-Jan Hsu et.al.	2501.17790	null
2025-01-29	VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback	Sayeh Gholipour Picha et.al.	2501.17726	link
2025-01-29	Distinguished Quantized Guidance for Diffusion-based Sequence Recommendation	Wenyu Mao et.al.	2501.17670	null
2025-01-29	On the Singular Control of a Diffusion and Its Running Infimum or Supremum	Giorgio Ferrari et.al.	2501.17577	null
2025-01-29	Solving Inverse Problems using Diffusion with Fast Iterative Renoising	Matt C. Bendel et.al.	2501.17468	null
2025-01-29	NF-MKV Net: A Constraint-Preserving Neural Network Approach to Solving Mean-Field Games Equilibrium	Jinwei Liu et.al.	2501.17450	null
2025-01-28	MDDM: A Molecular Dynamics Diffusion Model to Predict Particle Self-Assembly	Kevin Ferguson et.al.	2501.17319	null
2025-01-28	Deterministic Optimal Transport-based Gaussian Mixture Particle Filtering for Verifiable Applications	Andrey A Popov et.al.	2501.17302	null
2025-01-28	CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation	Nikolai Kalischek et.al.	2501.17162	null
2025-01-28	IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait	Han Yang et.al.	2501.17159	null
2025-01-28	Generative diffusion models from a PDE perspective	Fei Cao et.al.	2501.17054	null
2025-01-28	Adversarial Masked Autoencoder Purifier with Defense Transferability	Yuan-Chih Chen et.al.	2501.16904	null
2025-01-28	DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model	Josua Spisak et.al.	2501.16800	null
2025-01-28	A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process	Jack David Carson et.al.	2501.16783	null
2025-01-28	FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation	Arvin Tashakori et.al.	2501.16778	null
2025-01-28	DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation	Chenguo Lin et.al.	2501.16764	null
2025-01-28	ITVTON:Virtual Try-On Diffusion Transformer Model Based on Integrated Image and Text	Haifeng Ni et.al.	2501.16757	null
2025-01-28	Consistency Diffusion Models for Single-Image 3D Reconstruction with Priors	Chenru Jiang et.al.	2501.16737	null
2025-01-28	Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models	Huijie Liu et.al.	2501.16714	null
2025-01-28	CascadeV: An Implementation of Wurstchen Architecture for Video Generation	Wenfeng Lin et.al.	2501.16612	link
2025-01-27	PackDiT: Joint Human Motion and Text Generation via Mutual Prompting	Zhongyu Jiang et.al.	2501.16551	null
2025-01-27	PhysAnimator: Physics-Guided Generative Cartoon Animation	Tianyi Xie et.al.	2501.16550	null
2025-01-27	Decrypting the temperature field in flow boiling with latent diffusion models	UngJin Na et.al.	2501.16510	null
2025-01-27	RelightVid: Temporal-Consistent Diffusion Model for Video Relighting	Ye Fang et.al.	2501.16330	null
2025-01-27	The Fundamental Theorem of Weak Optimal Transport	Mathias Beiglböck et.al.	2501.16316	null
2025-01-27	Congested Crossing Pedestrian Traffic Flow : Dispersion vs. Transport in Crowded Areas	Mariam Al Khatib et.al.	2501.16275	null
2025-01-27	UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images	Tatiana Taís Schein et.al.	2501.16211	link
2025-01-27	Multi-front dynamics in spatially inhomogeneous Allen-Cahn equations	Robbin Bastiaansen et.al.	2501.16195	null
2025-01-27	BAG: Body-Aligned 3D Wearable Asset Generation	Zhongjin Luo et.al.	2501.16177	null
2025-01-27	Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors	Zhiyuan Lu et.al.	2501.16147	null
2025-01-27	Using Generative Models to Produce Realistic Populations of UK Windstorms	Yee Chun Tsoi et.al.	2501.16110	null
2025-01-27	Improving Tropical Cyclone Forecasting With Video Diffusion Models	Zhibo Ren et.al.	2501.16003	link
2025-01-27	MatCLIP: Light- and Shape-Insensitive Assignment of PBR Material Models	Michael Birsak et.al.	2501.15981	null
2025-01-27	EDSep: An Effective Diffusion-Based Method for Speech Source Separation	Jinwei Dong et.al.	2501.15965	null
2025-01-27	Minimax rates of convergence for the nonparametric estimation of the diffusion coefficient from time-homogeneous SDE paths	Eddy Michel Ella Mintsa et.al.	2501.15933	null
2025-01-27	Generative AI for Lyapunov Optimization Theory in UAV-based Low-Altitude Economy Networking	Zhang Liu et.al.	2501.15928	null
2025-01-27	Minimax rates of convergence of a binary classification procedure for time-homogeneous S.D.E paths	Eddy Michel Ella Mintsa et.al.	2501.15926	null
2025-01-28	Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation	Adil Kaan Akan et.al.	2501.15878	null
2025-01-24	Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning	Jisi Zhang et.al.	2501.14680	null
2025-01-24	Optimal Transport Barycenter via Nonconvex-Concave Minimax Optimization	Kaheon Kim et.al.	2501.14635	null
2025-01-24	Diffusive transport on the real line: semi-contractive gradient flows and their discretization	Daniel Matthes et.al.	2501.14527	null
2025-01-24	Training-Free Style and Content Transfer by Leveraging U-Net Skip Connections in Stable Diffusion 2.*	Ludovica Schaerf et.al.	2501.14524	null
2025-01-24	Advancing data-driven broadband seismic wavefield simulation with multi-conditional diffusion model	Zhengfa Bi et.al.	2501.14348	null
2025-01-24	Stochastic Method for Delayed Neutron Precursors Transport in Liquid Fuel	Mathis Caprais et.al.	2501.14332	null
2025-01-24	Cosmic ray transport and acceleration in an evolving shock landscape	Sophie Aerdker et.al.	2501.14331	null
2025-01-24	CDI: Blind Image Restoration Fidelity Evaluation based on Consistency with Degraded Image	Xiaojun Tang et.al.	2501.14264	null
2025-01-24	TFG-Flow: Training-free Guidance in Multimodal Generative Flow	Haowei Lin et.al.	2501.14216	link
2025-01-24	Fully Guided Neural Schrödinger bridge for Brain MR image synthesis	Hanyeol Yang et.al.	2501.14171	null
2025-01-23	INDIGO+: A Unified INN-Guided Probabilistic Diffusion Algorithm for Blind and Non-Blind Image Restoration	Di You et.al.	2501.14014	null
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	null
2025-01-23	Improving Video Generation with Human Feedback	Jie Liu et.al.	2501.13918	null
2025-01-23	Unveiling the Power of Noise Priors: Enhancing Diffusion Models for Mobile Traffic Prediction	Zhi Sheng et.al.	2501.13794	null
2025-01-23	An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem	Mingzhao Wang et.al.	2501.13767	link
2025-01-23	A dimensionality reduction technique based on the Gromov-Wasserstein distance	Rafael P. Eufrazio et.al.	2501.13732	null
2025-01-23	Training-Free Consistency Pipeline for Fashion Repose	Potito Aghilar et.al.	2501.13692	null
2025-01-23	Linearization of ergodic McKean SDEs and applications	Grigorios A. Pavliotis et.al.	2501.13655	null
2025-01-23	Duality Theorems and Vector Measures in Optimal Transportation Theory	Shlomi Gover et.al.	2501.13557	null
2025-01-24	One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt	Tao Liu et.al.	2501.13554	link
2025-01-23	Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse	Wenzhuo Ma et.al.	2501.13528	null
2025-01-23	LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation	JiaXin Chen et.al.	2501.13475	null
2025-01-23	Zero-Shot Trajectory Planning for Signal Temporal Logic Tasks	Ruijia Liu et.al.	2501.13457	null
2025-01-23	Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement	Meng-Ping Lin et.al.	2501.13375	null
2025-01-23	MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize	Haohang Xu et.al.	2501.13349	null
2025-01-23	One Fits All: General Mobility Trajectory Modeling via Masked Conditional Diffusion	Qingyue Long et.al.	2501.13347	null
2025-01-22	Accelerate High-Quality Diffusion Models with Inner Loop Feedback	Matthew Gwilliam et.al.	2501.13107	null
2025-01-22	Robust Representation Consistency Model via Contrastive Denoising	Jiachen Lei et.al.	2501.13094	link
2025-01-22	Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation	Akshay Krishnan et.al.	2501.13087	null
2025-01-22	Robust Body Composition Analysis by Generating 3D CT Volumes from Limited 2D Slices	Lianrui Zuo et.al.	2501.13071	null
2025-01-22	Beyond the Lungs: Extending the Field of View in Chest CT with Latent Diffusion Models	Lianrui Zuo et.al.	2501.13068	null
2025-01-22	Low-dimensional adaptation of diffusion models: Convergence in total variation	Jiadong Liang et.al.	2501.12982	null
2025-01-22	LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation	Jiahao Wang et.al.	2501.12976	null
2025-01-22	3D Object Manipulation in a Single Image using Generative Models	Ruisi Zhao et.al.	2501.12935	null
2025-01-22	CrossDiff: Diffusion Probabilistic Model With Cross-conditional Encoder-Decoder for Crack Segmentation	Xianglong Shi et.al.	2501.12860	null
2025-01-22	AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation	Aghiles Kebaili et.al.	2501.12840	null
2025-01-22	Certified Guidance for Planning with Deep Generative Models	Francesco Giacomarra et.al.	2501.12815	null
2025-01-22	Explicit Eigenvalue Regularization Improves Sharpness-Aware Minimization	Haocheng Luo et.al.	2501.12666	link
2025-01-22	T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation	Lijun Li et.al.	2501.12612	link
2025-01-22	Image Motion Blur Removal in the Temporal Dimension with Video Diffusion Models	Wang Pang et.al.	2501.12604	null
2025-01-22	Improved Deep Learning Methods for Large-Scale Dynamic Portfolio Choice	Jeonggyu Huh et.al.	2501.12600	null
2025-01-21	Towards Affordance-Aware Articulation Synthesis for Rigged Objects	Yu-Chu Yu et.al.	2501.12393	null
2025-01-22	GPS as a Control Signal for Image Generation	Chao Feng et.al.	2501.12390	null
2025-01-21	Audio Texture Manipulation by Exemplar-Based Analogy	Kan Jen Cheng et.al.	2501.12385	null
2025-01-21	DiffDoctor: Diagnosing Image Diffusion Models Before Treating	Yiyang Wang et.al.	2501.12382	null
2025-01-21	RALAD: Bridging the Real-to-Sim Domain Gap in Autonomous Driving with Retrieval-Augmented Learning	Jiacheng Zuo et.al.	2501.12296	link
2025-01-21	VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models	Chaohao Xie et.al.	2501.12267	null
2025-01-21	Joint Reconstruction and Motion Estimation in Sparse-View 4DCT Using Diffusion Models within a Blind Inverse Problem Framework	Antoine De Paepe et.al.	2501.12249	null
2025-01-21	TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space	Daniel Garibi et.al.	2501.12224	null
2025-01-22	Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation	Zibo Zhao et.al.	2501.12202	link
2025-01-21	An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests	Marius Chevallier et.al.	2501.12195	null
2025-01-21	ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions	Shiyue Zhang et.al.	2501.12173	link
2025-01-21	A quadratic BSDE approach to normalization for the finite volume 2D sine-Gordon model in the finite ultraviolet regime	Shanjian Tang et.al.	2501.12172	null
2025-01-21	A note on the relations between mixture models, maximum-likelihood and entropic optimal transport	Titouan Vayer et.al.	2501.12005	null
2025-01-21	Growth model with externalities for energetic transition via MFG with common external variable	Pierre Lavigne et.al.	2501.11988	null
2025-01-21	A trajectorial approach to the gradient flow of McKean-Vlasov SDEs with mobility	Zhenxin Liu et.al.	2501.11913	null
2025-01-17	Principled model selection for stochastic dynamics	Andonis Gerardos et.al.	2501.10339	null
2025-01-17	DiffStereo: High-Frequency Aware Diffusion Model for Stereo Image Restoration	Huiyun Cao et.al.	2501.10325	null
2025-01-17	An optimal transport based embedding to quantify the distance between playing styles in collective sports	Ali Baouan et.al.	2501.10299	null
2025-01-17	Convex Physics Informed Neural Networks for the Monge-Ampère Optimal Transport Problem	Alexandre Caboussat et.al.	2501.10162	null
2025-01-17	DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency	Xiaohui Li et.al.	2501.10110	null
2025-01-17	Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning	Shengkui Zhao et.al.	2501.10052	link
2025-01-17	On Carathéodory approximate scheme for a class of one-dimensional doubly perturbed diffusion processes	R. Belfadli et.al.	2501.10036	null
2025-01-17	DiffuEraser: A Diffusion Model for Video Inpainting	Xiaowen Li et.al.	2501.10018	link
2025-01-17	Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks	Junlan Chen et.al.	2501.10017	null
2025-01-17	Physics-informed DeepCT: Sinogram Wavelet Decomposition Meets Masked Diffusion	Zekun Zhou et.al.	2501.09935	link
2025-01-16	Geometry-Preserving Encoder/Decoder in Latent Generative Models	Wonjun Lee et.al.	2501.09876	null
2025-01-16	CrossModalityDiffusion: Multi-Modal Novel View Synthesis with Unified Intermediate Representation	Alex Berian et.al.	2501.09838	link
2025-01-16	PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery	Shristi Das Biswas et.al.	2501.09826	link
2025-01-16	Lossy Compression with Pretrained Diffusion Models	Jeremy Vonderfecht et.al.	2501.09815	link
2025-01-16	SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces	Sumit Chaturvedi et.al.	2501.09756	null
2025-01-16	Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps	Nanye Ma et.al.	2501.09732	null
2025-01-16	Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review	Masatoshi Uehara et.al.	2501.09685	null
2025-01-16	Pruning for Sparse Diffusion Models based on Gradient Flow	Ben Wan et.al.	2501.09464	null
2025-01-16	CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation	Hwan Heo et.al.	2501.09433	link
2025-01-16	Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse	Guangyuan Liu et.al.	2501.09391	null
2025-01-16	UVRM: A Scalable 3D Reconstruction Model from Unposed Videos	Shiu-hong Kao et.al.	2501.09347	null
2025-01-16	Domain-conditioned and Temporal-guided Diffusion Modeling for Accelerated Dynamic MRI Reconstruction	Liping Zhang et.al.	2501.09305	null
2025-01-16	LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport	Kyeongha Rho et.al.	2501.09291	link
2025-01-16	Text Semantics to Flexible Design: A Residential Layout Generation Method Based on Stable Diffusion Model	Zijin Qiu et.al.	2501.09279	null
2025-01-16	PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving	Desen Sun et.al.	2501.09253	null
2025-01-15	Grounding Text-To-Image Diffusion Models For Controlled High-Quality Image Generation	Ahmad Süleyman et.al.	2501.09194	null
2025-01-15	Existence of Periodic and Stationary Solutions to Distribution-Dependent SDEs	Wei Sun et.al.	2501.09176	null
2025-01-15	Generative diffusion model with inverse renormalization group flows	Kanta Masuki et.al.	2501.09064	link
2025-01-15	SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation	Aditya Bhat et.al.	2501.09008	null
2025-01-15	RepVideo: Rethinking Cross-Layer Representation for Video Generation	Chenyang Si et.al.	2501.08994	null
2025-01-15	Boosting Diffusion Guidance via Learning Degradation-Aware Models for Blind Super Resolution	Shao-Hao Lu et.al.	2501.08819	link
2025-01-15	Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models	Zerui Tao et.al.	2501.08727	null
2025-01-15	FlexiClip: Locality-Preserving Free-Form Character Animation	Anant Khandelwal et.al.	2501.08676	null
2025-01-15	TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis	Bailiang Jian et.al.	2501.08667	null
2025-01-15	Product of Gaussian Mixture Diffusion Model for non-linear MRI Inversion	Laurenz Nagler et.al.	2501.08662	null
2025-01-15	Joint Learning of Depth and Appearance for Portrait Image Animation	Xinya Ji et.al.	2501.08649	null
2025-01-15	Watermarking in Diffusion Model: Gaussian Shading with Exact Diffusion Inversion via Coupled Transformations (EDICT)	Krishna Panthi et.al.	2501.08604	null
2025-01-15	Confinement-Driven Acceleration of First-Passage Rates	Won Kyu Kim et.al.	2501.08571	null
2025-01-15	DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors	Runqi Wang et.al.	2501.08553	null
2025-01-14	Bayesian Sphere-on-Sphere Regression with Optimal Transport Maps	Tin Lok James Ng et.al.	2501.08492	null
2025-01-14	Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models	Weichen Fan et.al.	2501.08453	null
2025-01-14	DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models	Hyeonwoo Kim et.al.	2501.08333	null
2025-01-14	MangaNinja: Line Art Colorization with Precise Reference Following	Zhiheng Liu et.al.	2501.08332	null
2025-01-14	Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise	Ryan Burgert et.al.	2501.08331	link
2025-01-14	GameFactory: Creating New Games with Generative Interactive Videos	Jiwen Yu et.al.	2501.08325	null
2025-01-14	Diffusion Adversarial Post-Training for One-Step Video Generation	Shanchuan Lin et.al.	2501.08316	null
2025-01-14	LayerAnimate: Layer-specific Control for Animation	Yuxue Yang et.al.	2501.08295	null
2025-01-14	Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints	Jonathan Nöther et.al.	2501.08246	null
2025-01-14	FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors	Yabo Zhang et.al.	2501.08225	link
2025-01-14	D $^2$ -DPM: Dual Denoising for Quantized Diffusion Probabilistic Models	Qian Zeng et.al.	2501.08180	link
2025-01-14	Wasserstein distances and divergences of order $p$ by quantum channels	Gergely Bunth et.al.	2501.08066	null
2025-01-14	Decision Transformers for RIS-Assisted Systems with Diffusion Model-Based Channel Acquisition	Jie Zhang et.al.	2501.08007	null
2025-01-14	GDiffRetro: Retrosynthesis Prediction with Dual Graph Enhanced Molecular Representation and Diffusion Generation	Shengyin Sun et.al.	2501.08001	link
2025-01-14	VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models	Hui Kuurila-Zhang et.al.	2501.07922	link
2025-01-14	Bridge-SR: Schrödinger Bridge for Efficient SR	Chang Li et.al.	2501.07897	null
2025-01-14	Strong existence, pathwise uniqueness and chains of collisions in infinite Brownian particle systems	Sayan Banerjee et.al.	2501.07840	null
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-13	Uniform large-scale $\varepsilon$ -regularity for entropic optimal transport	Rishabh S. Gvalani et.al.	2501.07539	null
2025-01-13	Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection	Shiman Zhang et.al.	2501.07533	link
2025-01-13	IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion	Tharun Anand et.al.	2501.07530	null
2025-01-13	PrecipDiff: Leveraging image diffusion models to enhance satellite-based precipitation observations	Ting-Yu Dai et.al.	2501.07447	null
2025-01-14	Synthesis and Analysis of Data as Probability Measures with Entropy-Regularized Optimal Transport	Brendan Mallery et.al.	2501.07446	link
2025-01-13	Diff-Ensembler: Learning to Ensemble 2D Diffusion Models for Volume-to-Volume Medical Image Translation	Xiyue Zhu et.al.	2501.07430	null
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	null
2025-01-13	Bigger Isn’t Always Better: Towards a General Prior for Medical Image Reconstruction	Lukas Glaszner et.al.	2501.07376	link
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-13	Synthetic notions of Ricci flow for metric measure spaces	Matthias Erbar et.al.	2501.07175	null
2025-01-13	D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation	Zhejun Zhang et.al.	2501.07077	link
2025-01-13	Relativistic model of spontaneous wave-function localization induced by nonHermitian colored noise	Pei Wang et.al.	2501.07050	null
2025-01-13	Erasing Noise in Signal Detection with Diffusion Model: From Theory to Application	Xiucheng Wang et.al.	2501.07030	null
2025-01-13	A Multi-Modal Deep Learning Framework for Pan-Cancer Prognosis	Binyu Zhang et.al.	2501.07016	link
2025-01-10	From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training	Julius Berner et.al.	2501.06148	link
2025-01-10	Averaged Adam accelerates stochastic optimization in the training of deep neural network approximations for partial differential equation and optimal control problems	Steffen Dereich et.al.	2501.06081	link
2025-01-10	Nonisotropic Gaussian Diffusion for Realistic 3D Human Motion Prediction	Cecilia Curreli et.al.	2501.06035	null
2025-01-10	CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control	Stefan Popov et.al.	2501.06006	null
2025-01-10	Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion	Michal Švento et.al.	2501.05959	link
2025-01-10	Symmetry Analysis of Semi-Linear Partial Differential Equations and Forward Backward Stochastic Differential Equations	Anas Ouknine et.al.	2501.05947	null
2025-01-10	Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation	Minxing Luo et.al.	2501.05892	null
2025-01-10	Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models	Sofia Jamil et.al.	2501.05839	link
2025-01-10	Diffusion Models for Smarter UAVs: Decision-Making and Modeling	Yousef Emami et.al.	2501.05819	null
2025-01-10	Alignment without Over-optimization: Training-Free Solution for Diffusion Models	Sunwoo Kim et.al.	2501.05803	link
2025-01-10	Conditional Diffusion Model for Electrical Impedance Tomography	Duanpeng Shi et.al.	2501.05769	null
2025-01-10	StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation	Shangjin Zhai et.al.	2501.05763	null
2025-01-10	UAV Swarm-enabled Collaborative Post-disaster Communications in Low Altitude Economy via a Two-stage Optimization Approach	Xiaoya Zheng et.al.	2501.05742	null
2025-01-10	EXION: Exploiting Inter- and Intra-Iteration Output Sparsity for Diffusion Models	Jaehoon Heo et.al.	2501.05680	null
2025-01-10	Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration	Zuyuan Zhang et.al.	2501.05673	null
2025-01-09	Decentralized Diffusion Models	David McAllister et.al.	2501.05450	null
2025-01-09	Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces	Aniruddha Mahapatra et.al.	2501.05442	null
2025-01-09	The GAN is dead; long live the GAN! A Modern GAN Baseline	Yiwen Huang et.al.	2501.05441	link
2025-01-09	Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation	Xuyi Meng et.al.	2501.05427	null
2025-01-09	TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts	Yu-Hao Huang et.al.	2501.05403	link
2025-01-09	Accelerated Diffusion Models via Speculative Sampling	Valentin De Bortoli et.al.	2501.05370	null
2025-01-09	CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models	Junha Park et.al.	2501.05359	null
2025-01-09	Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes	Ludwic Leonard et.al.	2501.05226	link
2025-01-10	FaceMe: Robust Blind Face Restoration with Personal Identification	Siyu Liu et.al.	2501.05177	null
2025-01-09	DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification	Qing Wang et.al.	2501.05127	null
2025-01-09	EquiBoost: An Equivariant Boosting Approach to Molecular Conformation Generation	Yixuan Yang et.al.	2501.05109	link
2025-01-09	Recovery of activation propagation and self-sustained oscillation abilities in stroke brain networks	Yingpeng Liu et.al.	2501.05099	null
2025-01-10	ResPanDiff: Diffusion Model for Pansharpening by Inferring Residual Inference	Shiqi Cao et.al.	2501.05091	null
2025-01-09	D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription	Hounsu Kim et.al.	2501.05068	link
2025-01-09	On a reaction-diffusion virus model with general boundary conditions in heterogeneous environments	Mingxin Wang et.al.	2501.04992	null
2025-01-08	EditAR: Unified Conditional Generation with Autoregressive Models	Jiteng Mu et.al.	2501.04699	null
2025-01-08	ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning	Yuzhou Huang et.al.	2501.04698	null
2025-01-08	SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images	Zixuan Huang et.al.	2501.04689	null
2025-01-08	Quadratic-form Optimal Transport	Ruodu Wang et.al.	2501.04658	null
2025-01-08	A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI	Kazusato Oko et.al.	2501.04641	link
2025-01-08	Disentangled Clothed Avatar Generation with Layered Representation	Weitian Zhang et.al.	2501.04631	null
2025-01-09	MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation	Daniele Molino et.al.	2501.04614	null
2025-01-08	Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion	Yangfan He et.al.	2501.04606	link
2025-01-08	ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training	Xinfa Zhu et.al.	2501.04416	null
2025-01-08	Edit as You See: Image-guided Video Editing via Masked Motion Modeling	Zhi-Lin Huang et.al.	2501.04325	null
2025-01-08	Local convergence near equilibria for distribution dependent SDEs	Shao-Qin Zhang et.al.	2501.04313	null
2025-01-08	DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models	Hyogon Ryu et.al.	2501.04304	link
2025-01-09	ContextMRI: Enhancing Compressed Sensing MRI through Metadata Conditioning	Hyungjin Chung et.al.	2501.04284	link
2025-01-08	DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions	Weidong Chen et.al.	2501.04256	null
2025-01-07	NeuralSVG: An Implicit Representation for Text-to-Vector Generation	Sagi Polaczek et.al.	2501.03992	null
2025-01-07	Stabilising effect of generic anomalous diffusion independent of the Rayleigh number	Antonio Barletta et.al.	2501.03990	null
2025-01-07	A precise asymptotic analysis of learning diffusion models: theory and insights	Hugo Cui et.al.	2501.03937	link
2025-01-07	Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers	Yuechen Zhang et.al.	2501.03931	link
2025-01-07	A regularized transportation cost stemming from entropic approximation	Camilla Brizzi et.al.	2501.03906	null
2025-01-07	Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control	Zekai Gu et.al.	2501.03847	link
2025-01-07	Impact of diffusion mechanisms on persistence and spreading	Nathanaël Boutillon et.al.	2501.03816	null
2025-01-07	Mixing by Internal Gravity Waves in Stars: Assessing Numerical Simulations Against Theory	Jack Morton et.al.	2501.03796	null
2025-01-07	Stochastic volatility model with long memory for water quantity-quality dynamics	Hidekazu Yoshioka et.al.	2501.03725	null
2025-01-07	Exploring Molecule Generation Using Latent Space Graph Diffusion	Prashanth Pombala et.al.	2501.03696	link
2025-01-07	MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer	Junsheng Luan et.al.	2501.03630	null
2025-01-07	FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI Synthesis	Xiaojiao Xiao et.al.	2501.03526	link
2025-01-07	Modeling Cell Type Developmental Trajectory using Multinomial Unbalanced Optimal Transport	Junhao Zhu et.al.	2501.03501	null
2025-01-07	SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation	Shang Chai et.al.	2501.03490	null
2025-01-06	A Self-supervised Diffusion Bridge for MRI Reconstruction	Harry Gao et.al.	2501.03430	null
2025-01-06	MObI: Multimodal Object Inpainting Using Diffusion Models	Alexandru Buburuzan et.al.	2501.03173	null
2025-01-06	Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches	Alhassan Mumuni et.al.	2501.03151	null
2025-01-06	DDRM-PR: Fourier Phase Retrieval using Denoising Diffusion Restoration Models	Mehmet Onurcan Kaya et.al.	2501.03030	link
2025-01-06	Convexity in ReLU Neural Networks: beyond ICNNs?	Anne Gagneux et.al.	2501.03017	null
2025-01-06	STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution	Rui Xie et.al.	2501.02976	null
2025-01-07	SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild	Jiawei Liu et.al.	2501.02962	null
2025-01-06	Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions	Jianhua Pei et.al.	2501.02928	null
2025-01-06	Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis	Thang-Anh-Quan Nguyen et.al.	2501.02913	null
2025-01-06	Conditional Mutual Information Based Diffusion Posterior Sampling for Solving Inverse Problems	Shayan Mohajer Hamidi et.al.	2501.02880	null
2025-01-06	Towards HRTF Personalization using Denoising Diffusion Models	Juan Camilo Albarracín Sánchez et.al.	2501.02871	null
2025-01-07	Diff-Lung: Diffusion-Based Texture Synthesis for Enhanced Pathological Tissue Segmentation in Lung CT Scans	Rezkellah Noureddine Khiati et.al.	2501.02867	null
2025-01-06	InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models	Kai Wang et.al.	2501.02816	null
2025-01-06	Fairness Through Matching	Kunwoong Kim et.al.	2501.02793	link
2025-01-06	Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising	Yunlong Yuan et.al.	2501.02741	null
2025-01-06	Multilevel Semantic-Aware Model for AI-Generated Video Quality Assessment	Jiaze Li et.al.	2501.02706	null
2025-01-03	Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models	Manh Duong Nguyen et.al.	2501.01932	link
2025-01-03	Nonparametric estimation of a factorizable density using diffusion models	Hyeok Kyu Kwon et.al.	2501.01783	null
2025-01-03	Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models	Andrea Matteazzi et.al.	2501.01761	null
2025-01-03	ACE: Anti-Editing Concept Erasure in Text-to-Image Models	Zihao Wang et.al.	2501.01633	link
2025-01-03	Multivariate Time Series Anomaly Detection using DiffGAN Model	Guangqiang Wu et.al.	2501.01591	link
2025-01-02	Denoising Diffused Embeddings: a Generative Approach for Hypergraphs	Shihao Wu et.al.	2501.01541	null
2025-01-02	Object-level Visual Prompts for Compositional Image Generation	Gaurav Parmar et.al.	2501.01424	null
2025-01-02	Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models	Jingfeng Yao et.al.	2501.01423	link
2025-01-02	Test-time Controllable Image Generation by Explicit Spatial Constraint Enforcement	Z. Zhang et.al.	2501.01368	null
2025-01-03	Conditional Consistency Guided Image Translation and Enhancement	Amil Bhagat et.al.	2501.01223	link
2025-01-02	Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission	Maojun Zhang et.al.	2501.01138	link
2025-01-02	EliGen: Entity-Level Controlled Image Generation with Regional Attention	Hong Zhang et.al.	2501.01097	link
2025-01-02	DiffCL: A Diffusion-Based Contrastive Learning Framework with Semantic Alignment for Multimodal Recommendations	Qiya Song et.al.	2501.01066	null
2025-01-02	Optimizing Noise Schedules of Generative Models in High Dimensionss	Santiago Aranguri et.al.	2501.00988	null
2025-01-01	Linear-Quadratic Optimal Control for Mean-Field Stochastic Differential Equations in Infinite-Horizon with Regime Switching	Hongwei Mei et.al.	2501.00981	null
2025-01-01	Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model	Omid Saghatchian et.al.	2501.00946	link
2025-01-01	Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion	Hao Wang et.al.	2501.00944	null
2025-01-01	A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset	Junhuan Yang et.al.	2501.00941	null
2025-01-01	Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models	Emily Johnson et.al.	2501.00917	null
2025-01-01	Diffusion Policies for Generative Modeling of Spacecraft Trajectories	Julia Briden et.al.	2501.00915	null
2025-01-01	Population Aware Diffusion for Time Series Generation	Yang Li et.al.	2501.00910	link
2024-12-30	Well-posedness of quadratic RBSDEs and BSDEs with one-sided growth restrictions	Shiqiu Zheng et.al.	2412.21172	null
2025-01-02	Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation	Yuanbo Yang et.al.	2412.21117	null
2024-12-30	Quantum Diffusion Model for Quark and Gluon Jet Generation	Mariia Baidachna et.al.	2412.21082	link
2025-01-02	Edicho: Consistent Image Editing in the Wild	Qingyan Bai et.al.	2412.21079	link
2024-12-30	Varformer: Adapting VAR’s Generative Prior for Image Restoration	Siyang Wang et.al.	2412.21063	link
2024-12-30	E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models	Zhiyu Tan et.al.	2412.21044	null
2024-12-30	Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration	Wanglong Lu et.al.	2412.21042	link
2024-12-30	A positivity-preserving truncated Euler–Maruyama method for stochastic differential equations with positive solutions: multi-dimensional case	Xingwei Hu et.al.	2412.20988	null
2024-12-30	AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies	Yibo Wen et.al.	2412.20984	null
2024-12-30	EdSr: A Novel End-to-End Approach for State-Space Sampling in Molecular Dynamics Simulation	Hai-Ming Cao et.al.	2412.20978	link
2024-12-30	Influence Maximization in Temporal Networks with Persistent and Reactive Behaviors	Aaqib Zahoor et.al.	2412.20936	null
2024-12-30	Optimal Diffusion Processes	Saber Jafarizadeh et.al.	2412.20934	null
2024-12-30	DDIM sampling for Generative AIBIM, a faster intelligent structural design framework	Zhili He et.al.	2412.20899	null
2024-12-30	VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control	Shaojin Wu et.al.	2412.20800	link
2024-12-30	A randomisation method for mean-field control problems with common noise	Robert Denkert et.al.	2412.20782	null
2024-12-27	VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models	Tao Wu et.al.	2412.19645	null
2024-12-27	StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture	Miaomiao Dai et.al.	2412.19535	null
2024-12-27	Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy–Fokker–Planck Equations	Yuanfei Huang et.al.	2412.19520	link
2024-12-27	RobotDiffuse: Motion Planning for Redundant Manipulator based on Diffusion Model	Xiaohan Zhang et.al.	2412.19500	link
2024-12-27	RAIN: Real-time Animation of Infinite Video Stream	Zhilei Shu et.al.	2412.19489	null
2024-12-27	DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes	Yiyuan Liang et.al.	2412.19458	link
2024-12-27	Multi-scale Latent Point Consistency Models for 3D Shape Generation	Bi’an Du et.al.	2412.19413	null
2024-12-27	A Generalized Einstein Relation for Markovian Friction Coefficients from Molecular Trajectories	J. M. Hall et.al.	2412.19398	null
2024-12-26	6Diffusion: IPv6 Target Generation Using a Diffusion Model with Global-Local Attention Mechanisms for Internet-wide IPv6 Scanning	Nabo He et.al.	2412.19243	null
2024-12-26	Mask Approximation Net: Merging Feature Extraction and Distribution Learning for Remote Sensing Change Captioning	Dongwei Sun et.al.	2412.19179	null
2024-12-26	Convergence rate of Euler-Maruyama scheme for McKean-Vlasov SDEs with density-dependent drift	Anh-Dung Le et.al.	2412.19121	null
2024-12-26	Discrete vs. Continuous Trade-offs for Generative Models	Jathin Korrapati et.al.	2412.19114	null
2024-12-26	Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Models	Hyesong Choi et.al.	2412.19104	null
2024-12-26	Constrained stochastic linear quadratic control under regime switching with controlled jump size	Xiaomin Shi et.al.	2412.19100	null
2024-12-26	Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation	Haotian Qian et.al.	2412.19080	null
2024-12-24	PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models	Minghao Chen et.al.	2412.18608	null
2024-12-24	DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers	Yuntao Chen et.al.	2412.18607	null
2024-12-24	Explaining in Diffusion: Explaining a Classifier Through Hierarchical Semantics with Text-to-Image Diffusion Models	Tahira Kazimi et.al.	2412.18604	null
2024-12-24	DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation	Minghong Cai et.al.	2412.18597	link
2024-12-24	LatentCRF: Continuous CRF for Efficient Latent Diffusion	Kanchana Ranasinghe et.al.	2412.18596	null
2024-12-24	Resolution-Robust 3D MRI Reconstruction with 2D Diffusion Priors: Diverse-Resolution Training Outperforms Interpolation	Anselm Krainovic et.al.	2412.18584	null
2024-12-24	3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement	Yihang Luo et.al.	2412.18565	null
2024-12-24	Dynamic Mean-Variance Asset Allocation in General Incomplete Markets A Nonlocal BSDE-based Feedback Control Approach	Qian Lei et.al.	2412.18498	null
2024-12-24	Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm	O. Deniz Akyildiz et.al.	2412.18432	null
2024-12-24	Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models	Qice Qin et.al.	2412.18421	null
2024-12-24	Discovery of 2D Materials via Symmetry-Constrained Diffusion Model	Shihang Xu et.al.	2412.18414	null
2024-12-24	FameBias: Embedding Manipulation Bias Attack in Text-to-Image Models	Jaechul Roh et.al.	2412.18302	null
2024-12-24	GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications	Zhenzhou Jin et.al.	2412.18281	null
2024-12-24	Schödinger Bridge Type Diffusion Models as an Extension of Variational Autoencoders	Kentaro Kaba et.al.	2412.18237	null
2024-12-24	Expand VSR Benchmark for VLLM to Expertize in Spatial Rules	Peijin Xie et.al.	2412.18224	link
2024-12-23	FaceLift: Single Image to 3D Head with View Generation and GS-LRM	Weijie Lyu et.al.	2412.17812	null
2024-12-23	PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion	Sophia Tang et.al.	2412.17780	null
2024-12-23	Ergodic Network Stochastic Differential Equations	Francesco Iafrate et.al.	2412.17779	null
2024-12-23	The Superposition of Diffusion Models Using the Itô Density Estimator	Marta Skreta et.al.	2412.17762	null
2024-12-23	Broker-Trader Partial Information Nash Equilibria	Xuchen Wu et.al.	2412.17712	null
2024-12-23	A Bias-Free Training Paradigm for More General AI-generated Image Detection	Fabrizio Guillaro et.al.	2412.17671	null
2024-12-23	Benchmarking Generative AI Models for Deep Learning Test Input Generation	Maryam et.al.	2412.17652	link
2024-12-23	DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder	Ente Lin et.al.	2412.17644	null
2024-12-23	Retention Score: Quantifying Jailbreak Risks for Vision Language Models	Zaitang Li et.al.	2412.17544	null
2024-12-23	DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak	Hao Wang et.al.	2412.17522	null
2024-12-23	Improving the Noise Estimation of Latent Neural Stochastic Differential Equations	Linus Heck et.al.	2412.17499	null
2024-12-23	Heterogeneous carrying capacities and global extinction in metapopulations	Jakub Hesoun et.al.	2412.17461	null
2024-12-23	AeroDiT: Diffusion Transformers for Reynolds-Averaged Navier-Stokes Simulations of Airfoil Flows	Hui Xiang et.al.	2412.17394	null
2024-12-23	Applications of optimal transport to Dyson Brownian Motions and beyond	Xuan Wu et.al.	2412.17389	null
2024-12-24	Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement	Hyeonjin Kim et.al.	2412.17387	link
2024-12-20	Stochastic Analysis of Entanglement-assisted Quantum Communication Channels	Karim S. Elsayed et.al.	2412.16157	null
2024-12-20	Personalized Representation from Personalized Generation	Shobhita Sundaram et.al.	2412.16156	link
2024-12-20	Predicting human cooperation: sensitizing drift-diffusion model to interaction and external stimuli	Lucila G. Alvarez-Zuzek et.al.	2412.16121	null
2024-12-20	Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation	Timur Sattarov et.al.	2412.16083	null
2024-12-20	Examining Entropic Unbalanced Optimal Transport and Sinkhorn Divergences for Spatial Forecast Verification	Jacob J. M. Francis et.al.	2412.16063	null
2024-12-20	Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy	Shaoyan Pan et.al.	2412.16050	null
2024-12-20	SafeCFG: Redirecting Harmful Classifier-Free Guidance for Safe Generation	Jiadong Pan et.al.	2412.16039	null
2024-12-20	Sensitivity of functionals of McKean-Vlasov SDE’s with respect to the initial distribution	Filippo de Feo et.al.	2412.15906	null
2024-12-20	On Robust Cross Domain Alignment	Anish Chakrabarty et.al.	2412.15861	null
2024-12-20	Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation	Kai Brandenbusch et.al.	2412.15853	null
2024-12-20	Electromagnetic particle-in-cell modeling of an electron cyclotron resonance plasma discharge in hydrogen	D. Eremin et.al.	2412.15802	null
2024-12-20	Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance	Hyunsoo Lee et.al.	2412.15798	null
2024-12-20	Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction	Mengshi Qi et.al.	2412.15673	link
2024-12-20	BS-LDM: Effective Bone Suppression in High-Resolution Chest X-Ray Images with Conditional Latent Diffusion Models	Yifei Sun et.al.	2412.15670	link
2024-12-20	SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control	Xiaohan Zhang et.al.	2412.15664	null
2024-12-19	LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis	Hanlin Wang et.al.	2412.15214	link
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	null
2024-12-19	Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation	Hadi Alzayer et.al.	2412.15211	null
2024-12-19	Preventing Local Pitfalls in Vector Quantization via Optimal Transport	Borui Zhang et.al.	2412.15195	link
2024-12-19	AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation	Moayed Haji-Ali et.al.	2412.15191	null
2024-12-19	Tiled Diffusion	Or Madar et.al.	2412.15185	null
2024-12-19	OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization	Jiacheng Zhang et.al.	2412.15159	null
2024-12-19	Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM	Yatai Ji et.al.	2412.15156	link
2024-12-19	Jet: A Modern Transformer-Based Normalizing Flow	Alexander Kolesnikov et.al.	2412.15129	null
2024-12-19	Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion	Zhifei Chen et.al.	2412.15050	null
2024-12-19	DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space	Mang Ning et.al.	2412.15032	link
2024-12-19	Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls	Riccardo Fosco Gramaccioni et.al.	2412.15023	null
2024-12-19	MagicNaming: Consistent Identity Generation by Finding a “Name Space” in T2I Diffusion Models	Jing Zhao et.al.	2412.14902	null
2024-12-19	Diffusion priors for Bayesian 3D reconstruction from incomplete measurements	Julian L. Möbius et.al.	2412.14897	null
2024-12-20	Quantum Algorithms for Stochastic Differential Equations: A Schrödingerisation Approach	Shi Jin et.al.	2412.14868	null
2024-12-18	AniDoc: Animation Creation Made Easier	Yihao Meng et.al.	2412.14173	null
2024-12-19	E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling	Zhihang Yuan et.al.	2412.14170	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169	link
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167	null
2024-12-18	MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation	Shenhao Zhu et.al.	2412.14148	null
2024-12-18	SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation	Tong Chen et.al.	2412.14018	null
2024-12-18	Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates	Sen Yan et.al.	2412.13966	null
2024-12-18	IDEQ: an improved diffusion model for the TSP	Mickael Basson et.al.	2412.13858	null
2024-12-18	Object Style Diffusion for Generalized Object Detection in Urban Scene	Hao Li et.al.	2412.13815	null
2024-12-18	Text2Relight: Creative Portrait Relighting with Text Guidance	Junuk Cha et.al.	2412.13734	null
2024-12-18	Diffusion models and stochastic quantisation in lattice field theory	Gert Aarts et.al.	2412.13704	null
2024-12-18	MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing	Chuang Yang et.al.	2412.13684	null
2024-12-18	VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement	Chen Zhao et.al.	2412.13655	link
2024-12-18	TAUDiff: Improving statistical downscaling for extreme weather events using generative diffusion models	Rahul Sundar et.al.	2412.13627	null
2024-12-18	PASCO (PArallel Structured COarsening): an overlay to speed up graph clustering algorithms	Etienne Lasalle et.al.	2412.13592	link
2024-12-17	CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models	Gaoyang Zhang et.al.	2412.13195	link
2024-12-17	StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models	Yunzhi Yan et.al.	2412.13188	null
2024-12-17	Move-in-2D: 2D-Conditioned Human Motion Generation	Hsin-Ping Huang et.al.	2412.13185	null
2024-12-17	A Pontryagin-Guided Neural Policy Optimization Framework for Merton’s Portfolio Problem	Jeonggyu Huh et.al.	2412.13101	null
2024-12-17	Prompt Augmentation for Self-supervised Text-guided Image Manipulation	Rumeysa Bodur et.al.	2412.13081	null
2024-12-17	3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation	Haoshen Wang et.al.	2412.13059	null
2024-12-18	Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance	Wenhao Sun et.al.	2412.12974	link
2024-12-17	ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting	Guillaume Couairon et.al.	2412.12971	link
2024-12-17	Generation of cosmic ray trajectories by a Diffusion Model trained on test particles in 3D magnetohydrodynamic turbulence	Johannes Martin et.al.	2412.12923	null
2024-12-17	Unsupervised Region-Based Image Editing of Denoising Diffusion Models	Zixiang Li et.al.	2412.12912	null
2024-12-17	Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency	Taisuke Kobayashi et.al.	2412.12894	null
2024-12-18	ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction	Zhongjie Duan et.al.	2412.12888	link
2024-12-17	Rethinking Diffusion-Based Image Generators for Fundus Fluorescein Angiography Synthesis on Limited Data	Chengzhou Yu et.al.	2412.12778	null
2024-12-17	Guided and Variance-Corrected Fusion with One-shot Style Alignment for Large-Content Image Generation	Shoukun Sun et.al.	2412.12771	link
2024-12-17	Towards a Training Free Approach for 3D Scene Editing	Vivek Madhavaram et.al.	2412.12766	null
2024-12-16	Causal Diffusion Transformers for Generative Modeling	Chaorui Deng et.al.	2412.12095	link
2024-12-16	CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models	Felix Taubner et.al.	2412.12093	null
2024-12-16	Wonderland: Navigating 3D Scenes from a Single Image	Hanwen Liang et.al.	2412.12091	null
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048	null
2024-12-16	The entropic optimal (self-)transport problem: Limit distributions for decreasing regularization with application to score function estimation	Gilles Mordant et.al.	2412.12007	null
2024-12-16	Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data	Onur Tasar et.al.	2412.11972	null
2024-12-16	ColorFlow: Retrieval-Augmented Image Sequence Colorization	Junhao Zhuang et.al.	2412.11815	null
2024-12-16	InterDyn: Controllable Interactive Dynamics with Video Diffusion Models	Rick Akkerman et.al.	2412.11785	null
2024-12-16	Joint Reconstruction of the Activity and the Attenuation in PET by Diffusion Posterior Sampling: a Feasibility Study	Clémentine Phung-Ngoc et.al.	2412.11776	null
2024-12-17	No More Adam: Learning Rate Scaling at Initialization is All You Need	Minghao Xu et.al.	2412.11768	link
2024-12-16	Conditional Diffusion Models Based Conditional Independence Testing	Yanfeng Yang et.al.	2412.11744	link
2024-12-16	Re-Attentional Controllable Video Diffusion Editing	Yuanzhi Wang et.al.	2412.11710	link
2024-12-16	VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting	Muhammet Furkan Ilaslan et.al.	2412.11621	link
2024-12-16	3D $^2$ -Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling	Zichen Tang et.al.	2412.11599	link
2024-12-16	StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors	Xiaokun Sun et.al.	2412.11586	link
2024-12-13	Towards a foundation model for heavy-ion collision experiments through point cloud diffusion	Manjunath Omana Kuttan et.al.	2412.10352	null
2024-12-13	BrushEdit: All-In-One Image Inpainting and Editing	Yaowei Li et.al.	2412.10316	null
2024-12-13	Coherent 3D Scene Diffusion From a Single RGB Image	Manuel Dahnert et.al.	2412.10294	null
2024-12-13	GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion	Jiapeng Tang et.al.	2412.10209	null
2024-12-13	Efficient Generative Modeling with Residual Vector Quantization-Based Tokens	Jaehyeon Kim et.al.	2412.10208	null
2024-12-13	Simple Guidance Mechanisms for Discrete Diffusion Models	Yair Schiff et.al.	2412.10193	link
2024-12-13	SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models	Hung Nguyen et.al.	2412.10178	null
2024-12-13	The Art of Deception: Color Visual Illusions and Diffusion Models	Alex Gomez-Villa et.al.	2412.10122	null
2024-12-13	SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution	Runyi Hu et.al.	2412.10049	null
2024-12-13	Emergence of complexity in opinion propagation: A reaction-diffusion model	Romain Ducasse et.al.	2412.10000	null
2024-12-13	Cycle-Consistent Bridge Diffusion Model for Accelerated MRI Reconstruction	Tao Song et.al.	2412.09998	null
2024-12-13	EP-CFG: Energy-Preserving Classifier-Free Guidance	Kai Zhang et.al.	2412.09966	null
2024-12-13	Generating 3D Pseudo-Healthy Knee MR Images to Support Trochleoplasty Planning	Michael Wehrli et.al.	2412.09962	link
2024-12-13	Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization	Xinhao Zhong et.al.	2412.09959	null
2024-12-13	Latent feedback control of distributed systems in multiple scenarios through deep learning-based reduced order models	Matteo Tomasetto et.al.	2412.09942	null
2024-12-12	FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion	Haonan Qiu et.al.	2412.09626	null
2024-12-12	Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors	Yue Feng et.al.	2412.09625	null
2024-12-12	OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation	Weiqi Li et.al.	2412.09623	null
2024-12-12	LoRACLR: Contrastive Adaptation for Customization of Diffusion Models	Enis Simsar et.al.	2412.09622	null
2024-12-12	SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training	Dongting Hu et.al.	2412.09619	null
2024-12-12	EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Zhuofan Zong et.al.	2412.09618	null
2024-12-12	Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG	Kavana Venkatesh et.al.	2412.09614	null
2024-12-12	LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors	Yabo Chen et.al.	2412.09597	null
2024-12-12	Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion	Zexin He et.al.	2412.09593	null
2024-12-12	SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing	Xueting Li et.al.	2412.09545	null
2024-12-12	Learned Compression for Compressed Learning	Dan Jacobellis et.al.	2412.09405	link
2024-12-12	Diffusion Model with Representation Alignment for Protein Inverse Folding	Chenglin Wang et.al.	2412.09380	null
2024-12-12	Diffusion Predictive Control with Constraints	Ralf Römer et.al.	2412.09342	link
2024-12-12	Auto-Regressive Moving Diffusion Models for Time Series Forecasting	Jiaxin Gao et.al.	2412.09328	link
2024-12-13	Are Conditional Latent Diffusion Models Effective for Image Restoration?	Yunchen Yuan et.al.	2412.09324	null
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	null
2024-12-11	DMin: Scalable Training Data Influence Estimation for Diffusion Models	Huawei Lin et.al.	2412.08637	link
2024-12-11	TryOffAnyone: Tiled Cloth Generation from a Dressed Person	Ioannis Xarchakos et.al.	2412.08573	link
2024-12-11	A numerical method to simulate the stochastic linear-quadratic optimal control problem with control constraint in higher dimensions	Abhishek Chaudhary et.al.	2412.08553	null
2024-12-11	Learning Flow Fields in Attention for Controllable Person Image Generation	Zijian Zhou et.al.	2412.08486	link
2024-12-11	InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models	Min Hou et.al.	2412.08480	link
2024-12-11	CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis	Mu Zhang et.al.	2412.08464	null
2024-12-11	Reliable Uncertainty Quantification for Fiber Orientation in Composite Molding Processes using Multilevel Polynomial Surrogates	Stjepan Salatovic et.al.	2412.08459	null
2024-12-11	Generalized free energy and excess entropy production for active systems	Artemy Kolchinsky et.al.	2412.08432	null
2024-12-12	Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views	Songchun Zhang et.al.	2412.08412	null
2024-12-11	Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR3	Joao Carvalho et.al.	2412.08398	null
2024-12-11	Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion	Jisheng Chu et.al.	2412.08326	link
2024-12-11	GDSG: Graph Diffusion-based Solution Generation for Optimization Problems in MEC Networks	Ruihuai Liang et.al.	2412.08296	link
2024-12-11	Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations	Nikil Roashan Selvam et.al.	2412.08292	link
2024-12-11	Toward Near-Globally Optimal Nonlinear Model Predictive Control via Diffusion Models	Tzu-Yuan Huang et.al.	2412.08278	null
2024-12-10	Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets	Zhen Liu et.al.	2412.07775	null
2024-12-10	From Slow Bidirectional to Fast Causal Video Generators	Tianwei Yin et.al.	2412.07772	null
2024-12-10	Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds	Xiaoyu Xiang et.al.	2412.07766	null
2024-12-10	Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation	Jingxi Chen et.al.	2412.07761	null
2024-12-10	SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints	Jianhong Bai et.al.	2412.07760	link
2024-12-10	Multi-Shot Character Consistency for Text-to-Video Generation	Yuval Atzmon et.al.	2412.07750	null
2024-12-10	FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models	Tong Wu et.al.	2412.07674	null
2024-12-10	TraSCE: Trajectory Steering for Concept Erasure	Anubhav Jain et.al.	2412.07658	link
2024-12-11	Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model	Jiahua Xu et.al.	2412.07590	link
2024-12-10	DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation	Jianzong Wu et.al.	2412.07589	null
2024-12-10	Mobile Video Diffusion	Haitam Ben Yahia et.al.	2412.07583	null
2024-12-10	Parallel simulation for sampling under isoperimetry and score-based diffusion models	Huanjian Zhou et.al.	2412.07435	null
2024-12-10	Non-Progressive Influence Maximization in Dynamic Social Networks	Yunming Hui et.al.	2412.07402	null
2024-12-10	Fusion Embedding for Pose-Guided Person Image Synthesis with Diffusion Model	Donghwna Lee et.al.	2412.07333	null
2024-12-10	AppGen: Mobility-aware App Usage Behavior Generation for Mobile Users	Zihan Huang et.al.	2412.07267	null
2024-12-10	[MASK] is All You Need	Vincent Tao Hu et.al.	2412.06787	link
2024-12-09	Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation	Ruihan Gao et.al.	2412.06785	link
2024-12-09	Diverse Score Distillation	Yanbo Xu et.al.	2412.06780	null
2024-12-09	Visual Lexicon: Rich Image Features in Language Space	XuDong Wang et.al.	2412.06774	null
2024-12-09	InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention	Howard Zhang et.al.	2412.06753	null
2024-12-10	ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet	Andrei-Robert Alexandrescu et.al.	2412.06742	null
2024-12-09	Partially Observed Optimal Stochastic Control: Regularity, Optimality, Approximations, and Learning	Ali Devran Kara et.al.	2412.06735	null
2024-12-09	Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection	Caiyun Xie et.al.	2412.06727	link
2024-12-09	You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale	Baorui Ma et.al.	2412.06699	link
2024-12-09	Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy	Yuxuan Xue et.al.	2412.06698	null
2024-12-09	Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset	Shanshan Wang et.al.	2412.06666	null
2024-12-09	Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion	Shuaiting Li et.al.	2412.06661	null
2024-12-09	MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences	Weitao Wang et.al.	2412.06614	null
2024-12-09	On the problem of optimal fair exchange	Alexander Kolesnikov et.al.	2412.06522	null
2024-12-09	Generative Lines Matching Models	Ori Matityahu et.al.	2412.06403	null
2024-12-06	Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories	Susung Hong et.al.	2412.05279	null
2024-12-06	Birth and Death of a Rose	Chen Geng et.al.	2412.05278	null
2024-12-06	MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models	Tuna Han Salih Meral et.al.	2412.05275	null
2024-12-06	Go-or-Grow Models in Biology: a Monster on a Leash	R. Thiessen et.al.	2412.05191	null
2024-12-06	On Mean Field Monotonicity Conditions from Control Theoretical Perspective	Alain Bensoussan et.al.	2412.05189	null
2024-12-06	DNF: Unconditional 4D Generation with Dictionary-based Neural Fields	Xinyi Zhang et.al.	2412.05161	null
2024-12-06	Probabilistic Galaxy Field Generation with Diffusion Models	Tanner Sether et.al.	2412.05131	null
2024-12-06	The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation	Ruoyu Wang et.al.	2412.05101	null
2024-12-06	ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration	Chi-Wei Hsiao et.al.	2412.05043	null
2024-12-06	Noise Matters: Diffusion Model-based Urban Mobility Generation with Collaborative Noise Priors	Yuheng Zhang et.al.	2412.05000	null
2024-12-06	Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction	Gaurav Shrivastava et.al.	2412.04929	null
2024-12-06	SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models	Zilan Wang et.al.	2412.04852	link
2024-12-06	Wavelet Diffusion Neural Operator	Peiyan Hu et.al.	2412.04833	link
2024-12-06	DAWN-SI: Data-Aware and Noise-Informed Stochastic Interpolation for Solving Inverse Problems	Shadab Ahamed et.al.	2412.04766	null
2024-12-06	Diff4Steer: Steerable Diffusion Prior for Generative Music Retrieval with Semantic Guidance	Xuchan Bao et.al.	2412.04746	null
2024-12-05	PaintScene4D: Consistent 4D Scene Generation from Text Prompts	Vinayak Gupta et.al.	2412.04471	null
2024-12-05	LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors	Yusuf Dalva et.al.	2412.04460	null
2024-12-05	Four-Plane Factorized Video Autoencoders	Mohammed Suhail et.al.	2412.04452	null
2024-12-05	MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation	Longtao Zheng et.al.	2412.04448	null
2024-12-05	DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models	Yizhuo Li et.al.	2412.04446	null
2024-12-05	Learning Artistic Signatures: Symmetry Discovery and Style Transfer	Emma Finn et.al.	2412.04441	null
2024-12-05	Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation	Yuying Ge et.al.	2412.04432	link
2024-12-05	Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis	Jian Han et.al.	2412.04431	link
2024-12-05	Reversible molecular simulation for training classical and machine learning force fields	Joe G Greener et.al.	2412.04374	link
2024-12-05	ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation	Dayoung Gong et.al.	2412.04353	null
2024-12-05	RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse	Zhouyingcheng Liao et.al.	2412.04343	null
2024-12-05	Likelihood-Scheduled Score-Based Generative Modeling for Fully 3D PET Image Reconstruction	George Webber et.al.	2412.04339	null
2024-12-05	Multi-Subject Image Synthesis as a Generative Prior for Single-Subject PET Image Reconstruction	George Webber et.al.	2412.04324	null
2024-12-05	Structure-Aware Stylized Image Synthesis for Robust Medical Image Segmentation	Jie Bao et.al.	2412.04296	link
2024-12-05	Alpha shapes and optimal transport on the sphere	Erik Carlsson et.al.	2412.04286	link
2024-12-04	MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation	Zehuan Huang et.al.	2412.03558	null
2024-12-04	NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images	Lingen Li et.al.	2412.03517	null
2024-12-04	Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion	Shengyuan Zhang et.al.	2412.03515	link
2024-12-04	Self-test loss functions for learning weak-form operators and gradient flows	Yuan Gao et.al.	2412.03506	null
2024-12-04	Solving Monge problem by Hilbert space embeddings of probability measures	Takafumi Saito et.al.	2412.03478	null
2024-12-04	CleanDIFT: Diffusion Features without Noise	Nick Stracke et.al.	2412.03439	link
2024-12-04	SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model	Yan Li et.al.	2412.03430	null
2024-12-04	Skel3D: Skeleton Guided Novel View Synthesis	Aron Fóthi et.al.	2412.03407	null
2024-12-04	Deep Operator BSDE: a Numerical Scheme to Approximate the Solution Operators	Giulia Di Nunno et.al.	2412.03405	null
2024-12-04	Identifiability implies consistency of MLE in partially observed diffusions on a torus	Ibrahim Ekren et.al.	2412.03380	null
2024-12-04	TASR: Timestep-Aware Diffusion Model for Image Super-Resolution	Qinwei Lin et.al.	2412.03355	link
2024-12-04	DIVE: Taming DINO for Subject-Driven Video Editing	Yi Huang et.al.	2412.03347	null
2024-12-04	Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis	Tao Jun Lin et.al.	2412.03315	null
2024-12-04	Schrodinger Bridge over Averaged Systems	Daniel Owusu Adu et.al.	2412.03294	null
2024-12-04	Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression	Junjie Wen et.al.	2412.03293	null
2024-12-03	Diffusion-based Visual Anagram as Multi-task Learning	Zhiyuan Xu et.al.	2412.02693	link
2024-12-03	FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation	Kefan Chen et.al.	2412.02690	null
2024-12-04	SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance	Viet Nguyen et.al.	2412.02687	null
2024-12-03	Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation	Yiftach Edelstein et.al.	2412.02631	null
2024-12-03	Unveiling Concept Attribution in Diffusion Models	Quang H. Nguyen et.al.	2412.02542	link
2024-12-03	It Takes Two: Real-time Co-Speech Two-person’s Interaction Generation via Reactive Auto-regressive Diffusion Model	Mingyi Shi et.al.	2412.02419	null
2024-12-03	GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing	Khawar Islam et.al.	2412.02366	null
2024-12-03	LoRA Diffusion: Zero-Shot LoRA Synthesis for Diffusion Model Personalization	Ethan Smith et.al.	2412.02352	null
2024-12-03	SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models	Sabina Martyniak et.al.	2412.02332	link
2024-12-03	Controlling the Latent Diffusion Model for Generative Image Shadow Removal via Residual Generation	Xinjie Li et.al.	2412.02322	null
2024-12-03	Viewpoint Consistency in 3D Generation via Attention and CLIP Guidance	Qing Zhang et.al.	2412.02287	null
2024-12-03	Fast LiDAR Data Generation with Rectified Flows	Kazuto Nakashima et.al.	2412.02241	link
2024-12-03	Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models	Jungwon Park et.al.	2412.02237	link
2024-12-03	How to Use Diffusion Priors under Sparse Views?	Qisen Wang et.al.	2412.02225	link
2024-12-03	GIST: Towards Photorealistic Style Transfer via Multiscale Geometric Representations	Renan A. Rojas-Gomez et.al.	2412.02214	null
2024-11-29	Gaussian multi-target filtering with target dynamics driven by a stochastic differential equation	Ángel F. García-Fernández et.al.	2411.19814	link
2024-11-29	MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks	Yiming Wu et.al.	2411.19786	null
2024-11-29	Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy	Jeheon Woo et.al.	2411.19769	null
2024-11-29	TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting	Bojun Xiong et.al.	2411.19654	link
2024-11-29	Uniform Attention Maps: Boosting Image Fidelity in Reconstruction and Editing	Wenyi Mo et.al.	2411.19652	link
2024-11-29	Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook	Florinel-Alin Croitoru et.al.	2411.19537	link
2024-11-29	Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis	Tianqi Li et.al.	2411.19509	link
2024-11-29	Diffusion Models Meet Network Management: Improving Traffic Matrix Analysis with Diffusion-based Approach	Xinyu Yuan et.al.	2411.19493	link
2024-11-28	DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models	Shwetha Ram et.al.	2411.19390	null
2024-11-28	Enhancing Sketch Animation: Text-to-Video Diffusion Models with Temporal Consistency and Rigidity Constraints	Gaurav Rai et.al.	2411.19381	null
2024-11-28	Towards a Mechanistic Explanation of Diffusion Model Generalization	Matthew Niedoba et.al.	2411.19339	null
2024-11-28	Trajectory Attention for Fine-grained Video Motion Control	Zeqi Xiao et.al.	2411.19324	null
2024-11-28	Improving Multi-Subject Consistency in Open-Domain Image Generation with Isolation and Reposition Attention	Huiguo He et.al.	2411.19261	null
2024-11-28	Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes	Thomas Wimmer et.al.	2411.19233	link
2024-11-28	Z-STAR+: A Zero-shot Style Transfer Method via Adjusting Style Distribution	Yingying Deng et.al.	2411.19231	null
2024-11-27	GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data	Wentao Wang et.al.	2411.18624	null
2024-11-27	Diffusion Self-Distillation for Zero-Shot Customized Image Generation	Shengqu Cai et.al.	2411.18616	null
2024-11-27	CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models	Rundi Wu et.al.	2411.18613	null
2024-11-27	Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis	Eva Prakash et.al.	2411.18602	null
2024-11-27	FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion	Haosen Yang et.al.	2411.18552	null
2024-11-28	Enhancing weed detection performance by means of GenAI-based image augmentation	Sourav Modak et.al.	2411.18513	null
2024-11-27	Learning the Evolution of Physical Structure of Galaxies via Diffusion Models	Andrew Lizarraga et.al.	2411.18440	link
2024-11-27	De-baryonifying halos via optimal transport	Leander Thiele et.al.	2411.18399	null
2024-11-27	Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models	Yiming Wu et.al.	2411.18375	null
2024-11-28	Large systems of symmetrized trapped Brownian Bridges and Schrodinger processes	Stefan Adams et.al.	2411.18359	null
2024-11-27	TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models	Riza Velioglu et.al.	2411.18350	link
2024-11-27	HiFiVFS: High Fidelity Video Face Swapping	Xu Chen et.al.	2411.18293	null
2024-11-27	TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution	Linwei Dong et.al.	2411.18263	link
2024-11-27	Dependency-Aware CAV Task Scheduling via Diffusion-Based Reinforcement Learning	Xiang Cheng et.al.	2411.18230	null
2024-11-27	Uniqueness and regularity of weak solutions of a drift-diffusion system for perovskite solar cells	Annegret Glitzky et.al.	2411.18223	null
2024-11-27	StableAnimator: High-Quality Identity-Preserving Human Image Animation	Shuyuan Tu et.al.	2411.17697	link
2024-11-26	ScribbleLight: Single Image Indoor Relighting with Scribbles	Jun Myeong Choi et.al.	2411.17696	null
2024-11-26	GenDeg: Diffusion-Based Degradation Synthesis for Generalizable All-in-One Image Restoration	Sudarshan Rajagopalan et.al.	2411.17687	null
2024-11-26	Accelerating Vision Diffusion Transformers with Skip Branches	Guanjie Chen et.al.	2411.17616	link
2024-11-26	VideoDirector: Precise Video Editing via Text-to-Video Models	Yukun Wang et.al.	2411.17592	null
2024-11-26	FTMoMamba: Motion Generation with Frequency and Text State Space Models	Chengjian Li et.al.	2411.17532	null
2024-11-26	WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model	Zongjian Li et.al.	2411.17459	link
2024-11-26	Image Generation with Multimodule Semantic Feature-Aided Selection for Semantic Communications	Chengyang Liang et.al.	2411.17428	null
2024-11-26	Reward Incremental Learning in Text-to-Image Generation	Maorong Wang et.al.	2411.17310	null
2024-11-26	APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents	Jun Yu Chen et.al.	2411.17255	link
2024-11-26	DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model	JiHwan Moon et.al.	2411.17248	null
2024-11-26	Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration	Junyuan Deng et.al.	2411.17240	link
2024-11-26	From Graph Diffusion to Graph Classification	Jia Jun Cheng Xian et.al.	2411.17236	null
2024-11-26	DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting	Yicheng Yang et.al.	2411.17223	link
2024-11-26	Large deviations of the empirical measures of a strong-Feller Markov process inside a subset and quasi-ergodic distribution	Arnaud Guillin et.al.	2411.17216	null
2024-11-25	Generative Omnimatte: Learning to Decompose Video into Layers	Yao-Chih Lee et.al.	2411.16683	null
2024-11-25	Diffusion Features for Zero-Shot 6DoF Object Pose Estimation	Bernd Von Gimborn et.al.	2411.16668	null
2024-11-25	On a problem of optimal mixing	Kirill Sokolov et.al.	2411.16651	null
2024-11-25	LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction	Yiran Sun et.al.	2411.16629	link
2024-11-25	Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models	Ronghuan Wu et.al.	2411.16602	null
2024-11-25	Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification	Andre Kassis et.al.	2411.16598	link
2024-11-25	Rethinking Diffusion for Text-Driven Human Motion Generation	Zichong Meng et.al.	2411.16575	null
2024-11-25	Representation Collapsing Problems in Vector Quantization	Wenhao Zhao et.al.	2411.16550	null
2024-11-25	ADOBI: Adaptive Diffusion Bridge For Blind Inverse Problems with Application to MRI Reconstruction	Yuyang Hu et.al.	2411.16535	null
2024-11-25	Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis	Boming Miao et.al.	2411.16503	null
2024-11-25	On approximations of stochastic optimal control problems with an application to climate equations	Franco Flandoli et.al.	2411.16491	null
2024-11-25	Model-based reinforcement corrosion prediction: Continuous calibration with Bayesian optimization and corrosion wire sensor data	A. Potnis et.al.	2411.16447	null
2024-11-25	Privacy Protection in Personalized Diffusion Models via Targeted Cross-Attention Adversarial Attack	Xide Xu et.al.	2411.16437	null
2024-11-25	Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing	Kaifeng Gao et.al.	2411.16375	link
2024-11-25	One Diffusion to Generate Them All	Duong H. Le et.al.	2411.16318	link
2024-11-22	DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving	Bencheng Liao et.al.	2411.15139	link
2024-11-22	Material Anything: Generating Materials for Any 3D Object via Diffusion	Xin Huang et.al.	2411.15138	null
2024-11-22	VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement	Daeun Lee et.al.	2411.15115	null
2024-11-22	Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation	Lakshmikar R. Polamreddy et.al.	2411.15084	link
2024-11-22	The 1D nonlocal Fisher-KPP equation with a top hat kernel. Part 3. The effect of perturbations in the kernel	David John Needham et.al.	2411.15054	null
2024-11-22	FloAt: Flow Warping of Self-Attention for Clothing Animation Generation	Swasti Shreya Mishra et.al.	2411.15028	null
2024-11-22	Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation	Huy Le et.al.	2411.14913	null
2024-11-22	Prioritize Denoising Steps on Diffusion Model Preference Alignment via Explicit Denoised Distribution Estimation	Dingyuan Shi et.al.	2411.14871	null
2024-11-22	Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation	Jeongsol Kim et.al.	2411.14863	null
2024-11-22	Style-Friendly SNR Sampler for Style-Driven Generation	Jooyoung Choi et.al.	2411.14793	null
2024-11-22	FastGrasp: Efficient Grasp Synthesis with Diffusion	Xiaofei Wu et.al.	2411.14786	link
2024-11-22	Kolmogorov Modes and Linear Response of Jump-Diffusion Models: Applications to Stochastic Excitation of the ENSO Recharge Oscillator	Mickaël D. Chekroun et.al.	2411.14769	null
2024-11-22	Measurement of the dynamic charge susceptibility near the charge density wave transition in ErTe $_3$	Dipanjan Chaudhuri et.al.	2411.14746	null
2024-11-22	TEXGen: a Generative Diffusion Model for Mesh Textures	Xin Yu et.al.	2411.14740	link
2024-11-22	AI Tailoring: Evaluating Influence of Image Features on Fashion Product Popularity	Xiaomin Li et.al.	2411.14737	null
2024-11-21	Stable Flow: Vital Layers for Training-Free Image Editing	Omri Avrahami et.al.	2411.14430	link
2024-11-21	Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation	Yuanhao Cai et.al.	2411.14384	null
2024-11-21	CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields	Xin-Yang Liu et.al.	2411.14378	null
2024-11-21	Enhancing Medical Image Segmentation with Deep Learning and Diffusion Models	Houze Liu et.al.	2411.14353	null
2024-11-21	Continuous nonlinear adaptive experimental design with gradient flow	Ruhui Jin et.al.	2411.14332	null
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	link
2024-11-21	Stochastic interventions, sensitivity analysis, and optimal transport	Alexander W. Levis et.al.	2411.14285	null
2024-11-21	Guided MRI Reconstruction via Schrödinger Bridge	Yue Wang et.al.	2411.14269	null
2024-11-21	TaQ-DiT: Time-aware Quantization for Diffusion Transformers	Xinyan Liu et.al.	2411.14172	null
2024-11-21	RestorerID: Towards Tuning-Free Face Restoration with ID Preservation	Jiacheng Ying et.al.	2411.14125	link
2024-11-21	Point Cloud Resampling with Learnable Heat Diffusion	Wenqiang Xu et.al.	2411.14120	null
2024-11-21	Transforming Static Images Using Generative Models for Video Salient Object Detection	Suhwan Cho et.al.	2411.13975	link
2024-11-21	Continuum of coupled Wasserstein gradient flows	Clément Cancès et.al.	2411.13969	null
2024-11-21	Decoupled Sparse Priors Guided Diffusion Compression Model for Point Clouds	Xiaoge Zhang et.al.	2411.13860	null
2024-11-21	Detecting Human Artifacts from Text-to-Image Models	Kaihong Wang et.al.	2411.13842	link
2024-11-20	REDUCIO! Generating 1024 $\times$ 1024 Video within 16 Seconds using Extremely Compressed Motion Latents	Rui Tian et.al.	2411.13552	link
2024-11-20	Identity Preserving 3D Head Stylization with Multiview Score Distillation	Bahri Batuhan Bilecen et.al.	2411.13536	null
2024-11-20	Heuristically Adaptive Diffusion-Model Evolutionary Strategy	Benedikt Hartl et.al.	2411.13420	null
2024-11-20	ripALM: A Relative-Type Inexact Proximal Augmented Lagrangian Method with Applications to Quadratically Regularized Optimal Transport	Jiayi Zhu et.al.	2411.13267	null
2024-11-20	A new maximal regularity for parabolic equations and an application	Jinlong Wei et.al.	2411.13266	null
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	link
2024-11-20	Backward Stochastic Control System with Entropy Regularization	Ziyue Chen et.al.	2411.13219	null
2024-11-20	A computational framework for integrating Predictive processes with evidence Accumulation Models (PAM)	Antonino Visalli et.al.	2411.13203	link
2024-11-20	RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation	Christoph Reinders et.al.	2411.13150	link
2024-11-20	CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models	Naen Xu et.al.	2411.13144	null
2024-11-20	Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry	Yijie Zhang et.al.	2411.13120	null
2024-11-20	Distribution-free Measures of Association based on Optimal Transport	Nabarun Deb et.al.	2411.13080	null
2024-11-19	Breaking the wire: the impact of critical length on melting pathways in silver nanowires	Kannan M Ridings et.al.	2411.12891	null
2024-11-19	From Text to Pose to Image: Improving Diffusion Model Control and Quality	Clément Bonnett et.al.	2411.12872	link
2024-11-19	CDI: Copyrighted Data Identification in Diffusion Models	Jan Dubiński et.al.	2411.12858	link
2024-11-19	PoM: Efficient Image and Video Generation with the Polynomial Mixer	David Picard et.al.	2411.12663	link
2024-11-19	Improving Controllability and Editability for Pretrained Text-to-Music Generation Models	Yixiao Zhang et.al.	2411.12641	null
2024-11-19	Data Pruning in Generative Diffusion Models	Rania Briq et.al.	2411.12523	link
2024-11-19	Itô, Stratonovich, and zoom-in schemes in stochastic inflation	Eemeli Tomberg et.al.	2411.12465	null
2024-11-19	Frequency-Aware Guidance for Blind Image Restoration via Diffusion Models	Jun Xiao et.al.	2411.12450	null
2024-11-19	Combinational Backdoor Attack against Customized Text-to-Image Models	Wenbo Jiang et.al.	2411.12389	null
2024-11-19	Scalable and Effective Negative Sample Generation for Hyperedge Prediction	Shilin Qu et.al.	2411.12354	null
2024-11-19	Diffusion Product Quantization	Jie Shao et.al.	2411.12306	null
2024-11-19	SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model	Haowen Zheng et.al.	2411.12290	link
2024-11-20	HouseLLM: LLM-Assisted Two-Phase Text-to-Floorplan Generation	Ziyang Zong et.al.	2411.12279	null
2024-11-19	On sensitivities regarding shape and topology optimization as derivatives on Wasserstein spaces	Fumiya Okazaki et.al.	2411.12234	null
2024-11-19	Wavespeed selection of travelling wave solutions of a two-component reaction-diffusion model of cell invasion	Yuhui Chen et.al.	2411.12232	null
2024-11-19	Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models	Shuntaro Okada et.al.	2411.12188	null
2024-11-19	Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive Testing	Haiping Ma et.al.	2411.12182	link
2024-11-19	Enhancing Low Dose Computed Tomography Images Using Consistency Training Techniques	Mahmut S. Gokmen et.al.	2411.12181	null
2024-11-18	Milstein-type schemes for McKean-Vlasov SDEs driven by Brownian motion and Poisson random measure (with super-linear coefficients)	Sani Biswas et.al.	2411.11759	null
2024-11-18	Aligning Few-Step Diffusion Models with Dense Reward Difference Learning	Ziyi Zhang et.al.	2411.11727	link
2024-11-18	Robust Reinforcement Learning under Diffusion Models for Data with Jumps	Chenyang Jiang et.al.	2411.11697	null
2024-11-18	Conceptwm: A Diffusion Model Watermark for Concept Protection	Liangqi Lei et.al.	2411.11688	null
2024-11-19	Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation	Rüveyda Yilmaz et.al.	2411.11515	link
2024-11-18	MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion	Dongseok Shim et.al.	2411.11475	null
2024-11-18	CLUE-MARK: Watermarking Diffusion Models using CLWE	Kareem Shehata et.al.	2411.11434	null
2024-11-18	Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge	Qinglong Cao et.al.	2411.11343	null
2024-11-18	Stochastic quantization and diffusion models	Kenji Fukushima et.al.	2411.11297	null
2024-11-18	Unbiased Approximations for Stationary Distributions of McKean-Vlasov SDEs	Elsiddig Awadelkarim et.al.	2411.11270	null
2024-11-17	Stealing Training Graphs from Graph Neural Networks	Minhua Lin et.al.	2411.11197	null
2024-11-17	DeepSPV: An Interpretable Deep Learning Pipeline for 3D Spleen Volume Estimation from 2D Ultrasound Images	Zhen Yuan et.al.	2411.11190	null
2024-11-17	Strong Stability Preservation for Stochastic Partial Differential Equations	James Woodfield et.al.	2411.11172	null
2024-11-17	Integrated Ising Model with global inhibition for decision making	Olga Tapinova et.al.	2411.11143	null
2024-11-17	Oscillation Inversion: Understand the structure of Large Flow Model through the Lens of Inversion Method	Yan Zheng et.al.	2411.11135	null
2024-11-15	M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation	Sucheng Ren et.al.	2411.10433	link
2024-11-15	Mitigating Parameter Degeneracy using Joint Conditional Diffusion Model for WECC Composite Load Model in Power Systems	Feiqin Zhu et.al.	2411.10431	null
2024-11-15	Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion	Haoran Wei et.al.	2411.10369	null
2024-11-15	Probabilistic Prior Driven Attention Mechanism Based on Diffusion Model for Imaging Through Atmospheric Turbulence	Guodong Sun et.al.	2411.10321	null
2024-11-15	Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting	Ziqi Xie et.al.	2411.10309	link
2024-11-15	The Unreasonable Effectiveness of Guidance for Diffusion Models	Tim Kaiser et.al.	2411.10257	null
2024-11-15	Smooth transport map via diffusion process	Arthur Stéphanovitch et.al.	2411.10235	null
2024-11-15	ColorEdit: Training-free Image-Guided Color editing with diffusion model	Xingxi Yin et.al.	2411.10232	null
2024-11-15	Fused Gromov-Wasserstein Variance Decomposition with Linear Optimal Transport	Michael Wilson et.al.	2411.10204	null
2024-11-15	Evaluating Text-to-Image Diffusion Models for Texturing Synthetic Data	Thomas Lips et.al.	2411.10164	link
2024-11-15	Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning	Yushen Zuo et.al.	2411.10130	null
2024-11-15	SPLIT: SE(3)-diffusion via Local Geometry-based Score Prediction for 3D Scene-to-Pose-Set Matching Problems	Kanghyun Kim et.al.	2411.10049	null
2024-11-15	EyeDiff: text-to-image diffusion model improves rare eye disease diagnosis	Ruoyu Chen et.al.	2411.10004	null
2024-11-15	Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training	Myunsoo Kim et.al.	2411.09998	null
2024-11-15	Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era	Thanh Tam Nguyen et.al.	2411.09955	link
2024-11-14	How to implement the Bayes’ formula in the age of ML?	Amirhossein Taghvaei et.al.	2411.09653	null
2024-11-14	Golden Noise for Diffusion Models: A Learning Framework	Zikai Zhou et.al.	2411.09502	link
2024-11-14	DiffRoad: Realistic and Diverse Road Scenario Generation for Autonomous Vehicle Testing	Junjie Zhou et.al.	2411.09451	null
2024-11-14	Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models	Chutian Meng et.al.	2411.09449	null
2024-11-14	A survey of probabilistic generative frameworks for molecular simulations	Richard John et.al.	2411.09388	link
2024-11-14	EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models	Soowon Kim et.al.	2411.09302	null
2024-11-14	Advancing Diffusion Models: Alias-Free Resampling and Enhanced Rotational Equivariance	Md Fahim Anjum et.al.	2411.09174	null
2024-11-14	VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation	Youpeng Wen et.al.	2411.09153	null
2024-11-14	General linear threshold models with application to influence maximization	Alexander Kagan et.al.	2411.09100	link
2024-11-13	Microfoundation Inference for Strategic Prediction	Daniele Bracale et.al.	2411.08998	null
2024-11-15	Inconsistencies In Consistency Models: Better ODE Solving Does Not Imply Better Samples	Noël Vouitsis et.al.	2411.08954	link
2024-11-13	4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization	Mijeong Kim et.al.	2411.08879	null
2024-11-13	Offline Adaptation of Quadruped Locomotion using Diffusion Models	Reece O’Mahoney et.al.	2411.08832	link
2024-11-13	Optimal Transport-Based Displacement Interpolation with Data Augmentation for Reduced Order Modeling of Nonlinear Dynamical Systems	Moaad Khamlich et.al.	2411.08750	null
2024-11-13	Berry-Esseen bounds for large-time asymptotics of one-dimensional diffusion processes via Malliavin-Stein method	Seiichiro Kusuoka et.al.	2411.08725	null
2024-11-13	A Machine Learning Algorithm for Finite-Horizon Stochastic Control Problems in Economics	Xianhua Peng et.al.	2411.08668	null
2024-11-13	Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models	Chengdong Dong et.al.	2411.08642	null
2024-11-13	Neural Topic Modeling with Large Language Models in the Loop	Xiaohao Yang et.al.	2411.08534	null
2024-11-13	V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion	Xun Huang et.al.	2411.08402	link
2024-11-13	Physics Informed Distillation for Diffusion Models	Joshua Tian Jin Tee et.al.	2411.08378	link
2024-11-13	Multiscale Graph Construction Using Non-local Cluster Features	Reina Kaneko et.al.	2411.08371	null
2024-11-13	Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study	Jinbo Wen et.al.	2411.08341	null
2024-11-13	Motion Control for Enhanced Complex Action Video Generation	Qiang Zhou et.al.	2411.08328	null
2024-11-13	Conditional Variable Flow Matching: Transforming Conditional Densities with Amortized Conditional Optimal Transport	Adam P. Generale et.al.	2411.08314	link
2024-11-13	DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach	Xin Tang et.al.	2411.08299	null
2024-11-12	Joint Diffusion models in Continual Learning	Paweł Skierś et.al.	2411.08224	null
2024-11-12	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-12	GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation	Yushi Lan et.al.	2411.08033	null
2024-11-12	Approximation rates of entropic maps in semidiscrete optimal transport	Ritwik Sadhu et.al.	2411.07947	null
2024-11-12	Stochastic MPC for Finite Gaussian Mixture Disturbances with Guarantees	Maico H. W. Engelaar et.al.	2411.07887	null
2024-11-12	Diverse capability and scaling of diffusion and auto-regressive models when learning abstract rules	Binxu Wang et.al.	2411.07873	null
2024-11-12	Federated Learning for Discrete Optimal Transport with Large Population under Incomplete Information	Navpreet Kaur et.al.	2411.07841	null
2024-11-12	Novel View Synthesis with Pixel-Space Diffusion Models	Noam Elata et.al.	2411.07765	null
2024-11-12	Nanosecond nanothermometry in an electron microscope	Florian Castioni et.al.	2411.07764	null
2024-11-12	Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion	Kaiyu Song et.al.	2411.07627	null
2024-11-12	Unraveling the Connections between Flow Matching and Diffusion Probabilistic Models in Training-free Conditional Generation	Kaiyu Song et.al.	2411.07625	null
2024-11-12	Harmonizing Pixels and Melodies: Maestro-Guided Film Score Generation and Composition Style Transfer	F. Qi et.al.	2411.07539	null
2024-11-12	FM-TS: Flow Matching for Time Series Generation	Yang Hu et.al.	2411.07506	link
2024-11-12	Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors	Anisha Pal et.al.	2411.07472	link
2024-11-12	Tracing the Roots: Leveraging Temporal Dynamics in Diffusion Trajectories for Origin Attribution	Andreas Floros et.al.	2411.07449	null
2024-11-12	All-in-one Weather-degraded Image Restoration via Adaptive Degradation-aware Self-prompting Model	Yuanbo Wen et.al.	2411.07445	null
2024-11-11	Score-based generative diffusion with “active” correlated noise sources	Alexandra Lamtyugina et.al.	2411.07233	null
2024-11-12	Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models	Yoad Tewel et.al.	2411.07232	null
2024-11-11	DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID	Nyle Siddiqui et.al.	2411.07205	link
2024-11-11	Crossover from inhomogeneous to homogeneous response of a resonantly driven hBN quantum emitter	Domitille Gérard et.al.	2411.07202	null
2024-11-11	OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision	Cong Wei et.al.	2411.07199	null
2024-11-11	More Expressive Attention with Negative Weights	Ang Lv et.al.	2411.07176	link
2024-11-11	Rough differential equations in the flow approach	Ajay Chandra et.al.	2411.07157	null
2024-11-11	Conditional simulation via entropic optimal transport: Toward non-parametric estimation of conditional Brenier maps	Ricardo Baptista et.al.	2411.07154	null
2024-11-11	Variational Graph Contrastive Learning	Shifeng Xie et.al.	2411.07150	link
2024-11-11	Edify 3D: Scalable High-Quality 3D Asset Generation	NVIDIA et.al.	2411.07135	null
2024-11-11	Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models	NVIDIA et.al.	2411.07126	null
2024-11-12	Distribution dependent SDEs with multiplicative fractional noise	Xiliang Fan et.al.	2411.06974	null
2024-11-11	Nonparametric estimation of trend for stochastic differential equations driven by multiplicative stochastic volatility	B. L. S. Prakasa Rao et.al.	2411.06865	null
2024-11-11	The Exponential Lie Series and a Chen-Strichartz Formula for Levy Processes	Kurusch Ebrahimi-Fard et.al.	2411.06827	null
2024-11-11	White-Box Diffusion Transformer for single-cell RNA-seq generation	Zhuorui Cui et.al.	2411.06785	link
2024-11-08	StdGEN: Semantic-Decomposed 3D Character Generation from Single Images	Yuze He et.al.	2411.05738	null
2024-11-08	Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models	Jia-Hong Huang et.al.	2411.05706	null
2024-11-08	Relative Optimal Transport	Peter Bubenik et.al.	2411.05678	null
2024-11-08	Improving Molecular Graph Generation with Flow Matching and Optimal Transport	Xiaoyang Hou et.al.	2411.05676	null
2024-11-08	Rigidly breaking potential flows and a countable Alexandrov theorem for polytopes	Jian-Guo Liu et.al.	2411.05606	null
2024-11-08	Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion	Nan Song et.al.	2411.05544	null
2024-11-08	Improving image synthesis with diffusion-negative sampling	Alakh Desai et.al.	2411.05473	null
2024-11-08	Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation	Peidong Liu et.al.	2411.05472	link
2024-11-08	Generalization, Expressivity, and Universality of Graph Neural Networks on Attributed Graphs	Levi Rauchwerger et.al.	2411.05464	null
2024-11-08	Sticky diffusions on star graphs : characterization and It{ô} formula	Jules Berry et.al.	2411.05441	null
2024-11-08	Stochastic games of parental vaccination decision making and bounded rationality	Andras Balogh et.al.	2411.05369	null
2024-11-08	RED: Residual Estimation Diffusion for Low-Dose PET Sinogram Reconstruction	Xingyu Ai et.al.	2411.05354	link
2024-11-08	Electro-diffusive modeling and the role of spine geometry on action potential propagation in neurons	Rahul Gulati et.al.	2411.05329	null
2024-11-08	Adaptive Whole-Body PET Image Denoising Using 3D Diffusion Models with ControlNet	Boxiao Yu et.al.	2411.05302	null
2024-11-08	SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding	Ryan Sun et.al.	2411.05289	link
2024-11-07	SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models	Muyang Li et.al.	2411.05007	link
2024-11-07	ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing	Jun-Kun Chen et.al.	2411.05006	null
2024-11-07	Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models	Shuhong Zheng et.al.	2411.05005	null
2024-11-07	ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning	David Junhao Zhang et.al.	2411.05003	null
2024-11-07	SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation	Koichi Namekata et.al.	2411.04989	null
2024-11-07	Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification	Mischa Dombrowski et.al.	2411.04956	null
2024-11-07	DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion	Wenqiang Sun et.al.	2411.04928	null
2024-11-07	Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion	Kaizhe Hu et.al.	2411.04919	link
2024-11-07	Gluing methods for quantitative stability of optimal transport maps	Cyril Letrouit et.al.	2411.04908	null
2024-11-07	Coupling between Brownian motion and random walks on the infinite percolation cluster	Chenlin Gu et.al.	2411.04778	null
2024-11-07	Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation	Benito Buchheim et.al.	2411.04724	null
2024-11-07	DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction	Li Zhao et.al.	2411.04646	null
2024-11-07	Brain Tumour Removing and Missing Modality Generation using 3D WDM	André Ferreira et.al.	2411.04630	link
2024-11-07	Social EgoMesh Estimation	Luca Scofano et.al.	2411.04598	link
2024-11-07	Series-to-Series Diffusion Bridge Model	Hao Yang et.al.	2411.04491	null
2024-11-06	Community Forensics: Using Thousands of Generators to Train Fake Image Detectors	Jeongsoo Park et.al.	2411.04125	link
2024-11-06	A Multi-level Monte Carlo simulation for invariant distribution of Markovian switching Lévy-driven SDEs with super-linearly growth coefficients	Hoang-Viet Nguyen et.al.	2411.04081	null
2024-11-06	Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging	Yuan Bi et.al.	2411.04004	link
2024-11-06	ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy	Chenrui Tie et.al.	2411.03990	null
2024-11-06	ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models	Ashutosh Srivastava et.al.	2411.03982	null
2024-11-06	ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization	Huayang Huang et.al.	2411.03862	link
2024-11-06	Sub-DM:Subspace Diffusion Model with Orthogonal Decomposition for MRI Reconstruction	Yu Guan et.al.	2411.03758	link
2024-11-06	Zero-shot Dynamic MRI Reconstruction with Global-to-local Diffusion Model	Yu Guan et.al.	2411.03723	link
2024-11-06	Asymptotic analysis of estimators of ergodic stochastic differential equations	Arnab Ganguly et.al.	2411.03623	null
2024-11-06	Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation	Chihaya Matsuhira et.al.	2411.03595	null
2024-11-05	Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data	Seunggeun Chi et.al.	2411.03561	null
2024-11-05	Ergodicity and Mixing of Sublinear Expectation System and Applications	Wen Huang et.al.	2411.03512	null
2024-11-05	SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture	Andrew Heschl et.al.	2411.03505	link
2024-11-05	Chance-Constrained Convex MPC for Robust Quadruped Locomotion Under Parametric and Additive Uncertainties	Ananya Trivedi et.al.	2411.03481	link
2024-11-05	Exo-Daisy World: Revisiting Gaia Theory through an Informational Architecture Perspective	Damian R Sowinski et.al.	2411.03421	null
2024-11-05	Information geometry of diffeomorphism groups	Boris Khesin et.al.	2411.03265	null
2024-11-05	DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models	Ying Zhou et.al.	2411.03250	null
2024-11-05	On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models	Tariq Berrada Ifriqi et.al.	2411.03177	null
2024-11-05	Unleashing the power of novel conditional generative approaches for new materials discovery	Lev Novitskiy et.al.	2411.03156	link
2024-11-05	Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising	Tao Huang et.al.	2411.03053	null
2024-11-05	GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details	Zhongjin Luo et.al.	2411.03047	null
2024-11-05	IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems	Heiko Oppel et.al.	2411.02954	null
2024-11-05	LDPM: Towards undersampled MRI reconstruction with MR-VAE and Latent Diffusion Prior	Xingjian Tang et.al.	2411.02951	null
2024-11-05	Theoretically Guaranteed Distribution Adaptable Learning	Chao Xu et.al.	2411.02921	null
2024-11-05	How much is a noisy image worth? Data Scaling Laws for Ambient Diffusion	Giannis Daras et.al.	2411.02780	link
2024-11-04	Modelling Alzheimer’s Protein Dynamics: A Data-Driven Integration of Stochastic Methods, Machine Learning and Connectome Insights	Alec MacIver et.al.	2411.02644	null
2024-11-04	Training-free Regional Prompting for Diffusion Transformers	Anthony Chen et.al.	2411.02395	link
2024-11-04	Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition	Xinkai Liu et.al.	2411.02334	null
2024-11-04	LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation	Mufei Li et.al.	2411.02322	link
2024-11-05	Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation	Xianghui Yang et.al.	2411.02293	null
2024-11-04	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-04	Metric properties of partial and robust Gromov-Wasserstein distances	Jannatul Chhoa et.al.	2411.02198	null
2024-11-04	CleAR: Robust Context-Guided Generative Lighting Estimation for Mobile Augmented Reality	Yiqin Zhao et.al.	2411.02179	null
2024-11-04	Model Integrity when Unlearning with T2I Diffusion Models	Andrea Schioppa et.al.	2411.02068	null
2024-11-04	Learning Controlled Stochastic Differential Equations	Luc Brogat-Motte et.al.	2411.01982	null
2024-11-04	A tamed-adaptive Milstein scheme for stochastic differential equations with low regularity coefficients	Thi-Huong Vu et.al.	2411.01849	null
2024-11-04	DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability	Bo Gao et.al.	2411.01819	null
2024-11-04	MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence	Fuming You et.al.	2411.01805	null
2024-11-04	A Regressor-Guided Graph Diffusion Model for Predicting Enzyme Mutations to Enhance Turnover Number	Xiaozhu Yu et.al.	2411.01745	link
2024-11-04	xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism	Jiarui Fang et.al.	2411.01738	link
2024-11-04	LaGDif: Latent Graph Diffusion Model for Efficient Protein Inverse Folding with Self-Ensemble	Taoyu Wu et.al.	2411.01737	link
2024-10-31	DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion	Weicai Ye et.al.	2410.24203	link
2024-10-31	Redefining in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation	Fu Feng et.al.	2410.24160	null
2024-10-31	Scaling Concept With Text-Guided Diffusion Models	Chao Huang et.al.	2410.24151	null
2024-10-31	Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure	Xiang Li et.al.	2410.24060	link
2024-10-31	TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation	Sunjae Yoon et.al.	2410.24037	null
2024-10-31	DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination	Jia Fu et.al.	2410.24006	link
2024-11-01	Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model	Wenjia Xie et.al.	2410.23994	null
2024-10-31	Stochastic Reconstruction of Gappy Lagrangian Turbulent Signals by Conditional Diffusion Models	Tianyi Li et.al.	2410.23971	link
2024-10-31	Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation	Yihang Zhou et.al.	2410.23962	null
2024-10-31	A dynamic programming principle for multiperiod control problems with bicausal constraints	Ruslan Mirmominov et.al.	2410.23927	null
2024-10-31	Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model	Hao Zhang et.al.	2410.23905	link
2024-10-31	DiffBatt: A Diffusion Model for Battery Degradation Prediction and Synthesis	Hamidreza Eivazi et.al.	2410.23893	link
2024-10-31	Denoising Diffusion Models for Anomaly Localization in Medical Images	Cosmin I. Bercea et.al.	2410.23834	null
2024-10-31	Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models	Youngjun Jun et.al.	2410.23820	null
2024-10-31	EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching	Xinwang Chen et.al.	2410.23788	link
2024-10-30	ReferEverything: Towards Segmenting Everything We Can Speak of in Videos	Anurag Bagchi et.al.	2410.23287	null
2024-10-30	Provable acceleration for diffusion models under minimal assumptions	Gen Li et.al.	2410.23285	null
2024-10-30	RelationBooth: Towards Relation-Aware Customized Object Generation	Qingyu Shi et.al.	2410.23280	null
2024-10-30	SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation	Yining Hong et.al.	2410.23277	null
2024-10-30	Multi-student Diffusion Distillation for Better One-step Generators	Yanke Song et.al.	2410.23274	null
2024-10-30	A uniform point vortex approximation for the solution of the two-dimensional Navier Stokes equation with transport noise	Filippo Giovagnini et.al.	2410.23163	null
2024-10-30	Identifiability of the Optimal Transport Cost on Finite Spaces	Alberto González-Sanz et.al.	2410.23146	null
2024-10-30	CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense	Mingkun Zhang et.al.	2410.23091	link
2024-10-30	Controlling Language and Diffusion Models by Transporting Activations	Pau Rodriguez et.al.	2410.23054	link
2024-10-30	Improving Musical Accompaniment Co-creation via Diffusion Transformers	Javier Nistal et.al.	2410.23005	null
2024-10-30	DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes	Jialiang Zhang et.al.	2410.23004	null
2024-10-30	LumiSculpt: A Consistency Lighting Control Network for Video Generation	Yuxin Zhang et.al.	2410.22979	null
2024-10-30	Private Synthetic Text Generation with Diffusion Models	Sebastian Ochs et.al.	2410.22971	link
2024-10-31	DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data	Hanyang Chen et.al.	2410.22938	link
2024-10-30	HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models	Shengkai Zhang et.al.	2410.22901	link
2024-10-29	Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models	Raman Dutt et.al.	2410.22149	link
2024-10-29	Averaging principle for multiscale controlled jump diffusions and associated nonlocal HJB equations	Qi Zhang et.al.	2410.22141	null
2024-10-29	Variational inference for pile-up removal at hadron colliders with diffusion models	Malte Algren et.al.	2410.22074	null
2024-10-29	Self-normalized Cramér-type Moderate Deviation of Stochastic Gradient Langevin Dynamics	Hongsheng Dai et.al.	2410.22047	null
2024-10-29	Dual Conditional Diffusion Models for Sequential Recommendation	Hongtao Huang et.al.	2410.21967	null
2024-10-29	PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference	Kendong Liu et.al.	2410.21966	null
2024-10-29	CT to PET Translation: A Large-scale Dataset and Domain-Knowledge-Guided Diffusion Approach	Dac Thai Nguyen et.al.	2410.21932	link
2024-10-29	Guided Diffusion-based Counterfactual Augmentation for Robust Session-based Recommendation	Muskan Gupta et.al.	2410.21892	null
2024-10-29	On invariance of observability for BSDEs and its applications to stochastic control systems	Bao-Zhu Guo et.al.	2410.21863	null
2024-10-29	Diffusion as Reasoning: Enhancing Object Goal Navigation with LLM-Biased Diffusion Model	Yiming Ji et.al.	2410.21842	null
2024-10-29	Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images	Suhyun Ahn et.al.	2410.21826	link
2024-10-29	Robot Policy Learning with Temporal Optimal Transport Reward	Yuwei Fu et.al.	2410.21795	link
2024-10-29	HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion	Yu Zeng et.al.	2410.21789	null
2024-10-29	DiffusionVel: Multi-Information Integrated Velocity Inversion Using Generative Diffusion Models	Hao Zhang et.al.	2410.21776	null
2024-10-30	IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models	Hang Guo et.al.	2410.21759	link
2024-10-28	On Inductive Biases That Enable Generalization of Diffusion Transformers	Jie An et.al.	2410.21273	link
2024-10-28	One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation	Zhendong Wang et.al.	2410.21257	null
2024-10-28	$\texttt{skwdro}$ : a library for Wasserstein distributionally robust machine learning	Florian Vincent et.al.	2410.21231	link
2024-10-28	On learning higher-order cumulants in diffusion models	Gert Aarts et.al.	2410.21212	null
2024-10-28	Trajectory Flow Matching with Applications to Clinical Time Series Modeling	Xi Zhang et.al.	2410.21154	link
2024-10-28	Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences	Zhihao Zhao et.al.	2410.21130	null
2024-10-28	Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models	Wenda Li et.al.	2410.21088	link
2024-10-28	Federated Time Series Generation on Feature and Temporally Misaligned Data	Chenrui Fan et.al.	2410.21072	null
2024-10-28	Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework	Vladimir Arkhipkin et.al.	2410.21061	link
2024-10-28	Beyond Autoregression: Fast LLMs via Self-Distillation Through Time	Justin Deschenaux et.al.	2410.21035	link
2024-10-28	Reference-Free Formula Drift with Reinforcement Learning: From Driving Data to Tire Energy-Inspired, Real-World Policies	Franck Djeumou et.al.	2410.20990	null
2024-10-29	EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior	Xin Xiang et.al.	2410.20981	null
2024-10-28	Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models!	Arash Marioriyad et.al.	2410.20972	null
2024-10-28	*Diff-Instruct: Towards Human-Preferred One-step Text-to-image Generative Models**	Weijian Luo et.al.	2410.20898	link
2024-10-28	Novel Object Synthesis via Adaptive Text-Image Harmony	Zeren Xiong et.al.	2410.20823	null
2024-10-25	Adversarial Environment Design via Regret-Guided Diffusion Models	Hojun Chung et.al.	2410.19715	null
2024-10-25	DiffGS: Functional Gaussian Splatting Diffusion	Junsheng Zhou et.al.	2410.19657	null
2024-10-25	Diffusion models for lattice gauge field simulations	Qianteng Zhu et.al.	2410.19602	null
2024-10-25	On the robustness of semi-discrete optimal transport	Davy Paindaveine et.al.	2410.19596	null
2024-10-25	Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series	Ilan Naiman et.al.	2410.19538	null
2024-10-25	Ensemble Data Assimilation for Particle-based Methods	Marius Duvillard et.al.	2410.19525	null
2024-10-28	NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction	Zixuan Gong et.al.	2410.19452	link
2024-10-25	Learned Reference-based Diffusion Sampling for multi-modal distributions	Maxence Noble et.al.	2410.19449	null
2024-10-25	Generative Diffusion Models for Sequential Recommendations	Sharare Zolghadr et.al.	2410.19429	null
2024-10-25	FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality	Zhengyao Lv et.al.	2410.19355	null
2024-10-25	High Resolution Seismic Waveform Generation using Denoising Diffusion	Andreas Bergmeister et.al.	2410.19343	null
2024-10-25	Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion	Emiel Hoogeboom et.al.	2410.19324	null
2024-10-25	A prescriptive theory for brain-like inference	Hadi Vafaii et.al.	2410.19315	null
2024-10-25	TEARS: Textual Representations for Scrutable Recommendations	Emiliano Penaloza et.al.	2410.19302	null
2024-10-25	A Flow-based Truncated Denoising Diffusion Model for Super-resolution Magnetic Resonance Spectroscopic Imaging	Siyuan Dong et.al.	2410.19288	null
2024-10-24	MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms	Ling-Hao Chen et.al.	2410.18977	null
2024-10-24	3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation	Hansheng Chen et.al.	2410.18974	link
2024-10-24	On the Crucial Role of Initialization for Matrix Factorization	Bingcong Li et.al.	2410.18965	null
2024-10-24	Stable Consistency Tuning: Understanding and Improving Consistency Models	Fu-Yun Wang et.al.	2410.18958	link
2024-10-24	Generation of synthetic financial time series by diffusion models	Tomonori Takahashi et.al.	2410.18897	null
2024-10-24	The Cat and Mouse Game: The Ongoing Arms Race Between Diffusion Models and Detection Methods	Linda Laurier et.al.	2410.18866	null
2024-10-24	Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation	Xiaoyu Zhang et.al.	2410.18830	null
2024-10-24	Fast constrained sampling in pre-trained diffusion models	Alexandros Graikos et.al.	2410.18804	null
2024-10-24	Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances	Shilin Lu et.al.	2410.18775	link
2024-10-25	Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing	Haonan Lin et.al.	2410.18756	null
2024-10-24	Rectified Diffusion Guidance for Conditional Generation	Mengfei Xia et.al.	2410.18737	null
2024-10-24	Retrieval-Augmented Diffusion Models for Time Series Forecasting	Jingwei Liu et.al.	2410.18712	link
2024-10-24	Ali-AUG: Innovative Approaches to Labeled Data Augmentation using One-Step Diffusion Model	Ali Hamza et.al.	2410.18678	null
2024-10-24	DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation	Yuang Ai et.al.	2410.18666	link
2024-10-25	Diffusion Attribution Score: Evaluating Training Data Influence in Diffusion Model	Jinxu Lin et.al.	2410.18639	null
2024-10-23	DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes	Hengwei Bian et.al.	2410.18084	null
2024-10-23	Prioritized Generative Replay	Renhao Wang et.al.	2410.18082	null
2024-10-23	Optical Generative Models	Shiqi Chen et.al.	2410.17970	null
2024-10-23	A Wavelet Diffusion GAN for Image Super-Resolution	Lorenzo Aloisi et.al.	2410.17966	null
2024-10-23	Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation	Wenfang Yao et.al.	2410.17918	link
2024-10-23	Scaling Diffusion Language Models via Adaptation from Autoregressive Models	Shansan Gong et.al.	2410.17891	link
2024-10-23	Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech	Danilo de Oliveira et.al.	2410.17834	null
2024-10-23	PGDiffSeg: Prior-Guided Denoising Diffusion Model with Parameter-Shared Attention for Breast Cancer Segmentation	Feiyan Feng et.al.	2410.17812	null
2024-10-23	AdaDiffSR: Adaptive Region-aware Dynamic Acceleration Diffusion Model for Real-World Image Super-Resolution	Yuanting Fan et.al.	2410.17752	null
2024-10-23	VISAGE: Video Synthesis using Action Graphs for Surgery	Yousef Yeganeh et.al.	2410.17751	null
2024-10-23	Optimal Impulse Control for Cyber Risk Management	Caroline Hillairet et.al.	2410.17706	null
2024-10-23	Deep Generative Models for 3D Medical Image Synthesis	Paul Friedrich et.al.	2410.17664	null
2024-10-23	Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation	Muquan Li et.al.	2410.17606	link
2024-10-23	How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?	Jiahua Dong et.al.	2410.17594	link
2024-10-23	GDDA: Semantic OOD Detection on Graphs under Covariate Shift via Score-Based Diffusion Models	Zhixia He et.al.	2410.17526	null
2024-10-22	Reinforcement learning on structure-conditioned categorical diffusion for protein inverse folding	Yasha Ektefaie et.al.	2410.17173	link
2024-10-22	CLAP: Concave Linear APproximation for Quadratic Graph Matching	Yongqing Liang et.al.	2410.17101	link
2024-10-22	DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization	Haowei Zhu et.al.	2410.16942	null
2024-10-22	Hierarchical Clustering for Conditional Diffusion in Image Generation	Jorge da Silva Goncalves et.al.	2410.16910	link
2024-10-22	VistaDream: Sampling multiview consistent images for single-view scene reconstruction	Haiping Wang et.al.	2410.16892	null
2024-10-22	MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model	Meng Xu et.al.	2410.16840	null
2024-10-22	Evaluating the Effectiveness of Attack-Agnostic Features for Morphing Attack Detection	Laurent Colbois et.al.	2410.16802	link
2024-10-22	One-Step Diffusion Distillation through Score Implicit Matching	Weijian Luo et.al.	2410.16794	link
2024-10-22	LLM-Assisted Red Teaming of Diffusion Models through “Failures Are Fated, But Can Be Faded”	Som Sagar et.al.	2410.16738	null
2024-10-22	Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing	Runpu Wei et.al.	2410.16732	null
2024-10-22	DiffusionSeeder: Seeding Motion Optimization with Diffusion for Rapid Motion Planning	Huang Huang et.al.	2410.16727	null
2024-10-22	Progressive Compositionality In Text-to-Image Generative Models	Xu Han et.al.	2410.16719	link
2024-10-22	Governing equation discovery of a complex system from snapshots	Qunxi Zhu et.al.	2410.16694	null
2024-10-22	DARE: Diffusion Policy for Autonomous Robot Exploration	Yuhong Cao et.al.	2410.16687	null
2024-10-22	NucleiMix: Realistic Data Augmentation for Nuclei Instance Segmentation	Jiamu Wang et.al.	2410.16671	null
2024-10-21	MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors	Honghua Chen et.al.	2410.16272	null
2024-10-21	A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data	Simon Deltadahl et.al.	2410.16177	null
2024-10-22	Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models	Giannis Daras et.al.	2410.16152	null
2024-10-21	SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation	Xinyi Zhou et.al.	2410.16119	null
2024-10-21	Continuous Speech Synthesis using per-token Latent Diffusion	Arnon Turetzky et.al.	2410.16048	null
2024-10-22	CamI2V: Camera-Controlled Image-to-Video Diffusion Model	Guangcong Zheng et.al.	2410.15957	link
2024-10-21	Global existence and mean-field limit for a stochastic interacting particle system of signed Coulomb charges	Patrick van Meurs et.al.	2410.15855	null
2024-10-21	Learning signals defined on graphs with optimal transport and Gaussian process regression	Raphaël Carpintero Perez et.al.	2410.15721	null
2024-10-21	Quantiles and Quantile Regression on Riemannian Manifolds: a measure-transportation-based approach	Marc Hallin et.al.	2410.15711	null
2024-10-21	Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces	Jifeng Hu et.al.	2410.15698	null
2024-10-21	Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation	Anh Bui et.al.	2410.15618	link
2024-10-20	Data Augmentation via Diffusion Model to Enhance AI Fairness	Christina Hastings Blow et.al.	2410.15470	null
2024-10-20	MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications	Yongrui Yu et.al.	2410.15432	null
2024-10-20	ConSinger: Efficient High-Fidelity Singing Voice Generation with Minimal Steps	Yulin Song et.al.	2410.15342	null
2024-10-20	Diffusion-PINN Sampler	Zhekun Shi et.al.	2410.15336	null
2024-10-18	A Lipschitz spaces view of infinitely wide shallow neural networks	Francesca Bartolucci et.al.	2410.14591	null
2024-10-18	Neuro-Symbolic Traders: Assessing the Wisdom of AI Crowds in Markets	Namid R. Stillman et.al.	2410.14587	null
2024-10-18	Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior	Calvin-Khang Ta et.al.	2410.14540	null
2024-10-18	LEAD: Latent Realignment for Human Motion Diffusion	Nefeli Andreou et.al.	2410.14508	null
2024-10-18	Reinforcement Learning in Non-Markov Market-Making	Luca Lalor et.al.	2410.14504	null
2024-10-18	ANT: Adaptive Noise Schedule for Time Series Diffusion Models	Seunghan Lee et.al.	2410.14488	link
2024-10-18	DRL Optimization Trajectory Generation via Wireless Network Intent-Guided Diffusion Models for Optimizing Resource Allocation	Junjie Wu et.al.	2410.14481	null
2024-10-18	FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models	Rui Hu et.al.	2410.14429	null
2024-10-18	Dynamic Negative Guidance of Diffusion Models	Felix Koulischer et.al.	2410.14398	link
2024-10-18	Unscrambling disease progression at scale: fast inference of event permutations with optimal transport	Peter A. Wijeratne et.al.	2410.14388	link
2024-10-18	HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation	Bo Cheng et.al.	2410.14324	link
2024-10-18	A class of kernel-based scalable algorithms for data science	Philippe G. LeFloch et.al.	2410.14323	null
2024-10-18	ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer	Yuhao Wan et.al.	2410.14279	link
2024-10-18	HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for Inanimate Objects	Oliverio Theophilus Nathanael et.al.	2410.14265	null
2024-10-18	ERDDCI: Exact Reversible Diffusion via Dual-Chain Inversion for High-Quality Image Editing	Jimin Dai et.al.	2410.14247	null
2024-10-17	Diffusing States and Matching Scores: A New Framework for Imitation Learning	Runzhe Wu et.al.	2410.13855	link
2024-10-17	Influence Functions for Scalable Data Attribution in Diffusion Models	Bruno Mlodozeniec et.al.	2410.13850	null
2024-10-17	Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning	Xiaodan Xing et.al.	2410.13823	link
2024-10-17	ConsisSR: Delving Deep into Consistency in Diffusion-based Image Super-Resolution	Junhao Gu et.al.	2410.13807	null
2024-10-17	Probing the Latent Hierarchical Structure of Data via Diffusion Models	Antonio Sclocchi et.al.	2410.13770	null
2024-10-17	Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers	Yuchen Liang et.al.	2410.13746	null
2024-10-17	Improved Convergence Rate for Diffusion Probabilistic Models	Gen Li et.al.	2410.13738	null
2024-10-18	DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation	Hanbo Cheng et.al.	2410.13726	link
2024-10-18	Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion	Yijun Liang et.al.	2410.13674	link
2024-10-17	Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design	Chenyu Wang et.al.	2410.13643	link
2024-10-17	Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control	Xinyi Yuan et.al.	2410.13586	null
2024-10-17	Can Medical Vision-Language Pre-training Succeed with Purely Synthetic Data?	Che Liu et.al.	2410.13523	null
2024-10-17	Solving Prior Distribution Mismatch in Diffusion Models via Optimal Transport	Zhanpeng Wang et.al.	2410.13431	null
2024-10-17	MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models	Donghao Zhou et.al.	2410.13370	null
2024-10-17	DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone	Hongfan Gao et.al.	2410.13338	null
2024-10-16	Meta-Unlearning on Diffusion Models: Preventing Relearning Unlearned Concepts	Hongcheng Gao et.al.	2410.12777	link
2024-10-16	SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation	Jaehong Yoon et.al.	2410.12761	null
2024-10-16	Geometry and Duality of Alternating Markov Chains	Deven Mithal et.al.	2410.12721	null
2024-10-16	Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization	Xingqi Wang et.al.	2410.12700	link
2024-10-16	AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing	DuoSheng Chen et.al.	2410.12696	link
2024-10-16	One Step Diffusion via Shortcut Models	Kevin Frans et.al.	2410.12557	link
2024-10-16	Disentangling data distribution for Federated Learning	Xinyuan Zhao et.al.	2410.12530	null
2024-10-16	Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing	Mingce Guo et.al.	2410.12526	null
2024-10-16	Price impact and long-term profitability of energy storage	Roxana Dumitrescu et.al.	2410.12495	null
2024-10-16	Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective	Yongxin Zhu et.al.	2410.12490	link
2024-10-16	A Class of Degenerate Mean Field Games, Associated FBSDEs and Master Equations	Alain Bensoussan et.al.	2410.12404	null
2024-10-16	DaDiff: Domain-aware Diffusion Model for Nighttime UAV Tracking	Haobo Zuo et.al.	2410.12270	link
2024-10-16	FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation	Huadai Liu et.al.	2410.12266	null
2024-10-17	Expected Sliced Transport Plans	Xinran Liu et.al.	2410.12176	null
2024-10-16	Preference Optimization with Multi-Sample Comparisons	Chaoqi Wang et.al.	2410.12138	null
2024-10-15	High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion	Junhwa Hur et.al.	2410.11838	null
2024-10-15	On the Effectiveness of Dataset Alignment for Fake Image Detection	Anirudh Sundara Rajan et.al.	2410.11835	null
2024-10-15	Bayesian Experimental Design via Contrastive Diffusions	Jacopo Iollo et.al.	2410.11826	link
2024-10-15	Improving Long-Text Alignment for Text-to-Image Diffusion Models	Luping Liu et.al.	2410.11817	link
2024-10-15	SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing	Zhiyuan Zhang et.al.	2410.11815	null
2024-10-16	Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices	Zhiyuan Ma et.al.	2410.11795	null
2024-10-15	Probabilistic Principles for Biophysics and Neuroscience: Entropy Production, Bayesian Mechanics & the Free-Energy Principle	Lancelot Da Costa et.al.	2410.11735	null
2024-10-15	Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems	Jason Hu et.al.	2410.11730	null
2024-10-15	On the potential of Optimal Transport in Geospatial Data Science	Nina Wiedemann et.al.	2410.11709	link
2024-10-15	Optimal Finite-time Maxwell’s Demons in Langevin Systems	Takuya Kamijima et.al.	2410.11603	null
2024-10-15	DeformPAM: Data-Efficient Learning for Long-horizon Deformable Object Manipulation via Preference-based Action Alignment	Wendi Chen et.al.	2410.11584	link
2024-10-15	Bayesian inference of mixed Gaussian phylogenetic models	Bayu Brahmantio et.al.	2410.11548	link
2024-10-15	Riemann-Liouville fractional Brownian motion with random Hurst exponent	Hubert Woszczek et.al.	2410.11546	null
2024-10-15	InvSeg: Test-Time Prompt Inversion for Semantic Segmentation	Jiayi Lin et.al.	2410.11473	null
2024-10-15	A Simple Approach to Unifying Diffusion-based Conditional Generation	Xirui Li et.al.	2410.11439	null
2024-10-14	Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models	Jingzhi Bao et.al.	2410.10821	link
2024-10-14	Depth Any Video with Scalable Synthetic Data	Honghui Yang et.al.	2410.10815	link
2024-10-14	HART: Efficient Visual Generation with Hybrid Autoregressive Transformer	Haotian Tang et.al.	2410.10812	link
2024-10-14	TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction	Qingze et.al.	2410.10804	link
2024-10-14	Boosting Camera Motion Control for Video Diffusion Transformers	Soon Yau Cheong et.al.	2410.10802	null
2024-10-14	Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations	Litu Rout et.al.	2410.10792	null
2024-10-14	ControlMM: Controllable Masked Motion Generation	Ekkasit Pinyoanuntapong et.al.	2410.10780	null
2024-10-14	Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation	Youwei Yu et.al.	2410.10766	link
2024-10-14	DragEntity: Trajectory Guided Video Generation using Entity and Positional Relationships	Zhang Wan et.al.	2410.10751	null
2024-10-14	FlexGen: Flexible Multi-View Generation from Text and Image Inputs	Xinli Xu et.al.	2410.10745	null
2024-10-14	Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models	Junyu Chen et.al.	2410.10733	link
2024-10-14	TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model	Jiazhi Guan et.al.	2410.10696	null
2024-10-14	Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation	Peiwen Sun et.al.	2410.10676	null
2024-10-14	Generating Model Parameters for Controlling: Parameter Diffusion for Controllable Multi-Task Recommendation	Chenglei Shen et.al.	2410.10639	null
2024-10-15	SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers	Enze Xie et.al.	2410.10629	null
2024-10-11	SceneCraft: Layout-Guided 3D Scene Generation	Xiuyu Yang et.al.	2410.09049	link
2024-10-11	Linear Convergence of Diffusion Models Under the Manifold Hypothesis	Peter Potaptchik et.al.	2410.09046	null
2024-10-11	Semantic Score Distillation Sampling for Compositional Text-to-3D Generation	Ling Yang et.al.	2410.09009	link
2024-10-11	WaveDiffusion: Exploring Full Waveform Inversion via Joint Diffusion in the Latent Space	Hanchen Wang et.al.	2410.09002	null
2024-10-11	Gradient-adjusted underdamped Langevin dynamics for sampling	Xinzhe Zuo et.al.	2410.08987	null
2024-10-11	DiffPO: A causal diffusion model for learning distributions of potential outcomes	Yuchen Ma et.al.	2410.08924	null
2024-10-11	Lifelong Event Detection via Optimal Transport	Viet Dao et.al.	2410.08905	null
2024-10-11	Domain decomposition for entropic unbalanced optimal transport	Ismael Medina et.al.	2410.08859	link
2024-10-11	Zero-Shot Offline Imitation Learning via Optimal Transport	Thomas Rupf et.al.	2410.08751	link
2024-10-11	Multi-dimensional non-Markovian backward stochastic differential equations of interactively quadratic generators	Shengjun Fan et.al.	2410.08748	null
2024-10-11	Distillation of Discrete Diffusion through Dimensional Correlations	Satoshi Hayakawa et.al.	2410.08709	link
2024-10-14	Gait Sequence Upsampling using Diffusion Models for Single LiDAR Sensors	Jeongho Ahn et.al.	2410.08680	null
2024-10-11	E-Motion: Future Motion Simulation via Event Sequence Diffusion	Song Wu et.al.	2410.08649	link
2024-10-11	Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting	Purushothaman Natarajan et.al.	2410.08612	link
2024-10-11	Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models	Pascl Zwick et.al.	2410.08551	link
2024-10-10	DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models	Xiaoxiao He et.al.	2410.08207	null
2024-10-10	HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation	Shanyan Guan et.al.	2410.08192	null
2024-10-10	DifFRelight: Diffusion-Based Facial Performance Relighting	Mingming He et.al.	2410.08188	null
2024-10-10	ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion	Zitian Zhang et.al.	2410.08168	link
2024-10-10	DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation	Jiatao Gu et.al.	2410.08159	null
2024-10-10	Progressive Autoregressive Video Diffusion Models	Desai Xie et.al.	2410.08151	link
2024-10-10	Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction	Jarrid Rector-Brooks et.al.	2410.08134	null
2024-10-10	On Barycenter Computation: Semi-Unbalanced Optimal Transport-based Method on Gaussians	Ngoc-Hai Nguyen et.al.	2410.08117	null
2024-10-10	CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack Segmentation	Xiaoyan Jiang et.al.	2410.08100	link
2024-10-10	Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models	Vinith M. Suriyakumar et.al.	2410.08074	null
2024-10-10	Optimal Transportation by Orthogonal Coupling Dynamics	Mohsen Sadr et.al.	2410.08060	null
2024-10-10	LADIMO: Face Morph Generation through Biometric Template Inversion with Latent Diffusion	Marcel Grimmer et.al.	2410.07988	link
2024-10-10	Convex comparison of Gaussian mixtures	Benjamin Jourdain et.al.	2410.07958	null
2024-10-10	AI Surrogate Model for Distributed Computing Workloads	David K. Park et.al.	2410.07940	null
2024-10-10	Congestion and Penalization in Optimal Transport	Marcelo Gallardo et.al.	2410.07363	null
2024-10-09	IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation	Xinchen Zhang et.al.	2410.07171	link
2024-10-09	AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation	Yukang Cao et.al.	2410.07164	null
2024-10-09	InstructG2I: Synthesizing Images from Multimodal Attributed Graphs	Bowen Jin et.al.	2410.07157	link
2024-10-09	Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis	Bohan Zeng et.al.	2410.07155	link
2024-10-09	Through the Looking Glass: Mirror Schrödinger Bridges	Leticia Mattos Da Silva et.al.	2410.07003	null
2024-10-09	Diffusion Density Estimators	Akhil Premkumar et.al.	2410.06986	null
2024-10-09	Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control	Shimon Vainer et.al.	2410.06985	null
2024-10-09	Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think	Sihyun Yu et.al.	2410.06940	link
2024-10-09	Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis	Ahmed Abdullah et.al.	2410.06841	null
2024-10-09	Diffuse or Confuse: A Diffusion Deepfake Speech Dataset	Anton Firc et.al.	2410.06796	link
2024-10-09	Diff-FMT: Diffusion Models for Fluorescence Molecular Tomography	Qianqian Xue et.al.	2410.06757	null
2024-10-10	Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques	Benyuan Meng et.al.	2410.06719	link
2024-10-09	Decouple-Then-Merge: Towards Better Training for Diffusion Models	Qianli Ma et.al.	2410.06664	null
2024-10-09	WardropNet: Traffic Flow Predictions via Equilibrium-Augmented Learning	Kai Jungel et.al.	2410.06656	link
2024-10-10	DeepMuon: Accelerating Cosmic-Ray Muon Simulation Based on Optimal Transport	Ao-Bo Wang et.al.	2410.06539	link
2024-10-07	DART: A Diffusion-Based Autoregressive Motion Model for Real-Time Text-Driven Motion Control	Kaifeng Zhao et.al.	2410.05260	null
2024-10-07	GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting	Yukang Cao et.al.	2410.05259	null
2024-10-07	SePPO: Semi-Policy Preference Optimization for Diffusion Alignment	Daoan Zhang et.al.	2410.05255	link
2024-10-07	DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration	Yongtai Zhuo et.al.	2410.05234	link
2024-10-07	Presto! Distilling Steps and Layers for Accelerating Music Generation	Zachary Novack et.al.	2410.05167	null
2024-10-08	A Simulation-Free Deep Learning Approach to Stochastic Optimal Control	Mengjian Hua et.al.	2410.05163	null
2024-10-07	Leveraging Multimodal Diffusion Models to Accelerate Imaging with Side Information	Timofey Efimov et.al.	2410.05143	null
2024-10-07	Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning	Ayano Hiranaka et.al.	2410.05116	null
2024-10-07	DreamSat: Towards a General 3D Model for Novel View Synthesis of Space Objects	Nidhi Mathihalli et.al.	2410.05097	link
2024-10-07	A nodally bound-preserving discontinuous Galerkin method for the drift-diffusion equation	Gabriel R. Barrenechea et.al.	2410.05040	null
2024-10-07	Revealing Directions for Text-guided 3D Face Editing	Zhuo Chen et.al.	2410.04965	null
2024-10-07	Low-Rank Continual Personalization of Diffusion Models	Łukasz Staniszewski et.al.	2410.04891	link
2024-10-07	Patch is Enough: Naturalistic Adversarial Patch against Vision-Language Pre-training Models	Dehong Kong et.al.	2410.04884	null
2024-10-07	Artificial Barriers for stochastic differential equations and for construction of Boundary-preserving schemes	Johan Ulander et.al.	2410.04850	null
2024-10-07	Real-time cardiac cine MRI – A comparison of a diffusion probabilistic model with alternative state-of-the-art image reconstruction techniques for undersampled spiral acquisitions	Oliver Schad et.al.	2410.04843	link
2024-10-04	Estimating Body and Hand Motion in an Ego-sensed World	Brent Yi et.al.	2410.03665	null
2024-10-04	Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models	Chumeng Liang et.al.	2410.03640	link
2024-10-04	How Discrete and Continuous Diffusion Meet: Comprehensive Analysis of Discrete Diffusion Models via a Stochastic Integral Framework	Yinuo Ren et.al.	2410.03601	null
2024-10-04	Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features	Benyuan Meng et.al.	2410.03558	link
2024-10-04	Diffusion State-Guided Projected Gradient for Inverse Problems	Rayhan Zirvi et.al.	2410.03463	link
2024-10-04	Generative Semantic Communication for Text-to-Speech Synthesis	Jiahao Zheng et.al.	2410.03459	null
2024-10-04	Dynamic Diffusion Transformer	Wangbo Zhao et.al.	2410.03456	link
2024-10-04	CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control	Guy Tevet et.al.	2410.03441	link
2024-10-04	Sparsity of Quadratically Regularized Optimal Transport: Bounds on concentration and bias	Johannes Wiesel et.al.	2410.03425	null
2024-10-04	One2set + Large Language Model: Best Partners for Keyphrase Generation	Liangying Shao et.al.	2410.03421	link
2024-10-04	The scaling behaviour of localised and extended states in one-dimensional tight-binding models with disorder	Luca Schaefer et.al.	2410.03405	null
2024-10-04	Latent Abstractions in Generative Diffusion Models	Giulio Franzese et.al.	2410.03368	null
2024-10-04	LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding	Doohyuk Jang et.al.	2410.03355	null
2024-10-04	Sparsity of Quadratically Regularized Optimal Transport: Scalar Case	Alberto González-Sanz et.al.	2410.03353	null
2024-10-04	Optimal Transport for $ε$ -Contaminated Credal Sets	Michele Caprio et.al.	2410.03267	null
2024-10-03	Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models	Zhengfeng Lai et.al.	2410.02740	null
2024-10-03	NETS: A Non-Equilibrium Transport Sampler	Michael S. Albergo et.al.	2410.02711	null
2024-10-03	SteerDiff: Steering towards Safe Text-to-Image Diffusion Models	Hongxiang Zhang et.al.	2410.02710	null
2024-10-03	ControlAR: Controllable Image Generation with Autoregressive Models	Zongming Li et.al.	2410.02705	link
2024-10-03	Unsupervised Point Cloud Completion through Unbalanced Optimal Transport	Taekyung Lee et.al.	2410.02671	null
2024-10-03	GUD: Generation with Unified Diffusion	Mathis Gerdes et.al.	2410.02667	null
2024-10-03	Scalable Simulation-free Entropic Unbalanced Optimal Transport	Jaemoo Choi et.al.	2410.02656	null
2024-10-03	Efficient calibration of the shifted square-root diffusion model to credit default swap spreads using asymptotic approximations	Ankush Agarwal et.al.	2410.02645	null
2024-10-03	Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization	Mikhail Persiianov et.al.	2410.02628	null
2024-10-03	Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting	Sergei Kholkin et.al.	2410.02601	null
2024-10-04	Diffusion Models are Evolutionary Algorithms	Yanbo Zhang et.al.	2410.02543	link
2024-10-03	Lightweight Diffusion Models for Resource-Constrained Semantic Communication	Giovanni Pignata et.al.	2410.02491	link
2024-10-03	Towards a Theoretical Understanding of Memorization in Diffusion Models	Yunhao Chen et.al.	2410.02467	null
2024-10-03	Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models	Seyedmorteza Sadat et.al.	2410.02416	null
2024-10-03	Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks	Zeyu Feng et.al.	2410.02389	null
2024-10-02	FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Clothing Images	Cheng Zhang et.al.	2410.01801	null
2024-10-02	Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space	Yangming Li et.al.	2410.01796	null
2024-10-02	Learning To Solve Differential Equation Constrained Optimization Problems	Vincenzo Di Vito et.al.	2410.01786	null
2024-10-02	Dynamical-generative downscaling of climate model ensembles	Ignacio Lopez-Gomez et.al.	2410.01776	null
2024-10-02	ImageFolder: Autoregressive Image Generation with Folded Tokens	Xiang Li et.al.	2410.01756	link
2024-10-02	VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models	Kailai Feng et.al.	2410.01738	link
2024-10-02	HarmoniCa: Harmonizing Training and Inference for Better Feature Cache in Diffusion Transformer Acceleration	Yushi Huang et.al.	2410.01723	link
2024-10-02	KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models	Pouyan Navard et.al.	2410.01595	link
2024-10-02	MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video Generation	Mingzhen Sun et.al.	2410.01594	link
2024-10-02	HRTF Estimation using a Score-based Prior	Etienne Thuillier et.al.	2410.01562	null
2024-10-02	Weighted $L^p~(p\geq1)$ solutions of random time horizon BSDEs with stochastic monotonicity generators	Xinying Li et.al.	2410.01543	null
2024-10-02	Edge-preserving noise for diffusion models	Jente Vandersanden et.al.	2410.01540	null
2024-10-02	Discrete Diffusion Schrödinger Bridge Matching for Graph Transformation	Jun Hyeong Kim et.al.	2410.01500	null
2024-10-02	Modeling Cosmic-Ray Transport: A CRPropa based stochastic differential equation solver	Lukas Merten et.al.	2410.01472	null
2024-10-02	Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models	Ching-Chia Kao et.al.	2410.01438	null
2024-09-30	COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models	Divyanshu Daiya et.al.	2409.20502	null
2024-09-30	FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing	Lingling Cai et.al.	2409.20500	null
2024-09-30	A mean field Jacobi process for modeling sustainable tourism	Hidekazu Yoshioka et.al.	2409.20347	null
2024-09-30	Ensemble Kalman Diffusion Guidance: A Derivative-free Method for Inverse Problems	Hongkai Zheng et.al.	2409.20175	null
2024-09-30	Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model	Fulong Ma et.al.	2409.20164	null
2024-09-30	Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation	Rong Tang et.al.	2409.20124	null
2024-09-30	Reaction-diffusion model for a population structured in phenotype and space I – Criterion for persistence	Nathanaël Boutillon et.al.	2409.20118	null
2024-09-30	RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models	Jangyeong Kim et.al.	2409.19989	null
2024-09-30	Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function	Chenyi Zhuang et.al.	2409.19967	link
2024-10-02	Image Copy Detection for Diffusion Models	Wenhao Wang et.al.	2409.19952	null
2024-09-30	Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner	Chenyou Fan et.al.	2409.19949	null
2024-09-30	Replace Anyone in Videos	Xiang Wang et.al.	2409.19911	link
2024-09-30	The only admissible way of merging e-values	Ruodu Wang et.al.	2409.19888	null
2024-09-30	Partial Stochastic Dominance via Optimal Transport	Takashi Kamihigashi et.al.	2409.19876	null
2024-09-30	GameLabel-10K: Collecting Image Preference Data Through Mobile Game Crowdsourcing	Jonathan Zhou et.al.	2409.19830	null
2024-09-27	$O(d/T)$ Convergence Theory for Diffusion Probabilistic Models under Minimal Assumptions	Gen Li et.al.	2409.18959	null
2024-09-27	ReviveDiff: A Universal Diffusion Model for Restoring Images in Adverse Weather Conditions	Wenfeng Huang et.al.	2409.18932	null
2024-09-27	Unsupervised Low-light Image Enhancement with Lookup Tables and Diffusion Priors	Yunlong Lin et.al.	2409.18899	null
2024-09-27	Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis	Songrui Wang et.al.	2409.18897	null
2024-09-27	Explainable Artifacts for Synthetic Western Blot Source Attribution	João Phillipe Cardenuto et.al.	2409.18881	link
2024-09-27	Emu3: Next-Token Prediction is All You Need	Xinlong Wang et.al.	2409.18869	null
2024-09-27	Convergence of Diffusion Models Under the Manifold Hypothesis in High-Dimensions	Iskander Azangulov et.al.	2409.18804	null
2024-09-27	Unsupervised Fingerphoto Presentation Attack Detection With Diffusion Models	Hailin Li et.al.	2409.18636	null
2024-09-27	Treating Brain-inspired Memories as Priors for Diffusion Model to Forecast Multivariate Time Series	Muyao Wang et.al.	2409.18491	null
2024-09-27	Gradient-free Decoder Inversion in Latent Diffusion Models	Seongmin Hong et.al.	2409.18442	null
2024-09-27	GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation	Jiawei Lu et.al.	2409.18401	null
2024-09-27	Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images	Donghwan Kim et.al.	2409.18364	link
2024-09-27	Generative AI for fast and accurate Statistical Computation of Fluids	Roberto Molinaro et.al.	2409.18359	link
2024-09-26	Harnessing Wavelet Transformations for Generalizable Deepfake Forgery Detection	Lalith Bharadwaj Baru et.al.	2409.18301	link
2024-09-26	Synthesizing beta-amyloid PET images from T1-weighted Structural MRI: A Preliminary Study	Qing Lyu et.al.	2409.18282	null
2024-09-26	FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner	Wenliang Zhao et.al.	2409.18128	link
2024-09-26	Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction	Jing He et.al.	2409.18124	null
2024-09-26	EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation	Jiaxiang Tang et.al.	2409.18114	null
2024-09-26	Nonnegative cross-curvature in infinite dimensions: synthetic definition and spaces of measures	Flavien Léger et.al.	2409.18112	null
2024-09-26	StackGen: Generating Stable Structures from Silhouettes via Diffusion	Luzhe Sun et.al.	2409.18098	null
2024-09-26	DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models	Helin Cao et.al.	2409.18092	null
2024-09-26	Stable Video Portraits	Mirela Ostrek et.al.	2409.18083	null
2024-09-26	PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging	Xin Cai et.al.	2409.17996	null
2024-09-26	Joint Localization and Planning using Diffusion	L. Lao Beyer et.al.	2409.17995	null
2024-09-26	CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors	Linye Lyu et.al.	2409.17963	link
2024-09-26	Relativistic diffusion model for hadron production in p-Pb collisions at the LHC	Philipp Schulz et.al.	2409.17960	null
2024-09-26	Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative Criterion	Hengrui Gu et.al.	2409.17928	link
2024-09-26	Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation	Qihan Huang et.al.	2409.17920	link
2024-09-26	Physics-aligned Schrödinger bridge	Zeyu Li et.al.	2409.17825	null
2024-09-26	Continual learning with task specialist	Indu Solomon et.al.	2409.17806	null
2024-09-25	DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion	Yukun Huang et.al.	2409.17145	link
2024-09-25	Strong solutions to degenerate SDEs and uniqueness for degenerate Fokker-Planck equations	Sebastian Grube et.al.	2409.17135	null
2024-09-25	Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model	Xinfeng Wei et.al.	2409.17104	null
2024-09-25	Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors	Aiping Zhang et.al.	2409.17058	link
2024-09-25	ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis	Fangshuo Zhou et.al.	2409.17049	link
2024-09-25	Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion	Vineet Punyamoorty et.al.	2409.16950	null
2024-09-25	DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling	Kyuheon Jung et.al.	2409.16949	link
2024-09-25	Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model	Hongliang Zhong et.al.	2409.16938	link
2024-09-25	Weak Closed-loop Solvability of Linear Quadratic Stochastic Optimal Control Problems with Partial Information	Xun Li et.al.	2409.16924	null
2024-09-25	Automating Traffic Model Enhancement with AI Research Agent	Xusen Guo et.al.	2409.16876	link
2024-09-25	A Versatile and Differentiable Hand-Object Interaction Representation	Théo Morales et.al.	2409.16855	null
2024-09-25	Analytical assessment of workers’ safety concerning direct and indirect ways of getting infected by dangerous pathogen	Krzysztof Domino et.al.	2409.16809	null
2024-09-25	Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model	Shoma Iwai et.al.	2409.16689	null
2024-09-25	CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models	Xin Jing et.al.	2409.16619	null
2024-09-25	BSDEs driven by G-Brownian motion with time-varying uniformly continuous generators	Bingru Zhao et.al.	2409.16574	null
2024-09-18	Massively Multi-Person 3D Human Motion Forecasting with Scene Context	Felix B Mueller et.al.	2409.12189	link
2024-09-18	MoRAG – Multi-Fusion Retrieval Augmented Generation for Human Motion	Kalakonda Sai Shashank et.al.	2409.12140	link
2024-09-18	Cyclicity Analysis of the Ornstein-Uhlenbeck Process	Vivek Kaushik et.al.	2409.12102	null
2024-09-18	Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance	Jaehoon Joo et.al.	2409.12099	null
2024-09-18	Denoising diffusion models for high-resolution microscopy image restoration	Pamela Osuna-Vargas et.al.	2409.12078	null
2024-09-18	SFDA-rPPG: Source-Free Domain Adaptive Remote Physiological Measurement with Spatio-Temporal Consistency	Yiping Xie et.al.	2409.12040	null
2024-09-18	LEMON: Localized Editing with Mesh Optimization and Neural Shaders	Furkan Mert Algan et.al.	2409.12024	null
2024-09-18	Generation of Complex 3D Human Motion by Temporal and Spatial Composition of Diffusion Models	Lorenzo Mandelli et.al.	2409.11920	null
2024-09-18	DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech	Xin Qi et.al.	2409.11835	null
2024-09-18	RaggeDi: Diffusion-based State Estimation of Disordered Rags, Sheets, Towels and Blankets	Jikai Ye et.al.	2409.11831	null
2024-09-18	InverseMeetInsert: Robust Real Image Editing via Geometric Accumulation Inversion in Guided Diffusion Models	Yan Zheng et.al.	2409.11734	null
2024-09-18	GUNet: A Graph Convolutional Network United Diffusion Model for Stable and Diversity Pose Generation	Shuowen Liang et.al.	2409.11689	link
2024-09-18	Recurrent Interpolants for Probabilistic Time Series Prediction	Yu Chen et.al.	2409.11684	null
2024-09-18	SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation	Mingze Sun et.al.	2409.11682	link
2024-09-18	Electromagnetic Property Sensing and Channel Reconstruction Based on Diffusion Schrödinger Bridge in ISAC	Yuhua Jiang et.al.	2409.11651	null
2024-09-17	Ultrasound Image Enhancement with the Variance of Diffusion Models	Yuxin Zhang et.al.	2409.11380	link
2024-09-17	OSV: One Step is Enough for High-Quality Image to Video Generation	Xiaofeng Mao et.al.	2409.11367	null
2024-09-17	Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Gonzalo Martin Garcia et.al.	2409.11355	link
2024-09-17	OmniGen: Unified Image Generation	Shitao Xiao et.al.	2409.11340	link
2024-09-17	Parameter dependent rough SDEs with applications to rough PDEs	Fabio Bugini et.al.	2409.11330	null
2024-09-17	fMRI-3D: A Comprehensive Dataset for Enhancing fMRI-based 3D Reconstruction	Jianxiong Gao et.al.	2409.11315	null
2024-09-17	DroneDiffusion: Robust Quadrotor Dynamics Learning with Diffusion Models	Avirup Das et.al.	2409.11292	null
2024-09-17	Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models	Tianqi Chen et.al.	2409.11219	null
2024-09-17	High-Resolution Speech Restoration with Latent Diffusion Model	Tushar Dhyani et.al.	2409.11145	link
2024-09-17	In-situ measurements of light diffusion in an optically dense atomic ensemble	Antoine Glicenstein et.al.	2409.11117	null
2024-09-17	TacDiffusion: Force-domain Diffusion Policy for Precise Tactile Manipulation	Yansong Wu et.al.	2409.11047	null
2024-09-17	Enhanced segmentation of femoral bone metastasis in CT scans of patients using synthetic data generation with 3D diffusion models	Emile Saillard et.al.	2409.11011	null
2024-09-17	Local discontinuous Galerkin method for nonlinear BSPDEs of Neumann boundary conditions with deep backward dynamic programming time-marching	Yixiang Dai et.al.	2409.11004	null
2024-09-17	Edge-based Denoising Image Compression	Ryugo Morita et.al.	2409.10978	null
2024-09-17	CUNSB-RFIE: Context-aware Unpaired Neural Schrödinger Bridge in Retinal Fundus Image Enhancement	Xuanzhao Dong et.al.	2409.10966	link
2024-09-16	Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation	Noah Buchanan et.al.	2409.10494	null
2024-09-16	SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing	Qi Qian et.al.	2409.10476	null
2024-09-16	MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion	Lehong Wu et.al.	2409.10473	null
2024-09-16	Mamba-ST: State Space Model for Efficient Style Transfer	Filippo Botti et.al.	2409.10385	link
2024-09-16	Stochastic Control of UAVs: An Optimal Tradeoff between Performance, Flight Smoothness and Control Effort	George Rapakoulias et.al.	2409.10369	null
2024-09-16	Taming Diffusion Models for Image Restoration: A Review	Ziwei Luo et.al.	2409.10353	null
2024-09-16	Fairness, not Emotion, Drives Socioeconomic Decision Making	Rudra Mukhopadhyay et.al.	2409.10322	null
2024-09-16	DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis	Fa-Ting Hong et.al.	2409.10281	null
2024-09-16	RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models	Başak Melis Öcal et.al.	2409.10180	null
2024-09-16	PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion	Peng Li et.al.	2409.10141	null
2024-09-16	Approximating the signature of Brownian motion for high order SDE simulation	James Foster et.al.	2409.10118	link
2024-09-16	DDoS: Diffusion Distribution Similarity for Out-of-Distribution Detection	Kun Fang et.al.	2409.10094	null
2024-09-16	MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior	Weijing Tao et.al.	2409.10090	link
2024-09-16	Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models	Alexander Koch et.al.	2409.10089	null
2024-09-16	A Riemannian Approach to Ground Metric Learning for Optimal Transport	Pratik Jawanpuria et.al.	2409.10085	null
2024-09-13	Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation	Qingwen Bu et.al.	2409.09016	link
2024-09-13	A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis	Yohan Poirier-Ginter et.al.	2409.08947	null
2024-09-13	Latent Space Score-based Diffusion Model for Probabilistic Multivariate Time Series Imputation	Guojun Liang et.al.	2409.08917	link
2024-09-13	Gaussian is All You Need: A Unified Framework for Solving Inverse Problems via Diffusion Posterior Sampling	Nebiyou Yismaw et.al.	2409.08906	link
2024-09-13	Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control	Carles Domingo-Enrich et.al.	2409.08861	null
2024-09-13	InstantDrag: Improving Interactivity in Drag-based Image Editing	Joonghyuk Shin et.al.	2409.08857	null
2024-09-13	DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s)	Yun Su Jeong et.al.	2409.08850	null
2024-09-13	Measure-Theoretic Time-Delay Embedding	Jonah Botvinick-Greenhouse et.al.	2409.08768	link
2024-09-13	DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset	Jiawei Du et.al.	2409.08731	link
2024-09-13	Asymptotics for Random Quadratic Transportation Costs	Martin Huesmann et.al.	2409.08612	null
2024-09-13	Finite-time thermodynamic bounds and tradeoff relations for information processing	Takuya Kamijima et.al.	2409.08606	null
2024-09-13	STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment	Yong Ren et.al.	2409.08601	null
2024-09-13	LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling	Yubo Huang et.al.	2409.08583	null
2024-09-13	DiffFAS: Face Anti-Spoofing via Generative Diffusion Models	Xinxu Ge et.al.	2409.08572	link
2024-09-13	Think Twice Before You Act: Improving Inverse Problem Solving With MCMC	Yaxuan Zhu et.al.	2409.08551	null
2024-09-12	DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors	Thomas Hanwen Zhu et.al.	2409.08278	null
2024-09-12	DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer	Runjia Li et.al.	2409.08271	null
2024-09-12	Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation	Samanta Rodriguez et.al.	2409.08269	null
2024-09-12	Improving Text-guided Object Inpainting with Semantic Pre-inpainting	Yifu Chen et.al.	2409.08260	link
2024-09-12	Improving Virtual Try-On with Garment-focused Diffusion Models	Siqi Wan et.al.	2409.08258	link
2024-09-12	LoRID: Low-Rank Iterative Diffusion for Adversarial Purification	Geigh Zollicoffer et.al.	2409.08255	null
2024-09-12	Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding	Hongyu Li et.al.	2409.08251	null
2024-09-12	IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation	Yinwei Wu et.al.	2409.08240	null
2024-09-12	How can the tragedy of the commons be prevented?: Introducing Linear Quadratic Mixed Mean Field Games	Gokce Dayanikli et.al.	2409.08235	null
2024-09-12	LT3SD: Latent Trees for 3D Scene Diffusion	Quan Meng et.al.	2409.08215	null
2024-09-12	VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis	Hao Chen et.al.	2409.08207	null
2024-09-12	MagicStyle: Portrait Stylization Based on Reference Image	Zhaoli Deng et.al.	2409.08156	null
2024-09-12	EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance	Zicheng Duan et.al.	2409.08091	link
2024-09-12	Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation	Junsung Lee et.al.	2409.08077	null
2024-09-12	AI-accelerated discovery of high critical temperature superconductors	Xiao-Qi Han et.al.	2409.08065	link
2024-09-11	DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation	Haibo Yang et.al.	2409.07454	null
2024-09-11	Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models	Haibo Yang et.al.	2409.07452	link
2024-09-11	FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process	Yang Luo et.al.	2409.07451	null
2024-09-11	Efficient One-Step Diffusion Refinement for Snapshot Compressive Imaging	Yunzhen Wang et.al.	2409.07417	null
2024-09-11	Training-Free Guidance for Discrete Diffusion Models for Molecular Generation	Thomas J. Kerby et.al.	2409.07359	null
2024-09-11	Learning Robotic Manipulation Policies from Point Clouds with Conditional Flow Matching	Eugenio Chisari et.al.	2409.07343	null
2024-09-11	Efficient and Unbiased Sampling of Boltzmann Distributions via Consistency Models	Fengzhe Zhang et.al.	2409.07323	null
2024-09-11	Exploring User-level Gradient Inversion with a Diffusion Prior	Zhuohang Li et.al.	2409.07291	null
2024-09-11	CCFExp: Facial Image Synthesis with Cycle Cross-Fusion Diffusion Model for Facial Paralysis Individuals	Weixiang Gao et.al.	2409.07271	link
2024-09-11	Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models	Sanoojan Baliah et.al.	2409.07269	link
2024-09-11	EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion	Jian Zhang et.al.	2409.07255	link
2024-09-12	Alignment of Diffusion Models: Fundamentals, Challenges, and Future	Buhua Liu et.al.	2409.07253	link
2024-09-11	Diff-VPS: Video Polyp Segmentation via a Multi-task Diffusion Network with Adversarial Temporal Reasoning	Yingling Lu et.al.	2409.07238	link
2024-09-11	Phy124: Fast Physics-Driven 4D Content Generation from a Single Image	Jiajing Lin et.al.	2409.07179	null
2024-09-11	Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models	Jiahang Cao et.al.	2409.07163	null
2024-09-10	SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation	Teng Hu et.al.	2409.06633	null
2024-09-10	One-Shot Imitation under Mismatched Execution	Kushal Kedia et.al.	2409.06615	null
2024-09-10	Modelling Global Trade with Optimal Transport	Thomas Gaskin et.al.	2409.06554	link
2024-09-10	Robust financial calibration: a Bayesian approach for neural SDEs	Christa Cuchiero et.al.	2409.06551	link
2024-09-10	Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models	Xin Jing et.al.	2409.06451	null
2024-09-10	Robust semi-parametric signal detection in particle physics with classifiers decorrelated via optimal transport	Purvasha Chakravarti et.al.	2409.06399	link
2024-09-10	Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition	Junzheng Zhang et.al.	2409.06371	null
2024-09-10	What happens to diffusion model likelihood when your model is conditional?	Mattias Cross et.al.	2409.06364	null
2024-09-10	DiffQRCoder: Diffusion-based Aesthetic QR Code Generation with Scanning Robustness Guided Iterative Refinement	Jia-Wei Liao et.al.	2409.06355	null
2024-09-10	Geometry of the Space of Partitioned Networks: A Unified Theoretical and Computational Framework	Stephen Y Zhang et.al.	2409.06302	link
2024-09-10	Multi-Source Music Generation with Latent Diffusion	Zhongweiyang Xu et.al.	2409.06190	link
2024-09-10	MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control	Yining Yao et.al.	2409.06189	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-09	Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer	Michele Mancusi et.al.	2409.06096	null
2024-09-09	SVS-GAN: Leveraging GANs for Semantic Video Synthesis	Khaled M. Seyam et.al.	2409.06074	null
2024-09-09	Enhancing Preference-based Linear Bandits via Human Response Time	Shen Li et.al.	2409.05798	null
2024-09-09	Vector Quantized Diffusion Model Based Speech Bandwidth Extension	Yuan Fang et.al.	2409.05784	null
2024-09-09	AS-Speech: Adaptive Style For Speech Synthesis	Zhipeng Li et.al.	2409.05730	null
2024-09-09	Distributionally Robust Stochastic Data-Driven Predictive Control with Optimized Feedback Gain	Ruiqi Li et.al.	2409.05727	null
2024-09-09	Quantitative approximation of stochastic kinetic equations: from discrete to continuum	Zimo Hao et.al.	2409.05706	null
2024-09-09	pFedGPA: Diffusion-based Generative Parameter Aggregation for Personalized Federated Learning	Jiahao Lai et.al.	2409.05701	null
2024-09-09	Unlearning or Concealment? A Critical Analysis and Evaluation Metrics for Unlearning in Diffusion Models	Aakash Sen Sharma et.al.	2409.05668	null
2024-09-09	Forward KL Regularized Preference Optimization for Aligning Diffusion Policies	Zhao Shan et.al.	2409.05622	null
2024-09-09	CipherDM: Secure Three-Party Inference for Diffusion Model Sampling	Xin Zhao et.al.	2409.05414	null
2024-09-09	Sequential Posterior Sampling with Diffusion Models	Tristan S. W. Stevens et.al.	2409.05399	null
2024-09-09	TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors	Yichuan Mo et.al.	2409.05294	link
2024-09-08	The Stochastic Gause predator-prey model: noise-induced extinctions and invariance	Leon Alexander Valencia et.al.	2409.05237	null
2024-09-08	Nuclear transparencies with a two step process of the $A(e,e’π^+)$ reactions	Tae Keun Choi et.al.	2409.05129	null
2024-09-08	Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule	Siyi Wang et.al.	2409.05116	null
2024-09-08	A Survey on Diffusion Models for Recommender Systems	Jianghao Lin et.al.	2409.05033	link
2024-09-06	VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation	Yecheng Wu et.al.	2409.04429	link
2024-09-06	Exploring Foundation Models for Synthetic Medical Imaging: A Study on Chest X-Rays and Fine-Tuning Techniques	Davide Clode da Silva et.al.	2409.04424	null
2024-09-06	How Fair is Your Diffusion Recommender Model?	Daniele Malitesta et.al.	2409.04339	null
2024-09-06	Random effects estimation in a fractional diffusion model based on continuous observations	Nesrine Chebli et.al.	2409.04331	null
2024-09-06	Probabilistic Representation for Viscosity Solutions to Double-Obstacle Quasi-Variational Inequalities	Magnus Perninge et.al.	2409.04207	null
2024-09-06	Breaking the Brownian Barrier: Models and Manifestations of Molecular Diffusion in Complex Fluids	Harish Srinivasan et.al.	2409.04199	null
2024-09-06	GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers	Lorenza Prospero et.al.	2409.04196	link
2024-09-06	D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection	Kentaro Hirahara et.al.	2409.04060	null
2024-09-06	A policy iteration algorithm for non-Markovian control problems	Dylan Possamaï et.al.	2409.04037	null
2024-09-06	One-Shot Diffusion Mimicker for Handwritten Text Generation	Gang Dai et.al.	2409.04004	link
2024-09-06	DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes	Jianbiao Mei et.al.	2409.04003	link
2024-09-05	Data-Efficient Generation for Dataset Distillation	Zhe Li et.al.	2409.03929	null
2024-09-05	Generating High Dimensional User-Specific Wireless Channels using Diffusion Models	Taekyun Lee et.al.	2409.03924	null
2024-09-05	Neural Entropy	Akhil Premkumar et.al.	2409.03817	null
2024-09-05	Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding	Yunze Man et.al.	2409.03757	link
2024-09-05	ArtiFade: Learning to Generate High-quality Subject from Blemished Images	Shuya Yang et.al.	2409.03745	null
2024-09-05	Quantum optimal transport with convex regularization	Emanuele Caputo et.al.	2409.03698	null
2024-09-05	RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images	Benzhi Wang et.al.	2409.03644	link
2024-09-05	DiffEVC: Any-to-Any Emotion Voice Conversion with Expressive Guidance	Hsing-Hang Chou et.al.	2409.03636	null
2024-09-05	TCDiff: Triple Condition Diffusion Model with 3D Constraints for Stylizing Synthetic Faces	Bernardo Biesseck et.al.	2409.03600	link
2024-09-05	DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture	Qianlong Xiang et.al.	2409.03550	link
2024-09-05	On the mean field limit of consensus based methods	Marvin Koß et.al.	2409.03518	null
2024-09-05	Blended Latent Diffusion under Attention Control for Real-World Video Editing	Deyin Liu et.al.	2409.03514	null
2024-09-05	Data-free Distillation with Degradation-prompt Diffusion for Multi-weather Image Restoration	Pei Wang et.al.	2409.03455	null
2024-09-05	Recursive Quantization for $\mathcal{L}_2$ Stabilization of a Finite Capacity Stochastic Control Loop with Intermittent State Observations	Shrija Karmakar et.al.	2409.03398	null
2024-09-05	Enhancing User-Centric Privacy Protection: An Interactive Framework through Diffusion Models and Machine Unlearning	Huaxi Huang et.al.	2409.03326	null
2024-09-05	SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model	Weipeng Tan et.al.	2409.03270	null
2024-09-05	RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry	Zhaowei Wang et.al.	2409.03198	null
2024-09-04	Spatial Diffusion for Cell Layout Generation	Chen Li et.al.	2409.03106	link
2024-09-04	HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts	Xinyu Liu et.al.	2409.02919	link
2024-09-04	Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling	Kaiwen Zheng et.al.	2409.02908	null
2024-09-04	Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models	Zhibin Liu et.al.	2409.02851	link
2024-09-04	Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model	Tornike Karchkhadze et.al.	2409.02845	null
2024-09-04	Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects	Kyungmin Jo et.al.	2409.02653	null
2024-09-04	MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos	Junyi Ma et.al.	2409.02638	null
2024-09-04	Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency	Jianwen Jiang et.al.	2409.02634	null
2024-09-04	Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models	Pujing Yang et.al.	2409.02597	null
2024-09-04	Solving Video Inverse Problems Using Image Diffusion Models	Taesung Kwon et.al.	2409.02574	null
2024-09-04	StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models	Wen Li et.al.	2409.02543	link
2024-09-04	Sample what you cant compress	Vighnesh Birodkar et.al.	2409.02529	null
2024-09-04	Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal	Jifeng Hu et.al.	2409.02512	link
2024-09-04	Demographic parity in regression and classification within the unawareness framework	Vincent Divol et.al.	2409.02471	null
2024-09-04	Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis	Aishwarya Agarwal et.al.	2409.02429	null
2024-09-04	Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering	Peng Wang et.al.	2409.02426	link
2024-08-30	Subspace Diffusion Posterior Sampling for Travel-Time Tomography	Xiang Cao et.al.	2408.17333	null
2024-08-30	Likelihood estimation for stochastic differential equations with mixed effects	Fernando Baltazar-Larios et.al.	2408.17257	null
2024-08-30	The random periodic solutions for McKean-Vlasov stochastic differential equations	Jianhai Bao et.al.	2408.17242	null
2024-08-30	A methodological framework for Resilience as a Service (RaaS) in multimodal urban transportation networks	Sara Jaber et.al.	2408.17233	null
2024-09-02	RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance	Avideep Mukherjee et.al.	2408.17095	null
2024-09-02	Instant Adversarial Purification with Adversarial Consistency Distillation	Chun Tong Lei et.al.	2408.17064	null
2024-08-30	Text-to-Image Generation Via Energy-Based CLIP	Roy Ganz et.al.	2408.17046	null
2024-08-30	High-fidelity holographic beam shaping with optimal transport and phase diversity	Hunter Swan et.al.	2408.17025	null
2024-08-30	Contrastive Learning with Synthetic Positives	Dewen Zeng et.al.	2408.16965	link
2024-09-02	Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis	Theodoros Kouzelis et.al.	2408.16845	null
2024-08-29	ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model	Fangfu Liu et.al.	2408.16767	null
2024-09-04	CSGO: Content-Style Composition in Text-to-Image Generation	Peng Xing et.al.	2408.16766	null
2024-08-29	DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving	Yongjie Fu et.al.	2408.16647	null
2024-09-02	RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model	Zhuan Shi et.al.	2408.16634	null
2024-08-29	A Score-based Generative Solver for PDE-constrained Inverse Problems with Complex Priors	Yankun Hong et.al.	2408.16626	null

Dataset Distillation

Publish Date	Title	Authors	PDF	Code
2025-07-23	CodeReasoner: Enhancing the Code Reasoning Ability with Reinforcement Learning	Lingxiao Tang et.al.	2507.17548	null
2025-07-23	MaskedCLIP: Bridging the Masked and CLIP Space for Semi-Supervised Medical Vision-Language Pre-training	Lei Zhu et.al.	2507.17239	null
2025-07-23	Dataset Distillation as Data Compression: A Rate-Utility Perspective	Youneng Bao et.al.	2507.17221	null
2025-07-22	Sensor Drift Compensation in Electronic-Nose-Based Gas Recognition Using Knowledge Distillation	Juntao Lin et.al.	2507.17071	null
2025-07-22	Task-Specific Zero-shot Quantization-Aware Training for Object Detection	Changhao Li et.al.	2507.16782	null
2025-07-22	Cross-Modal Distillation For Widely Differing Modalities	Cairong Zhao et.al.	2507.16296	null
2025-07-21	Local Dense Logit Relations for Enhanced Knowledge Distillation	Liuchi Xu et.al.	2507.15911	null
2025-07-21	Efficient Face Image Quality Assessment via Self-training and Knowledge Distillation	Wei Sun et.al.	2507.15709	null
2025-07-23	Visual-Language Model Knowledge Distillation Method for Image Quality Assessment	Yongkang Hou et.al.	2507.15680	null
2025-07-21	Optimal Transceiver Design in Over-the-Air Federated Distillation	Zihao Hu et.al.	2507.15256	null
2025-07-19	Generative Distribution Distillation	Jiequan Cui et.al.	2507.14503	null
2025-07-18	Influence Functions for Preference Dataset Pruning	Daniel Fein et.al.	2507.14344	null
2025-07-18	TGIF: Talker Group-Informed Familiarization of Target Speaker Extraction	Tsun-An Hsieh et.al.	2507.14044	null
2025-07-18	Efficient Burst Super-Resolution with One-step Diffusion	Kento Kawai et.al.	2507.13607	null
2025-07-17	Uncertainty-Aware Cross-Modal Knowledge Distillation with Prototype Learning for Multimodal Brain-Computer Interfaces	Hyo-Jeong Jang et.al.	2507.13092	null
2025-07-17	Label-Consistent Dataset Distillation with Detector-Guided Refinement	Yawen Zou et.al.	2507.13074	null
2025-07-17	Multimodal-Guided Dynamic Dataset Pruning for Robust and Efficient Data-Centric Learning	Suorong Yang et.al.	2507.12750	null
2025-07-18	DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition	Hayat Ullah et.al.	2507.12426	null
2025-07-16	Improving Lightweight Weed Detection via Knowledge Distillation	Ahmet Oğuz Saltık et.al.	2507.12344	null
2025-07-16	Fine-Grained Image Recognition from Scratch with Teacher-Guided Data Augmentation	Edwin Arkel Rios et.al.	2507.12157	null
2025-07-15	HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training	Seungho Choi et.al.	2507.10920	null
2025-07-14	The Power of Certainty: How Confident Models Lead to Better Segmentation	Tugberk Erol et.al.	2507.10490	null
2025-07-15	Energy Efficiency in AI for 5G and Beyond: A DeepRx Case Study	Amine Lbath et.al.	2507.10409	null
2025-07-14	Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning	Yichen Li et.al.	2507.10348	null
2025-07-14	Task-Based Flexible Feature Distillation for LLMs	Khouloud Saadi et.al.	2507.10155	null
2025-07-13	Leveraging Distribution Matching to Make Approximate Machine Unlearning Faster	Junaid Iqbal Khan et.al.	2507.09786	null
2025-07-13	HMID-Net: An Exploration of Masked Image Modeling and Knowledge Distillation in Hyperbolic Space	Changli Wang et.al.	2507.09487	null
2025-07-12	Cross Knowledge Distillation between Artificial and Spiking Neural Networks	Shuhan Ye et.al.	2507.09269	null
2025-07-11	Forget Me Not: Fighting Local Overfitting with Knowledge Fusion and Distillation	Uri Stern et.al.	2507.08686	null
2025-07-11	Towards Collaborative Fairness in Federated Learning Under Imbalanced Covariate Shift	Tianrun Yu et.al.	2507.08617	null
2025-07-11	Occlusion-Guided Feature Purification Learning via Reinforced Knowledge Distillation for Occluded Person Re-Identification	Yufei Zheng et.al.	2507.08520	null
2025-07-11	SFedKD: Sequential Federated Learning with Discrepancy-Aware Multi-Teacher Knowledge Distillation	Haotian Xu et.al.	2507.08508	null
2025-07-11	Distillation versus Contrastive Learning: How to Train Your Rerankers	Zhichao Xu et.al.	2507.08336	null
2025-07-15	KAT-V1: Kwai-AutoThink Technical Report	Zizheng Zhan et.al.	2507.08297	null
2025-07-11	LISTEN: Lightweight Industrial Sound-representable Transformer for Edge Notification	Changheon Han et.al.	2507.07879	null
2025-07-10	Exploring the Limits of Model Compression in LLMs: A Knowledge Distillation Study on QA Tasks	Joyeeta Datta et.al.	2507.07630	null
2025-07-10	Diffusion-Guided Knowledge Distillation for Weakly-Supervised Low-Light Semantic Segmentation	Chunyan Wang et.al.	2507.07578	null
2025-07-11	Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models	Tiezheng Zhang et.al.	2507.07104	null
2025-07-09	MST-Distill: Mixture of Specialized Teachers for Cross-Modal Knowledge Distillation	Hui Li et.al.	2507.07015	null
2025-07-12	Diffusion Dataset Condensation: Training Your Diffusion Model Faster with Less Data	Rui Huang et.al.	2507.05914	null
2025-07-08	Flipping Knowledge Distillation: Leveraging Small Models’ Expertise to Enhance LLMs in Text Matching	Mingzhe Li et.al.	2507.05617	null
2025-07-07	Information-Guided Diffusion Sampling for Dataset Distillation	Linfeng Ye et.al.	2507.04619	null
2025-07-06	Tractable Representation Learning with Probabilistic Circuits	Steven Braun et.al.	2507.04385	null
2025-07-06	MLLM-Fabric: Multimodal Large Language Model-Driven Robotic Framework for Fabric Sorting and Selection	Liman Wang et.al.	2507.04351	null
2025-07-05	When Data-Free Knowledge Distillation Meets Non-Transferable Teacher: Escaping Out-of-Distribution Trap is All You Need	Ziming Hong et.al.	2507.04119	null
2025-07-04	Task-Specific Generative Dataset Distillation with Difficulty-Guided Sampling	Mingzhuo Li et.al.	2507.03331	null
2025-07-04	Dual-frequency Selected Knowledge Distillation with Statistical-based Sample Rectification for PolSAR Image Classification	Xinyue Xin et.al.	2507.03268	null
2025-07-01	We Need Knowledge Distillation for Solving Math Word Problems	Zhenquan Shen et.al.	2507.02982	null
2025-07-10	A Large Language Model for Chemistry and Retrosynthesis Predictions	Yueqing Zhang et.al.	2507.01444	null
2025-07-02	Can Large Language Models Develop Strategic Reasoning? Post-training Insights from Learning Chess	Dongyoon Hwang et.al.	2507.00726	null
2025-07-01	Simulation-Efficient Cosmological Inference with Multi-Fidelity SBI	Leander Thiele et.al.	2507.00514	null
2025-06-30	FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation	Jiacheng Cui et.al.	2506.24125	null
2025-07-08	The Trilemma of Truth in Large Language Models	Germans Savcisens et.al.	2506.23921	null
2025-06-30	Efficient Interleaved Speech Modeling through Knowledge Distillation	Mohammadmahdi Nouriborji et.al.	2506.23670	null
2025-06-30	Dataset Distillation via Vision-Language Category Prototype	Yawen Zou et.al.	2506.23580	null
2025-06-30	When Test-Time Adaptation Meets Self-Supervised Models	Jisu Han et.al.	2506.23529	null
2025-06-29	Competitive Distillation: A Simple Learning Strategy for Improving Visual Classification	Daqian Shi et.al.	2506.23285	null
2025-06-29	ReMem: Mutual Information-Aware Fine-tuning of Pretrained Vision Transformers for Effective Knowledge Distillation	Chengyu Dong et.al.	2506.23041	null
2025-06-28	ReasonBridge: Efficient Reasoning Transfer from Closed to Open-Source Language Models	Ziqi Zhong et.al.	2506.22865	null
2025-06-28	LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning	Jiang Yuan et.al.	2506.22710	null
2025-06-27	Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training	Aadim Nepal et.al.	2506.22638	null
2025-06-27	CaO $_2$ : Rectifying Inconsistencies in Diffusion-Based Dataset Distillation	Haoxuan Wang et.al.	2506.22637	null
2025-06-27	Unifying Biomedical Vision-Language Expertise: Towards a Generalist Foundation Model via Multi-CLIP Knowledge Distillation	Shansong Wang et.al.	2506.22567	null
2025-06-26	A Survey on Model Extraction Attacks and Defenses for Large Language Models	Kaixiang Zhao et.al.	2506.22521	null
2025-06-27	Seismic resolution enhancement via deep Learning with Knowledge Distillation and Domain Adaptation	Hanpeng Cai et.al.	2506.22018	null
2025-06-28	G $^{2}$ D: Boosting Multimodal Learning with Gradient-Guided Distillation	Mohammed Rakib et.al.	2506.21514	null
2025-06-26	Continual Self-Supervised Learning with Masked Autoencoders in Remote Sensing	Lars Möllenbrok et.al.	2506.21312	null
2025-06-26	Distilling Normalizing Flows	Steven Walton et.al.	2506.21003	null
2025-06-26	A Multi-Stage Framework for Multimodal Controllable Speech Synthesis	Rui Niu et.al.	2506.20945	null
2025-06-25	Building Lightweight Semantic Segmentation Models for Aerial Images Using Dual Relation Distillation	Minglong Li et.al.	2506.20688	null
2025-06-25	Tackling Data Heterogeneity in Federated Learning through Knowledge Distillation with Inequitable Aggregation	Xing Ma et.al.	2506.20431	null
2025-06-25	Client Clustering Meets Knowledge Sharing: Enhancing Privacy and Robustness in Personalized Peer-to-Peer Learning	Mohammad Mahdi Maheri et.al.	2506.20413	null
2025-06-25	FedBKD: Distilled Federated Learning to Embrace Gerneralization and Personalization on Non-IID Data	Yushan Zhao et.al.	2506.20245	null
2025-06-26	Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition	Man Duc Chuc et.al.	2506.20174	null
2025-06-24	GNN’s Uncertainty Quantification using Self-Distillation	Hirad Daneshvar et.al.	2506.20046	null
2025-06-24	Distillation-Enabled Knowledge Alignment for Generative Semantic Communications in AIGC Provisioning Tasks	Jingzhi Hu et.al.	2506.19893	null
2025-06-24	Recalling The Forgotten Class Memberships: Unlearned Models Can Be Noisy Labelers to Leak Privacy	Zhihao Sui et.al.	2506.19486	null
2025-06-23	HAWAII: Hierarchical Visual Knowledge Transfer for Efficient Vision-Language Models	Yimu Wang et.al.	2506.19072	null
2025-06-23	Diffusion Transformer-to-Mamba Distillation for High-Resolution Image Generation	Yuan Yao et.al.	2506.18999	null
2025-06-24	PicoSAM2: Low-Latency Segmentation In-Sensor for Edge Vision Applications	Pietro Bonazzi et.al.	2506.18807	null
2025-06-23	Multi-modal Anchor Gated Transformer with Knowledge Distillation for Emotion Recognition in Conversation	Jie Li et.al.	2506.18716	link
2025-06-23	Efficient and Generalizable Speaker Diarization via Structured Pruning of Self-Supervised Models	Jiangyu Han et.al.	2506.18623	null
2025-06-23	Biased Teacher, Balanced Student	Seonghak Kim et.al.	2506.18496	null
2025-06-23	Dual-Forward Path Teacher Knowledge Distillation: Bridging the Capacity Gap Between Teacher and Student	Tong Li et.al.	2506.18244	null
2025-06-23	Cross-Architecture Knowledge Distillation (KD) for Retinal Fundus Image Anomaly Detection on NVIDIA Jetson Nano	Berk Yilmaz et.al.	2506.18220	null
2025-06-24	Multimodal Fusion SLAM with Fourier Attention	Youjie Zhou et.al.	2506.18204	null
2025-06-21	Enhancing Few-shot Keyword Spotting Performance through Pre-Trained Self-supervised Speech Models	Alican Gok et.al.	2506.17686	null
2025-06-19	Fine-grained Image Retrieval via Dual-Vision Adaptation	Xin Jiang et.al.	2506.16273	null
2025-06-19	Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs	Xun Wang et.al.	2506.16196	null
2025-06-19	From Teacher to Student: Tracking Memorization Through Model Distillation	Simardeep Singh et.al.	2506.16170	null
2025-06-18	Factorized RVQ-GAN For Disentangled Speech Tokenization	Sameer Khurana et.al.	2506.15456	null
2025-06-18	FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation	Haolong Jin et.al.	2506.15365	link
2025-06-20	Knowledge Distillation Framework for Accelerating High-Accuracy Neural Network-Based Molecular Dynamics Simulations	Naoki Matsumura et.al.	2506.15337	null
2025-06-18	CKD-EHR:Clinical Knowledge Distillation for Electronic Health Records	Junke Wang et.al.	2506.15118	null
2025-06-17	AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes	Jiahao Qiu et.al.	2506.14728	null
2025-06-17	Dataset distillation for memorized data: Soft labels can leak held-out teacher knowledge	Freya Behrens et.al.	2506.14457	link
2025-06-17	Model compression using knowledge distillation with integrated gradients	David E. Hernandez et.al.	2506.14440	null
2025-06-17	KDMOS:Knowledge Distillation for Motion Segmentation	Chunyu Cao et.al.	2506.14130	link
2025-06-20	A Technical Study into 0.5B Reasoning Language Models	Xialie Zhuang et.al.	2506.13404	null
2025-06-17	SeqPE: Transformer with Sequential Position Encoding	Huayang Li et.al.	2506.13277	link
2025-06-16	Lightweight Task-Oriented Semantic Communication Empowered by Large-Scale AI Models	Chuanhong Liu et.al.	2506.13243	null
2025-06-16	I $^2$ S-TFCKD: Intra-Inter Set Knowledge Distillation with Time-Frequency Calibration for Speech Enhancement	Jiaming Cheng et.al.	2506.13127	link
2025-06-17	HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs	Zijian Zhang et.al.	2506.13038	null
2025-06-18	PLD: A Choice-Theoretic List-Wise Knowledge Distillation	Ejafa Bassam et.al.	2506.12542	null
2025-06-14	Merlin: Multi-View Representation Learning for Robust Multivariate Time Series Forecasting with Unfixed Missing Rates	Chengqing Yu et.al.	2506.12459	null
2025-06-13	Brewing Knowledge in Context: Distillation Perspectives on In-Context Learning	Chengye Li et.al.	2506.11516	null
2025-06-12	Ground Reaction Force Estimation via Time-aware Knowledge Distillation	Eun Som Jeon et.al.	2506.10265	null
2025-06-11	FedMLAC: Mutual Learning Driven Heterogeneous Federated Audio Classification	Jun Bai et.al.	2506.10207	null
2025-06-11	A Novel Lightweight Transformer with Edge-Aware Fusion for Remote Sensing Image Captioning	Swadhin Das et.al.	2506.09429	null
2025-06-08	ReStNet: A Reusable & Stitchable Network for Dynamic Adaptation on IoT Devices	Maoyu Wang et.al.	2506.09066	null
2025-06-10	SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning	Xiao Liang et.al.	2506.08989	link
2025-06-10	Multi-Teacher Language-Aware Knowledge Distillation for Multilingual Speech Emotion Recognition	Mehedi Hasan Bijoy et.al.	2506.08717	link
2025-06-10	Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation	Shiji Zhao et.al.	2506.08611	link
2025-06-09	Flowing Datasets with Wasserstein over Wasserstein Gradient Flows	Clément Bonet et.al.	2506.07534	link
2025-06-09	DPFormer: Dynamic Prompt Transformer for Continual Learning	Sheng-Kai Huang et.al.	2506.07414	null
2025-06-08	A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge	Tarique Dahri et.al.	2506.07055	null
2025-06-07	DivScore: Zero-Shot Detection of LLM-Generated Text in Specialized Domains	Zhihui Chen et.al.	2506.06705	null
2025-06-07	Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit	Charles Goddard et.al.	2506.06607	null
2025-06-06	Label-Context-Dependent Internal Language Model Estimation for CTC	Zijian Yang et.al.	2506.06096	null
2025-06-06	Being Strong Progressively! Enhancing Knowledge Distillation of Large Language Models through a Curriculum Learning Framework	Lingyuan Liu et.al.	2506.05695	link
2025-06-04	QA-HFL: Quality-Aware Hierarchical Federated Learning for Resource-Constrained Mobile Devices with Heterogeneous Image Quality	Sajid Hussain et.al.	2506.05411	null
2025-06-05	Static Word Embeddings for Sentence Semantic Representation	Takashi Wada et.al.	2506.04624	null
2025-06-05	StatsMerging: Statistics-Guided Model Merging via Task-Specific Teacher Distillation	Ranjith Merugu et.al.	2506.04567	link
2025-06-05	hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation	Charles Hong et.al.	2506.04544	null
2025-06-04	Building a Few-Shot Cross-Domain Multilingual NLU Model for Customer Care	Saurabh Kumar et.al.	2506.04389	null
2025-06-04	Analyzing Transformer Models and Knowledge Distillation Approaches for Image Captioning on Edge AI	Wing Man Casca Kwok et.al.	2506.03607	null
2025-06-04	Debate, Reflect, and Distill: Multi-Agent Feedback with Tree-Structured Preference Optimization for Efficient Language Model Enhancement	Xiaofeng Zhou et.al.	2506.03541	null
2025-06-03	Pre-trained Vision-Language Models Assisted Noisy Partial Label Learning	Qian-Wei Wang et.al.	2506.03229	null
2025-06-03	Targeted Forgetting of Image Subgroups in CLIP Models	Zeliang Zhang et.al.	2506.03117	null
2025-06-03	TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Chetwin Low et.al.	2506.03099	null
2025-06-03	MTL-KD: Multi-Task Learning Via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver	Yuepeng Zheng et.al.	2506.02935	null
2025-06-03	Large-scale Self-supervised Video Foundation Model for Intelligent Surgery	Shu Yang et.al.	2506.02692	null
2025-06-03	One-Step Diffusion-based Real-World Image Super-Resolution with Visual Perception Distillation	Xue Wu et.al.	2506.02605	null
2025-06-03	Towards Better De-raining Generalization via Rainy Characteristics Memorization and Replay	Kunyu Wang et.al.	2506.02477	null
2025-06-04	Improving Knowledge Distillation Under Unknown Covariate Shift Through Confidence-Guided Data Augmentation	Niclas Popp et.al.	2506.02294	null
2025-06-02	VLCD: Vision-Language Contrastive Distillation for Accurate and Efficient Automatic Placenta Analysis	Manas Mehta et.al.	2506.02229	null
2025-06-02	KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning	Hongling Xu et.al.	2506.02208	null
2025-06-02	MLLMs Need 3D-Aware Representation Supervision for Scene Understanding	Xiaohu Huang et.al.	2506.01946	null
2025-06-02	OD3: Optimization-free Dataset Distillation for Object Detection	Salwa K. Al Khatib et.al.	2506.01942	link
2025-06-02	Frugal Machine Learning for Energy-efficient, and Resource-aware Artificial Intelligence	John Violos et.al.	2506.01869	null
2025-06-02	Multi-Modal Dataset Distillation in the Wild	Zhuohang Dang et.al.	2506.01586	null
2025-06-02	Analyzing the Importance of Blank for CTC-Based Knowledge Distillation	Benedikt Hilmes et.al.	2506.01503	null
2025-06-03	Unlearning’s Blind Spots: Over-Unlearning and Prototypical Relearning Attack	SeungBum Ha et.al.	2506.01318	null
2025-05-30	CL-LoRA: Continual Low-Rank Adaptation for Rehearsal-Free Class-Incremental Learning	Jiangpeng He et.al.	2505.24816	link
2025-05-30	A Simple Linear Patch Revives Layer-Pruned Large Language Models	Xinrui Chen et.al.	2505.24680	null
2025-05-30	Hyperbolic Dataset Distillation	Wenyuan Li et.al.	2505.24623	link
2025-05-30	CREFT: Sequential Multi-Agent LLM for Character Relation Extraction	Ye Eun Chun et.al.	2505.24553	null
2025-05-30	Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation	Roger Ferrod et.al.	2505.24361	link
2025-05-30	Progressive Class-level Distillation	Jiayan Li et.al.	2505.24310	null
2025-05-30	Proactive Guidance of Multi-Turn Conversation in Industrial Search	Xiaoyu Li et.al.	2505.24251	null
2025-05-30	Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization	Jiangyu Han et.al.	2505.24111	null
2025-05-29	Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch	Aneeshan Sain et.al.	2505.23763	null
2025-05-29	Knowledge Distillation for Reservoir-based Classifier: Human Activity Recognition	Masaharu Kagiyama et.al.	2505.22985	null
2025-05-28	PRISM: Video Dataset Condensation with Progressive Refinement and Insertion for Sparse Motion	Jaehyun Choi et.al.	2505.22564	null
2025-05-28	Multi-MLLM Knowledge Distillation for Out-of-Context News Detection	Yimeng Gu et.al.	2505.22517	null
2025-05-28	A Closer Look at Multimodal Representation Collapse	Abhra Chaudhuri et.al.	2505.22483	null
2025-05-28	DAM: Domain-Aware Module for Multi-Domain Dataset Condensation	Jaehyun Choi et.al.	2505.22387	null
2025-05-28	RAD: Redundancy-Aware Distillation for Hybrid Models via Self-Speculative Decoding	Yuichiro Hoshino et.al.	2505.22135	null
2025-05-28	Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR	Longhao Li et.al.	2505.22069	null
2025-05-28	Improving Respiratory Sound Classification with Architecture-Agnostic Knowledge Distillation from Ensembles	Miika Toikkanen et.al.	2505.22027	link
2025-05-29	CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation	Pardis Taghavi et.al.	2505.21904	null
2025-05-27	TuneComp: Joint Fine-tuning and Compression for Large Foundation Models	Xiangyu Chen et.al.	2505.21835	null
2025-05-27	Knowledge Distillation Approach for SOS Fusion Staging: Towards Fully Automated Skeletal Maturity Assessment	Omid Halimi Milani et.al.	2505.21561	null
2025-05-27	A Cross Modal Knowledge Distillation & Data Augmentation Recipe for Improving Transcriptomics Representations through Morphological Features	Ihab Bendidi et.al.	2505.21317	null
2025-05-27	Instance Data Condensation for Image Super-Resolution	Tianhao Peng et.al.	2505.21099	null
2025-05-27	EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models	Chengyu Wang et.al.	2505.20888	null
2025-05-27	Temporal Saliency-Guided Distillation: A Scalable Framework for Distilling Video Datasets	Xulin Gu et.al.	2505.20694	null
2025-05-26	Efficient Speech Translation through Model Compression and Knowledge Distillation	Yasmin Moslem et.al.	2505.20237	link
2025-05-26	Model Stitching by Functional Latent Alignment	Ioannis Athanasiadis et.al.	2505.20142	null
2025-05-28	Data-Distill-Net: A Data Distillation Approach Tailored for Reply-based Continual Learning	Wenyang Liao et.al.	2505.20135	null
2025-05-26	From Data to Modeling: Fully Open-vocabulary Scene Graph Generation	Zuyao Chen et.al.	2505.20106	null
2025-05-26	Optimizing edge AI models on HPC systems with the edge in the loop	Marcel Aach et.al.	2505.19995	link
2025-05-26	ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining	Melis Ilayda Bal et.al.	2505.19893	null
2025-05-26	Light distillation for Incremental Graph Convolution Collaborative Filtering	X Fan et.al.	2505.19810	null
2025-05-26	Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments	Junming Liu et.al.	2505.19699	null
2025-05-26	DOGe: Defensive Output Generation for LLM Protection Against Knowledge Distillation	Pingzhi Li et.al.	2505.19504	link
2025-05-26	Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory	Mingzhuo Li et.al.	2505.19469	null
2025-05-25	Holistic White-light Polyp Classification via Alignment-free Dense Distillation of Auxiliary Optical Chromoendoscopy	Qiang Hu et.al.	2505.19319	link
2025-05-25	Remote Sensing Image Classification with Decoupled Knowledge Distillation	Yaping He et.al.	2505.19111	null
2025-05-25	Tokenizing Electron Cloud in Protein-Ligand Interaction Learning	Haitao Lin et.al.	2505.19014	null
2025-05-25	MGD $^3$ : Mode-Guided Dataset Distillation using Diffusion Models	Jeffrey A. Chan-Santiago et.al.	2505.18963	null
2025-05-25	Online Knowledge Distillation with Reward Guidance	Chen Jia et.al.	2505.18952	null
2025-05-24	C3R: Channel Conditioned Cell Representations for unified evaluation in microscopy imaging	Umar Marikkar et.al.	2505.18745	null
2025-05-23	Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models	Jiongran Wu et.al.	2505.18120	null
2025-05-23	Towards Heterogeneous Continual Graph Learning via Meta-knowledge Distillation	Guiquan Sun et.al.	2505.17458	null
2025-05-23	Reflectance Prediction-based Knowledge Distillation for Robust 3D Object Detection in Compressed Point Clouds	Hao Jing et.al.	2505.17442	null
2025-05-22	Extending Dataset Pruning to Object Detection: A Variance-based Approach	Ryota Yagi et.al.	2505.17245	null
2025-05-22	On Multilingual Encoder Language Model Compression for Low-Resource Languages	Daniil Gurgurov et.al.	2505.16956	null
2025-05-22	SEDD-PCC: A Single Encoder-Dual Decoder Framework For End-To-End Learned Point Cloud Compression	Kai Hsiang Hsieh et.al.	2505.16709	null
2025-05-22	ToDi: Token-wise Distillation via Fine-Grained Divergence Control	Seongryong Jung et.al.	2505.16297	null
2025-05-21	UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset	Hua Li et.al.	2505.15581	link
2025-05-21	On the Generalization vs Fidelity Paradox in Knowledge Distillation	Suhas Kamasetty Ramesh et.al.	2505.15442	link
2025-05-22	Contrastive Learning-Enhanced Trajectory Matching for Small-Scale Dataset Distillation	Wenmin Li et.al.	2505.15267	null
2025-05-22	MentalMAC: Enhancing Large Language Models for Detecting Mental Manipulation via Multi-Task Anti-Curriculum Distillation	Yuansheng Gao et.al.	2505.15255	null
2025-05-21	An Efficient Private GPT Never Autoregressively Decodes	Zhengyi Li et.al.	2505.15252	null
2025-05-21	Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs	Jie Ma et.al.	2505.15210	link
2025-05-26	Exploring Generalized Gait Recognition: Reducing Redundancy and Noise within Indoor and Outdoor Datasets	Qian Zhou et.al.	2505.15176	null
2025-05-21	DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer	Haiduo Huang et.al.	2505.15133	link
2025-05-20	Bridge the Gap between Past and Future: Siamese Model Optimization for Context-Aware Document Ranking	Songhao Wu et.al.	2505.14180	null
2025-05-20	Intra-class Patch Swap for Self-Distillation	Hongjun Choi et.al.	2505.14124	link
2025-05-20	Improved Methods for Model Pruning and Knowledge Distillation	Wei Jiang et.al.	2505.14052	null
2025-05-20	Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation	Siddhant Bhambri et.al.	2505.13792	null
2025-05-20	Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels	Yongshuo Zong et.al.	2505.13788	null
2025-05-19	SMOTExT: SMOTE meets Large Language Models	Mateusz Bystroński et.al.	2505.13434	null
2025-05-21	DD-Ranking: Rethinking the Evaluation of Dataset Distillation	Zekai Li et.al.	2505.13300	link
2025-05-19	Distilling a speech and music encoder with task arithmetic	Fabian Ritter-Gutierrez et.al.	2505.13270	null
2025-05-19	Quantum Knowledge Distillation for Large Language Models	Lingxiao Li et.al.	2505.13205	null
2025-05-19	Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation	Sungmin Cha et.al.	2505.13111	null
2025-05-19	Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image Corruption	Kazuki Adachi et.al.	2505.12912	link
2025-05-19	Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach	Shiao Wang et.al.	2505.12903	link
2025-05-19	ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling	Ege Özsoy et.al.	2505.12890	null
2025-05-19	Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation	Jiaqi Tan et.al.	2505.12861	link
2025-05-19	A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone	Jitai Hao et.al.	2505.12781	link
2025-05-19	Bridging the Modality Gap: Enhancing Channel Prediction with Semantically Aligned LLMs and Knowledge Distillation	Zhaoyang Li et.al.	2505.12729	null
2025-05-18	SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning	Yang Liu et.al.	2505.12448	null
2025-05-18	LAMeTA: Intent-Aware Agentic Network Optimization via a Large AI Model-Empowered Two-Stage Approach	Yinqiu Liu et.al.	2505.12247	null
2025-05-18	Always Clear Depth: Robust Monocular Depth Estimation under Adverse Weather	Kui Jiang et.al.	2505.12199	link
2025-05-17	Denoising Mutual Knowledge Distillation in Bi-Directional Multiple Instance Learning	Chen Shu et.al.	2505.12074	null
2025-05-16	Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors	Lang Feng et.al.	2505.11100	null
2025-05-16	Distilled Circuits: A Mechanistic Study of Internal Restructuring in Knowledge Distillation	Reilly Haskins et.al.	2505.10822	link
2025-05-15	Advancing Multiple Instance Learning with Continual Learning for Whole Slide Imaging	Xianrui Li et.al.	2505.10649	null
2025-05-14	DCSNet: A Lightweight Knowledge Distillation-Based Model with Explainable AI for Lung Cancer Diagnosis from Histopathological Images	Sadman Sakib Alif et.al.	2505.09334	null
2025-05-12	An Extra RMSNorm is All You Need for Fine Tuning to 1.58 Bits	Cody Steinmetz et.al.	2505.08823	null
2025-05-15	Leveraging Multi-Modal Information to Enhance Dataset Distillation	Zhe Li et.al.	2505.08605	null
2025-05-13	Scalable UAV Multi-Hop Networking via Multi-Agent Reinforcement Learning with Large Language Models	Yanggang Xu et.al.	2505.08448	null
2025-05-13	Low-Complexity Inference in Continual Learning via Compressed Knowledge Transfer	Zhenrong Liu et.al.	2505.08327	null
2025-05-13	MoKD: Multi-Task Optimization for Knowledge Distillation	Zeeshan Hayder et.al.	2505.08170	null
2025-05-14	Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage	Ruilin Liu et.al.	2505.08167	null
2025-05-13	Foundation Models Knowledge Distillation For Battery Capacity Degradation Forecast	Joey Chan et.al.	2505.08151	link
2025-05-12	Topology-Guided Knowledge Distillation for Efficient Point Cloud Processing	Luu Tung Hai et.al.	2505.08101	link
2025-05-12	Channel Fingerprint Construction for Massive MIMO: A Deep Conditional Generative Approach	Zhenzhou Jin et.al.	2505.07893	null
2025-05-12	Simple Semi-supervised Knowledge Distillation from Vision-Language Models via $\mathbf{\texttt{D}}$ual-$\mathbf{\texttt{H}}$ead $\mathbf{\texttt{O}}$ ptimization	Seongjae Kang et.al.	2505.07675	link
2025-05-12	Ranking-aware Continual Learning for LiDAR Place Recognition	Xufei Wang et.al.	2505.07198	null
2025-05-12	EmoVLM-KD: Fusing Distilled Expertise with Vision-Language Models for Visual Emotion Analysis	SangEun Lee et.al.	2505.07164	link
2025-05-12	KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification	Hajar Sakai et.al.	2505.07162	null
2025-05-11	Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models	Hongwei Shang et.al.	2505.07105	null
2025-05-10	Video Dataset Condensation with Diffusion Models	Zhe Li et.al.	2505.06670	null
2025-05-10	Dataset Distillation with Probabilistic Latent Features	Zhe Li et.al.	2505.06647	null
2025-05-09	Robust & Precise Knowledge Distillation-based Novel Context-Aware Predictor for Disease Detection in Brain and Gastrointestinal	Saif Ur Rehman Khan et.al.	2505.06381	null
2025-05-09	Human in the Latent Loop (HILL): Interactively Guiding Model Training Through Human Intuition	Daniel Geissler et.al.	2505.06325	null
2025-05-08	Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models	Wei Peng et.al.	2505.05189	link
2025-05-11	Federated Deconfounding and Debiasing Learning for Out-of-Distribution Generalization	Zhuang Qi et.al.	2505.04979	null
2025-05-07	ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via $α$-$β$ -Divergence	Guanghui Wang et.al.	2505.04560	link
2025-05-07	Theoretical Guarantees for LT-TTD: A Unified Transformer-based Architecture for Two-Level Ranking Systems	Ayoub Abraich et.al.	2505.04434	null
2025-05-06	Knowledge Distillation Inspired Variational Quantum Eigensolver with Virtual Annealing	Junxu Li et.al.	2505.03998	null
2025-05-06	Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges	Hao Xu et.al.	2505.03991	null
2025-05-06	Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance	Diep Luong et.al.	2505.03442	null
2025-05-06	Artificial Behavior Intelligence: Technology, Challenges, and Future Directions	Kanghyun Jo et.al.	2505.03315	null
2025-05-06	SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation	Zhaoxi Mu et.al.	2505.03273	null
2025-05-11	Image Recognition with Online Lightweight Vision Transformer: A Survey	Zherui Zhang et.al.	2505.03113	null
2025-05-05	FedSDAF: Leveraging Source Domain Awareness for Enhanced Federated Domain Generalization	Hongze Li et.al.	2505.02515	link
2025-05-08	Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques	Sanjay Surendranath Girija et.al.	2505.02309	null
2025-05-06	Efficient Multivariate Time Series Forecasting via Calibrated Language Models with Privileged Knowledge Distillation	Chenxi Liu et.al.	2505.02138	link
2025-05-04	Segment Any RGB-Thermal Model with Language-aided Distillation	Dong Xing et.al.	2505.01950	null
2025-05-03	High-Fidelity Pseudo-label Generation by Large Language Models for Training Robust Radiology Report Classifiers	Brian Wong et.al.	2505.01693	null
2025-05-02	Improving Group Fairness in Knowledge Distillation via Laplace Approximation of Early Exits	Edvin Fasth et.al.	2505.01070	null
2025-05-02	Toward Data-centric Directed Graph Learning: An Entropy-driven Approach	Xunkai Li et.al.	2505.00983	null
2025-05-05	Llama-Nemotron: Efficient Reasoning Models	Akhiad Bercovich et.al.	2505.00949	null
2025-05-01	Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading	Shuo Tong et.al.	2505.00592	null
2025-04-30	Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization	Anas Anwarul Haq Khan et.al.	2504.21831	null
2025-04-30	CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation	Zherui Zhang et.al.	2504.21478	null
2025-04-30	Enhancing New-item Fairness in Dynamic Recommender Systems	Huizhong Guo et.al.	2504.21362	link
2025-04-30	How to Backdoor the Knowledge Distillation	Chen Wu et.al.	2504.21323	null
2025-04-29	Federated One-Shot Learning with Data Privacy and Objective-Hiding	Maximilian Egger et.al.	2504.21182	null
2025-04-29	A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection	Andreas Karathanasis et.al.	2504.21066	null
2025-04-30	DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease Recognition	Yanghui Song et.al.	2504.20948	link
2025-04-30	Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition	Tyler McDonald et.al.	2504.20946	null
2025-04-29	SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation	Jia Wang et.al.	2504.20501	null
2025-04-29	UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation	Huimin Lu et.al.	2504.20500	link
2025-04-29	Group Relative Knowledge Distillation: Learning from Teacher’s Relational Inductive Bias	Chao Li et.al.	2504.20482	null
2025-04-29	The Estimation of Continual Causal Effect for Dataset Shifting Streams	Baining Chen et.al.	2504.20471	null
2025-04-29	Head-Tail-Aware KL Divergence in Knowledge Distillation for Spiking Neural Networks	Tianqing Zhang et.al.	2504.20445	null
2025-04-27	Swapped Logit Distillation via Bi-level Teacher Alignment	Stephen Ekaputra Limantoro et.al.	2504.20108	link
2025-04-25	DNAD: Differentiable Neural Architecture Distillation	Xuan Rao et.al.	2504.20080	null
2025-04-28	Mitigating Catastrophic Forgetting in the Incremental Learning of Medical Images	Sara Yavari et.al.	2504.20033	null
2025-04-28	Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom	Rishika Sen et.al.	2504.20000	null
2025-04-28	Federated Out-of-Distribution Generalization: A Causal Augmentation View	Runhui Zhang et.al.	2504.19882	null
2025-04-28	m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training	Meng Xiao et.al.	2504.19565	null
2025-04-27	Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation	Qianren Mao et.al.	2504.19101	null
2025-04-26	KETCHUP: K-Step Return Estimation for Sequential Knowledge Distillation	Jiabin Fan et.al.	2504.19024	null
2025-04-25	Intelligent Attacks and Defense Methods in Federated Learning-enabled Energy-Efficient Wireless Networks	Han Zhang et.al.	2504.18519	null
2025-04-24	Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation	Xin Yi et.al.	2504.17480	null
2025-04-24	Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs	Tiancheng Gu et.al.	2504.17432	null
2025-04-24	Does Knowledge Distillation Matter for Large Language Model based Bundle Generation?	Kaidong Feng et.al.	2504.17220	null
2025-04-25	Latent Video Dataset Distillation	Ning Li et.al.	2504.17132	null
2025-04-23	Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Alexander Shvets et.al.	2504.16856	null
2025-04-23	Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection	Linhua Kong et.al.	2504.16368	null
2025-04-21	Hybrid Knowledge Transfer through Attention and Logit Distillation for On-Device Vision Systems in Agricultural IoT	Stanley Mugisha et.al.	2504.16128	null
2025-04-21	MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation	Xingxing Zuo et.al.	2504.16127	null
2025-04-22	Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability	Daniel Hendriks et.al.	2504.16056	null
2025-04-21	Linear Item-Item Model with Neural Knowledge for Session-based Recommendation	Minjin Choi et.al.	2504.15057	null
2025-04-22	Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification	Shiben Liu et.al.	2504.15041	link
2025-04-21	Distribution-aware Dataset Distillation for Efficient Image Restoration	Zhuoran Zheng et.al.	2504.14826	null
2025-04-21	DONOD: Robust and Generalizable Instruction Fine-Tuning for LLMs via Model-Intrinsic Dataset Pruning	Jucheng Hu et.al.	2504.14810	null
2025-04-20	Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions	Luyang Fang et.al.	2504.14772	null
2025-04-20	Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis	Jingjing Ren et.al.	2504.14470	null
2025-04-19	Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models	Patrick Haller et.al.	2504.14366	null
2025-04-19	Learning from Stochastic Teacher Representations Using Student-Guided Knowledge Distillation	Muhammad Haseeb Aslam et.al.	2504.14307	null
2025-04-19	A Knowledge-Informed Deep Learning Paradigm for Generalizable and Stability-Optimized Car-Following Models	Chengming Wang et.al.	2504.14241	null
2025-04-19	Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Calibration	Hongji Li et.al.	2504.14214	link
2025-04-18	Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models	Junjie Yang et.al.	2504.13825	null
2025-04-18	From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs	Jiliang Ni et.al.	2504.13471	null
2025-04-17	Scaling Laws for Data-Efficient Visual Transfer Learning	Wenxuan Yang et.al.	2504.13219	null
2025-04-13	Optimizing Multi-Gateway LoRaWAN via Cloud-Edge Collaboration and Knowledge Distillation	Hong Yang et.al.	2504.13194	null
2025-04-16	Transferable Deployment of Semantic Edge Inference Systems via Unsupervised Domain Adaption	Weiqiang Jiao et.al.	2504.11873	null
2025-04-15	A Dual-Space Framework for General Knowledge Distillation of Large Language Models	Xue Zhang et.al.	2504.11426	null
2025-04-15	Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning	Ali Taghibakhshi et.al.	2504.11409	null
2025-04-15	Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution	Xinning Chai et.al.	2504.11271	link
2025-04-15	Efficient Reasoning Models: A Survey	Sicheng Feng et.al.	2504.10903	link
2025-04-14	Optimising Intrusion Detection Systems in Cloud-Edge Continuum with Knowledge Distillation for Privacy-Preserving and Efficient Communication	Soad Almabdy et.al.	2504.10698	null
2025-04-14	Better Estimation of the KL Divergence Between Language Models	Afra Amini et.al.	2504.10637	link
2025-04-14	Digital Staining with Knowledge Distillation: A Unified Framework for Unpaired and Paired-But-Misaligned Data	Ziwang Xu et.al.	2504.09899	link
2025-04-14	DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation	Beomseok Kang et.al.	2504.09814	null
2025-04-13	Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models?	Christophe El Zeinaty et.al.	2504.09685	null
2025-04-12	Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking	You Wu et.al.	2504.09228	link
2025-04-12	Langformers: Unified NLP Pipelines for Language Models	Rabindra Lamsal et.al.	2504.09170	null
2025-04-12	Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization	Gen Li et.al.	2504.09039	null
2025-04-11	Knowledge Distillation for Multimodal Egocentric Action Recognition Robust to Missing Modalities	Maria Santos-Villafranca et.al.	2504.08578	null
2025-04-11	Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery	Alireza Fathalizadeh et.al.	2504.08550	link
2025-04-11	Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images	Jinghe Yang et.al.	2504.08253	null
2025-04-10	Towards Unconstrained 2D Pose Estimation of the Human Spine	Muhammad Saif Ullah Khan et.al.	2504.08110	null
2025-04-10	SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement	Xiyao Wang et.al.	2504.07934	link
2025-04-10	Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation	Yanglin Huang et.al.	2504.07691	null
2025-04-10	ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement	Anning Hu et.al.	2504.07418	null
2025-04-10	WK-Pnet: FM-Based Positioning via Wavelet Packet Decomposition and Knowledge Distillation	Shilian Zheng et.al.	2504.07399	null
2025-04-09	Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer	Shi Pan et.al.	2504.07061	null
2025-04-09	Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation	Thomas Kerdreux et.al.	2504.06962	null
2025-04-08	Multi-Sense Embeddings for Language Models and Knowledge Distillation	Qitong Wang et.al.	2504.06036	null
2025-04-07	Learning Activity View-invariance Under Extreme Viewpoint Changes via Curriculum Knowledge Distillation	Arjun Somayazulu et.al.	2504.05451	null
2025-04-07	Reinforced Multi-teacher Knowledge Distillation for Efficient General Image Forgery Detection and Localization	Zeqin Yu et.al.	2504.05224	null
2025-04-07	Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework	Yu Min Park et.al.	2504.05187	null
2025-04-07	GOTHAM: Graph Class Incremental Learning Framework under Weak Supervision	Aditya Hemant Shahane et.al.	2504.04954	link
2025-04-07	T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models	Minki Kang et.al.	2504.04718	null
2025-04-06	A Novel Algorithm for Personalized Federated Learning: Knowledge Distillation with Weighted Combination Loss	Hengrui Hu et.al.	2504.04642	null
2025-04-08	Your Image Generator Is Your New Private Dataset	Nicolo Resmini et.al.	2504.04582	null
2025-04-05	CoMBO: Conflict Mitigation via Branched Optimization for Class Incremental Segmentation	Kai Fang et.al.	2504.04156	null
2025-04-09	Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible	Chaoyi Lu et.al.	2504.04081	null
2025-04-04	Distillation and Refinement of Reasoning in Small Language Models for Document Re-ranking	Chris Samarinas et.al.	2504.03947	link
2025-04-03	UNDO: Understanding Distillation as Optimization	Kushal Jain et.al.	2504.02521	null
2025-04-03	Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation	Laibin Chang et.al.	2504.02391	null
2025-04-03	Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation	Chengxi Zeng et.al.	2504.02351	null
2025-04-03	Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation	Wupeng Wang et.al.	2504.02302	null
2025-04-03	Beyond Conventional Transformers: The Medical X-ray Attention (MXA) Block for Improved Multi-Label Diagnosis Using Knowledge Distillation	Amit Rand et.al.	2504.02277	link
2025-04-02	FlowDistill: Scalable Traffic Flow Prediction via Distillation from LLMs	Chenyang Yu et.al.	2504.02094	link
2025-04-02	Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression	Dohyun Kim et.al.	2504.02011	null
2025-04-01	OccludeNeRF: Geometric-aware 3D Scene Inpainting with Collaborative Score Distillation in NeRF	Jingyu Shi et.al.	2504.02007	null
2025-04-02	A Novel Approach To Implementing Knowledge Distillation In Tsetlin Machines	Calvin Kinateder et.al.	2504.01798	null
2025-04-02	KD $^{2}$ M: An unifying framework for feature knowledge distillation	Eduardo Fernandes Montesuma et.al.	2504.01757	null
2025-04-02	Style over Substance: Distilled Language Models Reason Via Stylistic Replication	Philip Lippmann et.al.	2504.01738	null
2025-04-01	Data-free Knowledge Distillation with Diffusion Models	Xiaohua Qi et.al.	2504.00870	null
2025-04-01	Global Intervention and Distillation for Federated Out-of-Distribution Generalization	Zhuang Qi et.al.	2504.00850	null
2025-04-01	Sample-level Adaptive Knowledge Distillation for Action Recognition	Ping Li et.al.	2504.00606	null
2025-04-02	Adversarial Curriculum Graph-Free Knowledge Distillation for Graph Neural Networks	Yuang Jia et.al.	2504.00540	null
2025-03-31	Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?	Yewei Song et.al.	2503.24102	null
2025-03-31	A Plasticity-Aware Method for Continual Self-Supervised Learning in Remote Sensing	Lars Möllenbrok et.al.	2503.24088	null
2025-03-31	Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification	Chenqi Guo et.al.	2503.24017	null
2025-03-31	Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Jiagen Li et.al.	2503.23721	null
2025-03-28	Efficient Verified Machine Unlearning For Distillation	Yijun Quan et.al.	2503.22539	null
2025-03-28	Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces	Wonhyeok Choi et.al.	2503.22209	null
2025-03-28	Multi-modal Knowledge Distillation-based Human Trajectory Forecasting	Jaewoo Jeong et.al.	2503.22201	link
2025-03-28	Permutation-Invariant and Orientation-Aware Dataset Distillation for 3D Point Clouds	Jae-Young Yim et.al.	2503.22154	null
2025-03-27	As easy as PIE: understanding when pruning causes language models to disagree	Pietro Tropeano et.al.	2503.21714	link
2025-03-27	DuckSegmentation: A segmentation model based on the AnYue Hemp Duck Dataset	Ling Feng et.al.	2503.21323	null
2025-03-27	Delving Deep into Semantic Relation Distillation	Zhaoyi Yan et.al.	2503.21269	null
2025-03-27	Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search	Yedan Shen et.al.	2503.21098	null
2025-03-26	Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications	Mahya Nikouei et.al.	2503.20516	null
2025-03-26	MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation	Rongyu Zhang et.al.	2503.20384	null
2025-03-26	Modality-Independent Brain Lesion Segmentation with Privacy-aware Continual Learning	Yousef Sadegheih et.al.	2503.20326	link
2025-03-25	Scaling Down Text Encoders of Text-to-Image Diffusion Models	Lifu Wang et.al.	2503.19897	link
2025-03-23	FedSKD: Aggregation-free Model-heterogeneous Federated Learning using Multi-dimensional Similarity Knowledge Distillation	Ziqiao Weng et.al.	2503.18981	null
2025-03-24	DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation	Karim Abou Zeid et.al.	2503.18944	link
2025-03-24	Curriculum Coarse-to-Fine Selection for High-IPC Dataset Distillation	Yanda Chen et.al.	2503.18872	link
2025-03-24	Generative Dataset Distillation using Min-Max Diffusion Model	Junqiao Fan et.al.	2503.18626	null
2025-03-24	Distilling Stereo Networks for Performant and Efficient Leaner Networks	Rafia Rahim et.al.	2503.18544	link
2025-03-24	Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding	Xiangrui Liu et.al.	2503.18478	null
2025-03-24	Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control	Basim Azam et.al.	2503.18324	null
2025-03-24	Enhancing Dataset Distillation via Non-Critical Region Refinement	Minh-Tuan Tran et.al.	2503.18267	null
2025-03-23	CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation	Jungsoo Lee et.al.	2503.18244	null
2025-03-25	Dataset Distillation for Quantum Neural Networks	Koustubh Phalak et.al.	2503.17935	null
2025-03-23	Finding Stable Subnetworks at Initialization with Dataset Distillation	Luke McDermott et.al.	2503.17905	null
2025-03-22	OmniScience: A Domain-Specialized LLM for Scientific Reasoning and Discovery	Vignesh Prabhakar et.al.	2503.17604	null
2025-03-21	Efficient Knowledge Distillation via Curriculum Extraction	Shivam Gupta et.al.	2503.17494	null
2025-03-21	Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs	Reem Gody et.al.	2503.17336	null
2025-03-21	Distilling Monocular Foundation Model for Fine-grained Depth Completion	Yingping Liang et.al.	2503.16970	null
2025-03-21	Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs	Anshumann et.al.	2503.16870	null
2025-03-21	City2Scene: Improving Acoustic Scene Classification with City Features	Yiqiang Cai et.al.	2503.16862	null
2025-03-20	Bezier Distillation	Ling Feng et.al.	2503.16562	null
2025-03-21	Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning	Zhaowei Liu et.al.	2503.16252	link
2025-03-20	InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer	Tony Zhang et.al.	2503.15983	null
2025-03-19	KoGNER: A Novel Framework for Knowledge Graph Distillation on Biomedical Named Entity Recognition	Heming Zhang et.al.	2503.15737	null
2025-03-19	Technical Report for the 5th CLVision Challenge at CVPR: Addressing the Class-Incremental with Repetition using Unlabeled Data – 4th Place Solution	Panagiota Moraiti et.al.	2503.15697	link
2025-03-19	High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight	Cédric Vincent et.al.	2503.15676	link
2025-03-19	DCA: Dividing and Conquering Amnesia in Incremental Object Detection	Aoting Zhang et.al.	2503.15295	link
2025-03-20	Distilling 3D distinctive local descriptors for 6D pose estimation	Amir Hamza et.al.	2503.15106	null
2025-03-19	Taming Flow Matching with Unbalanced Optimal Transport into Fast Pansharpening	Zihan Cao et.al.	2503.14975	null
2025-03-19	Ensemble Knowledge Distillation for Machine Learning Interatomic Potentials	Sakib Matin et.al.	2503.14293	null
2025-03-18	SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation	Weihong Chen et.al.	2503.14097	null
2025-03-18	SCORE: Soft Label Compression-Centric Dataset Condensation via Coding Rate Optimization	Bowen Yuan et.al.	2503.13935	null
2025-03-18	Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection	Chunlei Li et.al.	2503.13828	link
2025-03-17	DynSTG-Mamba: Dynamic Spatio-Temporal Graph Mamba with Cross-Graph Knowledge Distillation for Gait Disorders Recognition	Zakariae Zrimek et.al.	2503.13156	null
2025-03-17	Historic Scripts to Modern Vision: A Novel Dataset and A VLM Framework for Transliteration of Modi Script to Devanagari	Harshal Kausadikar et.al.	2503.13060	null
2025-03-17	Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation	Nassim Ali Ousalah et.al.	2503.13053	null
2025-03-17	Knowledge Distillation: Enhancing Neural Network Compression with Integrated Gradients	David E. Hernandez et.al.	2503.13008	null
2025-03-17	Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation	Kailin Li et.al.	2503.12820	null
2025-03-16	Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep Learning	Khayrul Islam et.al.	2503.12622	link
2025-03-16	UniBERTs: Adversarial Training for Language-Universal Representations	Andrei-Marius Avram et.al.	2503.12608	null
2025-03-15	Universal Speech Token Learning via Low-Bitrate Neural Codec and Pretrained Representations	Xue Jiang et.al.	2503.12115	null
2025-03-15	Robust Dataset Distillation by Matching Adversarial Trajectories	Wei Lai et.al.	2503.12069	null
2025-03-15	A Comprehensive Survey on Knowledge Distillation	Amir M. Mansourian et.al.	2503.12067	link
2025-03-14	Exploring Performance-Complexity Trade-Offs in Sound Event Detection	Tobias Morocutti et.al.	2503.11373	link
2025-03-14	Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification	Tobias Morocutti et.al.	2503.11363	null
2025-03-14	Enabling Weak Client Participation via On-device Knowledge Distillation in Heterogenous Federated Learning	Jihyun Lim et.al.	2503.11151	null
2025-03-12	CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation	Hariprasath Govindarajan et.al.	2503.09878	null
2025-03-12	Vi-LAD: Vision-Language Attention Distillation for Socially-Aware Robot Navigation in Dynamic Environments	Mohamed Elnoor et.al.	2503.09820	null
2025-03-16	xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation	Elio Musacchio et.al.	2503.09313	null
2025-03-12	Adaptive Temperature Based on Logits Correlation in Knowledge Distillation	Kazuhiro Matsuyama et.al.	2503.09030	link
2025-03-12	Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds	Dikai Liu et.al.	2503.08997	null
2025-03-11	LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization	Xianfeng Wu et.al.	2503.08619	link
2025-03-16	Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation	Deyi Ji et.al.	2503.08043	null
2025-03-11	Generalized Kullback-Leibler Divergence Loss	Jiequan Cui et.al.	2503.08038	null
2025-03-11	Efficient Dataset Distillation through Low-Rank Space Sampling	Hangyang Kong et.al.	2503.07998	null
2025-03-10	Training Domain Draft Models for Speculative Decoding: Best Practices and Insights	Fenglu Hong et.al.	2503.07807	null
2025-03-10	ADROIT: A Self-Supervised Framework for Learning Robust Representations for Active Learning	Soumya Banerjee et.al.	2503.07506	null
2025-03-10	Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification	Thomas Boucher et.al.	2503.07294	null
2025-03-10	CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting	Haicheng Liao et.al.	2503.07234	null
2025-03-10	PTMs-TSCIL Pre-Trained Models Based Class-Incremental Learning	Yuanlong Wu et.al.	2503.07153	null
2025-03-10	Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation	Pengchen Liang et.al.	2503.06976	null
2025-03-09	Asymmetric Decision-Making in Online Knowledge Distillation:Unifying Consensus and Divergence	Zhaowei Chen et.al.	2503.06685	null
2025-03-09	HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast	Yiting Zheng et.al.	2503.06511	null
2025-03-09	Causality Enhanced Origin-Destination Flow Prediction in Data-Scarce Cities	Tao Feng et.al.	2503.06398	null
2025-03-08	ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation	Qizhen Lan et.al.	2503.06307	null
2025-03-08	Improving SAM for Camouflaged Object Detection via Dual Stream Adapters	Jiaming Liu et.al.	2503.06042	null
2025-03-07	Semantic Shift Estimation via Dual-Projection and Classifier Reconstruction for Exemplar-Free Class-Incremental Learning	Run He et.al.	2503.05423	link
2025-03-07	Spatial Distillation based Distribution Alignment (SDDA) for Cross-Headset EEG Classification	Dingkun Liu et.al.	2503.05349	link
2025-03-07	Similarity-Based Domain Adaptation with LLMs	Jie He et.al.	2503.05281	null
2025-03-05	ZAugNet for Z-Slice Augmentation in Bio-Imaging	Alessandro Pasqui et.al.	2503.04843	link
2025-03-05	Distilling Dataset into Neural Field	Donghyeok Shin et.al.	2503.04835	link
2025-03-10	RD Efficient FPGA Deployment of Learned Image Compression: Knowledge Distillation and Hybrid Quantization	Alaa Mazouz et.al.	2503.04832	null
2025-03-07	No Forgetting Learning: Memory-free Continual Learning	Mohammad Ali Vahedifar et.al.	2503.04638	null
2025-03-06	scDD: Latent Codes Based scRNA-seq Dataset Distillation with Foundation Model Knowledge	Zhen Yu et.al.	2503.04357	null
2025-03-05	KLiNQ: Knowledge Distillation-Assisted Lightweight Neural Network for Qubit Readout on FPGA	Xiaorang Guo et.al.	2503.03544	null
2025-03-05	Temporal Separation with Entropy Regularization for Knowledge Distillation in Spiking Neural Networks	Kairong Yu et.al.	2503.03144	null
2025-03-04	It Helps to Take a Second Opinion: Teaching Smaller LLMs to Deliberate Mutually via Selective Rationale Optimisation	Sohan Patnaik et.al.	2503.02463	null
2025-03-04	Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration	Pengchen Liang et.al.	2503.02321	null
2025-02-27	Enhancing Transformer with GNN Structural Knowledge via Distillation: A Novel Approach	Zhihua Duan et.al.	2503.01888	null
2025-03-03	Mamba base PKD for efficient knowledge compression	José Medina et.al.	2503.01727	null
2025-03-03	DILEMMA: Joint LLM Quantization and Distributed LLM Inference Over Edge Computing Systems	Minoo Hosseinzadeh et.al.	2503.01704	null
2025-03-03	Understanding Dataset Distillation via Spectral Filtering	Deyu Bo et.al.	2503.01212	null
2025-03-02	Training-Free Dataset Pruning for Instance Segmentation	Yalun Dai et.al.	2503.00828	link
2025-03-01	SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection	Xin Lin et.al.	2503.00414	link
2025-02-28	Real-Time Aerial Fire Detection on Resource-Constrained Devices Using Knowledge Distillation	Sabina Jangirova et.al.	2502.20979	null
2025-02-28	VRM: Knowledge Distillation via Virtual Relation Matching	Weijia Zhang et.al.	2502.20760	null
2025-02-28	Dataset Distillation with Neural Characteristic Function: A Minmax Perspective	Shaobo Wang et.al.	2502.20653	null
2025-02-27	SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models	Zicheng Cai et.al.	2502.20422	null
2025-02-27	KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model	Kai Zhang et.al.	2502.20350	null
2025-02-27	Granite Embedding Models	Parul Awasthy et.al.	2502.20204	null
2025-02-28	Behind the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models	Sibo Yi et.al.	2502.19883	null
2025-02-28	Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval	Jiaxing Li et.al.	2502.19751	null
2025-02-27	XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs	Linyang He et.al.	2502.19737	null
2025-02-26	Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents	Ashley Lewis et.al.	2502.19545	null
2025-02-26	Knowledge Distillation for Semantic Segmentation: A Label Space Unification Approach	Anton Backhaus et.al.	2502.19177	null
2025-02-22	Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition	Chuanguang Yang et.al.	2502.18510	null
2025-02-25	AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages	Joshua Sakthivel Raju et.al.	2502.18020	null
2025-02-25	Advantage-Guided Distillation for Preference Alignment in Small Language Models	Shiping Gao et.al.	2502.17927	link
2025-02-25	From underwater to aerial: a novel multi-scale knowledge distillation approach for coral reef monitoring	Matteo Contini et.al.	2502.17883	link
2025-02-24	Knowledge Distillation with Training Wheels	Guanlin Liu et.al.	2502.17717	null
2025-02-24	CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation	Vishal Thengane et.al.	2502.17429	link
2025-02-24	Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing	Zhuoran Li et.al.	2502.17308	null
2025-02-24	Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation	Wenyuan Wu et.al.	2502.17003	null
2025-02-24	PQDAST: Depth-Aware Arbitrary Style Transfer for Games via Perceptual Quality-Guided Distillation	Eleftherios Ioannou et.al.	2502.16996	null
2025-02-25	CoT2Align: Cross-Chain of Thought Distillation via Optimal Transport Alignment for Language Models with Different Tokenizers	Anh Duc Le et.al.	2502.16806	null
2025-02-24	A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition	Dewan Tauhid Rahman et.al.	2502.16762	null
2025-02-23	EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation	Hong Cai Chen et.al.	2502.16541	null
2025-02-21	A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier Models	Yuchen Jiang et.al.	2502.15959	link
2025-02-21	PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation	Tao Fan et.al.	2502.15857	null
2025-02-21	Scaling Sparse and Dense Retrieval in Decoder-Only LLMs	Hansi Zeng et.al.	2502.15526	link
2025-02-20	Modifying Final Splits of Classification Tree for Fine-tuning Subpopulation Target in Policy Making	Lei Bill Wang et.al.	2502.15072	null
2025-02-20	TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation	Juntong Ni et.al.	2502.15016	null
2025-02-20	Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery	Minh-Quyet Ha et.al.	2502.14631	null
2025-02-21	Vision Foundation Models in Medical Image Analysis: Advances and Challenges	Pengchen Liang et.al.	2502.14584	null
2025-02-20	Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining	Wonhyeok Choi et.al.	2502.14573	null
2025-02-20	Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications	Kayhan Behdin et.al.	2502.14305	null
2025-02-20	Designing Parameter and Compute Efficient Diffusion Transformers using Distillation	Vignesh Sundaresha et.al.	2502.14226	null
2025-02-19	MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation	Romina Aalishah et.al.	2502.14090	null
2025-02-19	Towards Vector Optimization on Low-Dimensional Vector Symbolic Architecture	Shijin Duan et.al.	2502.14075	null
2025-02-19	Dynamic Activation with Knowledge Distillation for Energy-Efficient Spiking NN Ensembles	Orestis Konstantaropoulos et.al.	2502.14023	null
2025-02-19	Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning	Caihua Liu et.al.	2502.13754	null
2025-02-19	Secure Federated Data Distillation	Marco Arazzi et.al.	2502.13728	null
2025-02-19	JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework	Ziyuan Liu et.al.	2502.13407	link
2025-02-21	NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions	Weizhe Yuan et.al.	2502.13124	null
2025-02-18	Does Training with Synthetic Data Truly Protect Privacy?	Yunpeng Zhao et.al.	2502.12976	link
2025-02-18	Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models	Gyeongman Kim et.al.	2502.12947	null
2025-02-18	Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models	Neeraj Gangwar et.al.	2502.12855	null
2025-02-18	Generalized Kernel Inducing Points by Duality Gap for Dataset Distillation	Tatsuya Aoyama et.al.	2502.12607	null
2025-02-17	Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation	Zengkui Sun et.al.	2502.11766	link
2025-02-17	Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?	Leyi Pan et.al.	2502.11598	link
2025-02-17	Leave No One Behind: Enhancing Diversity While Maintaining Accuracy in Social Recommendation	Lei Li et.al.	2502.11374	link
2025-02-16	Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation	Hieu Nguyen et.al.	2502.11306	null
2025-02-16	Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification	Thanushon Sivakaran et.al.	2502.11258	null
2025-02-16	DAViMNet: SSMs-Based Domain Adaptive Object Detection	A. Enes Doruk et.al.	2502.11178	link
2025-02-16	Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping	Yijie Chen et.al.	2502.11104	link
2025-02-15	LLM-driven Knowledge Distillation for Dynamic Text-Attributed Graphs	Amit Roy et.al.	2502.10914	null
2025-02-15	CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs	Qizhen Lan et.al.	2502.10683	null
2025-02-14	EVODMs: variational learning of PDEs for stochastic systems via diffusion models with quantified epistemic uncertainty	Zequn He et.al.	2502.10588	null
2025-02-13	AIDE: Agentically Improve Visual Language Model with Domain Experts	Ming-Chang Chiu et.al.	2502.09051	null
2025-02-12	LLM Pretraining with Continuous Concepts	Jihoon Tack et.al.	2502.08524	null
2025-02-11	Vision-Language Models for Edge Networks: A Comprehensive Survey	Ahmed Sharshar et.al.	2502.07855	null
2025-02-11	Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers	Zhaodong Bing et.al.	2502.07436	null
2025-02-11	OpenGrok: Enhancing SNS Data Processing with Distilled Knowledge and Mask-like Mechanisms	Lumen AI et.al.	2502.07312	link
2025-02-11	Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification	Zicheng Liu et.al.	2502.07299	null
2025-02-10	DROP: Poison Dilution via Knowledge Distillation for Federated Learning	Georgios Syros et.al.	2502.07011	link
2025-02-13	A Simple yet Effective DDG Predictor is An Unsupervised Antibody Optimizer and Explainer	Lirong Wu et.al.	2502.06913	link
2025-02-10	Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty	Yeseul Cho et.al.	2502.06905	link
2025-02-13	Rationalization Models for Text-to-SQL	Gaetano Rossiello et.al.	2502.06759	null
2025-02-10	Rethinking Large-scale Dataset Compression: Shifting Focus From Labels to Images	Lingao Xiao et.al.	2502.06434	null
2025-02-10	Progressive Collaborative and Semantic Knowledge Fusion for Generative Recommendation	Longtao Xiao et.al.	2502.06269	null
2025-02-10	Right Time to Learn:Promoting Generalization via Bio-inspired Spacing Effect in Knowledge Distillation	Guanglong Sun et.al.	2502.06192	link
2025-02-10	Multi-Level Decoupled Relational Distillation for Heterogeneous Architectures	Yaoxin Yang et.al.	2502.06189	null
2025-02-12	A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar	Seung-Hyun Song et.al.	2502.06114	null
2025-02-09	ClinKD: Cross-Modal Clinic Knowledge Distiller For Multi-Task Medical Images	Hongyu Ge et.al.	2502.05928	link
2025-02-09	Learning Accurate, Efficient, and Interpretable MLPs on Multiplex Graphs via Node-wise Multi-View Ensemble Distillation	Yunhui Liu et.al.	2502.05864	null
2025-02-09	Synergistic Effects of Knowledge Distillation and Structured Pruning for Self-Supervised Speech Models	Shiva Kumar C et.al.	2502.05837	null
2025-02-09	Contrastive Representation Distillation via Multi-Scale Feature Decoupling	Cuipeng Wang et.al.	2502.05835	null
2025-02-09	Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models	Jing-Xuan Zhang et.al.	2502.05766	link
2025-02-08	The Evolution of Dataset Distillation: Toward Scalable and Generalizable Solutions	Ping Liu et.al.	2502.05673	null
2025-02-08	Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark	Shiao Wang et.al.	2502.05574	link
2025-02-08	Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector	Qirui Wu et.al.	2502.05540	null
2025-02-07	Trust-Aware Diversion for Data-Effective Distillation	Zhuojie Wu et.al.	2502.05027	null
2025-02-07	Dynamic Frequency-Adaptive Knowledge Distillation for Speech Enhancement	Xihao Yuan et.al.	2502.04711	null
2025-02-06	Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation	Chenyang Huang et.al.	2502.04537	link
2025-02-06	Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn’t Matter (Much)	Zony Yu et.al.	2502.04499	null
2025-02-06	Dark Distillation: Backdooring Distilled Datasets without Accessing Raw Data	Ziyuan Yang et.al.	2502.04229	null
2025-02-06	PGB: One-Shot Pruning for BERT via Weight Grouping and Permutation	Hyemin Lim et.al.	2502.03984	null
2025-02-06	Towards Unified Music Emotion Recognition across Dimensional and Categorical Models	Jaeyong Kang et.al.	2502.03979	link
2025-02-06	BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation	Bo Pang et.al.	2502.03860	null
2025-02-06	Taking A Closer Look at Interacting Objects: Interaction-Aware Open Vocabulary Scene Graph Generation	Lin Li et.al.	2502.03856	null
2025-02-05	A Study in Dataset Distillation for Image Super-Resolution	Tobias Dietz et.al.	2502.03656	null
2025-02-05	Knowledge Distillation from Large Language Models for Household Energy Modeling	Mohannad Takrouri et.al.	2502.03034	null
2025-02-05	Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons	Renjun Hu et.al.	2502.02988	null
2025-02-06	TD3: Tucker Decomposition Based Dataset Distillation Method for Sequential Recommendation	Jiaqing Zhang et.al.	2502.02854	link
2025-02-04	On Teacher Hacking in Language Model Distillation	Daniil Tiapkin et.al.	2502.02671	null
2025-02-03	Enhancing Generalization via Sharpness-Aware Trajectory Matching for Dataset Condensation	Boyan Gao et.al.	2502.01865	null
2025-02-03	Memorization Inheritance in Sequence-Level Knowledge Distillation for Neural Machine Translation	Verna Dankers et.al.	2502.01491	null
2025-02-03	CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation	Xiao Lin et.al.	2502.01312	null
2025-02-03	A Framework for Double-Blind Federated Adaptation of Foundation Models	Nurbek Tastan et.al.	2502.01289	null
2025-02-03	MIND: Modality-Informed Knowledge Distillation Framework for Multimodal Clinical Prediction Tasks	Alejandro Guerra-Manzanares et.al.	2502.01158	null
2025-02-02	FedHPD: Heterogeneous Federated Reinforcement Learning via Policy Distillation	Wenzheng Jiang et.al.	2502.00870	link
2025-02-02	VLM-Assisted Continual learning for Visual Question Answering in Self-Driving	Yuxin Lin et.al.	2502.00843	null
2025-02-02	A method for estimating forest carbon storage distribution density via artificial intelligence generated content model	Zhenyu Yu et.al.	2502.00783	null
2025-02-02	Role of Mixup in Topological Persistence Based Knowledge Distillation for Wearable Sensor Data	Eun Som Jeon et.al.	2502.00779	null
2025-02-02	VIKSER: Visual Knowledge-Driven Self-Reinforcing Reasoning Framework	Chunbai Zhang et.al.	2502.00711	null
2025-02-01	Robust Knowledge Distillation in Federated Learning: Counteracting Backdoor Attacks	Ebtisaam Alharbi et.al.	2502.00587	link
2025-01-31	Imagine with the Teacher: Complete Shape in a Multi-View Distillation Way	Zhanpeng Luo et.al.	2501.19270	null
2025-01-30	Rethinking the Upsampling Layer in Hyperspectral Image Super Resolution	Haohan Shi et.al.	2501.18664	null
2025-01-30	Mini-ResEmoteNet: Leveraging Knowledge Distillation for Human-Centered Design	Amna Murtada et.al.	2501.18538	null
2025-01-29	RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems	Duy A. Nguyen et.al.	2501.18056	null
2025-01-29	Distilling Knowledge for Designing Computational Imaging Systems	Leon Suarez-Rodriguez et.al.	2501.17898	link
2025-01-29	Tapor: 3D Hand Pose Reconstruction with Fully Passive Thermal Sensing for Around-device Interactions	Xie Zhang et.al.	2501.17585	link
2025-01-28	A Contrastive Teacher-Student Framework for Novelty Detection under Style Shifts	Hossein Mirzaei et.al.	2501.17289	null
2025-01-28	FedEFM: Federated Endovascular Foundation Model with Unseen Data	Tuong Do et.al.	2501.16992	null
2025-01-28	Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning	Xi Chen et.al.	2501.16966	null
2025-01-29	TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models	Makoto Shing et.al.	2501.16937	null
2025-01-28	Target-driven Self-Distillation for Partial Observed Trajectories Forecasting	Pengfei Zhu et.al.	2501.16767	null
2025-01-28	Efficient Knowledge Distillation of SAM for Medical Image Segmentation	Kunal Dasharath Patil et.al.	2501.16740	null
2025-01-30	Return of the Encoder: Maximizing Parameter Efficiency for SLMs	Mohamed Elfeki et.al.	2501.16273	link
2025-01-27	PISCO: Pretty Simple Compression for Retrieval-Augmented Generation	Maxime Louis et.al.	2501.16075	null
2025-01-26	MimicGait: A Model Agnostic approach for Occluded Gait Recognition using Correlational Knowledge Distillation	Ayush Gupta et.al.	2501.15666	link
2025-01-26	Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis	Robinson Umeike et.al.	2501.15370	null
2025-01-25	Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning	Yu Qiao et.al.	2501.15257	null
2025-01-25	Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training	Xunxin Cai et.al.	2501.15108	null
2025-01-25	Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval	Bingjun Luo et.al.	2501.15052	null
2025-01-28	On Accelerating Edge AI: Optimizing Resource-Constrained Environments	Jacob Sander et.al.	2501.15014	null
2025-01-24	Remining Hard Negatives for Generative Pseudo Labeled Domain Adaptation	Goksenin Yuksel et.al.	2501.14434	null
2025-01-24	Multimodal Prescriptive Deep Learning	Dimitris Bertsimas et.al.	2501.14152	null
2025-01-23	On Learning Representations for Tabular Data Distillation	Inwon Kang et.al.	2501.13905	null
2025-01-23	Unlearning Clients, Features and Samples in Vertical Federated Learning	Ayush K. Varshney et.al.	2501.13683	null
2025-01-28	Multi-aspect Knowledge Distillation with Large Language Model	Taegyeong Lee et.al.	2501.13341	link
2025-01-22	LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation	Jiahao Wang et.al.	2501.12976	null
2025-01-24	EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation	Yifan Yu et.al.	2501.12689	null
2025-01-22	Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation	Jan Christian Blaise Cruz et.al.	2501.12660	null
2025-01-22	Toward Model-centric Heterogeneous Federated Graph Learning: A Knowledge-driven Approach	Huilin lai et.al.	2501.12624	null
2025-01-21	Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor	Jiaqi Guo et.al.	2501.12524	link
2025-01-19	AI Based Font Pair Suggestion Modelling For Graphic Design	Aryan Singh et.al.	2501.10969	null
2025-01-18	Learning to reconstruct signals with inexact sensing operator via knowledge distillation	Roman Jacome et.al.	2501.10794	null
2025-01-18	Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring	Xia Li et.al.	2501.10677	null
2025-01-18	DNA 1.0 Technical Report	Jungyup Lee et.al.	2501.10648	null
2025-01-15	Efficient Traffic Prediction Through Spatio-Temporal Distillation	Qianru Zhang et.al.	2501.10459	link
2025-01-16	Enhancing Generalization in Chain of Thought Reasoning for Smaller Models	Maxwell J. Yin et.al.	2501.09804	null
2025-01-19	Class Incremental Fault Diagnosis under Limited Fault Data via Supervised Contrastive Knowledge Distillation	Hanrong Zhang et.al.	2501.09525	link
2025-01-16	Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression	Yongheng Zhang et.al.	2501.09321	null
2025-01-16	Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images	Yongheng Zhang et.al.	2501.09268	null
2025-01-15	Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians	Ishan Amin et.al.	2501.09009	link
2025-01-17	VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Youssef Abdalla et.al.	2501.08995	link
2025-01-15	Feature-based One-For-All: A Universal Framework for Heterogeneous Knowledge Distillation	Jhe-Hao Lin et.al.	2501.08885	null
2025-01-14	Self-Attentive Spatio-Temporal Calibration for Precise Intermediate Layer Matching in ANN-to-SNN Distillation	Di Hong et.al.	2501.08049	link
2025-01-14	Balance Divergence for Knowledge Distillation	Yafei Qi et.al.	2501.07804	null
2025-01-13	Dataset Distillation as Pushforward Optimal Quantization	Hong Ye Tan et.al.	2501.07681	null
2025-01-13	Dataset Distillation via Committee Voting	Jiacheng Cui et.al.	2501.07575	link
2025-01-13	Knowledge Distillation and Enhanced Subdomain Adaptation Using Graph Convolutional Network for Resource-Constrained Bearing Fault Diagnosis	Mohammadreza Kavianpour et.al.	2501.07173	null
2025-01-13	Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection	ZhouRui Zhang et.al.	2501.07101	null
2025-01-13	Research on the Online Update Method for Retrieval-Augmented Generation (RAG) Model with Incremental Learning	Yuxin Fan et.al.	2501.07063	null
2025-01-13	Rethinking Knowledge in Distillation: An In-context Sample Retrieval Perspective	Jinjing Zhu et.al.	2501.07040	null
2025-01-12	Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving	Haoxiang Gao et.al.	2501.06680	null
2025-01-11	FocusDD: Real-World Scene Infusion for Robust Dataset Distillation	Youbing Hu et.al.	2501.06405	null
2025-01-10	Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation	Daowan Peng et.al.	2501.05690	null
2025-01-09	LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts	Yuri Facanha Bezerra et.al.	2501.05554	link
2025-01-08	Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models	Miaoyang He et.al.	2501.04582	null
2025-01-08	Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions	Na Yan et.al.	2501.04436	null
2025-01-08	Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing Images	Yuze Wang et.al.	2501.04283	null
2025-01-08	Generative Dataset Distillation Based on Self-knowledge Distillation	Longzhen Li et.al.	2501.04202	null
2025-01-07	FedKD-hybrid: Federated Hybrid Knowledge Distillation for Lithography Hotspot Detection	Yuqi Li et.al.	2501.04066	link
2025-01-07	A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving	Yi Zhang et.al.	2501.03670	link
2025-01-07	ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting	Yifeng Yang et.al.	2501.03605	link
2025-01-05	Strategic Fusion Optimizes Transformer Compression	Md Shoaibur Rahman et.al.	2501.03273	null
2025-01-04	Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies	Xubin Wang et.al.	2501.03265	link
2025-01-07	LightGNN: Simple Graph Neural Network for Recommendation	Guoxuan Chen et.al.	2501.03228	link
2025-01-06	Comprehensive Pathological Image Segmentation via Teacher Aggregation for Tumor Microenvironment Analysis	Daisuke Komura et.al.	2501.02909	null
2025-01-06	Knowledge Distillation with Adapted Weight	Sirong Wu et.al.	2501.02705	null
2025-01-05	Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding	Binh-Nguyen Nguyen et.al.	2501.02432	link
2025-01-04	Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison	Tsz Kin Lam et.al.	2501.02370	null
2025-01-04	V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection	Sichao Wang et.al.	2501.02363	link
2025-01-09	KD-MSLRT: Lightweight Sign Language Recognition Model Based on Mediapipe and 3D to 1D Knowledge Distillation	Yulong Li et.al.	2501.02321	null
2025-01-04	Distillation-Enhanced Physical Adversarial Attacks	Wei Liu et.al.	2501.02232	null
2025-01-03	Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification	Jarin Ritu et.al.	2501.01921	link
2025-01-03	MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders	Jiajun Cao et.al.	2501.01709	null
2025-01-02	DiagrammaticLearning: A Graphical Language for Compositional Training Regimes	Mason Lary et.al.	2501.01515	null
2024-12-31	Pan-infection Foundation Framework Enables Multiple Pathogen Prediction	Lingrui Zhang et.al.	2501.01462	null
2025-01-01	LENS-XAI: Redefining Lightweight and Explainable Network Security through Knowledge Distillation and Variational Autoencoders for Scalable Intrusion Detection in Cybersecurity	Muhammet Anil Yagiz et.al.	2501.00790	null
2024-12-30	Temporal reasoning for timeline summarisation in social media	Jiayu Song et.al.	2501.00152	null
2024-12-30	A Large-Scale Study on Video Action Dataset Condensation	Yang Chen et.al.	2412.21197	link
2024-12-30	Improving Acoustic Scene Classification in Low-Resource Conditions	Zhi Chen et.al.	2412.20722	null
2025-01-05	Distilling Desired Comments for Enhanced Code Review with Large Language Models	Yongda Yu et.al.	2412.20340	null
2024-12-28	Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems	Wen-Dong Jiang et.al.	2412.20201	null
2024-12-28	SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection	Phi Vu Tran et.al.	2412.20047	link
2024-12-28	Invariant debiasing learning for recommendation via biased imputation	Ting Bai et.al.	2412.20036	link
2024-12-28	Learning Adaptive and View-Invariant Vision Transformer with Multi-Teacher Knowledge Distillation for Real-Time UAV Tracking	You Wu et.al.	2412.20002	link
2024-12-27	Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis	Jiaqi Wang et.al.	2412.19654	link
2024-12-27	Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models	Shuo Wang et.al.	2412.19449	null
2024-12-26	SpectralKD: Understanding and Optimizing Vision Transformer Distillation through Spectral Analysis	Huiyuan Tian et.al.	2412.19055	link
2024-12-24	HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation	Mohammed Hamdan et.al.	2412.18524	null
2024-12-23	Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck	Xingcheng Fu et.al.	2412.17355	link
2024-12-23	Better Knowledge Enhancement for Privacy-Preserving Cross-Project Defect Prediction	Yuying Wang et.al.	2412.17317	null
2024-12-23	LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation	Riku Uemura et.al.	2412.17282	null
2024-12-22	Adaptive Dataset Quantization	Muquan Li et.al.	2412.16895	link
2024-12-21	CyberSentinel: Efficient Anomaly Detection in Programmable Switch using Knowledge Distillation	Sankalp Mittal et.al.	2412.16693	null
2024-12-21	STKDRec: Spatial-Temporal Knowledge Distillation for Takeaway Recommendation	Shuyuan Zhao et.al.	2412.16502	null
2024-12-21	Cross-View Consistency Regularisation for Knowledge Distillation	Weijia Zhang et.al.	2412.16493	link
2024-12-21	CBNN: 3-Party Secure Framework for Customized Binary Neural Networks Inference	Benchang Dong et.al.	2412.16449	null
2024-12-27	LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance	Huawei Sun et.al.	2412.16380	link
2024-12-20	BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models	Patrick Haller et.al.	2412.15978	null
2024-12-20	A New Method to Capturing Compositional Knowledge in Linguistic Space	Jiahe Wan et.al.	2412.15632	null
2024-12-19	Uncertainty-Guided Cross Attention Ensemble Mean Teacher for Semi-supervised Medical Image Segmentation	Meghana Karri et.al.	2412.15380	null
2024-12-19	Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models	Reza Shirkavand et.al.	2412.15341	link
2024-12-19	Self-Evolution Knowledge Distillation for LLM-based Machine Translation	Yuncheng Song et.al.	2412.15303	null
2024-12-19	SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection	Ruoyu Xu et.al.	2412.14571	null
2024-12-19	Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models	Xiao Cui et.al.	2412.14528	link
2024-12-19	Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance	Sukrit Leelaluk et.al.	2412.14526	link
2024-12-18	A Survey on Inference Optimization Techniques for Mixture of Experts Models	Jiacheng Liu et.al.	2412.14219	link
2024-12-18	Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective	Zhiyuan Zeng et.al.	2412.14135	null
2024-12-18	On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process	Gereziher Adhane et.al.	2412.13943	null
2024-12-18	Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation	Kaiwen Huang et.al.	2412.13742	link
2024-12-18	On the Compression of Language Models for Code: An Empirical Study on CodeBERT	Giordano d’Aloisio et.al.	2412.13737	null
2024-12-18	Hybrid Data-Free Knowledge Distillation	Jialiang Tang et.al.	2412.13525	link
2024-12-17	In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning	Yifei Duan et.al.	2412.13243	null
2024-12-17	Modality-Inconsistent Continual Learning of Multimodal Large Language Models	Weiguo Pian et.al.	2412.13050	null
2024-12-17	Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation	Jiaqi Wang et.al.	2412.12858	null
2024-12-17	PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Kun Guo et.al.	2412.12460	null
2024-12-16	Neural Collapse Inspired Knowledge Distillation	Shuoxi Zhang et.al.	2412.11788	null
2024-12-16	Relation-Guided Adversarial Learning for Data-free Knowledge Transfer	Yingping Liang et.al.	2412.11380	link
2024-12-16	BiM-VFI: directional Motion Field-Guided Frame Interpolation for Video with Non-uniform Motions	Wonyong Seo et.al.	2412.11365	null
2024-12-15	Wearable Accelerometer Foundation Models for Health via Knowledge Distillation	Salar Abbaspourazad et.al.	2412.11276	null
2024-12-15	ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes	Pedro Miguel Sánchez Sánchez et.al.	2412.11207	null
2024-12-15	Leveraging Large Language Models for Active Merchant Non-player Characters	Byungjun Kim et.al.	2412.11189	link
2024-12-15	Knowledge Migration Framework for Smart Contract Vulnerability Detection	Luqi Wang et.al.	2412.11175	null
2024-12-15	Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection	Mohammadreza Salehi et.al.	2412.11148	link
2024-12-17	On Distilling the Displacement Knowledge for Few-Shot Class-Incremental Learning	Pengfei Fang et.al.	2412.11017	null
2024-12-13	Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization	Xinhao Zhong et.al.	2412.09959	null
2024-12-13	Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information	Xinhao Zhong et.al.	2412.09945	link
2024-12-13	Can Students Beyond The Teacher? Distilling Knowledge from Teacher’s Bias	Jianhua Zhang et.al.	2412.09874	null
2024-12-13	ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression	Kai Yao et.al.	2412.09812	null
2024-12-13	LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering	Patrick Sutanto et.al.	2412.09807	null
2024-12-12	SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training	Dongting Hu et.al.	2412.09619	null
2024-12-12	A Theoretical Analysis of Soft-Label vs Hard-Label Training in Neural Networks	Saptarshi Mandal et.al.	2412.09579	null
2024-12-12	All You Need in Knowledge Distillation Is a Tailored Coordinate System	Junjie Zhou et.al.	2412.09388	null
2024-12-12	Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices	Thanaphon Suwannaphong et.al.	2412.09289	null
2024-12-12	DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification	Kunlun Xu et.al.	2412.09224	link
2024-12-12	Multimodal Industrial Anomaly Detection by Crossmodal Reverse Distillation	Xinyue Liu et.al.	2412.08949	link
2024-12-12	Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration	Yunshuai Zhou et.al.	2412.08939	link
2024-12-11	Efficient Gravitational Wave Parameter Estimation via Knowledge Distillation: A ResNet1D-IAF Approach	Xihua Zhu et.al.	2412.08672	null
2024-12-11	Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation	Jiaming Lv et.al.	2412.08139	null
2024-12-11	DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation	Jaeho Moon et.al.	2412.08116	null
2024-12-10	Unlocking the Potential of Reverse Distillation for Anomaly Detection	Xinyue Liu et.al.	2412.07579	link
2024-12-10	TT-MPD: Test Time Model Pruning and Distillation	Haihang Wu et.al.	2412.07114	null
2024-12-09	FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering	Amirhossein Abaskohi et.al.	2412.07030	link
2024-12-09	U-Know-DiffPAN: An Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening	Sungpyo Kim et.al.	2412.06243	null
2024-12-08	Enhancing Content Representation for AR Image Quality Assessment Using Knowledge Distillation	Aymen Sekhri et.al.	2412.06003	null
2024-12-07	Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery	Ye Wang et.al.	2412.05573	null
2024-12-06	BEExformer: A Fast Inferencing Transformer Architecture via Binarization with Multiple Early Exits	Wazib Ansar et.al.	2412.05225	null
2024-12-06	One-shot Federated Learning via Synthetic Distiller-Distillate Communication	Junyuan Zhang et.al.	2412.05186	link
2024-12-06	CCS: Continuous Learning for Customized Incremental Wireless Sensing Services	Qunhang Fu et.al.	2412.04821	null
2024-12-06	Decomposed Distribution Matching in Dataset Condensation	Sahar Rahimi Malakshan et.al.	2412.04748	link
2024-12-05	Diffusion-Augmented Coreset Expansion for Scalable Dataset Distillation	Ali Abbasi et.al.	2412.04668	null
2024-12-05	FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning	Jiayu Liu et.al.	2412.04521	link
2024-12-05	Expanding Deep Learning-based Sensing Systems with Multi-Source Knowledge Transfer	Gaole Dai et.al.	2412.04060	null
2024-12-07	Enhancing CLIP Conceptual Embedding through Knowledge Distillation	Kuei-Chun Kao et.al.	2412.03513	null
2024-12-04	Distillation of Diffusion Features for Semantic Correspondence	Frank Fundel et.al.	2412.03512	null
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039	link
2024-12-02	Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model	Qianhan Feng et.al.	2412.01282	link
2024-12-01	QABISAR: Query-Article Bipartite Interactions for Statutory Article Retrieval	T. Y. S. S. Santosh et.al.	2412.00934	null
2024-12-01	Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps	Girmaw Abebe Tadesse et.al.	2412.00777	null
2024-11-30	Continuous Concepts Removal in Text-to-image Diffusion Models	Tingxu Han et.al.	2412.00580	null
2024-11-30	Toward Fair Graph Neural Networks Via Dual-Teacher Knowledge Distillation	Chengyu Li et.al.	2412.00382	null
2024-11-28	PP-SSL : Priority-Perception Self-Supervised Learning for Fine-Grained Recognition	ShuaiHeng Li et.al.	2412.00134	null
2024-11-28	Video Set Distillation: Information Diversification and Temporal Densification	Yinjie Zhao et.al.	2412.00111	null
2024-11-29	DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation	Zhiqiang Shen et.al.	2411.19946	link
2024-11-29	Reverse Thinking Makes LLMs Stronger Reasoners	Justin Chih-Yao Chen et.al.	2411.19865	null
2024-11-29	FairDD: Fair Dataset Distillation via Synchronized Matching	Qihang Zhou et.al.	2411.19623	null
2024-11-28	Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG	Xinxu Wei et.al.	2411.19230	null
2024-12-03	Puzzle: Distillation-Based NAS for Inference-Optimized LLMs	Akhiad Bercovich et.al.	2411.19146	null
2024-11-28	Headache to Overstock? Promoting Long-tail Items through Debiased Product Bundling	Shuo Xu et.al.	2411.19107	null
2024-11-28	Zero-shot Slot Filling in the Age of LLMs for Dialogue Systems	Mansi Rana et.al.	2411.18980	null
2024-11-27	Active Data Curation Effectively Distills Large-Scale Multimodal Models	Vishaal Udandarao et.al.	2411.18674	null
2024-11-27	Vision Mamba Distillation for Low-resolution Fine-grained Image Classification	Yao Chen et.al.	2411.17980	link
2024-11-27	Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery	Zhenyu Yu et.al.	2411.17973	null
2024-11-26	Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation	Minh-Tuan Tran et.al.	2411.17046	null
2024-11-26	Words Matter: Leveraging Individual Text Embeddings for Code Generation in CLIP Test-Time Adaptation	Shambhavi Mishra et.al.	2411.17002	link
2024-11-25	Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models	Yao Fu et.al.	2411.16991	null
2024-11-25	Leveraging Foundation Models To learn the shape of semi-fluid deformable objects	Omar El Assal et.al.	2411.16802	null
2024-11-25	O1 Replication Journey – Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?	Zhen Huang et.al.	2411.16489	link
2024-11-25	When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?	Srikrishna Iyer et.al.	2411.16487	link
2024-11-25	Learn from Foundation Model: Fruit Detection Model without Manual Annotation	Yanan Wang et.al.	2411.16196	link
2024-11-25	Beyond Task Vectors: Selective Task Arithmetic Based on Importance Metrics	Tian Bowen et.al.	2411.16139	null
2024-11-25	Ensemble Learning via Knowledge Transfer for CTR Prediction	Honghao Li et.al.	2411.16122	link
2024-11-24	Data Lineage Inference: Uncovering Privacy Vulnerabilities of Dataset Pruning	Qi Li et.al.	2411.15796	null
2024-11-23	Botfip-LLM: An Enhanced Multimodal Scientific Computing Framework Leveraging Knowledge Distillation from Large Language Models	Tianhao Chen et.al.	2411.15525	null
2024-11-23	Efficient Ternary Weight Embedding Model: Bridging Scalability and Performance	Jiayi Chen et.al.	2411.15438	link
2024-11-23	Partial Knowledge Distillation for Alleviating the Inherent Inter-Class Discrepancy in Federated Learning	Xiaoyu Gan et.al.	2411.15403	null
2024-11-22	BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques	Muhammad Rafsan Kabir et.al.	2411.15270	null
2024-11-22	RankByGene: Gene-Guided Histopathology Representation Learning Through Cross-Modal Ranking Consistency	Wentao Huang et.al.	2411.15076	null
2024-11-22	Adaptive Group Robust Ensemble Knowledge Distillation	Patrik Kenfack et.al.	2411.14984	null
2024-11-25	Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation	Aniket Bhattacharyya et.al.	2411.14957	null
2024-11-22	Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers	Hongbo Liu et.al.	2411.14789	null
2024-11-22	Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation	Xunyu Zhu et.al.	2411.14698	null
2024-11-21	Teaching MLPs to Master Heterogeneous Graph-Structured Knowledge for Efficient and Accurate Inference	Yunhui Liu et.al.	2411.14035	link
2024-11-21	CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition	Md Mahedi Hasan et.al.	2411.13886	null
2024-11-20	RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content	Yuxuan Jiang et.al.	2411.13362	null
2024-11-20	Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning	Gang Zhao et.al.	2411.13045	null
2024-11-19	Reward Modeling with Ordinal Feedback: Wisdom of the Crowd	Shang Liu et.al.	2411.12843	null
2024-11-19	Data-to-Model Distillation: Data-Efficient Learning Framework	Ahmad Sajedi et.al.	2411.12841	link
2024-11-19	What Makes a Good Dataset for Knowledge Distillation?	Logan Frank et.al.	2411.12817	null
2024-11-19	KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder	Maheswar Bora et.al.	2411.12270	null
2024-11-19	Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes	Rahul Garg et.al.	2411.12174	null
2024-11-18	Distill the Best, Ignore the Rest: Improving Dataset Distillation with Loss-Value-Based Pruning	Brian B. Moser et.al.	2411.12115	link
2024-11-18	Dataset Distillers Are Good Label Denoisers In the Wild	Lechao Cheng et.al.	2411.11924	link
2024-11-18	Federated Incremental Named Entity Recognition	Duzhen Zhang et.al.	2411.11623	link
2024-11-18	Color-Oriented Redundancy Reduction in Dataset Distillation	Bowen Yuan et.al.	2411.11329	link
2024-11-17	Map-Free Trajectory Prediction with Map Distillation and Hierarchical Encoding	Xiaodong Liu et.al.	2411.10961	null
2024-11-16	Hybrid Attention Model Using Feature Decomposition and Knowledge Distillation for Glucose Forecasting	Ebrahim Farahmand et.al.	2411.10703	link
2024-11-16	Multi-perspective Contrastive Logit Distillation	Qi Wang et.al.	2411.10693	null
2024-11-16	Exploring Feature-based Knowledge Distillation For Recommender System: A Frequency Perspective	Zhangchi Zhu et.al.	2411.10676	link
2024-11-15	Evidential Federated Learning for Skin Lesion Image Classification	Rutger Hendrix et.al.	2411.10071	null
2024-11-14	VPBSD:Vessel-Pattern-Based Semi-Supervised Distillation for Efficient 3D Microscopic Cerebrovascular Segmentation	Xi Lin et.al.	2411.09567	null
2024-11-14	BEARD: Benchmarking the Adversarial Robustness for Dataset Distillation	Zheng Zhou et.al.	2411.09265	link
2024-11-14	Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching	Yuran Wang et.al.	2411.09151	null
2024-11-14	Toward Democratized Generative AI in Next-Generation Mobile Edge Networks	Ruichen Zhang et.al.	2411.09148	null
2024-11-14	SCAN: Bootstrapping Contrastive Pre-training for Data Efficiency	Yangyang Guo et.al.	2411.09126	link
2024-11-13	Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head	Penghui Yang et.al.	2411.08937	null
2024-11-13	UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation	Chengyuan Zhang et.al.	2411.08569	null
2024-11-13	Federated Graph Learning with Graphless Clients	Xingbo Fu et.al.	2411.08374	null
2024-11-12	Joint Diffusion models in Continual Learning	Paweł Skierś et.al.	2411.08224	null
2024-11-12	Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data	Juanhui Li et.al.	2411.08028	null
2024-11-13	Query Optimization for Parametric Knowledge Refinement in Retrieval-Augmented Large Language Models	Youan Cong et.al.	2411.07820	null
2024-11-12	Robust Offline Reinforcement Learning for Non-Markovian Decision Processes	Ruiquan Huang et.al.	2411.07514	null
2024-11-13	Feature Interaction Fusion Self-Distillation Network For CTR Prediction	Lei Sang et.al.	2411.07508	null
2024-11-12	Quantifying Knowledge Distillation Using Partial Information Decomposition	Pasan Dissanayake et.al.	2411.07483	null
2024-11-08	Multi-Document Financial Question Answering using LLMs	Shalin Shah et.al.	2411.07264	null
2024-11-11	SAMPart3D: Segment Any Part in 3D Objects	Yunhan Yang et.al.	2411.07184	link
2024-11-11	LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models	Runming Yang et.al.	2411.06839	null
2024-11-11	ScaleKD: Strong Vision Transformers Could Be Excellent Teachers	Jiawei Fan et.al.	2411.06786	link
2024-11-11	An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning	Dong Li et.al.	2411.06659	link
2024-11-10	CULL-MT: Compression Using Language and Layer pruning for Machine Translation	Pedram Rostami et.al.	2411.06506	null
2024-11-10	Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation	Yu-Liang Zhan et.al.	2411.06448	link
2024-11-09	Dynamic Textual Prompt For Rehearsal-free Lifelong Person Re-identification	Hongyu Chen et.al.	2411.06023	null
2024-11-09	Multi-hop RIS-aided Learning Model Sharing for Urban Air Mobility	Kai Xiong et.al.	2411.06015	null
2024-11-08	Mitigating Hallucination with ZeroG: An Advanced Knowledge Management Engine	Anantha Sharma et.al.	2411.05936	null
2024-11-08	*Asterisk: Keep it Simple**	Andrew Semenov et.al.	2411.05691	null
2024-11-08	Knowledge Distillation Neural Network for Predicting Car-following Behaviour of Human-driven and Autonomous Vehicles	Ayobami Adewale et.al.	2411.05618	null
2024-11-08	Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion	Nan Song et.al.	2411.05544	null
2024-11-07	Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale	Flavio Di Palo et.al.	2411.05045	null
2024-11-07	Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers	Zhichao Geng et.al.	2411.04403	null
2024-11-07	GazeGen: Gaze-Driven User Interaction for Visual Content Generation	He-Yen Hsieh et.al.	2411.04335	null
2024-11-06	Towards Personalized Federated Learning via Comprehensive Knowledge Distillation	Pengju Wang et.al.	2411.03569	null
2024-11-05	Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation	Francisco Giral et.al.	2411.02975	null
2024-11-05	Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery	Bowei Du et.al.	2411.02861	null
2024-11-05	Brewing Vodka: Distilling Pure Knowledge for Lightweight Threat Detection in Audit Logs	Weiheng Wu et.al.	2411.02775	null
2024-11-05	Multimodal Commonsense Knowledge Distillation for Visual Question Answering	Shuo Yang et.al.	2411.02722	null
2024-11-04	Training on the Test Model: Contamination in Ranking Distillation	Vishakha Suresh Kalal et.al.	2411.02284	link
2024-11-03	Decoupling Dark Knowledge via Block-wise Logit Distillation for Feature-level Alignment	Chengting Yu et.al.	2411.01547	null
2024-11-01	On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance	Jaskirat Singh et.al.	2411.00907	null
2024-10-30	The Graph’s Apprentice: Teaching an LLM Low Level Knowledge for Circuit Quality Estimation	Reza Moravej et.al.	2411.00843	null
2024-10-29	Unsupervised Training of a Dynamic Context-Aware Deep Denoising Framework for Low-Dose Fluoroscopic Imaging	Sun-Young Jeon et.al.	2411.00830	link
2024-11-01	Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation	Bohan Lyu et.al.	2411.00412	null
2024-11-01	Towards Building Secure UAV Navigation with FHE-aware Knowledge Distillation	Arjun Ramesh Kaushik et.al.	2411.00403	null
2024-10-31	Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification	Thanh-Dung Le et.al.	2411.00209	link
2024-10-30	Larger models yield better results? Streamlined severity classification of ADHD-related concerns using BERT-based knowledge distillation	Ahmed Akib Jawad Karim et.al.	2411.00052	null
2024-10-30	IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking	Run Luo et.al.	2410.23907	null
2024-10-28	Unveiling Context-Aware Criteria in Self-Assessing LLMs	Taneesh Gupta et.al.	2410.21545	null
2024-10-28	Knowledge Distillation for Real-Time Classification of Early Media in Voice Communications	Kemal Altwlkany et.al.	2410.21478	null
2024-10-28	Less is More: Efficient Time Series Dataset Condensation via Two-fold Modal Matching–Extended Version	Hao Miao et.al.	2410.20905	null
2024-10-28	Deep Learning for Medical Text Processing: BERT Model Fine-Tuning and Comparative Study	Jiacheng Hu et.al.	2410.20792	null
2024-10-28	KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation	Rambod Azimi et.al.	2410.20777	link
2024-10-28	Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning	Bing Han et.al.	2410.20775	null
2024-10-28	Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA	Sangmin Bae et.al.	2410.20672	null
2024-10-28	FLiP: Privacy-Preserving Federated Learning based on the Principle of Least Privileg	ShiMao Xu et.al.	2410.19548	null
2024-10-25	SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models	Jahyun Koo et.al.	2410.19503	null
2024-10-24	AlignCap: Aligning Speech Emotion Captioning to Human Preferences	Ziqi Liang et.al.	2410.19134	null
2024-10-24	High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws	M. Emrullah Ildiz et.al.	2410.18837	null
2024-10-24	Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data	Anup Shirgaonkar et.al.	2410.18588	null
2024-10-24	SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning	Shivam Adarsh et.al.	2410.18574	link
2024-10-23	ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams	Srija Anand et.al.	2410.17901	null
2024-10-23	Towards Active Participant-Centric Vertical Federated Learning: Some Representations May Be All You Need	Jon Irureta et.al.	2410.17648	null
2024-10-23	Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation	Muquan Li et.al.	2410.17606	link
2024-10-23	Physics-driven AI for Channel Estimation in Cellular Network	Xiaoqian Qi et.al.	2410.17525	null
2024-10-22	MiniPLM: Knowledge Distillation for Pre-Training Language Models	Yuxian Gu et.al.	2410.17215	link
2024-10-22	Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios	Kai Wang et.al.	2410.17193	link
2024-10-22	CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare	Nicholas I-Hsien Kuo et.al.	2410.16872	null
2024-10-22	AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models	Yongjian Wu et.al.	2410.16820	link
2024-10-22	SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation	Jing-Jing Li et.al.	2410.16665	null

Synthetic Data Generation

Publish Date	Title	Authors	PDF	Code
2025-07-23	On the Interaction of Compressibility and Adversarial Robustness	Melih Barsbey et.al.	2507.17725	null
2025-07-23	Generalized Dual Discriminator GANs	Penukonda Naga Chandana et.al.	2507.17684	null
2025-07-23	Synthetic Voice Data for Automatic Speech Recognition in African Languages	Brian DeRenzi et.al.	2507.17578	null
2025-07-23	RoadBench: A Vision-Language Foundation Model and Benchmark for Road Damage Understanding	Xi Xiao et.al.	2507.17353	null
2025-07-23	Dataset Distillation as Data Compression: A Rate-Utility Perspective	Youneng Bao et.al.	2507.17221	null
2025-07-22	Risk In Context: Benchmarking Privacy Leakage of Foundation Models in Synthetic Tabular Data Generation	Jessup Byun et.al.	2507.17066	null
2025-07-22	Bringing Balance to Hand Shape Classification: Mitigating Data Imbalance Through Generative Models	Gaston Gustavo Rios et.al.	2507.17008	null
2025-07-22	Leveraging Synthetic Data for Question Answering with Multilingual LLMs in the Agricultural Domain	Rishemjit Kaur et.al.	2507.16974	null
2025-07-22	A linear PDF model for Bayesian inference	Mark N. Costantini et.al.	2507.16913	null
2025-07-23	Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning	Yanjun Zheng et.al.	2507.16802	null
2025-07-22	Enhancing Domain Diversity in Synthetic Data Face Recognition with Dataset Fusion	Anjith George et.al.	2507.16790	null
2025-07-22	Task-Specific Zero-shot Quantization-Aware Training for Object Detection	Changhao Li et.al.	2507.16782	null
2025-07-22	Denoising-While-Completing Network (DWCNet): Robust Point Cloud Completion Under Corruption	Keneni W. Tesema et.al.	2507.16743	null
2025-07-22	Synthetic Data Matters: Re-training with Geo-typical Synthetic Labels for Building Detection	Shuang Song et.al.	2507.16657	null
2025-07-22	ACT: Bridging the Gap in Code Translation through Synthetic Data Generation & Adaptive Training	Shreya Saxena et.al.	2507.16478	null
2025-07-22	Improving Predictions on Highly Unbalanced Data Using Open Source Synthetic Data Upsampling	Ivona Krchova et.al.	2507.16419	null
2025-07-22	Towards Railway Domain Adaptation for LiDAR-based 3D Detection: Road-to-Rail and Sim-to-Real via SynDRA-BBox	Xavier Diaz et.al.	2507.16413	null
2025-07-22	STAR: A Benchmark for Astronomical Star Fields Super-Resolution	Kuo-Cheng Wu et.al.	2507.16385	null
2025-07-22	Robust Bioacoustic Detection via Richly Labelled Synthetic Soundscape Augmentation	Kaspar Soltero et.al.	2507.16235	null
2025-07-22	LENS-DF: Deepfake Detection and Temporal Localization for Long-Form Noisy Speech	Xuechen Liu et.al.	2507.16220	null
2025-07-21	FASTGEN: Fast and Cost-Effective Synthetic Tabular Data Generation with LLMs	Anh Nguyen et.al.	2507.15839	null
2025-07-21	The Win Ratio at the Design Stage of Clinical Trials	David Kronthaler et.al.	2507.15685	null
2025-07-21	Missing value imputation with adversarial random forests – MissARF	Pegah Golchian et.al.	2507.15681	null
2025-07-21	Accelerating HEC-RAS: A Recurrent Neural Operator for Rapid River Forecasting	Edward Holmberg et.al.	2507.15614	null
2025-07-21	Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos	Hao Luo et.al.	2507.15597	null
2025-07-21	Chart-R1: Chain-of-Thought Supervision and Reinforcement for Advanced Chart Reasoner	Lei Chen et.al.	2507.15509	null
2025-07-21	ASPERA: A Simulated Environment to Evaluate Planning for Complex Action Execution	Alexandru Coca et.al.	2507.15501	null
2025-07-21	Bayesian Surface Wave Inversion for 3D Shear Wave Velocity Structure Beneath the British Isles: comparing Direct-3D Variational Inversion to Two-step (2D+1D) Inversion Methods	Xuebin Zhao et.al.	2507.15390	null
2025-07-21	Learning to Gridize: Segment Physical World by Wireless Communication Channel	Juntao Wang et.al.	2507.15386	null
2025-07-21	DAViD: Data-efficient and Accurate Vision Models from Synthetic Data	Fatemeh Saleh et.al.	2507.15365	null
2025-07-21	Latent Space Synergy: Text-Guided Data Augmentation for Direct Diffusion Biomedical Segmentation	Muhammad Aqeel et.al.	2507.15361	null
2025-07-21	RAD: Retrieval High-quality Demonstrations to Enhance Decision-making	Lu Guo et.al.	2507.15356	null
2025-07-21	BEAM-Net: A Deep Learning Framework with Bone Enhancement Attention Mechanism for High Resolution High Frame Rate Ultrasound Beamforming	Midhila Madhusoodanan et.al.	2507.15306	null
2025-07-21	Hierarchical Part-based Generative Model for Realistic 3D Blood Vessel	Siqi Chen et.al.	2507.15223	null
2025-07-21	Asymptotic Optimality in Data-driven Decision Making	Radek Salač et.al.	2507.15215	null
2025-07-18	Cross-modal Causal Intervention for Alzheimer’s Disease Prediction	Yutao Jin et.al.	2507.13956	null
2025-07-18	Novel techniques of imaging interferometry analysis to study gas and plasma density for laser-plasma experiments	F. Filippi et.al.	2507.13907	null
2025-07-18	Off-Policy Evaluation and Learning for Matching Markets	Yudai Hayashi et.al.	2507.13608	null
2025-07-17	Apple Intelligence Foundation Language Models: Tech Report 2025	Hanzhi Zhou et.al.	2507.13575	null
2025-07-17	Provable Low-Frequency Bias of In-Context Learning of Representations	Yongyi Yang et.al.	2507.13540	null
2025-07-17	Addressing the ML Domain Adaptation Problem for Networking: Realistic and Controllable Training Data Generation with NetReplica	Jaber Daneshamooz et.al.	2507.13476	null
2025-07-17	The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner	Zhouqi Hua et.al.	2507.13332	null
2025-07-17	Optimal Empirical Risk Minimization under Temporal Distribution Shifts	Yujin Jeong et.al.	2507.13287	null
2025-07-17	Trace Reconstruction with Language Models	Franziska Weindel et.al.	2507.12927	null
2025-07-16	Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes	Johann Frei et.al.	2507.12261	null
2025-07-16	Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese	Yikang Liu et.al.	2507.12260	null
2025-07-16	CosmoFlow: Scale-Aware Representation Learning for Cosmology with Flow Matching	Sidharth Kannan et.al.	2507.11842	null
2025-07-15	Fragment size density estimator for shrinkage-induced fracture based on a physics-informed neural network	Shin-ichi Ito et.al.	2507.11799	null
2025-07-15	Enforcing Latent Euclidean Geometry in Single-Cell VAEs for Manifold Interpolation	Alessandro Palma et.al.	2507.11789	null
2025-07-15	Fiducial Matching: Differentially Private Inference for Categorical Data	Ogonnaya Michael Romanus et.al.	2507.11762	null
2025-07-15	Auto-Formulating Dynamic Programming Problems with Large Language Models	Chenyu Zhou et.al.	2507.11737	null
2025-07-15	Synthetic Tabular Data Generation: A Comparative Survey for Modern Techniques	Raju Challagundla et.al.	2507.11590	null
2025-07-15	3C-FBI: A Combinatorial method using Convolutions for Circle Fitting in Blurry Images	Esteban Román Catafau et.al.	2507.11476	null
2025-07-15	Implementing Adaptations for Vision AutoRegressive Model	Kaif Shaikh et.al.	2507.11441	null
2025-07-15	The miniJPAS survey quasar selection V: combined algorithm	Ignasi Pérez-Ràfols et.al.	2507.11380	null
2025-07-15	Attributes Shape the Embedding Space of Face Recognition Models	Pierrick Leroy et.al.	2507.11372	null
2025-07-15	A Review of Privacy Metrics for Privacy-Preserving Synthetic Data Generation	Frederik Marinus Trudslev et.al.	2507.11324	null
2025-07-15	An Explainable AI-Enhanced Machine Learning Approach for Cardiovascular Disease Detection and Risk Assessment	Md. Emon Akter Sourov et.al.	2507.11185	null
2025-07-15	Standards-Compliant DM-RS Allocation via Temporal Channel Prediction for Massive MIMO Systems	Sehyun Ryu et.al.	2507.11064	null
2025-07-15	Scalable Variational Inference for Multinomial Probit Models under Large Choice Sets and Sample Sizes	Gyeongjun Kim et.al.	2507.10945	null
2025-07-15	Trexplorer Super: Topologically Correct Centerline Tree Tracking of Tubular Objects in CT Volumes	Roman Naeem et.al.	2507.10881	null
2025-07-14	First-of-its-kind AI model for bioacoustic detection using a lightweight associative memory Hopfield neural network	Andrew Gascoyne et.al.	2507.10642	null
2025-07-14	SynthGuard: Redefining Synthetic Data Generation with a Scalable and Privacy-Preserving Workflow Framework	Eduardo Brito et.al.	2507.10489	null
2025-07-14	Straighten Viscous Rectified Flow via Noise Optimization	Jimin Dai et.al.	2507.10218	null
2025-07-14	Minimizing the Pretraining Gap: Domain-aligned Text-Based Person Retrieval	Shuyu Yang et.al.	2507.10195	null
2025-07-14	Cyclic Multichannel Wiener Filter for Acoustic Beamforming	Giovanni Bologni et.al.	2507.10159	null
2025-07-14	Simulating Biases for Interpretable Fairness in Offline and Online Classifiers	Ricardo Inácio et.al.	2507.10154	null
2025-07-14	Observational mapping of the mass discrepancy in eclipsing binaries. A new self-contained framework for concurrent analysis of photometric and spectroscopic time series	Nadya Serebriakova et.al.	2507.10096	null
2025-07-14	Towards High Supervised Learning Utility Training Data Generation: Data Pruning and Column Reordering	Tung Sum Thomas Kwok et.al.	2507.10088	null
2025-07-14	Iceberg: Enhancing HLS Modeling with Synthetic Data	Zijian Ding et.al.	2507.09948	null
2025-07-14	The Rosetta Stone Project. II. The correlation between SFE and L/M indicator for the evolutionary stages of star-forming clumps in post-processed RMHD simulations	Ngo-Duy Tung et.al.	2507.09936	null
2025-07-12	GreenCrossingAI: A Camera Trap/Computer Vision Pipeline for Environmental Science Research Groups	Bernie Boscoe et.al.	2507.09410	null
2025-07-12	Fair CCA for Fair Representation Learning: An ADNI Study	Bojian Hou et.al.	2507.09382	null
2025-07-12	A deep learning approach to multi-marginal optimal transport via Hilbert space embeddings of probability measures	Yumiharu Nakano et.al.	2507.09206	null
2025-07-12	Learning from Synthetic Labs: Language Models as Auction Participants	Anand Shah et.al.	2507.09083	null
2025-07-11	A monotone single index model for spatially-referenced multistate current status data	Snigdha Das et.al.	2507.09057	null
2025-07-11	Learning Diffusion Models with Flexible Representation Guidance	Chenyu Wang et.al.	2507.08980	null
2025-07-11	DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images	Haoran Sun et.al.	2507.08648	null
2025-07-11	Nonparametric predictive inference for discrete data via Metropolis-adjusted Dirichlet sequences	Davide Agnoletto et.al.	2507.08629	null
2025-07-11	Space filling positionality and the Spiroformer	M. Maurin et.al.	2507.08456	null
2025-07-11	Lightweight Safety Guardrails via Synthetic Data and RL-guided Adversarial Training	Aleksei Ilin et.al.	2507.08284	null
2025-07-11	Data Generation without Function Estimation	Hadi Daneshmand et.al.	2507.08239	null
2025-07-10	Can AI-predicted complexes teach machine learning to compute drug binding affinity?	Wei-Tse Hsu et.al.	2507.07882	null
2025-07-11	Single-Step Latent Diffusion for Underwater Image Restoration	Jiayi Wu et.al.	2507.07878	null
2025-07-10	A Unified Empirical Risk Minimization Framework for Flexible N-Tuples Weak Supervision	Shuying Huang et.al.	2507.07771	null
2025-07-10	Beyond Connectivity: Higher-Order Network Framework for Capturing Memory-Driven Mobility Dynamics	Chen Zhang et.al.	2507.07727	null
2025-07-11	Learning Pole Structures of Hadronic States using Predictive Uncertainty Estimation	Felix Frohnert et.al.	2507.07668	null
2025-07-13	Scalable Signed Exponential Random Graph Models under Local Dependence	Marc Schalberger et.al.	2507.07660	null
2025-07-10	Stable-Hair v2: Real-World Hair Transfer via Multiple-View Diffusion Model	Kuiyuan Sun et.al.	2507.07591	null
2025-07-10	PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency	Haotian Wang et.al.	2507.07374	null
2025-07-10	Learning from positive and unlabeled examples -Finite size sample bounds	Farnam Mansouri et.al.	2507.07354	null
2025-07-09	SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains	Krithika Ramesh et.al.	2507.07229	null
2025-07-09	Reading a Ruler in the Wild	Yimu Pan et.al.	2507.07077	null
2025-07-09	Scaling Towards the Information Boundary of Instruction Set: InfinityInstruct-Subject Technical Report	Li Du et.al.	2507.06968	null
2025-07-09	Dataset and Benchmark for Enhancing Critical Retained Foreign Object Detection	Yuli Wang et.al.	2507.06937	null
2025-07-11	Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model	Jing Liang et.al.	2507.06892	null
2025-07-09	Horizontal and Vertical Federated Causal Structure Learning via Higher-order Cumulants	Wei Chen et.al.	2507.06888	null
2025-07-09	Finetuning Vision-Language Models as OCR Systems for Low-Resource Languages: A Case Study of Manchu	Yan Hon Michael Chung et.al.	2507.06761	null
2025-07-09	Mathematical artificial data for operator learning	Heng Wu et.al.	2507.06752	null
2025-07-09	MADPOT: Medical Anomaly Detection with CLIP Adaptation and Partial Optimal Transport	Mahshid Shiri et.al.	2507.06733	null
2025-07-09	Generalization in Reinforcement Learning for Radio Access Networks	Burak Demirel et.al.	2507.06602	null
2025-07-09	3D-Generalist: Self-Improving Vision-Language-Action Models for Crafting 3D Worlds	Fan-Yun Sun et.al.	2507.06484	null
2025-07-09	Generative Lagrangian data assimilation for ocean dynamics under extreme sparsity	Niloofar Asefi et.al.	2507.06479	null
2025-07-08	SImpHAR: Advancing impedance-based human activity recognition using 3D simulation and text-to-motion models	Lala Shakti Swarup Ray et.al.	2507.06405	null
2025-07-08	Mitigating Multi-Sequence 3D Prostate MRI Data Scarcity through Domain Adaptation using Locally-Trained Latent Diffusion Models for Prostate Cancer Detection	Emerson P. Grabke et.al.	2507.06384	null
2025-07-08	CultureCLIP: Empowering CLIP with Cultural Awareness through Synthetic Images and Contextualized Captions	Yuchen Huang et.al.	2507.06210	null
2025-07-08	The Delta Learning Hypothesis: Preference Tuning on Weak Data can Yield Strong Gains	Scott Geng et.al.	2507.06187	null
2025-07-08	Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis	Xintong Hu et.al.	2507.06116	null
2025-07-08	Taming Data Challenges in ML-based Security Tasks: Lessons from Integrating Generative AI	Shravya Kanchi et.al.	2507.06092	null
2025-07-08	DocIE@XLLM25: In-Context Learning for Information Extraction using Fully Synthetic Demonstrations	Nicholas Popovič et.al.	2507.05997	null
2025-07-09	Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework	Wang Wang et.al.	2507.05814	null
2025-07-08	Self-Review Framework for Enhancing Instruction Following Capability of LLM	Sihyun Park et.al.	2507.05598	null
2025-07-07	W2W: A Simulated Exploration of IMU Placement Across the Human Body for Designing Smarter Wearable	Lala Shakti Swarup Ray et.al.	2507.05532	null
2025-07-07	Statistical inference of anomalous thermal transport with uncertainty quantification for interpretive 2-D SOL models	Yichen Fu et.al.	2507.05413	null
2025-07-07	$\varphi$ -Adapt: A Physics-Informed Adaptation Learning Approach to 2D Quantum Material Discovery	Hoang-Quan Nguyen et.al.	2507.05184	null
2025-07-07	Deep Learning to Automate Parameter Extraction and Model Fitting of Two-Dimensional Transistors	Robert K. A. Bennett et.al.	2507.05134	null
2025-07-07	Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning	Ricardo Cardoso et.al.	2507.05029	null
2025-07-07	A Generative Diffusion Model for Amorphous Materials	Kai Yang et.al.	2507.05024	null
2025-07-07	Computing Largest Subsets of Points Whose Convex Hulls have Bounded Area and Diameter	Gianmarco Picarella et.al.	2507.04933	null
2025-07-07	Building Open-Retrieval Conversational Question Answering Systems by Generating Synthetic Data and Decontextualizing User Questions	Christos Vlachos et.al.	2507.04884	null
2025-07-07	Efficacy of Image Similarity as a Metric for Augmenting Small Dataset Retinal Image Segmentation	Thomas Wallace et.al.	2507.04862	null
2025-07-07	Efficient SAR Vessel Detection for FPGA-Based On-Satellite Sensing	Colin Laganier et.al.	2507.04842	null
2025-07-07	TeethGenerator: A two-stage framework for paired pre- and post-orthodontic 3D dental data generation	Changsong Lei et.al.	2507.04685	null
2025-07-06	Grounded Gesture Generation: Language, Motion, and Space	Anna Deichler et.al.	2507.04522	null
2025-07-05	T-SYNTH: A Knowledge-Based Dataset of Synthetic Breast Images	Christopher Wiedeman et.al.	2507.04038	null
2025-07-05	LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models	Gaurav Srivastava et.al.	2507.04023	null
2025-07-05	Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents	Ziyang Miao et.al.	2507.04009	null
2025-07-05	Exploring a Gamified Personality Assessment Method through Interaction with Multi-Personality LLM Agents	Baiqiao Zhang et.al.	2507.04005	null
2025-07-05	NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models	Siyu Li et.al.	2507.04002	null
2025-07-04	Optimizing Start Locations in Ergodic Search for Disaster Response	Ananya Rao et.al.	2507.02708	null
2025-07-07	AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models	Ziyin Zhou et.al.	2507.02664	null
2025-07-06	Alleviating Attack Data Scarcity: SCANIA’s Experience Towards Enhancing In-Vehicle Cyber Security Measures	Frida Sundfeldt et.al.	2507.02607	null
2025-07-03	Scaling LLM Planning: NL2FLOW for Parametric Problem Generation and Rigorous Evaluation	Jungkoo Kang et.al.	2507.02253	null
2025-07-03	Understanding Trade offs When Conditioning Synthetic Data	Brandon Trabucco et.al.	2507.02217	null
2025-07-03	Hybrid least squares for learning functions from highly noisy data	Ben Adcock et.al.	2507.02215	null
2025-07-02	Do Role-Playing Agents Practice What They Preach? Belief-Behavior Consistency in LLM-Based Simulations of Human Trust	Amogh Mannekote et.al.	2507.02197	null
2025-07-02	Data Diversification Methods In Alignment Enhance Math Performance In LLMs	Berkan Dokmeci et.al.	2507.02173	null
2025-07-02	Underwater Monocular Metric Depth Estimation: Real-World Benchmarks and Synthetic Fine-Tuning	Zijie Cai et.al.	2507.02148	null
2025-07-02	AI-Empowered Channel Generation for IoV Semantic Communications in Dynamic Conditions	Hao Liu et.al.	2507.02013	null
2025-07-02	IC-Custom: Diverse Image Customization via In-Context Learning	Yaowei Li et.al.	2507.01926	null
2025-07-02	Measurement-based Evaluation of CNN-based Detection and Estimation for ISAC Systems	Steffen Schieler et.al.	2507.01799	null
2025-07-02	Chargax: A JAX Accelerated EV Charging Simulator	Koen Ponse et.al.	2507.01522	null
2025-07-02	Symbolic identification of tensor equations in multidimensional physical fields	Tianyi Chen et.al.	2507.01466	null
2025-07-01	The hunt for research data: Development of an open-source workflow for tracking institutionally-affiliated research data publications	Bryan M. Gee et.al.	2507.01228	null
2025-07-01	Emerging Activity Temporal Hypergraph (EATH), a model for generating realistic time-varying hypergraphs	Marco Mancastroppa et.al.	2507.01124	null
2025-07-01	VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers	Yating Wang et.al.	2507.01016	null
2025-07-01	HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning	Zhi Jing et.al.	2507.00833	null
2025-07-01	Instant Particle Size Distribution Measurement Using CNNs Trained on Synthetic Data	Yasser El Jarida et.al.	2507.00822	null
2025-07-01	Comparing Misspecified Models with Big Data: A Variational Bayesian Perspective	Yong Li et.al.	2507.00763	null
2025-07-01	Decentralized Pliable Index Coding For Federated Learning In Intelligent Transportation Systems	Sadina Kadakkottiri et.al.	2507.00643	null
2025-07-01	TeamCMU at Touché: Adversarial Co-Evolution for Advertisement Integration and Detection in Conversational Search	To Eun Kim et.al.	2507.00509	null
2025-07-01	Learning Dense Feature Matching via Lifting Single 2D Image to 3D Space	Yingping Liang et.al.	2507.00392	null
2025-06-30	Towards 3D Semantic Image Synthesis for Medical Imaging	Wenwu Tang et.al.	2507.00206	null
2025-06-30	FADRM: Fast and Accurate Data Residual Matching for Dataset Distillation	Jiacheng Cui et.al.	2506.24125	null
2025-06-30	Development of Hybrid Artificial Intelligence Training on Real and Synthetic Data: Benchmark on Two Mixed Training Strategies	Paul Wachter et.al.	2506.24093	null
2025-06-30	TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation	Renren Jin et.al.	2506.23979	null
2025-06-30	Learning Constraints Directly from Network Data	Hongyu Hè et.al.	2506.23964	null
2025-06-30	Predicting Instabilities in Transient Landforms and Interconnected Ecosystems	Taylor Smith et.al.	2506.23946	null
2025-06-30	Large Language Models for Statistical Inference: Context Augmentation with Applications to the Two-Sample Problem and Regression	Marc Ratkovic et.al.	2506.23862	null
2025-06-30	Differentially Private Synthetic Data Release for Topics API Outputs	Travis Dick et.al.	2506.23855	null
2025-06-30	MadCLIP: Few-shot Medical Anomaly Detection with CLIP	Mahshid Shiri et.al.	2506.23810	null
2025-06-30	Adaptive Out-of-Control Point Pattern Detection in Sequential Random Finite Set Observations	Konstantinos Bourazas et.al.	2506.23802	null
2025-06-30	Advancing Learnable Multi-Agent Pathfinding Solvers with Active Fine-Tuning	Anton Andreychuk et.al.	2506.23793	null
2025-06-30	Spatio-Temporal Representation Decoupling and Enhancement for Federated Instrument Segmentation in Surgical Videos	Zheng Fang et.al.	2506.23759	null
2025-06-30	Can We Challenge Open-Vocabulary Object Detectors with Generated Content in Street Scenes?	Annika Mütze et.al.	2506.23751	null
2025-06-30	A New Perspective On AI Safety Through Control Theory Methodologies	Lars Ullrich et.al.	2506.23703	null
2025-06-30	Diffusion Model-based Data Augmentation Method for Fetal Head Ultrasound Segmentation	Fangyijie Wang et.al.	2506.23664	null
2025-06-30	Modelling effective electrical resistance in particle reinforced composites using Generative Adversarial Network	Vinit Vijay Deshpande et.al.	2506.23655	null
2025-06-27	Optimal Estimation of Watermark Proportions in Hybrid AI-Human Texts	Xiang Li et.al.	2506.22343	null
2025-06-27	A Deep Learning framework for building damage assessment using VHR SAR and geospatial data: demonstration on the 2023 Turkiye Earthquake	Luigi Russo et.al.	2506.22338	null
2025-06-27	Evaluating Scoring Bias in LLM-as-a-Judge	Qingquan Li et.al.	2506.22316	null
2025-06-27	Hybrid Generative Modeling for Incomplete Physics: Deep Grey-Box Meets Optimal Transport	Gurjeet Sangra Singh et.al.	2506.22204	null
2025-06-27	Binned semiparametric Bayesian networks	Rafael Sojo et.al.	2506.21997	null
2025-06-27	Physics-informed network paradigm with data generation and background noise removal for diverse distributed acoustic sensing applications	Yangyang Wan et.al.	2506.21952	null
2025-06-26	Inverse scattering without phase: Carleman convexification and phase retrieval via the Wentzel–Kramers–Brillouin approximation	Thuy T. Le et.al.	2506.21699	null
2025-06-26	Bayesian Modeling for Aggregated Relational Data: A Unified Perspective	Owen G. Ward et.al.	2506.21353	null
2025-06-26	Real-time Terrain Analysis for Off-road Autonomous Vehicles	Edwina Lewis et.al.	2506.21347	null
2025-06-27	ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation	Xiwei Xuan et.al.	2506.21233	null
2025-06-26	How Good Are Synthetic Requirements ? Evaluating LLM-Generated Datasets for AI4RE	Abdelkarim El-Hajjami et.al.	2506.21138	null
2025-06-26	Learning to See in the Extremely Dark	Hai Jiang et.al.	2506.21132	null
2025-06-26	Enhancing LLM Tool Use with High-quality Instruction Data from Knowledge Graph	Jingwei Wang et.al.	2506.21071	null
2025-06-26	Bridging Video Quality Scoring and Justification via Large Multimodal Models	Qizhi Xie et.al.	2506.21011	null
2025-06-26	Inverse Scene Text Removal	Takumi Yoshimatsu et.al.	2506.21002	null
2025-06-26	Active Learning for Manifold Gaussian Process Regression	Yuanxing Cheng et.al.	2506.20928	null
2025-06-25	Analytic inference with two-way clustering	Laurent Davezies et.al.	2506.20749	null
2025-06-25	Disentangled representations of microscopy images	Jacopo Dapueto et.al.	2506.20649	null
2025-06-25	Lost in Retraining: Roaming the Parameter Space of Exponential Families Under Closed-Loop Learning	Fariba Jangjoo et.al.	2506.20623	null
2025-06-25	Causal Representation Learning with Observational Grouping for CXR Classification	Rajat Rasal et.al.	2506.20582	null
2025-06-26	Causal Inference for Latent Outcomes Learned with Factor Models	Jenna M. Landy et.al.	2506.20549	null
2025-06-25	Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorch	Lei Liu et.al.	2506.20513	null
2025-06-25	Generative AI for Vulnerability Detection in 6G Wireless Networks: Advances, Case Study, and Future Directions	Shuo Yang et.al.	2506.20488	null
2025-06-25	POLAR: A Pessimistic Model-based Policy Learning Algorithm for Dynamic Treatment Regimes	Ruijia Zhang et.al.	2506.20406	null
2025-06-25	Time-series surrogates from energy consumers generated by machine learning approaches for long-term forecasting scenarios	Ben Gerhards et.al.	2506.20253	null
2025-06-25	FedBKD: Distilled Federated Learning to Embrace Gerneralization and Personalization on Non-IID Data	Yushan Zhao et.al.	2506.20245	null
2025-06-25	Time and covariance smoothing for restoration of bivariate signals	Yusuf Yigit Pilavci et.al.	2506.20237	null
2025-06-25	Progressive Alignment Degradation Learning for Pansharpening	Enzhe Zhao et.al.	2506.20179	null
2025-06-24	Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study	Yuqi Zhu et.al.	2506.19794	null
2025-06-24	Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders	Matyas Bohacek et.al.	2506.19708	null
2025-06-24	Extreme Learning Machines for Exoplanet Simulations: A Faster, Lightweight Alternative to Deep Learning	Tara P. A. Tahseen et.al.	2506.19679	null
2025-06-24	Unsupervised Data Generation for Offline Reinforcement Learning: A Perspective from Model	Shuncheng He et.al.	2506.19643	null
2025-06-24	Stylized Structural Patterns for Improved Neural Network Pre-training	Farnood Salehi et.al.	2506.19465	null
2025-06-24	SoK: Can Synthetic Images Replace Real Data? A Survey of Utility and Privacy of Synthetic Image Generation	Yunsung Chung et.al.	2506.19360	null
2025-06-24	In-Context Occam’s Razor: How Transformers Prefer Simpler Hypotheses on the Fly	Puneesh Deora et.al.	2506.19351	null
2025-06-24	Progressive Modality Cooperation for Multi-Modality Domain Adaptation	Weichen Zhang et.al.	2506.19316	null
2025-06-25	What Matters in LLM-generated Data: Diversity and Its Effect on Model Fine-Tuning	Yuchang Zhu et.al.	2506.19262	null
2025-06-24	Automated Image Recognition Framework	Quang-Binh Nguyen et.al.	2506.19261	null
2025-06-24	MSR-Align: Policy-Grounded Multimodal Alignment for Safety-Aware Reasoning in Vision-Language Models	Yinan Xia et.al.	2506.19257	null
2025-06-23	FairCauseSyn: Towards Causally Fair LLM-Augmented Synthetic Data Generation	Nitish Nagesh et.al.	2506.19082	null
2025-06-23	LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning	Yuhao Wu et.al.	2506.18841	null
2025-06-23	A Structural Causal Model for Electronic Device Reliability: From Effects to Counterfactuals	Federico Mattia Stefanini et.al.	2506.18663	null
2025-06-23	One-sample survival tests for non-proportional hazards in oncology clinical trial	Chloé Szurewsky et.al.	2506.18608	null
2025-06-23	PuckTrick: A Library for Making Synthetic Data More Realistic	Alessandra Agostini et.al.	2506.18499	null
2025-06-23	Edge Association Strategies for Synthetic Data Empowered Hierarchical Federated Learning with Non-IID Data	Jer Shyuan Ng et.al.	2506.18259	null
2025-06-22	Coherent Track-Before-Detect	Mingchao Liang et.al.	2506.18177	null
2025-06-22	Measuring Fractal Dimension using Discrete Global Grid Systems	Pramit Ghosh et.al.	2506.18175	null
2025-06-22	RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation	Tianxing Chen et.al.	2506.18088	null
2025-06-22	PP-DocBee2: Improved Baselines with Efficient Data for Multimodal Document Understanding	Kui Huang et.al.	2506.18023	null
2025-06-22	A Survey of Quantum Generative Adversarial Networks: Architectures, Use Cases, and Real-World Implementations	Mujahidul Islam et.al.	2506.18002	null
2025-06-22	Enabling PSO-Secure Synthetic Data Sharing Using Diversity-Aware Diffusion Models	Mischa Dombrowski et.al.	2506.17975	null
2025-06-22	Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms	Cheng Ji et.al.	2506.17900	null
2025-06-22	BeltCrack: the First Sequential-image Industrial Conveyor Belt Crack Detection Dataset and Its Baseline with Triple-domain Feature Learning	Jianghong Huang et.al.	2506.17892	null
2025-06-21	A Comparative Study of Open-Source Libraries for Synthetic Tabular Data Generation: SDV vs. SynthCity	Cristian Del Gobbo et.al.	2506.17847	null
2025-06-21	Maximum-likelihood reprojections for reliable Koopman-based predictions and bifurcation analysis of parametric dynamical systems	Pieter van Goor et.al.	2506.17817	null
2025-06-20	Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network	Mahin Montasir Afif et.al.	2506.17165	null
2025-06-20	The fundamental problem of risk prediction for individuals: health AI, uncertainty, and personalized medicine	Lasai Barreñada et.al.	2506.17141	null
2025-06-20	MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification	David Jacob Drexlin et.al.	2506.17140	null
2025-06-20	Unsupervised Image Super-Resolution Reconstruction Based on Real-World Degradation Patterns	Yiyang Tie et.al.	2506.17027	null
2025-06-20	Instituto de Telecomunicações at IWSLT 2025: Aligning Small-Scale Speech and Language Models for Speech-to-Text Learning	Giuseppe Attanasio et.al.	2506.17019	link
2025-06-20	Modeling and Visualization Reasoning for Stakeholders in Education and Industry Integration Systems: Research on Structured Synthetic Dialogue Data Generation Based on NIST Standards	Wei Meng et.al.	2506.16952	null
2025-06-20	Skewness-Kurtosis: small samples and power-law behavior	Carlo De Michele et.al.	2506.16906	null
2025-06-20	Multi-Objective Recommendation in the Era of Generative AI: A Survey of Recent Progress and Future Prospects	Zihan Hong et.al.	2506.16893	null
2025-06-20	Private Training & Data Generation by Clustering Embeddings	Felix Zhou et.al.	2506.16661	null
2025-06-19	Latent Noise Injection for Private and Statistically Aligned Synthetic Data Generation	Rex Shen et.al.	2506.16636	null
2025-06-19	A Scoping Review of Synthetic Data Generation for Biomedical Research and Applications	Hanshu Rao et.al.	2506.16594	null
2025-06-19	How Hard Is Snow? A Paired Domain Adaptation Dataset for Clear and Snowy Weather: CADC+	Mei Qi Tang et.al.	2506.16531	null
2025-06-19	Synthetic ALS-EEG Data Augmentation for ALS Diagnosis Using Conditional WGAN with Weight Clipping	Abdulvahap Mutlu et.al.	2506.16243	link
2025-06-19	Regularized Learning for Fractional Brownian Motion via Path Signatures	Ali Mohaddes et.al.	2506.16156	null
2025-06-19	DIGMAPPER: A Modular System for Automated Geologic Map Digitization	Weiwei Duan et.al.	2506.16006	null
2025-06-18	LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters	Kunming Zhang et.al.	2506.15595	null
2025-06-18	When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class	Yujin Kim et.al.	2506.15381	null
2025-06-18	Human Motion Capture from Loose and Sparse Inertial Sensors with Garment-aware Diffusion Models	Andela Ilic et.al.	2506.15290	null
2025-06-18	Unlocking Post-hoc Dataset Inference with Synthetic Data	Bihe Zhao et.al.	2506.15271	null
2025-06-18	Efficient space reduction techniques by optimized majority rules for the Kemeny aggregation problem	Xuan Kien Phung et.al.	2506.15097	null
2025-06-17	Understanding multi-fidelity training of machine-learned force-fields	John L. A. Gardner et.al.	2506.14963	null
2025-06-18	Accurate and scalable exchange-correlation with deep learning	Giulia Luise et.al.	2506.14665	null
2025-06-17	Synthetic Data Augmentation for Table Detection: Re-evaluating TableNet’s Performance with Automatically Generated Document Images	Krishna Sahukara et.al.	2506.14583	null
2025-06-17	Object-Centric Neuro-Argumentative Learning	Abdul Rahman Jacob et.al.	2506.14577	link
2025-06-17	A Logic For Fresh Labelled Transition Systems	Mohamed H Bandukara et.al.	2506.14538	null
2025-06-17	SIRI-Bench: Challenging VLMs’ Spatial Intelligence through Complex Reasoning Tasks	Zijian Song et.al.	2506.14512	null
2025-06-17	An ELIXIR scoping review on domain-specific evaluation metrics for synthetic data in life sciences	Styliani-Christina Fragkouli et.al.	2506.14508	null
2025-06-17	orGAN: A Synthetic Data Augmentation Pipeline for Simultaneous Generation of Surgical Images and Ground Truth Labels	Niran Nataraj et.al.	2506.14303	null
2025-06-17	CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation	Jia-Chen Zhang et.al.	2506.14206	null
2025-06-16	Estimation of Treatment Effects in Extreme and Unobserved Data	Jiyuan Tan et.al.	2506.14051	null
2025-06-16	Meta Optimality for Demographic Parity Constrained Regression via Post-Processing	Kazuto Fukuchi et.al.	2506.13947	null
2025-06-16	Rademacher learning rates for iterated random functions	Nikola Sandrić et.al.	2506.13946	null
2025-06-16	Deep learning inference with the Event Horizon Telescope III. Zingularity results from the 2017 observations and predictions for future array expansions	M. Janssen et.al.	2506.13877	null
2025-06-16	Deep learning inference with the Event Horizon Telescope II. The Zingularity framework for Bayesian artificial neural networks	M. Janssen et.al.	2506.13875	null
2025-06-16	Deep learning inference with the Event Horizon Telescope I. Calibration improvements and a comprehensive synthetic data library	M. Janssen et.al.	2506.13873	null
2025-06-16	*A Determination of $α_s(m_Z)$ at aN$^3$LO${\bf QCD}\otimes {\bf NLO}{\bf QED}$ Accuracy from a Global PDF Analysis*	The NNPDF Collaboration et.al.	2506.13871	null
2025-06-16	How Real is CARLAs Dynamic Vision Sensor? A Study on the Sim-to-Real Gap in Traffic Object Detection	Kaiyuan Tan et.al.	2506.13722	null
2025-06-17	Graph-Convolutional-Beta-VAE for Synthetic Abdominal Aorta Aneurysm Generation	Francesco Fabbri et.al.	2506.13628	null
2025-06-16	From Data-Driven to Purpose-Driven Artificial Intelligence: Systems Thinking for Data-Analytic Automation of Patient Care	Daniel Anadria et.al.	2506.13584	null
2025-06-16	X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability	Yu Yang et.al.	2506.13558	null
2025-06-16	What Matters in Learning from Large-Scale Datasets for Robot Manipulation	Vaibhav Saxena et.al.	2506.13536	null
2025-06-16	K/DA: Automated Data Generation Pipeline for Detoxifying Implicitly Offensive Language in Korean	Minkyeong Jeon et.al.	2506.13513	null
2025-06-16	Enhancing Omics Cohort Discovery for Research on Neurodegeneration through Ontology-Augmented Embedding Models	José A. Pardo et.al.	2506.13467	null
2025-06-16	PRO: Projection Domain Synthesis for CT Imaging	Kang Chen et.al.	2506.13443	null
2025-06-16	LapDDPM: A Conditional Graph Diffusion Model for scRNA-seq Generation with Spectral Adversarial Perturbations	Lorenzo Bini et.al.	2506.13344	null
2025-06-16	COME: Adding Scene-Centric Forecasting Control to Occupancy World Model	Yining Shi et.al.	2506.13260	link
2025-06-16	SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists	Lynn Khellaf et.al.	2506.13188	null
2025-06-15	Transforming Chatbot Text: A Sequence-to-Sequence Approach	Natesh Reddy et.al.	2506.12843	null
2025-06-14	Quantitative quasi-invariance of Gaussian measures below the energy level for the 1D generalized nonlinear Schrödinger equation and application to global well-posedness	Alexis Knezevitch et.al.	2506.12582	null
2025-06-14	Path-specific effects for pulse-oximetry guided decisions in critical care	Kevin Zhang et.al.	2506.12371	null
2025-06-13	Evaluation of machine-learning models to measure individualized treatment effects from randomized clinical trial data with time-to-event outcomes	Elvire Roblin et.al.	2506.12277	null
2025-06-13	Affogato: Learning Open-Vocabulary Affordance Grounding with Automated Data Generation at Scale	Junha Lee et.al.	2506.12009	null
2025-06-13	Confidence-Based Self-Training for EMG-to-Speech: Leveraging Synthetic EMG for Robust Modeling	Xiaodan Chen et.al.	2506.11862	null
2025-06-13	MindGrab for BrainChop: Fast and Accurate Skull Stripping for Command Line and Browser	Armina Fani et.al.	2506.11860	null
2025-06-13	Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds	Filippo Bragato et.al.	2506.11804	null
2025-06-13	Why Do Class-Dependent Evaluation Effects Occur with Time Series Feature Attributions? A Synthetic Data Investigation	Gregor Baer et.al.	2506.11790	null
2025-06-16	AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments	Zikang Leng et.al.	2506.11773	null
2025-06-13	Causal Effect Identification in Heterogeneous Environments from Higher-Order Moments	Yaroslav Kivva et.al.	2506.11756	null
2025-06-13	Exploring the Effectiveness of Deep Features from Domain-Specific Foundation Models in Retinal Image Synthesis	Zuzanna Skorniewska et.al.	2506.11753	null
2025-06-13	Generalised Rate Control Approach For Stream Processing Applications	Ziren Xiao et.al.	2506.11710	null
2025-06-13	Configurable Preference Tuning with Rubric-Guided Synthetic Data	Víctor Gallego et.al.	2506.11702	null
2025-06-13	Let the Tree Decide: FABART A Non-Parametric Factor Model	Sofia Velasco et.al.	2506.11551	null
2025-06-12	Variance estimation after matching or re-weighting	Xiang Meng et.al.	2506.11317	link
2025-06-12	Joint Denoising of Cryo-EM Projection Images using Polar Transformers	Joakim Andén et.al.	2506.11283	null
2025-06-12	Domain-Constrained Diffusion Models to Synthesize Tabular Data: A Case Study in Power Systems	Milad Hoseinpour et.al.	2506.11281	null
2025-06-12	Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation	Benjamin Elder et.al.	2506.11266	null
2025-06-12	Distillation of atomistic foundation models across architectures and chemical domains	John L. A. Gardner et.al.	2506.10956	link
2025-06-12	Foundation Models for Causal Inference via Prior-Data Fitted Networks	Yuchen Ma et.al.	2506.10914	null
2025-06-12	A Study on Individual Spatiotemporal Activity Generation Method Using MCP-Enhanced Chain-of-Thought Large Language Models	Yu Zhang et.al.	2506.10853	link
2025-06-12	Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches	Andrea Moglia et.al.	2506.10825	null
2025-06-13	Detecting High-Stakes Interactions with Activation Probes	Alex McKenzie et.al.	2506.10805	null
2025-06-12	Deep Learning-based Multi Project InP Wafer Simulation for Unsupervised Surface Defect Detection	Emílio Dolgener Cantú et.al.	2506.10713	null
2025-06-12	ConTextTab: A Semantics-Aware Tabular In-Context Learner	Marco Spinaci et.al.	2506.10707	link
2025-06-12	SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis	Sergio Burdisso et.al.	2506.10622	link
2025-06-12	Efficient nanophotonic devices optimization using deep neural network trained with physics-based transfer learning (PBTL) methodology	Gibaek Kim et.al.	2506.10418	null
2025-06-12	Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts	Zaijing Li et.al.	2506.10357	null
2025-06-12	Towards Understanding Bias in Synthetic Data for Evaluation	Hossein A. Rahmani et.al.	2506.10301	link
2025-06-12	Learning-Based Stable Optimal Control for Infinite-Time Nonlinear Regulation Problems	Han Wang et.al.	2506.10291	link
2025-06-11	ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese	Iago Alves Brito et.al.	2506.10245	null
2025-06-11	ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering	Caijun Jia et.al.	2506.10116	null
2025-06-11	Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing	Junfei Wu et.al.	2506.09965	link
2025-06-11	Error-Guided Pose Augmentation: Enhancing Rehabilitation Exercise Assessment through Targeted Data Generation	Omar Sherif et.al.	2506.09833	null
2025-06-11	Accurate and efficient zero-shot 6D pose estimation with frozen foundation models	Andrea Caraffa et.al.	2506.09784	null
2025-06-11	CINeMA: Conditional Implicit Neural Multi-Modal Atlas for a Spatio-Temporal Representation of the Perinatal Brain	Maik Dannecker et.al.	2506.09668	link
2025-06-11	Recognizing Every Voice: Towards Inclusive ASR for Rural Bhojpuri Women	Sakshi Joshi et.al.	2506.09653	link
2025-06-11	In-Context Bias Propagation in LLM-Based Tabular Data Generation	Pol G. Recasens et.al.	2506.09630	null
2025-06-11	TinySplat: Feedforward Approach for Generating Compact 3D Scene Representation	Zetian Song et.al.	2506.09479	null
2025-06-11	Synthetic Human Action Video Data Generation with Pose Transfer	Vaclav Knapp et.al.	2506.09411	null
2025-06-11	Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models	Zongjie Li et.al.	2506.09396	null
2025-06-11	CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation	Yuxing Long et.al.	2506.09343	null
2025-06-11	Towards Efficient and Effective Alignment of Large Language Models	Yuxin Jiang et.al.	2506.09329	null
2025-06-11	Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models	Xuanchi Ren et.al.	2506.09042	link
2025-06-11	SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner	Lei Zhang et.al.	2506.09003	link
2025-06-10	IMAGIC-500: IMputation benchmark on A Generative Imaginary Country (500k samples)	Siyi Sun et.al.	2506.08844	link
2025-06-10	Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data	Muhammad Anwar et.al.	2506.08750	null
2025-06-10	Urban Incident Prediction with Graph Neural Networks: Integrating Government Ratings and Crowdsourced Reports	Sidhika Balachandar et.al.	2506.08740	link
2025-06-10	ArrowPose: Segmentation, Detection, and 5 DoF Pose Estimation Network for Colorless Point Clouds	Frederik Hagelskjaer et.al.	2506.08699	null
2025-06-10	TableDreamer: Progressive and Weakness-guided Data Synthesis from Scratch for Table Instruction Tuning	Mingyu Zheng et.al.	2506.08646	link
2025-06-10	Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings	Liyan Xu et.al.	2506.08592	link
2025-06-10	Mic-hackathon 2024: Hackathon on Machine Learning for Electron and Scanning Probe Microscopy	Utkarsh Pratiush et.al.	2506.08423	link
2025-06-10	Spatiotemporal deep learning models for detection of rapid intensification in cyclones	Vamshika Sutar et.al.	2506.08397	null
2025-06-10	Private Evolution Converges	Tomás González et.al.	2506.08312	null
2025-06-09	Massive parallelization of projection-based depths	Leonardo Leone et.al.	2506.08262	null
2025-06-09	An In-situ Solid Fuel Ramjet Thrust Monitoring and Regulation Framework Using Neural Networks and Adaptive Control	Ryan DeBoskey et.al.	2506.08157	null
2025-06-09	SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents	Subhrangshu Nandi et.al.	2506.08119	null
2025-06-09	GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior	Penghao Wu et.al.	2506.08012	null
2025-06-09	Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor	Rishit Dagli et.al.	2506.07932	null
2025-06-09	CausalPFN: Amortized Causal Effect Estimation via In-Context Learning	Vahid Balazadeh et.al.	2506.07918	link
2025-06-09	Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces	Kevin Rojas et.al.	2506.07903	link
2025-06-09	HAIBU-ReMUD: Reasoning Multimodal Ultrasound Dataset and Model Bridging to General Specific Domains	Shijie Wang et.al.	2506.07837	link
2025-06-09	Augmenting LLMs’ Reasoning by Reinforcing Abstract Thinking	Silin Gao et.al.	2506.07751	null
2025-06-09	Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images	Yingping Liang et.al.	2506.07740	null
2025-06-09	Synthesis by Design: Controlled Data Generation via Structural Guidance	Lei Xu et.al.	2506.07664	null
2025-06-09	The Universality Lens: Why Even Highly Over-Parametrized Models Learn Well	Meir Feder et.al.	2506.07661	null
2025-06-09	Scaling Human Activity Recognition: A Comparative Evaluation of Synthetic Data Generation and Augmentation Techniques	Zikang Leng et.al.	2506.07612	null
2025-06-09	Scalable Spatiotemporal Modeling for Bicycle Count Prediction	Rishikesh Yadav et.al.	2506.07582	null
2025-06-09	LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization	Yixuan Yang et.al.	2506.07570	null
2025-06-09	Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study	Xiaomeng Zhu et.al.	2506.07539	link
2025-06-09	Addressing Correlated Latent Exogenous Variables in Debiased Recommender Systems	Shuqiang Zhang et.al.	2506.07517	link
2025-06-08	Pre-trained Large Language Models Learn Hidden Markov Models In-context	Yijia Dai et.al.	2506.07298	null
2025-06-06	3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model	Hongyan Zhi et.al.	2506.06199	link
2025-06-06	A comprehensive Darcy-type law for viscoplastic fluids: I. Framework	Emad Chaparian et.al.	2506.06184	null
2025-06-06	Synthetic Tabular Data: Methods, Attacks and Defenses	Graham Cormode et.al.	2506.06108	null
2025-06-06	Do-PFN: In-Context Learning for Causal Effect Estimation	Jake Robertson et.al.	2506.06039	null
2025-06-06	Optimization-Free Universal Watermark Forgery with Regenerative Diffusion Models	Chaoyi Zhu et.al.	2506.06018	link
2025-06-06	Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models	Yifu Qiu et.al.	2506.06006	link
2025-06-06	Additive decomposition of one-dimensional signals using Transformers	Samuele Salti et.al.	2506.05942	null
2025-06-06	Exponential Family Variational Flow Matching for Tabular Data Generation	Andrés Guzmán-Cordero et.al.	2506.05940	null
2025-06-06	Stealix: Model Stealing via Prompt Evolution	Zhixiong Zhuang et.al.	2506.05867	null
2025-06-06	dots.llm1 Technical Report	Bi Huo et.al.	2506.05767	null
2025-06-06	A cautious user’s guide in applying HMMs to physical systems	Max Schweiger et.al.	2506.05707	null
2025-06-05	A Fictional Q&A Dataset for Studying Memorization and Knowledge Acquisition	John Kirchenbauer et.al.	2506.05639	null
2025-06-05	Online Conformal Model Selection for Nonstationary Time Series	Shibo Li et.al.	2506.05544	null
2025-06-05	Winner-takes-all for Multivariate Probabilistic Time Series Forecasting	Adrien Cortés et.al.	2506.05515	null
2025-06-05	MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning	Xinyan Chen et.al.	2506.05331	link
2025-06-05	Reduction of Outflow Boundary Influence on Aerodynamic Performance using Neural Networks	Mario Christopher Bedrunka et.al.	2506.05293	null
2025-06-05	Privacy Amplification Through Synthetic Data: Insights from Linear Regression	Clément Pierquin et.al.	2506.05101	null
2025-06-05	Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training	Aneesh Deogan et.al.	2506.05092	null
2025-06-05	Identifying and Understanding Cross-Class Features in Adversarial Training	Zeming Wei et.al.	2506.05032	null
2025-06-05	Physical Annotation for Automated Optical Inspection: A Concept for In-Situ, Pointer-Based Trainingdata Generation	Oliver Krumpek et.al.	2506.05026	null
2025-06-05	Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting	Alfred T. Christiansen et.al.	2506.05009	null
2025-06-05	Learning Joint Interventional Effects from Single-Variable Interventions in Additive Models	Armin Kekić et.al.	2506.04945	null
2025-06-05	Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models	Thao Nguyen et.al.	2506.04689	null
2025-06-05	Gen-n-Val: Agentic Image Data Generation and Validation	Jing-En Huang et.al.	2506.04676	null
2025-06-05	Clustering and Median Aggregation Improve Differentially Private Inference	Kareem Amin et.al.	2506.04566	null
2025-06-05	PUB: An LLM-Enhanced Personality-Driven User Behaviour Simulator for Recommender System Evaluation	Chenglong Ma et.al.	2506.04551	null
2025-06-05	NOBLE – Neural Operator with Biologically-informed Latent Embeddings to Capture Experimental Variability in Biological Neuron Models	Luca Ghafourpour et.al.	2506.04536	null
2025-06-04	BEAR: BGP Event Analysis and Reporting	Hanqing Li et.al.	2506.04514	link
2025-06-04	Diffusion Transformer-based Universal Dose Denoising for Pencil Beam Scanning Proton Therapy	Yuzhen Ding et.al.	2506.04467	null
2025-06-04	OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis	Junting Chen et.al.	2506.04217	link
2025-06-04	Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness	Stephen R. Pfohl et.al.	2506.04193	link
2025-06-05	OpenThoughts: Data Recipes for Reasoning Models	Etash Guha et.al.	2506.04178	link
2025-06-04	Does Prompt Design Impact Quality of Data Imputation by LLMs?	Shreenidhi Srinivasan et.al.	2506.04172	null
2025-06-04	Adaptive tuning of Hamiltonian Monte Carlo methods	Elena Akhmatskaya et.al.	2506.04082	null
2025-06-04	HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models	Zhaolu Kang et.al.	2506.03922	link
2025-06-04	EuroGEST: Investigating gender stereotypes in multilingual language models	Jacqueline Rowe et.al.	2506.03867	null
2025-06-04	Personalized MR-Informed Diffusion Models for 3D PET Image Reconstruction	George Webber et.al.	2506.03804	null
2025-06-04	YOND: Practical Blind Raw Image Denoising Free from Camera-Specific Data Dependency	Hansen Feng et.al.	2506.03645	null
2025-06-04	Is linguistically-motivated data augmentation worth it?	Ray Groshan et.al.	2506.03593	null
2025-06-04	ConsistentChat: Building Skeleton-Guided Consistent Dialogues for Large Language Models from Scratch	Jiawei Chen et.al.	2506.03558	null
2025-06-03	RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions	Bimsara Pathiraja et.al.	2506.03448	null
2025-06-03	Trajectory Prediction Meets Large Language Models: A Survey	Yi Xu et.al.	2506.03408	link
2025-06-03	Fast Machine Learning for Quantum Control of Microwave Qudits on Edge Hardware	Flor Sanders et.al.	2506.03323	null
2025-06-03	ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions	Di Chang et.al.	2506.03107	null
2025-06-03	Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models	Ram Potham et.al.	2506.03056	null
2025-06-03	PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples	Junyu Liu et.al.	2506.03004	null
2025-06-03	Towards a Japanese Full-duplex Spoken Dialogue System	Atsumoto Ohashi et.al.	2506.02979	null
2025-06-03	Interaction Field Matching: Overcoming Limitations of Electrostatic Models	Stepan I. Manukhov et.al.	2506.02950	null
2025-06-03	INESC-ID @ eRisk 2025: Exploring Fine-Tuned, Similarity-Based, and Prompt-Based Approaches to Depression Symptom Identification	Diogo A. P. Nunes et.al.	2506.02924	null
2025-06-03	Enhancing Abnormality Identification: Robust Out-of-Distribution Strategies for Deepfake Detection	Luca Maiano et.al.	2506.02857	null
2025-06-03	CART-based Synthetic Tabular Data Generation for Imbalanced Regression	António Pedro Pinheiro et.al.	2506.02811	null
2025-06-03	Decentralized COVID-19 Health System Leveraging Blockchain	Lingsheng Chen et.al.	2506.02674	null
2025-06-03	Hyperspectral Image Generation with Unmixing Guided Diffusion Model	Shiyu Shen et.al.	2506.02601	null
2025-06-03	KARE-RAG: Knowledge-Aware Refinement and Enhancement for RAG	Yongjian Li et.al.	2506.02503	null
2025-06-03	Generative AI for Predicting 2D and 3D Wildfire Spread: Beyond Physics-Based Models and Traditional Deep Learning	Haowen Xu et.al.	2506.02485	null
2025-06-03	IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data	Bo Peng et.al.	2506.02449	null
2025-06-03	The Devil is in the Darkness: Diffusion-Based Nighttime Dehazing Anchored in Brightness Perception	Xiaofeng Cong et.al.	2506.02395	null
2025-06-03	NextQuill: Causal Preference Modeling for Enhancing LLM Personalization	Xiaoyan Zhao et.al.	2506.02368	null
2025-05-30	Consistent line clustering using geometric hypergraphs	Kalle Alaluusua et.al.	2505.24868	null
2025-06-02	How much do language models memorize?	John X. Morris et.al.	2505.24832	null
2025-05-30	REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards	Zafir Stojanovski et.al.	2505.24760	link
2025-05-30	Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning	Shelly Bensal et.al.	2505.24726	null
2025-05-30	Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison	Tejul Pandit et.al.	2505.24701	null
2025-05-30	TumorGen: Boundary-Aware Tumor-Mask Synthesis with Rectified Flow Matching	Shengyuan Liu et.al.	2505.24687	null
2025-05-30	TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis	Xiaorui Wu et.al.	2505.24672	link
2025-05-30	Category-Level 6D Object Pose Estimation in Agricultural Settings Using a Lattice-Deformation Framework and Diffusion-Augmented Synthetic Data	Marios Glytsos et.al.	2505.24636	null
2025-05-30	VietMix: A Naturally Occurring Vietnamese-English Code-Mixed Corpus with Iterative Augmentation for Machine Translation	Hieu Tran et.al.	2505.24472	null
2025-05-30	Anomaly Detection and Improvement of Clusters using Enhanced K-Means Algorithm	Vardhan Shorewala et.al.	2505.24365	null
2025-05-30	Factorization method for near-field inverse scattering problems in elastodynamics	Chun Liu et.al.	2505.24288	null
2025-05-30	Provably Improving Generalization of Few-Shot Models with Synthetic Data	Lan-Cuong Nguyen et.al.	2505.24190	null
2025-05-30	CodeV-R1: Reasoning-Enhanced Verilog Generation	Yaoyu Zhu et.al.	2505.24183	null
2025-05-30	Tag-Evol: Achieving Efficient Instruction Evolving via Tag Injection	Yixuan Wang et.al.	2505.24165	link
2025-05-30	Estimating dynamic transmission rates with a Black-Karasinski process in stochastic SIHR models using particle MCMC	Avery Drennan et.al.	2505.24127	null
2025-05-29	GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns	Enzo Doyen et.al.	2505.23630	link
2025-05-29	Going from a Representative Agent to Counterfactuals in Combinatorial Choice	Yanqiu Ruan et.al.	2505.23546	null
2025-05-29	TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning	Ron Shapira Weber et.al.	2505.23475	link
2025-05-29	To Measure What Isn’t There – Visual Exploration of Missingness Structures Using Quality Metrics	Sara Johansson Fernstad et.al.	2505.23447	null
2025-05-29	CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis	Runmin Jiang et.al.	2505.23444	null
2025-05-29	Synthetic Generation and Latent Projection Denoising of Rim Lesions in Multiple Sclerosis	Alexandra G. Roberts et.al.	2505.23353	link
2025-05-29	Calibrated Bayesian inference for random fields on large irregular domains using the debiased spatial Whittle likelihood	Thomas Goodwin et.al.	2505.23330	null
2025-05-29	EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian	Daryna Dementieva et.al.	2505.23297	null
2025-05-29	Group zero-norm regularized robust loss minimization: proximal MM method and statistical error bound	Ling Liang et.al.	2505.23294	null
2025-05-29	Infinite-Instruct: Synthesizing Scaling Code instruction Data with Bidirectional Synthesis and Static Verification	Wenjing Xing et.al.	2505.23177	null
2025-05-29	RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer	Liu Liu et.al.	2505.23171	null
2025-05-29	AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models	Jinchuan Zhang et.al.	2505.23020	link
2025-05-28	Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification	Sylvey Lin et.al.	2505.22926	null
2025-05-28	What Has Been Lost with Synthetic Evaluation?	Alexander Gill et.al.	2505.22830	null
2025-05-28	PGLearn – An Open-Source Learning Toolkit for Optimal Power Flow	Michael Klamkin et.al.	2505.22825	null
2025-05-28	SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation	Dekai Zhu et.al.	2505.22643	null
2025-05-28	Spatial Knowledge Graph-Guided Multimodal Synthesis	Yida Xue et.al.	2505.22633	null
2025-05-28	DocReRank: Single-Page Hard Negative Query Generation for Training Multi-Modal RAG Rerankers	Navve Wasserman et.al.	2505.22584	null
2025-05-28	TabularQGAN: A Quantum Generative Model for Tabular Data	Pallavi Bhardwaj et.al.	2505.22533	null
2025-05-28	Symplectic Generative Networks (SGNs): A Hamiltonian Framework for Invertible Deep Generative Modeling	Agnideep Aich et.al.	2505.22527	null
2025-05-28	Modeling and estimating skewed and heavy-tailed populations via unsupervised mixture models	Marco Bee et.al.	2505.22507	null
2025-05-28	Articulatory modeling of the S-shaped F2 trajectories observed in Öhman’s spectrographic analysis of VCV syllables	Frédéric Berthommier et.al.	2505.22455	null
2025-05-28	Position: All Current Generative Fidelity and Diversity Metrics are Flawed	Ossi Räisä et.al.	2505.22450	null
2025-05-28	Individualised Counterfactual Examples Using Conformal Prediction Intervals	James M. Adams et.al.	2505.22326	null
2025-05-28	Transformers Pretrained on Procedural Data Contain Modular Structures for Algorithmic Reasoning	Zachary Shinnick et.al.	2505.22308	null
2025-05-28	Neural Restoration of Greening Defects in Historical Autochrome Photographs Based on Purely Synthetic Data	Saptarshi Neil Sinha et.al.	2505.22291	null
2025-05-28	Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection	Jinming Zhang et.al.	2505.22029	link
2025-05-28	Almost Linear Convergence under Minimal Score Assumptions: Quantized Transition Diffusion	Xunpeng Huang et.al.	2505.21892	null
2025-05-27	Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation	Tharindu Kumarage et.al.	2505.21784	null
2025-05-27	Assessing EV Charging Impacts on Power Distribution Systems: A Unified Co-Simulation Framework	Mohammadreza Iranpour et.al.	2505.21773	null
2025-05-27	Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals	Davide Lobba et.al.	2505.21062	link
2025-05-27	EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models	Chengyu Wang et.al.	2505.20888	null
2025-05-27	Debiased Ill-Posed Regression	AmirEmad Ghassami et.al.	2505.20787	null
2025-05-27	ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval	Eric Xing et.al.	2505.20764	link
2025-05-27	Phir Hera Fairy: An English Fairytaler is a Strong Faker of Fluent Speech in Low-Resource Indian Languages	Praveen Srinivasa Varadhan et.al.	2505.20693	null
2025-05-26	Learning with Expected Signatures: Theory and Applications	Lorenzo Lucchese et.al.	2505.20465	null
2025-05-28	PreP-OCR: A Complete Pipeline for Document Image Restoration and Enhanced OCR Accuracy	Shuhao Guan et.al.	2505.20429	null
2025-05-26	GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation	Zihong Chen et.al.	2505.20416	link
2025-05-26	Multimodal Federated Learning With Missing Modalities through Feature Imputation Network	Pranav Poudel et.al.	2505.20232	null
2025-05-28	Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations	Mohit Chandra et.al.	2505.20201	null
2025-05-26	From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data	Chun-Yi Kuan et.al.	2505.20166	null
2025-05-26	Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning	Jaehun Jung et.al.	2505.20161	null
2025-05-26	MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models	Anh Thai et.al.	2505.20122	null
2025-05-26	TabPFN: One Model to Rule Them All?	Qiong Zhang et.al.	2505.20003	link
2025-05-26	Learning Optimal Multimodal Information Bottleneck Representations	Qilong Wu et.al.	2505.19996	null
2025-05-26	REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models	Hexuan Deng et.al.	2505.19862	link
2025-05-26	A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking	Zixiang Zhao et.al.	2505.19858	null
2025-05-26	SGM: A Framework for Building Specification-Guided Moderation Filters	Masoomali Fatehkia et.al.	2505.19766	null
2025-05-27	SAIL: Self-supervised Albedo Estimation from Real Images with a Latent Diffusion Model	Hala Djeghim et.al.	2505.19751	null
2025-05-26	Improving Heart Rejection Detection in XPCI Images Using Synthetic Data Augmentation	Jakov Samardžija et.al.	2505.19746	null
2025-05-26	Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments	Junming Liu et.al.	2505.19699	null
2025-05-26	KIT’s Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization	Zhaolin Li et.al.	2505.19679	null
2025-05-27	SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond	Junteng Liu et.al.	2505.19641	link
2025-05-26	Model Agnostic Differentially Private Causal Inference	Christiant Lebeda et.al.	2505.19589	null
2025-05-26	Situationally-Aware Dynamics Learning	Alejandro Murillo-Gonzalez et.al.	2505.19574	null
2025-05-26	SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback	Yaoning Yu et.al.	2505.19514	null
2025-05-26	CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data Synthesis	Ruixiang Feng et.al.	2505.19484	link
2025-05-23	Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals	Jia-Nan Li et.al.	2505.18071	null
2025-05-23	Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions	Yizhou Xu et.al.	2505.18046	link
2025-05-23	TRACE for Tracking the Emergence of Semantic Representations in Transformers	Nura Aljaafari et.al.	2505.17998	null
2025-05-23	ADLGen: Synthesizing Symbolic, Event-Triggered Sensor Sequences for Human Activity Modeling	Weihang You et.al.	2505.17987	null
2025-05-23	Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development	Nguyen Duc et.al.	2505.17959	null
2025-05-23	The Nuclear Route: Sharp Asymptotics of ERM in Overparameterized Quadratic Networks	Vittorio Erba et.al.	2505.17958	link
2025-05-23	Optimal Online Change Detection via Random Fourier Features	Florian Kalinke et.al.	2505.17789	null
2025-05-23	Automating Safety Enhancement for LLM-based Agents with Synthetic Risk Scenarios	Xueyang Zhou et.al.	2505.17735	null
2025-05-23	SynRES: Towards Referring Expression Segmentation in the Wild via Synthetic Data	Dong-Hee Kim et.al.	2505.17695	null
2025-05-23	4D-CTA Image and geometry dataset for kinematic analysis of abdominal aortic aneurysms	Mostafa Jamshidian et.al.	2505.17647	null
2025-05-23	Large language model as user daily behavior data generator: balancing population diversity and individual personality	Haoxin Li et.al.	2505.17615	null
2025-05-23	Offline Constrained Reinforcement Learning under Partial Data Coverage	Kihyuk Hong et.al.	2505.17506	null
2025-05-23	LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context	Natsuo Yamashita et.al.	2505.17410	link
2025-05-23	Measuring diversity of synthetic prompts and data generated with fine-grained persona prompting	Gauri Kambhatla et.al.	2505.17390	null
2025-05-22	ExeSQL: Self-Taught Text-to-SQL Models with Execution-Driven Bootstrapping for SQL Dialects	Jipeng Zhang et.al.	2505.17231	null
2025-05-23	VeriFastScore: Speeding up long-form factuality evaluation	Rishanth Rajendhran et.al.	2505.16973	link
2025-05-22	From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition	Tianduo Wang et.al.	2505.16972	link
2025-05-22	Does Synthetic Data Help Named Entity Recognition for Low-Resource Languages?	Gaurav Kamath et.al.	2505.16814	null
2025-05-22	Learning Beyond Limits: Multitask Learning and Synthetic Data for Low-Resource Canonical Morpheme Segmentation	Changbing Yang et.al.	2505.16800	null
2025-05-22	V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation	Hanyue Lou et.al.	2505.16797	link
2025-05-22	Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review	Beyazit Bestami Yuksel et.al.	2505.16771	null
2025-05-22	Forward-only Diffusion Probabilistic Models	Ziwei Luo et.al.	2505.16733	link
2025-05-22	On the Out-of-Distribution Generalization of Self-Supervised Learning	Wenwen Qiang et.al.	2505.16675	link
2025-05-22	CausalDynamics: A large-scale benchmark for structural discovery of dynamical causal models	Benjamin Herdeanu et.al.	2505.16620	link
2025-05-22	Constrained Non-negative Matrix Factorization for Guided Topic Modeling of Minority Topics	Seyedeh Fatemeh Ebrahimi et.al.	2505.16493	link
2025-05-22	Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning	Guanting Dong et.al.	2505.16410	link
2025-05-22	Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation	Estelle Chigot et.al.	2505.16360	link
2025-05-23	EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scenarios	Bin Xu et.al.	2505.16160	link
2025-05-21	Aug2Search: Enhancing Facebook Marketplace Search with LLM-Generated Synthetic Data Augmentation	Ruijie Xi et.al.	2505.16065	null
2025-05-21	Towards Identifiability of Interventional Stochastic Differential Equations	Aaron Zweig et.al.	2505.15987	null
2025-05-21	Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling	He Hu et.al.	2505.15715	null
2025-05-21	Graph Conditional Flow Matching for Relational Data Generation	Davide Scassola et.al.	2505.15668	link
2025-05-21	FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models	Zhen Sun et.al.	2505.15644	link
2025-05-21	AI-empowered Real-Time Line-of-Sight Identification via Network Digital Twins	Michele Zhu et.al.	2505.15478	null
2025-05-21	Synthetic Enclosed Echoes: A New Dataset to Mitigate the Gap Between Simulated and Real-World Sonar Data	Guilherme de Oliveira et.al.	2505.15465	link
2025-05-21	GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation	Yuchen Li et.al.	2505.15287	null
2025-05-21	Contrastive Learning-Enhanced Trajectory Matching for Small-Scale Dataset Distillation	Wenmin Li et.al.	2505.15267	null
2025-05-21	Recognition of Unseen Combined Motions via Convex Combination-based EMG Pattern Synthesis for Myoelectric Control	Itsuki Yazawa et.al.	2505.15218	null
2025-05-21	Co-optimize condenser water temperature and cooling tower fan using high-fidelity synthetic data	Gulai Shen et.al.	2505.15041	null
2025-05-20	Optimizing Age-of-Information in Piggyback Networks with Recurrent Data Generation	Ching-Chi Lin et.al.	2505.14968	null
2025-05-20	This Time is Different: An Observability Perspective on Time Series Foundation Models	Ben Cohen et.al.	2505.14766	link
2025-05-20	TransMedSeg: A Transferable Semantic Framework for Semi-Supervised Medical Image Segmentation	Mengzhu Wang et.al.	2505.14753	null
2025-05-20	CSTS: A Benchmark for the Discovery of Correlation Structures in Time Series Clustering	Isabella Degen et.al.	2505.14596	link
2025-05-21	PlanGPT-VL: Enhancing Urban Planning with Domain-Specific Vision-Language Models	He Zhu et.al.	2505.14481	null
2025-05-20	Scaling Low-Resource MT via Synthetic Data Generation with LLMs	Ona de Gibert et.al.	2505.14423	null
2025-05-21	Improving the Γ-functions Method for Vortex Identification	Quan Xie et.al.	2505.14384	link
2025-05-20	Challenges and Limitations in the Synthetic Generation of mHealth Sensor Data	Flavio Di Martino et.al.	2505.14206	null
2025-05-20	From stability of Langevin diffusion to convergence of proximal MCMC for non-log-concave sampling	Marien Renaud et.al.	2505.14177	null
2025-05-20	AUTOLAW: Enhancing Legal Compliance in Large Language Models via Case Law Generation and Jury-Inspired Deliberation	Tai D. Nguyen et.al.	2505.14015	null
2025-05-20	A Probabilistic Perspective on Model Collapse	Shirong Xu et.al.	2505.13947	null
2025-05-20	Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning	Jingqi Tong et.al.	2505.13886	link
2025-05-20	Graphon Mixtures	Sevvandi Kandanaarachchi et.al.	2505.13864	null
2025-05-20	Context-Free Synthetic Data Mitigates Forgetting	Parikshit Bansal et.al.	2505.13811	null
2025-05-20	LLMs Capture Urban Science but Oversimplify Complexity	Yecheng Zhang et.al.	2505.13803	null
2025-05-19	Finding Distributions that Differ, with False Discovery Rate Control	Yonghoon Lee et.al.	2505.13769	null
2025-05-19	Synthetic Non-stationary Data Streams for Recognition of the Unknown	Joanna Komorniczak et.al.	2505.13745	link
2025-05-19	Improving Compositional Generation with Diffusion Models Using Lift Scores	Chenning Yu et.al.	2505.13740	link
2025-05-19	GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation	Abhay Deshpande et.al.	2505.13441	null
2025-05-19	Synthetic-Powered Predictive Inference	Meshi Bashari et.al.	2505.13432	link
2025-05-19	MR. Judge: Multimodal Reasoner as a Judge	Renjie Pi et.al.	2505.13403	null
2025-05-19	eStonefish-scenes: A synthetically generated dataset for underwater event-based optical flow prediction tasks	Jad Mansour et.al.	2505.13309	null
2025-05-19	Calibration-free single-frame super-resolution fluorescence microscopy	Anežka Dostálová et.al.	2505.13293	link
2025-05-19	Model Selection for Gaussian-gated Gaussian Mixture of Experts Using Dendrograms of Mixing Measures	Tuan Thai et.al.	2505.13052	null
2025-05-19	MA-COIR: Leveraging Semantic Search Index and Generative Models for Ontology-Driven Biomedical Concept Recognition	Shanshan Liu et.al.	2505.12964	link
2025-05-19	RetinaLogos: Fine-Grained Synthesis of High-Resolution Retinal Images Through Captions	Junzhi Ning et.al.	2505.12887	null
2025-05-19	Causality-Inspired Robustness for Nonlinear Models via Representation Learning	Marin Šola et.al.	2505.12868	null
2025-05-20	Practical Equivalence Testing and Its Application in Synthetic Pre-Crash Scenario Validation	Jian Wu et.al.	2505.12827	null
2025-05-19	EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs	Wenhao Zhu et.al.	2505.12792	null
2025-05-19	What is Stigma Attributed to? A Theory-Grounded, Expert-Annotated Interview Corpus for Demystifying Mental-Health Stigma	Han Meng et.al.	2505.12727	link
2025-05-19	DreamGen: Unlocking Generalization in Robot Learning through Neural Trajectories	Joel Jang et.al.	2505.12705	link
2025-05-19	Towards A Generalist Code Embedding Model Based On Massive Data Synthesis	Chaofan Li et.al.	2505.12697	link
2025-05-18	Real-time surrogate modeling of nonlinear pulse evolution in multimode fibers	Bora Çarpınlıoğlu et.al.	2505.12517	null
2025-05-16	Reinforcement Learning Closures for Underresolved Partial Differential Equations using Synthetic Data	Lothar Heimbach et.al.	2505.11308	null
2025-05-16	From Intent Discovery to Recognition with Topic Modeling and Synthetic Data	Aaron Rodrigues et.al.	2505.11176	null
2025-05-16	Diffusion Model in Hyperspectral Image Processing and Analysis: A Review	Xing Hu et.al.	2505.11158	null
2025-05-19	What’s Inside Your Diffusion Model? A Score-Based Riemannian Metric to Explore the Data Manifold	Simone Azeglio et.al.	2505.11128	null
2025-05-16	ShiQ: Bringing back Bellman to LLMs	Pierre Clavier et.al.	2505.11081	null
2025-05-16	BLEUBERI: BLEU is a surprisingly effective reward for instruction following	Yapei Chang et.al.	2505.11080	link
2025-05-16	Supervised Models Can Generalize Also When Trained on Random Label	Oskar Allerbo et.al.	2505.11006	link
2025-05-16	Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges	Yuan Zhang et.al.	2505.10993	null
2025-05-16	RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization	Haiyang Shen et.al.	2505.10989	link
2025-05-16	On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating	Huy Nguyen et.al.	2505.10860	null
2025-05-16	Quantum data generation in a denoising model with multiscale entanglement renormalization network	Wei-Wei Zhang et.al.	2505.10796	null
2025-05-15	A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment	Jean-Philippe Corbeil et.al.	2505.10717	null
2025-05-15	3D-Fixup: Advancing Photo Editing with 3D Priors	Yen-Chi Cheng et.al.	2505.10566	null
2025-05-15	Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data	Yiwen Liu et.al.	2505.10551	link
2025-05-15	On the Foundations of the Design-Based Approach	P. M. Aronow et.al.	2505.10519	null
2025-05-15	Learning Nonlinear Dynamics in Physical Modelling Synthesis using Neural Ordinary Differential Equations	Victor Zheleznov et.al.	2505.10511	link
2025-05-15	RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs	Vibha Belavadi et.al.	2505.10495	null
2025-05-15	Causal discovery on vector-valued variables and consistency-guided aggregation	Urmi Ninad et.al.	2505.10476	null
2025-05-15	Uncovering Magnetic Phases with Synthetic Data and Physics-Informed Training	Agustin Medina et.al.	2505.10393	null
2025-05-15	Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data	Prashant P. Shinde et.al.	2505.10192	null
2025-05-15	Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning	Yoichi Ishibashi et.al.	2505.10182	null
2025-05-15	Data-driven discovery of the equations of turbulent convection	Christopher J. Wareing et.al.	2505.10109	null
2025-05-15	ChronoSteer: Bridging Large Language Model and Time Series Foundation Model via Synthetic Data	Chengsen Wang et.al.	2505.10083	null
2025-05-15	Sybil-based Virtual Data Poisoning Attacks in Federated Learning	Changxun Zhu et.al.	2505.09983	null
2025-05-14	ZENN: A Thermodynamics-Inspired Computational Framework for Heterogeneous Data-Driven Modeling	Shun Wang et.al.	2505.09851	null
2025-05-14	Self-Consuming Generative Models with Adversarially Curated Data	Xiukun Wei et.al.	2505.09768	null
2025-05-14	Robust Federated Learning with Confidence-Weighted Filtering and GAN-Based Completion under Noisy and Incomplete Data	Alpaslan Gokcen et.al.	2505.09733	null
2025-05-14	Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware	Justin Yu et.al.	2505.09601	null
2025-05-14	Online Isolation Forest	Filippo Leveni et.al.	2505.09593	link
2025-05-14	DESI DR1 Lyα 1D power spectrum: The Fast Fourier Transform estimator measurement	Corentin Ravoux et.al.	2505.09493	null
2025-05-14	Test-Time Augmentation for Pose-invariant Face Recognition	Jaemin Jung et.al.	2505.09256	null
2025-05-14	Zero-shot Quantization: A Comprehensive Survey	Minjun Kim et.al.	2505.09188	null
2025-05-13	Fully Dynamic Euclidean Bi-Chromatic Matching in Sublinear Update Time	Gramoz Goranci et.al.	2505.09010	null
2025-05-13	Predictive Digital Twins with Quantified Uncertainty for Patient-Specific Decision Making in Oncology	Graham Pash et.al.	2505.08927	link
2025-05-13	Grounding Synthetic Data Evaluations of Language Models in Unsupervised Document Corpora	Michael Majurski et.al.	2505.08905	link
2025-05-13	Optimized Couplings for Watermarking Large Language Models	Dor Tsur et.al.	2505.08878	link
2025-05-13	Generative AI for Autonomous Driving: Frontiers and Opportunities	Yuping Wang et.al.	2505.08854	link
2025-05-13	Modelling the impact of quasar redshift errors on the full-shape analysis of correlations in the Lyman- $α$ forest	Calum Gordon et.al.	2505.08789	null
2025-05-13	The Open Molecules 2025 (OMol25) Dataset, Evaluations, and Models	Daniel S. Levine et.al.	2505.08762	null
2025-05-13	Big Data and the Computational Social Science of Entrepreneurship and Innovation	Ningzi Li et.al.	2505.08706	null
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607	null
2025-05-15	Leveraging Multi-Modal Information to Enhance Dataset Distillation	Zhe Li et.al.	2505.08605	null
2025-05-13	Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM	Nicholas Attolino et.al.	2505.08492	link
2025-05-13	An adaptive sampling algorithm for data-generation to build a data-manifold for physical problem surrogate modeling	Chetra Mang et.al.	2505.08487	null
2025-05-13	Localization of Impacts on Thin-Walled Structures by Recurrent Neural Networks: End-to-end Learning from Real-World Data	Alexander Humer et.al.	2505.08362	null
2025-05-13	Community Detection on Noisy Stochastic Block Models	Washieu Anan et.al.	2505.08251	link
2025-05-13	Privacy-Preserving Analytics for Smart Meter (AMI) Data: A Hybrid Approach to Comply with CPUC Privacy Regulations	Benjamin Westrich et.al.	2505.08237	null
2025-05-12	Fréchet Power-Scenario Distance: A Metric for Evaluating Generative AI Models across Multiple Time-Scales in Smart Grids	Yuting Cai et.al.	2505.08082	null
2025-05-12	Robust Kidney Abnormality Segmentation: A Validation Study of an AI-Based Framework	Sarah de Boer et.al.	2505.07573	null
2025-05-12	SynID: Passport Synthetic Dataset for Presentation Attack Detection	Juan E. Tapia et.al.	2505.07540	null
2025-05-12	ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution	Xu Huang et.al.	2505.07512	null
2025-05-12	Characterizing 3D Magnetic Fields and Turbulence in H I Clouds	Yue Hu et.al.	2505.07422	null
2025-05-12	Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data	David de-Fitero-Dominguez et.al.	2505.07372	null
2025-05-12	GAN-based synthetic FDG PET images from T1 brain MRI can serve to improve performance of deep unsupervised anomaly detection models	Daria Zotova et.al.	2505.07364	null
2025-05-12	Synthetic Similarity Search in Automotive Production	Christoph Huber et.al.	2505.07256	null
2025-05-12	Spatial Confounding in Multivariate Areal Data Analysis	Kyle Lin Wu et.al.	2505.07232	link
2025-05-12	Measuring General Intelligence with Generated Games	Vivek Verma et.al.	2505.07215	link
2025-05-12	Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs	Yifan Wei et.al.	2505.07184	link
2025-05-12	Causal View of Time Series Imputation: Some Identification Results on Missing Mechanism	Ruichu Cai et.al.	2505.07180	null
2025-05-12	Skull stripping with purely synthetic data	Jong Sung Park et.al.	2505.07159	link
2025-05-11	Bi-directional Self-Registration for Misaligned Infrared-Visible Image Fusion	Timing Li et.al.	2505.06920	null
2025-05-11	Uni-AIMS: AI-Powered Microscopy Image Analysis	Yanhui Hong et.al.	2505.06918	null
2025-05-11	Building a Human-Verified Clinical Reasoning Dataset via a Human LLM Hybrid Pipeline for Trustworthy Medical AI	Chao Ding et.al.	2505.06912	null
2025-05-09	DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models	Radu Alexandru Rosu et.al.	2505.06166	null
2025-05-09	Towards Better Cephalometric Landmark Detection with Diffusion Data Generation	Dongqian Guo et.al.	2505.06055	null
2025-05-09	Leveraging Vision-Language Models for Visual Grounding and Analysis of Automotive UI	Benjamin Raphael Ernhofer et.al.	2505.05895	link
2025-05-08	Synthetic Training and Representation Bridging in Reconstruction Domains	Wonyong Chung et.al.	2505.05664	null
2025-05-08	Guidance for Intra-cardiac Echocardiography Manipulation to Maintain Continuous Therapy Device Tip Visibility	Jaeyoung Huh et.al.	2505.05518	null
2025-05-08	SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation	Yonwoo Choi et.al.	2505.05475	link
2025-05-08	Hide & Seek: Transformer Symmetries Obscure Sharpness & Riemannian Geometry Finds It	Marvin F. da Silva et.al.	2505.05409	null
2025-05-08	DispBench: Benchmarking Disparity Estimation to Synthetic Corruptions	Shashank Agnihotri et.al.	2505.05091	link
2025-05-08	Generating Reliable Synthetic Clinical Trial Data: The Role of Hyperparameter Optimization and Domain Constraints	Waldemar Hahn et.al.	2505.05019	null
2025-05-08	Boosting Statistic Learning with Synthetic Data from Pretrained Large Models	Jialong Jiang et.al.	2505.04992	null
2025-05-08	Canny2Palm: Realistic and Controllable Palmprint Generation for Large-scale Pre-training	Xingzeng Lan et.al.	2505.04922	null
2025-05-07	Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?	Shashank Agnihotri et.al.	2505.04835	link
2025-05-09	Replay to Remember (R2R): An Efficient Uncertainty-driven Unsupervised Continual Learning Framework Using Generative Replay	Sriram Mandalika et.al.	2505.04787	null
2025-05-07	REVEAL: Multi-turn Evaluation of Image-Input Harms for Vision LLM	Madhur Jindal et.al.	2505.04673	link
2025-05-07	AI-Generated Fall Data: Assessing LLMs and Diffusion Model for Wearable Fall Detection	Sana Alamgeer et.al.	2505.04660	null
2025-05-07	RAFT: Robust Augmentation of FeaTures for Image Segmentation	Edward Humes et.al.	2505.04529	null
2025-05-07	Efficient Flow Matching using Latent Variables	Anirban Samaddar et.al.	2505.04486	null
2025-05-07	Estimating Causal Effects in Networks with Cluster-Based Bandits	Ahmed Sayeed Faruk et.al.	2505.04200	null
2025-05-07	Advancing and Benchmarking Personalized Tool Invocation for LLMs	Xu Huang et.al.	2505.04072	link
2025-05-07	Tensor robust principal component analysis via the tensor nuclear over Frobenius norm	Huiwen Zheng et.al.	2505.04063	null
2025-05-06	PARC: Physics-based Augmentation with Reinforcement Learning for Character Controllers	Michael Xu et.al.	2505.04002	null
2025-05-06	Improving Failure Prediction in Aircraft Fastener Assembly Using Synthetic Data in Imbalanced Datasets	Gustavo J. G. Lahr et.al.	2505.03917	null
2025-05-06	Decentralized Nonconvex Optimization under Heavy-Tailed Noise: Normalization and Optimal Convergence	Shuhua Yu et.al.	2505.03736	null
2025-05-06	Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map	Alessandro Simoni et.al.	2505.03623	link
2025-05-06	Decision Making under Model Misspecification: DRO with Robust Bayesian Ambiguity Sets	Charita Dellaporta et.al.	2505.03585	null
2025-05-06	Information-theoretic reduction of deep neural networks to linear models in the overparametrized proportional regime	Francesco Camilli et.al.	2505.03577	null
2025-05-06	Generating Synthetic Data via Augmentations for Improved Facial Resemblance in DreamBooth and InstantID	Koray Ulusan et.al.	2505.03557	null
2025-05-06	Improving Omics-Based Classification: The Role of Feature Selection and Synthetic Data Generation	Diego Perazzolo et.al.	2505.03387	null
2025-05-06	Synthline: A Product Line Approach for Synthetic Requirements Engineering Data Generation using Large Language Models	Abdelkarim El-Hajjami et.al.	2505.03265	link
2025-05-06	GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data	Shengliang Deng et.al.	2505.03233	null
2025-05-06	Convergence Of Consistency Model With Multistep Sampling Under General Data Assumptions	Yiding Chen et.al.	2505.03194	null
2025-05-05	Improving Model Alignment Through Collective Intelligence of Open-Source LLMS	Junlin Wang et.al.	2505.03059	null
2025-05-05	A Typology of Synthetic Datasets for Dialogue Processing in Clinical Contexts	Steven Bedrick et.al.	2505.03025	null
2025-05-05	A robust neural determination of the source-count distribution of the Fermi-LAT sky at high latitudes	Christopher Eckner et.al.	2505.02906	link
2025-05-05	Cooperative Bayesian and variance networks disentangle aleatoric and epistemic uncertainties	Jiaxiang Yi et.al.	2505.02743	null
2025-05-06	A Note on Statistically Accurate Tabular Data Generation Using Large Language Models	Andrey Sidorenko et.al.	2505.02659	link
2025-05-05	Sim2Real in endoscopy segmentation with a novel structure aware image translation	Clara Tomasini et.al.	2505.02654	link
2025-05-05	Bemba Speech Translation: Exploring a Low-Resource African Language	Muhammad Hazim Al Farouq et.al.	2505.02518	null
2025-05-05	Data Augmentation With Back translation for Low Resource languages: A case of English and Luganda	Richard Kimera et.al.	2505.02463	null
2025-05-05	T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models	Yunfeng Ge et.al.	2505.02417	link
2025-05-04	Improving Physical Object State Representation in Text-to-Image Generative Systems	Tianle Chen et.al.	2505.02236	link
2025-05-04	Heterogeneous Trader Responses to Macroeconomic Surprises: Simulating Order Flow Dynamics	Haochuan Wang et.al.	2505.01962	null
2025-05-03	BOOM: Benchmarking Out-Of-distribution Molecular Property Predictions of Machine Learning Models	Evan R. Antoniuk et.al.	2505.01912	null
2025-05-03	PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach	Nitin Rai et.al.	2505.01823	null
2025-05-03	$\textit{New News}$ : System-2 Fine-tuning for Robust Integration of New Knowledge	Core Francisco Park et.al.	2505.01812	null
2025-05-03	SimAug: Enhancing Recommendation with Pretrained Language Models for Dense and Balanced Data Augmentation	Yuying Zhao et.al.	2505.01695	link
2025-05-02	Always Tell Me The Odds: Fine-grained Conditional Probability Estimation	Liaoyaqi Wang et.al.	2505.01595	null
2025-05-02	The DCR Delusion: Measuring the Privacy Risk of Synthetic Data	Zexi Yao et.al.	2505.01524	null
2025-05-02	Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation	Daniele Molino et.al.	2505.01091	null
2025-05-02	Synthesize-on-Graph: Knowledgeable Synthetic Data Generation for Continue Pre-training of Large Language Models	Xuhui Jiang et.al.	2505.00979	null
2025-05-01	NeMo-Inspector: A Visualization Tool for LLM Generation Analysis	Daria Gitman et.al.	2505.00903	link
2025-05-01	The Comparability of Model Fusion to Measured Data in Confuser Rejection	Conor Flynn et.al.	2505.00836	null
2025-05-01	Open-Source LLM-Driven Federated Transformer for Predictive IoV Management	Yazan Otoum et.al.	2505.00651	null
2025-05-01	Synthesizing and Identifying Noise Levels in Autonomous Vehicle Camera Radar Datasets	Mathis Morales et.al.	2505.00584	null
2025-05-01	Fast Azimuthally Anisotropic 3D Radon Transform by Generalized Fourier Slice Theorem	Ahmadreza Mokhtari et.al.	2505.00387	null
2025-05-01	KoACD: The First Korean Adolescent Dataset for Cognitive Distortion Analysis	JunSeo Kim et.al.	2505.00367	null
2025-05-01	SacFL: Self-Adaptive Federated Continual Learning for Resource-Constrained End Devices	Zhengyi Zhong et.al.	2505.00365	link
2025-05-01	Conformal changepoint localization	Sanjit Dandapanthula et.al.	2505.00292	link
2025-05-01	Policy Learning with $α$ -Expected Welfare	Yanqin Fan et.al.	2505.00256	null
2025-04-30	Generative Machine Learning in Adaptive Control of Dynamic Manufacturing Processes: A Review	Suk Ki Lee et.al.	2505.00210	null
2025-04-30	Direct Motion Models for Assessing Generated Videos	Kelsey Allen et.al.	2505.00209	null
2025-04-30	Polka-dotted Stars: a Hierarchical Model for Mapping Stellar Surfaces Using Occultation Light Curves and the Case of TOI-3884	Sabina Sagynbayeva et.al.	2504.21852	null
2025-04-30	Parameter Inference of Black Hole Images using Deep Learning in Visibility Space	Franc O et.al.	2504.21840	null
2025-05-01	How Real Are Synthetic Therapy Conversations? Evaluating Fidelity in Prolonged Exposure Dialogues	Suhas BN et.al.	2504.21800	null
2025-04-30	On the Robustness of Mixture Models in the Presence of Hidden Markov Regimes with Covariate-Dependent Transition Probabilities	Demian Pouzo et.al.	2504.21669	null
2025-04-30	Quantitative Auditing of AI Fairness with Differentially Private Synthetic Data	Chih-Cheng Rex Yuan et.al.	2504.21634	null
2025-04-30	Improving Informally Romanized Language Identification	Adrian Benton et.al.	2504.21540	null
2025-04-30	IDDM: Bridging Synthetic-to-Real Domain Gap from Physics-Guided Diffusion for Real-world Image Dehazing	Shijun Zhou et.al.	2504.21385	null
2025-04-30	CMD: Constraining Multimodal Distribution for Domain Adaptation in Stereo Matching	Zhelun Shen et.al.	2504.21302	null
2025-04-30	Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math	Haoran Xu et.al.	2504.21233	null
2025-04-29	Artificial Intelligence for Personalized Prediction of Alzheimer’s Disease Progression: A Survey of Methods, Data Challenges, and Future Directions	Gulsah Hancerliogullari Koksalmis et.al.	2504.21189	null
2025-04-29	SMOGAN: Synthetic Minority Oversampling with GAN Refinement for Imbalanced Regression	Shayan Alahyari et.al.	2504.21152	null
2025-04-29	Leveraging Generative AI Through Prompt Engineering and Rigorous Validation to Create Comprehensive Synthetic Datasets for AI Training in Healthcare	Polycarp Nalela et.al.	2504.20921	null
2025-04-29	Evaluating Generative Models for Tabular Data: Novel Metrics and Benchmarking	Dayananda Herurkar et.al.	2504.20900	null
2025-04-29	DP-SMOTE: Integrating Differential Privacy and Oversampling Technique to Preserve Privacy in Smart Homes	Amr Tarek Elsayed et.al.	2504.20827	null
2025-04-29	Influence network reconstruction from discrete time-series of count data modelled by multidimensional Hawkes processes	Naratip Santitissadeekorn et.al.	2504.20758	null
2025-04-29	Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers	Roman Abramov et.al.	2504.20752	null
2025-04-29	What’s Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models	Jan Kapar et.al.	2504.20687	link
2025-04-29	OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentation	Long Liu et.al.	2504.20682	link
2025-04-29	SpaRE: Enhancing Spatial Reasoning in Vision-Language Models with Synthetic Data	Michael Ogezi et.al.	2504.20648	null
2025-04-29	Bridging the Generalisation Gap: Synthetic Data Generation for Multi-Site Clinical Model Validation	Bradley Segal et.al.	2504.20635	link
2025-04-29	ReasonIR: Training Retrievers for Reasoning Tasks	Rulin Shao et.al.	2504.20595	link
2025-04-29	Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception	Yuanchen Wu et.al.	2504.20468	link
2025-04-29	RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library	Jiapeng Wang et.al.	2504.20426	null
2025-04-29	Enhancing Leakage Attacks on Searchable Symmetric Encryption Using LLM-Based Synthetic Data Generation	Joshua Chiu et.al.	2504.20414	link
2025-04-29	GarmentX: Autoregressive Parametric Representations for High-Fidelity 3D Garment Generation	Jingfeng Guo et.al.	2504.20409	null
2025-04-29	FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding	Yanan Guo et.al.	2504.20384	null
2025-04-28	Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer	Daniel Kienzle et.al.	2504.19863	link
2025-04-28	Pixels2Points: Fusing 2D and 3D Features for Facial Skin Segmentation	Victoria Yue Chen et.al.	2504.19718	null
2025-04-28	From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review	Mohamed Amine Ferrag et.al.	2504.19678	null
2025-04-28	Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs	Osma Suominen et.al.	2504.19675	link
2025-04-28	Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM	Leon Davies et.al.	2504.19654	null
2025-04-28	Topological derivative for a fast identification of short, linear perfectly conducting cracks with inaccurate background information	Won-Kwang Park et.al.	2504.19485	null
2025-04-29	Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing	James O’ Neill et.al.	2504.19333	null
2025-04-27	Efficient Serverless Cold Start: Reducing Library Loading Overhead by Profile-guided Optimization	Syed Salauddin Mohammad Tariq et.al.	2504.19283	null
2025-04-27	Anyprefer: An Agentic Framework for Preference Data Synthesis	Yiyang Zhou et.al.	2504.19276	null
2025-04-27	A tissue-informed deep learning-based method for positron range correction in preclinical 68Ga PET imaging	Nerea Encina-Baranda et.al.	2504.19175	null
2025-04-26	RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning	Haoran Geng et.al.	2504.18904	null
2025-04-26	Nonconvex Linear System Identification with Minimal State Representation	Uday Kiran Reddy Tadipatri et.al.	2504.18791	null
2025-04-26	SynLexLM: Scaling Legal LLMs with Synthetic Data and Curriculum Learning	Ojasw Upadhyay et.al.	2504.18762	null
2025-04-25	A Unified MDL-based Binning and Tensor Factorization Framework for PDF Estimation	Mustafa Musab et.al.	2504.18686	null
2025-04-25	Weighing neutrinos with 21cm Intensity Mapping at the SKAO	Gabriele Autieri et.al.	2504.18625	null
2025-04-25	DeSIA: Attribute Inference Attacks Against Limited Fixed Aggregate Statistics	Yifeng Mao et.al.	2504.18497	null
2025-04-25	Enhancing Strawberry Yield Forecasting with Backcasted IoT Sensor Data and Machine Learning	Tewodros Alemu Ayall et.al.	2504.18451	null
2025-04-25	LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning	Rui Li et.al.	2504.18424	null
2025-04-25	Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization	Kesen Zhao et.al.	2504.18397	link
2025-04-24	Fast Autoregressive Models for Continuous Latent Generation	Tiankai Hang et.al.	2504.18391	null
2025-04-25	Enhancing Long-Term Re-Identification Robustness Using Synthetic Data: A Comparative Analysis	Christian Pionzewski et.al.	2504.18286	null
2025-04-25	What is the Added Value of UDA in the VFM Era?	Brunó B. Englert et.al.	2504.18190	null
2025-04-25	Think, Prune, Train, Improve: Scaling Reasoning without Scaling Models	Caia Costello et.al.	2504.18116	null
2025-04-25	RL-Driven Data Generation for Robust Vision-Based Dexterous Grasping	Atsushi Kanehira et.al.	2504.18084	null
2025-04-25	HOSVD-SR: A Physics-Based Deep Learning Framework for Super-Resolution in Fluid Dynamics	Guillermo Barragán et.al.	2504.17994	null
2025-04-25	From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval	Yabing Wang et.al.	2504.17990	null
2025-04-24	Bernstein Polynomial Processes for Continuous Time Change Detection	Dan Cunha et.al.	2504.17876	null
2025-04-28	Step1X-Edit: A Practical Framework for General Image Editing	Shiyu Liu et.al.	2504.17761	link
2025-04-24	Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models	Julius Vetter et.al.	2504.17660	null
2025-04-24	TarDiff: Target-Oriented Diffusion Guidance for Synthetic Electronic Health Record Time Series Generation	Bowen Deng et.al.	2504.17613	null
2025-04-24	When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars	Rei Higuchi et.al.	2504.17562	null
2025-04-24	A decision support system for optimised industrial water management	Stavros Vatikiotis et.al.	2504.17469	null
2025-04-24	Doubly Adaptive Social Learning	Marco Carpentiero et.al.	2504.17370	null
2025-04-24	Synthetic Power Flow Data Generation Using Physics-Informed Denoising Diffusion Probabilistic Models	Junfei Wang et.al.	2504.17210	null
2025-04-24	High-Fidelity And Complex Test Data Generation For Real-World SQL Code Generation Services	Shivasankari Kannan et.al.	2504.17203	null
2025-04-23	Statistical Guarantees in Synthetic Data through Conformal Adversarial Generation	Rahul Vishwakarma et.al.	2504.17058	null
2025-04-23	High-Quality Cloud-Free Optical Image Synthesis Using Multi-Temporal SAR and Contaminated Optical Data	Chenxi Duan et.al.	2504.16870	null
2025-04-23	Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Alexander Shvets et.al.	2504.16856	null
2025-04-23	Gaussian Splatting is an Effective Data Generator for 3D Object Detection	Farhad G. Zanjani et.al.	2504.16740	null
2025-04-24	V $^2$ R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations	Zhiyuan Fan et.al.	2504.16727	null
2025-04-23	Case Study: Fine-tuning Small Language Models for Accurate and Private CWE Detection in Python Code	Md. Azizul Hakim Bappy et.al.	2504.16584	null
2025-04-23	Unified Molecule Generation and Property Prediction	Adam Izdebski et.al.	2504.16559	null
2025-04-23	A Comprehensive Survey of Synthetic Tabular Data Generation	Ruxue Shi et.al.	2504.16506	link
2025-04-23	Private Federated Learning using Preference-Optimized Synthetic Data	Charlie Hou et.al.	2504.16438	link
2025-04-23	Towards a fast and robust deep hedging approach	Fabienne Schmid et.al.	2504.16436	null
2025-04-23	Advancing Radar Hand Gesture Recognition: A Hybrid Spectrum Synthetic Framework Merging Simulation with Neural Networks	Jiaqi Tang et.al.	2504.16423	null
2025-04-23	Evaluating Multi-Hop Reasoning in Large Language Models: A Chemistry-Centric Case Study	Mohammad Khodadad et.al.	2504.16414	null
2025-04-23	Covariate-dependent Graphical Model Estimation via Neural Networks with Statistical Guarantees	Jiahe Lin et.al.	2504.16356	null
2025-04-23	ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving	Jie JW Wu et.al.	2504.16331	null
2025-04-22	Accounting for spillover when using the augmented synthetic control method: estimating the effect of localized COVID-19 lockdowns in Chile	Taylor Krajewski et.al.	2504.16244	null
2025-04-22	Explainable Unsupervised Anomaly Detection with Random Forest	Joshua S. Harvey et.al.	2504.16075	null
2025-04-22	Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability	Daniel Hendriks et.al.	2504.16056	null
2025-04-22	Modeling and Forecasting Realized Volatility with Multivariate Fractional Brownian Motion	Markus Bibinger et.al.	2504.15985	null
2025-04-22	Quantum machine learning advantages beyond hardness of evaluation	Riccardo Molteni et.al.	2504.15964	null
2025-04-22	Consistent Causal Inference of Group Effects in Non-Targeted Trials with Finitely Many Effect Levels	Georgios Mavroudeas et.al.	2504.15854	null
2025-04-22	Monte Carlo simulation of GRB data to test Lorentz-invariance violation	Hanlin Song et.al.	2504.15685	null
2025-04-22	A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment	Kun Wang et.al.	2504.15585	null
2025-04-22	Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction	Yuxin Jiang et.al.	2504.15573	link
2025-04-21	A dual-stage constitutive modeling framework based on finite strain data-driven identification and physics-augmented neural networks	Lennart Linden et.al.	2504.15492	null
2025-04-21	From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System	Rohan Surana et.al.	2504.15476	null
2025-04-21	Feeding LLM Annotations to BERT Classifiers at Your Own Risk	Yucheng Lu et.al.	2504.15432	null
2025-04-21	IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs	David Ma et.al.	2504.15415	link
2025-04-21	On optimality and bounds for internal solutions generated from boundary data-driven Gramians	V. Druskin et.al.	2504.15407	null
2025-04-21	MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World	Ankit Dhiman et.al.	2504.15397	null
2025-04-21	FLARE: Feature-based Lightweight Aggregation for Robust Evaluation of IoT Intrusion Detection	Bradley Boswell et.al.	2504.15375	null
2025-04-21	Diffusion Bridge Models for 3D Medical Image Translation	Shaorong Zhang et.al.	2504.15267	null
2025-04-21	Bringing Diversity from Diffusion Models to Semantic-Guided Face Asset Generation	Yunxuan Cai et.al.	2504.15259	null
2025-04-21	MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning	Yahan Yang et.al.	2504.15241	null
2025-04-21	Predicting Methane Adsorption in Metal-Substituted MOFs: A Comparative Study between Density Functional Theory and Machine Learning	Karim Aljamal et.al.	2504.15034	null
2025-04-22	clusttraj: A Solvent-Informed Clustering Tool for Molecular Modeling	Rafael Bicudo Ribeiro et.al.	2504.14978	link
2025-04-21	SuFIA-BC: Generating High Quality Demonstration Data for Visuomotor Policy Learning in Surgical Subtasks	Masoud Moghani et.al.	2504.14857	null
2025-04-21	Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation	Jiajun Shen et.al.	2504.14856	null
2025-04-21	Aligning Beam with Imbalanced Multi-modality: A Generative Federated Learning Approach	Jiahui Liang et.al.	2504.14835	null
2025-04-21	Novel Concept-Oriented Synthetic Data approach for Training Generative AI-Driven Crystal Grain Analysis Using Diffusion Model	Ahmed Sobhi Saleh et.al.	2504.14782	null
2025-04-20	A Case Study Exploring the Current Landscape of Synthetic Medical Record Generation with Commercial LLMs	Yihan Lin et.al.	2504.14657	null
2025-04-20	AlphaZero-Edu: Making AlphaZero Accessible to Everyone	Binjie Guo et.al.	2504.14636	link
2025-04-20	Learning from Reasoning Failures via Synthetic Data Generation	Gabriela Ben Melech Stan et.al.	2504.14523	null
2025-04-20	Less is More: Adaptive Coverage for Synthetic Training Data	Sasan Tavakkol et.al.	2504.14508	null
2025-04-20	DialogueAgents: A Hybrid Agent-Based Speech Synthesis Framework for Multi-Party Dialogue	Xiang Li et.al.	2504.14482	link
2025-04-20	Causal Disentanglement for Robust Long-tail Medical Image Generation	Weizhi Nie et.al.	2504.14450	null
2025-04-18	Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding	Yuchen Rao et.al.	2504.13580	link
2025-04-18	Using Machine Learning and Neural Networks to Analyze and Predict Chaos in Multi-Pendulum and Chaotic Systems	Vasista Ramachandruni et.al.	2504.13453	null
2025-04-17	PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding	Jang Hyun Cho et.al.	2504.13180	link
2025-04-17	AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis	Khiem Vuong et.al.	2504.13157	null
2025-04-17	$\texttt{Complex-Edit}$ : CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark	Siwei Yang et.al.	2504.13143	null
2025-04-17	Effective Dual-Region Augmentation for Reduced Reliance on Large Amounts of Labeled Data	Prasanna Reddy Pulakurthi et.al.	2504.13077	link
2025-04-17	Imaging for All-Day Wearable Smart Glasses	Michael Goesele et.al.	2504.13060	null
2025-04-17	ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images	Sangwook Kim et.al.	2504.13023	null
2025-04-17	MAIN: Mutual Alignment Is Necessary for instruction tuning	Fanyi Yang et.al.	2504.12913	null
2025-04-17	Hardware Implementation of Tunable Fractional-Order Capacitors by Morphogenesis of Conducting Polymer Dendrites	Antoine Baron et.al.	2504.12861	null
2025-04-17	When do Random Forests work?	C. Revelas et.al.	2504.12860	null
2025-04-17	Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration	Yicheng Pan et.al.	2504.12773	link
2025-04-17	Incorporating a Deep Neural Network into Moving Horizon Estimation for Embedded Thermal Torque Derating of an Electric Machine	Alexander Winkler et.al.	2504.12736	null
2025-04-17	Data-efficient LLM Fine-tuning for Code Generation	Weijie Lv et.al.	2504.12687	link
2025-04-17	Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation	Linda He et.al.	2504.12637	null
2025-04-17	MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation	Haris Riaz et.al.	2504.12563	null
2025-04-17	ELAB: Extensive LLM Alignment Benchmark in Persian Language	Zahra Pourbahman et.al.	2504.12553	null
2025-04-16	Towards Realistic Low-Light Image Enhancement via ISP Driven Data Modeling	Zhihua Wang et.al.	2504.12204	link
2025-04-16	Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline	Joanne Lin et.al.	2504.12169	null
2025-04-16	Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging	Tristan S. W. Stevens et.al.	2504.12154	null
2025-04-16	Towards LLM Agents for Earth Observation	Chia Hsiang Kao et.al.	2504.12110	null
2025-04-17	Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions	Yifei Dong et.al.	2504.11967	null
2025-04-16	Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading	Qianjin Yu et.al.	2504.11919	null
2025-04-16	Synthetic Data for Blood Vessel Network Extraction	Joël Mathys et.al.	2504.11858	null
2025-04-16	A cautionary note for plasmode simulation studies in the setting of causal inference	Pamela A Shaw et.al.	2504.11740	null
2025-04-16	Learning What NOT to Count	Adriano D’Alessandro et.al.	2504.11705	null
2025-04-15	Probabilistic causal graphs as categorical data synthesizers: Do they do better than Gaussian Copulas and Conditional Tabular GANs?	Olha Shaposhnyk et.al.	2504.11547	null
2025-04-17	REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real Websites	Divyansh Garg et.al.	2504.11543	link
2025-04-17	ReTool: Reinforcement Learning for Strategic Tool Use in LLMs	Jiazhan Feng et.al.	2504.11536	link
2025-04-15	Posterior Consistency in Parametric Models via a Tighter Notion of Identifiability	Nicola Bariletto et.al.	2504.11360	null
2025-04-15	Looking beyond the next token	Abitha Thankaraj et.al.	2504.11336	null
2025-04-17	UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis	Xinyi Liu et.al.	2504.11257	null
2025-04-15	Divergence of Empirical Neural Tangent Kernel in Classification Problems	Zixiong Yu et.al.	2504.11130	null
2025-04-15	$R$ -matrix type parametrization of the Jost function for extracting the resonance parameters from scattering data	P. Vaandrager et.al.	2504.11129	null
2025-04-15	Leveraging Vertical Public-Private Split for Improved Synthetic Data Generation	Samuel Maddock et.al.	2504.10987	null
2025-04-15	Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task	Aviral Chharia et.al.	2504.10880	null
2025-04-15	ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping	Shun Iwase et.al.	2504.10857	null
2025-04-15	E2E Parking Dataset: An Open Benchmark for End-to-End Autonomous Parking	Kejia Gao et.al.	2504.10812	null
2025-04-14	PlantD: Performance, Latency ANalysis, and Testing for Data Pipelines – An Open Source Measurement, Testing, and Simulation Framework	Christopher Bogart et.al.	2504.10692	null
2025-04-14	H-MoRe: Learning Human-centric Motion Representation for Action Analysis	Zhanbo Huang et.al.	2504.10676	link
2025-04-14	Cross terms and monochromatic gravitational wave sources in our Galactic Centre	Pau Amaro Seoane et.al.	2504.10594	null
2025-04-14	Decoupled Diffusion Sparks Adaptive Scene Generation	Yunsong Zhou et.al.	2504.10485	null
2025-04-14	Quantum Liouvillian Tomography	Diogo Aguiar et.al.	2504.10393	null
2025-04-16	Heimdall: test-time scaling on the generative verification	Wenlei Shi et.al.	2504.10337	null
2025-04-14	Deep Reasoning Translation via Reinforcement Learning	Jiaan Wang et.al.	2504.10187	link
2025-04-14	Negate or Embrace: On How Misalignment Shapes Multimodal Representation Learning	Yichao Cai et.al.	2504.10143	link
2025-04-14	FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding	Zheng Liu et.al.	2504.09925	link
2025-04-14	Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems	Zaid Khan et.al.	2504.09763	null
2025-04-13	Replacing ARDL? Introducing the NSB-ARDL Model for Structural and Asymmetric Forecasting	Tuhin G M Al Mamun et.al.	2504.09646	null
2025-04-13	Understanding LLM Behaviors via Compression: Data Generation, Knowledge Acquisition and Scaling Laws	Zhixuan Pan et.al.	2504.09597	null
2025-04-13	Causal integration of chemical structures improves representations of microscopy images for morphological profiling	Yemin Yu et.al.	2504.09544	link
2025-04-13	Adaptive Cluster-Based Synthetic Minority Oversampling Technique for Traffic Mode Choice Prediction with Imbalanced Dataset	Guang An Ooi et.al.	2504.09486	null
2025-04-13	Evaluation Under Imperfect Benchmarks and Ratings: A Case Study in Text Simplification	Joseph Liu et.al.	2504.09394	null
2025-04-12	Text To 3D Object Generation For Scalable Room Assembly	Sonia Laguna et.al.	2504.09328	null
2025-04-12	MatWheel: Addressing Data Scarcity in Materials Science Through Synthetic Data	Wentao Li et.al.	2504.09152	null
2025-04-12	NWP-based deep learning for tropical cyclone intensity prediction	Chanh Kieu et.al.	2504.09143	null
2025-04-11	Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation	Davide Berghi et.al.	2504.08644	link
2025-04-11	Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets	Luis Chuquimarca et.al.	2504.08568	null
2025-04-11	A dependent and censored first hitting-time model with compound Poisson processes	Mikael Escobar-Bach et.al.	2504.08483	null
2025-04-11	Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation	Bram Vanherle et.al.	2504.08473	link
2025-04-11	Beetroots: spatially-regularized Bayesian inference of physical parameter maps – Application to Orion	Pierre Palud et.al.	2504.08387	null
2025-04-11	DreamFuse: Adaptive Image Fusion with Diffusion Transformer	Junjia Huang et.al.	2504.08291	null
2025-04-14	Understanding the Impact of Data Domain Extraction on Synthetic Data Privacy	Georgi Ganev et.al.	2504.08254	null
2025-04-11	InSPE: Rapid Evaluation of Heterogeneous Multi-Modal Infrastructure Sensor Placement	Zhaoliang Zheng et.al.	2504.08240	null
2025-04-11	SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data	Sourya Sengupta et.al.	2504.08177	null
2025-04-10	Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction	Zeren Jiang et.al.	2504.07961	link
2025-04-10	Zero-Shot Cross-Domain Code Search without Fine-Tuning	Keyu Liang et.al.	2504.07740	link
2025-04-10	Diffusion Transformers for Tabular Data Time Series Generation	Fabrizio Garuti et.al.	2504.07566	link
2025-04-10	PhaseGen: A Diffusion-Based Approach for Complex-Valued MRI Data Generation	Moritz Rempe et.al.	2504.07560	link
2025-04-10	Adversarial Subspace Generation for Outlier Detection in High-Dimensional Data	Jose Cribeiro-Ramallo et.al.	2504.07522	link
2025-04-10	Conditional Data Synthesis Augmentation	Xinyu Tian et.al.	2504.07426	null
2025-04-10	ID-Booth: Identity-consistent Face Generation with Diffusion Models	Darian Tomašević et.al.	2504.07392	link
2025-04-11	SemEval-2025 Task 5: LLMs4Subjects – LLM-based Automated Subject Tagging for a National Technical Library’s Open-Access Catalog	Jennifer D’Souza et.al.	2504.07199	link
2025-04-09	The Effects of Binary Reference Stars on JWST NIRCam Coronagraphy	Klaus Subbotina Stephenson et.al.	2504.07190	null
2025-04-09	R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE Agents	Naman Jain et.al.	2504.07164	null
2025-04-09	Restoring the Forecasting Power of Google Trends with Statistical Preprocessing	Candice Djorno et.al.	2504.07032	null
2025-04-09	Cerebral blood flow monitoring using a deep learning implementation of the two-layer DCS analytical model with a 512 512 SPAD array	Mingliang Pan et.al.	2504.06997	null
2025-04-09	Enhancing Metabolic Syndrome Prediction with Hybrid Data Balancing and Counterfactuals	Sanyam Paresh Shah et.al.	2504.06987	link
2025-04-09	SIGMAN:Scaling 3D Human Gaussian Generation with Millions of Assets	Yuhang Yang et.al.	2504.06982	null
2025-04-09	The Importance of Being Discrete: Measuring the Impact of Discretization in End-to-End Differentially Private Synthetic Data	Georgi Ganev et.al.	2504.06923	null
2025-04-09	MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs	Jiawei Mao et.al.	2504.06897	null
2025-04-10	MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection	Rishubh Parihar et.al.	2504.06801	null
2025-04-09	NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables	Lanrui Wang et.al.	2504.06560	null
2025-04-08	Diagrammatic expansion for the mutual-information rate in the realm of limited statistics	Tobias Kühn et.al.	2504.06255	null
2025-04-08	A Self-Supervised Framework for Space Object Behaviour Characterisation	Ian Groves et.al.	2504.06176	null
2025-04-08	QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform	Movina Moses et.al.	2504.06136	null
2025-04-08	Explainable AI for building energy retrofitting under data scarcity	Panagiota Rempi et.al.	2504.06055	null
2025-04-08	Trust-Region Twisted Policy Improvement	Joery A. de Vries et.al.	2504.06048	link
2025-04-08	Leveraging Robust Optimization for LLM Alignment under Distribution Shifts	Mingye Zhu et.al.	2504.05831	null
2025-04-08	Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought	Yi Peng et.al.	2504.05599	null
2025-04-08	Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions	Oded Ovadia et.al.	2504.05571	null
2025-04-07	A Generalized Tangent Approximation Framework for Strongly Super-Gaussian Likelihoods	Somjit Roy et.al.	2504.05431	null
2025-04-07	GARF: Learning Generalizable 3D Reassembly for Real-World Fractures	Sihang Li et.al.	2504.05400	null
2025-04-07	From Sparse Signal to Smooth Motion: Real-Time Motion Generation with Rolling Prediction Models	German Barquero et.al.	2504.05265	null
2025-04-07	Bayesian estimation of causal effects from observational categorical data	Vera Kvisgaard et.al.	2504.05198	null
2025-04-07	Resource-Efficient Beam Prediction in mmWave Communications with Multimodal Realistic Simulation Framework	Yu Min Park et.al.	2504.05187	null
2025-04-07	BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks	Wei Li et.al.	2504.05180	null
2025-04-07	Modeling Micro-Doppler Signature of Multi-Propeller Drones in Distributed ISAC	Heraldo Cesar Alves Costa et.al.	2504.05168	null
2025-04-07	CARE: Aligning Language Models for Regional Cultural Awareness	Geyang Guo et.al.	2504.05154	link
2025-04-07	InstructionBench: An Instructional Video Understanding Benchmark	Haiwan Wei et.al.	2504.05040	null
2025-04-07	Probabilistic Position-Aided Beam Selection for mmWave MIMO Systems	Joseph K. Chege et.al.	2504.05035	null
2025-04-07	Mixture-of-Personas Language Models for Population Simulation	Ngoc Bui et.al.	2504.05019	null
2025-04-07	Towards Visual Text Grounding of Multimodal Large Language Model	Ming Li et.al.	2504.04974	null
2025-04-07	SoK: LLM-based Log Parsing	Viktor Beck et.al.	2504.04877	link
2025-04-07	Statistical parametric simulation studies based on real data	Christina Sauer et.al.	2504.04864	link
2025-04-08	TabRep: a Simple and Effective Continuous Representation for Training Tabular Diffusion Models	Jacob Si et.al.	2504.04798	link
2025-04-07	Enhancing Compositional Reasoning in Vision-Language Models with Synthetic Preference Data	Samarth Mishra et.al.	2504.04740	link
2025-04-07	Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	Anna Goldie et.al.	2504.04736	null
2025-04-04	Enhancing Causal Effect Estimation with Diffusion-Generated Data	Li Chen et.al.	2504.03630	null
2025-04-04	A New Statistical Approach to Calibration-Free Localization Using Unlabeled Crowdsourced Data	Haozhou Hu et.al.	2504.03619	null
2025-04-04	APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay	Akshara Prabhakar et.al.	2504.03601	null
2025-04-04	Discovering Partially Known Ordinary Differential Equations: a Case Study on the Chemical Kinetics of Cellulose Degradation	Federica Bragone et.al.	2504.03484	link
2025-04-04	D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations	Antoine Dumoulin et.al.	2504.03468	null
2025-04-04	Data Augmentation of Time-Series Data in Human Movement Biomechanics: A Scoping Review	Christina Halmich et.al.	2504.03334	null
2025-04-04	Mind the Prompt: Prompting Strategies in Audio Generations for Improving Sound Classification	Francesca Ronchini et.al.	2504.03329	null
2025-04-04	A model-free feature extraction procedure for interval-valued time series prediction	Wan Tian et.al.	2504.03310	null
2025-04-03	DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery	Jing Gao et.al.	2504.03006	null
2025-04-03	Online Learning for Nonlinear Dynamical Systems without the I.I.D. Condition	Lantian Zhang et.al.	2504.02995	null
2025-04-03	Generating Diverse Audio-Visual 360 Soundscapes for Sound Event Localization and Detection	Adrian S. Roman et.al.	2504.02988	link
2025-04-03	MegaMath: Pushing the Limits of Open Math Corpora	Fan Zhou et.al.	2504.02807	link
2025-04-03	Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions	PeiJie Yu et.al.	2504.02623	link
2025-04-03	Regulating Spatial Fairness in a Tripartite Micromobility Sharing System via Reinforcement Learning	Matteo Cederle et.al.	2504.02597	null
2025-04-03	Quantitative assessment of biological dynamics with aggregate data	Stephen McCoy et.al.	2504.02581	null
2025-04-03	Noise Calibration and Spatial-Frequency Interactive Network for STEM Image Enhancement	Hesong Li et.al.	2504.02555	link
2025-04-03	We Need Improved Data Curation and Attribution in AI for Scientific Discovery	Mara Graziani et.al.	2504.02486	null
2025-04-03	Scaling Analysis of Interleaved Speech-Text Language Models	Gallil Maimon et.al.	2504.02398	link
2025-04-03	State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla	Sharif Md. Abdullah et.al.	2504.02293	null
2025-04-03	Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data	Waris Gill et.al.	2504.02268	null
2025-04-03	Traffic Flow Data Completion and Anomaly Diagnosis via Sparse and Low-Rank Tensor Optimization	Junxi Man et.al.	2504.02245	link
2025-04-02	Less-to-More Generalization: Unlocking More Controllability by In-Context Generation	Shaojin Wu et.al.	2504.02160	link
2025-04-02	Multivariate Temporal Regression at Scale: A Three-Pillar Framework Combining ML, XAI, and NLP	Jiztom Kavalakkatt Francis et.al.	2504.02151	null
2025-04-03	Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation	Baban Gain et.al.	2504.01919	null
2025-04-02	Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework	Andrey Sidorenko et.al.	2504.01908	link
2025-04-02	GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical Reasoning	Yanzhou Su et.al.	2504.01886	null
2025-04-02	A New Approach to the Nonparametric Behrens-Fisher Problem with Compatible Confidence Intervals	Stephen Schüürhuis et.al.	2504.01796	null
2025-04-02	ToolACE-R: Tool Learning with Adaptive Self-Refinement	Xingshan Zeng et.al.	2504.01400	null
2025-04-02	On Data Synthesis and Post-training for Visual Abstract Reasoning	Ke Zhu et.al.	2504.01324	null
2025-04-02	SOLAR: Scalable Distributed Spatial Joins through Learning-based Optimization	Yongyi Liu et.al.	2504.01292	link
2025-04-01	Performative Drift Resistant Classification Using Generative Domain Adversarial Networks	Maciej Makowski et.al.	2504.01135	null
2025-04-01	Confidence Bands for Multiparameter Persistence Landscapes	Inés García-Redondo et.al.	2504.01113	link
2025-04-01	ShieldGemma 2: Robust and Tractable Image Content Moderation	Wenjun Zeng et.al.	2504.01081	null
2025-04-01	Enhancing 3T BOLD fMRI SNR using Unpaired 7T Data with Schrödinger Bridge Diffusion	Yujian Xiong et.al.	2504.01004	null
2025-04-01	Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models	José Pombal et.al.	2504.01001	null
2025-04-01	Personalized Federated Training of Diffusion Models with Privacy Guarantees	Kumar Kshitij Patel et.al.	2504.00952	null
2025-04-01	Data-free Knowledge Distillation with Diffusion Models	Xiaohua Qi et.al.	2504.00870	null
2025-04-01	TAMIS: Tailored Membership Inference Attacks on Synthetic Data	Paul Andrey et.al.	2504.00758	null
2025-04-02	Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation	Abhiram Maddukuri et.al.	2503.24361	null
2025-03-31	InstructRestore: Region-Customized Image Restoration with Human Instructions	Shuaizheng Liu et.al.	2503.24357	link
2025-03-31	Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction	Yizhou Huang et.al.	2503.24272	null
2025-03-31	Beyond a Single Mode: GAN Ensembles for Diverse Medical Data Generation	Lorenzo Tronchin et.al.	2503.24258	link
2025-03-31	Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes	Daichi Otsuka et.al.	2503.24229	null
2025-03-31	Synthetic News Generation for Fake News Classification	Abdul Sittar et.al.	2503.24206	null
2025-04-02	TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection	Zhiming Ma et.al.	2503.24115	link
2025-03-31	A low cost singular value decomposition based data assimilation technique for analysis of heterogeneous combustion data	Prajith Pillai et.al.	2503.24064	null
2025-03-31	Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data	Fatemeh Mohammadi et.al.	2503.24062	null
2025-04-02	Detecting Localized Density Anomalies in Multivariate Data via Coin-Flip Statistics	Sebastian Springer et.al.	2503.23927	link
2025-03-31	Feature learning from non-Gaussian inputs: the case of Independent Component Analysis in high dimensions	Fabiola Ricci et.al.	2503.23896	null
2025-03-31	Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation	Lingyu Liu et.al.	2503.23736	null
2025-03-31	Finite sample valid confidence sets of mode	Manit Paul et.al.	2503.23711	null
2025-03-31	WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation	Zhengyi Zhao et.al.	2503.23673	null
2025-03-30	Partial Transportability for Domain Generalization	Kasra Jalaldoust et.al.	2503.23605	null
2025-03-28	Unicorn: Text-Only Data Synthesis for Vision Language Model Training	Xiaomin Yu et.al.	2503.22655	link
2025-03-28	Empirical Analysis of Sim-and-Real Cotraining Of Diffusion Policies For Planar Pushing from Pixels	Adam Wei et.al.	2503.22634	null
2025-03-28	Comparing Methods for Bias Mitigation in Graph Neural Networks	Barbara Hoffmann et.al.	2503.22569	null
2025-03-28	Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision	Rulin Zhou et.al.	2503.22394	null
2025-03-28	One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images	Byeongjun Kwon et.al.	2503.22351	null
2025-03-31	Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging	Chongjie Ye et.al.	2503.22236	null
2025-03-28	An Empirical Study of Validating Synthetic Data for Text-Based Person Retrieval	Min Cao et.al.	2503.22171	link
2025-03-27	GLM Inference with AI-Generated Synthetic Data Using Misspecified Linear Regression	Nir Keret et.al.	2503.21968	null
2025-03-27	Parametric Shadow Control for Portrait Generationin Text-to-Image Diffusion Models	Haoming Cai et.al.	2503.21943	null
2025-03-27	LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis	Shitian Zhao et.al.	2503.21749	null
2025-03-27	A Powerful Bootstrap Test of Independence in High Dimensions	Mauricio Olivares et.al.	2503.21715	null
2025-03-27	COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing	Rajvee Sheth et.al.	2503.21670	null
2025-03-27	Advancing CAN Network Security through RBM-Based Synthetic Attack Data Generation for Intrusion Detection Systems	Huacheng Li et.al.	2503.21496	link
2025-03-27	Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving	Lucas Nunes et.al.	2503.21449	link
2025-03-27	From Deep Learning to LLMs: A survey of AI in Quantitative Investment	Bokai Cao et.al.	2503.21422	null
2025-03-27	Simulation-based assessment of a Bayesian survival model with flexible baseline hazard and time-dependent effects	Iain R. Timmins et.al.	2503.21388	null
2025-03-27	Interactive Databases for the Life Sciences	Rosalia Moreddu et.al.	2503.21274	null
2025-03-26	Eyes Tell the Truth: GazeVal Highlights Shortcomings of Generative AI in Medical Imaging	David Wong et.al.	2503.20967	null
2025-03-26	Toward Sustainable Polymer Design: A Molecular Dynamics-Informed Machine Learning Approach for Vitrimers	Yiwen Zheng et.al.	2503.20956	link
2025-03-26	Assessing Generative Models for Structured Data	Reilly Cannon et.al.	2503.20903	null
2025-03-26	Robust Federated Learning Against Poisoning Attacks: A GAN-Based Defense Framework	Usama Zafar et.al.	2503.20884	link
2025-03-26	The Data Sharing Paradox of Synthetic Data in Healthcare	Jim Achterberg et.al.	2503.20847	null
2025-03-27	Inferring Treatment Effects in Large Panels by Uncovering Latent Similarities	Ben Deaner et.al.	2503.20769	null
2025-03-26	From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models	Nikita Neveditsin et.al.	2503.20715	null
2025-03-26	Diffusion Counterfactuals for Image Regressors	Trung Duc Ha et.al.	2503.20595	link
2025-03-26	Synthetic Data Augmentation for Cross-domain Implicit Discourse Relation Recognition	Frances Yung et.al.	2503.20588	null
2025-03-26	A Deep Learning Pipeline for Large Earthquake Analysis using High-Rate Global Navigation Satellite System Data	Claudia Quinteros-Cartaya et.al.	2503.20584	null
2025-03-26	Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications	Mahya Nikouei et.al.	2503.20516	null
2025-03-26	Active Data Sampling and Generation for Bias Remediation	Antonio Maratea et.al.	2503.20414	null
2025-03-26	SpikeDerain: Unveiling Clear Videos from Rainy Sequences Using Color Spike Streams	Hanwen Liang et.al.	2503.20315	null
2025-03-26	A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications	Sunayana Sitaram et.al.	2503.20302	link
2025-03-26	AIGC-assisted Federated Learning for Edge Intelligence: Architecture Design, Research Challenges and Future Directions	Xianke Qiang et.al.	2503.20166	link
2025-03-25	Low-resource Machine Translation for Code-switched Kazakh-Russian Language Pair	Maksim Borisov et.al.	2503.20007	null
2025-03-25	Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals	Stefan Stojanov et.al.	2503.19953	null
2025-03-25	SemEval-2025 Task 9: The Food Hazard Detection Challenge	Korbinian Randl et.al.	2503.19800	null
2025-03-25	AIGC-assisted Federated Learning for Vehicular Edge Intelligence: Vehicle Selection, Resource Allocation and Model Augmentation	Xianke Qiang et.al.	2503.19676	null
2025-03-26	Scaling Laws of Synthetic Data for Language Models	Zeyu Qin et.al.	2503.19551	null
2025-03-25	Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation	Zhuoran Zhao et.al.	2503.19307	link
2025-03-24	Deep learning in the abyss: a stratified Physics Informed Neural Network for data assimilation	Vadim Limousin et.al.	2503.19160	null
2025-03-24	Reasoning to Learn from Latent Thoughts	Yangjun Ruan et.al.	2503.18866	null
2025-03-24	Unsupervised Acquisition of Discrete Grammatical Categories	David Ph. Shakouri et.al.	2503.18702	null
2025-03-24	GranQ: Granular Zero-Shot Quantization with Unified Layer-Channel Awareness	Inpyo Hong et.al.	2503.18339	null
2025-03-24	Enhancing Dataset Distillation via Non-Critical Region Refinement	Minh-Tuan Tran et.al.	2503.18267	null
2025-03-25	PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment	Jong Myoung Kim et.al.	2503.18250	null
2025-03-23	Extended Visibility of Autonomous Vehicles via Optimized Cooperative Perception under Imperfect Communication	Ahmad Sarlak et.al.	2503.18192	null
2025-03-23	SNRAware: Improved Deep Learning MRI Denoising with SNR Unit Training and G-factor Map Augmentation	Hui Xue et.al.	2503.18162	null
2025-03-23	GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks	Varvara Krechetova et.al.	2503.18129	link
2025-03-23	Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving	Junhao Ge et.al.	2503.18108	link
2025-03-23	Model-Guardian: Protecting against Data-Free Model Stealing Using Gradient Representations and Deceptive Predictions	Yunfei Yang et.al.	2503.18081	null
2025-03-22	Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment	Abdullah Al-Khatib et.al.	2503.17756	null
2025-03-22	Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection	Chatrine Qwaider et.al.	2503.17739	null
2025-03-21	Follow-up Question Generation For Enhanced Patient-Provider Conversations	Joseph Gatto et.al.	2503.17509	null
2025-03-21	ConvoGen: Enhancing Conversational AI with Synthetic Data: A Multi-Agent Approach	Reem Gody et.al.	2503.17460	null
2025-03-21	CausalRivers – Scaling up benchmarking of causal discovery for real-world time-series	Gideon Stein et.al.	2503.17452	null
2025-03-21	Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models	Jianing Qi et.al.	2503.17349	null
2025-03-21	Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation	Giacomo Savazzi et.al.	2503.17224	null
2025-03-21	TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning	Sheng Wang et.al.	2503.17195	null
2025-03-21	Unitless Unrestricted Markov-Consistent SCM Generation: Better Benchmark Datasets for Causal Discovery	Rebecca J. Herman et.al.	2503.17037	null
2025-03-21	A categorization of performance measures for estimated non-linear associations between an outcome and continuous predictors	Theresa Ullmann et.al.	2503.16981	link
2025-03-21	When Words Outperform Vision: VLMs Can Self-Improve Via Text-Only Training For Human-Centered Decision Making	Zhe Hu et.al.	2503.16965	null
2025-03-21	TEMPO: Temporal Preference Optimization of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment	Shicheng Li et.al.	2503.16929	link
2025-03-20	EarlyStopping: Implicit Regularization for Iterative Learning Procedures in Python	Eric Ziebell et.al.	2503.16753	link
2025-03-20	Digitally Prototype Your Eye Tracker: Simulating Hardware Performance using 3D Synthetic Data	Esther Y. H. Lin et.al.	2503.16742	null
2025-03-20	Ultra-Resolution Adaptation with Ease	Ruonan Yu et.al.	2503.16322	link
2025-03-20	Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data	Zijian Li et.al.	2503.16260	null
2025-03-20	VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis	Chia-Yi Hsu et.al.	2503.16195	null
2025-03-20	MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures	Lucas Morin et.al.	2503.16096	link
2025-03-20	Tuning LLMs by RAG Principles: Towards LLM-native Memory	Jiale Wei et.al.	2503.16071	link
2025-03-20	Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures	Tim Seizinger et.al.	2503.16067	link
2025-03-20	Closer to Ground Truth: Realistic Shape and Appearance Labeled Data Generation for Unsupervised Underwater Image Segmentation	Andrei Jelea et.al.	2503.16051	null
2025-03-20	Outcome-Informed Weighting for Robust ATE Estimation	Linying Yang et.al.	2503.15989	link
2025-03-20	TVineSynth: A Truncated C-Vine Copula Generator of Synthetic Tabular Data to Balance Privacy and Utility	Elisabeth Griesbauer et.al.	2503.15972	link
2025-03-20	Integrative Analysis of High-dimensional RCT and RWD Subject to Censoring and Hidden Confounding	Xin Ye et.al.	2503.15967	null
2025-03-20	An Evaluation Framework for the FAIR Assessment tools in Open Science	Payel Patra et.al.	2503.15929	null
2025-03-20	TruthLens: Explainable DeepFake Detection for Face Manipulated and Fully Synthetic Data	Rohit Kundu et.al.	2503.15867	null
2025-03-20	Controlling Avatar Diffusion with Learnable Gaussian Embedding	Xuan Gao et.al.	2503.15809	null
2025-03-19	The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generation	Benidir Yanis et.al.	2503.15683	null
2025-03-19	Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning	Luc McCutcheon et.al.	2503.15629	link
2025-03-19	Improved Lattice QCD $B_c\to J/ψ$ Vector, Axial-Vector, and Tensor Form Factors	Judd Harrison et.al.	2503.15090	null
2025-03-19	Box-constrained L0 Bregman-relaxations	Mhamed Essafri et.al.	2503.15083	null
2025-03-19	ELTEX: A Framework for Domain-Driven Synthetic Data Generation	Arina Razmyslovich et.al.	2503.15055	link
2025-03-19	Benchmarking Brain Connectivity Graph Inference: A Novel Validation Approach	Alice Chevaux et.al.	2503.15012	null
2025-03-19	MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models	Jiazheng Li et.al.	2503.14917	null
2025-03-19	Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation	Siwei Wen et.al.	2503.14905	null
2025-03-19	Synthesizing Grid Data with Cyber Resilience and Privacy Guarantees	Shengyang Wu et.al.	2503.14877	null
2025-03-19	Project Jenkins: Turning Monkey Neural Data into Robotic Arm Movement, and Back	Andrii Zahorodnii et.al.	2503.14847	null
2025-03-18	PSInference: A Package to Draw Inference for Released Plug-in Sampling Single Synthetic Dataset	Ricardo Moura et.al.	2503.14711	null
2025-03-18	Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM	Yazeed Alnumay et.al.	2503.14603	null
2025-03-18	Graph-CNNs for RF Imaging: Learning the Electric Field Integral Equations	Kyriakos Stylianopoulos et.al.	2503.14439	null
2025-03-18	QSTToolkit: A Python Library for Deep Learning Powered Quantum State Tomography	George FitzGerald et.al.	2503.14422	link
2025-03-18	Optimizing High-Dimensional Oblique Splits	Chien-Ming Chi et.al.	2503.14381	null
2025-03-19	VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation	Shoubin Yu et.al.	2503.14350	null
2025-03-18	Four checks for low-fidelity synthetic data: recommendations for disclosure control and quality evaluation	Gillian M Raab et.al.	2503.14211	null
2025-03-18	Synthetic Data Generation Using Large Language Models: Advances in Text and Code	Mihai Nadas et.al.	2503.14023	null
2025-03-18	The KoLMogorov Test: Compression by Code Generation	Ori Yoran et.al.	2503.13992	null
2025-03-18	Empowering LLMs in Decision Games through Algorithmic Data Synthesis	Haolin Wang et.al.	2503.13980	null
2025-03-18	SoccerSynth Field: enhancing field detection with synthetic data from virtual soccer simulator	HaoBin Qin et.al.	2503.13969	null
2025-03-18	SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model	Xinqing Li et.al.	2503.13952	link
2025-03-18	Reconstructing Cell Lineage Trees from Phenotypic Features with Metric Learning	Da Kuang et.al.	2503.13925	null
2025-03-18	VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences	Anukriti Singh et.al.	2503.13817	null
2025-03-18	LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation	Yang Zhou et.al.	2503.13794	null
2025-03-17	Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization	Hao Li et.al.	2503.13617	null
2025-03-17	Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images	Tianhao Wu et.al.	2503.13439	null
2025-03-17	Uncovering Utility Functions from Observed Outcomes	Marta Grzeskiewicz et.al.	2503.13432	null
2025-03-17	Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation	Xinyu Lian et.al.	2503.13424	null
2025-03-17	Pairwise vs Higher-order interactions: Can we identify the interaction type in coupled oscillators from time series?	Weiwei Su et.al.	2503.13244	null
2025-03-17	MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis	Marvin Seyfarth et.al.	2503.13211	null
2025-03-17	A representational framework for learning and encoding structurally enriched trajectories in complex agent environments	Corina Catarau-Cotutiu et.al.	2503.13194	null
2025-03-17	HybridGen: VLM-Guided Hybrid Planning for Scalable Data Generation of Imitation Learning	Wensheng Wang et.al.	2503.13171	null
2025-03-17	Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences	Kedi Chen et.al.	2503.13109	null
2025-03-17	PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data	ChangHee Yang et.al.	2503.13025	null
2025-03-17	Concept-as-Tree: Synthetic Data is All You Need for VLM Personalization	Ruichuan An et.al.	2503.12999	null
2025-03-18	DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding	Xinyu Ma et.al.	2503.12797	link
2025-03-17	A Brain-Computer Interface Data Persistence System for Multi-Scenario and Multi-Modal Data: NeuroStore	Yang Chen et.al.	2503.12705	null
2025-03-16	Dynamic Angle Selection in X-Ray CT: A Reinforcement Learning Approach to Optimal Stopping	Tianyuan Wang et.al.	2503.12688	null
2025-03-16	Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility	Jacob Chmura et.al.	2503.12667	null
2025-03-16	Point Cloud Based Scene Segmentation: A Survey	Dan Halperin et.al.	2503.12595	null
2025-03-14	Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation	Hongyu Wen et.al.	2503.11633	null
2025-03-14	AugGen: Synthetic Augmentation Can Improve Discriminative Models	Parsa Rahimi et.al.	2503.11544	null
2025-03-14	FLASHμ: Fast Localizing And Sizing of Holographic Microparticles	Ayush Paliwal et.al.	2503.11538	null
2025-03-14	Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models	Xu Liu et.al.	2503.11411	null
2025-03-14	When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective	Alireza Mousavi-Hosseini et.al.	2503.11272	link
2025-03-14	CyclePose – Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy	Jonas Utz et.al.	2503.11266	null
2025-03-14	NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications	Li Cui et.al.	2503.11199	null
2025-03-14	Physics-constrained DeepONet for Surrogate CFD models: a curved backward-facing step case	Anas Jnini et.al.	2503.11196	null
2025-03-14	DeskVision: Large Scale Desktop Region Captioning for Advanced GUI Agents	Yibin Xu et.al.	2503.11170	null
2025-03-13	OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models	Akshat Ramachandran et.al.	2503.10959	null
2025-03-13	Mamba time series forecasting with uncertainty propagation	Pedro Pessoa et.al.	2503.10873	link
2025-03-13	On the Identifiability of Causal Abstractions	Xiusi Li et.al.	2503.10834	null
2025-03-13	NIL: No-data Imitation Learning by Leveraging Pre-trained Video Diffusion Models	Mert Albaba et.al.	2503.10626	null
2025-03-13	PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models	Zilu Guo et.al.	2503.10529	null
2025-03-12	Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving	Enes Özeren et.al.	2503.09803	null
2025-03-12	A PyTorch-Enabled Tool for Synthetic Event Camera Data Generation and Algorithm Development	Joseph L. Greene et.al.	2503.09754	null
2025-03-12	Local Look-Ahead Guidance via Verifier-in-the-Loop for Automated Theorem Proving	Sara Rajaee et.al.	2503.09730	null
2025-03-12	Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks	Lutfi Eren Erdogan et.al.	2503.09572	null
2025-03-12	Neural Network-Based Change Point Detection for Large-Scale Time-Evolving Data	Jialiang Geng et.al.	2503.09541	null
2025-03-12	Materials Discovery With Quantum-Enhanced Machine Learning Algorithms	Ignacio F. Graña et.al.	2503.09517	null
2025-03-12	How Well Does Your Tabular Generator Learn the Structure of Tabular Data?	Xiangjian Jiang et.al.	2503.09453	link
2025-03-12	Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models	Julian Spravil et.al.	2503.09443	null
2025-03-12	Monte Carlo Diffusion for Generalizable Learning-Based RANSAC	Jiale Wang et.al.	2503.09410	null
2025-03-12	Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training	Jiatong Xia et.al.	2503.09396	null
2025-03-12	RetSTA: An LLM-Based Approach for Standardizing Clinical Fundus Image Reports	Jiushen Cai et.al.	2503.09358	null
2025-03-12	Fully-Synthetic Training for Visual Quality Inspection in Automotive Production	Christoph Huber et.al.	2503.09354	null
2025-03-12	Adaptive political surveys and GPT-4: Tackling the cold start problem with simulated user interactions	Fynn Bachmann et.al.	2503.09311	link
2025-03-12	Neural Normalized Cut: A Differential and Generalizable Approach for Spectral Clustering	Wei He et.al.	2503.09260	link
2025-03-12	Active Learning Inspired ControlNet Guidance for Augmenting Semantic Segmentation Datasets	Hannah Kniesel et.al.	2503.09221	null
2025-03-12	Addressing pitfalls in implicit unobserved confounding synthesis using explicit block hierarchical ancestral sampling	Xudong Sun et.al.	2503.09194	null
2025-03-12	Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?	Yuechen Xie et.al.	2503.09122	link
2025-03-12	Tacchi 2.0: A Low Computational Cost and Comprehensive Dynamic Contact Simulator for Vision-based Tactile Sensors	Yuhao Sun et.al.	2503.09100	null
2025-03-11	Generating Robot Constitutions & Benchmarks for Semantic Safety	Pierre Sermanet et.al.	2503.08663	null
2025-03-11	Rethinking Diffusion Model in High Dimension	Zhenxin Zheng et.al.	2503.08643	link
2025-03-11	LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization	Xianfeng Wu et.al.	2503.08619	link
2025-03-11	Mellow: a small audio language model for reasoning	Soham Deshmukh et.al.	2503.08540	link
2025-03-11	Clustered Flexible Calibration Plots For Binary Outcomes Using Random Effects Modeling	Lasai Barreñada et.al.	2503.08389	null
2025-03-11	Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models	Weiguo Gao et.al.	2503.08117	null
2025-03-11	A General Framework to Evaluate Methods for Assessing Dimensions of Lexical Semantic Change Using LLM-Generated Synthetic Data	Naomi Baes et.al.	2503.08042	null
2025-03-11	ObjectMover: Generative Object Movement with Video Prior	Xin Yu et.al.	2503.08037	null
2025-03-11	Group Preference Alignment: Customized LLM Response Generation from In-Situ Conversations	Ishani Mondal et.al.	2503.08035	null
2025-03-11	Efficient Dataset Distillation through Low-Rank Space Sampling	Hangyang Kong et.al.	2503.07998	null
2025-03-10	A Landmark-Aided Navigation Approach Using Side-Scan Sonar	Ellen Davenport et.al.	2503.07900	null
2025-03-10	Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation	Fan Yin et.al.	2503.07826	null
2025-03-10	Training Domain Draft Models for Speculative Decoding: Best Practices and Insights	Fenglu Hong et.al.	2503.07807	null
2025-03-10	SANDRO: a Robust Solver with a Splitting Strategy for Point Cloud Registration	Michael Adlerstein et.al.	2503.07743	link
2025-03-10	Learning Physics-Based Full-Body Human Reaching and Grasping from Brief Walking References	Yitang Li et.al.	2503.07481	null
2025-03-10	Skelite: Compact Neural Networks for Efficient Iterative Skeletonization	Luis D. Reyes Vargas et.al.	2503.07369	link
2025-03-10	Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music	Eetu Tunturi et.al.	2503.07352	link
2025-03-10	Decision-Dependent Stochastic Optimization: The Role of Distribution Dynamics	Zhiyu He et.al.	2503.07324	link
2025-03-10	Synthetic Lung X-ray Generation through Cross-Attention and Affinity Transformation	Ruochen Pi et.al.	2503.07209	null
2025-03-11	PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM	Alan Dao et.al.	2503.07111	null
2025-03-10	RS2V-L: Vehicle-Mounted LiDAR Data Generation from Roadside Sensor Observations	Ruidan Xing et.al.	2503.07085	null
2025-03-10	Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation	Pengchen Liang et.al.	2503.06976	null
2025-03-10	CAFusion: Controllable Anatomical Synthesis of Perirectal Lymph Nodes via SDF-guided Diffusion	Weidong Guo et.al.	2503.06919	null
2025-03-09	UniGenX: Unified Generation of Sequence and Structure with Autoregressive Diffusion	Gongbo Zhang et.al.	2503.06687	null
2025-03-09	Reinforcement Learning with Verifiable Rewards: GRPO’s Effective Loss, Dynamics, and Success Amplification	Youssef Mroueh et.al.	2503.06639	null
2025-03-09	Synthetic Data Generation for Minimum-Exposure Navigation in a Time-Varying Environment using Generative AI Models	Nachiket U. Bapat et.al.	2503.06619	null
2025-03-09	Inverse Reinforcement Learning for Minimum-Exposure Paths in Spatiotemporally Varying Scalar Fields	Alexandra E. Ballentine et.al.	2503.06611	null
2025-03-09	Extremes of structural causal models	Sebastian Engelke et.al.	2503.06536	link
2025-03-09	UAV-Assisted Coverage Hole Detection Using Reinforcement Learning in Urban Cellular Networks	Mushfiqur Rahman et.al.	2503.06494	null
2025-03-07	Algorithmic Data Minimization for Machine Learning over Internet-of-Things Data Streams	Ted Shaowang et.al.	2503.05675	null
2025-03-07	AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data	Zengqun Zhao et.al.	2503.05665	link
2025-03-07	VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control	Yuxuan Bian et.al.	2503.05639	link
2025-03-07	Joint 3D Point Cloud Segmentation using Real-Sim Loop: From Panels to Trees and Branches	Tian Qiu et.al.	2503.05630	null
2025-03-07	Novel Object 6D Pose Estimation with a Single Reference View	Jian Liu et.al.	2503.05578	link
2025-03-07	Statistical Deficiency for Task Inclusion Estimation	Loïc Fosse et.al.	2503.05491	null
2025-03-07	DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction	Miaowei Wang et.al.	2503.05484	null
2025-03-07	Semi-Supervised Learning for Dose Prediction in Targeted Radionuclide: A Synthetic Data Study	Jing Zhang et.al.	2503.05367	null
2025-03-07	Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs	Ling Team et.al.	2503.05139	null
2025-03-06	GRIP: A General Robotic Incremental Potential Contact Simulation Dataset for Unified Deformable-Rigid Coupled Grasping	Siyu Ma et.al.	2503.05020	null
2025-03-06	Seismic inversion using hybrid quantum neural networks	Divakar Vashisth et.al.	2503.05009	null
2025-03-06	A Consensus Privacy Metrics Framework for Synthetic Data	Lisa Pilgram et.al.	2503.04980	null
2025-03-06	HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models	Yao Ge et.al.	2503.04930	null
2025-03-06	Compositional World Knowledge leads to High Utility Synthetic data	Sachit Gaudi et.al.	2503.04687	null
2025-03-06	Assessing the performance of compartmental and renewal models for learning $R_{t}$ using spatially heterogeneous epidemic simulations on real geographies	Matthew Ghosh et.al.	2503.04648	null
2025-03-06	PathoPainter: Augmenting Histopathology Segmentation via Tumor-aware Inpainting	Hong Liu et.al.	2503.04634	null
2025-03-06	Synthetic Data is an Elegant GIFT for Continual Vision-Language Models	Bin Wu et.al.	2503.04229	null
2025-03-06	CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation	Yuki Tanaka et.al.	2503.04164	null
2025-03-06	Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination	Simin Chen et.al.	2503.04149	null
2025-03-07	Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts	Xiangnan Chen et.al.	2503.04095	null
2025-03-06	PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks	Feng Ni et.al.	2503.04065	link
2025-03-06	Subgraph Federated Learning for Local Generalization	Sungwon Kim et.al.	2503.03995	link
2025-03-05	Improving the Temporal Resolution of SOHO/MDI Magnetograms of Solar Active Regions Using a Deep Generative Model	Jialiang Li et.al.	2503.03959	null
2025-03-05	Enhancing Autonomous Driving Safety with Collision Scenario Integration	Zi Wang et.al.	2503.03957	null
2025-03-05	Neural Descriptors: Self-Supervised Learning of Robust Local Surface Descriptors Using Polynomial Patches	Gal Yona et.al.	2503.03907	link
2025-03-05	Handling Uncertainty in Health Data using Generative Algorithms	Mahdi Arab Loodaricheh et.al.	2503.03715	null
2025-03-05	Robust Learning of Diverse Code Edits	Tushar Aggarwal et.al.	2503.03656	null
2025-03-05	4D Radar Ground Truth Augmentation with LiDAR-to-4D Radar Data Synthesis	Woo-Jin Jung et.al.	2503.03637	link
2025-03-05	Semiparametric Growth-Curve Modeling in Hierarchical, Longitudinal Studies	Rajesh Selukar et.al.	2503.03550	null
2025-03-05	Rethinking Synthetic Data definitions: A privacy driven approach	Vibeke Binz Vallevik et.al.	2503.03506	null
2025-03-05	Bridging Synthetic-to-Real Gaps: Frequency-Aware Perturbation and Selection for Single-shot Multi-Parametric Mapping Reconstruction	Linyu Fan et.al.	2503.03475	link
2025-03-05	Generative Artificial Intelligence in Robotic Manipulation: A Survey	Kun Zhang et.al.	2503.03464	null
2025-03-05	Simplicial SMOTE: Oversampling Solution to the Imbalanced Learning Problem	Oleg Kachan et.al.	2503.03418	null
2025-03-05	Evolutionary Prediction Games	Eden Saig et.al.	2503.03401	link
2025-03-05	Video Super-Resolution: All You Need is a Video Diffusion Model	Zhihao Zhan et.al.	2503.03355	null
2025-03-06	Optimizing for the Shortest Path in Denoising Diffusion Model	Ping Chen et.al.	2503.03265	link
2025-03-05	Can Frontier LLMs Replace Annotators in Biomedical Text Mining? Analyzing Challenges and Exploring Solutions	Yichong Zhao et.al.	2503.03261	link
2025-03-05	Online Bidding under RoS Constraints without Knowing the Value	Sushant Vijayan et.al.	2503.03195	null
2025-03-05	Distributed Certifiably Correct Range-Aided SLAM	Alexander Thoms et.al.	2503.03192	link
2025-03-05	SpinML: Customized Synthetic Data Generation for Private Training of Specialized ML Models	Jiang Zhang et.al.	2503.03160	null
2025-03-04	Deep Learning-Enhanced Visual Monitoring in Hazardous Underwater Environments with a Swarm of Micro-Robots	Shuang Chen et.al.	2503.02752	link
2025-03-05	ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points	Qirui Huang et.al.	2503.02745	null
2025-03-04	The Effectiveness of Large Language Models in Transforming Unstructured Text to Standardized Formats	William Brach et.al.	2503.02650	link
2025-03-04	YARE-GAN: Yet Another Resting State EEG-GAN	Yeganeh Farahzadi et.al.	2503.02636	link
2025-03-04	It Helps to Take a Second Opinion: Teaching Smaller LLMs to Deliberate Mutually via Selective Rationale Optimisation	Sohan Patnaik et.al.	2503.02463	null
2025-03-04	ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks	Heng Zhou et.al.	2503.02390	link
2025-03-04	Confidence HNC: A Network Flow Technique for Binary Classification with Noisy Labels	Dorit Hochbaum et.al.	2503.02352	null
2025-03-04	Algebraic Reconstruction of Piecewise-Smooth Functions of Two Variables from Fourier Data	Michael Levinov et.al.	2503.02254	link
2025-03-04	OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale	Haoyang Li et.al.	2503.02240	link
2025-03-04	LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical Relationship Preservation	Yunbo Long et.al.	2503.02161	null
2025-03-04	Tabby: Tabular Data Synthesis with Language Models	Sonia Cromp et.al.	2503.02152	null
2025-03-03	Hebbian learning the local structure of language	P. Myles Eugenio et.al.	2503.02057	null
2025-03-03	Photon Interval Statistics Measure Rapid Variability	J. I. Katz et.al.	2503.02045	null
2025-03-03	Projection-angle effects when “observing” a turbulent magnetized collapsing molecular cloud. II. Magnetic field	A. Tritsis et.al.	2503.01971	null
2025-03-03	SHADE-AD: An LLM-Based Framework for Synthesizing Activity Data of Alzheimer’s Patients	Heming Fu et.al.	2503.01768	null
2025-02-28	Synthesizing Tabular Data Using Selectivity Enhanced Generative Adversarial Networks	Youran Zhou et.al.	2502.21034	null
2025-02-28	Amortized Conditional Independence Testing	Bao Duong et.al.	2502.20925	null
2025-02-28	MFSR-GAN: Multi-Frame Super-Resolution with Handheld Motion Modeling	Fadeel Sher Khan et.al.	2502.20824	null
2025-02-28	Towards Ultimate NMR Resolution with Deep Learning	Amir Jahangiri et.al.	2502.20793	null
2025-02-28	Generating Clinically Realistic EHR Data via a Hierarchy- and Semantics-Guided Transformer	Guanglin Zhou et.al.	2502.20719	null
2025-02-28	EndoPBR: Material and Lighting Estimation for Photorealistic Surgical Simulations via Physically-based Rendering	John J. Han et.al.	2502.20669	null
2025-02-28	Dataset Distillation with Neural Characteristic Function: A Minmax Perspective	Shaobo Wang et.al.	2502.20653	null
2025-02-28	PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data	Juntao Tan et.al.	2502.20616	null
2025-02-27	TripCraft: A Benchmark for Spatio-Temporally Fine Grained Travel Planning	Soumyabrata Chaudhuri et.al.	2502.20508	null
2025-02-27	Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization	Lujie Yang et.al.	2502.20382	null
2025-02-27	Sanity Checking Causal Representation Learning on a Simple Real-World System	Juan L. Gamella et.al.	2502.20099	link
2025-02-27	Quantum generative classification with mixed states	Diego H. Useche et.al.	2502.19970	null
2025-02-28	Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation	Xiang Geng et.al.	2502.19941	null
2025-02-27	Shifting the Paradigm: A Diffeomorphism Between Time Series Data Manifolds for Achieving Shift-Invariancy in Deep Learning	Berken Utku Demirel et.al.	2502.19921	link
2025-02-27	UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition	Xiao Lin et.al.	2502.19803	link
2025-02-27	Developmental Support Approach to AI’s Autonomous Growth: Toward the Realization of a Mutually Beneficial Stage Through Experiential Learning	Taichiro Endo et.al.	2502.19798	null
2025-02-27	In-Context Learning with Hypothesis-Class Guidance	Ziqian Lin et.al.	2502.19787	link
2025-02-27	Few-Shot Multilingual Open-Domain QA from 5 Examples	Fan Jiang et.al.	2502.19722	link
2025-02-27	Training Robust Graph Neural Networks by Modeling Noise Dependencies	Yeonjun In et.al.	2502.19670	link
2025-02-26	Learning Ensembles of Interpretable Simple Structure	Gaurav Arwade et.al.	2502.19602	null
2025-02-26	Trustworthy Answers, Messier Data: Bridging the Gap in Low-Resource Retrieval-Augmented Generation for Domain Expert Systems	Nayoung Choi et.al.	2502.19596	null
2025-02-26	Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents	Ashley Lewis et.al.	2502.19545	null
2025-02-26	FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users	Anikait Singh et.al.	2502.19312	null
2025-02-26	AI-Powered Bayesian Inference	Veronika Ročková et.al.	2502.19231	null
2025-02-26	Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects	Petri Mäkinen et.al.	2502.19169	null
2025-02-27	A Survey on Foundation-Model-Based Industrial Defect Detection	Tianle Yang et.al.	2502.19106	null
2025-02-26	FungalZSL: Zero-Shot Fungal Classification with Image Captioning Using a Synthetic Data Approach	Anju Rani et.al.	2502.19038	null
2025-02-26	A Multifacet Hierarchical Sentiment-Topic Model with Application to Multi-Brand Online Review Analysis	Qiao Liang et.al.	2502.18927	link
2025-02-26	Dynamic Classification: Leveraging Self-Supervised Classification to Enhance Prediction Performance	Ziyuan Zhong et.al.	2502.18891	null
2025-02-26	A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops	Shi Fu et.al.	2502.18865	null
2025-02-25	Interpretable Data-Driven Ship Dynamics Model: Enhancing Physics-Based Motion Prediction with Parameter Optimization	Papandreou Christos et.al.	2502.18696	null
2025-02-25	Quantum Machine Learning in Precision Medicine and Drug Discovery – A Game Changer for Tailored Treatments?	Markus Bertl et.al.	2502.18639	null
2025-02-25	FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response	Mollie Shichman et.al.	2502.18452	null
2025-02-25	CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing	Yafei Ou et.al.	2502.18437	null
2025-02-25	Self-Supervised Data Generation for Precision Agriculture: Blending Simulated Environments with Real Imagery	Leonardo Saraceni et.al.	2502.18320	null
2025-02-25	Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints	Mihaela Cătălina Stoian et.al.	2502.18237	link
2025-02-25	Differentially private synthesis of Spatial Point Processes	Dangchan Kim et.al.	2502.18198	null
2025-02-25	Sharper Concentration Inequalities for Multi-Graph Dependent Variables	Xiao Shao et.al.	2502.18167	null
2025-02-25	Golden Ratio Mixing of Real and Synthetic Data for Stabilizing Generative Model Training	Hengzhi He et.al.	2502.18049	null
2025-02-25	On Synthetic Data Strategies for Domain-Specific Generative Retrieval	Haoyang Wen et.al.	2502.17957	null
2025-02-25	FRT Regulation in China	Jyh-An Lee et.al.	2502.17877	null
2025-02-25	SYNTHEMPATHY: A Scalable Empathy Corpus Generated Using LLMs Without Any Crowdsourcing	Run Chen et.al.	2502.17857	null
2025-02-25	Uncertainty Quantification for LLM-Based Survey Simulations	Chengpiao Huang et.al.	2502.17773	null
2025-02-25	Learning Density Evolution from Snapshot Data	Rentian Yao et.al.	2502.17738	null
2025-02-24	Experimentally Informed Decoding of Stabilizer Codes Based on Syndrome Correlations	Ants Remm et.al.	2502.17722	null
2025-02-24	Aligning Compound AI Systems via System-level DPO	Xiangwen Wang et.al.	2502.17721	null
2025-02-24	Contrastive Visual Data Augmentation	Yu Zhou et.al.	2502.17709	null
2025-02-24	Towards Hierarchical Rectified Flow	Yichi Zhang et.al.	2502.17436	link
2025-02-24	FIG: Forward-Inverse Generation for Low-Resource Domain-specific Event Detection	Tanmay Parekh et.al.	2502.17394	null
2025-02-24	Mutual Reinforcement of LLM Dialogue Synthesis and Summarization Capabilities for Few-Shot Dialogue Summarization	Yen-Ju Lu et.al.	2502.17328	null
2025-02-24	Statistical machine learning tools for probabilistic closures of turbulence models	Julia Domingues Lemos et.al.	2502.17316	null
2025-02-24	From High-Entropy Alloys to Alloys with High Entropy: A New Paradigm in Materials Science and Engineering for Advancing Sustainable Metallurgy	Jose Manuel Torralba et.al.	2502.17279	null
2025-02-24	A Two-step Linear Mixing Model for Unmixing under Hyperspectral Variability	Xander Haijen et.al.	2502.17212	null
2025-02-24	A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis	Yuli Wu et.al.	2502.17160	null
2025-02-24	Improved Diffusion-based Generative Model with Better Adversarial Robustness	Zekun Wang et.al.	2502.17099	link
2025-02-24	WildFrame: Comparing Framing in Humans and LLMs on Naturally Occurring Texts	Gili Lior et.al.	2502.17091	link
2025-02-24	PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance	Haoran Li et.al.	2502.17041	link
2025-02-24	Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation	Jaskaran Singh Walia et.al.	2502.17011	null
2025-02-24	Potential-Based Greedy Matching for Dynamic Delivery Pooling	Hongyao Ma et.al.	2502.16862	null
2025-02-24	Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization	Yao Xiao et.al.	2502.16825	null
2025-02-23	Automated Keypoint Estimation for Self-Piercing Rivet Joints Using micro-CT Imaging and Transfer Learning	Wei Qin Chuah et.al.	2502.16752	null
2025-02-23	WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale	Jiaxi Li et.al.	2502.16684	null
2025-02-21	Machine-generated text detection prevents language model collapse	George Drayson et.al.	2502.15654	link
2025-02-21	A Population Sampling Framework for Claim Reserving in General Insurance	Sebastian Calcetero Vanegas et.al.	2502.15598	null
2025-02-21	Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning	Wenhao Zhu et.al.	2502.15592	link
2025-02-21	Improving the Scaling Laws of Synthetic Data with Deliberate Practice	Reyhane Askari-Hemmat et.al.	2502.15588	null
2025-02-21	Context-Aware Doubly-Robust Semi-Supervised Learning	Clement Ruah et.al.	2502.15577	null
2025-02-21	Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation	Wenxuan Wang et.al.	2502.15466	null
2025-02-21	MVIP – A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition	Paul Koch et.al.	2502.15448	null
2025-02-21	Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking	Yi-Ling Chung et.al.	2502.15419	link
2025-02-21	Ultrasound Phase Aberrated Point Spread Function Estimation with Convolutional Neural Network: Simulation Study	Wei-Hsiang Shen et.al.	2502.15298	null
2025-02-21	Hierarchical Bayesian estimation of population-level torque law parameters from anomalous pulsar braking indices	Andrés F. Vargas et.al.	2502.15211	null
2025-02-21	Methods and Trends in Detecting Generated Images: A Comprehensive Review	Arpan Mahara et.al.	2502.15176	null
2025-02-21	mStyleDistance: Multilingual Style Embeddings and their Evaluation	Justin Qiu et.al.	2502.15168	null
2025-02-20	Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios	Richard Marcus et.al.	2502.15076	link
2025-02-20	The ETKidney simulator: a discrete event simulator to assess the impact of alternative kidney allocation rules in Eurotransplant	H. C. de Ferrante et.al.	2502.15001	null
2025-02-20	CLIPPER: Compression enables long-context synthetic data generation	Chau Minh Pham et.al.	2502.14854	link
2025-02-20	Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation	Yue Yang et.al.	2502.14846	null
2025-02-20	PREM: Privately Answering Statistical Queries with Relative Error	Badih Ghazi et.al.	2502.14809	null
2025-02-20	Cross Validation for Correlated Data in Regression and Classification Models, with Applications to Deep Learning	Oren Yuval et.al.	2502.14808	link
2025-02-20	Data-Constrained Synthesis of Training Data for De-Identification	Thomas Vakili et.al.	2502.14677	null
2025-02-20	Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation	Austin A. Barr et.al.	2502.14523	link
2025-02-20	MLGym: A New Framework and Benchmark for Advancing AI Research Agents	Deepak Nathani et.al.	2502.14499	null
2025-02-20	LEIT-motifs: Scalable Motif Mining in Multidimensional Time Series	Matteo Ceccarello et.al.	2502.14446	link
2025-02-20	Unstructured Evidence Attribution for Long Context Query Focused Summarization	Dustin Wright et.al.	2502.14409	link
2025-02-19	Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data	Yucheng Shi et.al.	2502.14044	link
2025-02-19	DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation	Giorgio Franceschelli et.al.	2502.14037	null
2025-02-19	Contrastive Learning-Based privacy metrics in Tabular Synthetic Datasets	Milton Nicolás Plasencia Palacios et.al.	2502.13833	link
2025-02-19	Benchmarking of Different YOLO Models for CAPTCHAs Detection and Classification	Mikołaj Wysocki et.al.	2502.13740	null
2025-02-19	Cross-Comparison of Sampling Algorithms for Pulse Profile Modeling of PSR J0740+6620	Mariska Hoogkamer et.al.	2502.13682	null
2025-02-19	Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh	Nurkhan Laiyk et.al.	2502.13647	null
2025-02-19	Integrating Inverse and Forward Modeling for Sparse Temporal Data from Sensor Networks	Julian Vexler et.al.	2502.13638	null
2025-02-19	The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?	Yutao Sun et.al.	2502.13441	null
2025-02-19	MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification	Linzhuang Sun et.al.	2502.13383	link
2025-02-18	Synthetic generation of 2D data records based on Autoencoders	Darius Couchard et.al.	2502.13183	null
2025-02-18	Theorem Prover as a Judge for Synthetic Data Generation	Joshua Ong Jun Leang et.al.	2502.13137	null
2025-02-18	Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning	Jingyang Lin et.al.	2502.13127	null
2025-02-19	STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models	Narun Raman et.al.	2502.13119	null
2025-02-18	Statistically Significant $k$ NNAD by Selective Inference	Mizuki Niihori et.al.	2502.12978	null
2025-02-18	Does Training with Synthetic Data Truly Protect Privacy?	Yunpeng Zhao et.al.	2502.12976	link
2025-02-18	Fake It Till You Make It: Using Synthetic Data and Domain Knowledge for Improved Text-Based Learning for LGE Detection	Athira J Jacob et.al.	2502.12948	null
2025-02-18	Synthetic Data Generation for Culturally Nuanced Commonsense Reasoning in Low-Resource Languages	Salsabila Zahirah Pranida et.al.	2502.12932	null
2025-02-18	CausalMan: A physics-based simulator for large-scale causality	Nicholas Tagliapietra et.al.	2502.12707	null
2025-02-18	Disentangling Long-Short Term State Under Unknown Interventions for Online Time Series Forecasting	Ruichu Cai et.al.	2502.12603	link
2025-02-18	LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data	Cehao Yang et.al.	2502.12583	link
2025-02-18	MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos	Huaying Yuan et.al.	2502.12558	null
2025-02-17	From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations	Matteo Scucchia et.al.	2502.12303	link
2025-02-19	Data-Efficient Limited-Angle CT Using Deep Priors and Regularization	Ilmari Vahteristo et.al.	2502.12293	link
2025-02-17	Integrating Expert Knowledge into Logical Programs via LLMs	Franciszek Górski et.al.	2502.12275	link
2025-02-17	Idiosyncrasies in Large Language Models	Mingjie Sun et.al.	2502.12150	link
2025-02-17	Meta-Statistical Learning: Supervised Learning of Statistical Inference	Maxime Peyrard et.al.	2502.12088	null
2025-02-17	Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarsity	Dylan Zhang et.al.	2502.11901	null
2025-02-17	Steering the LoCoMotif: Using Domain Knowledge in Time Series Motif Discovery	Aras Yurtman et.al.	2502.11850	link
2025-02-17	Text Classification in the LLM Era - Where do we stand?	Sowmya Vajjala et.al.	2502.11830	null
2025-02-17	Efficient Response Generation Method Selection for Fine-Tuning Large Language Models	Xuan Ren et.al.	2502.11779	null
2025-02-17	HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims	Michiel van der Meer et.al.	2502.11753	null
2025-02-17	Improve LLM-as-a-Judge Ability as a General Ability	Jiachen Yu et.al.	2502.11689	null
2025-02-17	InTec: integrated things-edge computing: a framework for distributing machine learning pipelines in edge AI systems	Habib Larian et.al.	2502.11644	link
2025-02-17	User-Centric Data Management in Decentralized Internet of Behaviors System	Shiqi Zhang et.al.	2502.11616	null
2025-02-17	A population synthesis study of the Gaia 100 pc unresolved white dwarf-main sequence binary population	Alejandro Santos-García et.al.	2502.11593	null
2025-02-17	UniGO: A Unified Graph Neural Network for Modeling Opinion Dynamics on Graphs	Hao Li et.al.	2502.11519	null
2025-02-17	FastMCTS: A Simple Sampling Strategy for Data Synthesis	Peiji Li et.al.	2502.11476	null
2025-02-17	GiFT: Gibbs Fine-Tuning for Code Generation	Haochen Li et.al.	2502.11466	link
2025-02-17	UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance	Yichuan Ma et.al.	2502.11460	null
2025-02-14	Dimension-free Score Matching and Time Bootstrapping for Diffusion Models	Syamantak Kumar et.al.	2502.10354	null
2025-02-14	Ocular Disease Classification Using CNN with Deep Convolutional Generative Adversarial Network	Arun Kunwar et.al.	2502.10334	null
2025-02-14	Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers	Aivin V. Solatorio et.al.	2502.10263	link
2025-02-14	VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models	Gokul Karthik Kumar et.al.	2502.10250	null
2025-02-14	Near-Field Localization with Physics-Compliant Electromagnetic Model: Algorithms and Model Mismatch Analysis	Alexandr M. Kuzminskiy et.al.	2502.10102	null
2025-02-14	A novel approach to data generation in generative model	JaeHong Kim et.al.	2502.10092	null
2025-02-14	Generating on Generated: An Approach Towards Self-Evolving Diffusion Models	Xulu Zhang et.al.	2502.09963	null
2025-02-14	Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond	Kehan Guo et.al.	2502.09897	null
2025-02-14	Efficient Multitask Learning in Small Language Models Through Upside-Down Reinforcement Learning	Yu-Chen Lin et.al.	2502.09854	null
2025-02-14	Solving Empirical Bayes via Transformers	Anzo Teh et.al.	2502.09844	null
2025-02-13	Flexible Empirical Bayesian Approaches to Pharmacovigilance for Simultaneous Signal Detection and Signal Strength Estimation in Spontaneous Reporting Systems Data	Yihao Tan et.al.	2502.09816	null
2025-02-13	Variational Rectified Flow Matching	Pengsheng Guo et.al.	2502.09616	null
2025-02-13	Zero-shot generation of synthetic neurosurgical data with large language models	Austin A. Barr et.al.	2502.09566	link
2025-02-13	Correlation based modeling of the ionospheric magnetic field	K. Ferrat et.al.	2502.09492	null
2025-02-13	DiffRenderGAN: Addressing Training Data Scarcity in Deep Segmentation Networks for Quantitative Nanomaterial Analysis through Differentiable Rendering and Generative Modelling	Dennis Possart et.al.	2502.09477	null
2025-02-13	When do neural networks learn world models?	Tianren Zhang et.al.	2502.09297	null
2025-02-13	Typhoon T1: An Open Thai Reasoning Model	Pittawat Taveekitworachai et.al.	2502.09042	null
2025-02-13	Escaping Collapse: The Strength of Weak Data for Large Language Model Training	Kareem Amin et.al.	2502.08924	null
2025-02-13	A Systematic Evaluation of Generative Models on Tabular Transportation Data	Chengen Wang et.al.	2502.08856	link
2025-02-12	HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification	Valentina Vadori et.al.	2502.08754	link
2025-02-12	Checkerboard Target Measurement in Unordered Point Clouds with Coloured ICP	June Moh Goo et.al.	2502.08525	null
2025-02-12	FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge Devices	Dezhong Yao et.al.	2502.08518	link
2025-02-12	The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data	Evgenii Evstafev et.al.	2502.08515	null
2025-02-12	One-Shot Federated Learning with Classifier-Free Diffusion Models	Obaidullah Zaland et.al.	2502.08488	null
2025-02-12	mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data	Haonan Chen et.al.	2502.08468	link
2025-02-12	CRISP: A Framework for Cryo-EM Image Segmentation and Processing with Conditional Random Field	Szu-Chi Chung et.al.	2502.08287	link
2025-02-12	ChemZIP: Accelerated Modeling of Complex Aerothermochemical Interactions in Novel Turbomachines for Sustainable High-Temperature Chemical Processes	Dylan Rubini et.al.	2502.08232	null
2025-02-12	DGSense: A Domain Generalization Framework for Wireless Sensing	Rui Zhou et.al.	2502.08155	null
2025-02-11	Simulating Longitudinal Data from Marginal Structural Models	Xi Lin et.al.	2502.07991	null
2025-02-11	Symbiotic Cooperation for Web Agents: Harnessing Complementary Strengths of Large and Small LLMs	Ruichen Zhang et.al.	2502.07942	null
2025-02-11	Discrete Markov Probabilistic Models	Le-Tuyet-Nhi Pham et.al.	2502.07939	null
2025-02-11	Measuring the Distances to Asteroids from One Observatory in One Night with Upcoming All-Sky Telescopes	Maryann Benny Fernandes et.al.	2502.07881	link
2025-02-11	Methodology for Identifying Social Groups within a Transactional Graph	Maxence Morin et.al.	2502.07694	null
2025-02-12	Beyond Prompting: Time2Lang – Bridging Time-Series Foundation Models and Large Language Models for Health Sensing	Arvind Pillai et.al.	2502.07608	link
2025-02-12	O1 Embedder: Let Retrievers Think Before Action	Ruiran Yan et.al.	2502.07555	null
2025-02-11	Scaling relations for the uncertainty in neutron star radius inferred from pulse profile modelling: the effect of spin rate	Erik Bootsma et.al.	2502.07471	null
2025-02-11	Semantic to Structure: Learning Structural Representations for Infringement Detection	Chuanwei Huang et.al.	2502.07323	null
2025-02-11	Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting	Jiecheng Lu et.al.	2502.07244	link
2025-02-12	Space-Aware Instruction Tuning: Dataset and Benchmark for Guide Dog Robots Assisting the Visually Impaired	ByungOk Han et.al.	2502.07183	link
2025-02-11	Does Training on Synthetic Data Make Models Less Robust?	Lingze Zhang et.al.	2502.07164	null
2025-02-11	Conditional Distribution Quantization in Machine Learning	Blaise Delattre et.al.	2502.07151	null
2025-02-10	Generative Distribution Prediction: A Unified Approach to Multimodal Learning	Xinyu Tian et.al.	2502.07090	null
2025-02-10	CLaRe: Compact near-lossless Latent Representations of High-Dimensional Object Data	Emma Zohner et.al.	2502.07084	null
2025-02-10	Scalable and Ethical Insider Threat Detection through Data Synthesis and Analysis by LLMs	Haywood Gelman et.al.	2502.07045	null
2025-02-10	Enhanced Renewable Energy Forecasting and Operations through Probabilistic Forecast Aggregation	Alireza Moradi et.al.	2502.07010	null
2025-02-10	Dual Conic Proxy for Semidefinite Relaxation of AC Optimal Power Flow	Guancheng Qiu et.al.	2502.06978	null
2025-02-12	Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT	Dongyang Liu et.al.	2502.06782	null
2025-02-10	Towards Internet-Scale Training For Agents	Brandon Trabucco et.al.	2502.06776	null
2025-02-10	VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data	Thomas Zeng et.al.	2502.06737	null
2025-02-10	Do we really have to filter out random noise in pre-training data for language models?	Jinghan Ru et.al.	2502.06604	null
2025-02-10	LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM	Zhi Zhou et.al.	2502.06572	link
2025-02-10	Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation	Chengwen Qi et.al.	2502.06563	link
2025-02-10	Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data?	Marika Swanberg et.al.	2502.06555	null
2025-02-10	Solving Linear-Gaussian Bayesian Inverse Problems with Decoupled Diffusion Sequential Monte Carlo	Filip Ekström Kelvinius et.al.	2502.06379	null
2025-02-10	Simulation as Reality? The Effectiveness of LLM-Generated Data in Open-ended Question Assessment	Long Zhang et.al.	2502.06371	null
2025-02-10	UniDemoiré: Towards Universal Image Demoiréing with Data Generation and Synthesis	Zemin Yang et.al.	2502.06324	null
2025-02-10	Examining False Positives under Inference Scaling for Mathematical Reasoning	Yu Wang et.al.	2502.06217	null
2025-02-10	A Data-Efficient Pan-Tumor Foundation Model for Oncology CT Interpretation	Wenhui Lei et.al.	2502.06171	null
2025-02-09	A Conditional Tabular GAN-Enhanced Intrusion Detection System for Rare Attacks in IoT Networks	Safaa Menssouri et.al.	2502.06031	null
2025-02-09	ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification	Yashwanth M. et.al.	2502.05923	null
2025-02-09	GOLD: Graph Out-of-Distribution Detection via Implicit Adversarial Latent Generation	Danny Wang et.al.	2502.05780	null
2025-02-07	Multitwine: Multi-Object Compositing with Text and Layout Control	Gemma Canet Tarrés et.al.	2502.05165	null
2025-02-07	DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails	Yihe Deng et.al.	2502.05163	link
2025-02-07	Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment	Minh-Quan Le et.al.	2502.05153	null
2025-02-07	Preference-aware compensation policies for crowdsourced on-demand services	Georgina Nouli et.al.	2502.05060	null
2025-02-07	Stability and performance guarantees for misspecified multivariate score-driven filters	Simon Donker van Heel et.al.	2502.05021	null
2025-02-07	Gradient-based Explanations for Deep Learning Survival Models	Sophie Hanna Langbein et.al.	2502.04970	null
2025-02-07	SeDi-Instruct: Enhancing Alignment of Language Models through Self-Directed Instruction Generation	Jungwoo Kim et.al.	2502.04774	null
2025-02-07	Can Diffusion Models Learn Hidden Inter-Feature Rules Behind Images?	Yujin Han et.al.	2502.04725	null
2025-02-07	Early Stopping for Regression Trees	Ratmir Miftachov et.al.	2502.04709	null
2025-02-07	LLM Query Scheduling with Prefix Reuse and Latency Constraints	Gregory Dexter et.al.	2502.04677	null
2025-02-07	${\rm P{\small ROOF}W{\small ALA}}$ : Multilingual Proof Data Synthesis and Theorem-Proving	Amitayush Thakur et.al.	2502.04671	link
2025-02-06	Zero-shot Meta-learning for Tabular Prediction Tasks with Adversarially Pre-trained Transformer	Yulun Wu et.al.	2502.04573	null
2025-02-06	AnyPlace: Learning Generalized Object Placement for Robot Manipulation	Yuchi Zhao et.al.	2502.04531	null
2025-02-06	Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis	Shuhaib Mehri et.al.	2502.04511	link
2025-02-06	Augmented Conditioning Is Enough For Effective Training Image Generation	Jiahui Chen et.al.	2502.04475	null
2025-02-06	Consistency of augmentation graph and network approximability in contrastive learning	Chenghui Li et.al.	2502.04312	link
2025-02-06	Targeted Learning for Data Fairness	Alexander Asemota et.al.	2502.04309	null
2025-02-06	Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression	Lirui Wang et.al.	2502.04296	null
2025-02-06	Cohomology of symmetric stacks	Chenjing Bu et.al.	2502.04253	null
2025-02-07	Are the Majority of Public Computational Notebooks Pathologically Non-Executable?	Tien Nguyen et.al.	2502.04184	link
2025-02-06	A data-driven two-microphone method for in-situ sound absorption measurements	Leon Emmerich et.al.	2502.04143	null
2025-02-06	Market-based insurance ratemaking: application to pet insurance	Pierre-Olivier Goffard et.al.	2502.04082	link
2025-02-06	Evaluating Inter-Column Logical Relationships in Synthetic Tabular Data Generation	Yunbo Long et.al.	2502.04055	null
2025-02-06	CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing	Yu Yuan et.al.	2502.03997	null
2025-02-06	Tight Bounds on Jensen’s Gap: Novel Approach with Applications in Generative Modeling	Marcin Mazur et.al.	2502.03988	null
2025-02-06	MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation	YoonJe Kang et.al.	2502.03966	null
2025-02-07	A retake on the analysis of scores truncated by terminal events	Klaus Kähler Holst et.al.	2502.03942	null
2025-02-06	Thermal Model Calibration of a Squirrel-Cage Induction Machine	Leon Blumrich et.al.	2502.03935	null
2025-02-06	Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation	Tianhao Li et.al.	2502.03825	null
2025-02-06	Syntriever: How to Train Your Retriever with Synthetic Data from LLMs	Minsang Kim et.al.	2502.03824	link
2025-02-05	Linearized Optimal Transport pyLOT Library: A Toolkit for Machine Learning on Point Clouds	Jun Linwu et.al.	2502.03439	null
2025-02-05	On Fairness of Unified Multimodal Large Language Model for Image Generation	Ming Liu et.al.	2502.03429	null
2025-02-05	Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation	Alexey A. Novikov et.al.	2502.03420	null
2025-02-05	High-Fidelity Simultaneous Speech-To-Speech Translation	Tom Labiausse et.al.	2502.03382	link
2025-02-05	Learning from Active Human Involvement through Proxy Value Propagation	Zhenghao Peng et.al.	2502.03369	null
2025-02-05	Optimal Task Order for Continual Learning of Multiple Tasks	Ziyan Li et.al.	2502.03350	null
2025-02-05	Out-of-Distribution Detection using Synthetic Data Generation	Momin Abbas et.al.	2502.03323	null
2025-02-05	Posterior SBC: Simulation-Based Calibration Checking Conditional on Data	Teemu Säilynoja et.al.	2502.03279	link
2025-02-05	ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models	Ying Zhang et.al.	2502.03266	link
2025-02-05	SpaceGNN: Multi-Space Graph Neural Network for Node Anomaly Detection with Extremely Limited Labels	Xiangyu Dong et.al.	2502.03201	link
2025-02-05	Automatic Prompt Optimization Techniques: Exploring the Potential for Synthetic Data Generation	Nina Freise et.al.	2502.03078	null
2025-02-05	Panel Data Estimation and Inference: Homogeneity versus Heterogeneity	Jiti Gao et.al.	2502.03019	null
2025-02-05	DANDI: Diffusion as Normative Distribution for Deep Neural Network Input	Somin Kim et.al.	2502.02910	null
2025-02-05	OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds	Fan Wang et.al.	2502.02869	null
2025-02-05	On Trimming Tensor-structured Measurements and Efficient Low-rank Tensor Recovery	Shambhavi Suryanarayanan et.al.	2502.02843	link
2025-02-04	Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation	Junha Lee et.al.	2502.02548	null
2025-02-04	Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation	Jian Liu et.al.	2502.02525	link
2025-02-04	Distributional Diffusion Models with Scoring Rules	Valentin De Bortoli et.al.	2502.02483	null
2025-02-04	Sparse Data Generation Using Diffusion Models	Phil Ostheimer et.al.	2502.02448	null
2025-02-04	LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models	Jiangong Chen et.al.	2502.02441	link
2025-02-04	TransformDAS: Mapping Φ-OTDR Signals to Riemannian Manifold for Robust Classification	Jiaju Kang et.al.	2502.02428	null
2025-02-04	Synthetic Random Environmental Time Series Generation with Similarity Control, Preserving Original Signal’s Statistical Characteristics	Ofek Aloni et.al.	2502.02392	link
2025-02-04	STAIR: Improving Safety Alignment with Introspective Reasoning	Yichi Zhang et.al.	2502.02384	link
2025-02-04	Position Paper: Building Trust in Synthetic Data for Clinical AI	Krishan Agyakari Raja Babu et.al.	2502.02076	null
2025-02-04	Generative Data Mining with Longtail-Guided Diffusion	David S. Hayden et.al.	2502.01980	null
2025-02-04	SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset	Goodarz Mehr et.al.	2502.01894	link
2025-02-03	Generating Multi-Image Synthetic Data for Text-to-Image Customization	Nupur Kumari et.al.	2502.01720	null
2025-02-03	Preference Leakage: A Contamination Problem in LLM-as-a-judge	Dawei Li et.al.	2502.01534	link
2025-02-03	BD-Diff: Generative Diffusion Model for Image Deblurring on Unknown Domains with Blur-Decoupled Learning	Junhao Cheng et.al.	2502.01522	null
2025-02-03	Explaining Context Length Scaling and Bounds for Language Models	Jingzhe Shi et.al.	2502.01481	link
2025-01-31	Pathological MRI Segmentation by Synthetic Pathological Data Generation in Fetuses and Neonates	Misha P. T Kaandorp et.al.	2501.19338	null
2025-01-31	Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes	Zhiyao Xu et.al.	2501.19298	null
2025-01-31	Application of Generative Adversarial Network (GAN) for Synthetic Training Data Creation to improve performance of ANN Classifier for extracting Built-Up pixels from Landsat Satellite Imagery	Amritendu Mukherjee et.al.	2501.19283	null
2025-01-31	XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses	Bo Lan et.al.	2501.19034	link
2025-01-31	Quantum SMOTE with Angular Outliers: Redefining Minority Class Handling	Nishikanta Mohanty et.al.	2501.19001	null
2025-01-31	Permutation-Based Rank Test in the Presence of Discretization and Application in Causal Discovery with Mixed Data	Xinshuai Dong et.al.	2501.18990	null
2025-01-31	Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping	Pu Yang et.al.	2501.18962	link
2025-01-31	RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception	Joshua R. Waite et.al.	2501.18880	null
2025-01-31	Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming	Mrinank Sharma et.al.	2501.18837	null
2025-01-30	Synthetic Data Generation for Augmenting Small Samples	Dan Liu et.al.	2501.18741	null
2025-01-30	WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training	Benjamin Feuer et.al.	2501.18511	link
2025-01-30	Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline	Shivani Kapania et.al.	2501.18493	null
2025-01-30	MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding	Yuxin Zuo et.al.	2501.18362	null
2025-01-31	Leveraging Sparsity for Sample-Efficient Preference Learning: A Theoretical Perspective	Yunzhen Yao et.al.	2501.18282	null
2025-01-30	Diverse Preference Optimization	Jack Lanchantin et.al.	2501.18101	null
2025-01-29	Explainable Machine Learning: An Illustration of Kolmogorov-Arnold Network Model for Airfoil Lift Prediction	Sudhanva Kulkarni et.al.	2501.17896	null
2025-01-29	Generative Unordered Flow for Set-Structured Data Generation	Yangming Li et.al.	2501.17770	null
2025-01-29	A Framework for Generating Realistic Synthetic Tabular Data in a Randomized Controlled Trial Setting	Niki Z. Petrakos et.al.	2501.17719	null
2025-01-29	Drivetrain simulation using variational autoencoders	Pallavi Sharma et.al.	2501.17653	null
2025-01-29	RegionGCN: Spatial-Heterogeneity-Aware Graph Convolutional Networks	Hao Guo et.al.	2501.17599	null
2025-01-29	Closing the Gap Between Synthetic and Ground Truth Time Series Distributions via Neural Mapping	Daesoo Lee et.al.	2501.17553	link
2025-01-29	SemML: Enhancing Automata-Theoretic LTL Synthesis with Machine Learning	Jan Kretinsky et.al.	2501.17496	null
2025-01-28	A Guaranteed-Stable Neural Network Approach for Optimal Control of Nonlinear Systems	Anran Li et.al.	2501.17333	null
2025-01-28	CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data	Lee Carlin et.al.	2501.17324	null
2025-01-28	Floodgates up to contain the DeePC and limit extrapolation	Mohammad Ramadan et.al.	2501.17318	link
2025-01-28	FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data	Deren Lei et.al.	2501.17144	link
2025-01-28	What Really Matters for Learning-based LiDAR-Camera Calibration	Shujuan Huang et.al.	2501.16969	null
2025-01-28	DBSCAN in domains with periodic boundary conditions	Xander M. de Wit et.al.	2501.16894	link
2025-01-28	Exponential Family Attention	Kevin Christian Wibisono et.al.	2501.16790	link
2025-01-28	Meta-Federated Learning: A Novel Approach for Real-Time Traffic Flow Management	Bob Johnson et.al.	2501.16758	null
2025-01-28	More Efficient Sybil Detection Mechanisms Leveraging Resistance of Users to Attack Requests	Ali Safarpoor Dehkordi et.al.	2501.16624	link
2025-01-27	DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models	Niyati Bafna et.al.	2501.16581	null
2025-01-27	LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation	Farzad Farhadzadeh et.al.	2501.16559	null
2025-01-27	Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction	Atharva Naik et.al.	2501.16524	null
2025-01-27	SampleLLM: Optimizing Tabular Data Synthesis in Recommendations	Jingtong Gao et.al.	2501.16125	null
2025-01-27	Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks	Hailong Guo et.al.	2501.15891	null
2025-01-26	Approximate Message Passing for Bayesian Neural Networks	Romeo Sommerfeld et.al.	2501.15573	link
2025-01-26	Amortized Safe Active Learning for Real-Time Decision-Making: Pretrained Neural Policies from Simulated Nonparametric Functions	Cen-You Li et.al.	2501.15458	null
2025-01-26	OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas	Xiaoyang Wang et.al.	2501.15427	null
2025-01-26	Qwen2.5-1M Technical Report	An Yang et.al.	2501.15383	null
2025-01-26	Federated Class-Incremental Learning: A Hybrid Approach Using Latent Exemplars and Data-Free Techniques to Address Local and Global Forgetting	Milad Khademi Nori et.al.	2501.15356	null
2025-01-25	Processing the 2D and 3D Fresnel experimental databases via topological derivative methods	A. Carpio et.al.	2501.15327	null
2025-01-25	Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data	Jiajie Li et.al.	2501.15326	null
2025-01-25	Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data	Nora Fink et.al.	2501.15263	null
2025-01-25	Enhancing Fetal Plane Classification Accuracy with Data Augmentation Using Diffusion Models	Yueying Tian et.al.	2501.15248	null
2025-01-25	End-to-end localized deep learning for Cryo-ET	Vinith Kishore et.al.	2501.15246	link
2025-01-25	A Training-free Synthetic Data Selection Method for Semantic Segmentation	Hao Tang et.al.	2501.15201	link
2025-01-25	Using Large Language Models for education managements in Vietnamese with low resources	Duc Do Minh et.al.	2501.15022	null
2025-01-24	Stroke classification using Virtual Hybrid Edge Detection from in silico electrical impedance tomography data	Juan Pablo Agnelli et.al.	2501.14704	null
2025-01-24	Deep-BrownConrady: Prediction of Camera Calibration and Distortion Parameters Using Deep Learning and Synthetic Data	Faiz Muhammad Chaudhry et.al.	2501.14510	null
2025-01-24	Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video	Xiaohao Xu et.al.	2501.14319	link
2025-01-23	Advancing Math Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages	Zui Chen et.al.	2501.14002	null
2025-01-23	GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing	Akashah Shabbir et.al.	2501.13925	link
2025-01-23	Federated Granger Causality Learning for Interdependent Clients with State Space Representation	Ayush Mohanty et.al.	2501.13890	null
2025-01-23	Fast Iterative and Task-Specific Imputation with Online Learning	Rahul Bordoloi et.al.	2501.13786	null
2025-01-23	A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation	Dario Serez et.al.	2501.13718	null
2025-01-23	Exploring the interplay between small and large scales movements in a neotropical small mammal	E. Brigatti et.al.	2501.13688	null
2025-01-23	Robust Amortized Bayesian Inference with Self-Consistency Losses on Unlabeled Data	Aayush Mishra et.al.	2501.13483	null
2025-01-23	VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework	He Kong et.al.	2501.13411	link
2025-01-23	Beyond Task Diversity: Provable Representation Transfer for Sequential Multi-Task Linear Bandits	Thang Duong et.al.	2501.13390	link
2025-01-23	Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement	Jae-Sung Bae et.al.	2501.13372	null
2025-01-22	Learning accurate rigid registration for longitudinal brain MRI from synthetic data	Jingru Fu et.al.	2501.13010	null
2025-01-22	Deep learning enhanced initial model prediction in elastic FWI: application to marine streamer data	Pavel Plotnitskii et.al.	2501.12992	link
2025-01-22	Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities	Florian Kankowski et.al.	2501.12980	null
2025-01-22	Generating Diverse Q&A Benchmarks for RAG Evaluation with DataMorgana	Simone Filice et.al.	2501.12789	null
2025-01-22	REX: Causal Discovery based on Machine Learning and Explainability techniques	Jesus Renero et.al.	2501.12706	link
2025-01-22	Approximate Puzzlepiece Compositing	Xuan Huang et.al.	2501.12581	null
2025-01-21	A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data	Minh Tran et.al.	2501.12501	null
2025-01-21	BlanketGen2-Fit3D: Synthetic Blanket Augmentation Towards Improving Real-World In-Bed Blanket Occluded Human Pose Estimation	Tamás Karácsony et.al.	2501.12318	null
2025-01-21	Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement	Maosong Cao et.al.	2501.12273	link
2025-01-21	Exploring Temporally-Aware Features for Point Tracking	Inès Hyeonsu Kim et.al.	2501.12218	link
2025-01-21	Foreign object segmentation in chest x-rays through anatomy-guided shape insertion	Constantin Seibold et.al.	2501.12022	null
2025-01-21	TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data	Paul Tiwald et.al.	2501.12012	link
2025-01-21	Diffeomorphic ICP Registration for Single and Multiple Point Sets	Adrien Wohrer et.al.	2501.11986	link
2025-01-21	Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues	Maya Medjad et.al.	2501.11977	link
2025-01-21	Progressive Cross Attention Network for Flood Segmentation using Multispectral Satellite Imagery	Vicky Feliren et.al.	2501.11923	null
2025-01-21	Finding the nearest bounded-real port-Hamiltonian system	Karim Cherifi et.al.	2501.11903	link
2025-01-21	*Enhanced imaging of M87: Simulations with the EHT and extended-KVN**	Ilje Cho et.al.	2501.11822	null
2025-01-20	Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection	Ali Naseh et.al.	2501.11786	null
2025-01-20	Efficient Bearing Sensor Data Compression via an Asymmetrical Autoencoder with a Lifting Wavelet Transform Layer	Xin Zhu et.al.	2501.11737	null
2025-01-20	KKL Observer Synthesis for Nonlinear Systems via Physics-Informed Learning	M. Umar B. Niazi et.al.	2501.11655	null
2025-01-20	A Multidimensional Elasticity Framework for Adaptive Data Analytics Management in the Computing Continuum	Sergio Laso et.al.	2501.11369	null
2025-01-20	A Survey of World Models for Autonomous Driving	Tuo Feng et.al.	2501.11260	link
2025-01-17	Using Technology in Digital Humanities for Learning and Knowledge Dissemination	Armanda Rodrigues et.al.	2501.10275	null
2025-01-17	PaSa: An LLM Agent for Comprehensive Academic Paper Search	Yichen He et.al.	2501.10120	link
2025-01-17	Spatiotemporal Prediction of Secondary Crashes by Rebalancing Dynamic and Static Data with Generative Adversarial Networks	Junlan Chen et.al.	2501.10041	null
2025-01-17	Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks	Junlan Chen et.al.	2501.10017	null
2025-01-17	ComptoNet: An End-to-End Deep Learning Framework for Scatter Estimation in Multi-Source Stationary CT	Yingxian Xia et.al.	2501.09986	null
2025-01-17	GenSC-6G: A Prototype Testbed for Integrated Generative AI, Quantum, and Semantic Communication	Brian E. Arfeto et.al.	2501.09918	link
2025-01-17	Decoding Patterns of Data Generation Teams for Clinical and Scientific Success: Insights from the Bridge2AI Talent Knowledge Graph	Jiawei Xu et.al.	2501.09897	null
2025-01-16	Improving Automated Feedback Systems for Tutor Training in Low-Resource Scenarios through Data Augmentation	Chentianye Xu et.al.	2501.09824	null
2025-01-16	Metrics for Inter-Dataset Similarity with Example Applications in Synthetic Data and Feature Selection Evaluation – Extended Version	Muhammad Rajabinasab et.al.	2501.09591	link
2025-01-16	Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities	Runzhou Mao et.al.	2501.09579	null
2025-01-16	Joint Transmission and Deblurring: A Semantic Communication Approach Using Events	Pujing Yang et.al.	2501.09396	null
2025-01-16	Identifying Information from Observations with Uncertainty and Novelty	Derek S. Prijatelj et.al.	2501.09331	null
2025-01-17	Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding	Kohei Torimi et.al.	2501.09278	null
2025-01-15	Generative diffusion model with inverse renormalization group flows	Kanta Masuki et.al.	2501.09064	link
2025-01-15	UNIR-Net: A Novel Approach for Restoring Underwater Images with Non-Uniform Illumination Using Synthetic Data	Ezequiel Perez-Zarate et.al.	2501.09053	link
2025-01-15	Generating Realistic Synthetic Head Rotation Data for Extended Reality using Deep Learning	Jakob Struye et.al.	2501.09050	null
2025-01-15	Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails	Shaona Ghosh et.al.	2501.09004	null
2025-01-17	VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Youssef Abdalla et.al.	2501.08995	link
2025-01-15	SAIF: A Comprehensive Framework for Evaluating the Risks of Generative AI in the Public Sector	Kyeongryul Lee et.al.	2501.08814	null
2025-01-15	Enhanced Large Language Models for Effective Screening of Depression and Anxiety	June M. Liu et.al.	2501.08769	null
2025-01-15	Computerized Assessment of Motor Imitation for Distinguishing Autism in Video (CAMI-2DNet)	Kaleab A. Kinfu et.al.	2501.08609	null
2025-01-14	CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset	Jiawei Du et.al.	2501.08238	null
2025-01-14	Audio-visual Deepfake Detection With Local Temporal Inconsistencies	Marcella Astrid et.al.	2501.08137	null
2025-01-14	Smooth Handovers via Smoothed Online Learning	Michail Kalntis et.al.	2501.08099	null
2025-01-14	Bridge-SR: Schrödinger Bridge for Efficient SR	Chang Li et.al.	2501.07897	null
2025-01-14	HgPCN: A Heterogeneous Architecture for E2E Embedded Point Cloud Inference	Yiming Gao et.al.	2501.07767	null
2025-01-13	Active Learning Enhanced Surrogate Modeling of Jet Engines in JuliaSim	Anas Abdelrehim et.al.	2501.07701	null
2025-01-13	Dataset Distillation as Pushforward Optimal Quantization	Hong Ye Tan et.al.	2501.07681	null
2025-01-13	CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory	Haokun Zhao et.al.	2501.07674	null
2025-01-13	Finite Sample Identification of Partially Observed Bilinear Dynamical Systems	Yahya Sattar et.al.	2501.07652	null
2025-01-13	Dataset Distillation via Committee Voting	Jiacheng Cui et.al.	2501.07575	link
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	null
2025-01-13	Ultrasonic Medical Tissue Imaging Using Probabilistic Inversion: Leveraging Variational Inference for Speed Reconstruction and Uncertainty Quantification	Qiang Li et.al.	2501.07348	null
2025-01-13	The Lessons of Developing Process Reward Models in Mathematical Reasoning	Zhenru Zhang et.al.	2501.07301	null
2025-01-13	A data-driven approach to discover and quantify systemic lupus erythematosus etiological heterogeneity from electronic health records	Marco Barbero Mota et.al.	2501.07206	null
2025-01-15	Adaptive Noise-Tolerant Network for Image Segmentation	Weizhi Li et.al.	2501.07163	null
2025-01-13	State-space algorithm for detecting the nanohertz gravitational wave background	Tom Kimpson et.al.	2501.06990	null
2025-01-12	Synthetic Prior for Few-Shot Drivable Head Avatar Inversion	Wojciech Zielonka et.al.	2501.06903	null
2025-01-12	DRDT3: Diffusion-Refined Decision Test-Time Training Model	Xingshuai Huang et.al.	2501.06718	null
2025-01-11	Study Self-lensing/Eclipsing Signals in Edge-on Double White-Dwarf Systems	Sedighe Sajadian et.al.	2501.06498	link
2025-01-11	A Correlated Data-Driven Collaborative Beamforming Approach for Energy-efficient IoT Data Transmission	Yangning Li et.al.	2501.06464	null
2025-01-10	Diffusion Models for Smarter UAVs: Decision-Making and Modeling	Yousef Emami et.al.	2501.05819	null
2025-01-10	Enabling Scalable Oversight via Self-Evolving Critic	Zhengyang Tang et.al.	2501.05727	null
2025-01-10	Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains	Vighnesh Subramaniam et.al.	2501.05707	null
2025-01-10	Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models	Zheqi Lv et.al.	2501.05662	null
2025-01-09	Introducing the generalized gamma distribution: a flexible distribution for index standardization	Jillian C. Dunic et.al.	2501.05618	null
2025-01-09	Bit-depth color recovery via off-the-shelf super-resolution models	Xuanshuo Fu et.al.	2501.05611	null
2025-01-09	Generative Flow Networks: Theory and Applications to Structure Learning	Tristan Deleu et.al.	2501.05498	null
2025-01-09	Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance	Dimitrios Gerogiannis et.al.	2501.05379	null
2025-01-09	KabaddiPy: A package to enable access to Professional Kabaddi Data	Bhaskar Lalwani et.al.	2501.05168	link
2025-01-09	Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier	Yufei Shang et.al.	2501.05155	null
2025-01-09	Constrained Optimization of Charged Particle Tracking with Multi-Agent Reinforcement Learning	Tobias Kortus et.al.	2501.05113	null
2025-01-09	MORDA: A Synthetic Dataset to Facilitate Adaptation of Object Detectors to Unseen Real-target Domain While Preserving Performance on Real-source Domain	Hojun Lim et.al.	2501.04950	null
2025-01-09	Towards understanding the bias in decision trees	Nathan Phelps et.al.	2501.04903	null
2025-01-08	URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics	Ruilin Luo et.al.	2501.04686	link
2025-01-08	Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought	Violet Xiang et.al.	2501.04682	null
2025-01-08	Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling	Nannan Li et.al.	2501.04666	null
2025-01-09	MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation	Daniele Molino et.al.	2501.04614	null
2025-01-09	MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data	Zifan Wang et.al.	2501.04595	null
2025-01-08	Inferring resource competition in microbial communities from time series	Xiaowen Chen et.al.	2501.04520	null
2025-01-08	User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation	Krisztian Balog et.al.	2501.04410	null
2025-01-07	MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation	Siddharth Joshi et.al.	2501.04155	link
2025-01-07	Scalable Discovery of Fundamental Physical Laws: Learning Magnetohydrodynamics from 3D Turbulence Data	Matthew Golden et.al.	2501.04094	null
2025-01-07	Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance	Adil Rengim Cetingoz et.al.	2501.03993	null
2025-01-07	Synthetic Data Privacy Metrics	Amy Steier et.al.	2501.03941	null
2025-01-07	A precise asymptotic analysis of learning diffusion models: theory and insights	Hugo Cui et.al.	2501.03937	link
2025-01-07	OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints	Mingjie Pan et.al.	2501.03841	null
2025-01-07	Exploring Molecule Generation Using Latent Space Graph Diffusion	Prashanth Pombala et.al.	2501.03696	link
2025-01-07	SMIR: Efficient Synthetic Data Pipeline To Improve Multi-Image Reasoning	Andrew Li et.al.	2501.03675	link
2025-01-07	Imitation Learning of MPC with Neural Networks: Error Guarantees and Sparsification	Hendrik Alsmeier et.al.	2501.03671	null
2025-01-07	Advancing the Understanding of Fine-Grained 3D Forest Structures using Digital Cousins and Simulation-to-Reality: Methods and Datasets	Jing Liu et.al.	2501.03637	null
2025-01-07	Reading with Intent – Neutralizing Intent	Benjamin Reichman et.al.	2501.03475	null
2025-01-07	MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems	Yannis Katsis et.al.	2501.03468	link
2025-01-06	License Plate Images Generation with Diffusion Models	Mariia Shpir et.al.	2501.03374	null
2025-01-06	MObI: Multimodal Object Inpainting Using Diffusion Models	Alexandru Buburuzan et.al.	2501.03173	null
2025-01-06	The Scaling Law for LoRA Base on Mutual Information Upper Bound	Jing Zhang et.al.	2501.03152	null
2025-01-06	Learning DAGs and Root Causes from Time-Series Data	Panagiotis Misiakos et.al.	2501.03130	null
2025-01-06	Probably Correct Optimal Stable Matching for Two-Sided Markets Under Uncertainty	Andreas Athanasopoulos et.al.	2501.03018	link
2025-01-05	A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models	Yinpeng Cai et.al.	2501.02441	null
2025-01-04	Reweighting Improves Conditional Risk Bounds	Yikai Zhang et.al.	2501.02353	null
2025-01-04	Diffusion Model-Based Data Synthesis Aided Federated Semi-Supervised Learning	Zhongwei Wang et.al.	2501.02219	null
2025-01-04	Phase Retrieval by Quaternionic Reweighted Amplitude Flow on Image Reconstruction	Ren Hu et.al.	2501.02180	null
2025-01-03	DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data	Yuanpeng Tu et.al.	2501.02048	null
2025-01-03	Detecting Music Performance Errors with Transformers	Benjamin Shiue-Hal Chou et.al.	2501.02030	link
2025-01-03	Learning from Ambiguous Data with Hard Labels	Zeke Xie et.al.	2501.01844	null
2025-01-03	Time Series Language Model for Descriptive Caption Generation	Mohamed Trabelsi et.al.	2501.01832	null
2025-01-03	Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation	Mohammad Khalil et.al.	2501.01793	link
2025-01-03	Can Synthetic Data be Fair and Private? A Comparative Study of Synthetic Data Generation and Fairness Algorithms	Qinyi Liu et.al.	2501.01785	null
2025-01-03	CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis	Bohan Zhang et.al.	2501.01668	link
2025-01-02	OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios	Xize Cheng et.al.	2501.01384	null
2025-01-03	TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer	Jiayu Li et.al.	2501.01216	null
2025-01-02	Vulnerability-Aware Spatio-Temporal Learning for Generalizable and Interpretable Deepfake Video Detection	Dat Nguyen et.al.	2501.01184	null
2025-01-02	Ultrasound Lung Aeration Map via Physics-Aware Neural Operators	Jiayun Wang et.al.	2501.01157	null
2025-01-03	KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model	Xinshuo Hu et.al.	2501.01028	link
2025-01-01	Enhancing Early Diabetic Retinopathy Detection through Synthetic DR1 Image Generation: A StyleGAN3 Approach	Sagarnil Das et.al.	2501.00954	null
2025-01-01	A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset	Junhuan Yang et.al.	2501.00941	null
2025-01-01	Population Aware Diffusion for Time Series Generation	Yang Li et.al.	2501.00910	link
2025-01-01	VoiceRestore: Flow-Matching Transformers for Speech Recording Quality Restoration	Stanislav Kirdey et.al.	2501.00794	link
2025-01-01	Beyond Static Datasets: A Behavior-Driven Entity-Specific Simulation to Overcome Data Scarcity and Train Effective Crypto Anti-Money Laundering Models	Dinesh Srivasthav P et.al.	2501.00757	null
2025-01-01	RORem: Training a Robust Object Remover with Human-in-the-Loop	Ruibin Li et.al.	2501.00740	link
2025-01-01	Cost and Reward Infused Metric Elicitation	Chethan Bhateja et.al.	2501.00696	link
2024-12-31	Compositional Covariate Importance Testing via Partial Conjunction of Bivariate Hypotheses	Ritwik Bhaduri et.al.	2501.00566	link
2024-12-31	Tensor Topic Modeling Via HOSVD	Yating Liu et.al.	2501.00535	null
2024-12-31	Addressing Challenges in Data Quality and Model Generalization for Malaria Detection	Kiswendsida Kisito Kabore et.al.	2501.00464	null
2024-12-30	A flexible parametric approach to synthetic patients generation using health data	Marta Cipriani et.al.	2412.21056	link
2024-12-30	Rethinking Aleatoric and Epistemic Uncertainty	Freddie Bickford Smith et.al.	2412.20892	null
2024-12-31	HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving	Yang Li et.al.	2412.20735	null
2024-12-30	SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy	Md Mahadi Hasan Nahid et.al.	2412.20641	null
2024-12-29	LEARNER: A Transfer Learning Method for Low-Rank Matrix Estimation	Sean McGrath et.al.	2412.20605	link
2024-12-29	Testing and Improving the Robustness of Amortized Bayesian Inference for Cognitive Models	Yufei Wu et.al.	2412.20586	link
2024-12-29	Sub-optimal Learning in Meta-Classifier Attacks: A Study of Membership Inference on Differentially Private Location Aggregates	Yuhan Liu et.al.	2412.20456	null
2024-12-29	Image Augmentation Agent for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.20439	null
2024-12-29	LLM2: Let Large Language Models Harness System 2 Reasoning	Cheng Yang et.al.	2412.20372	link
2024-12-28	Machine-Learning Enabled Multidimensional Data Utilization in Multi-resonance Biosensors: A Pathway to Enhanced Accuracy	Majid Aalizadeh et.al.	2412.20245	null
2024-12-27	Data-driven tool wear prediction in milling, based on a process-integrated single-sensor approach	Eric Hirsch et.al.	2412.19950	null
2024-12-27	Direct estimates of irreversibility from time series	Trevor GrandPre et.al.	2412.19772	null
2024-12-27	Generative Video Propagation	Shaoteng Liu et.al.	2412.19761	null
2024-12-27	OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis	Qiushi Sun et.al.	2412.19723	null
2024-12-27	TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data	Xiang Huang et.al.	2412.19544	link
2024-12-27	Learning Radiance Fields from a Single Snapshot Compressive Image	Yunhao Li et.al.	2412.19483	null
2024-12-27	NijiGAN: Transform What You See into Anime with Contrastive Semi-Supervised Learning and Neural Ordinary Differential Equations	Kevin Putra Santoso et.al.	2412.19455	null
2024-12-26	Adaptive Conformal Inference by Betting	Aleksandr Podkopaev et.al.	2412.19318	null
2024-12-26	Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing	Inpyo Hong et.al.	2412.19125	link
2024-12-26	Discrete vs. Continuous Trade-offs for Generative Models	Jathin Korrapati et.al.	2412.19114	null
2024-12-26	Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation	Haotian Qian et.al.	2412.19080	null
2024-12-25	Bootstrap Your Own Context Length	Liang Wang et.al.	2412.18860	null
2024-12-24	HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge Distillation	Mohammed Hamdan et.al.	2412.18524	null
2024-12-24	Subsampling, aligning, and averaging to find circular coordinates in recurrent time series	Andrew J. Blumberg et.al.	2412.18515	null
2024-12-24	GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent	Kangjia Zhao et.al.	2412.18426	null
2024-12-24	Compact Binary Coalescence Gravitational Wave Signals Counting and Separation Using UnMixFormer	Tianyu Zhao et.al.	2412.18259	null
2024-12-24	Efficient Long Context Language Model Retrieval with Compression	Minju Seo et.al.	2412.18232	null
2024-12-24	PCM Selector: Penalized Covariate-Mediator Selection Operator for Evaluating Linear Causal Effects	Hisayoshi Nanmo et.al.	2412.18180	null
2024-12-24	AIGT: AI Generative Table Based on Prompt	Mingming Zhang et.al.	2412.18111	null
2024-12-24	Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner	Aizierjiang Aiersilan et.al.	2412.18086	link
2024-12-23	AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data	Mirko Zaffaroni et.al.	2412.18038	link
2024-12-23	FaceLift: Single Image to 3D Head with View Generation and GS-LRM	Weijie Lyu et.al.	2412.17812	null
2024-12-23	Generating Completions for Fragmented Broca’s Aphasic Sentences Using Large Language Models	Sijbren van Vaals et.al.	2412.17669	link
2024-12-23	Rate of Model Collapse in Recursive Training	Ananda Theertha Suresh et.al.	2412.17646	link
2024-12-23	HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data	Ting Zhou et.al.	2412.17574	link
2024-12-23	ERUPD – English to Roman Urdu Parallel Dataset	Mohammed Furqan et.al.	2412.17562	null
2024-12-23	Multimodal Preference Data Synthetic Alignment with Reward Model	Robert Wijaya et.al.	2412.17417	link
2024-12-23	Dynamics of Collective Information Processing for Risk Encoding in Social Networks during Crises	Chao Fan et.al.	2412.17342	null
2024-12-23	MatchMiner-AI: An Open-Source Solution for Cancer Clinical Trial Matching	Ethan Cerami et.al.	2412.17228	null
2024-12-22	DreamOmni: Unified Image Generation and Editing	Bin Xia et.al.	2412.17098	null
2024-12-22	Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration	Hai Ye et.al.	2412.17061	link
2024-12-22	Generate to Discriminate: Expert Routing for Continual Learning	Yewon Byun et.al.	2412.17009	null
2024-12-22	Diffusion-Based Approaches in Medical Image Generation and Analysis	Abdullah al Nomaan Nafi et.al.	2412.16860	null
2024-12-22	GME: Improving Universal Multimodal Retrieval by Multimodal LLMs	Xin Zhang et.al.	2412.16855	null
2024-12-21	A Comprehensive Guide to Item Recovery Using the Multidimensional Graded Response Model in R	Yesim Beril Soguksu et.al.	2412.16657	null
2024-12-21	A Systems Thinking Approach to Algorithmic Fairness	Chris Lam et.al.	2412.16641	null
2024-12-20	Personalized Representation from Personalized Generation	Shobhita Sundaram et.al.	2412.16156	link
2024-12-20	Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation	Timur Sattarov et.al.	2412.16083	null
2024-12-20	Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support	Qijiong Liu et.al.	2412.15973	link
2024-12-20	Fine-tuning Whisper on Low-Resource Languages for Real-World Applications	Vincenzo Timmel et.al.	2412.15726	link
2024-12-20	Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class	Annie D’souza et.al.	2412.15657	link
2024-12-20	Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage	Zhi Gao et.al.	2412.15606	null
2024-12-20	ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model	Qi Zang et.al.	2412.15541	link
2024-12-20	GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators	Hengjia Li et.al.	2412.15491	null
2024-12-20	Toward Appearance-based Autonomous Landing Site Identification for Multirotor Drones in Unstructured Environments	Joshua Springer et.al.	2412.15486	null
2024-12-19	Tree-of-Code: A Tree-Structured Exploring Framework for End-to-End Code Generation and Execution in Complex Task Handling	Ziyi Ni et.al.	2412.15305	null
2024-12-19	OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization	Jiacheng Zhang et.al.	2412.15159	null
2024-12-19	Language Models as Continuous Self-Evolving Data Engineers	Peidong Wang et.al.	2412.15151	null
2024-12-19	Assessing treatment effects in observational data with missing confounders: A comparative study of practical doubly-robust and traditional missing data methods	Brian D. Williamson et.al.	2412.15012	link
2024-12-19	DS $^2$ -ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis	Hongling Xu et.al.	2412.14849	link
2024-12-20	ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis	Zeao Tu et.al.	2412.14809	link
2024-12-19	ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine	Rabee Qasem et.al.	2412.14771	null
2024-12-19	How to Synthesize Text Data without Model Collapse?	Xuekai Zhu et.al.	2412.14689	null
2024-12-19	Bel Esprit: Multi-Agent Framework for Building AI Model Pipelines	Yunsu Kim et.al.	2412.14684	null
2024-12-19	Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles	Chuang Lin et.al.	2412.14494	null
2024-12-19	MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval	Junjie Zhou et.al.	2412.14475	null
2024-12-18	GREGoR: Accelerating Genomics for Rare Diseases	Moez Dawood et.al.	2412.14338	null
2024-12-18	MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data	Hanwen Jiang et.al.	2412.14166	null
2024-12-18	Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective	Zhiyuan Zeng et.al.	2412.14135	null
2024-12-18	Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation	Haotong Lin et.al.	2412.14015	link
2024-12-18	Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali	Sharad Duwal et.al.	2412.13860	null
2024-12-18	RadField3D: A Data Generator and Data Format for Deep Learning in Radiation-Protection Dosimetry for Medical Applications	Felix Lehner et.al.	2412.13852	link
2024-12-18	Object Style Diffusion for Generalized Object Detection in Urban Scene	Hao Li et.al.	2412.13815	null
2024-12-18	Text2Relight: Creative Portrait Relighting with Text Guidance	Junuk Cha et.al.	2412.13734	null
2024-12-18	NPC: Neural Predictive Control for Fuel-Efficient Autonomous Trucks	Jiaping Ren et.al.	2412.13618	null
2024-12-18	Single-cell spatial (scs) omics: Recent developments in data analysis	José Camacho et.al.	2412.13591	null
2024-12-18	Hybrid Data-Free Knowledge Distillation	Jialiang Tang et.al.	2412.13525	link
2024-12-18	Learning Causal Transition Matrix for Instance-dependent Label Noise	Jiahui Li et.al.	2412.13516	null
2024-12-18	AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark	Jianlyu Chen et.al.	2412.13102	link
2024-12-17	Are Data Experts Buying into Differentially Private Synthetic Data? Gathering Community Perspectives	Lucas Rosenblatt et.al.	2412.13030	null
2024-12-17	OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain	Shuting Wang et.al.	2412.13018	link
2024-12-17	Synthetic Data Generation for Anomaly Detection on Table Grapes	Ionut Marian Motoi et.al.	2412.12949	link
2024-12-17	SynthCypher: A Fully Synthetic Data Generation Framework for Text-to-Cypher Querying in Knowledge Graphs	Aman Tiwari et.al.	2412.12612	null
2024-12-17	Libri2Vox Dataset: Target Speaker Extraction with Diverse Speaker Conditions and Synthetic Data	Yun Liu et.al.	2412.12512	null
2024-12-17	Persona-SQ: A Personalized Suggested Question Generation Framework For Real-world Documents	Zihao Lin et.al.	2412.12445	link
2024-12-17	On the Number of Vertices in a Hyperplane Section of a Polytope	Jesús A. De Loera et.al.	2412.12419	null
2024-12-16	LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts	Zhuhao Wang et.al.	2412.12001	link
2024-12-16	Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data	Onur Tasar et.al.	2412.11972	null
2024-12-16	Scalable Data Transmission Framework for Earth Observation Satellites with Channel Adaptation	Van-Phuc Bui et.al.	2412.11857	null
2024-12-16	Beyond Dataset Creation: Critical View of Annotation Variation and Bias Probing of a Dataset for Online Radical Content Detection	Arij Riabi et.al.	2412.11745	null
2024-12-18	Conditional Diffusion Models Based Conditional Independence Testing	Yanfeng Yang et.al.	2412.11744	link
2024-12-16	Generalized Bayesian deep reinforcement learning	Shreya Sinha Roy et.al.	2412.11743	null
2024-12-16	PSGraph: Differentially Private Streaming Graph Synthesis by Considering Temporal Dynamics	Quan Yuan et.al.	2412.11369	null
2024-12-17	Learning Set Functions with Implicit Differentiation	Gözde Özcan et.al.	2412.11239	null
2024-12-15	Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal	Yuhao Wang et.al.	2412.11196	null
2024-12-15	OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation	Bohan Li et.al.	2412.11183	null
2024-12-15	AD-LLM: Benchmarking Large Language Models for Anomaly Detection	Tiankai Yang et.al.	2412.11142	link
2024-12-15	Empowering LLMs to Understand and Generate Complex Vector Graphics	Ximing Xing et.al.	2412.11102	null
2024-12-15	Understanding and Mitigating Memorization in Diffusion Models for Tabular Data	Zhengyu Fang et.al.	2412.11044	null
2024-12-13	Differentially Private Multi-Sampling from Distributions	Albert Cheu et.al.	2412.10512	null
2024-12-13	Uncertainties in Signal Recovery from Heterogeneous and Convoluted Time Series with Principal Component Analysis	Mariia Legenkaia et.al.	2412.10175	null
2024-12-13	Research Integrity and GenAI: A Systematic Analysis of Ethical Challenges Across Research Phases	Sonja Bjelobaba et.al.	2412.10134	null
2024-12-13	AMUSE: Adaptive Model Updating using a Simulated Environment	Louis Chislett et.al.	2412.10119	null
2024-12-13	Quaffure: Real-Time Quasi-Static Neural Hair Simulation	Tuur Stuyck et.al.	2412.10061	null
2024-12-13	Are you doing better than random guessing? A call for using negative controls when evaluating causal discovery algorithms	Anne Helby Petersen et.al.	2412.10039	null
2024-12-13	Latent feedback control of distributed systems in multiple scenarios through deep learning-based reduced order models	Matteo Tomasetto et.al.	2412.09942	null
2024-12-13	Financial Sentiment Analysis: Leveraging Actual and Synthetic Data for Supervised Fine-tuning	Abraham Atsiwo et.al.	2412.09859	link
2024-12-13	Leveraging Programmatically Generated Synthetic Data for Differentially Private Diffusion Training	Yujin Choi et.al.	2412.09842	link
2024-12-13	LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering	Patrick Sutanto et.al.	2412.09807	null
2024-12-12	Private Synthetic Data Generation in Small Memory	Rayne Holland et.al.	2412.09756	null
2024-12-12	Should We Learn Contact-Rich Manipulation Policies from Sampling-Based Planners?	Huaijiang Zhu et.al.	2412.09743	null
2024-12-12	AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials	Yiheng Xu et.al.	2412.09605	null
2024-12-12	A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data	Alice Ruget et.al.	2412.09427	null
2024-12-12	MaskTerial: A Foundation Model for Automated 2D Material Flake Detection	Jan-Lucas Uslu et.al.	2412.09333	null
2024-12-13	First Train to Generate, then Generate to Train: UnitedSynT5 for Few-Shot NLI	Sourav Banerjee et.al.	2412.09263	null
2024-12-12	VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation	Roberto Alcover-Couso et.al.	2412.09240	null
2024-12-12	eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction	Jad Mansour et.al.	2412.09209	link
2024-12-12	Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method	Xinshuai Song et.al.	2412.09082	null
2024-12-12	Phi-4 Technical Report	Marah Abdin et.al.	2412.08905	null
2024-12-12	A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions	Jiankang Wang et.al.	2412.08864	null
2024-12-12	Exploring Large Language Models on Cross-Cultural Values in Connection with Training Methodology	Minsang Kim et.al.	2412.08846	null
2024-12-11	Efficient Dynamic Attributed Graph Generation	Fan Li et.al.	2412.08810	null
2024-12-11	Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions	Jiarui Zhang et.al.	2412.08737	null
2024-12-11	Coherent3D: Coherent 3D Portrait Video Reconstruction via Triplane Fusion	Shengze Wang et.al.	2412.08684	null
2024-12-11	A 1% accurate method to include baryonic effects in galaxy-galaxy lensing models	Matteo Zennaro et.al.	2412.08623	null
2024-12-11	Can We Generate Visual Programs Without Prompting LLMs?	Michal Shlapentokh-Rothman et.al.	2412.08564	null
2024-12-11	Federated Learning for Traffic Flow Prediction with Synthetic Data Augmentation	Fermin Orozco et.al.	2412.08460	null
2024-12-11	Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming	Ziqi Gao et.al.	2412.08221	link
2024-12-11	Analyzing and Improving Model Collapse in Rectified Flow Models	Huminhao Zhu et.al.	2412.08175	null
2024-12-11	DiffRaman: A Conditional Latent Denoising Diffusion Probabilistic Model for Bacterial Raman Spectroscopy Identification Under Limited Data Conditions	Haiming Yao et.al.	2412.08131	null
2024-12-11	Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models	Quang-Hung Le et.al.	2412.08125	link
2024-12-11	Generative Zoo	Tomasz Niewiadomski et.al.	2412.08101	null
2024-12-11	THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots	Zeshun Li et.al.	2412.08096	null
2024-12-11	DialogAgent: An Auto-engagement Agent for Code Question Answering Data Production	Xiaoyun Liang et.al.	2412.08069	null
2024-12-10	Mitigating exponential concentration in covariant quantum kernels for subspace and real-world data	Gabriele Agliardi et.al.	2412.07915	null
2024-12-10	Spectral Differential Network Analysis for High-Dimensional Time Series	Michael Hellstern et.al.	2412.07905	null
2024-12-10	GASP: Gaussian Avatars with Synthetic Priors	Jack Saunders et.al.	2412.07739	null
2024-12-10	Granite Guardian	Inkit Padhi et.al.	2412.07724	link
2024-12-10	SimVS: Simulating World Inconsistencies for Robust View Synthesis	Alex Trevithick et.al.	2412.07696	null
2024-12-10	Bayesian Data Augmentation and Training for Perception DNN in Autonomous Aerial Vehicles	Ashik E Rasul et.al.	2412.07655	link
2024-12-10	SurvBETA: Ensemble-Based Survival Models Using Beran Estimators and Several Attention Mechanisms	Lev V. Utkin et.al.	2412.07638	link
2024-12-10	Causal World Representation in the GPT Model	Raanan Y. Rohekar et.al.	2412.07446	null
2024-12-10	AppGen: Mobility-aware App Usage Behavior Generation for Mobile Users	Zihan Huang et.al.	2412.07267	null
2024-12-10	Epidemiological Model Calibration via Graybox Bayesian Optimization	Puhua Niu et.al.	2412.07193	null
2024-12-11	Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation	Tal Zeevi et.al.	2412.07169	link
2024-12-10	Enhancing radioisotope identification in gamma spectra with transfer learning	Peter Lalor et.al.	2412.07069	null
2024-12-09	Data Augmentation with Variational Autoencoder for Imbalanced Dataset	Samuel Stocksieker et.al.	2412.07039	link
2024-12-09	FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering	Amirhossein Abaskohi et.al.	2412.07030	link
2024-12-09	ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models	Jieyu Zhang et.al.	2412.07012	link
2024-12-09	JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLM	Takuro Fujii et.al.	2412.06738	link
2024-12-11	Numerical Estimation of Spatial Distributions under Differential Privacy	Leilei Du et.al.	2412.06541	null
2024-12-09	Improving text-conditioned latent diffusion for cancer pathology	Aakash Madhav Rao et.al.	2412.06487	link
2024-12-09	World-Consistent Data Generation for Vision-and-Language Navigation	Yu Zhong et.al.	2412.06413	null
2024-12-09	Exploring the Impact of Synthetic Data on Human Gesture Recognition Tasks Using GANs	George Kontogiannis et.al.	2412.06389	null
2024-12-09	Rendering-Refined Stable Diffusion for Privacy Compliant Synthetic Data	Kartik Patwari et.al.	2412.06248	null
2024-12-09	AIDE: Task-Specific Fine Tuning with Attribute Guided Multi-Hop Data Expansion	Jiayu Li et.al.	2412.06136	null
2024-12-08	Implicit Delta Learning of High Fidelity Neural Network Potentials	Stephan Thaler et.al.	2412.06064	null
2024-12-08	Concerning the Use of Turbulent Flow Data for Machine Learning	Mohammed Sardar et.al.	2412.06050	null
2024-12-08	Accelerating Video Diffusion Models via Distribution Matching	Yuanzhi Zhu et.al.	2412.05899	null
2024-12-08	XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference	Weizhuo Li et.al.	2412.05896	null
2024-12-08	Towards Modeling Data Quality and Machine Learning Model Performance	Usman Anjum et.al.	2412.05882	link
2024-12-08	Laser Ultrasonic Imaging via the Time Domain Linear Sampling Method	Jian Song et.al.	2412.05803	null
2024-12-08	Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors	Alex Rich et.al.	2412.05771	null
2024-12-07	A new basic air shower observable sensitive to the cosmic-ray elemental mass	Animesh Basak et.al.	2412.05727	null
2024-12-06	One-shot Federated Learning via Synthetic Distiller-Distillate Communication	Junyuan Zhang et.al.	2412.05186	link
2024-12-06	A text-to-tabular approach to generate synthetic patient data using LLMs	Margaux Tornqvist et.al.	2412.05153	link
2024-12-06	Noise Matters: Diffusion Model-based Urban Mobility Generation with Collaborative Noise Priors	Yuheng Zhang et.al.	2412.05000	null
2024-12-06	Neuro-Symbolic Data Generation for Math Reasoning	Zenan Li et.al.	2412.04857	null
2024-12-06	DrIFT: Autonomous Drone Dataset with Integrated Real and Synthetic Data, Flexible Views, and Transformed Domains	Fardad Dadboud et.al.	2412.04789	link
2024-12-06	Differentially Private Random Feature Model	Chunyang Liao et.al.	2412.04785	link
2024-12-06	SpasticMyoElbow: Physical Human-Robot Interaction Simulation Framework for Modelling Elbow Spasticity	Hao Yu et.al.	2412.04700	null
2024-12-05	Give me Some Hard Questions: Synthetic Data Generation for Clinical QA	Fan Bai et.al.	2412.04573	link
2024-12-05	DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction	Ben Kaye et.al.	2412.04464	null
2024-12-05	Monocular Dynamic Gaussian Splatting is Fast and Brittle but Smooth Motion Helps	Yiqing Liang et.al.	2412.04457	null
2024-12-05	BhashaVerse : Translation Ecosystem for Indian Subcontinent Languages	Vandan Mujadia et.al.	2412.04351	null
2024-12-05	ALMA: Alignment with Minimal Annotation	Michihiro Yasunaga et.al.	2412.04305	null
2024-12-05	Methodology for Online Estimation of Rheological Parameters in Polymer Melts Using Deep Learning and Microfluidics	Juan Sandubete-López et.al.	2412.04142	null
2024-12-05	AI-based Attacker Models for Enhancing Multi-Stage Cyberattack Simulations in Smart Grids Using Co-Simulation Environments	Omer Sen et.al.	2412.03979	null
2024-12-05	Learning Speed-Adaptive Walking Agent Using Imitation Learning with Physics-Informed Simulation	Yi-Hung Chiu et.al.	2412.03949	link
2024-12-05	Towards Data Governance of Frontier AI Models	Jason Hausenloy et.al.	2412.03824	null
2024-12-04	Diffusion in Zero-Shot Learning for Environmental Audio	Ysobel Sims et.al.	2412.03771	link
2024-12-04	End to End Collaborative Synthetic Data Generation	Sikha Pentyala et.al.	2412.03766	null
2024-12-04	Evaluating Language Models as Synthetic Data Generators	Seungone Kim et.al.	2412.03679	link
2024-12-04	Interpreting Transformers for Jet Tagging	Aaron Wang et.al.	2412.03673	link
2024-12-04	DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining	Youssof Nawar et.al.	2412.03629	null
2024-12-04	MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation	Zehuan Huang et.al.	2412.03558	null
2024-12-04	Microwave Remote Sensing of Soil Moisture, Above Ground Biomass and Freeze-Thaw Dynamic: Modeling and Empirical Approaches	Laura Angeloni et.al.	2412.03523	null
2024-12-04	Domain-Agnostic Stroke Lesion Segmentation Using Physics-Constrained Synthetic Data	Liam Chalcroft et.al.	2412.03318	link
2024-12-04	GERD: Geometric event response data generation	Jens Egholm Pedersen et.al.	2412.03259	link
2024-12-04	Semi-Supervised Transfer Boosting (SS-TrBoosting)	Lingfei Deng et.al.	2412.03212	null
2024-12-04	ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning	Zhe Xie et.al.	2412.03104	link
2024-12-04	Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models	Alex Havrilla et.al.	2412.02980	null
2024-12-03	MACAW: A Causal Generative Model for Medical Imaging	Vibujithan Vigneshwaran et.al.	2412.02900	link
2024-12-03	Learning constitutive relations from experiments: 1. PDE constrained optimization	Andrew Akerson et.al.	2412.02864	null
2024-12-03	Unpaired Modality Translation for Pseudo Labeling of Histology Images	Arthur Boschet et.al.	2412.02858	link
2024-12-03	Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset	Dan Su et.al.	2412.02595	null
2024-12-03	Active learning of neural population dynamics using two-photon holographic optogenetics	Andrew Wagenmaker et.al.	2412.02529	null
2024-12-03	DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators	Tejumade Afonja et.al.	2412.02467	link
2024-12-03	3D Face Reconstruction From Radar Images	Valentin Braeutigam et.al.	2412.02403	null
2024-12-03	Probing jet dynamics and collimation in radio galaxies. Application to NGC 1052	Ainara Saiz-Pérez et.al.	2412.02358	null
2024-12-03	SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models	Sabina Martyniak et.al.	2412.02332	link
2024-12-03	Initial Study On Improving Segmentation By Combining Preoperative CT And Intraoperative CBCT Using Synthetic Data	Maximilian E. Tschuchnig et.al.	2412.02294	null
2024-12-03	Connecting Large Language Models with Blockchain: Advancing the Evolution of Smart Contracts from Automation to Intelligence	Youquan Xian et.al.	2412.02263	null
2024-12-03	Fast LiDAR Data Generation with Rectified Flows	Kazuto Nakashima et.al.	2412.02241	link
2024-12-03	FaaSRCA: Full Lifecycle Root Cause Analysis for Serverless Applications	Jin Huang et.al.	2412.02239	null
2024-12-03	Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs	Zixuan Hu et.al.	2412.02220	null
2024-12-03	Thallus: An RDMA-based Columnar Data Transport Protocol	Jayjeet Chakraborty et.al.	2412.02192	null
2024-12-02	Who’s Gaming the System? A Causally-Motivated Approach for Detecting Strategic Adaptation	Trenton Chang et.al.	2412.02000	link
2024-12-02	MALT: Improving Reasoning with Multi-Agent LLM Training	Sumeet Ramesh Motwani et.al.	2412.01928	null
2024-12-02	VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval	Dhiman Paul et.al.	2412.01558	link
2024-11-29	On Domain-Specific Post-Training for Multimodal Large Language Models	Daixuan Cheng et.al.	2411.19930	null
2024-11-29	Linear methods for non-linear inverse problems	Geerten Koers et.al.	2411.19797	null
2024-11-29	Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems	Rafael Teixeira de Lima et.al.	2411.19710	null
2024-11-29	MIMDE: Exploring the Use of Synthetic vs Human Data for Evaluating Multi-Insight Multi-Document Extraction Tasks	John Francis et.al.	2411.19689	null
2024-11-29	Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling	Qirui Wu et.al.	2411.19492	null
2024-11-28	UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation	Yichong Lu et.al.	2411.19292	null
2024-11-28	Parallel and Mini-Batch Stable Matching for Large-Scale Reciprocal Recommender Systems	Kento Nakada et.al.	2411.19214	null
2024-11-27	Reconstructing Animals and the Wild	Peter Kulits et.al.	2411.18807	null
2024-11-27	Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis	Eva Prakash et.al.	2411.18602	null
2024-11-28	Enhancing weed detection performance by means of GenAI-based image augmentation	Sourav Modak et.al.	2411.18513	null
2024-11-27	Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification	José Fernando Núñez et.al.	2411.18456	null
2024-11-27	The more, the better? Evaluating the role of EEG preprocessing for deep learning applications	Federico Del Pup et.al.	2411.18392	link
2024-11-27	Two-Timescale Digital Twin Assisted Model Interference and Retraining over Wireless Network	Jiayi Cong et.al.	2411.18329	null
2024-11-27	Dependency-Aware CAV Task Scheduling via Diffusion-Based Reinforcement Learning	Xiang Cheng et.al.	2411.18230	null
2024-11-27	SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation	Duc-Hai Pham et.al.	2411.18229	null
2024-11-27	Training Data Synthesis with Difficulty Controlled Diffusion Model	Zerun Wang et.al.	2411.18109	null
2024-11-27	Training and Evaluating Language Models with Template-based Data Generation	Yifan Zhang et.al.	2411.18104	link
2024-11-26	CrypQ: A Database Benchmark Based on Dynamic, Ever-Evolving Ethereum Data	Vincent Capol et.al.	2411.17913	null
2024-11-26	Repeated sampling of different individuals but the same clusters to improve precision of difference-in-differences estimators: the DISC design	Jordan Downey et.al.	2411.17905	null
2024-11-26	RealSeal: Revolutionizing Media Authentication with Real-Time Realism Scoring	Bhaktipriya Radharapu et.al.	2411.17684	null
2024-11-26	Synthetic Data Generation with LLM for Improved Depression Prediction	Andrea Kang et.al.	2411.17672	null
2024-11-26	Pre-training for Action Recognition with Automatically Generated Fractal Datasets	Davyd Svyezhentsev et.al.	2411.17584	link
2024-11-26	Evolving Markov Chains: Unsupervised Mode Discovery and Recognition from Data Streams	Kutalmış Coşkun et.al.	2411.17528	null
2024-11-26	A Method for Fabricating CMOS Back-End-of-Line-Compatible Solid-State Nanopore Devices	Mohamed Yassine Bouhamidi et.al.	2411.17416	null
2024-11-26	vesselFM: A Foundation Model for Universal 3D Blood Vessel Segmentation	Bastian Wittmann et.al.	2411.17386	link
2024-11-27	RealTraj: Towards Real-World Pedestrian Trajectory Forecasting	Ryo Fujii et.al.	2411.17376	null
2024-11-26	On the Generalization of Handwritten Text Recognition Models	Carlos Garrido-Munoz et.al.	2411.17332	null
2024-11-26	ER2Score: LLM-based Explainable and Customizable Metric for Assessing Radiology Reports with Reward-Control Loss	Yunyi Liu et.al.	2411.17301	null
2024-11-26	LHPF: Look back the History and Plan for the Future in Autonomous Driving	Sheng Wang et.al.	2411.17253	null
2024-11-26	DOGE: Towards Versatile Visual Document Grounding and Referring	Yinan Zhou et.al.	2411.17125	null
2024-11-26	Average X-ray properties of galaxy groups. From Milky Way-like halos to massive clusters	P. Popesso et.al.	2411.17120	null
2024-11-26	Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation	Minh-Tuan Tran et.al.	2411.17046	null
2024-11-25	Decision Making under the Exponential Family: Distributionally Robust Optimisation with Bayesian Ambiguity Sets	Charita Dellaporta et.al.	2411.16829	null
2024-11-25	A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models	Manuel Schwonberg et.al.	2411.16407	null
2024-11-25	Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models	Hao Yi et.al.	2411.16201	null
2024-11-25	On the Robustness of the Successive Projection Algorithm	Giovanni Barbarino et.al.	2411.16195	link
2024-11-25	Image Generation Diversity Issues and How to Tame Them	Mischa Dombrowski et.al.	2411.16171	link
2024-11-25	DP-CDA: An Algorithm for Enhanced Privacy Preservation in Dataset Synthesis Through Randomized Mixing	Utsab Saha et.al.	2411.16121	null
2024-11-25	Boosting 3D Object Generation through PBR Materials	Yitong Wang et.al.	2411.16080	null
2024-11-24	PINNs4Drops: Convolutional feature-enhanced physics-informed neural networks for reconstructing two-phase flows	Maximilian Dreisbach et.al.	2411.15949	null
2024-11-24	Generative Context Distillation	Haebin Shin et.al.	2411.15927	link
2024-11-24	Beyond Data Scarcity: A Frequency-Driven Framework for Zero-Shot Forecasting	Liran Nochumsohn et.al.	2411.15743	null
2024-11-24	Comparative Analysis of Diffusion Generative Models in Computational Pathology	Denisha Thakkar et.al.	2411.15719	link
2024-11-24	Tackling Data Heterogeneity in Federated Time Series Forecasting	Wei Yuan et.al.	2411.15716	null
2024-11-24	ROOT: VLM based System for Indoor Scene Understanding and Beyond	Yonghui Wang et.al.	2411.15714	link
2024-11-26	GraphGrad: Efficient Estimation of Sparse Polynomial Representations for General State-Space Models	Benjamin Cox et.al.	2411.15637	null
2024-11-23	Enhancing Object Detection Accuracy in Autonomous Vehicles Using Synthetic Data	Sergei Voronin et.al.	2411.15602	null
2024-11-23	Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing	Yadong Qu et.al.	2411.15585	link
2024-11-22	OminiControl: Minimal and Universal Control for Diffusion Transformer	Zhenxiong Tan et.al.	2411.15098	link
2024-11-22	The EE-Classifier: A classification method for functional data based on extremality indexes	Catalina Lesmes et.al.	2411.14999	null
2024-11-22	Open-Amp: Synthetic Data Framework for Audio Effect Foundation Models	Alec Wright et.al.	2411.14972	link
2024-11-22	LLM for Barcodes: Generating Diverse Synthetic Data for Identity Documents	Hitesh Laxmichand Patel et.al.	2411.14962	null
2024-11-22	Morph: A Motion-free Physics Optimization Framework for Human Motion Generation	Zhuo Li et.al.	2411.14951	null
2024-11-22	The NANOGrav 15 year Data Set: Removing pulsars one by one from the pulsar timing array	Gabriella Agazie et.al.	2411.14846	null
2024-11-22	Harlequin: Color-driven Generation of Synthetic Data for Referring Expression Comprehension	Luca Parolari et.al.	2411.14807	null
2024-11-22	Aim My Robot: Precision Local Navigation to Any Object	Xiangyun Meng et.al.	2411.14770	null
2024-11-22	Double Machine Learning for Adaptive Causal Representation in High-Dimensional Data	Lynda Aouar et.al.	2411.14665	null
2024-11-21	The importance of the clustering model to detect new types of intrusion in data traffic	Noor Saud Abd et.al.	2411.14550	null
2024-11-21	Learning Fair Robustness via Domain Mixup	Meiyu Zhong et.al.	2411.14424	null
2024-11-21	Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification	Junhua Liu et.al.	2411.14252	null
2024-11-21	Learning from “Silly” Questions Improves Large Language Models, But Only Slightly	Tingyuan Zhu et.al.	2411.14121	null
2024-11-21	Generative Intervention Models for Causal Perturbation Modeling	Nora Schneider et.al.	2411.14003	null
2024-11-21	iHQGAN: A Lightweight Invertible Hybrid Quantum-Classical Generative Adversarial Network for Unsupervised Image-to-Image Translation	Xue Yang et.al.	2411.13920	link
2024-11-21	Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning	Song Jiang et.al.	2411.13904	null
2024-11-21	PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation	Zhijie Bao et.al.	2411.13902	link
2024-11-21	Robust Detection of Watermarks for Large Language Models Under Human Edits	Xiang Li et.al.	2411.13868	link
2024-11-21	Dealing with Synthetic Data Contamination in Online Continual Learning	Maorong Wang et.al.	2411.13852	link
2024-11-21	GalaxyEdit: Large-Scale Image Editing Dataset with Enhanced Diffusion Adapter	Aniruddha Bala et.al.	2411.13794	null
2024-11-21	Adaptable Embeddings Network (AEN)	Stan Loosmore et.al.	2411.13786	null
2024-11-22	Utilizing Large Language Models to Synthesize Product Desirability Datasets	John D. Hastings et.al.	2411.13485	null
2024-11-20	Heuristically Adaptive Diffusion-Model Evolutionary Strategy	Benedikt Hartl et.al.	2411.13420	null
2024-11-20	Enhanced Gas Source Localization Using Distributed IoT Sensors and Bayesian Inference	Leonardo Balocchi et.al.	2411.13268	null
2024-11-20	BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation	Umamaheswaran Raman Kumar et.al.	2411.13251	null
2024-11-20	SONNET: Enhancing Time Delay Estimation by Leveraging Simulated Audio	Erik Tegler et.al.	2411.13179	null
2024-11-20	Writing Style Matters: An Examination of Bias and Fairness in Information Retrieval Systems	Hongliu Cao et.al.	2411.13173	null
2024-11-20	Data driven learning to enhance a kinetic model of distressed crowd dynamics	Daewa Kim et.al.	2411.12974	null
2024-11-20	Machine learned reconstruction of tsunami dynamics from sparse observations	Edward McDugald et.al.	2411.12948	null
2024-11-20	Improving Low-Fidelity Models of Li-ion Batteries via Hybrid Sparse Identification of Nonlinear Dynamics	Samuel Filgueira da Silva et.al.	2411.12935	null
2024-11-19	Data-to-Model Distillation: Data-Efficient Learning Framework	Ahmad Sajedi et.al.	2411.12841	link
2024-11-19	Regular-pattern-sensitive CRFs for Distant Label Interactions	Sean Papay et.al.	2411.12484	null
2024-11-19	Empirical Privacy Evaluations of Generative and Predictive Machine Learning Models – A review and challenges for practice	Flavio Hafner et.al.	2411.12451	null
2024-11-19	Could Humans Outshine AI in Visual Data Analysis?	Ratanond Koonchanok et.al.	2411.12299	null
2024-11-18	SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input	Zhen Lv et.al.	2411.11934	null
2024-11-18	RoboGSim: A Real2Sim2Real Robotic Gaussian Splatting Simulator	Xinhai Li et.al.	2411.11839	null
2024-11-18	Theoretical Foundations of Conformal Prediction	Anastasios N. Angelopoulos et.al.	2411.11824	null
2024-11-18	Parallelly Tempered Generative Adversarial Networks	Jinwon Sohn et.al.	2411.11786	null
2024-11-18	Open Catalyst Experiments 2024 (OCx24): Bridging Experiments and Computational Models	Jehad Abed et.al.	2411.11783	null
2024-11-18	Few-shot Model Extraction Attacks against Sequential Recommender Systems	Hui Zhang et.al.	2411.11677	null
2024-11-18	Real-Time Fitness Exercise Classification and Counting from Video Frames	Riccardo Riccio et.al.	2411.11548	link
2024-11-18	A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents	Jean Vassoyan et.al.	2411.11520	link
2024-11-19	Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation	Rüveyda Yilmaz et.al.	2411.11515	link
2024-11-18	Lorentz: Learned SKU Recommendation Using Profile Data	Nicholas Glaze et.al.	2411.11325	null
2024-11-18	Subgroup analysis in multi level hierarchical cluster randomized trials	Shubhadeep Chakraborty et.al.	2411.11301	null
2024-11-17	MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild	Xi Fang et.al.	2411.11098	null
2024-11-17	SRA-MCTS: Self-driven Reasoning Aurmentation with Monte Carlo Tree Search for Enhanced Code Generation	Bin Xu et.al.	2411.11053	link
2024-11-17	Towards a framework on tabular synthetic data generation: a minimalist approach: theory, use cases, and limitations	Agus Sudjianto et.al.	2411.10982	null
2024-11-16	Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs	Philips George John et.al.	2411.10906	null
2024-11-16	Watermarking Generative Categorical Data	Bochao Gu et.al.	2411.10898	null
2024-11-15	Dynamic Causal Effects in a Nonlinear World: the Good, the Bad, and the Ugly	Michal Kolesár et.al.	2411.10415	link
2024-11-15	How to Build a Quantum Supercomputer: Scaling Challenges and Opportunities	Masoud Mohseni et.al.	2411.10406	null
2024-11-15	Generation of synthetic gait data: application to multiple sclerosis patients’ gait patterns	Klervi Le Gall et.al.	2411.10377	null
2024-11-15	Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation	Tim Elsner et.al.	2411.10281	link
2024-11-15	Evaluating Text-to-Image Diffusion Models for Texturing Synthetic Data	Thomas Lips et.al.	2411.10164	link
2024-11-15	Mitigating Sycophancy in Decoder-Only Transformer Architectures: Synthetic Data Intervention	Libo Wang et.al.	2411.10156	link
2024-11-15	Adaptive Physics-Guided Neural Network	David Shulman et.al.	2411.10064	null
2024-11-14	Cross-Matched Interval Prevalence of High Dimensional Point Clouds	Jonathan M. Mousley et.al.	2411.09797	null
2024-11-14	Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models	Wei Wang et.al.	2411.09691	null
2024-11-16	SAFES: Sequential Privacy and Fairness Enhancing Data Synthesis for Responsible AI	Spencer Giddens et.al.	2411.09178	link
2024-11-14	Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching	Yuran Wang et.al.	2411.09151	null
2024-11-13	Drone Detection using Deep Neural Networks Trained on Pure Synthetic Data	Mariusz Wisniewski et.al.	2411.09077	link
2024-11-13	Evaluating cosmological simulations of galaxy formation with spectral variance in the optical window	Z. Sharbaf et.al.	2411.08945	null
2024-11-13	A probabilistic reduced-order modeling framework for patient-specific cardio-mechanical analysis	Robin Willems et.al.	2411.08822	null
2024-11-13	Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models	Chengdong Dong et.al.	2411.08642	null
2024-11-13	Generalized Pose Space Embeddings for Training In-the-Wild using Anaylis-by-Synthesis	Dominik Borer et.al.	2411.08603	null
2024-11-13	Space-local memory in generalized master equations: Reaching the thermodynamic limit for the cost of a small lattice simulation	Srijan Bhattacharyya et.al.	2411.08598	null
2024-11-13	CorrSynth – A Correlated Sampling Method for Diverse Dataset Generation from LLMs	Suhas S Kowshik et.al.	2411.08553	null
2024-11-13	A dark energy parameterization independent constraint of the spatial curvature $Ω_K$	Zhennan Li et.al.	2411.08498	null
2024-11-13	Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study	Jinbo Wen et.al.	2411.08341	null
2024-11-13	DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach	Xin Tang et.al.	2411.08299	null
2024-11-13	Dynamic Thresholding Algorithm with Memory for Linear Inverse Problems	Zhong-Feng Sun et.al.	2411.08284	null
2024-11-12	SynapsNet: Enhancing Neuronal Population Dynamics Modeling via Learning Functional Connectivity	Parsa Delavari et.al.	2411.08221	null
2024-11-12	Design optimization of semiconductor manufacturing equipment using a novel multi-fidelity surrogate modeling approach	Bingran Wang et.al.	2411.08149	null
2024-11-12	Large Language Models Can Self-Improve in Long-context Reasoning	Siheng Li et.al.	2411.08147	link
2024-11-12	Language Models as Causal Effect Generators	Lucius E. J. Bynum et.al.	2411.08019	link
2024-11-12	Scalable piecewise smoothing with BART	Ryan Yee et.al.	2411.07984	null
2024-11-12	Maritime Search and Rescue Missions with Aerial Images: A Survey	Juan P. Martinez-Esteso et.al.	2411.07649	null
2024-11-11	Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models	SeungHeon Doh et.al.	2411.07439	link
2024-11-11	Feature-Space Semantic Invariance: Enhanced OOD Detection for Open-Set Domain Generalization	Haoliang Wang et.al.	2411.07392	null
2024-11-11	SynRL: Aligning Synthetic Clinical Trial Data with Human-preferred Clinical Endpoints Using Reinforcement Learning	Trisha Das et.al.	2411.07317	null
2024-11-11	DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID	Nyle Siddiqui et.al.	2411.07205	link
2024-11-11	Data-Driven Predictive Control of Nonholonomic Robots Based on a Bilinear Koopman Realization: Data Does Not Replace Geometry	Mario Rosenfelder et.al.	2411.07192	null
2024-11-11	Hierarchical Conditional Tabular GAN for Multi-Tabular Synthetic Data Generation	Wilhelm Ågren et.al.	2411.07009	null
2024-11-11	Maximizing domain generalization in fetal brain tissue segmentation: the role of synthetic data generation, intensity clustering and real image fine-tuning	Vladyslav Zalevskyi et.al.	2411.06842	null
2024-11-11	Synthesize, Partition, then Adapt: Eliciting Diverse Samples from Foundation Models	Yeming Wen et.al.	2411.06722	null
2024-11-11	DiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite Observations	Xuming He et.al.	2411.06714	null
2024-11-11	What Should Baby Models Read? Exploring Sample-Efficient Data Composition on Model Performance	Hong Meng Yam et.al.	2411.06672	null
2024-11-10	In-Context Learning for Preserving Patient Privacy: A Framework for Synthesizing Realistic Patient Portal Messages	Joseph Gatto et.al.	2411.06549	link
2024-11-10	CRTRE: Causal Rule Generation with Target Trial Emulation Framework	Junda Wang et.al.	2411.06338	null
2024-11-09	Clustering Algorithms and RAG Enhancing Semi-Supervised Text Classification with Large LLMs	Shan Zhong et.al.	2411.06175	null
2024-11-09	Behavior-Aware Efficient Detection of Malicious EVs in V2G Systems	Ruixiang Wu et.al.	2411.06113	null
2024-11-09	A novel study on the MUSIC-type imaging of small electromagnetic inhomogeneities in the limited-aperture inverse scattering problem	Won-Kwang Park et.al.	2411.06030	null
2024-11-08	DNAMite: Interpretable Calibrated Survival Analysis with Discretized Additive Models	Mike Van Ness et.al.	2411.05923	link
2024-11-08	Differential Privacy Under Class Imbalance: Methods and Empirical Insights	Lucas Rosenblatt et.al.	2411.05733	null
2024-11-08	Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation	Long Truong To et.al.	2411.05641	null
2024-11-08	SynDroneVision: A Synthetic Dataset for Image-Based Drone Detection	Tamara R. Lenhard et.al.	2411.05633	null
2024-11-08	DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions	Rafael Berral-Soler et.al.	2411.05552	link
2024-11-08	A Quality-Centric Framework for Generic Deepfake Detection	Wentang Song et.al.	2411.05335	null
2024-11-08	Discovering Latent Structural Causal Models from Spatio-Temporal Data	Kun Wang et.al.	2411.05331	null
2024-11-08	Cancer-Net SCa-Synth: An Open Access Synthetically Generated 2D Skin Lesion Dataset for Skin Cancer Classification	Chi-en Amy Tai et.al.	2411.05269	link
2024-11-07	Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model	Sheng Cheng et.al.	2411.05079	link
2024-11-07	Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models	Shuhong Zheng et.al.	2411.05005	null
2024-11-07	Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification	Mischa Dombrowski et.al.	2411.04956	null
2024-11-09	OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models	Siming Huang et.al.	2411.04905	null
2024-11-07	Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation	Benito Buchheim et.al.	2411.04724	null
2024-11-08	BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages	Sparsh Jain et.al.	2411.04699	link
2024-11-07	Improved Multi-Task Brain Tumour Segmentation with Synthetic Data Augmentation	André Ferreira et.al.	2411.04632	link
2024-11-07	Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation	Qingyao Tian et.al.	2411.04404	null
2024-11-06	Generating Synthetic Electronic Health Record (EHR) Data: A Review with Benchmarking	Xingran Chen et.al.	2411.04281	link
2024-11-06	Debiasing Synthetic Data Generated by Deep Generative Models	Alexander Decruyenaere et.al.	2411.04216	link
2024-11-06	Topology Bench: Systematic Graph Based Benchmarking for Core Optical Networks	Robin Matzner et.al.	2411.04160	null
2024-11-06	GUIDE-VAE: Advancing Data Generation with User Information and Pattern Dictionaries	Kutay Bölat et.al.	2411.03936	link
2024-11-06	VQA $^2$ :Visual Question Answering for Video Quality Assessment	Ziheng Jia et.al.	2411.03795	link
2024-11-06	Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions	Sagar Shrestha et.al.	2411.03755	null
2024-11-06	Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey	Amer Essakine et.al.	2411.03688	null
2024-11-06	Open-Source High-Speed Flight Surrogate Modeling Framework	Tyler E. Korenyi-Both et.al.	2411.03598	null
2024-11-05	Forecasting Outside the Box: Application-Driven Optimal Pointwise Forecasts for Stochastic Optimization	Tito Homem-de-Mello et.al.	2411.03520	null
2024-11-04	Enhancing Table Representations with LLM-powered Synthetic Data Generation	Dayu Yang et.al.	2411.03356	null
2024-11-05	DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models	Ying Zhou et.al.	2411.03250	null
2024-11-05	A data-driven study on Implicit LES using a spectral difference method	Nicola Clinco et.al.	2411.03211	null
2024-11-05	Local Lesion Generation is Effective for Capsule Endoscopy Image Data Augmentation in a Limited Data Setting	Adrian B. Chłopowiec et.al.	2411.03098	null
2024-11-05	Speech Separation with Pretrained Frontend to Minimize Domain Mismatch	Wupeng Wang et.al.	2411.03085	link
2024-11-05	Controlling for Unobserved Confounding with Large Language Model Classification of Patient Smoking Status	Samuel Lee et.al.	2411.03004	null
2024-11-05	IMUDiffusion: A Diffusion Model for Multivariate Time Series Synthetisation for Inertial Motion Capturing Systems	Heiko Oppel et.al.	2411.02954	null
2024-11-05	SpiDR: A Reconfigurable Digital Compute-in-Memory Spiking Neural Network Accelerator for Event-based Perception	Deepika Sharma et.al.	2411.02854	null
2024-11-05	On the Comparison between Multi-modal and Single-modal Contrastive Learning	Wei Huang et.al.	2411.02837	null
2024-11-04	Combining Induction and Transduction for Abstract Reasoning	Wen-Ding Li et.al.	2411.02272	link
2024-11-06	Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent	Xingwu Sun et.al.	2411.02265	link
2024-11-06	Digi2Real: Bridging the Realism Gap in Synthetic Data Face Recognition via Foundation Models	Anjith George et.al.	2411.02188	null
2024-11-04	Generating the Traces You Need: A Conditional Generative Model for Process Mining Data	Riccardo Graziosi et.al.	2411.02131	link
2024-11-04	GDP nowcasting with large-scale inter-industry payment data in real time – A network approach	Anastasia Mantziou et.al.	2411.02029	null
2024-11-04	Learning Where to Edit Vision Transformers	Yunqiao Yang et.al.	2411.01948	link
2024-11-04	Exploring the Landscape for Generative Sequence Models for Specialized Data Synthesis	Mohammad Zbeeb et.al.	2411.01929	link
2024-11-04	ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation	Hengkai Tan et.al.	2411.01850	null
2024-11-04	DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability	Bo Gao et.al.	2411.01819	null
2024-11-03	Enhancing Forecasts Using Real-Time Data Flow and Hierarchical Forecast Reconciliation, with Applications to the Energy Sector	Lukas Neubauer et.al.	2411.01528	link
2024-11-03	Privacy-Preserving Customer Churn Prediction Model in the Context of Telecommunication Industry	Joydeb Kumar Sana et.al.	2411.01447	null
2024-11-02	Network Causal Effect Estimation In Graphical Models Of Contagion And Latent Confounding	Yufeng Wu et.al.	2411.01371	null
2024-11-02	Guided Synthesis of Labeled Brain MRI Data Using Latent Diffusion Models for Segmentation of Enlarged Ventricles	Tim Ruschke et.al.	2411.01351	null
2024-11-02	Marginal Causal Flows for Validation and Inference	Daniel de Vassimon Manela et.al.	2411.01295	link
2024-11-02	Efficient Collaborative Navigation through Perception Fusion for Multi-Robots in Unknown Environments	Qingquan Lin et.al.	2411.01274	null
2024-11-01	SelfCodeAlign: Self-Alignment for Code Generation	Yuxiang Wei et.al.	2410.24198	link
2024-10-31	DexMimicGen: Automated Data Generation for Bimanual Dexterous Manipulation via Imitation Learning	Zhenyu Jiang et.al.	2410.24185	null
2024-10-31	Constraint Back-translation Improves Complex Instruction Following of Large Language Models	Yunjia Qi et.al.	2410.24175	link
2024-11-02	$π_0$ : A Vision-Language-Action Flow Model for General Robot Control	Kevin Black et.al.	2410.24164	null
2024-10-31	Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure	Xiang Li et.al.	2410.24060	link
2024-10-31	Unveiling Synthetic Faces: How Synthetic Datasets Can Expose Real Identities	Hatef Otroshi Shahreza et.al.	2410.24015	null
2024-10-31	Towards Fast Algorithms for the Preference Consistency Problem Based on Hierarchical Models	Anne-Marie George et.al.	2410.23934	null
2024-10-31	Bayesian Hierarchical Model for Synthesizing Registry and Survey Data on Female Breast Cancer Prevalence	Qiao Wang et.al.	2410.23580	null
2024-10-30	Neural spell-checker: Beyond words with synthetic data generation	Matej Klemen et.al.	2410.23514	link
2024-10-30	Development and Comparative Analysis of Machine Learning Models for Hypoxemia Severity Triage in CBRNE Emergency Scenarios Using Physiological and Demographic Data from Medical-Grade Devices	Santino Nanini et.al.	2410.23503	null
2024-10-30	PACER: Preference-conditioned All-terrain Costmap Generation	Luisa Mao et.al.	2410.23488	null
2024-10-30	Multilingual Vision-Language Pre-training for the Remote Sensing Domain	João Daniel Silva et.al.	2410.23370	link
2024-10-30	Strategic communication of narratives	Gerrit Bauch et.al.	2410.23259	null
2024-10-31	Enhancing Autonomous Driving Safety Analysis with Generative AI: A Comparative Study on Automated Hazard and Risk Assessment	Alireza Abbaspour et.al.	2410.23207	null
2024-10-30	Directional anomaly detection	Oliver Urs Lenz et.al.	2410.23158	null
2024-10-30	Federated Learning under Periodic Client Participation and Heterogeneous Data: A New Communication-Efficient Algorithm and Analysis	Michael Crawshaw et.al.	2410.23131	link
2024-10-30	Automated Image-Based Identification and Consistent Classification of Fire Patterns with Quantitative Shape Analysis and Spatial Location Identification	Pengkun Liu et.al.	2410.23105	null
2024-10-30	CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense	Mingkun Zhang et.al.	2410.23091	link
2024-10-30	Private Synthetic Text Generation with Diffusion Models	Sebastian Ochs et.al.	2410.22971	link
2024-10-30	Augmenting Polish Automatic Speech Recognition System With Synthetic Data	Łukasz Bondaruk et.al.	2410.22903	null
2024-10-30	Universality of the $π^2/6$ Pathway in Avoiding Model Collapse	Apratim Dey et.al.	2410.22812	link
2024-10-30	Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets	Andoni Cortés et.al.	2410.22748	null
2024-10-29	Unpicking Data at the Seams: VAEs, Disentanglement and Independent Components	Carl Allen et.al.	2410.22559	null
2024-10-29	Evaluating utility in synthetic banking microdata applications	Hugo E. Caceres et.al.	2410.22519	null
2024-10-30	Nanoscale Connectomics Annotation Standards Framework	Nicole K. Guittari et.al.	2410.22320	null
2024-10-29	Understanding Synthetic Context Extension via Retrieval Heads	Xinyu Zhao et.al.	2410.22316	null
2024-10-29	Model-free Estimation of Latent Structure via Multiscale Nonparametric Maximum Likelihood	Bryon Aragam et.al.	2410.22248	null
2024-10-29	Synthetic Data Generation with Large Language Models for Personalized Community Question Answering	Marco Braga et.al.	2410.22182	link
2024-10-29	Data Generation for Hardware-Friendly Post-Training Quantization	Lior Dikstein et.al.	2410.22110	link
2024-10-29	Cross-Entropy Is All You Need To Invert the Data Generating Process	Patrik Reizinger et.al.	2410.21869	null
2024-10-29	Generating Realistic Tabular Data with Large Language Models	Dang Nguyen et.al.	2410.21717	null
2024-10-28	Identifying Selections for Unsupervised Subtask Discovery	Yiwen Qiu et.al.	2410.21616	null
2024-10-28	Approximate Bayesian Computation with Statistical Distances for Model Selection	Clara Grazian et.al.	2410.21603	link
2024-10-28	Unveiling Context-Aware Criteria in Self-Assessing LLMs	Taneesh Gupta et.al.	2410.21545	null
2024-10-28	Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification	Hsun-Yu Kuo et.al.	2410.21526	null
2024-10-28	LLM-Forest for Health Tabular Data Imputation	Xinrui He et.al.	2410.21520	null
2024-10-28	Inferring the Morphology of the Galactic Center Excess with Gaussian Processes	Edward D. Ramirez et.al.	2410.21367	link
2024-10-28	Reconstructing dynamics from sparse observations with no training on target system	Zheng-Meng Zhai et.al.	2410.21222	link
2024-10-29	Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction	Qintong Zhang et.al.	2410.21169	null
2024-10-28	Synthetica: Large Scale Synthetic Data for Robot Perception	Ritvik Singh et.al.	2410.21153	null
2024-10-28	Topological Identification of Agent Status in Information Contagions: Application to Financial Markets	Anubha Goel et.al.	2410.21104	link
2024-10-28	Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models	Wenda Li et.al.	2410.21088	link
2024-10-28	Federated Time Series Generation on Feature and Temporally Misaligned Data	Chenrui Fan et.al.	2410.21072	null
2024-10-28	Push-Forward Signed Distance Functions enable interpretable and robust continuous shape quantification	Roua Rouatbi et.al.	2410.21004	null
2024-10-29	Valid Bootstraps for Networks with Applications to Network Visualisation	Emerald Dilworth et.al.	2410.20895	link
2024-10-28	Super-resolution with dynamics in the loss	Jacob Page et.al.	2410.20884	null
2024-10-29	zGAN: An Outlier-focused Generative Adversarial Network For Realistic Synthetic Data Generation	Azizjon Azimi et.al.	2410.20808	link
2024-10-28	Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training	Michael Pieler et.al.	2410.20796	null
2024-10-28	Scaling-based Data Augmentation for Generative Models and its Theoretical Extension	Yoshitaka Koike et.al.	2410.20780	null
2024-10-28	Plan $\times$ RAG: Planning-guided Retrieval Augmented Generation	Prakhar Verma et.al.	2410.20753	null
2024-10-28	General Causal Imputation via Synthetic Interventions	Marco Jiralerspong et.al.	2410.20647	null
2024-10-29	TabDiff: a Multi-Modal Diffusion Model for Tabular Data Generation	Juntong Shi et.al.	2410.20626	link
2024-10-25	Considerations for Distribution Shift Robustness of Diagnostic Models in Healthcare	Arno Blaas et.al.	2410.19575	null
2024-10-25	EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data	Xuetian Chen et.al.	2410.19461	null
2024-10-25	Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning	Yujian Liu et.al.	2410.19290	link
2024-10-25	In-Simulation Testing of Deep Learning Vision Models in Autonomous Robotic Manipulators	Dmytro Humeniuk et.al.	2410.19277	null
2024-10-24	Equitable Federated Learning with Activation Clustering	Antesh Upadhyay et.al.	2410.19207	null
2024-10-24	Heterogeneous Random Forest	Ye-eun Kim et.al.	2410.19022	link
2024-10-24	Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms	Zhangheng Li et.al.	2410.18967	null
2024-10-24	SkillMimicGen: Automated Demonstration Generation for Efficient Skill Learning and Deployment	Caelan Garrett et.al.	2410.18907	null
2024-10-24	Distill Visual Chart Reasoning Ability from LLMs to MLLMs	Wei He et.al.	2410.18798	link
2024-10-24	Learning Geodesics of Geometric Shape Deformations From Images	Nian Wu et.al.	2410.18797	null
2024-10-24	Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch	Yuyang Ding et.al.	2410.18693	link
2024-10-24	DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation	Yuang Ai et.al.	2410.18666	link
2024-10-24	Little Giants: Synthesizing High-Quality Embedding Data at Scale	Haonan Chen et.al.	2410.18634	link
2024-10-24	Knowledge Distillation Using Frontier Open-source LLMs: Generalizability and the Role of Synthetic Data	Anup Shirgaonkar et.al.	2410.18588	null
2024-10-24	Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data	Shuhao Gu et.al.	2410.18558	null