


default search action
CAIN 2025: Ottawa, ON, Canada
- 4th IEEE/ACM International Conference on AI Engineering - Software Engineering for AI, CAIN 2025, Ottawa, ON, Canada, April 27-28, 2025. IEEE 2025, ISBN 979-8-3315-0219-5
- Evangelos Ntentos, Stephen John Warnett, Uwe Zdun:
Rule-Based Assessment of Reinforcement Learning Practices Using Large Language Models. 1-11 - Mateus Devino, Evaline Ju, Paulo Marques Caldeira Junior:
Designing and Implementing LLM Guardrails Components in Production Environments. 12-17 - Alina Mailach, Sebastian Simon, Johannes Dorn, Norbert Siegmund:
Themes of Building LLM-Based Applications for Production: A Practitioner's View. 18-30 - Maryam Ekhlasi, Anurag Prakash, Maxime Lamothe, Michel R. Dagenais:
InsightAI: Root Cause Analysis in Large Log Files with Private Data Using Large Language Model. 31-41 - Hui Song, Arda Goknil, Xiaojun Jiang, Espen Melum, Hyunwhan Joe, Caterina Gazzotti, Valerio Frascolla, Adela Nedisan Videsjorden, Phu Nguyen:
Developing Multi-Agent LLM Applications Through Continuous Human-LLM Co-Programming. 42-47 - Shreyas Kumar Parida, Ilias Gerostathopoulos
, Justus Bogner
:
How Do Model Export Formats Impact the Development of ML-Enabled Systems? A Case Study on Model Integration. 48-59 - Shangeetha Sivasothy, Scott Barnett, Stefanus Kurniawan, Zafaryab Rasool, Rajesh Vasa:
RAGProbe: Breaking RAG Pipelines with Evaluation Scenarios. 60-71 - Vince Nguyen, Vidya Dhopate, Hieu Trung Huynh, Hiba Bouhlal, Anusha Annengala, Gian Luca Scoccia, Matias Martinez, Vincenzo Stoico, Ivano Malavolta
:
On-Device or Remote? On the Energy Efficiency of Fetching LLM-Generated Content. 72-82 - Kritin Maddireddy, Santhosh Kotekal Methukula, Chandrasekar Sridhar, Karthik Vaidhyanathan:
LoCoML: A Framework for Real-World ML Inference Pipelines. 83-88 - Keerthiga Rajenthiram, Milad Abdullah, Ilias Gerostathopoulos
, Petr Hnetynka
, Tomás Bures, Gerard Pons, Besim Bilalli, Anna Queralt:
Towards Continuous Experiment-Driven MLOps. 89-94 - Sergio Morales
, Robert Clarisó
, Jordi Cabot
:
ImageBiTe: A Framework for Evaluating Representational Harms in Text-to-Image Models. 95-106 - Chandrasekar Sridhar, Vyakhya Gupta, Prakhar Jain, Karthik Vaidhyanathan:
Approach Towards Semi-Automated Certification of Low Criticality ML-Enabled Airborne Applications. 107-112 - Yining Hong, Christopher Steven Timperley, Christian Kästner:
From Hazard Identification to Controller Design: Proactive and LLM-Supported Safety Engineering for ML-Powered Systems. 113-118 - Niels With Mikkelsen, Lasse Steen Pedersen, Mansoor Hussain
, Victor Foged, Ekkart Kindler
:
Bringing Machine Learning Models Beyond the Experimental Stage with Explainable AI. 119-129 - Richard Nordsieck, Jan-Philipp Steghöfer, Manish Bhandari:
Challenges in AI Projects for Machinery and Plant Engineering. 130-131 - Razan Abualsaud:
Towards a Domain-Specific Modeling Language for Streamlined Change Management in AI Systems Development. 132-137 - Hamed Barzamini, Fatemeh Nazaritiji, Annalise Brockmann, Hasan Ferdowsi, Mona Rahimi:
An AI-driven Requirements Engineering Framework Tailored for Evaluating AI-Based Software. 138-149 - Karthik Shivashankar, Antonio Martini:
MLScent: A Tool for Anti-Pattern Detection in ML Projects. 150-160 - Boyue Caroline Hu, Divya Gopinath, Corina S. Pasareanu, Nina Narodytska, Ravi Mangal, Susmit Jha:
Debugging and Runtime Analysis of Neural Networks with VLMs (A Case Study). 161-172 - Rodrigo Ximenes, Antonio Pedro Santos Alves, Tatiana Escovedo, Rodrigo O. Spínola, Marcos Kalinowski:
Investigating Issues That Lead to Code Technical Debt in Machine Learning Systems. 173-183 - Santiago del Rey, Adrià Medina, Xavier Franch, Silverio Martínez-Fernández:
Addressing Quality Challenges in Deep Learning: The Role of MLOps and Domain Knowledge. 184-189 - Jinyang Li, Sangwon Hyun, M. Ali Babar:
DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation. 190-200 - Kannan Parthasarathy, Karthik Vaidhyanathan, Rudra Dhar, Venkat Krishnamachari, Adyansh Kakran, Sreemaee Akshathala, Shrikara Arun, Amey Karan, Basil Muhammed, Sumant Dubey, Mohan Veerubhotla:
Engineering LLM Powered Multi-Agent Framework for Autonomous CloudOps. 201-211 - Lynn Vonderhaar, Timothy Elvira, Omar Ochoa:
Generating and Verifying Synthetic Datasets with Requirements Engineering. 212-221 - Oluwafemi Odu, Alvine Boaye Belle, Song Wang:
LLM-Based Safety Case Generation for Baidu Apollo: Are We there Yet? 222-233 - Dan Liyanage, Mahshid Moha, Sandy Suresh:
SqPal - Text to SQL GenAI Tool for PayPal. 234-235 - Keerthiga Rajenthiram:
Optimizing Data Analytics Workflows Through User-Driven Experimentation: Progress and Updates. 236-240 - Aidin Azamnouri
:
CoCo Challenges in ML Engineering Teams: How to Collaboratively Build ML-Enabled Systems. 241-243 - Yorick Sens
:
Towards a Privacy-by-Design Framework for ML-Enabled Systems. 244-246 - Marvin Muñoz Barón:
Towards an Adoption Framework to Foster Trust in AI-Assisted Software Engineering. 247-249 - Joshua Owotogbe:
Assessing and Enhancing the Robustness of LLM-Based Multi-Agent Systems Through Chaos Engineering. 250-252 - Rumbidzai Chitakunye:
Designing ML-Enabled Software Systems with ML Model Composition: A Green AI Perspective. 253-255 - Renato Cordeiro Ferreira
:
A Metrics-Oriented Architectural Model to Characterize Complexity on Machine Learning-Enabled Systems. 256-260 - Shamse Tasnim Cynthia:
Identification and Optimization of Redundant Code Using Large Language Models. 261-263 - Hasan Kaplan:
Systematic Testing of Security-Related Vulnerabilities in LLM-Based Applications. 264-266 - Hadiza Umar Yusuf:
Model-Based Verification for AI-Enabled Cyber-Physical Systems Through Guided Falsification of Temporal Logic Properties. 267-269 - Merel Veracx:
A Holistic Framework for Evolving AI-Based Systems. 270-272 - Benjamin Weigel, Fabian Stieler, Bernhard Bauer:
All You Need is an AI Platform: A Proposal for a Complete Reference Architecture. 273-274 - Hala Abdelkader, Mohamed Almorsy Abdelrazek, Sankhya Singh, Irini Logothetis, Priya Rani, Rajesh Vasa, Jean-Guy Schneider:
Safeguarding LLM-Applications: Specify or Train? 275-276 - Hadiza Umar Yusuf, Khouloud Gaaloul:
Navigating the Shift: Architectural Transformations and Emerging Verification Demands in AI-Enabled Cyber-Physical Systems. 277-278 - Orlando Marquez Ayala:
Task Decomposition and RAG as Design Patterns for LLM-Based Systems. 279-280 - Aftab Hussain, Md. Rafiqul Islam Rabin, Toufique Ahmed, Mohammad Amin Alipour, Bowen Xu, Stephen Huang:
Finding Trojan Triggers in Code LLMs: An Occlusion-Based Human-in-the-Loop Approach. 281-282 - Katherine R. Dearstyne, Pedro Alarcon Granadeno, Theodore Chambers, Jane Cleland-Huang:
Evaluating Reinforcement Learning Safety and Trustworthiness in Cyber-Physical Systems. 283-284 - Qiulu Peng, Chi Zhang, Ravi Mangal, Corina S. Pasareanu, Limin Jia
:
Random Perturbation Attack on LLMs for Code Generation. 285-287

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.