International Scientific Journal of Engineering and Management

An International Scholarly || Multidisciplinary || Open Access || Indexing in all major Database & Metadata
The journal follows the UGC Guidelines and is evaluated for inclusion in the Web of Science
ISSN: 2583-6129

Impact Factor: 8.072

Data Automation Pipeline: A Kernel-Centric Neuro-Symbolic Architecture for Autonomous Data Science

  • Version
  • Download 9
  • File Size 829.26 KB
  • File Count 1
  • Create Date 21 March 2026
  • Last Updated 21 March 2026

Data Automation Pipeline: A Kernel-Centric Neuro-Symbolic Architecture for Autonomous Data Science

 

 

Kajjam Hariprasad
Department of CSE Jyothishmathi Institute of
Technology and Science Karimnagar, Telangana,
India prasadkajjam5@gmail.com
Shivanath Hanumakonda
Department of CSE Jyothishmathi Institute of
Technology and Science Karimnagar, Telangana, India
222.6A7shivanath@gmail.com

Pravalika Daivala
Department of CSE Jyothishmathi Institute of
Technology and Science Karimnagar, Telangana, India
22.688pravalikadaivala@gmail.com
Vamshi Kadavergula
Department of CSE Jyothishmathi Institute of
Technology and Science Karimnagar, Telangana, India
22271a66b7vamshi@gmail.com

 

 

 

 

Abstract—The proliferation of Large Language Models (LLMs) has catalyzed a paradigm shift from static code completion to- ward autonomous agentic execution. Despite demonstrable proficiency in generating syntactically valid Python, con- temporary Co-Pilot systems remain fundamentally decou- pled from the runtime state of the environments they serve, producing generation errors that neither static analysis nor model scaling can eliminate. This paper introduces DataCur- sor, a hybrid neuro symbolic architecture that resolves this limitation through a Kernel Centric Architecture (KCA) in which a persistent, stateful Jupyter Kernel functions as the authoritative symbolic oracle for all neural reasoning steps. A Context Extraction Pipeline (CEP) continuously harvests live kernel state—variable bindings, DataFrame schemas, and cell output traces—and packages the results into a struc- tured context object that prefixes every LLM generation request. A Dual-Loop Control System (DLCS) pairs a de- terministic symbolic execution loop with an adaptive neural recovery loop; whenever execution raises an exception, the outer loop re-conditions the generator on the error trace and produces a revised artifact. External tool integration is governed by the Model Context Protocol (MCP), provid- ing process-isolated, hot-swappable satellite capabilities. A formal control-theoretic characterization of the DLCS is pre- sented alongside a design validation and architectural robust-ness analysis. These results position runtime state injection as a practically deployable and theoretically grounded foun- dation for thenext generation of autonomous data science tooling.Index Terms—Autonomous Data Science, Neuro-Symbolic AI, Large Language Models, Jupyter Kernel, Model Context Proto- col, Agentic Systems, Dual-Loop Control, Runtime Context In-jection.


Download

Author's Blog

What is the difference between a Research Paper and a Review Paper?

A research paper and a review paper are both scholarly documents, but they serve different purposes and have different characteristics....
Read More
Author's Blog

What is DOI?

A Digital Object Identifier (DOI) is a unique alphanumeric string that is used to identify and provide a persistent link...
Read More
Author's Blog

What do you need to do during production of your Research Paper?

During the production of a research paper, the following steps need to be taken: conducting research, organizing and analyzing data,...
Read More
Author's Blog

What are the advantages of publishing a research paper?

Publishing a research paper can have many advantages for researchers, including: Career advancement, professional recognition, opportunities for collaboration, increased visibility,...
Read More
Author's Blog

Ways to Support your Academic Wellbeing which preparing the Research Paper/Article

To support your academic wellbeing while publishing a research paper, it's important to set realistic goals, manage your time effectively,...
Read More
Author's Blog

How to improve your Research Paper writing Skills?

Read extensively: One of the best ways to improve your research paper skills is to read extensively in your field...
Read More
Author's Blog

Is DOI compulsory to publish a research paper in a Journal?

DOI is not strictly required to publish a research paper, but it is highly recommended. Basically, the International Scientific Journal...
Read More
Author's Blog

In what ways does research paper give weight to career development?

Publishing a research paper can give weight to a researcher's career development in several ways, such as: establishing oneself as...
Read More
Author's Blog

How to develop a Research Paper from Scratch

Developing a research paper involves several steps including: choosing a topic, conducting background research, formulating a research question or hypothesis,...
Read More
Author's Blog

How Plagiarism report plays crucial role in Research Paper Publication?

Plagiarism is a major concern in the academic and research community, as it undermines the integrity of the research and...
Read More