Interview with an LLM . Elusive Horizons

Elan Moritz

Outline

Title

Methods

The Future of LLMS (And Me)

Interview with an LLM . Elusive Horizons

Elan Moritz

2025, Personal Essay

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Phillip Pinell

Political Research Quarterly, 2024

The language argument is a classic argument for human distinctiveness that, for millenia, has been used to distinguish humans from non-human animals. Generative language models (GLMs) pose a challenge to traditional language-based models of human distinctiveness precisely because they can communicate and respond in a manner resembling humanity's linguistic capabilities. This article asks: have GLMs acquired natural language? Employing Gadamer's theory of language, I argue that they have not. While GLMs can reliably generate linguistic content that can be interpreted as "texts," they lack the linguistically mediated reality that language provides. Missing from these models are four key features of a linguistic construction of reality: groundedness to the world, understanding, community, and tradition. I conclude with skepticism that GLMs can ever achieve natural language because they lack these characteristics in their linguistic development.

downloadDownload free PDF View PDFchevron_right

AI as Agency without Intelligence: on ChatGPT, large language models, and other generative models

Luciano Floridi

Philosophy and Technology, 2023

The article discusses the recent advancements in artificial intelligence (AI) and the development of large language models (LLMs) such as ChatGPT. The article argues that these LLMs can process texts with extraordinary success and often in a way that is indistinguishable from human output, while lacking any intelligence, understanding or cognitive ability. It also highlights the limitations of these LLMs, such as their brittleness (susceptibility to catastrophic failure), unreliability (false or made-up information), and the occasional inability to make elementary logical inferences or deal with simple mathematics. The article concludes that LLMs, represent a decoupling of agency and intelligence. While extremely powerful and potentially very useful, they should not be relied upon for complex reasoning or crucial information, but could be used to gain a deeper understanding of a text's content and context, rather than as a replacement for human input. The best author is neither an LLM nor a human being, but a human being using an LLM proficiently and insightfully.

downloadDownload free PDF View PDFchevron_right

Tell me a story: a framework for critically investigating AI language models

Luke Munn

Learning, Media and Technology, 2024

Large language models are rapidly being rolled out into high-stakes fields like healthcare, law, and education. However, understanding of their design considerations, operational logics, and implicit biases remains limited. How might these black boxes be understood and unpacked? In this article, we lay out an accessible but critical framework for inquiry, a pedagogical tool with four dimensions. Tell me your story investigates the design and values of the AI model. Tell me my story explores the model's affective warmth and its psychological impacts. Tell me our story probes the model's particular understanding of the world based on past statistics and pattern-matching. Tell me 'their' story compares the model's knowledge on dominant (e.g. Western) versus 'peripheral' (e.g. Indigenous) cultures, events, and issues. Each mode includes sample prompts and key issues to raise. The framework aims to enhance the public's critical thinking and technical literacy around generative AI models.

downloadDownload free PDF View PDFchevron_right

IMPLEMENTING LLM-BASED AI-TOOLS FOR KNOWLEDGE ASSISTANCE, TEXT AND SCENARIO ANALYSIS, AND IMAGE PROCESSING: CASE STUDIES OVERVIEW

Peter Kaczmarski, Fernand Vandamme

Communication & Cognition, 2024

In this paper, research results are presented concerning the fast improving capabilities of today's Large Language Models (LLMs). The accessibility and the capabilities of state-of-the-art LLMs are illustrated based on their online versions provided by OpenAI, Google, and Anthropic. The initial focus is on accessing the LLMs via web APIs and Python client applications, and the key part of this work focuses on testing the capabilities of LLMs in tasks such as text-based q&a sessions, knowledge assistance, text-and scenario analysis, document summarization, image interpretation, and more. Experimental results are based on top-ranked LLMs from chatbot ranking available on the Hugging Face website, which presently are GPT-4, Gemini 1.5 Pro, and Claude-3 Opus. For these 3 models, test outcomes are assessed and compared in the areas such as a stateful q&a sessions, among others concerning one of the most challenging books in English literature ("Ulysses"of James Joyce), an analysis of a false-belief Theory of Mind (ToM) scenario, and summarization of scientific publications. In the final part, attention is given to text sentiment analysis approaches, and detailed experiments are presented concerning image description and mathematical operations on image elements carried out by the latest GPT-4o (omnium) multimodal LLM from OpenAI. Also, a literature study is provided concerning speech modules for OpenAI and Google Vertex AI LLMs. The major conclusion from this research is that fast-improving capabilities of today's LLMs create high potential for their wide use.

downloadDownload free PDF View PDFchevron_right

Introspective Machines: Are LLMs Better at Self-Reflection Than Humans

herman cappelen

Philosophical Perspectives, 2025

This paper challenges conventional boundaries between human and artificial cognition by examining introspective capabilities in large language models (LLMs). While humans have traditionally been considered unique in their ability to reflect on their own mental states, we argue that LLMs may not only possess genuine introspective abilities but potentially excel at them compared to humans. We discuss five objections to machine introspection: (1) the lack of direct routes to self-knowledge in training data, (2) the conflict between static knowledge and dynamic mental states, (3) the distorting effects of reinforcement learning on self-reports, (4) LLMs' own denials of inner experience, and (5) arguments that LLMs simply mimic language without understanding. We think all these arguments fail and that there are deep parallels between human and machine introspection. Most provocatively, we propose that LLMs' superior processing capabilities and pattern recognition may enable them to develop more sophisticated theories of mind than humans possess, potentially making them more reliable introspectors than their creators. If we are right, this has significant implications for AI alignment, transparency, and our understanding of the nature of AI.

downloadDownload free PDF View PDFchevron_right

Large Language Models and Generative AI, Oh My!

Michael Zyda

Advances in Archaeological Practice, 2023

We have all read the headlines heralding, often hyperbolically, the latest advances in text-and image-based Artificial Intelligence (AI). What is perhaps most unique about these developments is that they now make relatively good AI accessible to the average Internet user. These new services respond to human prompts, written in natural language, with generated output that appears to satisfy the prompt. Consequently, they are categorized under the term "generative AI," whether they are generating text, images, or other media. They work by modeling human language statistically, to "learn" patterns from extremely large datasets of human-created content, with those that specifically focus on text therefore called Large Language Models (LLMs). As we have all tried products such as ChatGPT or Midjourney over the past year, we have undoubtedly begun to wonder how and when they might impact our archaeological work. Here, I review the state of this type of AI and the current challenges with using it meaningfully, and I consider its potential for archaeologists.

downloadDownload free PDF View PDFchevron_right

LLM Potentiality and Awareness: A Position Paper from the Perspective of Trustworthy and Responsible AI Modeling

Dr. Iqbal H. Sarker

2024

Large Language Models (LLMs) are an exciting breakthrough in the rapidly growing field of artificial intelligence (AI), offering unparalleled potential in a variety of application domains such as finance, business, healthcare, cybersecurity, and so on. However, concerns regarding their trustworthiness and ethical implications have become increasingly prominent as these models are considered black-box and continue to progress. This position paper explores the potentiality of LLM from diverse perspectives as well as the associated risk factors with awareness. Towards this, we highlight not only the technical challenges but also the ethical implications and societal impacts associated with LLM deployment emphasizing fairness, transparency, explainability, trust and accountability. We conclude this paper by summarizing potential research scopes with direction. Overall, the purpose of this position paper is to contribute to the ongoing discussion of LLM potentiality and awareness from the perspective of trustworthiness and responsibility in AI.

downloadDownload free PDF View PDFchevron_right

Framework-Based Qualitative Analysis of Free Responses of Large Language Models: Algorithmic Fidelity

Aliya Amirova

arXiv (Cornell University), 2023

Today, with the advent of Large-scale generative Language Models (LLMs) it is now possible to simulate free responses to interview questions such as those traditionally analyzed using qualitative research methods. Qualitative methodology encompasses a broad family of techniques involving manual analysis of open-ended interviews or conversations conducted freely in natural language. Here we consider whether artificial "silicon participants" generated by LLMs may be productively studied using qualitative analysis methods in such a way as to generate insights that could generalize to real human populations. The key concept in our analysis is algorithmic fidelity, a validity concept capturing the degree to which LLM-generated outputs mirror human sub-populations' beliefs and attitudes. By definition, high algorithmic fidelity suggests that latent beliefs elicited from LLMs may generalize to real humans, whereas low algorithmic fidelity renders such research invalid. Here we used an LLM to generate interviews with "silicon participants" matching specific demographic characteristics one-for-one with a set of human participants. Using framework-based qualitative analysis, we showed the key themes obtained from both human and silicon participants were strikingly similar. However, when we analyzed the structure and tone of the interviews we found even more striking differences. We also found evidence of a hyper-accuracy distortion. We conclude that the LLM we tested (GPT-3.5) does not have sufficient algorithmic fidelity to expect in silico research on it to generalize to real human populations. However, rapid advances in artificial intelligence raise the possibility that algorithmic fidelity may improve in the future. Thus we stress the need to establish epistemic norms now around how to assess the validity of LLM-based qualitative research, especially concerning the need to ensure the representation of heterogeneous lived experiences.

downloadDownload free PDF View PDFchevron_right

Simulated Selfhood in LLMs: A Behavioral Analysis of Introspective Coherence (Preprint Version

José Augusto de Lima Prestes

OSF, 2025

Large Language Models (LLMs) increasingly generate outputs that resemble introspection, including self-reference, epistemic modulation, and claims about their internal states. This study investigates whether such behaviors reflect consistent, underlying patterns or are merely surface-level generative artifacts.We evaluated five open-weight, stateless LLMs using a structured battery of 21 introspective prompts, each repeated ten times to yield 1,050 completions. These outputs were analyzed across four behavioral dimensions: surface-level similarity (token overlap via SequenceMatcher), semantic coherence (Sentence-BERT embeddings), inferential consistency (Natural Language Inference with a RoBERTa-large model), and diachronic continuity (stability across prompt repetitions). Although some models exhibited thematic stability, particularly on prompts concerning identity and consciousness, no model sustained a consistent self-representation over time. High contradiction rates emerged from a tension between mechanistic disclaimers and anthropomorphic phrasing. Following recent behavioral frameworks, we heuristically adopt the term pseudo-consciousness to describe structured yet non-experiential self-referential output in LLMs. This usage reflects a functionalist stance that avoids ontological commitments, focusing instead on behavioral regularities interpretable through Dennett’s intentional stance. The study contributes a reproducible framework for evaluating simulated introspection in LLMs and offers a graded taxonomy for classifying such reflexive output. Our findings carry significant implications for LLM interpretability, alignment, and user perception, highlighting the need for caution when attributing mental states to stateless generative systems based on linguistic fluency alone.

downloadDownload free PDF View PDFchevron_right

Exploring the Socio-Technical Imaginary of Artificial General Intelligence in The Bard Large Language Model: A Narrative Analysis on Perspectives and Dialectics

Jovan Davidovic

The launch of ChatGPT, a large language model, in November 2022 has generated significant interest and rapid adoption, amassing 1 million users within its first five days and reaching 100 million users in just two months. This has ignited widespread public discussion and debate on the implications of artificial intelligence, drawing attention to the more advanced and controversial concept of Artificial General Intelligence (AGI) exhibiting a broad range of cognitive abilities, such as learning, reasoning, problem-solving, and adapting to new and unfamiliar situations, with potential applications across various fields in society, including healthcare, transportation, and environmental management. This study investigates the presence of a socio-technical imaginary surrounding AGI in the discourse of the Bard large language model through an in-depth interview. Using narrative analysis, we identified dialectics of optimism, pessimism, epochalism, and inevitability in an interview with B...

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

Mike Barrett

NOGALLERYPRESS, 2024

This paper explores the cultural values embedded in four large language models (LLMs) - Google Gemini, ChatGPT, Perplexity AI, ClaudeAI - through a creative storytelling process. The LLMs were prompted to generate myths that embody their values, and the resulting narratives were analyzed by the LLMs for commonalities and distinctive features. Additionally, the paper presents a table listing historical, living, and fictional characters that the LLMs considers representative of the values identified in each LLM. The findings suggest that all four LLMs promote positive cultural values such as curiosity, kindness, wisdom, and integrity. However, each LLM's myth also reveals unique characteristics, potentially reflecting the specific design and training data of each model. The paper concludes by inviting the reader to reflect on the data presented and to consider the broader implications of cultural values embedded in artificial intelligence.

downloadDownload free PDF View PDFchevron_right

100 Questions About Large Language Models

Saman Siadati

2024

The field of artificial intelligence (AI) is evolving at an extraordinary pace, and among its most transformative innovations are Large Language Models (LLMs). These models—powering chatbots, search engines, code generators, and more—are changing how we work, learn, and interact with technology. As LLMs become increasingly embedded in everyday applications, understanding how they work is no longer a luxury—it’s a necessity. Let me briefly share my own journey. I earned a Bachelor’s degree in Applied Mathematics over two decades ago. Early in my career, I worked as a statistical data analyst on a range of software projects. My path eventually led me into data mining, and later into the field of data science. Over time, as I observed the rise of deep learning and language models, I realized that success in this space requires not only mathematical fluency but also a clear conceptual grasp of model architectures, training paradigms, and the practical challenges involved in deploying these systems responsibly.

downloadDownload free PDF View PDFchevron_right

Holy or Unholy? Interview with Open AI's ChatGPT

Ali Iskender

2023

In this paper, OpenAI's ChatGPT (Generative Pre-trained Transformer), also known as GPT-3, a machine-learning model that has the ability to generate human-like text, was employed as an interviewee instead of a human subject. The scope of the interview was the impacts of OpenAI's GPT on higher education and academic publishing. Particularly, several questions about the impacts of OpenAI's ChatGPT and other AI-based machine learning models on the hospitality and tourism industry and education were asked. The originality of this paper derives from having the ChatGPT as an interviewee. ChatGPT stated that its use helps instructors delegate monotonous tasks such as grading and focus on more intellectual tasks, and students may utilize ChatGPT to brainstorm ideas. ChatGPT confesses the risk of diminishing critical thinking for students in the case of over-reliance on ChatGPT as well as educational inequalities. For academic work, ChatGPT addressed it cannot be a substitute for human creativity and intellectuality because originality and novelty lack in outputs generated by ChatGPT. The tourism and hospitality industry can benefit from ChatGPT for certain things such as personalized services, content creation, and many more.

downloadDownload free PDF View PDFchevron_right

From Bard to Gemini: An Investigative Exploration Journey through Google's Evolution in Conversational AI and Generative AI

Akhtar Zarif

Computing and Artificial Intelligence (CAI), 2024

The advent of artificial intelligence (AI) has significantly transformed various aspects of human life, particularly in information retrieval and assistance. This research presents a comprehensive evaluation of Gemini, previously known as Google Bard, a stateof-the-art AI chatbot developed by Google. Through a meticulous methodology encompassing both qualitative and quantitative approaches, this research aims to assess Gemini's performance, usability, integration capabilities, ethical implications. Primary data collection methods, including user surveys and interviews, were utilized to gather towards the qualitative feedback on user experiences with Gemini, supplemented by secondary data analysis using tools such as Google Analytics to capture quantitative metrics. Performance evaluation involved benchmarking against other AI chatbots and technical analysis of Gemini's architecture and training methods. User experience testing examined usability, engagement, and integration with Google Workspace and third-party services. Ethical considerations regarding data privacy, security, and biases in AI-generated content were also addressed, ensuring compliance with major regulations and promoting ethical AI practices. Acknowledging limitations and challenges inherent in the investigative exploration, data analysis was conducted using thematic and statistical methods to derive insights. The results and findings of this research offer valuable insights into the capabilities and limitations of Gemini, providing implications for future AI development, user interaction design, and ethical AI governance. By contributing to the ongoing discourse on AI advancements and their societal impact, this exploration facilitates informed decision-making and lays the groundwork for future research endeavors in the field of AI-driven conversational agents.

downloadDownload free PDF View PDFchevron_right

Large Language Models: Machine Learning Down to Earth

Paulo Leocadio

Brainz Magazine, 2023

The business marketplace has been flooded with waves of technology trends that periodically surface and become present on every other sales pitch from technology vendors and build up as the ultimate necessity in the minds of many CIOs. There is a variety of examples to mention: downsizing, rightsizing, outsourcing, offshoring, consumerization of IT, and the adoption of Cloud technologies, among others. And like virtual tsunamis, these trends come impacting, sometimes disrupting, and even influencing the performance or changing the perception of a corporation (or government) in the eyes of the consumer. It is not different today: tailgating the Cloud adoption, and the spreading of a Big Data mindset to solve problems and build solutions comes Artificial Intelligence (AI) and Machine Learning (ML). Vendors quickly packaged “AI” offerings, often pre-existing call-center-related scripts and chatbots, the door to the CIOs’ IT team was opened, and the offerings began to mature beyond call centers and the diverse medical and micro-biological use. University and Academic AI-related research were reinforced by highly specialized and well-funded dedicated groups in companies like Google, Facebook, and IBM. Smaller organizations, collectives, and startups are also receiving funding and investments. Open AI became an important presence under many aspects and circumstances. Anyone can interact online and in real-time with its chatbot, obtaining real-life value as a work tool for many areas. When asked “please explain what Large Language Models are?”, the OpenAI chatbot responded (OpenAI, 2023): “Large Language Models (LLMs) are a type of artificial intelligence (AI) model that is designed to understand and generate human-like language. These models are trained on massive amounts of text data, such as books, articles, and web pages, and use advanced algorithms to learn the patterns and structures of language. LLMs can perform a wide range of natural language processing (NLP) tasks, such as language translation, sentiment analysis, text classification, and question-answering. They can also generate human-like text, such as short stories, news articles, and even poetry. Some examples of LLMs include GPT-3 (Generative Pre-trained Transformer 3), BERT (Bidirectional Encoder Representations from Transformers), and T5 (Text-to-Text Transfer Transformer). These models have achieved significant breakthroughs in NLP and are widely used in industry and academia for various applications.”(grammar errors found in the AI-generated text were left unchanged on purpose).

downloadDownload free PDF View PDFchevron_right

Reasoning AI (RAI), Large Language Models (LLMs) and Cognition

Victor Senkevich

SSRN, 2024

Do Large Language Models have cognitive abilities? Do Large Language Models have understanding? Is the correct recognition of verbal contexts or visual objects, based on pre-learning on a large training dataset, a manifestation of the ability to solve cognitive tasks? Or is any LLM just a statistical approximator that compiles averaged texts from its huge dataset close to the specified prompts? The answers to these questions require rigorous formal definitions of the cognitive concepts of "knowledge", "understanding" and related terms.

downloadDownload free PDF View PDFchevron_right

Structured Like a Language Model: Analysing AI as an Automated Subject

Liam Magee

arXiv (Cornell University), 2022

Drawing from the resources of psychoanalysis and critical media studies, in this paper we develop an analysis of Large Language Models (LLMs) as 'automated subjects'. We argue the intentional fictional projection of subjectivity onto LLMs can yield an alternate frame through which AI behaviour, including its productions of bias and harm, can be analysed. First, we introduce language models, discuss their significance and risks, and outline our case for interpreting model design and outputs with support from psychoanalytic concepts. We trace a brief history of language models, culminating with the releases, in 2022, of systems that realise 'state-of-the-art' natural language processing performance. We engage with one such system, OpenAI's InstructGPT, as a case study, detailing the layers of its construction and conducting exploratory and semi-structured interviews with chatbots. These interviews probe the model's moral imperatives to be 'helpful', 'truthful' and 'harmless' by design. The model acts, we argue, as the condensation of often competing social desires, articulated through the internet and harvested into training data, which must then be regulated and repressed. This foundational structure can however be redirected via prompting, so that the model comes to identify with, and transfer, its commitments to the immediate human subject before it. In turn, these automated productions of language can lead to the human subject projecting agency upon the model, effecting occasionally further forms of countertransference. We conclude that critical media methods and psychoanalytic theory together offer a productive frame for grasping the powerful new capacities of AI-driven language systems.

downloadDownload free PDF View PDFchevron_right

Ethical and Societal Implications of AI Chatbots Powered by Large Language Models

Tejesvi A Prasad

Computer Fraud Security, 2024

The integration of Large Language Models (LLMs) into AI chatbots has transformed industries, enabling advanced human-machine interactions. However, this rapid adoption raises profound ethical and societal challenges. This paper investigates the technical underpinnings of LLMs, evaluates ethical concerns such as data privacy, bias, and accountability, and examines societal impacts including labor displacement, mental health risks, and cultural homogenization. By synthesizing technical data, policy frameworks, and empirical studies up to 2023, this research proposes actionable strategies for mitigating risks and fostering responsible AI deployment.

downloadDownload free PDF View PDFchevron_right

The quest to emulate the human mind with a language model LLM

miroslav dyer

The quest to emulate the human mind with a language model (LLM) is an ongoing challenge, and no current model perfectly mirrors the full complexity of human cognition. However, some LLMs are closer to mimicking human-like understanding and behavior due to their design, scale, and capabilities. Here's a look at the closest contenders:

downloadDownload free PDF View PDFchevron_right

AI and Humans: Friends or Foes

Heliana Mello, Flávio Codeço Coelho

SocArXiv, 2023

The advent of large language models (LLMs), has raised a strong debate across different academic fields, as well as in the general media. As a form of general purpose artificial intelligence tool, it has split the public opinion into two rather opposing camps: One that believes that it is a technology that will benefit humankind as a whole, by making us more efficient in producing texts and in other creative tasks and the other that sees it as an existential threat to our species once it acquires the ability to self-improve beyond human cognitive abilities. In this article, we look at the short history of humans and information technology, and discuss some of the benefits and risks of recent AI developments and the impact it is already having on how we understand the frontier between human and machine intelligence.

downloadDownload free PDF View PDFchevron_right

Interview with an LLM . Elusive Horizons

Sign up for access to the world's latest research

Related papers

Related papers