GPT-4.1 is the latest advancement in OpenAI’s large language models, providing enhanced accuracy and response quality compared to its predecessor, GPT-4o. It builds on the strengths of earlier models while delivering more consistent performance across tasks such as reading comprehension, content generation, and interactive learning tools like quizzes.

GPT-4.1 vs. GPT-4o: Key Differences and Comparison

When evaluating the transition from GPT-4o to GPT-4.1, several key criterias should be considered to determine which model best fits your use case:

Context Window: The context window defines the maximum number of tokens (words, code, or symbols) the model can process in a single request, including both input and output. GPT-4.1 offers a much larger context window (up to 1,047,576 tokens) compared to GPT-4o (128,000 tokens), making it ideal for processing entire books or large documents without splitting them into smaller chunks.
Token Limits and Handling: Exceeding a model’s token limit can result in errors, incomplete outputs, or truncated responses. It’s important to design prompts and workflows that respect these boundaries. GPT-4.1’s expanded token capacity allows for more comprehensive analysis and richer outputs, while GPT-4o’s smaller window may require more aggressive chunking or summarization.
Model Capabilities and Use Case Alignment: GPT-4.1 is not specifically a “reasoning” model, but it excels at large-scale, structured text processing tasks, such as extracting topics from books or generating detailed quizzes. It also supports multimodality (text, image, etc.), though GPT-4o is more optimized for real-time multimodal interactions (text, image, audio). GPT-4.1’s strength lies in maintaining context and producing consistent, structured outputs over long documents, while GPT-4o offers faster and more versatile responses but with a smaller context window.
Performance and Speed: GPT-4o is designed for lower latency and higher throughput, making it suitable for interactive applications or scenarios where response time is critical. GPT-4.1 may have longer processing times due to its larger context window and more extensive outputs.
Output Structure and Quality: For tasks requiring highly structured outputs (such as JSON, quizzes, or topic lists), GPT-4.1’s consistency and ability to retain context across large documents provide a significant advantage. GPT-4o is better suited for tasks that benefit from multimodal understanding or require rapid, conversational responses.
Cost Considerations: Larger context windows and more complex outputs can increase API usage costs. Evaluate the trade-offs between model capabilities, required output quality, and budget constraints.

GPT-4.1 vs. GPT-4o on Azure (as of May 29, 2025)

Feature	GPT-4.1 (Azure)	GPT-4o (Azure)
Max Context Window	Up to 1,047,576 tokens	Up to 128,000 tokens (input)
Max Output Tokens	Up to 32,768 tokens	Up to 16,384 tokens
Best Use Cases	Large-scale text processing, in-depth analysis, educational tools (e.g., quizzes). Multimodality supported	Multimodal tasks (text, image, audio), real-time interactions

For more detailed information, refer to the official Azure OpenAI documentation: https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models

Step-by-Step Workflow

The book I used for this demostration is Beginning C# Object-Oriented Programming 2nd edition - Dan Clark, with total 373 pages. This book contains rich, structured technical content that benefits from hierarchical topic extraction.

📚 Input: Books

The pipeline begins with digital or scanned books as the raw input material.

📄 Document Intelligence

A Document Intelligence service (such as Azure AI Document Intelligence or OCR) processes the books to extract structured content. The output is transformed into clean Markdown format for easier parsing and semantic analysis.

📝 Markdown

The structured Markdown acts as an intermediary layer, organizing content in a way that’s readable and machine-processable. This format facilitates the next stages of topic extraction and quiz generation.

The output markdown from the book is 767,408 characters ≈ ~153,000 tokens (based on 5 characters/token average). GPT-4.1 on Azure supports up to 1 million context window, allowing processing entire books or large chunks without splitting them into artificial sections.

Why This Matters:

No need for aggressive chunking or summarizing before topic extraction.
Retains full context of terminology, code samples, and explanations.
Allows hierarchical topic modeling: sections → chapters → concepts.

I you cannot OCR via Document Intelligence or other similar tool, you can use my result in this gist: https://gist.github.com/zhenyuan0502/4ee1d7df86fae62d513bf196c0bbc47b, let’s say you will save this as markdown.md

✨ GPT-4.1: Topic Extraction

The first GPT-4.1 model analyzes the Markdown content to identify key topics or concepts from the book. These topics help frame the knowledge scope and guide focused quiz generation.

My developer prompt, prompt_get_topic.md

# Topic Extraction Prompt

## Task Description
You will analyze the provided book content and extract key topics and concepts. 
Your task is to identify major topics, provide concise descriptions, list related subtopics, and note related categories.

## Output Requirements
Generate a structured list of topics following provided pydantic model

## Guidelines
- Identify 10-15 major topics from the book content
- Each topic should be significant and self-contained
- Provide accurate, concise descriptions (max 2 sentences)
- Include 3-5 relevant subtopics for each main topic
- List 1-3 related categories or domains that connect to the topic
- Ensure topics represent the book's core content and structure
- Avoid overly general topics or extremely specific details

## Example (for a programming book)

{
    "topic_name": "Object-Oriented Programming Fundamentals",
    "topic_description": "Core concepts and principles of object-oriented programming paradigm. Includes encapsulation, inheritance, and polymorphism as central tenets.",
    "subtopics": ["Classes and Objects", "Inheritance and Composition", "Polymorphism", "Encapsulation", "Abstraction"],
    "related_categories": ["Programming Paradigms", "Software Design", "Computer Science Fundamentals"]
}

As processing through json would be easier, I organize the Structured Ouput with Pydantic model and include them in the Responses API - the vNext of OpenAI SDK in python

model.py

from pydantic import BaseModel, Field
from typing import List

class Answer(BaseModel):
    text: str = Field(..., description="The text content of the answer option")
    explanation: str = Field(..., description="Explanation of why this answer is correct or incorrect")
    is_correct: bool = Field(..., description="Boolean flag indicating if this is the correct answer")

class MultipleChoiceQuestion(BaseModel):
    question: str = Field(..., description="The question text being asked")
    category: str = Field(..., description="The subject category the question belongs to")
    difficulty: str = Field(..., description="The difficulty level of the question (e.g., easy, medium, hard)")
    answers: List[Answer] = Field(..., description="List of possible answers (should contain exactly 4 answers)")

    def get_correct_answer(self) -> Answer:
        for answer in self.answers:
            if answer.is_correct:
                return answer
        raise ValueError("No correct answer found.")

    def get_incorrect_answers(self) -> List[Answer]:
        return [answer for answer in self.answers if not answer.is_correct]

class ListQuestion(BaseModel):
    responses: List[MultipleChoiceQuestion] = Field(..., description="List of response strings or questions")
    topic: str = Field(..., description="The topic or subject these questions relate to")

class Topic(BaseModel):
    name: str = Field(..., description="The name or title of the topic")
    description: str = Field(..., description="A brief description of the topic")
    questions: List[MultipleChoiceQuestion] = Field(..., description="List of multiple choice questions related to this topic")
    related_topics: List[str] = Field(..., description="List of related topics or subtopics")

class ListTopic(BaseModel):
    topics: List[Topic] = Field(..., description="List of topics extracted from the content")

Here is my get_aoai_instance_v2() function, you can replace and short down credential to gpt-4.1 only:


def get_aoai_instance_v2(model="o4-mini"):
    api_key = os.environ.get("AZURE_OPENAI_API_KEY_V2")
    endpoint = os.environ.get("AZURE_OPENAI_ENDPOINT_V2")
    if model == "o4-mini":
        deployment = os.environ.get("AZURE_OPENAI_DEPLOYMENT_V2_O4MINI")
    elif model == "o3":
        deployment = os.environ.get("AZURE_OPENAI_DEPLOYMENT_V2_O3")
    elif model == "gpt-4o":
        deployment = os.environ.get("AZURE_OPENAI_DEPLOYMENT_V2_GPT_4O")
    elif model == "gpt-4.1":
        deployment = os.environ.get("AZURE_OPENAI_DEPLOYMENT_V2_GPT_41")
    else:
        raise ValueError("Invalid model name. Use 'o4-mini' or 'o3'.")
    api_version = os.environ.get("AZURE_OPENAI_API_VERSION_V2")

    client = AzureOpenAI(
        azure_endpoint=endpoint,
        api_key=api_key,
        api_version=api_version
    )
    return client, deployment

Function get_topics(), with markdown_content is read from markdown.md

markdown_content = open('markdown.md'), 'r', encoding='utf-8').read()

def get_topics(markdown_content):
    client, deployment = get_aoai_instance_v2(model="gpt-4.1")
    prompt_get_topics = open(os.path.join(os.path.dirname(__file__), 'prompt_get_topic.md'), 'r', encoding='utf-8').read()

    response = client.responses.parse(
            model=deployment,
            input=[
                {"role": "developer", "content": prompt_get_topics},
                {"role": "user", "content": f"""
                 
                For the book content provided below, extract key topics following these guidelines and format:
                 
                 
                {markdown_content}
                
                 
                 """},
            ],
            text_format=model.ListTopic
        )

    text = response.output_parsed.model_dump_json(indent=4)
    
    return text

I saved the result in the topic.json, it looks like:

{
    "topics": [
        {
            "name": "Object-Oriented Programming Fundamentals",
            "description": "Explores the foundational principles of object-oriented programming (OOP), its history, importance, and basic characteristics like abstraction, encapsulation, inheritance, and polymorphism.",
            "questions": [],
            "related_topics": [
                "Classes and Objects",
                "OOP Language Features",
                "Programming Paradigms"
            ]
        },
        {
            "name": "Unified Modeling Language (UML) and OOP Design",
            "description": "Presents the role and elements of UML in designing object-oriented solutions, including use cases, class diagrams, sequence diagrams, and activity diagrams.",
            "questions": [],
            "related_topics": [
                "Software Requirements Specification (SRS)",
                "Use Cases",
                "Class Diagrams"
            ]
        },
        {
            "name": "Designing Object-Oriented Solutions",
            "description": "Focuses on practical software design using OOP, including requirements analysis, identifying classes, modeling object interactions, and documenting relationships.",
            "questions": [],
            "related_topics": [
                "System Analysis",
                "Modeling Object Relationships",
                "Case Studies"
            ]
        },
        {
            "name": "C# Language and .NET Framework Basics",
            "description": "Introduces the evolution of the C# language, its syntax, the .NET ecosystem, and the goals and components of the .NET Framework such as CLR, assemblies, and libraries.",
            "questions": [],
            "related_topics": [
                "Visual Studio IDE",
                "Managed Code",
                "Assemblies and Namespaces"
            ]
        },
        {
            "name": "Working with Classes in C#",
            "description": "Details how to define, create, and use classes and objects in C#, including encapsulation, properties, methods, constructors, and method overloading.",
            "questions": [],
            "related_topics": [
                "Class Constructors",
                "Method Overloading",
                "Encapsulation"
            ]
        },
        {
            "name": "Class Hierarchies, Inheritance, and Polymorphism",
            "description": "Explains how to construct class hierarchies using inheritance, implement and override methods, use access modifiers for visibility, and achieve polymorphism via inheritance and interfaces.",
            "questions": [],
            "related_topics": [
                "Abstract and Sealed Classes",
                "Interfaces",
                "Access Modifiers"
            ]
        },
        {
            "name": "Object Collaboration, Messaging, and Event-Driven Programming",
            "description": "Explores how objects communicate through method calls and events, handle parameters, use delegates, and respond to or raise events in C# applications.",
            "questions": [],
            "related_topics": [
                "Delegates",
                "Event Handlers",
                "Exception Handling"
            ]
        },
        {
            "name": "Collections, Arrays, and Generics",
            "description": "Covers the use of collections such as arrays, lists, stacks, queues, and generic collections in .NET, along with iteration, sorting, and extending collection types.",
            "questions": [],
            "related_topics": [
                "ArrayList",
                "Generic Collections",
                "Stacks and Queues"
            ]
        },
        {
            "name": "Data Access with ADO.NET and Entity Framework",
            "description": "Introduces accessing relational databases using ADO.NET, managing connections and commands, working with DataSets and DataTables, and leveraging Entity Framework for ORM and LINQ queries.",
            "questions": [],
            "related_topics": [
                "SQL Connections",
                "DataTables/DataSets",
                "LINQ to Entities"
            ]
        },
        {
            "name": "Developing Windows Applications with WPF",
            "description": "Explains building user interfaces using Windows Presentation Foundation (WPF), including XAML markup, layout and display controls, event handling, data binding, and using templates.",
            "questions": [],
            "related_topics": [
                "XAML",
                "Data Binding",
                "WPF Controls"
            ]
        },
        {
            "name": "Developing Web Applications with ASP.NET",
            "description": "Covers the creation of web-based user interfaces using ASP.NET Web Forms, managing server controls, page life cycle, state management, and data binding in web contexts.",
            "questions": [],
            "related_topics": [
                "Page Events",
                "Web Server Controls",
                "Model Binding"
            ]
        },
        {
            "name": "Developing Windows Store Apps",
            "description": "Discusses Windows Store application development leveraging XAML for UIs, page navigation, touch events, style sheets, and binding data to controls in C#.",
            "questions": [],
            "related_topics": [
                "XAML in Windows Store Apps",
                "Page Navigation",
                "Touch Events"
            ]
        },
        {
            "name": "Web Services with WCF and ASP.NET Web API",
            "description": "Describes how to create, deploy, and consume web services using Windows Communication Foundation (WCF) and ASP.NET Web API, focusing on service contracts, endpoints, data contracts, and RESTful design.",
            "questions": [],
            "related_topics": [
                "SOAP and REST",
                "Service Consumption",
                "Serialization"
            ]
        },
        {
            "name": "Multi-Tier Application Architecture and Case Study",
            "description": "Demonstrates designing and building multi-layered applications, such as the Office Supply Ordering (OSO) system, dividing responsibilities into data access, business logic, and presentation layers.",
            "questions": [],
            "related_topics": [
                "3-Tier Architecture",
                "UI/Business/Data Layers",
                "Case Studies"
            ]
        },
        {
            "name": "Fundamental Programming Concepts in C#",
            "description": "Provides foundational knowledge of variables, data types, type conversions, operators, control flow structures, loops, and method declaration and usage in C#.",
            "questions": [],
            "related_topics": [
                "Data Types",
                "Operators",
                "Control Structures"
            ]
        }
    ]
}

✨ GPT-4.1: Quiz Generation

A second call to GPT-4.1 is made, this time to generate multiple-choice quizzes based on the Markdown content and extracted topics - actually I used random 3 of 10 topics above to make it more quizzes generated per topic. This ensures the quizzes are relevant, accurate, and aligned with the core material.

My developer prompt, prompt_create_qna.md

# Custom Question Answering Prompting Guide

## Objective
Create a custom question-answering system using input from a markdown file. The system should generate multiple-choice questions based on the content.

## Notes
- Ensure questions span different cognitive levels (recall, understanding, application).
- Balance question difficulty based on the complexity of the source material.
- Use the markdown structure to identify key sections, topics, and relationships.
- For technical content, include code-based questions where appropriate.
- Consider including true/false or fill-in-the-blank questions for variety.

Function generate_quiz(), with json_topic_text is purely text in topic.json and same markdown_content

markdown_content = open('markdown.md'), 'r', encoding='utf-8').read()
json_topic_text = open('topic.json'), 'r', encoding='utf-8').read()

def generate_quiz(markdown_content, num_questions, json_topic_text):
    # read json_topics as model.ListTopic
    json_topics = model.ListTopic.model_validate_json(json_topic_text)

    random_topics = random.sample(json_topics.topics, 3)

    random_topics_dict = [topic.model_dump() for topic in random_topics]

    client, deployment = get_aoai_instance_v2(model="gpt-4.1")
    prompt_create_qna = open(os.path.join(os.path.dirname(__file__), 'prompt_create_qna.md'), 'r', encoding='utf-8').read()

    response = client.responses.parse(
            model=deployment,
            input=[
                {"role": "developer", "content": prompt_create_qna},
                {"role": "user", "content": f"""
                
                Covered topics:
               
                    {json.dumps(random_topics_dict, indent=4)}
                
                 
                Please create {num_questions} questions based on the following markdown content with above topics.
                
                    {markdown_content}
                
                 
                 """},
            ],
            text_format=model.ListQuestion
        )
    list_questions = response.output_parsed.model_dump_json(indent=4)
    
    return list_questions, json.dumps(random_topics_dict, indent=4)

✅ Output: Quiz

The final output is a set of multiple-choice quizzes that can be used for self-assessment, educational apps, or classroom activities.

{
    "responses": [
        {
            "question": "Which of the following best describes the purpose of encapsulation in object-oriented programming?",
            "category": "Class Structures and Hierarchies",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "It hides the internal data of an object and allows access only through methods or properties.",
                    "explanation": "Encapsulation is about restricting direct access to the internal state of an object, providing controlled access through public methods and properties.",
                    "is_correct": true
                },
                {
                    "text": "It enables objects to inherit properties and methods from a base class.",
                    "explanation": "This describes inheritance, not encapsulation.",
                    "is_correct": false
                },
                {
                    "text": "It allows multiple objects to be composed together.",
                    "explanation": "This describes aggregation, not encapsulation.",
                    "is_correct": false
                },
                {
                    "text": "It allows a method to have multiple forms.",
                    "explanation": "This describes polymorphism, not encapsulation.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which key benefit did the .NET Framework bring compared to previous Microsoft platforms?",
            "category": ".NET Framework and Visual Studio",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Automatic memory management through garbage collection.",
                    "explanation": "The .NET Framework introduced automatic garbage collection, simplifying memory management for developers.",
                    "is_correct": true
                },
                {
                    "text": "Code is always interpreted, never compiled.",
                    "explanation": "This is incorrect. .NET code is compiled to MSIL and then JIT compiled to native code.",
                    "is_correct": false
                },
                {
                    "text": "All applications must run in Internet Explorer.",
                    "explanation": "This is incorrect. .NET enables development for Windows and web applications.",
                    "is_correct": false
                },
                {
                    "text": "Only supports Visual Basic language.",
                    "explanation": "This is incorrect. .NET supports multiple languages, including C#, VB.NET, F#, and others.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which UML diagram is primarily used to model the static structure of classes in a system?",
            "category": "Class Structures and Hierarchies",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Class Diagram",
                    "explanation": "Class diagrams model the static structure, showing classes, their attributes, and relationships.",
                    "is_correct": true
                },
                {
                    "text": "Sequence Diagram",
                    "explanation": "Sequence diagrams model interactions over time, not static structures.",
                    "is_correct": false
                },
                {
                    "text": "Activity Diagram",
                    "explanation": "Activity diagrams model workflows/processes, not class structures.",
                    "is_correct": false
                },
                {
                    "text": "Use Case Diagram",
                    "explanation": "Use case diagrams model system functionality from the user's perspective, not class structures.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "What is the purpose of the DataSet object in ADO.NET?",
            "category": "Data Access in C# with ADO.NET and Entity Framework",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "To hold a disconnected, in-memory cache of data tables retrieved from a database.",
                    "explanation": "DataSet stores data tables locally in memory, allowing disconnected manipulation and later updating to the database.",
                    "is_correct": true
                },
                {
                    "text": "To manage direct, live connections to the database.",
                    "explanation": "This is handled by the Connection object, not DataSet.",
                    "is_correct": false
                },
                {
                    "text": "To execute SQL queries directly on the database.",
                    "explanation": "This is the responsibility of the Command object.",
                    "is_correct": false
                },
                {
                    "text": "To persistently store all changes to the database immediately.",
                    "explanation": "DataSet works with local, in-memory data; updates to the database are made explicitly when needed.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which of the following is NOT a characteristic of object-oriented programming?",
            "category": "Class Structures and Hierarchies",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Execution of code from top to bottom in a single linear stream.",
                    "explanation": "This describes procedural programming, not a characteristic of OOP.",
                    "is_correct": true
                },
                {
                    "text": "Abstraction",
                    "explanation": "Abstraction is a core characteristic of OOP.",
                    "is_correct": false
                },
                {
                    "text": "Inheritance",
                    "explanation": "Inheritance is a core characteristic of OOP.",
                    "is_correct": false
                },
                {
                    "text": "Polymorphism",
                    "explanation": "Polymorphism is a core characteristic of OOP.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "True or False: Inheritance allows a child class to reuse and extend the behavior and attributes of a base class.",
            "category": "Class Structures and Hierarchies",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "True",
                    "explanation": "Inheritance enables code reuse by allowing child classes to inherit and extend base class members.",
                    "is_correct": true
                },
                {
                    "text": "False",
                    "explanation": "This statement is correct regarding inheritance.",
                    "is_correct": false
                },
                {
                    "text": "Not Sure",
                    "explanation": "Review the concept of inheritance in OOP.",
                    "is_correct": false
                },
                {
                    "text": "Only applies to interfaces",
                    "explanation": "Inheritance applies to classes; interfaces define contracts.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "In the context of the Entity Framework, what is the main advantage of LINQ?",
            "category": "Data Access in C# with ADO.NET and Entity Framework",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "It allows developers to write database queries using native C# syntax, which are then translated to the underlying data provider's query language.",
                    "explanation": "LINQ provides a consistent, strongly-typed query syntax in C#, making data retrieval easier and less error-prone.",
                    "is_correct": true
                },
                {
                    "text": "It compiles all .NET applications to machine code.",
                    "explanation": "Compilation is not the main concern of LINQ.",
                    "is_correct": false
                },
                {
                    "text": "It provides direct editing of database schemas in Visual Studio.",
                    "explanation": "LINQ does not edit database schemas; it queries data.",
                    "is_correct": false
                },
                {
                    "text": "It allows database queries only in string format.",
                    "explanation": "LINQ uses C# language constructs, not raw strings.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "An abstract class in C# is used when:",
            "category": "Class Structures and Hierarchies",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "You want to define a base class that cannot be instantiated, but can provide shared members and define abstract methods for derived classes.",
                    "explanation": "Abstract classes are used to provide common functionality and enforce implementation of certain methods in derived classes.",
                    "is_correct": true
                },
                {
                    "text": "You need to prevent any further inheritance from this class.",
                    "explanation": "This describes a sealed class, not an abstract class.",
                    "is_correct": false
                },
                {
                    "text": "You want to directly create objects from this class.",
                    "explanation": "Abstract classes cannot be instantiated directly.",
                    "is_correct": false
                },
                {
                    "text": "You need to define only static methods.",
                    "explanation": "Static methods can be in any class; abstract classes are for code sharing and contracts.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "What type of relationship does aggregation express between classes?",
            "category": "Class Structures and Hierarchies",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "A 'whole-part' relationship where one class is composed of other classes.",
                    "explanation": "Aggregation represents a class (whole) that contains other classes (parts), which is a 'has-a' relationship.",
                    "is_correct": true
                },
                {
                    "text": "A parent-child relationship where one class inherits from another.",
                    "explanation": "That is inheritance, not aggregation.",
                    "is_correct": false
                },
                {
                    "text": "A relationship where two classes can use each other's methods interchangeably.",
                    "explanation": "That describes an association, not aggregation.",
                    "is_correct": false
                },
                {
                    "text": "An enforced one-to-one relationship in a database.",
                    "explanation": "Aggregation is a class composition concept, not a database constraint.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "When would you use a sealed class in C#?",
            "category": "Class Structures and Hierarchies",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "When you want to prevent other classes from inheriting from it.",
                    "explanation": "The sealed keyword prevents further inheritance from the class.",
                    "is_correct": true
                },
                {
                    "text": "When you want to make the class abstract.",
                    "explanation": "Abstract classes enforce inheritance, not prevent it.",
                    "is_correct": false
                },
                {
                    "text": "When you need the class to only have static methods.",
                    "explanation": "Sealed does not relate to static methods.",
                    "is_correct": false
                },
                {
                    "text": "When you want to allow partial implementation.",
                    "explanation": "Partial classes are different from sealed classes.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which of the following statements is TRUE regarding DataTables and DataSets in ADO.NET?",
            "category": "Data Access in C# with ADO.NET and Entity Framework",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "A DataSet can contain multiple DataTables and relationships between them.",
                    "explanation": "A DataSet is a collection of DataTables and can manage their relationships, acting like an in-memory database.",
                    "is_correct": true
                },
                {
                    "text": "A DataTable is a collection of DataSets.",
                    "explanation": "Correct relationship is reverse: DataSet contains DataTables.",
                    "is_correct": false
                },
                {
                    "text": "DataSets are used only for executing SQL commands.",
                    "explanation": "Execution of SQL is done by Command objects, not DataSets.",
                    "is_correct": false
                },
                {
                    "text": "DataTables cannot represent relational data.",
                    "explanation": "DataTables represent table structure, and DataSet can relate them.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which Visual Studio feature allows developers to debug code by pausing execution at a specific line?",
            "category": ".NET Framework and Visual Studio",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Breakpoint",
                    "explanation": "Breakpoints let you pause program execution to inspect or step through code.",
                    "is_correct": true
                },
                {
                    "text": "IntelliSense",
                    "explanation": "IntelliSense provides code completion and suggestions, not debugging.",
                    "is_correct": false
                },
                {
                    "text": "Solution Explorer",
                    "explanation": "Solution Explorer helps manage files but doesn't pause code execution.",
                    "is_correct": false
                },
                {
                    "text": "Reference Manager",
                    "explanation": "Reference Manager is for managing dependencies.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "When using the Entity Framework, what file extension is used for the model's XML-based definition?",
            "category": "Data Access in C# with ADO.NET and Entity Framework",
            "difficulty": "easy",
            "answers": [
                {
                    "text": ".edmx",
                    "explanation": "Entity Data Model files are named with .edmx extension.",
                    "is_correct": true
                },
                {
                    "text": ".dll",
                    "explanation": "Assemblies are .dll, but .edmx is for Entity Framework models.",
                    "is_correct": false
                },
                {
                    "text": ".config",
                    "explanation": "Config files store settings, not model schemas.",
                    "is_correct": false
                },
                {
                    "text": ".cs",
                    "explanation": ".cs are code files.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Why did Microsoft create the Common Language Runtime (CLR) as a core part of the .NET Framework?",
            "category": ".NET Framework and Visual Studio",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "To abstract the code execution from the operating system, managing memory, type safety, security, and more.",
                    "explanation": "CLR provides a managed runtime environment with many services.",
                    "is_correct": true
                },
                {
                    "text": "To enable C# code to run only on Windows 98.",
                    "explanation": "CLR is for code management and is not specific to Windows 98.",
                    "is_correct": false
                },
                {
                    "text": "To require all applications to be written in C++.",
                    "explanation": "The CLR is language-agnostic.",
                    "is_correct": false
                },
                {
                    "text": "To expose direct hardware access to application code.",
                    "explanation": "CLR abstracts hardware details, providing safety instead.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Fill in the blank: In ADO.NET, the ______ object is responsible for issuing SQL commands and retrieving results from a database.",
            "category": "Data Access in C# with ADO.NET and Entity Framework",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Command",
                    "explanation": "The Command object issues SQL statements and obtains results.",
                    "is_correct": true
                },
                {
                    "text": "Connection",
                    "explanation": "The Connection object manages the connection, not issuing queries.",
                    "is_correct": false
                },
                {
                    "text": "DataSet",
                    "explanation": "DataSet stores results; commands are issued by Command objects.",
                    "is_correct": false
                },
                {
                    "text": "Adapter",
                    "explanation": "The DataAdapter mediates between the DataSet and the database, but Command objects execute the SQL.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "A collection type in .NET that always returns elements in the last-in, first-out (LIFO) order is called a...",
            "category": ".NET Framework and Visual Studio",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Stack",
                    "explanation": "Stack collections provide LIFO behavior.",
                    "is_correct": true
                },
                {
                    "text": "Queue",
                    "explanation": "Queue collections are first-in, first-out (FIFO).",
                    "is_correct": false
                },
                {
                    "text": "ArrayList",
                    "explanation": "ArrayLists have no enforced order for adding/removing.",
                    "is_correct": false
                },
                {
                    "text": "Dictionary",
                    "explanation": "Dictionaries map keys to values, not defined by order.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which technology in .NET is designed to bridge the gap between the object-oriented structure of a programming language and the relational data structure of a database?",
            "category": "Data Access in C# with ADO.NET and Entity Framework",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Entity Framework",
                    "explanation": "Entity Framework is an ORM (object-relational mapper) that facilitates this mapping.",
                    "is_correct": true
                },
                {
                    "text": "ADO.NET DataReader",
                    "explanation": "DataReader is for fast, forward-only reading of data.",
                    "is_correct": false
                },
                {
                    "text": "DataAdapter",
                    "explanation": "DataAdapter bridges DataSet and Command objects, but does not deal with ORM.",
                    "is_correct": false
                },
                {
                    "text": "LINQ to Objects",
                    "explanation": "LINQ to Objects is for querying in-memory objects.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which access modifier in C# ensures that a class member is accessible only within the class itself?",
            "category": "Class Structures and Hierarchies",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "private",
                    "explanation": "private restricts access to the same class only.",
                    "is_correct": true
                },
                {
                    "text": "protected",
                    "explanation": "protected allows access in the class and derived classes.",
                    "is_correct": false
                },
                {
                    "text": "internal",
                    "explanation": "internal allows access anywhere within the same assembly.",
                    "is_correct": false
                },
                {
                    "text": "public",
                    "explanation": "public allows access from anywhere.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which of the following is NOT a core characteristic of Object-Oriented Programming (OOP)?",
            "category": "Object-Oriented Programming Fundamentals",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Abstraction",
                    "explanation": "Abstraction is a core OOP characteristic, allowing you to focus on relevant data.",
                    "is_correct": false
                },
                {
                    "text": "Encapsulation",
                    "explanation": "Encapsulation is a key OOP characteristic, involving hiding data within objects.",
                    "is_correct": false
                },
                {
                    "text": "Inheritance",
                    "explanation": "Inheritance is an OOP characteristic, allowing creation of new classes based on existing ones.",
                    "is_correct": false
                },
                {
                    "text": "Linear execution",
                    "explanation": "Linear execution is a characteristic of procedural programming, not OOP.",
                    "is_correct": true
                }
            ]
        },
        {
            "question": "Which programming paradigm organizes programs in a linear sequence of steps?",
            "category": "Object-Oriented Programming Fundamentals",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Object-Oriented Programming",
                    "explanation": "OOP centers on object interaction, not linear steps.",
                    "is_correct": false
                },
                {
                    "text": "Procedural Programming",
                    "explanation": "Procedural programming organizes code in a series of sequential steps.",
                    "is_correct": true
                },
                {
                    "text": "Functional Programming",
                    "explanation": "Functional programming focuses on functions and immutability.",
                    "is_correct": false
                },
                {
                    "text": "Declarative Programming",
                    "explanation": "Declarative programming expresses logic without explicit control flow.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "What is encapsulation in OOP?",
            "category": "Object-Oriented Programming Fundamentals",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Hiding the implementation details of data and exposing only necessary functionalities",
                    "explanation": "Encapsulation restricts access to object data and only allows interaction through methods/properties.",
                    "is_correct": true
                },
                {
                    "text": "Dividing code into repeated steps",
                    "explanation": "This is related to procedural decomposition, not encapsulation.",
                    "is_correct": false
                },
                {
                    "text": "Combining multiple classes into a single class",
                    "explanation": "This misrepresents encapsulation; aggregation/composition would be about combining classes.",
                    "is_correct": false
                },
                {
                    "text": "Allowing multiple meanings for a single function name",
                    "explanation": "This is polymorphism.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which of the following best describes polymorphism in OOP?",
            "category": "Object-Oriented Programming Fundamentals",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Enabling objects to respond to the same message in unique ways",
                    "explanation": "Polymorphism means different objects can execute different behaviors in response to the same method call.",
                    "is_correct": true
                },
                {
                    "text": "Protecting object data from outside access",
                    "explanation": "That is encapsulation.",
                    "is_correct": false
                },
                {
                    "text": "Relating classes through inheritance only",
                    "explanation": "Inheritance is just the relationship, not polymorphism.",
                    "is_correct": false
                },
                {
                    "text": "Dividing objects into smaller pieces",
                    "explanation": "That would be more related to decomposition.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which was the first programming language to introduce OOP concepts in the 1960s?",
            "category": "Object-Oriented Programming Fundamentals",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Simula",
                    "explanation": "Simula, developed in the 1960s, introduced the first OOP concepts.",
                    "is_correct": true
                },
                {
                    "text": "Smalltalk",
                    "explanation": "Smalltalk popularized OOP later, but was not the first.",
                    "is_correct": false
                },
                {
                    "text": "C++",
                    "explanation": "C++ came after OOP concepts were established.",
                    "is_correct": false
                },
                {
                    "text": "Java",
                    "explanation": "Java was designed in the 1990s.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "In the evolution of C# and .NET, which feature was introduced to help eliminate impedance mismatch between programming and database languages?",
            "category": "Fundamental Programming Concepts in C#",
            "difficulty": "hard",
            "answers": [
                {
                    "text": ".NET Remoting",
                    "explanation": "Remoting is about distributed objects, not data access.",
                    "is_correct": false
                },
                {
                    "text": "Generics",
                    "explanation": "While generics are useful for type safety, they don't directly solve language-database mismatch.",
                    "is_correct": false
                },
                {
                    "text": "LINQ",
                    "explanation": "LINQ (Language Integrated Query) was introduced to seamlessly work with data, bridging programming and database query languages.",
                    "is_correct": true
                },
                {
                    "text": "Reflection",
                    "explanation": "Reflection is used to inspect metadata at runtime.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which relationship is modeled in UML when a class includes instances of other classes as its parts?",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Inheritance",
                    "explanation": "Inheritance involves a parent and child class, not parts.",
                    "is_correct": false
                },
                {
                    "text": "Aggregation",
                    "explanation": "Aggregation models a whole-part relationship where one class contains or is composed of other classes.",
                    "is_correct": true
                },
                {
                    "text": "Association",
                    "explanation": "Association is a broader relationship type, not specifically whole-part.",
                    "is_correct": false
                },
                {
                    "text": "Polymorphism",
                    "explanation": "Polymorphism is not a structural relationship.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which diagram is primarily used in UML to show the dynamic interaction between objects over time?",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Class diagram",
                    "explanation": "Class diagrams show static structure.",
                    "is_correct": false
                },
                {
                    "text": "Sequence diagram",
                    "explanation": "Sequence diagrams illustrate object interactions over time.",
                    "is_correct": true
                },
                {
                    "text": "Use case diagram",
                    "explanation": "Use case diagrams show system functionality from a user's perspective.",
                    "is_correct": false
                },
                {
                    "text": "Activity diagram",
                    "explanation": "Activity diagrams show the flow of actions or activities.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "What does the 'extends' relationship indicate between use cases in UML?",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "The base use case is always called before the extended use case.",
                    "explanation": "That's not correct; 'include' is always called before. 'Extends' means the base use case can be extended by condition.",
                    "is_correct": false
                },
                {
                    "text": "A use case may add behavior to another use case under certain conditions.",
                    "explanation": "The 'extends' relationship means the additional behavior is only triggered given some condition.",
                    "is_correct": true
                },
                {
                    "text": "Two use cases are always executed together.",
                    "explanation": "This describes 'include', not 'extend'.",
                    "is_correct": false
                },
                {
                    "text": "There is no direct relationship, it's just documentation.",
                    "explanation": "'Extends' has semantic meaning in UML use cases.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which of the following best describes the purpose of a Software Requirements Specification (SRS) in object-oriented design?",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Describes the technical solution of the system",
                    "explanation": "SRS focuses on functional requirements and business logic, not the technical solution.",
                    "is_correct": false
                },
                {
                    "text": "Defines the scope, users, and functional requirements of the system",
                    "explanation": "SRS clearly defines the functional requirements, users, interactions, and scope.",
                    "is_correct": true
                },
                {
                    "text": "Shows the interactions between objects at runtime",
                    "explanation": "That is the purpose of a sequence diagram.",
                    "is_correct": false
                },
                {
                    "text": "Lists the names of the database tables",
                    "explanation": "That's the job of data modeling documents, not the SRS.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "When modeling an association in a UML class diagram, what does 'multiplicity' mean?",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "The number of classes participating in inheritance",
                    "explanation": "Multiplicity refers to the number of instances related through an association.",
                    "is_correct": false
                },
                {
                    "text": "The number of operations in a class",
                    "explanation": "Operations are not related to association multiplicity.",
                    "is_correct": false
                },
                {
                    "text": "The number of instances of one class that can be associated with another class",
                    "explanation": "Correct. Multiplicity indicates, for example, 1..n or 0..1 relationships.",
                    "is_correct": true
                },
                {
                    "text": "The number of diagrams in the UML model",
                    "explanation": "This is not a UML association concept.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Fill in the blank: In C#, the keyword used to prevent a class from being inherited by other classes is __________.",
            "category": "Fundamental Programming Concepts in C#",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "static",
                    "explanation": "static is for static members, not for preventing inheritance.",
                    "is_correct": false
                },
                {
                    "text": "private",
                    "explanation": "private restricts access, not inheritance.",
                    "is_correct": false
                },
                {
                    "text": "sealed",
                    "explanation": "The sealed keyword is used to prevent further inheritance.",
                    "is_correct": true
                },
                {
                    "text": "abstract",
                    "explanation": "abstract allows for base class functionality but requires implementation by derivation.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "What UML diagram is most useful for visualizing workflows, branching, and parallel processing in a system?",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Sequence diagram",
                    "explanation": "Sequence diagrams show interactions over time, but do not represent branching/parallel workflows well.",
                    "is_correct": false
                },
                {
                    "text": "Activity diagram",
                    "explanation": "Activity diagrams are designed for process flows with branches and parallel paths.",
                    "is_correct": true
                },
                {
                    "text": "Class diagram",
                    "explanation": "Class diagrams show static structure.",
                    "is_correct": false
                },
                {
                    "text": "Collaboration diagram",
                    "explanation": "Collaboration diagrams focus on communications rather than control flow.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "Which of these would NOT typically be considered a candidate 'class' when analyzing a system in OOP design?",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Book",
                    "explanation": "Book is a classic example of a class.",
                    "is_correct": false
                },
                {
                    "text": "Fine",
                    "explanation": "Fine tracks money owed and could encapsulate state and behavior.",
                    "is_correct": false
                },
                {
                    "text": "Manager",
                    "explanation": "Manager represents a type of object interacting with the system.",
                    "is_correct": false
                },
                {
                    "text": "Number",
                    "explanation": "Number is usually just a value/attribute, not a business class.",
                    "is_correct": true
                }
            ]
        },
        {
            "question": "In object-oriented programming, what is the primary benefit of using inheritance?",
            "category": "Object-Oriented Programming Fundamentals",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Code duplication",
                    "explanation": "Inheritance helps reduce code duplication by reusing base class code.",
                    "is_correct": false
                },
                {
                    "text": "Reusing and extending functionality in a hierarchical manner",
                    "explanation": "Inheritance enables reuse and extension of base class functionality by derived classes.",
                    "is_correct": true
                },
                {
                    "text": "Hiding data from users",
                    "explanation": "That's encapsulation.",
                    "is_correct": false
                },
                {
                    "text": "Allowing unrelated objects to interact",
                    "explanation": "That's association/aggregation.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "What is the primary purpose of a CASE tool like UMLet in the software design process?",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "easy",
            "answers": [
                {
                    "text": "Writing production code",
                    "explanation": "CASE tools are for design and modeling, not actual coding.",
                    "is_correct": false
                },
                {
                    "text": "Drafting and sharing professional-quality design diagrams",
                    "explanation": "CASE tools allow for rapid creation and easy sharing of diagrams, improving design documentation and communication.",
                    "is_correct": true
                },
                {
                    "text": "Managing databases",
                    "explanation": "Database management is not the focus of CASE tools like UMLet.",
                    "is_correct": false
                },
                {
                    "text": "Running code tests",
                    "explanation": "Testing is not the primary purpose of CASE tools.",
                    "is_correct": false
                }
            ]
        },
        {
            "question": "In the class diagram below, which relationship is represented by an open arrow from 'RetailCustomer' to 'Customer'?\n\nCustomer\n^ (open arrow)\nRetailCustomer",
            "category": "Unified Modeling Language (UML) and OOP Design",
            "difficulty": "medium",
            "answers": [
                {
                    "text": "Association",
                    "explanation": "Association is a direct link but not hierarchy.",
                    "is_correct": false
                },
                {
                    "text": "Inheritance",
                    "explanation": "An open arrow in UML class diagrams indicates that the child (RetailCustomer) inherits from the parent (Customer).",
                    "is_correct": true
                },
                {
                    "text": "Aggregation",
                    "explanation": "Aggregation is represented by a diamond, not an arrow.",
                    "is_correct": false
                },
                {
                    "text": "Dependency",
                    "explanation": "Dependency is typically a dashed arrow.",
                    "is_correct": false
                }
            ]
        }
    ],
    "topic": "Beginning C# Object-Oriented Programming, .NET Framework, Visual Studio, and Data Access with ADO.NET/EF"
}

How many token actually used?

To track the actual token usage during API calls, you can extract token counts from the response object using the following Python function::


def get_token_usage(response) -> dict:
    return {
        "input_tokens": response.usage.input_tokens,
        "output_tokens": response.usage.output_tokens,
        "total_tokens": response.usage.total_tokens,
    }

Topic Extraction

Used tokens: {'input_tokens': 174265, 'output_tokens': 1294, 'total_tokens': 175559}

Quiz Generation

Used tokens: {'input_tokens': 174283, 'output_tokens': 3663, 'total_tokens': 177946}

These numbers confirm that GPT-4.1 can effectively handle large inputs—such as an entire technical book—within a single context window, while still producing structured and detailed outputs. Why GPT-4o won’t work here? Because its maximum context window is limited to 128,000 tokens. In contrast, GPT-4.1 supports up to 1 million tokens, making it uniquely suited for tasks like:

Processing entire books in one pass
Preserving full context for topic extraction
Generating high-quality, context-aware quizzes

For large-scale, text-heavy applications, GPT-4.1 remains the better choice due to its extended memory and deeper language understanding.

What improvement could be made?

There are several ways it could be further improved for greater accuracy usability, and educational value. Even though models like GPT-4.1 are significantly more accurate and capable than previous versions, they are still not immune to hallucination — the generation of plausible-sounding but factually incorrect or misleading content. And yes, double-checking is still essential.

Source Citations for Each Question

Include citations or references to the original book content (e.g., chapter or section number) for each quiz question. This helps learners:

Trace concepts back to the source
Deepen understanding through contextual reading

Validate Quiz Content

This acts as a peer-review stage and improves overall reliability, it reviews the generated quizzes for:

Accuracy and consistency with the source material
Avoidance of trick questions or ambiguous phrasing

By linking each quiz question to a specific section or chapter of the book, you can scope down the validation process. This makes it easier to check the integrity of the question without re-reading the entire book:

Does this question reference content from Chapter X of the book? Prompt: Given the following quiz question and the corresponding source excerpt from the book, determine if the question is factually supported by the source. Explain your reasoning.
Is the correct answer explicitly or implicitly stated in the cited section? Prompt: Is the correct answer to this quiz question explicitly or implicitly mentioned in the source material below? If not, flag it as unsupported.
Are there any misleading or unsupported distractors? Prompt: Review the following multiple-choice question and its options. Identify any distractors (incorrect options) that might be misleading, technically inaccurate, or too ambiguous given the source context.
Clarity and Bias Check. Prompt: Evaluate this quiz question for clarity and neutrality. Is the wording unambiguous? Could it confuse or mislead learners? Suggest a clearer version if needed.
Chapter Reference Validation: Prompt: Does the content of this quiz question appear in Chapter X of the book Beginning C# Object-Oriented Programming (2nd Edition)? Justify your answer briefly.
Learning Objective Alignment: Prompt: Does this question align with the learning objectives of the chapter it’s based on? If not, explain why and suggest a more relevant question.

Difficulty Grading and Bloom’s Taxonomy Alignment

Tag questions with difficulty levels such as:

Basic (Recall)
Intermediate (Understanding)
Advanced (Application, Analysis)

Or go further by aligning them with Bloom’s Taxonomy levels: Remember → Understand → Apply → Analyze → Evaluate → Create

Cross-Referencing with External Knowledge

Enrich the quiz by:

Validating content against documentation (e.g., Microsoft Docs for C#)
Adding real-world examples or use cases to make questions more practical

Author Nguyen Luu
Link https://nguyenluu.dev/post/2025/05/29/Automating-Quiz-Generation-from-Books-with-GPT-4-1
License by-nc-sa

Comments

Home

Automating Quiz Generation from Books with GPT-4.1