Paul Deitel - Deitel & Associates, Inc.

Building OpenAI API-Based Java GenAI Applications—A Guide to the Deitel Videos on the O’Reilly Online Learning Subscription Site

by Paul Deitel | Dec 15, 2025 | AI, GenAI, General, Java, OpenAI | 0 comments

Image for the OpenAI APIs we demonstrate

[Estimated reading time for this document: 20 minutes. Estimated time to watch the linked videos and run the Java code: 5 hours. Please share this guide with your friends and colleagues who might find it helpful.]

This comprehensive guide overviews Lesson 19, Building OpenAI API-Based Java Generative AI Applications, from my Java Fundamentals video course on O’Reilly Online Learning. The lesson focuses on building Java apps using OpenAI’s generative AI (genAI) APIs and the official openai-java library. This document guides you through my hands-on code examples and provides “Try It” exercises for experimenting with the APIs. You’ll leverage the OpenAI APIs to create intelligent, multimodal apps that understand, generate and manipulate text, code, images, audio and video content.

This guide links you to 34 videos totaling about 4.75 hours in Lesson 19 of our Java Fundamentals video course, in which Paul Deitel presents fully coded Java genAI apps that use the OpenAI APIs to

summarize documents
determine text’s sentiment (positive, neutral or negative)
use vision capabilities to generate accessible image descriptions
translate text among spoken languages
generate and manipulate Java code
extract from text named entities, such as people, places, organizations, dates, times, events, products, …
transcribe speech to text
synthesize speech from text, using one of OpenAI’s 11 voices and prompts that control style and tone
create original images
transfer art styles to images via text prompts
transfer styles between images
generate video closed captions
filter inappropriate content
generate and remix videos (under development at the time of this writing—uses OpenAI’s recently released Sora 2 API)
build agentic AI apps (under development at the time of this writing—uses OpenAI’s recently released AgentKit)

The remaining videos overview concepts and present genAI prompt and coding exercises you can use to dig deeper into the covered topics.

Videos: Building API-Based Java GenAI Applications: Overview (7m 29s)
Videos: This Lesson Is Under Development (2m 42s)—Discusses my plans for enhancing this lesson with new examples as I continually create them.

How We Formed This Guide

We converted this document from our corresponding Python version. The initial Python draft was created using five genAIs—OpenAI’s ChatGPT, Google’s Gemini, Anthropic’s Claude, Microsoft’s Copilot and Perplexity. We provided each with

a detailed prompt and
a Chapter 18 draft from our forthcoming Python for Programmers, 2/e product suite.

We asked Claude to summarize the results, and tuned the summary to create the Python version of this guide, then updated it for the Java version of the videos discussed in this document.

Contacting Me with Questions

The OpenAI APIs are evolving rapidly. If you run into problems while working through the examples or find that something has changed, check the Deitel blog or email paul@deitel.com.

Downloading the Code

Go to the Java Fundamentals, 3/e GitHub Repository to get the source code that accompanies the videos referenced in this guide. The OpenAI API examples are located in the examples/ch19 folder.

Suggested Learning Workflow

If you watch the videos, you’ll get a code-example-rich intro to programming with the OpenAI APIs. To learn how to work with various aspects of the OpenAI APIs, I suggest that you:

Watch the video for each example.
Run the provided Java code.
Complete the “Try It” coding challenges.
Experiment by creatively combining APIs (e.g., transcribe audio then translate, or generate images with accessibility descriptions).

Key Takeaways

This comprehensive guide and the corresponding videos present practical skills for harnessing the power of OpenAI’s genAI APIs. You’ll:

Master OpenAI APIs in Java and perform creative prompt engineering.
Build complete, functional, multimodal apps that create and manipulate text, code, images, audio and video.
Implement responsible accessibility and content moderation practices.

Caution: GenAIs make mistakes and even “hallucinate.” You should always verify their outputs.

Introduction

This video overviews the required official openai-java library, OpenAI’s fee-based API model, and monitoring and managing API usage costs.

Video: Introduction (9m 45s)

OpenAI APIs

This video overviews the OpenAI APIs and models I’ll demo in this lesson.

Video: OpenAI APIs (1m 36s)
OpenAI Documentation: API Reference
Try It: Browse the OpenAI API documentation and review the API subcategories.
Try It: Prompt genAIs for an overview of responsible AI practices.

OpenAI Developer Account and API Key

Here, you’ll learn how to create your OpenAI developer account, generate an API key and securely store it in an environment variable. This required setup step will enable your apps to authenticate with OpenAI so they can make API calls. You’ll understand best practices for securing your API key. The OpenAI API is a paid service. If, for the moment, you do not want to code with paid APIs, reading this document, watching the videos and reading the code is still valuable.

Video: OpenAI Developer Account and API Key (8m 2s)
OpenAI Documentation: Account Setup, API Keys
Try It: Create your OpenAI developer account, generate your first API key and store it securely using an environment variable.

Text Generation Via the Responses API

My text-generation examples introduce the Responses API, OpenAI’s primary text-generation interface. I show how to structure prompts, configure parameters, invoke the API and interpret responses. This API enables sophisticated conversational AI applications and is the foundation for many text-based genAI tasks.

Video: Text Generation Via the Responses API: Overview (6m 8s)
OpenAI Documentation: Text Generation Guide

Text Summarization

In this lengthy video, I provide the foundation you’ll need to work with openai-java in the subsequent examples. I use OpenAI’s natural language understanding capabilities to condense documents into concise summaries. I discuss crafting summarization prompts to control summary size and style. Text summarization is invaluable for efficiently processing large documents, articles and reports.

Video: Text Summarization (37m 13s)
Video: Text Summarization: GenAI Prompting and Coding Exercises (3m 12s)
Video: Mapping openai-java Classes and Methods to OpenAI API Documentation (3m 53s)
Try It: Create a summarization tool that takes a long article and generates brief, moderate and detailed summaries.

Sentiment Analysis

This example uses OpenAI’s natural language understanding capabilities to analyze a text’s emotional tone and sentiment. It classifies text as positive, neutral or negative and asks the model to explain how it came to that conclusion.

Video: Sentiment Analysis (6m 48s)
Try It: Build a sentiment analyzer that classifies the sentiment of customer reviews and asks the genAI model to provide a confidence score from 0.0 to 1.0 for each, indicating the likelihood that the classification is correct. Confidence scores closer to 1.0 are more likely to be correct.

Vision: Accessible Image Descriptions

Here, I use OpenAI’s vision capabilities to generate brief and detailed image descriptions, making images accessible to users who are blind or have low vision.

Video: Vision: Accessible Image Descriptions (23m 48s)
Video: Accessible Image Descriptions: GenAI Prompting and Coding Exercises
(1m 3s)
OpenAI Documentation: Images and Vision Guide

Try It: Create an application that takes URLs for various images and generates both brief and comprehensive accessible descriptions suitable for screen readers.

Language Detection and Translation

In this example, I use OpenAI’s multilingual capabilities to auto–detect the language text is written in and translate text to other spoken languages.

Video: Language Detection and Translation (7m 51s)
Video: Language Detection and Translation: GenAI Prompting Exercises (54s)
Try It: Build a translation tool that detects the input language and translates to a target language, preserving tone and context.

Code Generation

Discover how AI can generate, explain, and debug code across multiple programming languages. The first video covers code generation, understanding AI-generated code quality, and using AI as a coding assistant. In the second video, I discuss how genAIs can assist you with coding, including code generation, testing, debugging, documenting, refactoring, performance tuning, security and more.

Video: Code Generation (11m 9s)
Video: Code Generation: Other AI Code Capabilities (3m 35s)
Video: Code Generation: GenAI Prompting Exercises (3m 7s)
Try It: In a text prompt, describe the requirements for a method you need and submit a request to the Responses API to generate that method and provide test cases to show it works correctly. If not, call the Responses API again with the generated code and a prompt to refine the code.

Named Entity Recognition (NER) and Structured Outputs

In this example, I use OpenAI’s natural language understanding capabilities and named entity recognition to extract structured information from unstructured text, identifying entities such as people, places, organizations, dates, times, events, products, and more. The example shows that OpenAI’s APIs can return outputs as formatted, human-and-computer readable JSON (JavaScript Object Notation). NER is essential for building applications that process and organize information from documents and text sources.

Video: Named Entity Recognition (NER) and Structured Outputs (19m 50s)
Video: NER and Structured Outputs: Code and Prompt Exercises (5m 22s)
OpenAI Documentation: Structured Model Outputs Guide
Try It: Modify the NER example to perform parts-of-speech (POS) tagging—identifying each word’s part of speech (e.g., noun, verb, adjective, etc.) in a sentence. Use genAIs to research the commonly used tag sets for POS tagging. Prompt the model to return a structured JSON response with the parts of speech for the words in the supplied text. Display each word with its part of speech. Use these record classes:
```
public record PartOfSpeech(String text, String part) {}
public record PartsOfSpeech(List parts) {}
```

Try ItModify the NER example to translate text into multiple languages and display the results Prompt the model to translate the text it receives to the specified languages and to return only JSON-structured data in the following format:

{
   "original_text": original_text_string,
   "original_language": original_text_language_code,
   "translations": [
      {
         "language": translated_text_language_code,
         "translation": translated_text_string
      }
   ]
}

Try It: Create an NER tool that extracts and displays key entities from news articles.

Speech Recognition and Speech Synthesis

This video introduces speech–to–text transcription and text–to–speech conversion (speech synthesis) concepts for working with audio input and output in your apps. You’ll understand the models used in the transcription and synthesis examples, and explore the speech voices via OpenAI’s voice demo site—https://openai.fm.

Video: Speech Recognition and Speech Synthesis: Overview (8m 20s)
OpenAI Documentation: Speech to Text Guide
OpenAI Documentation: Text to Speech Guide
Try It: Try all the voices at https://openai.fm. Which do you prefer?

English Speech-to-Text (STT) for Audio Transcription

In this example, I convert spoken audio to text. Speech-to-text technology enables applications like automated transcription services, voice commands, and accessibility features.

Video: English Speech-to-Text for Audio Transcription (10m 24s)
Video: English Speech-to-Text for Audio Transcription: Generative AI Prompt Exercises (3m 5s)
OpenAI Documentation: Speech to Text Guide
Try It: Build a transcription tool that converts .mp3 and .m4a audio files to text.

Text-To-Speech (TTS)

In this example, I convert written text into natural-sounding speech with one of OpenAI’s 11 voice options. I discuss selecting voice options, specifying speech style and tone, and generating audio files. Text-to-speech technology is crucial for creating voice assistants, audiobook generation, and accessibility applications.

Video: Text-To-Speech (14m 25s)
Video: Text-To-Speech: Generative AI Prompting and Coding Exercises (1m 45s)
OpenAI Documentation: Text to Speech Guide
Try It: Create an app that converts documents to audio files with selectable voices.

Image Generation

Here, I create original images from text descriptions using OpenAI’s latest image-generation model. Image generation opens possibilities for creative content, design mockups, and visual storytelling.

Video: Image Generation: Overview (6m 47s)
Video: Image Generation (9m 26s)
Video: Image Generation — Generative AI Prompting Exercises (1m 15s)
OpenAI Documentation: Images and Vision Guide
Try It: Build an image-generation tool that creates variations based on text prompts.

Image Style Transfer

In two examples, I apply artistic styles to existing images using the Images API’s edit capability with style-transfer prompts and the Responses API’s image generation tool to transfer the style of one image to another.

Video: Image Style Transfer: Overview (2m 37s)
Video: Style Transfer via the Images API’s Edit Capability and a Style-Transfer Prompt (15m 42s)
Video: Style Transfer Via the Responses API’s Image Generation Tool (21m 9s)
Video: Image Style Transfer: Generative AI Prompting Exercises (56s)
OpenAI Documentation: Images and Vision Guide
Try It: Create a style transfer application that transforms user photos into different artistic styles, such as Vincent van Gogh, Leonardo da Vinci and others.

Generating Closed Captions from a Video’s Audio Track

In this example, I generate closed captions from a video file’s audio track using OpenAI’s audio transcription capabilities. Closed captions enhance video accessibility and improve content searchability. This example covers caption formatting standards, audio extraction techniques and using the OpenAI Whisper-1 model, which supports generating captions with timestamps. I then use the cross-platform VLC Media Player to overlay the closed captions on the corresponding video.

Video: Generating Closed Captions from a Video’s Audio Track (11m 18s)
OpenAI Documentation: Speech to Text Guide
Try It: Build a caption generator that programmatically extracts audio from videos and creates properly formatted subtitle files. Investigate the moviepy module for conveniently extracting a video’s audio track in Java.

Content Moderation

Here, I use OpenAI’s Moderation APIs to detect and filter inappropriate or harmful text and images—essential techniques for platforms hosting user-generated content. Paul presents moderation categories and severity levels, demonstrates the Moderation API with text inputs and discusses image moderation.

Video: Content Moderation (13m 12s)
Video: Content Moderation: Generative AI Prompting Exercise (1m 7s)
OpenAI Documentation: Moderation Guide
Try It: Create a content moderation system that screens user submissions and flags potentially problematic content.

Sora 2 Video Generation

Coming soon: This video introduces OpenAI’s recently Video API. I use the Sora 2 model in prompt-to-video, image-to-video and video remixing examples. I will add these videos based to the lesson as soon as I complete them.

Video: Coming soon.
OpenAI Documentation: Video Generation with Sora Guide
OpenAI Documentation: Videos API
Try It: Experiment with text-to-video prompts and explore the creative possibilities of AI video generation.

Closing Note

As I develop additional OpenAI API-based apps, I will add new videos to this Building API-Based Java GenAI Applications Java Fundamentals lesson. Some new example possibilities include:

Generating and remixing videos with OpenAI’s Sora 2 API.
Using OpenAI’s Realtime Audio APIs for speech-to-speech apps.
Building AI agents with OpenAI’s AgentKit.
Single-tool AI agents.
Multi-tool AI agents.
Single-agent applications.
Multi-agent applications.
Managing AI conversations that maintain state between Responses API calls.

Try It: Review the course materials and start planning your own GenAI applications using the techniques learned. Enjoy!

Additional Resources

OpenAI Platform Documentation: https://platform.openai.com
OpenAI Community Forum: https://community.openai.com
Official OpenAI Java Library: https://github.com/openai/openai-java

Building OpenAI API-Based Python GenAI Applications—A Guide to the Deitel Videos on the O’Reilly Online Learning Subscription Site

by Paul Deitel | Oct 24, 2025 | AI, GenAI, OpenAI, Python | 0 comments

[Estimated reading time for this document: 20 minutes. Estimated time to watch the linked videos and run the Python code: 4.5 hours. Please share this guide with your friends and colleagues who might find it helpful.]

This comprehensive guide overviews Lesson 18, Building OpenAI API-Based Python Generative AI Applications, from my Python Fundamentals video course on O’Reilly Online Learning. The lesson focuses on building Python apps using OpenAI’s generative AI (genAI) APIs. This document guides you through my hands-on code examples and provides Try It exercises so you can experiment with the APIs. You’ll leverage the OpenAI APIs to create intelligent, multimodal apps that understand, generate and manipulate text, code, images, audio and video content.

This guide links you to 31 videos totaling about 3.5 hours in my Python Fundamentals video course in which I present fully coded Python genAI apps that use the OpenAI APIs to

summarize documents
determine text’s sentiment (positive, neutral or negative)
use vision capabilities to generate accessible image descriptions
translate text among spoken languages
generate and manipulate Python code
extract from text named entities, such as people, places, organizations, dates, times, events, products, …
transcribe speech to text
synthesize speech from text, using one of OpenAI’s 11 voices and prompts that control style and tone
create original images
transfer art styles to images via text prompts
transfer styles between images
generate video closed captions
filter inappropriate content
generate and remix videos (under development at the time of this writing—uses OpenAI’s recently released Sora 2 API)
build agentic AI apps (under development at the time of this writing—uses OpenAI’s recently released AgentKit)

The remaining videos overview concepts and present genAI prompt and coding exercises you can use to dig deeper into the covered topics.

Videos:

Building API-Based Python GenAI Applications: Overview (8m 1s)
This Lesson Is Under Development (2m 25s)—Discusses my plans for enhancing this lesson with new examples as I continually create them.

How I Formed This Guide

I created the initial draft of this guide using five genAIs—OpenAI’s ChatGPT, Google’s Gemini, Anthropic’s Claude, Microsoft’s Copilot and Perplexity. I provided each with

a detailed prompt,
a Chapter 18 draft from our forthcoming Python for Programmers, 2/e product suite and
a list of the video titles and links you’ll find in this guide.

I then asked Claude to summarize the results, and tuned the summary to create this blog post.

Contacting Me with Questions

The OpenAI APIs are evolving rapidly. If you run into problems while working through the examples or find that something has changed, check the Deitel blog or send an email to paul@deitel.com.

Downloading the Code

Go to the Python Fundamentals, 2/e GitHub Repository to get the source code that accompanies the videos referenced in this guide. The OpenAI API examples are located in the examples/18 folder. Book chapter numbers and corresponding video lesson numbers are subject to change while the second edition of our Python product suite is under development.

Suggested Learning Workflow

If you watch the videos, you’ll get a code-example-rich intro to programming with the OpenAI APIs. To learn how to work with various aspects of the OpenAI APIs, I suggest that you:

Watch the video for each example.
Run the provided Python code.
Complete the “Try It” coding challenges.
Experiment by creatively combining APIs (e.g., transcribe audio then translate, or generate images with accessibility descriptions).

Key Takeaways

This comprehensive guide and the corresponding videos present practical skills for harnessing the power of OpenAI’s genAI APIs. You’ll:

Master OpenAI APIs in Python and perform creative prompt engineering.
Build complete, functional, multimodal apps that create and manipulate text, code, images, audio and video.
Implement responsible accessibility and content moderation practices.

Caution: GenAIs make mistakes and even “hallucinate.” You should always verify their outputs.

Introduction

In this video, I discuss the required official openai Python module, OpenAI’s fee-based API model, and monitoring and managing API usage costs.

Video: Introduction (6m)

OpenAI APIs

Here, I overview the OpenAI APIs and models I’ll demo in this lesson.

Video: OpenAI APIs (2m 29s)

OpenAI Documentation: API Reference

Try It: Browse the OpenAI API documentation and review the API subcategories.

Try It: Prompt genAIs for an overview of responsible AI practices.

OpenAI Developer Account and API Key

Here, you’ll learn how to create your OpenAI developer account, generate an API key and securely store it in an environment variable. This required setup step will enable your apps to authenticate with OpenAI so they can make API calls. You’ll understand best practices for securing your API key. The OpenAI API is a paid service. If, for the moment, you do not want to code with paid APIs, reading this document, watching the videos and reading the code is still valuable.

Video: OpenAI Developer Account and API Key (8m 45s)

OpenAI Documentation: Account Setup, API Keys

Try It: Create your OpenAI developer account, generate your first API key and store it securely using an environment variable.

Text Generation Via the Responses API

My text-generation examples introduce the Responses API, OpenAI’s primary text-generation interface. I show how to structure prompts, configure parameters, invoke the API and interpret responses. This API enables sophisticated conversational AI applications and is the foundation for many text-based genAI tasks.

Video: Text Generation Via the Responses API: Overview (4m 58s)

OpenAI Documentation: Text Generation Guide

Text Summarization

Here, I use OpenAI’s natural language understanding capabilities to condense lengthy text into concise summaries. This example covers crafting summarization prompts and controlling summary length and style. Text summarization is invaluable for efficiently processing large documents, articles and reports.

Videos:

Text Summarization (13m 16s)
Text Summarization: GenAI Prompting Exercises (3m 12s)

Try It: Create a summarization tool that takes a long article and generates brief, moderate and detailed summaries.

Sentiment Analysis

This example uses OpenAI’s natural language understanding capabilities to analyze text’s emotional tone and sentiment. It classifies text as positive, negative or neutral, and has model explain how it came to that conclusion.

Video: Sentiment Analysis (4m 18s)

Try It: Build a sentiment analyzer that classifies the sentiment of customer reviews and asks the genAI model to provide a confidence score from 0.0 to 1.0 for each, indicating the likelihood that the classification is correct. Confidence scores closer to 1.0 are more likely to be correct.

Vision: Accessible Image Descriptions

In this example, I show OpenAI’s vision capabilities for analyzing images and use them to generate detailed, contextual descriptions, making them accessible to users with visual impairments. You’ll understand how to optimize prompts for description styles and detail levels.

Video: Vision: Accessible Image Descriptions (18m 42s)

OpenAI Documentation: Images and Vision Guide

Try It: Create an application that takes URLs for various images and generates both brief and comprehensive accessibility descriptions suitable for screen readers.

Language Detection and Translation

In this example, I use OpenAI’s multilingual capabilities to auto-detect what language text is written in and translate text to other spoken languages.

Videos:

Language Detection and Translation (4m 16s)
Language Detection and Translation: GenAI Prompting Exercises (1m 2s)

Try It: Build a translation tool that detects the input language and translates to a target language, preserving tone and context.

Code Generation

Videos:

Code Generation (10m 18s)
Code Generation: Other AI Code Capabilities (2m 42s)
Code Generation: GenAI Prompting Exercises (2m 37s)

Try It: In a text prompt, describe the requirements for a function you need and submit a request to the Responses API to generate that function and provide test cases to show it works correctly. If not, call the Responses API again with the generated code and a prompt to refine the code.

Named Entity Recognition (NER) and Structured Outputs

In this example, I use OpenAI’s natural language understanding capabilities and named entity recognition to extract structured information from unstructured text, identifying entities such as people, places, organizations, dates, times, events, products, and more. The example shows that OpenAI’s APIs can return outputs as formatted, human-and-computer readable JSON (JavaScript Object Notation). NER is essential for building applications that process and organize information from documents and text sources.

Videos:

Named Entity Recognition (NER) and Structured Outputs (10m 26s)
NER and Structured Outputs: Code and Prompt Exercises (3m 10s)

OpenAI Documentation: Structured Model Outputs Guide

Try It: Modify the NER example to perform parts-of-speech (POS) tagging—identifying each word’s part of speech (e.g., noun, verb, adjective, etc.) in a sentence. Use genAIs to research the commonly used tag sets for POS tagging, then prompt the model to return a structured JSON response with the parts of speech for the words in the supplied text and display each word with its part of speech. Each JSON object should contain key-value pairs for the keys “word” and “tag”.

Try It: Modify the NER example to translate text into multiple languages. Prompt the model to translate the text it receives to the specified languages and to return only JSON-structured data in the following format, then display the results:

{
   "original\_text": original\_text\_string,
   "original\_language": original\_text\_language\_code,
   "translations": \[
   _ {
       "language": translated\_text\_language\_code,
       "translation": translated\_text\_string
   _ }
   \]
}

Try It: Create a tool that extracts key entities from news articles and outputs them in a structured JSON format.

Speech Recognition and Speech Synthesis

In this video, I introduce speech-to-text transcription and text-to-speech conversion (speech synthesis) concepts that are the foundation for working with audio input and output in your AI applications. You’ll understand the models used in the transcription and synthesis examples, and explore the speech voices via OpenAI’s voice demo site—https://openai.fm.

Video: Speech Recognition and Speech Synthesis: Overview (5m 27s)

OpenAI Documentation:

Try It: Try all the voices at https://openai.fm. Which do you prefer? Why?

English Speech-to-Text (STT) for Audio Transcription

Here, I convert spoken audio to text. Speech-to-text technology enables applications like automated transcription services, voice commands, and accessibility features.

Videos:

OpenAI Documentation: Speech to Text Guide

Try It: Build a transcription tool that converts .mp3 and .m4a audio files to text.

Text-To-Speech (TTS)

Here, I convert written text into natural-sounding speech with one of OpenAI’s 11 voice options. I discuss selecting voice options, specifying speech style and tone, and generating audio files. Text-to-speech technology is crucial for creating voice assistants, audiobook generation, and accessibility applications.

Videos:

Text-To-Speech (11m 15s)
Text-To-Speech: Generative AI Prompting and Coding Exercises (2m 53s)

OpenAI Documentation: Text to Speech Guide

Try It: Create an app that converts documents to audio files with selectable voices.

Image Generation

Here, I create original images from text descriptions using OpenAI’s latest image-generation model. Image generation opens possibilities for creative content, design mockups, and visual storytelling.

Videos:

Image Generation: Overview (6m 32s)
Image Generation (7m 48s)
Image Generation — Generative AI Prompting Exercises (1m 30s)

OpenAI Documentation: Images and Vision Guide

Try It: Build an image-generation tool that creates variations based on text prompts.

Image Style Transfer

In two examples, I apply artistic styles to existing images using the Images API’s edit capability with style-transfer prompts and the Responses API’s image generation tool to transfer the style of one image to another.

Videos:

OpenAI Documentation: Images and Vision Guide

Try It: Create a style transfer application that transforms user photos into different artistic styles, such as Vincent van Gogh, Leonardo da Vinci and others.

Generating Closed Captions from a Video’s Audio Track

In this example, I generate closed captions from a video file’s audio track using OpenAI’s audio transcription capabilities. Closed captions enhance video accessibility and improve content searchability. This example covers caption formatting standards, audio extraction techniques and using the OpenAI Whisper model, which supports generating captions with timestamps. I then use the open-source VLC Media Player to overlay the closed captions on the corresponding video.

Video: Generating Closed Captions from a Video’s Audio Track (9m 7s)

OpenAI Documentation: Speech to Text Guide

Try It: Build a caption generator that programmatically extracts audio from videos and creates properly formatted subtitle files. Investigate the moviepy module for conveniently extracting a video’s audio track in Python.

Content Moderation

Videos:

Content Moderation (15m 30s)
Content Moderation: Generative AI Prompting Exercise (26s)

OpenAI Documentation: Moderation Guide

Try It: Create a content moderation system that screens user submissions and flags potentially problematic content.

Sora 2 Video Generation

This video introduces OpenAI Sora’s video-generation capabilities. I present prompt-to-video and image-to-video demos. Coming soon: I am developing API-based video-generation and video-remixing code examples using OpenAI’s recently released Sora 2 APIs and will add videos based on these code examples when I complete them.

Video: Sora Video Generation (10m 58s)

OpenAI Documentation: Video Generation with Sora Guide

Try It: Experiment with text-to-video prompts and explore the creative possibilities of AI video generation.

Closing Note

As I develop additional OpenAI API-based apps, I will add new videos to this Python Fundamentals lesson on Building API-Based Python GenAI Applications. Some new example possibilities include:

Generating and remixing videos with OpenAI’s Sora 2 API.
Using OpenAI’s Realtime Audio APIs for speech-to-speech apps.
Building AI agents with OpenAI’s AgentKit.
Single-tool AI agents.
Multi-tool AI agents.
Single-agent applications.
Multi-agent applications.
Managing AI conversations that maintain state between Responses API calls.

Try It: Review the course materials and start planning your own GenAI applications using the techniques learned. Enjoy!

Additional Resources

OpenAI Platform Documentation: https://platform.openai.com
OpenAI Community Forum: https://community.openai.com
Official OpenAI Python Library: https://github.com/openai/openai-python

My Full Throttle, One-Day, Code-Intensive Live-Training Courses on O’Reilly Online Learning

My Video Courses on O’Reilly Online Learning

Python Fundamentals, 2/e, includes Data Science and AI fundamentals (55 hours)
Java Fundamentals, 3/e (21.5 hours on fundamentals, including including new treatment of object-oriented programming; will be 50+ hours when I complete the high-end recordings in Q1 2026)
C++20 Fundamentals (54 hours; with an Intro to C++23)
C Fundamentals (under development)

Getting an OpenAI API Key

by Paul Deitel | Sep 7, 2025 | General | 0 comments

Updated October 28, 2025

This guide provides instructions for setting up your OpenAI Developer Account and securely storing your API Key. This is essential for using our book and video products that interact with the OpenAI APIs.

🔑 Step 1: Get an OpenAI Developer Account

Signup: You’ll need to sign up for an account. Use the following link: https://platform.openai.com/signup.

Additional relevant info:

Pricing: Information on pricing for the API can be found here: https://openai.com/api/pricing/.
Rate Limits: You may also want to review the rate limits documentation: https://platform.openai.com/docs/guides/rate-limits/rate-limits.

Step 2: Get an OpenAI Developer API Key

Sign into your account at https://platform.openai.com/docs/overview.
In the upper-right corner, press the settings icon.
In the left column under the Project heading, select API keys.
Press + Create new secret key.
Optionally, specify an API key Name, then press Create secret key button.
Press the Copy button to copy the alphanumeric key to the clipboard.

Step 3: Storing the API Key as an Environment Variable

For security, it’s best practice to store your API key as an environment variable rather than directly in your code. We’ll use the variable name OPENAI_API_KEY.

Storing the API Key in macOS

Open Terminal.
Open the configuration file ~/.zshrc using a text editor (e.g., nano ~/.zshrc).
- nano ~/.zshrc
Scroll to the end of the file and add the following line, replacing YourAPIKey with the lengthy alphanumeric key you copied previously:
- export OPENAI_API_KEY="YourAPIKey"
Save and close the file.
Run the following command in the Terminal to apply the changes:
- source ~/.zshrc

Storing the API Key on Windows

In the taskbar’s Search field, enter SystemPropertiesAdvanced, and press Enter.
In the System Properties dialog, press the Environment Variables… button.
Under User variables, press New….
Enter the Variable name as OPENAI_API_KEY.
For the Variable value, paste in the lengthy alphanumeric API key you copied previously.
Press OK to save the environment variable, then press OK in the System Properties dialog to close it.
Restart your command line before launching iPython or Jupyter Lab to ensure the new variable is loaded.

You are now ready to use the OpenAI APIs with your Deitel products!

Twitter v2 Update for Our Python Books and Videos

by Paul Deitel | Aug 23, 2022 | General | 0 comments

Intro to Python for Computer Science and Data Science: Learning to Program with AI, Big Data and the Cloud

Updated September 7, 2023—We’re leaving this post up for anyone who might still have access to the Twitter APIs. The Twitter API’s free tier is now so limited that most of what we demonstrate in our Twitter chapter/lesson is longer available. Higher levels of paid access are too expensive for average users and students. The first paid tier ($100/month) provides basic capabilities and no streaming access (the free tier used to allow access to 1% of the daily live stream). The second paid tier gives more access and some streaming capability, but costs $5000/month and caps the total number of tweets at 1,000,000. Significant access to the live stream of tweets costs tens of thousands of dollars per month. There has been some discussion of an academic/research tier, but as of now, we have not seen any indication of when or if this will be available.

Attention users of the following Python products:

Intro to Python for Computer Science and Data Science: Learning to Program with AI, Big Data and the Cloud
Python for Programmers
Python Fundamentals LiveLessons

On August 18, 2022, we discovered that new Twitter developer accounts cannot access the Twitter v1.1 APIs on which we based Intro to Python‘s Chapter 13, Data Mining Twitter, and two case studies in Chapter 17, Big Data: Hadoop, Spark, NoSQL and IoT. Chapters 13 and 17 correspond to Chapters/Lessons 12 and 16 in our Python for Programmers book and Python Fundamentals LiveLessons videos.

Twitter users who already had Twitter developer accounts can still access the Twitter v1.1 APIs, but most of our Python content users will not fall into this category.

We’ve updated all our Twitter examples to the Twitter v2 APIs now. In addition, for the Intro to Python textbook, we need to update the instructor’s manual solutions and test-item file.

Updated chapters from our books are now available:

Updated version of the Intro to Python textbook’s Chapter 13 https://deitel.com/wp-content/uploads/2022/09/intro-to-python-chapter-13-data-mining-twitter-v2.pdf
Updated version of the Python for Programmers Chapter 12 https://deitel.com/wp-content/uploads/2022/09/python-for-programmers-chapter-12-data-mining-twitter-v2.pdf
O’Reilly Online Learning users can also access the Python for Programmers chapter at: https://learning.oreilly.com/library/view/python-for-programmers/9780135231364/ch12.xhtml

Updated instructor slides for Chapter 13 of the textbook should be available now in the Pearson Instructor Resource Center (IRC). Other updated instructor supplements will be updated there as we complete them.

Updated source-code files are available in the books’ IntroToPython and PythonForProgrammers GitHub repositories at https://github.com/pdeitel.

I’ll be re-recording the Python Fundamentals LiveLessons videos’ Lesson 12 soon.

If you have any questions, please email paul@deitel.com.

C How to Program, 9/e Errata

by Paul Deitel | Mar 30, 2022 | C, General | 1 comment

This post contains the C How to Program, 9/e errata list. We’ll keep this up-to-date as we become aware of additional errata items. Please Contact Us with any you find.

Note: After publication, we discovered a bug in our authoring software that deleted some items in single quotes, like ‘A’, from our code tables. The source-code files were not affected, but occasionally a single-quoted item is missing from a code table in the text.

Last updated January 15, 2023

Chapter 2 — Intro to C Programming

Page 76, in Section 2.5: “+, / and %” should be “*, / and %“.

Chapter 4 — Program Control

Page 149, “Notes on Integral Types”:
–32767 should be –32768
–2147483647 should be –2147483648
–127 should be –128

Chapter 5 — Pointers

Page 214, Fig. 5.9: The example should produce factorial values through 20, not 21. The value displayed for factorial(21) in the program output is incorrect because unsigned long long is not capable of representing that value.

Chapter 7 — Pointers

Page 320, line 19 of Fig. 7.6 should be:
while (*sPtr != '\0') {
Page 321, line 22 of Fig. 7.7, should be
for (; *sPtr != '\0'; ++sPtr) {

Chapter 10 — Structures, Unions, Bit Manipulation and Enumerations

Page 496, Fig. 10.4, line 24 should be:
putchar(value & displayMask ? '1' : '0');
Page 496, Fig. 10.4, line 28 should be:
putchar(' ');
Page 496, Fig. 10.4, line 32 should be:
putchar('\n');
Page 497, seventh text line on the page should be:
putchar(value & displayMask ? '1' : '0');
Page 499, Fig. 10.5, line 53 should be:
putchar(value & displayMask ? '1' : '0');
Page 499, Fig. 10.5, line 57 should be:
putchar(' ');
Page 499, Fig. 10.5, line 61 should be:
putchar('\n');
Page 502, Fig. 10.6, line 32 should be:
putchar(value & displayMask ? '1' : '0')
Page 502, Fig. 10.6, line 36 should be:
putchar(' ');
Page 502, Fig. 10.6 line 40 should be:
putchar('\n');

Questions? Contact us!

« Older Entries

Building OpenAI API-Based Java GenAI Applications—A Guide to the Deitel Videos on the O’Reilly Online Learning Subscription Site

How We Formed This Guide

Contacting Me with Questions

Downloading the Code

Suggested Learning Workflow

Key Takeaways

Introduction

OpenAI APIs

OpenAI Developer Account and API Key

Text Generation Via the Responses API

Text Summarization

Sentiment Analysis

Vision: Accessible Image Descriptions

Language Detection and Translation

Code Generation

Named Entity Recognition (NER) and Structured Outputs

Speech Recognition and Speech Synthesis

English Speech-to-Text (STT) for Audio Transcription

Text-To-Speech (TTS)

Image Generation

Image Style Transfer

Generating Closed Captions from a Video’s Audio Track

Content Moderation

Sora 2 Video Generation

Closing Note

Additional Resources

Building OpenAI API-Based Python GenAI Applications—A Guide to the Deitel Videos on the O’Reilly Online Learning Subscription Site

How I Formed This Guide

Contacting Me with Questions

Downloading the Code

Suggested Learning Workflow

Key Takeaways

Introduction

OpenAI APIs

OpenAI Developer Account and API Key

Text Generation Via the Responses API

Text Summarization

Sentiment Analysis

Vision: Accessible Image Descriptions

Language Detection and Translation

Code Generation

Named Entity Recognition (NER) and Structured Outputs

Speech Recognition and Speech Synthesis

English Speech-to-Text (STT) for Audio Transcription

Text-To-Speech (TTS)

Image Generation

Image Style Transfer

Generating Closed Captions from a Video’s Audio Track

Content Moderation

Sora 2 Video Generation

Closing Note

Additional Resources

My Full Throttle, One-Day, Code-Intensive Live-Training Courses on O’Reilly Online Learning

My Video Courses on O’Reilly Online Learning

Getting an OpenAI API Key

🔑 Step 1: Get an OpenAI Developer Account

Signup: You’ll need to sign up for an account. Use the following link: https://platform.openai.com/signup.

Step 2: Get an OpenAI Developer API Key

Step 3: Storing the API Key as an Environment Variable

Storing the API Key in macOS

Storing the API Key on Windows

Twitter v2 Update for Our Python Books and Videos

C How to Program, 9/e Errata

Chapter 2 — Intro to C Programming

Chapter 4 — Program Control

Chapter 5 — Pointers

Chapter 7 — Pointers

Chapter 10 — Structures, Unions, Bit Manipulation and Enumerations

Follow Us

Recent Posts

Archives

Categories

Pin It on Pinterest