> For the complete documentation index, see [llms.txt](https://bythen-ai.gitbook.io/bythen-ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://bythen-ai.gitbook.io/bythen-ai/technical-docs/technology-stack/foundational-technologies.md).

# Foundational Technologies

1. **Large Language Models (LLMs):** \
   LLMs enable natural, contextually relevant, and emotionally resonant dialogues for AI assistants and content creation. LLMs generate human-like responses, enabling 3D characters to engage meaningfully with users.
2. **Text-to-Speech (TTS) and Voice Cloning:** \
   TTS models generate high-quality, natural-sounding speech by converting textual input into a spectrogram and transform it into emotion aware realistic human speech. Voice Cloning allows for the creation and personalization of unique voice profiles for each 3D character, enhancing their individuality and personal connection with users.
3. **Text & Speech to Video and Pose Detection:** \
   LipSync, Generative Adversarial Networks, Stable Diffusion, and detection models unify text, voice, and 3D characters for professional content creation. These technologies blend narrative, emotions, and visual elements to produce engaging 3D videos, where characters interact dynamically and authentically, offering users a deeply immersive and customized experience.
4. **AR/VR Technologies:** \
   ARplay a crucial role in digital representation and content creation by providing immersive, interactive experiences that blend digital avatars with real and digital world environments. ARCore, ARKit and Unity enable realistic placement and interaction of 3D characters in real-world environments, while their Face tracking and augmented faces capture capabilities help replicate user expressions and movements in real-time. A blend of Virtual camera and AR Plugins allows real-time blend of 3D characters and user’s physical environment in live streams, making sessions dynamic and interactive.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://bythen-ai.gitbook.io/bythen-ai/technical-docs/technology-stack/foundational-technologies.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
