#Asynchronous programming

Explore tagged Tumblr posts

Visit Tumblr Blog

Explore Tumblr blogs with no restrictions, modern design and the best experience.

Last Seen Tumblr Blogs

anonim0-irreal

Opositando Sin Control

38K posts

tattoohug

TATTOO HUG

187 posts

chachasabrina

ChachaSabrina

26 posts

artofnana

Oodles of Doodles and Whatnot

247 posts

coconatra

3K posts

Fun Fact

If you dial 1-866-584-6757, you can leave an audio post for your followers.

la-principessa-nuova · 3 months ago

Text

Product Owner: We just found out the client is still using Internet Explorer, so we’ll need to add ES5 support. Can you get that done by the end of the day?

Developer: Well I can’t make any promises, but I’ll give you a callback this afternoon and let you know how far along it is.

#javascript #ecmascript #asynchronous programming #web development #software engineer

2 notes · View notes

jcmarchi · 4 months ago

Text

Asynchronous LLM API Calls in Python: A Comprehensive Guide

New Post has been published on https://thedigitalinsider.com/asynchronous-llm-api-calls-in-python-a-comprehensive-guide/

Asynchronous LLM API Calls in Python: A Comprehensive Guide

As developers and dta scientists, we often find ourselves needing to interact with these powerful models through APIs. However, as our applications grow in complexity and scale, the need for efficient and performant API interactions becomes crucial. This is where asynchronous programming shines, allowing us to maximize throughput and minimize latency when working with LLM APIs.

In this comprehensive guide, we’ll explore the world of asynchronous LLM API calls in Python. We’ll cover everything from the basics of asynchronous programming to advanced techniques for handling complex workflows. By the end of this article, you’ll have a solid understanding of how to leverage asynchronous programming to supercharge your LLM-powered applications.

Before we dive into the specifics of async LLM API calls, let’s establish a solid foundation in asynchronous programming concepts.

Asynchronous programming allows multiple operations to be executed concurrently without blocking the main thread of execution. In Python, this is primarily achieved through the asyncio module, which provides a framework for writing concurrent code using coroutines, event loops, and futures.

Key concepts:

Coroutines: Functions defined with async def that can be paused and resumed.

Event Loop: The central execution mechanism that manages and runs asynchronous tasks.

Awaitables: Objects that can be used with the await keyword (coroutines, tasks, futures).

Here’s a simple example to illustrate these concepts:

import asyncio async def greet(name): await asyncio.sleep(1) # Simulate an I/O operation print(f"Hello, name!") async def main(): await asyncio.gather( greet("Alice"), greet("Bob"), greet("Charlie") ) asyncio.run(main())

In this example, we define an asynchronous function greet that simulates an I/O operation with asyncio.sleep(). The main function uses asyncio.gather() to run multiple greetings concurrently. Despite the sleep delay, all three greetings will be printed after approximately 1 second, demonstrating the power of asynchronous execution.

The Need for Async in LLM API Calls

When working with LLM APIs, we often encounter scenarios where we need to make multiple API calls, either in sequence or parallel. Traditional synchronous code can lead to significant performance bottlenecks, especially when dealing with high-latency operations like network requests to LLM services.

Consider a scenario where we need to generate summaries for 100 different articles using an LLM API. With a synchronous approach, each API call would block until it receives a response, potentially taking several minutes to complete all requests. An asynchronous approach, on the other hand, allows us to initiate multiple API calls concurrently, dramatically reducing the overall execution time.

Setting Up Your Environment

To get started with async LLM API calls, you’ll need to set up your Python environment with the necessary libraries. Here’s what you’ll need:

Python 3.7 or higher (for native asyncio support)

aiohttp: An asynchronous HTTP client library

openai: The official OpenAI Python client (if you’re using OpenAI’s GPT models)

langchain: A framework for building applications with LLMs (optional, but recommended for complex workflows)

You can install these dependencies using pip:

pip install aiohttp openai langchain <div class="relative flex flex-col rounded-lg">

Basic Async LLM API Calls with asyncio and aiohttp

Let’s start by making a simple asynchronous call to an LLM API using aiohttp. We’ll use OpenAI’s GPT-3.5 API as an example, but the concepts apply to other LLM APIs as well.

import asyncio import aiohttp from openai import AsyncOpenAI async def generate_text(prompt, client): response = await client.chat.completions.create( model="gpt-3.5-turbo", messages=["role": "user", "content": prompt] ) return response.choices[0].message.content async def main(): prompts = [ "Explain quantum computing in simple terms.", "Write a haiku about artificial intelligence.", "Describe the process of photosynthesis." ] async with AsyncOpenAI() as client: tasks = [generate_text(prompt, client) for prompt in prompts] results = await asyncio.gather(*tasks) for prompt, result in zip(prompts, results): print(f"Prompt: promptnResponse: resultn") asyncio.run(main())

In this example, we define an asynchronous function generate_text that makes a call to the OpenAI API using the AsyncOpenAI client. The main function creates multiple tasks for different prompts and uses asyncio.gather() to run them concurrently.

This approach allows us to send multiple requests to the LLM API simultaneously, significantly reducing the total time required to process all prompts.

Advanced Techniques: Batching and Concurrency Control

While the previous example demonstrates the basics of async LLM API calls, real-world applications often require more sophisticated approaches. Let’s explore two important techniques: batching requests and controlling concurrency.

Batching Requests: When dealing with a large number of prompts, it’s often more efficient to batch them into groups rather than sending individual requests for each prompt. This reduces the overhead of multiple API calls and can lead to better performance.

import asyncio from openai import AsyncOpenAI async def process_batch(batch, client): responses = await asyncio.gather(*[ client.chat.completions.create( model="gpt-3.5-turbo", messages=["role": "user", "content": prompt] ) for prompt in batch ]) return [response.choices[0].message.content for response in responses] async def main(): prompts = [f"Tell me a fact about number i" for i in range(100)] batch_size = 10 async with AsyncOpenAI() as client: results = [] for i in range(0, len(prompts), batch_size): batch = prompts[i:i+batch_size] batch_results = await process_batch(batch, client) results.extend(batch_results) for prompt, result in zip(prompts, results): print(f"Prompt: promptnResponse: resultn") asyncio.run(main())

Concurrency Control: While asynchronous programming allows for concurrent execution, it’s important to control the level of concurrency to avoid overwhelming the API server or exceeding rate limits. We can use asyncio.Semaphore for this purpose.

import asyncio from openai import AsyncOpenAI async def generate_text(prompt, client, semaphore): async with semaphore: response = await client.chat.completions.create( model="gpt-3.5-turbo", messages=["role": "user", "content": prompt] ) return response.choices[0].message.content async def main(): prompts = [f"Tell me a fact about number i" for i in range(100)] max_concurrent_requests = 5 semaphore = asyncio.Semaphore(max_concurrent_requests) async with AsyncOpenAI() as client: tasks = [generate_text(prompt, client, semaphore) for prompt in prompts] results = await asyncio.gather(*tasks) for prompt, result in zip(prompts, results): print(f"Prompt: promptnResponse: resultn") asyncio.run(main())

In this example, we use a semaphore to limit the number of concurrent requests to 5, ensuring we don’t overwhelm the API server.

Error Handling and Retries in Async LLM Calls

When working with external APIs, it’s crucial to implement robust error handling and retry mechanisms. Let’s enhance our code to handle common errors and implement exponential backoff for retries.

import asyncio import random from openai import AsyncOpenAI from tenacity import retry, stop_after_attempt, wait_exponential class APIError(Exception): pass @retry(stop=stop_after_attempt(3), wait=wait_exponential(multiplier=1, min=4, max=10)) async def generate_text_with_retry(prompt, client): try: response = await client.chat.completions.create( model="gpt-3.5-turbo", messages=["role": "user", "content": prompt] ) return response.choices[0].message.content except Exception as e: print(f"Error occurred: e") raise APIError("Failed to generate text") async def process_prompt(prompt, client, semaphore): async with semaphore: try: result = await generate_text_with_retry(prompt, client) return prompt, result except APIError: return prompt, "Failed to generate response after multiple attempts." async def main(): prompts = [f"Tell me a fact about number i" for i in range(20)] max_concurrent_requests = 5 semaphore = asyncio.Semaphore(max_concurrent_requests) async with AsyncOpenAI() as client: tasks = [process_prompt(prompt, client, semaphore) for prompt in prompts] results = await asyncio.gather(*tasks) for prompt, result in results: print(f"Prompt: promptnResponse: resultn") asyncio.run(main())

This enhanced version includes:

A custom APIError exception for API-related errors.

A generate_text_with_retry function decorated with @retry from the tenacity library, implementing exponential backoff.

Error handling in the process_prompt function to catch and report failures.

Optimizing Performance: Streaming Responses

For long-form content generation, streaming responses can significantly improve the perceived performance of your application. Instead of waiting for the entire response, you can process and display chunks of text as they become available.

import asyncio from openai import AsyncOpenAI async def stream_text(prompt, client): stream = await client.chat.completions.create( model="gpt-3.5-turbo", messages=["role": "user", "content": prompt], stream=True ) full_response = "" async for chunk in stream: if chunk.choices[0].delta.content is not None: content = chunk.choices[0].delta.content full_response += content print(content, end='', flush=True) print("n") return full_response async def main(): prompt = "Write a short story about a time-traveling scientist." async with AsyncOpenAI() as client: result = await stream_text(prompt, client) print(f"Full response:nresult") asyncio.run(main())

This example demonstrates how to stream the response from the API, printing each chunk as it arrives. This approach is particularly useful for chat applications or any scenario where you want to provide real-time feedback to the user.

Building Async Workflows with LangChain

For more complex LLM-powered applications, the LangChain framework provides a high-level abstraction that simplifies the process of chaining multiple LLM calls and integrating other tools. Let’s look at an example of using LangChain with async capabilities:

This example shows how LangChain can be used to create more complex workflows with streaming and asynchronous execution. The AsyncCallbackManager and StreamingStdOutCallbackHandler enable real-time streaming of the generated content.

import asyncio from langchain.llms import OpenAI from langchain.prompts import PromptTemplate from langchain.chains import LLMChain from langchain.callbacks.manager import AsyncCallbackManager from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler async def generate_story(topic): llm = OpenAI(temperature=0.7, streaming=True, callback_manager=AsyncCallbackManager([StreamingStdOutCallbackHandler()])) prompt = PromptTemplate( input_variables=["topic"], template="Write a short story about topic." ) chain = LLMChain(llm=llm, prompt=prompt) return await chain.arun(topic=topic) async def main(): topics = ["a magical forest", "a futuristic city", "an underwater civilization"] tasks = [generate_story(topic) for topic in topics] stories = await asyncio.gather(*tasks) for topic, story in zip(topics, stories): print(f"nTopic: topicnStory: storyn'='*50n") asyncio.run(main())

Serving Async LLM Applications with FastAPI

To make your async LLM application available as a web service, FastAPI is an great choice due to its native support for asynchronous operations. Here’s an example of how to create a simple API endpoint for text generation:

from fastapi import FastAPI, BackgroundTasks from pydantic import BaseModel from openai import AsyncOpenAI app = FastAPI() client = AsyncOpenAI() class GenerationRequest(BaseModel): prompt: str class GenerationResponse(BaseModel): generated_text: str @app.post("/generate", response_model=GenerationResponse) async def generate_text(request: GenerationRequest, background_tasks: BackgroundTasks): response = await client.chat.completions.create( model="gpt-3.5-turbo", messages=["role": "user", "content": request.prompt] ) generated_text = response.choices[0].message.content # Simulate some post-processing in the background background_tasks.add_task(log_generation, request.prompt, generated_text) return GenerationResponse(generated_text=generated_text) async def log_generation(prompt: str, generated_text: str): # Simulate logging or additional processing await asyncio.sleep(2) print(f"Logged: Prompt 'prompt' generated text of length len(generated_text)") if __name__ == "__main__": import uvicorn uvicorn.run(app, host="0.0.0.0", port=8000)

This FastAPI application creates an endpoint /generate that accepts a prompt and returns generated text. It also demonstrates how to use background tasks for additional processing without blocking the response.

Best Practices and Common Pitfalls

As you work with async LLM APIs, keep these best practices in mind:

Use connection pooling: When making multiple requests, reuse connections to reduce overhead.

Implement proper error handling: Always account for network issues, API errors, and unexpected responses.

Respect rate limits: Use semaphores or other concurrency control mechanisms to avoid overwhelming the API.

Monitor and log: Implement comprehensive logging to track performance and identify issues.

Use streaming for long-form content: It improves user experience and allows for early processing of partial results.

0 notes

robomad · 4 months ago

Text

Building Real-Time Notifications with Django Channels: A Step-by-Step Guide

Introduction:In today’s dynamic web applications, real-time features such as notifications, chat, and live updates have become increasingly important to enhance user engagement and experience. Django Channels extends Django to handle WebSockets, allowing you to build real-time applications. By integrating Django Channels into your project, you can implement real-time notifications that alert…

#asynchronous programming #Django #Django Channels #Python #Real-Time Notifications #web development #WebSockets

0 notes

justnshalom · 1 year ago

Text

JavaScript: Find Intersection

JavaScript: Find Intersection In JavaScript, finding the intersection of two arrays is a common programming task. The intersection is the set of elements that are present in both arrays. There are several ways to accomplish this in JavaScript, depending on the specific requirements and constraints of your project. Method 1: Using a loop One of the simplest approaches to find the intersection is…

View On WordPress

#array #Asynchronous Programming #intersection #JavaScript

0 notes

codewithnazam · 2 years ago

Text

Javascript Async & Await with Examples

#javascript #javascripts #javascriptlibraries #javascript30 #javascriptengineer #javascripting #javascript3 #javascriptdeveloper #javascriptiskillingmysoul #javascriptbasics #javascriptdevelopers #javascriptlover

Javascript is a high-level, dynamic, and interpreted programming language that is widely used for developing web applications and creating interactive effects on the web. Async and Await are two powerful features of Javascript that make it possible to write asynchronous code in a way that is much more readable and manageable. In this article, we will discuss what Async and Await are, how they…

View On WordPress

#async #async & await #async and await #Asynchronous programming #await #fetch api #Javascript #json #Readable code #Synchronous code

1 note · View note

nerdyqueerr · 7 months ago

Text

Outsourcing academic decisions to tumblr dot com

Further context for each course:

"Weird Fiction" is an English course that will have me reading and annotating tons of short stories. Reading reviews + syllabus online has led me to believe that the prof is strict as hell and not very generous. However if I do ok on the midterm I get to write a short story instead of a final exam. Online synchronous.

"Religon and the body" is a new religion course which is about. Well. Religion and the body, whether that's death rituals, food purity codes, sexuality, etc. Reviews of the prof seem to say she's flexible but the course has also never been taught before so who knows if the content and evaluation format will work for me. Online asynchronous.

#i would LOVE to do the short story stuff if it didnt seem so fucking stressful lol #religion and the body was lowkey my favorite part of my fun religion course last fall #but if the class ends up sucking im gonna be there like.... damn..... i could be reading + writing short stories rn #theyre both online which is fine cause all my other courses are in person that term so it's already lots #asynchronous online is kinda fun cause i just pick a time to sit in the hums lounge and do them #synchronous has more structure #its really a toss up #trying to see if the cool theythem in my program wants to take one or the other with me #my shit

6 notes · View notes

techcronus · 2 months ago

Text

Node.js Development: Everything You Need to Know in 2025

In 2025, Node.js development continues to be a powerful tool for building efficient, scalable, and real-time applications. This backend JavaScript framework has become a go-to technology for backend development, favoured by developers for its speed, flexibility, and vast ecosystem. Here’s everything you need to know about Node.js development trends, advantages, and key considerations in 2025.

Why Node.js Remains Popular in 2025

Node.js has gained a strong foothold in web and app development due to its high performance and ability to handle large volumes of simultaneous requests, making it ideal for data-intensive applications. Its non-blocking, event-driven architecture allows developers to build scalable web applications that can easily support thousands of concurrent users.

Key Node.js Trends to Watch in 2025

Serverless Architecture: Serverless is growing in popularity, and Node.js serverless applications fit perfectly with this trend. In a serverless environment, developers don’t need to manage server infrastructure; they focus instead on writing code. This approach can reduce development costs and improve scalability, making Node.js a key player in the serverless computing market.

Edge Computing: As demand for faster data processing rises, Node.js for edge computing is becoming crucial. By enabling data processing closer to the data source, Node.js helps reduce latency and improve application performance, particularly in real-time applications.

Microservices Architecture: Microservices are essential for large-scale, modular applications. Node.js, with its lightweight nature, is perfect for Node.js microservices architecture, allowing developers to build small, independent services that can be deployed and scaled individually.

Artificial Intelligence (AI) and Machine Learning (ML) Integration: In 2025, integrating AI and ML models into applications is a significant trend. Node.js with AI and ML is compatible with powerful machine-learning libraries, making it an attractive choice for developers looking to create intelligent applications.

Benefits of Using Node.js in 2025

High Performance: Node.js uses the V8 engine, offering impressive speed and efficient execution of JavaScript. This makes it suitable for applications requiring fast response times, such as real-time applications, chat applications, and IoT devices.

Rich Ecosystem: The Node.js ecosystem, including npm (Node Package Manager), gives developers access to a wide range of reusable modules and libraries. This Node.js ecosystem reduces development time and helps accelerate project timelines.

Cross-Platform Compatibility: Node.js Development cross-platform applications work well across different platforms, making it easier for developers to build applications that run seamlessly on various operating systems.

Scalability: The non-blocking, asynchronous architecture of Node.js for scalable applications makes it easy to scale horizontally, supporting increased workloads as businesses grow.

Best Practices for Node.js Development in 2025

Leverage TypeScript: Using TypeScript with Node.js enhances code quality and reduces bugs, making it a valuable addition to any development project.

Prioritize Security: Security is a primary concern for developers, particularly in 2025, as cyber threats grow more sophisticated. Implementing Node.js security best practices, like input validation and rate limiting, is essential for protecting applications.

Adopt CI/CD Pipelines: Continuous integration and continuous deployment (CI/CD) pipelines streamline development and ensure faster, more reliable Node.js deployments.

Conclusion

Node.js continues to be a versatile and high-performance choice for backend development in 2025. Its adaptability to trends like serverless architecture, microservices, and AI integration makes it a prime technology for building future-ready applications. By leveraging the power of Node.js developers, businesses can develop scalable, efficient, and intelligent solutions to stay ahead in the digital landscape.

0 notes

mosh2viss · 4 months ago

Text

https://www.futureelectronics.com/p/semiconductors--memory--RAM--static-ram--asynchronous/cy62157ev18ll-55bvxi-infineon-3728565

SRAM memory cell, Types of SRAM, data Memory, memory card reader

CY62157EV18 Series 8 Mb (512 K x 16) 1.65 - 2.25 V 55 ns Static RAM - VFBGA-48

#RAM #Static RAM #Asynchronous SRAM #CY62157EV18LL-55BVXI #Infineon #memory cell #Types of SRAM #data #card reader #programming #Random Access Memory #SRAM chip #flash memory card reader #Nv SRAM #SRAM memory #CMOS Static RAM

1 note · View note

abr2tello · 7 months ago

Text

https://www.futureelectronics.com/p/semiconductors--memory--RAM--static-ram--asynchronous/cy62167ev30ll-45bvxi-infineon-6042923

Non-Volatile SRAM memory, Non-Volatile SRAM, Non volatile memory

CY62167EV30 Series 16 Mb (1M x 16/2M x 8) 2.2 - 3.6 V 45 ns Static RAM -TSOP-48

#RAM #Static RAM #Asynchronous SRAM #CY62167EV30LL-45ZXIT #Infineon #Non-Volatile SRAM memory #Non-Volatile SRAM #Surface Mount Flash Memory #Memory chips #Random-Access Memory #Volatile Memory #Programming System Devices

1 note · View note

bbob2reaa · 7 months ago

Text

https://www.futureelectronics.com/p/semiconductors--memory--RAM--static-ram--asynchronous/cy62167ev30ll-45bvxit-infineon-1068579

Non Volatile SRAM memory, What is SRAM, SRAM manufacturers, SRAM chip

CY62167EV30 Series 16 Mb (1M x 16 / 2 M x 8) 3 V 45 ns Static RAM - FBGA-48

#RAM #Static RAM Asynchronous #SRAM #CY62167EV30LL-45BVXIT #Infineon #Non Volatile SRAM memory #What is SRAM #manufacturers #SRAM ram chip #Static random access memory #SRAM memories #Memory Density #Programming System Devices

1 note · View note

robomad · 4 months ago

Text

Mastering Advanced Error Handling Techniques in Node.js Applications

Introduction:Error handling is a critical aspect of building robust Node.js applications. While Node.js provides basic error handling mechanisms, complex applications require more sophisticated techniques to ensure errors are caught, logged, and handled gracefully. Effective error handling not only improves the reliability of your application but also enhances the user experience by providing…

#asynchronous programming #backend development #Error Handling #Exception Handling #JavaScript #logging #Node.js

0 notes

arg0t · 1 year ago

Text

i hate asynchronous classes so much jesus christ i cannot do another discussion post

#this whole program is 100% asynchronous and it just #melts my brain

1 note · View note

xbsoftware · 2 years ago

Link

The Difference Between Asynchronous and Synchronous Programming

Multi-threading has enabled wide possibilities and, among other things, brought to life two essential development models. Asynchronous and synchronous programming are two terms that may sound confusing, but understanding their differences helps to spend effort more efficiently and build applications providing outstanding user experience. Today, we’ll learn the major pros and cons of both approaches.

0 notes