Question 1

What is Paralon AI?

Accepted Answer

Paralon AI is an OpenAI-compatible inference API powered by a distributed network of high-performance GPUs. You can use the same code you'd use with OpenAI, just change the base URL to paraloncloud.com/v1.

Question 2

Which AI models are available?

Accepted Answer

We offer a variety of popular open-source LLMs optimized for different use cases. Models are dynamically loaded based on network availability. Check our /models endpoint for the current list.

Question 3

Is there a free tier?

Accepted Answer

Yes! During beta, all users get 60 requests per minute for free. No credit card required. Just sign in with Google and start building.

Question 4

How do I integrate with my existing code?

Accepted Answer

If you're using the OpenAI SDK, simply change the base_url to 'https://paraloncloud.com/v1' and use your Paralon API key. The API is fully compatible with OpenAI's chat completions format.

Question 5

Is my data secure?

Accepted Answer

Yes. We use HTTPS everywhere, authenticate all requests with API keys, and do not retain prompt data after processing. Your conversations are not used for training.

Question 6

What's the latency like?

Accepted Answer

Our distributed GPU network provides low-latency inference. Most requests complete in under 1 second for initial response, with streaming available for real-time token delivery.