[New feature] Native support for API streaming

Could you increase the API timeout limit to 4 minutes to handle reasoning models? It should be very simple to do and is a huge enabler.