You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ingestfromURI can ingest from S3, but does not support public buckets.
ingestFromFile requires to pass the file in the payload. This means the client must have the file locally. Often this means the client has to download the file first, then pass it in the payload. This is pretty inefficient.
Proposal
It'd be nice to be able to ingest from a public remote file.
I suggest to add a new parameter to the /ingestFromFile endpoint: remoteFileUrl .
If this parameter is set, instead of getting the file from the payload, Pinot will download the file directly.
This is a feature request.
Problem
As of today, it is possible to ingest small datasets to try Pinot quickly with the endpoints
/ingestfromURI
and/ingestFromFile
.See https://docs.pinot.apache.org/basics/data-import/batch-ingestion#ingestfromuri and
https://docs.pinot.apache.org/basics/data-import/batch-ingestion#ingestfromfile.
The problems are the following:
ingestfromURI
can ingest from S3, but does not support public buckets.ingestFromFile
requires to pass the file in the payload. This means the client must have the file locally. Often this means the client has to download the file first, then pass it in the payload. This is pretty inefficient.Proposal
It'd be nice to be able to ingest from a public remote file.
I suggest to add a new parameter to the
/ingestFromFile
endpoint:remoteFileUrl
.If this parameter is set, instead of getting the file from the payload, Pinot will download the file directly.
Around this line:
https://github.com/apache/pinot/blob/6eddacfc32055e959ad72634684b904cf4098e20/pinot-controller/src/main/java/org/apache/pinot/controller/api/resources/PinotIngestionRestletResource.java#L154C81-L154C92
Pseudo code will look like this:
The text was updated successfully, but these errors were encountered: