GCS Pipeline does not read the data

I gave all the permissions to the service account from google cloud storage but I’m still getting
Cannot extract data for pipeline. **Object actual_object_name.csv.gz has a nil content length

It reads the file name because it’s in the log, but anything else. The files are complete

CREATE PIPELINE pipeline_name
AS LOAD DATA GCS ‘bucket_name’
CREDENTIALS ‘{“access_id”: “XXXXXXX”, “secret_key”: “XXXXXXX”}’
BATCH_INTERVAL 200
SKIP ALL ERRORS
INTO TABLE XXXXXX
FIELDS TERMINATED BY ‘,’ ENCLOSED BY ‘"’ ESCAPED BY ‘\’
LINES TERMINATED BY ‘\n’ STARTING BY ‘’
IGNORE 1 LINES

The same, with the same source of data works fine for S3. Any help?

UPDATE: It works if I pick a file and manually remove from GCS from the metadata Content-Encondig=gzip and I left it blank

thanks in advance.
Rodrigo

1 Like