Skip to main content
Filter by
Sorted by
Tagged with
Advice
0 votes
1 replies
34 views

Is there a way to query data from a S3 bucket through Athena without creating a table? Something like this: with table1 as (select * from real_table), table2 as (create table from s3 location)...
Pedro Henrique's user avatar
0 votes
0 answers
38 views

I have an Athena table in my AWS account and I am using an S3 buckets as a database for which i have defined the schema in AWS glue. I currently have Hive style partitioning(manual partitionig) of ...
code_it's user avatar
  • 127
1 vote
0 answers
49 views

In AWS, I have a DynamoDB table with one of the fields as a map (my_map). This table is exposed to Athena, so data can be queried there (and eventually in Grafana). The Athena table shows a schema ...
Nisarg's user avatar
  • 336
1 vote
4 answers
137 views

I have some data like this: Id timestamp 100 2025-01-27 10:00:00 100 2025-01-26 10:00:00 100 2025-01-25 10:00:00 100 2024-04-20 10:00:00 100 2024-03-25 10:00:00 100 2023-05-05 10:00:00 100 2022-08-01 ...
nut's user avatar
  • 61
0 votes
1 answer
94 views

Description: I'm trying to configure AWS Athena to write query results to an S3 bucket that is in a different AWS account AND different region, but I'm getting the "Unable to verify/create output ...
Mitsukk's user avatar
  • 321
0 votes
0 answers
107 views

I am currently running into an issue with using Athena's iceberg "merge" where it ends up scanning the entire source + target table. For example, I have a source table and a target table. ...
cpstone's user avatar
0 votes
0 answers
76 views

I'm trying to work with AWS' client API in python using boto3. I've been trying get_data_catalog() but it throws an error: File ".../.venv/lib/python3.13/site-packages/botocore/client.py", ...
Sriram Rao's user avatar
0 votes
0 answers
82 views

I am looking to compute the total size of my files per year inside a given S3 bucket. I have been trying multiple methods: I tried to use a goto3 script but I constantly ran into issues or ...
Snows's user avatar
  • 1
1 vote
1 answer
55 views

When creating a table in AWS - Athena with a CTAS statement, I'm trying to understand how the table is stored in S3 and how I can convert the output to CSV. create table1 as select col1, col2 from ...
user1783504's user avatar
1 vote
1 answer
84 views

We have an AWS Glue pipeline where: A crawler populates a raw database table from partitioned JSON files in S3. S3 structure: raw/ ├── org=21/ │ └── 221.json └── org=23/ └── 654.json ...
Max Manitskov's user avatar
1 vote
0 answers
439 views

I am currently working with an Iceberg table in Athena. However, I am facing an issue where some of the data files have become corrupted or deleted due to incorrect lifecycle policies in s3. When I ...
Javier Esteban's user avatar
1 vote
1 answer
390 views

I'm setting up AWS Athena to query data stored in an S3 bucket that is: in a different AWS account in a different region and accessible via a VPC endpoint (PrivateLink) to S3 here's what I've done ...
jipot's user avatar
  • 146
0 votes
1 answer
124 views

I have a glue table that is fed by partitioned data in s3. The issue at hand is in Athena that if the partition projection is turned off, and I run MSCK REPAIR TABLE <my table>; and SELECT * ...
Raisin's user avatar
  • 21
0 votes
1 answer
71 views

The SQL in question: WITH first_cte AS ( SELECT s.two_id, s.d_month, COUNT(*) AS total, COUNT(DISTINCT one_id) AS unique_one_id_count FROM ...
Nir's user avatar
  • 2,677
1 vote
0 answers
24 views

I have a data lake implemented using AWS s3. Bronze and silver layer are implemented, with data in silver layer exposed for access via API and also via jdbc/odbc based sql client. We have a ...
bphya's user avatar
  • 51
0 votes
1 answer
75 views

I have a file in S3 with the following contents: {"foo-bar": {"name":"Mercury","distanceFromSun":0.39,"orbitalPeriod":0.24,"dayLength":58.65}...
Samer A.'s user avatar
1 vote
2 answers
112 views

I am trying to write a query to return ALL rows for an ID where a condition is met and a condition is not met for each ID on the Order table. The conditions I want are to return all rows where the ID'...
CWZY's user avatar
  • 37
1 vote
1 answer
288 views

We are planning to use Iceberg tables instead of PostgreSQL. For PostgreSQL, we were using Flyway for database migrations. So, I wonder if it is possible to do it for Iceberg tables, as well. ChatGPT ...
melih zenciroglu's user avatar
0 votes
2 answers
86 views

I am facing issue while filtering data with array I have columns userid,event_name,attributes,ti Attributes column have value like this {"bool_sample":true,"array_int":[10,20,25,38]...
Nishant Dixit's user avatar
0 votes
0 answers
50 views

I get the following error Failed to start Athena query: An error occurred (InvalidRequestException) when calling the StartQueryExecution operation: line 1:748: mismatched input '.'. Expecting: '=' ...
aroe's user avatar
  • 171
0 votes
1 answer
200 views

I'm trying to query data through Redshift Spectrum using an external schema from the Glue catalog but encountering an issue with a column that has a timestamp data type. When I run the query SELECT * ...
Jeff A's user avatar
  • 619
1 vote
0 answers
26 views

CREATE EXTERNAL TABLE testpart (id bigint, eventday bigint, eventhour bigint PARTITIONED BY (eventday smallint,eventhour bigint) ROW FORMAT SERDE'org.apache.hadoop.hive.ql.io.parquet.serde....
atish kulkarni's user avatar
0 votes
2 answers
99 views

I have two tables an "Enrolled Table" and a "Customer Table". I only want to show all records on the Customer Table where the enrolled_no on the Customer table matches the ...
CWZY's user avatar
  • 37
0 votes
0 answers
109 views

I am trying to connect to athena database using SQLAlchemy create_engine api. The datasource name is Athena-xxxx, the database name is amazon_security_lake_glue_db_ca_central_1 and the primary ...
dossani's user avatar
  • 1,958
2 votes
0 answers
206 views

We have been trying Firehose for Iceberg Tables. The source is Kinesis stream attached to DynamoDB tables with some Lambda processing in between. Table has been successfully filled by Firehose, but ...
Martin Macak's user avatar
  • 3,842

1
2 3 4 5
73