database – DAILY NEWS

TECH & AI

PostgreSQL 22034 Error: Causes and Solutions Complete Guide

jackminion Jun 17, 2026 0

PostgreSQL Error 22034: more than one sql json item

PostgreSQL error code 22034 (more than one sql json item) occurs when a SQL/JSON function such as JSON_VALUE() or JSON_QUERY() encounters a JSON path expression that returns more than one item, while the function context expects exactly one. This error became more prevalent with the introduction of SQL-standard JSON functions in PostgreSQL 15 and later.

Top 3 Causes

1. Wildcard path in JSON_VALUE() returning multiple results

JSON_VALUE() strictly requires a single scalar return value. Using a wildcard like $across an array will match multiple elements and immediately trigger error 22034.

— Triggers 22034
SELECT JSON_VALUE(‘{“fruits”: (“apple”, “banana”, “cherry”)}’, ‘$.fruits’);

— Fix: specify an explicit index
SELECT JSON_VALUE(‘{“fruits”: (“apple”, “banana”, “cherry”)}’, ‘$.fruits(0)’);
— Result: “apple”

— Fix: suppress the error gracefully
SELECT JSON_VALUE(
‘{“fruits”: (“apple”, “banana”, “cherry”)}’,
‘$.fruits’
NULL ON ERROR
);
— Result: NULL

Enter fullscreen mode

Exit fullscreen mode

2. JSON_QUERY() without WITH ARRAY WRAPPER on multi-value paths

JSON_QUERY() also fails when a path resolves to multiple independent values and no wrapper option is provided to consolidate them into a single JSON array.

— Triggers 22034
SELECT JSON_QUERY(‘{“scores”: (95, 87, 76)}’, ‘$.scores’);

— Fix: wrap results into a JSON array
SELECT JSON_QUERY(
‘{“scores”: (95, 87, 76)}’,
‘$.scores’
WITH ARRAY WRAPPER
);
— Result: (95, 87, 76)

Enter fullscreen mode

Exit fullscreen mode

3. Navigating nested array structures with simple path expressions

Deeply nested JSON arrays compound the cardinality problem at every path step. Using JSON_VALUE() or JSON_QUERY() on paths that traverse multiple array levels without index constraints will almost always produce multiple results.

— Sample nested data
WITH doc AS (
SELECT ‘{“orders”: ({“id”:1}, {“id”:2}, {“id”:3})}’::jsonb AS data
)

— Triggers 22034 (multiple ids returned)
— SELECT JSON_VALUE(data::json, ‘$.orders.id’) FROM doc;

— Fix: use jsonb_path_query() to return a set of rows
SELECT jsonb_path_query(data, ‘$.orders.id’)
FROM doc;

— Fix: use jsonb_array_elements() for row-by-row processing
SELECT elem->>’id’ AS order_id
FROM doc, jsonb_array_elements(data->’orders’) AS elem;

Enter fullscreen mode

Exit fullscreen mode

Quick Fix Solutions

Scenario
Recommended Fix

Need only the first value
Use $.array(0) explicit index

Need all values as JSON array
JSON_QUERY(… WITH ARRAY WRAPPER)

Need all values as rows

jsonb_path_query() or jsonb_array_elements()

Want to avoid query failure
Add NULL ON ERROR clause

Complex nested structures
Use JSON_TABLE() (PostgreSQL 17+)

— JSON_TABLE() for structured unnesting (PostgreSQL 17+)
SELECT *
FROM JSON_TABLE(
‘{“orders”: ({“id”:1,”amt”:100},{“id”:2,”amt”:250})}’::json,
‘$.orders’
COLUMNS (
order_id INT PATH ‘$.id’,
amount INT PATH ‘$.amt’
)
) AS jt;

Enter fullscreen mode

Exit fullscreen mode

Prevention Tips

Always verify path cardinality before using scalar JSON functions.Before deploying queries with JSON path expressions into production, use jsonb_path_query_array() to check how many items a path returns. If the count exceeds one, switch to a set-returning function or add WITH ARRAY WRAPPER.

— Pre-flight cardinality check
SELECT jsonb_array_length(
jsonb_path_query_array(your_column, ‘$.some.path’)
)
FROM your_table
LIMIT 10;

Enter fullscreen mode

Exit fullscreen mode

Always declare explicit error and empty behavior clauses.Never rely on default behavior for SQL/JSON functions. Explicitly specifying NULL ON ERROR and NULL ON EMPTY prevents a single malformed or unexpectedly multi-valued JSON document from failing an entire query batch — especially critical when handling externally sourced JSON data.

SELECT JSON_VALUE(
payload::json,
‘$.event.type’
NULL ON EMPTY
NULL ON ERROR
)
FROM event_log;

Enter fullscreen mode

Exit fullscreen mode

Related Errors

22033 – invalid sql json subscript: bad array index in path expression

22032 – invalid json text: malformed JSON, often encountered before 22034

22035 – no sql json item: the opposite of 22034; path matches nothing

2203A – sql json scalar required: path returns an object/array where a scalar is expected

📖 Want a more detailed guide?Check out the full in-depth version (Korean) on oraerror.com — includes detailed analysis, additional SQL examples, and prevention tips.

Source link

TECH & AI

Delete record using JooqTemplate – DEV Community

jackminion Jun 10, 2026 0

1. delete

Delete records matching the given condition.

// Single Condition
public int delete(String table, Condition condition)
public int delete(Table table, Condition condition)
// List
public int delete(String table, ListCondition> conditions)
public int delete(Table table, ListCondition> conditions)

Enter fullscreen mode

Exit fullscreen mode

Returns: int — number of affected rows.Example:

jt.delete(“user_table”, F(“id”).eq(100));
jt.delete(“user_table”, Arrays.asList(F(“id”).eq(100)));

Enter fullscreen mode

Exit fullscreen mode

2. deletev (varargs conditions)

Specify delete conditions via varargs.

public int deletev(String table, Object… conditions)
public int deletev(Table table, Object… conditions)

Enter fullscreen mode

Exit fullscreen mode

Returns: int — number of affected rows.Example:

// Delete by equality
jt.deletev(“user_table”, “id”, 100);

// Multiple conditions WHERE name = ? AND birthday >= ?
jt.deletev(“user_table”, “name”, “John”, “birthday>=”, beginDate);

Enter fullscreen mode

Exit fullscreen mode

Source link

TECH & AI

Anatomy of Duck DB for Python Developers

jackminion May 17, 2026 0

Introduction – SQL without a Server

Pandas is widely used for data analysis and almost every data analyst or even data engineers utilize it for faster analysis with table like data structure called DataFrames.The drawback is that it suffers once the data goes beyond few GB’s and spinning up a Postgres or a Redshift is an overkill for quick analysis.Duck DB fills this gap with Zero-setup columnar SQL.

Getting Started – zero config, instant power

DuckDb is an open source OLAP database management system designed for analytics and for running within the same process as the application.It is lightweight, can work directly with data files in csv, parquet etc without needing a server.

Installation and first query

pip install duckdb – No ports to open, No configuration and No daemon

In-Memory and Persistent Database – Two Operating Modes

In-MemoryWhen DuckDB connection is created without specifying a file, a database lives entirely in RAM.

import duckdb
con = duckdb.connect() # or duckdb.connect(‘:memory:’)

Enter fullscreen mode

Exit fullscreen mode

All data is stored in RAM and no files are written to disk
Extremely fast reads/writes since there is zero I/O overhead.
Data is completely lost when connection closes.
No file locking or concurrency concerns

Persistent ModeWhen the user provides a location DuckDB can write the results to disk in .duckDb format.

con = duckdb.connect(‘my_database.duckdb’)

Enter fullscreen mode

Exit fullscreen mode

Tables,Schemas and indexes are persisted.
Uses a columnar storage format with compression and buffered I/O
Only one write connection at a time but multiple read connection are allowed.
Supports WAL(Write Ahead Logging) for crash recovery

Powerful Pattern

DuckDb allows you to mix both modes where user can start with in-memory and attach a persistent database or use copy/export to snapshot in-memory result to disk.

con = duckdb.connect()

#Query a CSV, transform it, save the result to a persistent file
con.execute(“””
COPY(SELECT region, SUM(sales) AS total FROM read_csv(‘data.csv’)
GROUP BY region
)
TO ‘results.parquet’ (FORMAT PARQUET)
“””)

Enter fullscreen mode

Exit fullscreen mode

Users gets the speed of In-Memory processing which accelerates the pipeline processing with an option to persist.

Reading files directly –CSV,PARQUET,JSON,Arrow,

Query CSV without loading into memory

Select * from read_csv(‘data_csv’, auto_detect=true);

Enter fullscreen mode

Exit fullscreen mode

-Auto detects delimiter, compression and data types-Handles malformed rows gracefully-Can read multiple CSVs at once read_csv(‘data/*.csv’)

Parquet

Select * from read_parquet(‘data.parquet’);
–even from S3 directly
Select * from read_parquet(‘s3://bucket/data/*.parquet’);

Enter fullscreen mode

Exit fullscreen mode

Exploits column pruning as it only reads columns you need
Leverages row group skipping using Parquet’s build in min/max stats
Native support for nested types(structs,list,maps)

JSON/NDJSON

SELECT * FROM read_json(‘events.ndjson’, auto_detect=true);

Enter fullscreen mode

Exit fullscreen mode

-AUTO INFERS schema from data-NDJSON(Newline delimited) streams efficiently line by line-Can unnest deeply nested JSON fields using DuckDB’s json_extract, UNNEST, or -> operators

Apache Arrow

import pyarrow as pa
arrow_table = pa.Table.from_pandas(df)
duckdb.query(“”SELECT * from arrow_table”””)

Enter fullscreen mode

Exit fullscreen mode

-Zero copy integration: DuckDB reads from Arrow memory without serialization-Ideal for pipelines where data never needs to touch disk

SQL Beyond Select

DuckDB is not just a query engine, it supports rich SQL that covers data transformation, creation, and some genuinely unique syntax extensions to available in most databases.

Full Suite of WINDOW Functions

Select
customer,
ordered_at,
amount,

— Running total
SUM(amount) OVER (PARTITION BY customer ORDER BY ordered_at) AS running_tot,

— Lag/lead comparisons
LAG(amount) OVER (PARTITION BY customer ORDER BY ordered_at) AS prev_amt,

— Percentile rank
PERCENT_RANK() OVER (ORDER BY amount) AS pct_rank,

— Named window reuse
FIRST_VALUE(amount) OVER w AS first_order
FROM orders
WINDOW w AS (PARTITION BY customer ORDER BY ordered_at);

Enter fullscreen mode

Exit fullscreen mode

DuckDB also allows the use of qualify clause which filters on window result without a subquery.

Select * From orders
QUALIFY ROW_NUMBER() OVER (PARTITION BY customer ORDER BY amount DESC) = 1;

Enter fullscreen mode

Exit fullscreen mode

PIVOT and UNPIVOT

Most databases make you write case when manually for PIVOTS.DuckDB does it natively.

–PIVOT- rows to columns
PIVOT orders on region USING SUM(amount) GROUP BY year;

–UNPIVOT- Column to rows
UNPIVOT sales_wide ON(q1,q2,q3,q4) INTO NAME quarter VALUE revenue;

Enter fullscreen mode

Exit fullscreen mode

MULTI DATABASE SQL

–Attach another DuckDB file
ATTACH ‘archive.duckdb’ AS archive;

— Cross-database join
SELECT a.*, b.region
FROM main.orders a
JOIN archive.customers b ON a.customer_id = b.id;

–Attach another database
ATTACH ‘postgres://user:pass@host/db’ AS pg (TYPE POSTGRES);
SELECT * FROM pg.public.users LIMIT 10;

Enter fullscreen mode

Exit fullscreen mode

DUCKDB+Pandas+Polars –Choosing your stack

DuckDB does not replace pandas or Polars it solves a problem which is niche.The sweet spot of the industry is to use DuckDB for SQL-shaped operations and pandas/polars for row level python logic.

Where Duck DB shines

Feature Engineering for ML: Window functions or group by’s for feature computation are often faster and more readable in DuckDB then pandas before handing it over to Sklearn or pytorch
Unit testing DBT models locally:DuckDB lets you run complete dbt project locally without a cloud warehouse providing fast feedback loop for data engineers.
Light weight ETL Pipelines: One can read raw parquet from S3, transform with SQL, write cleaned output back without any spark cluster or airflow jobs.

Conclusion

DuckDB lets you think in SQL for analytical tasks without worrying about infrastructure setup. Anyone using python can utilize duckdb for analysis of larger files where regular pandas will give headache.Given the advantages, it is important to know whare DuckDB should not be used which in case of concurrent writes,OLTP workloads and long running multi user services.

Reference-https://duckdb.org/docs/current/data/overview

Source link