I need to create a function that validates incoming json data and returns a python dict. It should check if all necessary fields are present in a json file and also validate the data types of those fields. I need to use try-catch. Could you provide some kind of snippets or examples that give me answers?
4 Answers
If you haven't check jsonschema library, it can be useful to validate data. JSON Schema is a way to describe the content of JSON. The library just uses the format to make validations based on the given schema.
I made a simple example from basic usage.
import json
from jsonschema import validate
# Describe what kind of json you expect.
schema = {
"type" : "object",
"properties" : {
"description" : {"type" : "string"},
"status" : {"type" : "boolean"},
"value_a" : {"type" : "number"},
"value_b" : {"type" : "number"},
},
}
# Convert json to python object.
my_json = json.loads('{"description": "Hello world!", "status": true, "value_a": 1, "value_b": 3.14}')
# Validate will raise exception if given json is not
# what is described in schema.
validate(instance=my_json, schema=schema)
# print for debug
print(my_json)
Comments
As you're using a JSON file, you can use this example:
import json
def validate(filename):
with open(filename) as file:
try:
data = json.load(file) # put JSON-data to a variable
print("Valid JSON") # in case json is valid
return data
except json.decoder.JSONDecodeError:
print("Invalid JSON") # in case json is invalid
4 Comments
I had the same problem and was dissatisfied with the existing solution of using jsonschema. It gives terrible error messages that are not at all user-friendly.
I wrote my own library to define schemas, which has a lot of additional functionality over jsonschema:
https://github.com/FlorianDietz/syntaxTrees
It gives very precise error messages, allows you to write code to customize the validation process, lets you define functions that operate on the validated JSON, and even creates HTML documentation for the schemas you define.
Schemas are defined as classes, similar to how Django defines models:
class MyExampleNode(syntaxTreesBasics.Node):
field_1 = fields.Float(default=0)
field_2 = fields.String()
field_3 = fields.Value('my_example_node', null=True, default=None)
class Meta:
name = 'my_example_node'
1 Comment
While Jsonschema module is good but documentation is missing complex examples. And library not reporting any errors for invalid schema just ignoring!
This is example:
from jsonschema import validate
set_tl_schema = {
"type" : "object",
"properties" : {
"level": {
"value": {"type" : "number"},
"updatedAt": {"type" : "number"}
}
}
}
x = {'level': {'updatedAt': '1970-01-01T00:00:00.000Z', 'value': 1}, }
try:
validate(instance=x, schema=set_tl_schema)
except jsonschema.exceptions.ValidationError as ex:
print(ex)
Mistake was that level also need properties field. But validator never report you that.
I found very efficient and easy to use module:
https://pypi.org/project/json-checker/
>>> from json_checker import Checker
>>> current_data = {'first_key': 1, 'second_key': '2'}
>>> expected_schema = {'first_key': int, 'second_key': str}
>>> checker = Checker(expected_schema)
>>> result = checker.validate(current_data)
2 Comments
set_tl_schema is valid. Try something like jsonschema.Draft202012Validator.check_schema(set_tl_schema) and it reports nothing wrong. The default metaschema just checks that you have an object or bool. e.g. this fails Draft202012Validator.check_schema("hello") but this succeeds Draft202012Validator.check_schema(True). This other lib you recommend looks unmaintained and unpopular though.{} or bool. See json-schema.org/learn/getting-started-step-by-step Your schema is valid, yet doesn't constrain the way you would want it to, until you add that "properties" field (and ideally the "type": "object" as well). Without that, the parser stops at the point where it evaluates "level" as being an object or bool, and then ignores the content inside "level".