46 questions
0
votes
1
answer
237
views
Tensorflow-data-validation installing issue
I'm using Python 3.12, and I'm trying to install the latest version (1.15.1) of TensorFlow Data Validation. However, while installing, I encountered the following error:
ERROR: Could not find a ...
0
votes
1
answer
248
views
How to define features presence in a TensorFlow Data Validation schema?
I want to create a new TensorFlow Data Validation schema from scratch with fixed features name, type and presence.
import tensorflow_data_validation as tfdv
from tensorflow_metadata.proto.v0 import ...
0
votes
1
answer
400
views
Installing tensorflow_data_validation is causing error #tensorflow_data_validation
I'm trying to install tensorflow_data_validation but it shows the below error.
it says that i need Python version 2.7 or 3.4+ but I'm using the version 3.10.9.
Also I have the 2.11.0 version of ...
0
votes
1
answer
100
views
tfx.components.StatisticsGen display train and eval in two different figures, is it possible to have them in a single figure as tfdv does?
a superimposed display for train/val splits using StatisticsGen
Hi,
I'm currently using tfx pipeline inside kubeflow. I struggle to have StatisticsGen showing a single graph with train and validation ...
0
votes
2
answers
124
views
relaxing the anomaly detection constraints in MLOps
I am stucked in getting this block of statement.In my serving sets there are some anomalies.
I am not getting what this below code is doing in removing the anomalies.
payer_code = tfdv.get_feature(...
1
vote
1
answer
807
views
create a data validation using JOI module
I am new to Joi. I try to put some rules to signup users. But every new user could signup and "if" command is always executed.
const Joi = require ('joi')
const authSchema = Joi.object()....
1
vote
1
answer
600
views
apache beam rows to tfrecord in order to GenerateStatistics
I have built a pipeline that read some data, does some manipulations and create some apache beam Row objects (Steps 1 and 2 in the code below). I then would like to generate statistic and write them ...
4
votes
1
answer
609
views
Infer multivalent features with tfdv from pandas dataframe
I want to infer a schema with tensorflow data validation (tfdv) based on a pandas dataframe of the training data. The dataframe contains a column with a multivalent feature, where multiple values (or ...
0
votes
0
answers
42
views
Installing TensorFow Data Validation on Jetson AGX (AArch64)
Has anyone been able to install TFDV on a Jetson AGX or similar device with AArch64 architecture? I have tried installing with PyPI, building with Docker, and building from source with no luck. I am ...
0
votes
1
answer
513
views
Dimension does not match when using `keras.Model.fit` in `BERT` of tensorflow
I follow the instruction of Fine-tuning BERT to build a model with my own dataset(It is kind of large, and greater than 20G), then take steps to re-cdoe my data and load them from tf_record files.
...
5
votes
1
answer
782
views
Is it possible to run tensorflow-data-validation on MacOS with M1 chip?
Question: Is it possible to run tensorflow-data-validation on MacOS with M1 chip?
Steps taken: I have created a conda environment (tfdv38) in which I have installed the Mac-optimized TensorFlow.
I ...
1
vote
1
answer
417
views
TensorFlow Extended: Specifying Valency Of Features In Schema
I am presently trying to feed a dataset with a few multivalent feature columns through a TensorFlow Extended (TFX) pipeline. Here is a row from my sample data:
user_id 29601
...
2
votes
1
answer
355
views
Why isn't SchemaGen supported in tfdv.display_schema()?
Regarding TFX' tensorflow-data-validation, I'm trying to understand when I should use *Gen components vs. using TFDV provided methods.
Specifically, what's confusing me is that I have this as my ...
2
votes
1
answer
876
views
What is the difference between implementation of 'drift_comparator' and 'skew_comparator' available in Tensorflow Data Validation?
Need help some in understanding the difference between the underlying implementation of skew and drift comparator in TFDV.
0
votes
0
answers
521
views
Unable to run Tensorflow Data Validation on Google Cloud Platform (Dataflow)
I have been trying to run TensorFlow Data Validation following Google Documents
Followed same steps as https://www.tensorflow.org/tfx/data_validation/install:
>pip install tensorflow-data-...
0
votes
4
answers
872
views
How do I save a TFDV stats in the correct format for them to be loaded back in?
It is puzzling to me that there is a tfdv.load_statistics() function, but no corresponding tfdv.write_statistics() function. How do I go about saving the statistics, and then loading them again?
e.g.
...
1
vote
2
answers
2k
views
Understanding L-infinity norm which is used in TFDV
I was trying to implement TensorFlow Data Validation to check drift/skew in a dataset. They are using the L-infinity norm as a metric. I didn't understand the concept. Can anyone explain how it is ...
1
vote
1
answer
324
views
How to see all the possible options for schema metadata in tensorflow?
I am using tensorflow data validation and I am trying to build schemas around my datasets. I've built the initial schemas and I can see/edit them in notepad, but I'm having a hard time actually ...
2
votes
2
answers
483
views
How to enable numerical feature drift?
I want to enable numerical feature drift without setting a domain. Ex: I am scoring customers based on age and in my training set I had a uniform distribution of the variable. Now, in my serving data, ...
1
vote
2
answers
431
views
Tensorflow data validation for tensorflow 2.0
Is there any version of Tensorflow data validation (tfdv) that can work with tensorflow 2.0 alpha ?
0
votes
1
answer
200
views
TFX / Tensorflow Data Validation (TFDV) throws overflow encountered in float_scalars error
Following basic tutorial, and trying to generate stats for a TFRecord, I get a overflow error:
import tensorflow_data_validation as tfdv
stats = tfdv.generate_statistics_from_tfrecord(data_location='/...
1
vote
1
answer
711
views
Tensorflow TFDV does not work with images
I'm trying to get TFDV working with RGB images as feature inputs, reading from a TFRecords file. I can read/write the image data to TFRecord files fine. Here's the relevant code snippets for writing, ...
5
votes
3
answers
2k
views
TFDV Tensorflow Data Validation: how can I save/load the protobuf schema to/from a file
TFDV generates schema as a Schema protocol buffer. However it seems that there is no helper function to write/read schema to/from a file.
schema = tfdv.infer_schema(stats)
How can I save it/load it ?