Materialization Quick Reference

Design classes/records for CSV, JSON, YAML, Snowflake, and Spark

When using DataFlow readers like Read.Csv<T>(), Read.Json<T>(), or Read.Yaml<T>(), you must define a target type (T) to receive the data. The reader automatically maps source fields to your type's properties or constructor parameters. This guide covers the rules each reader expects.

✅ What Works

Pattern	CSV	JSON	YAML	Snowflake	Spark
Mutable class `{ get; set; }`	✅	✅	✅	✅	✅
Positional record `(int Id, string Name)`	✅	✅	✅	✅	✅
Init-only `{ get; init; }`	✅	✅	✅	✅	✅
Private setter `{ get; private set; }`	✅	❌	❌	✅	✅
Public fields	✅	❌	❌	✅	✅

Important

CSV uses custom parser that fully uses ObjectMaterializer capacities. JSON uses System.Text.Json (supports records natively via constructor deserialization). YAML uses YamlDotNet + ObjectMaterializer bridge (flat records supported; nested objects require mutable classes). Snowflake uses ObjectMaterializer (most flexible). Spark uses ObjectMaterializer (most flexible).

❌ What Fails

Pattern	Result
Read-only properties `{ get; }`	Properties stay at default
No parameterless constructor (without matching ctor)	Exception

🎯 Recommended Patterns

Best: Mutable Class

public class Person
{
    public int Id { get; set; }
    public string Name { get; set; } = "";
    public decimal Salary { get; set; }
}

Works with all readers (CSV, JSON, YAML, Snowflake, Spark).

Good: Positional Record

public record Order(int Id, string Product, decimal Amount);

Works with all readers (CSV, JSON, YAML, Snowflake, Spark).

Note

YAML record support uses a Dictionary→ObjectMaterializer bridge. Flat records work perfectly; nested object records require mutable classes. See ObjectMaterializer-Limitations.md.

Good: Record with Properties

public record Employee
{
    public int Id { get; set; }
    public string Name { get; set; } = "";
}

Works with all readers.

🔗 Column Matching Rules

SchemaMemberResolver uses a 5-pass resolution pipeline (first match wins):

Pass	Strategy	Example
1	Exact (case-sensitive)	`Name` → `Name` ✅
2	Case-insensitive	`name` → `Name` ✅
3	Normalized (snake_case, camelCase, no-spaces, lowercase)	`first_name` → `FirstName` ✅
4	Resemblance (prefix/suffix/contains)	`CustomerName` → `Name` ✅
5	Levenshtein (≤2 edits)	`Nmae` → `Name` ✅

Note

Case-variant properties (v1.2.1): If your type has properties that differ only by case (e.g., Name, name, NAME), the materializer auto-detects this and uses case-sensitive mapping to preserve each property's identity.

Extra columns → Ignored
Missing columns → Default value

💡 Quick Examples

CSV

// ✅ Works
public class Row { public int Id { get; set; } public string Value { get; set; } = ""; }
public record Row(int Id, string Value);

// ❌ Fails (read-only)
public class Row { public int Id { get; } }

JSON

// ✅ Works - both patterns
public class Doc { public string Name { get; set; } = ""; }
public record Doc(string Name);

YAML

// ✅ Works - mutable class
public class Config { public string DbUrl { get; set; } = ""; }

// ✅ Works - positional record (flat properties)
public record Config(string DbUrl);

// ⚠️ Nested objects require mutable classes
public record Address(string City);          // ← flat record OK
public class Person { public Address Addr { get; set; } }  // ← nested needs class

Snowflake / Spark

// ✅ Works - snake_case columns match PascalCase properties
public class Order { public int OrderId { get; set; } public decimal TotalAmount { get; set; } }
// Matches: order_id, total_amount

// Spark only: explicit [Column] attribute
[Column("custom_col")]
public string PropertyName { get; set; }

📋 Checklist

Before using Read.Csv<T>(), Read.Json<T>(), etc.:

Has parameterless constructor (or matching primary constructor)
Properties have setters ({ get; set; } or { get; init; } for CSV/Snowflake/Spark)
For YAML: positional records work for flat types; use mutable classes for nested objects
For JSON: use public setters or positional records (no private setters or fields)
Property names roughly match column/key names

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Materialization Quick Reference

✅ What Works

❌ What Fails

🎯 Recommended Patterns

Best: Mutable Class

Good: Positional Record

Good: Record with Properties

🔗 Column Matching Rules

💡 Quick Examples

CSV

JSON

YAML

Snowflake / Spark

📋 Checklist

See Also

FilesExpand file tree

Materialization-Quick-Reference.md

Latest commit

History

Materialization-Quick-Reference.md

File metadata and controls

Materialization Quick Reference

✅ What Works

❌ What Fails

🎯 Recommended Patterns

Best: Mutable Class

Good: Positional Record

Good: Record with Properties

🔗 Column Matching Rules

💡 Quick Examples

CSV

JSON

YAML

Snowflake / Spark

📋 Checklist

See Also