Releases
v0.42.0
📦 Uncategorized
Syrmia/new sweeps
Update test sweeps for the system memory input buffer
#4181 : Add bfloat8_b dtype fix for tests that should support bfloat8_b
#4343 : Add new op sweeps for GS and WH
#0: (MINOR) Update to v0.42.0
#4311 : Automate determining and scheduling RC generation
Jedi main
#0: Remove path appends from test files
#4003 : Adding padding for whisper
#4632 : Add dprint server support for eth cores
#4003 : added ttnn.group_norm
#4003 : added ttnn.silu
#3999 : move fallback_ops.silu -> tt_lib.tensor.silu
#4683 : Support tracing
#0: Patch for bad state reached when enqueuing trace
Nshanker/remove pow of 2 req for channels size
#4003 : added ttnn.pad
#4730 : Adding ttnn.concat as fallback
#4003 : added ttnn.split
Syrmia/ttnn sweeps
#4347 : Move VGG tensors to L1
#4670 : Add end to end demo for functional roberta model
#4431 : mnist gs_demo benchmark
#4623 : lenet gs demo benchmarking [Pending CI]
#4720 : Improve folder structure of broken sweep tests
Adding interface to assign dispatch kernels to dispatch functionality and adding kernel to service remote command queue
#4003 : Fixing whisper pcc in last layer
#4003 : updated ttnn unit tests to assert using higher PCC thresholds
#4761 : Adding fallback for repeat_interleave
#4003 : simplified the logic in to_layout
#4003 : added ttnn.log
#4003 : updated ttnn.to_layout and ttnn.pad to do the right thing with padded shape
#0: Fix reference to Python integration test in README
#0: As a quick fix for now, source /etc/rc.local to re-insert number of hugepages back in after starting weka service in perf pipelines
#4003 : updated model names
#4617 : Matmul went to 0.9998887677925289 with float comparison to torch
#0: Fix bad access to memconfig/device when input tensors are on host
#4503 : Demo for functional bloom
#4611 : Add end to end test for ViT model with ImageNet data
#4506 : SSD gs demo benchmarking
#4504 : Add end to end demo for functional t5 model
#4557 : Uplift swin model to resolve errors in tests & Add test_perf_accuracy...
#4556 : Roberta gs demo benchmarking
#3974 : nanogpt uplift and move weights to weka path
#4610 : EfficientNet gs demo benchmark
#4003 : added more sweeps
#4231 : Fine-tune the unary ops for add, sub, div, mul binops with one scalar constant arg
#516 : Sanity check tracy artifact generation
#4003 : fixed crashing sweep tests
#0: Update get_semaphore to return 16B aligned semaphore addresses
#0: Add tracy dependencies to github actions runner workflows
#4730 : Add sweep test for ttnn.concat
Update ops for sharding used in falcon 40b
#4833 : Create initial ttnn sweeps with csv artifact upload
#4003 : debugging whisper
#4003 : Setting all = [] to block whild card imports
TTNN Sharded tensor support
#3662 : Impl moreh_clip_grad_norm
#4609 : Deit gs demo benchmarking
#4741 : Add sum op to tt_dnn
#4622 : Yolov3 GS demo Benchmarking
#0: Add weka mount + force hugepage mount with /etc/rc.local in frequent pipelines
#0: Reduce timeout of multi queue single device FD post commit
#4003 : Make ttnn sweep tests available from pytest
Add MaxPool2d to ttnn
Ttnn 4761 add sweep for repeat interleave
#0: Remove checkout secret
#4847 : Error out when there are insufficient num hugepages
simpler hugepage check
Revert "#4839 : simpler hugepage check"
#4862 : Disable test_moreh_clip_grad_norm_with_error_if_nonfinite
#4374 : Benchmarking for bloom TT model
#4505 : Add end to end demo for functional bert model
#4003 : updated documentation
#4003 : updated concat operation to raise an exception if the dimension is out of range
#0: Losen models perf tolerance for GS
#0: Add more instructions on syseng assets installation + direct users to additional hugepages setup if needed for cloud VMs
#4815 : New restart command which safely resets a command queue into a starting state
Revert "#4815 : New restart command which safely resets a command queue into a starting state"
You can’t perform that action at this time.