Release v0.42.0 · tenstorrent/tt-metal

📦 Uncategorized

Syrmia/new sweeps
- PR: #4390
Update test sweeps for the system memory input buffer
- PR: #4245
#4181: Add bfloat8_b dtype fix for tests that should support bfloat8_b
- PR: #4207
#4343: Add new op sweeps for GS and WH
- PR: #4408
#0: (MINOR) Update to v0.42.0
- PR: #4714
#4311: Automate determining and scheduling RC generation
- PR: #4713
Jedi main
- PR: #4690
#0: Remove path appends from test files
- PR: #4715
#4003: Adding padding for whisper
- PR: #4578
#4632: Add dprint server support for eth cores
- PR: #4709
#4003: added ttnn.group_norm
- PR: #4727
#4003: added ttnn.silu
- PR: #4731
#3999: move fallback_ops.silu -> tt_lib.tensor.silu
- PR: #4728
#4683: Support tracing
- PR: #4656
#0: Patch for bad state reached when enqueuing trace
- PR: #4735
Nshanker/remove pow of 2 req for channels size
- PR: #4693
#4003: added ttnn.pad
- PR: #4733
#4730: Adding ttnn.concat as fallback
- PR: #4738
#4003: added ttnn.split
- PR: #4737
Syrmia/ttnn sweeps
- PR: #4579
#4347: Move VGG tensors to L1
- PR: #4498
#4670: Add end to end demo for functional roberta model
- PR: #4718
#4431: mnist gs_demo benchmark
- PR: #4502
#4623: lenet gs demo benchmarking [Pending CI]
- PR: #4634
#4720: Improve folder structure of broken sweep tests
- PR: #4721
Adding interface to assign dispatch kernels to dispatch functionality and adding kernel to service remote command queue
- PR: #4615
#4003: Fixing whisper pcc in last layer
- PR: #4753
#4003: updated ttnn unit tests to assert using higher PCC thresholds
- PR: #4762
#4761: Adding fallback for repeat_interleave
- PR: #4767
#4003: simplified the logic in to_layout
- PR: #4766
#4003: added ttnn.log
- PR: #4769
#4003: updated ttnn.to_layout and ttnn.pad to do the right thing with padded shape
- PR: #4770
#0: Fix reference to Python integration test in README
- PR: #4784
#0: As a quick fix for now, source /etc/rc.local to re-insert number of hugepages back in after starting weka service in perf pipelines
- PR: #4807
#4003: updated model names
- PR: #4771
#4617: Matmul went to 0.9998887677925289 with float comparison to torch
- PR: #4812
#0: Fix bad access to memconfig/device when input tensors are on host
- PR: #4716
#4503: Demo for functional bloom
- PR: #4554
#4611: Add end to end test for ViT model with ImageNet data
- PR: #4749
#4506: SSD gs demo benchmarking
- PR: #4585
#4504: Add end to end demo for functional t5 model
- PR: #4649
#4557: Uplift swin model to resolve errors in tests & Add test_perf_accuracy...
- PR: #4774
#4556: Roberta gs demo benchmarking
- PR: #4627
#3974: nanogpt uplift and move weights to weka path
- PR: #4221
#4610: EfficientNet gs demo benchmark
- PR: #4633
#4003: added more sweeps
- PR: #4813
#4231: Fine-tune the unary ops for add, sub, div, mul binops with one scalar constant arg
- PR: #4768
#516: Sanity check tracy artifact generation
- PR: #4545
#4003: fixed crashing sweep tests
- PR: #4829
#0: Update get_semaphore to return 16B aligned semaphore addresses
- PR: #4820
#0: Add tracy dependencies to github actions runner workflows
- PR: #4835
#4730: Add sweep test for ttnn.concat
- PR: #4830
Update ops for sharding used in falcon 40b
- PR: #4806
#4833: Create initial ttnn sweeps with csv artifact upload
- PR: #4834
#4003: debugging whisper
- PR: #4746
#4003: Setting all = [] to block whild card imports
- PR: #4832
TTNN Sharded tensor support
- PR: #4597
#3662: Impl moreh_clip_grad_norm
- PR: #4743
#4609: Deit gs demo benchmarking
- PR: #4628
#4741: Add sum op to tt_dnn
- PR: #4744
#4622: Yolov3 GS demo Benchmarking
- PR: #4719
#0: Add weka mount + force hugepage mount with /etc/rc.local in frequent pipelines
- PR: #4827
#0: Reduce timeout of multi queue single device FD post commit
- PR: #4850
#4003: Make ttnn sweep tests available from pytest
- PR: #4819
Add MaxPool2d to ttnn
- PR: #4831
Ttnn 4761 add sweep for repeat interleave
- PR: #4841
#0: Remove checkout secret
- PR: #4856
#4847: Error out when there are insufficient num hugepages
- PR: #4860
simpler hugepage check
- PR: #4839
Revert "#4839: simpler hugepage check"
- PR: #4865
#4862: Disable test_moreh_clip_grad_norm_with_error_if_nonfinite
- PR: #4867
#4374: Benchmarking for bloom TT model
- PR: #4772
#4505: Add end to end demo for functional bert model
- PR: #4582
#4003: updated documentation
- PR: #4876
#4003: updated concat operation to raise an exception if the dimension is out of range
- PR: #4853
#0: Losen models perf tolerance for GS
- PR: #4879
#0: Add more instructions on syseng assets installation + direct users to additional hugepages setup if needed for cloud VMs
- PR: #4884
#4815: New restart command which safely resets a command queue into a starting state
- PR: #4816
Revert "#4815: New restart command which safely resets a command queue into a starting state"
- PR: #4887

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.42.0

📦 Uncategorized