352a460 | Yong He | 13 April 2023, 16:04:18 UTC | Fix stack overflow in lookupWitness lowering pass. (#2798) Co-authored-by: Yong He <yhe@nvidia.com> | 13 April 2023, 16:04:18 UTC |
59a6035 | Ellie Hermaszewska | 13 April 2023, 16:00:56 UTC | Set the executable bit on Executable artifact files (#2796) * Set the executable bit on Executable artifact files * Don't zero out other permission bits in makeExecutable | 13 April 2023, 16:00:56 UTC |
c7e5601 | Ellie Hermaszewska | 13 April 2023, 15:49:00 UTC | Matrix swizzle writes (#2713) * Add a bunch of builder emit wrappers for constant indices To avoid cluttering any calling code with int instruction construction * Matrix swizzle stores Closes https://github.com/shader-slang/slang/issues/2512 * Matrix swizzle store tests * Squash vs warnings * Select scalar for singular swizzles * Test singular swizzle materialization * Use IRIntegerValue over UInt for IR wrappers * Correct size of swizzle vector type * Remove variable shadowing | 13 April 2023, 15:49:00 UTC |
6fbd892 | Ellie Hermaszewska | 13 April 2023, 13:53:44 UTC | Correct http header construction (#2795) Previously we were always setting mimeType as "mimeType" | 13 April 2023, 13:53:44 UTC |
ca7bf79 | Yong He | 13 April 2023, 05:58:22 UTC | Combine lookupWitness lowering with specialization. (#2794) | 13 April 2023, 05:58:22 UTC |
12ec9b8 | Ellie Hermaszewska | 12 April 2023, 21:01:06 UTC | Downgrade github runner to 20.04 (#2792) * Downgrade github runner to 20.04 Specify gcc10 also * Select gcc with update-alternatives * Ignore persistent cache test on linux. --------- Co-authored-by: Yong He <yonghe@outlook.com> | 12 April 2023, 21:01:06 UTC |
947a78d | Yong He | 12 April 2023, 20:30:03 UTC | Update 02-conventional-features.md | 12 April 2023, 20:30:03 UTC |
d631ef9 | jsmall-nvidia | 12 April 2023, 20:24:08 UTC | Reading artifact hierarchy from file system interface (#2787) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP simplifying artifact interface. * Use ContainedKind. * Remove LazyCastableList. Use ContainedKind for find. * Remove ICastableList. * Remove need for ICastableList. * Remove IArtifactContainer. * Small fixes. * Small improvements around Artifact. * Make explicit find is for *representations* that can cast. Fix bug in handling casting in lookup. * Made associated items artifacts too. * Small fixes. * Small improvements around writing a container. * WIP artifact container format. * Make the root a special case. * Special case if the artifact doesn't have children/associated. * First pass handling of interpretting a file system into artifact hierarchy. * Explain the final structure. Make the file system available. * Remove addArtifact from IArtifact interface - means will be compatible with previous version. * Rename function to get compile result as a filesytem. | 12 April 2023, 20:24:08 UTC |
2ce42a2 | jsmall-nvidia | 12 April 2023, 17:22:53 UTC | Fix for crashes around FileCheck (#2793) * #include an absolute path didn't work - because paths were taken to always be relative. * Fix scoping issue for filecheck lib. * Small fix for file check issue. | 12 April 2023, 17:22:53 UTC |
eda9dd3 | jsmall-nvidia | 12 April 2023, 16:06:41 UTC | Artifact Container (#2783) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP simplifying artifact interface. * Use ContainedKind. * Remove LazyCastableList. Use ContainedKind for find. * Remove ICastableList. * Remove need for ICastableList. * Remove IArtifactContainer. * Small fixes. * Small improvements around Artifact. * Make explicit find is for *representations* that can cast. Fix bug in handling casting in lookup. * Made associated items artifacts too. * Small fixes. * Small improvements around writing a container. * WIP artifact container format. * Make the root a special case. * Special case if the artifact doesn't have children/associated. | 12 April 2023, 16:06:41 UTC |
4e9ca28 | Yong He | 12 April 2023, 01:24:43 UTC | Update documentation for `TorchTensor.alloc`. | 12 April 2023, 01:24:43 UTC |
195af97 | Yong He | 12 April 2023, 01:14:49 UTC | Fix missing `f` suffix for float lits in CUDA backend. (#2791) Co-authored-by: Yong He <yhe@nvidia.com> | 12 April 2023, 01:14:49 UTC |
7c3a40c | Yong He | 11 April 2023, 22:11:45 UTC | Small fixes to TorchTensor. (#2790) Co-authored-by: Yong He <yhe@nvidia.com> | 11 April 2023, 22:11:45 UTC |
54f112f | jsmall-nvidia | 11 April 2023, 20:56:59 UTC | WIP Hack to work around IFileCheck release (#2789) * #include an absolute path didn't work - because paths were taken to always be relative. * Upgrade to slang-llvm v13.x-38 * Hack to not release IFileCheck on windows debug. * Update to slang-llvm build with cast. * Disable IFileCheck * Disable IFileCheck. | 11 April 2023, 20:56:59 UTC |
1370b3b | jsmall-nvidia | 11 April 2023, 17:20:59 UTC | Upgrade to slang-llvm v13.x 38 (#2788) * #include an absolute path didn't work - because paths were taken to always be relative. * Upgrade to slang-llvm v13.x-38 | 11 April 2023, 17:20:59 UTC |
d6dd38f | Ellie Hermaszewska | 11 April 2023, 15:28:58 UTC | Implement FileCheck tests for several test commands (#2747) * Add missing expected.txt for test * Diagnostics -> StdWriters in render test * Allow specifying several test prefixes to run `slang-test -- tests/foo tests/bar` * Squash warnings in some tests * Enable gfx debug layer in gfx test util Makes this issue present consistently: https://github.com/shader-slang/slang/issues/2766 * Allow DebugDevice to return interfaces instantiated by the debugged object * Check that we actaully have a shader cache for shader cache tests * Implement FileCheck tests for several test commands - SIMPLE, SIMPLE_EX - SIMPLE_LINE - REFLECTION, CPU_REFLECTION - CROSS_COMPILE It does not currently support the render tests or the COMPARE_COMPUTE commands It is invoked by adding `(filecheck=MY_FILECHECK_PREFIX)` to the test command, for example TEST:CROSS_COMPILE(filecheck=SPIRV): -target spirv-assembly * Move LLVM FileCheck interface to slang-llvm * Neaten slang-test tests * Refine handling of expected output in slang-test * Add example FileCheck buffer test * Add cuda-kernel-export tests Which were waiting on FileCheck * Bump vs project files * Make createLLVMFileCheck_V1 return a void* rather than specifically an IFileCheck * Remove use of CharSlice from filecheck interface * Bump slang-llvm version --------- Co-authored-by: jsmall-nvidia <jsmall@nvidia.com> | 11 April 2023, 15:28:58 UTC |
20ea33f | jsmall-nvidia | 11 April 2023, 14:18:18 UTC | Artifact simplification (#2781) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP simplifying artifact interface. * Use ContainedKind. * Remove LazyCastableList. Use ContainedKind for find. * Remove ICastableList. * Remove need for ICastableList. * Remove IArtifactContainer. * Small fixes. * Small improvements around Artifact. * Make explicit find is for *representations* that can cast. Fix bug in handling casting in lookup. * Made associated items artifacts too. * Small fixes. * Small improvements around writing a container. | 11 April 2023, 14:18:18 UTC |
d934bbc | Yong He | 10 April 2023, 21:36:39 UTC | Fix inlining. (#2786) Co-authored-by: Yong He <yhe@nvidia.com> | 10 April 2023, 21:36:39 UTC |
d82992e | Yong He | 10 April 2023, 20:43:18 UTC | Cleaner impl of unary stdlib derivative functions. (#2785) * Cleaner impl of unary stdlib derivative functions. * fixup * Fix. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 10 April 2023, 20:43:18 UTC |
ea15647 | Yong He | 07 April 2023, 17:12:00 UTC | Diagnose on attempt to specialize with interface type. (#2780) * Diagnose on attempt to specialize with interface type. Fixes ##1445. * Enable fixed test. * Fix test. * Fix. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 07 April 2023, 17:12:00 UTC |
0468cd0 | Ellie Hermaszewska | 07 April 2023, 17:11:27 UTC | Add SLANG_IID_PPV_ARGS (#2784) * Add SLANG_IID_PPV_ARGS To mirror IID_PPV_ARGS from the COM Coding Practices: https://learn.microsoft.com/en-us/windows/win32/LearnWin32/com-coding-practices#the-iid_ppv_args-macro * Make getTypeGuid constexpr --------- Co-authored-by: Yong He <yonghe@outlook.com> | 07 April 2023, 17:11:27 UTC |
7bbe7b4 | Sai Praveen Bangaru | 07 April 2023, 13:57:39 UTC | Fix crash on overloaded custom derivative function (#2782) * Fix issue with resolving overloaded custom forward derivative methods. * Add test | 07 April 2023, 13:57:39 UTC |
c9eb594 | Yong He | 05 April 2023, 20:37:22 UTC | Warn on dangling comparison operator. (#2779) Fixes #1685 Co-authored-by: Yong He <yhe@nvidia.com> | 05 April 2023, 20:37:22 UTC |
dc45802 | Yong He | 05 April 2023, 18:19:00 UTC | Fix linking issue in slangpy + no mask param for kernels. (#2778) * Fix linking issue in slangpy + no mask param for kernels. * add cuda header changes * fix * More correct change of active mask insertion. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 05 April 2023, 18:19:00 UTC |
5d0fe48 | Sai Praveen Bangaru | 05 April 2023, 18:08:04 UTC | Register `IDifferentiable` witnesses for the differential associated types of registered types. (#2777) * Create high-order-forward-diff-struct.slang * Add a small pass to register differentials for differentials of registered types * Remove type reg pass and apply logic in differentiable type loading instead * Update slang-ir-autodiff.cpp * Fix comments. --------- Co-authored-by: Yong He <yonghe@outlook.com> | 05 April 2023, 18:08:04 UTC |
492c6f2 | Yong He | 04 April 2023, 22:29:36 UTC | Diagnose on using assignment as predicate expr. (#2774) Co-authored-by: Yong He <yhe@nvidia.com> | 04 April 2023, 22:29:36 UTC |
68c7d5c | jsmall-nvidia | 04 April 2023, 22:00:16 UTC | Preliminary support for realtime clock (#2772) * #include an absolute path didn't work - because paths were taken to always be relative. * Initial support for realtime clock. * Add realtime-clock render feature where seems appropriate. * Fixes to make NVAPI compile properly. Change realtime-clock.slang check to use maths that can't overflow. | 04 April 2023, 22:00:16 UTC |
7bb2de1 | jsmall-nvidia | 04 April 2023, 21:20:05 UTC | Simplification around ArtifactRepresentation and unique identity (#2771) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP in handling artifacts/SourceFiles. * Add getUniqueIdentity to IPathArtifactRepresentation * Simplification because around using the representation to provide the uniqueIdentity. | 04 April 2023, 21:20:05 UTC |
34a2fd5 | Sai Praveen Bangaru | 04 April 2023, 19:21:23 UTC | Fix crash on encountering undefined identifier when checking derivative attributes (#2773) * Fix compiler crash if referencing undefined method in the deriavative decorator * Delete sample.comp --------- Co-authored-by: Yong He <yonghe@outlook.com> | 04 April 2023, 19:21:23 UTC |
206b546 | Yong He | 04 April 2023, 19:18:55 UTC | Update a1-02-slangpy.md | 04 April 2023, 19:18:55 UTC |
5e5c42f | Yong He | 03 April 2023, 23:56:06 UTC | Update a1-02-slangpy.md | 03 April 2023, 23:56:06 UTC |
b68516e | Yong He | 03 April 2023, 22:54:16 UTC | Emit simpler vector element access code. (#2770) * Emit simpler vector element access code * Fix. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 03 April 2023, 22:54:16 UTC |
7a346b2 | Ellie Hermaszewska | 03 April 2023, 03:47:10 UTC | Squash some warnings (#2765) Co-authored-by: Yong He <yonghe@outlook.com> | 03 April 2023, 03:47:10 UTC |
271ae71 | Ellie Hermaszewska | 03 April 2023, 03:46:03 UTC | Fix several silently failing tests (#2767) * Add missing expected.txt for test * Diagnostics -> StdWriters in render test * Allow specifying several test prefixes to run `slang-test -- tests/foo tests/bar` * Squash warnings in some tests * Enable gfx debug layer in gfx test util Makes this issue present consistently: https://github.com/shader-slang/slang/issues/2766 * Allow DebugDevice to return interfaces instantiated by the debugged object * Check that we actaully have a shader cache for shader cache tests --------- Co-authored-by: Yong He <yonghe@outlook.com> | 03 April 2023, 03:46:03 UTC |
d7ba60c | Yong He | 03 April 2023, 03:43:09 UTC | Fix type legalization pass. (#2768) | 03 April 2023, 03:43:09 UTC |
539f17a | Yong He | 31 March 2023, 23:21:33 UTC | Update a1-02-slangpy.md | 31 March 2023, 23:21:33 UTC |
d1a49ed | Yong He | 31 March 2023, 21:26:20 UTC | Update TOC (#2764) Co-authored-by: Yong He <yhe@nvidia.com> | 31 March 2023, 21:26:20 UTC |
321e8ee | Yong He | 31 March 2023, 21:20:23 UTC | Add a complex example. to slangpy documentation. (#2763) Co-authored-by: Yong He <yhe@nvidia.com> | 31 March 2023, 21:20:23 UTC |
3ce11c3 | Yong He | 31 March 2023, 15:56:03 UTC | Fix typos in user-guide. (#2762) Co-authored-by: Yong He <yhe@nvidia.com> | 31 March 2023, 15:56:03 UTC |
6a2bf87 | Ellie Hermaszewska | 31 March 2023, 00:57:31 UTC | Detect when calls to execv fail (#2751) * Detect when calls to execv fail * Use pipe+fcntl instead of pipe2 pipe2 is not available on macOS * remove outdated comment * Use _exit in child if execv fails * Prevent Process::create leaks on some failures, and more robust in setting std streams for child --------- Co-authored-by: Yong He <yonghe@outlook.com> | 31 March 2023, 00:57:31 UTC |
efeda20 | Yong He | 30 March 2023, 21:34:54 UTC | Fix stdlib definitions for tensor interlocked methods. (#2761) Co-authored-by: Yong He <yhe@nvidia.com> | 30 March 2023, 21:34:54 UTC |
917416f | Yong He | 30 March 2023, 19:50:02 UTC | More builtin library support in torch backend. (#2760) Co-authored-by: Yong He <yhe@nvidia.com> | 30 March 2023, 19:50:02 UTC |
e3b701c | Ellie Hermaszewska | 30 March 2023, 19:48:02 UTC | Enable CUDA render api on unix (#2757) * Remove extra qualification in cuda device impl Only MSVC accepts this illegal code * Enable CUDA render api on unix --------- Co-authored-by: Yong He <yonghe@outlook.com> | 30 March 2023, 19:48:02 UTC |
72f7928 | Ellie Hermaszewska | 30 March 2023, 19:46:27 UTC | Fix mismatched malloc/delete[] in toWString (#2758) Co-authored-by: Yong He <yonghe@outlook.com> | 30 March 2023, 19:46:27 UTC |
d01e28a | Yong He | 30 March 2023, 18:41:54 UTC | Fix autodiff pass duplicates exported functions. (#2759) Co-authored-by: Yong He <yhe@nvidia.com> | 30 March 2023, 18:41:54 UTC |
37594df | Yong He | 30 March 2023, 04:06:23 UTC | Update TOC (#2756) Co-authored-by: Yong He <yhe@nvidia.com> | 30 March 2023, 04:06:23 UTC |
6fa4edb | Yong He | 30 March 2023, 01:23:21 UTC | Convert tensor types in `make_tensor_view`. (#2755) Co-authored-by: Yong He <yhe@nvidia.com> | 30 March 2023, 01:23:21 UTC |
af062bf | Yong He | 30 March 2023, 01:00:34 UTC | Fix IRArrayType emit logic. (#2754) * Fix IRArrayType emit logic. * Fix test. * Fix ast constant folding. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 30 March 2023, 01:00:34 UTC |
082c48d | Yong He | 30 March 2023, 00:05:07 UTC | Update checkpoint policy to make obvious recompute decisions. (#2753) * Update checkpoint policy to make obvious recompute decisions. Also adds an optimization to fold updateElement chains on the same array or struct into a single makeArray or makeStruct. * Bug fixes around array types with different int typed count. * change test. * Fix. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 30 March 2023, 00:05:07 UTC |
a862f5b | Yong He | 29 March 2023, 17:00:48 UTC | Update a1-02-slangpy.md | 29 March 2023, 17:00:48 UTC |
bbf12c2 | Ellie Hermaszewska | 29 March 2023, 16:25:15 UTC | Add missing initializer in json source map decoding (#2752) | 29 March 2023, 16:25:15 UTC |
64fe75c | Yong He | 29 March 2023, 01:01:42 UTC | Update a1-02-slangpy.md | 29 March 2023, 01:01:42 UTC |
87d331c | Yong He | 29 March 2023, 00:28:38 UTC | Update a1-02-slangpy.md | 29 March 2023, 00:28:38 UTC |
102702c | Yong He | 29 March 2023, 00:20:54 UTC | Update a1-02-slangpy.md | 29 March 2023, 00:20:54 UTC |
cce9094 | Yong He | 29 March 2023, 00:20:36 UTC | Update a1-02-slangpy.md | 29 March 2023, 00:20:36 UTC |
3ca504f | Yong He | 29 March 2023, 00:16:07 UTC | Update a1-02-slangpy.md | 29 March 2023, 00:16:07 UTC |
0ea68f2 | Yong He | 28 March 2023, 23:55:34 UTC | Update a1-02-slangpy.md | 28 March 2023, 23:55:34 UTC |
1f9f604 | Yong He | 28 March 2023, 23:51:56 UTC | Update a1-02-slangpy.md | 28 March 2023, 23:51:56 UTC |
240a29b | Yong He | 28 March 2023, 23:33:50 UTC | Update a1-02-slangpy.md | 28 March 2023, 23:33:50 UTC |
dba963d | Yong He | 28 March 2023, 23:27:45 UTC | Update toc (#2749) Co-authored-by: Yong He <yhe@nvidia.com> | 28 March 2023, 23:27:45 UTC |
92ba905 | Yong He | 28 March 2023, 23:26:42 UTC | Update a1-02-slangpy.md | 28 March 2023, 23:26:42 UTC |
a61f089 | Yong He | 28 March 2023, 22:19:03 UTC | Add slangpy doc, fix cuda prelude. (#2748) * Add slangpy doc, fix cuda prelude. * more bug fix. * fix. * fix. * More fix. * fix. * f * fix prelude. * update prelude. * update doc * Update prelude. * add zeros_like * update doc. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 28 March 2023, 22:19:03 UTC |
8f03af5 | Sai Praveen Bangaru | 28 March 2023, 19:39:52 UTC | AD: Warped-Area-Sampling test works now. (#2742) * Create render.slang * Added higher-order differentiability decorators for built-ins + preliminary tests * Update diff.meta.slang * Copy over conformance synthesis code to `DifferentiableTypeConformanceContext` * Update render.slang * Fixed 1D warped-area sampling test * Update warped-sampling-1d.slang * Remove commented line. * Change WAS test to use fixed point * Replaced InterlockedCmpExchange with InterlockedAdd * Increase fixed point precision * Reduce floating-point precision by 2 digits to avoid platform-specific problems * Dropped another digit (just to be safe) --------- Co-authored-by: Yong He <yonghe@outlook.com> | 28 March 2023, 19:39:52 UTC |
e22b4db | Yong He | 28 March 2023, 18:12:40 UTC | Small fixes and cleanups on CUDA/CPP codegen. (#2746) * Small fixes and cleanups on CUDA/CPP codegen. * Disable `legalizeEmptyTypes` for now. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 28 March 2023, 18:12:40 UTC |
0a69260 | Yong He | 28 March 2023, 06:00:42 UTC | Translate all composed types into tuple types in pyBind. (#2744) * Translate all composed types into tuple types in pyBind. * Delete temp file. * Fix get tuple element code emit logic. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 28 March 2023, 06:00:42 UTC |
d120fec | jsmall-nvidia | 28 March 2023, 02:52:48 UTC | Upgrade `slang-llvm` (#2741) * #include an absolute path didn't work - because paths were taken to always be relative. * Fix SlangCompileTarget to keep ordering. * Add test. Remove V2 version of interface to access IDownstreamCompiler Update to slang-llvm which has _chkstk support. * Update slang.h Co-authored-by: Ellie Hermaszewska <github@sub.monoid.al> --------- Co-authored-by: Yong He <yonghe@outlook.com> Co-authored-by: Ellie Hermaszewska <github@sub.monoid.al> | 28 March 2023, 02:52:48 UTC |
579870b | Yong He | 28 March 2023, 02:51:21 UTC | Apply IR simplifcation immediately after specialization to avoid duplicates. (#2739) * Apply IR simplifcation immediately after specialization to avoid duplicates. * Update source/slang/slang-ir-specialize.cpp Co-authored-by: Ellie Hermaszewska <github@sub.monoid.al> --------- Co-authored-by: Yong He <yhe@nvidia.com> Co-authored-by: Ellie Hermaszewska <github@sub.monoid.al> | 28 March 2023, 02:51:21 UTC |
84dc318 | Yong He | 28 March 2023, 01:39:44 UTC | Don't touch output file if content did not change. (#2738) Co-authored-by: Yong He <yhe@nvidia.com> | 28 March 2023, 01:39:44 UTC |
ca1f93a | jsmall-nvidia | 27 March 2023, 17:57:42 UTC | Using SourceMap for location output (#2736) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP using SourceMap with SourceManager. * Add a test to check obfuscation map is working. --------- Co-authored-by: Yong He <yonghe@outlook.com> | 27 March 2023, 17:57:42 UTC |
2179480 | Yong He | 27 March 2023, 17:05:07 UTC | Fix lowering crash in [BackwardDerivativeOf]. (#2737) Co-authored-by: Yong He <yhe@nvidia.com> | 27 March 2023, 17:05:07 UTC |
333968a | Ellie Hermaszewska | 27 March 2023, 09:48:46 UTC | Use c++17 standard (#2686) * Use c++17 standard * regenerate VS projects * Switch to using std::invoke | 27 March 2023, 09:48:46 UTC |
ffa7d05 | Yong He | 26 March 2023, 23:39:51 UTC | Update slang-llvm (#2735) | 26 March 2023, 23:39:51 UTC |
d64ee86 | Yong He | 26 March 2023, 20:59:11 UTC | Add PyTorch C++ binding generation. (#2734) * Add PyTorch C++ binding generation. * fix --------- Co-authored-by: Yong He <yhe@nvidia.com> | 26 March 2023, 20:59:11 UTC |
666af09 | Yong He | 25 March 2023, 03:17:46 UTC | Switch to short circuiting semantics for scalar `?:` operator. (#2733) | 25 March 2023, 03:17:46 UTC |
7292edb | Sai Praveen Bangaru | 24 March 2023, 23:50:51 UTC | Added `[BackwardDifferentiable]` tags for intrinsic + builtin methods (#2732) * Added higher-order differentiability decorators for built-ins + preliminary tests * Update diff.meta.slang | 24 March 2023, 23:50:51 UTC |
e794de0 | jsmall-nvidia | 24 March 2023, 13:56:59 UTC | Obfuscated source map writing (#2727) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP produce obfuscated source map and write when container is specified. * Make the sourcemap generated name stable. | 24 March 2023, 13:56:59 UTC |
03c1083 | Yong He | 24 March 2023, 11:33:51 UTC | Fix nested bwdContextType lowering. (#2731) Co-authored-by: Yong He <yhe@nvidia.com> | 24 March 2023, 11:33:51 UTC |
56a84a0 | Yong He | 24 March 2023, 05:42:59 UTC | Fix various autodiff crashes related to interface usage. (#2730) * Fix crash. * Fix `[ForwradDerivative]` on member functions. * Update comments. * Fix crash when [BackwardDerivative] is provided but not [ForwardDerivative]. * Allow calling dynamic dispatched generic method from differentiable func. * Fix. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 24 March 2023, 05:42:59 UTC |
e004511 | Sai Praveen Bangaru | 24 March 2023, 02:27:30 UTC | AD: Fix type checking for higher-order custom derivatives definitions (#2729) * Fixed type coercion issue with higher-order user defined methods * Placed associated type lookup method in a loop * Update high-order-user-defined-derivative.slang * Revert changes to associated type lookup method | 24 March 2023, 02:27:30 UTC |
6e4eae1 | Yong He | 24 March 2023, 00:16:32 UTC | Hack handling of primal insts that has a function type. (#2728) * Update diff-bwd material test * Minor update * Hack handling of primal insts that has a function type. --------- Co-authored-by: winmad <winmad.wlf@gmail.com> Co-authored-by: Yong He <yhe@nvidia.com> | 24 March 2023, 00:16:32 UTC |
50e7d97 | Yong He | 23 March 2023, 23:59:02 UTC | Fix optimization pass not converging. (#2725) * Fix optimization pass not converging. * Fix. * Fix tests. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 23 March 2023, 23:59:02 UTC |
85f0058 | Yong He | 23 March 2023, 18:37:29 UTC | Fix scope fixing for address insts. (#2724) Co-authored-by: Yong He <yhe@nvidia.com> | 23 March 2023, 18:37:29 UTC |
34acec2 | Yong He | 23 March 2023, 06:59:22 UTC | Fix generic lowering regression due to IR deduplication. (#2723) * Fix generic lowering. * Fix generic lowering regression due to IR deduplication. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 23 March 2023, 06:59:22 UTC |
44ab095 | winmad | 23 March 2023, 05:04:28 UTC | Create a simplified Falcor material system in slang test (#2721) * Duplicate an example of differentiating Falcor's material system * Add a second BSDF * WIP: update test; grad didn't pass through * Finalize test --------- Co-authored-by: Lifan Wu <lifanw@nvidia.com> | 23 March 2023, 05:04:28 UTC |
259a015 | Yong He | 23 March 2023, 04:16:35 UTC | Type legalization and autodiff bug fixes. (#2722) * Bug fixes. * Fix. * Only perform autodiff for functions whose derivative is actually used. * Fix loop optimize bug. * Fix high order diff. * Fix trivial diff func generation. * Fixes. * Cleanup. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 23 March 2023, 04:16:35 UTC |
d4f99c8 | jsmall-nvidia | 22 March 2023, 16:04:33 UTC | Source map obfuscation (#2717) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP source map. * Split out handling of RttiTypeFuncs to a map type. * Make RttiTypeFuncsMap hold default impls. * Slightly more sophisticated RttiTypeFuncsMap * Source map decoding. * Fix tabs. * Fix asserts due to negative values. * Use less obscure mechanisms in SourceMap. * Source map decoding. Simplifying SourceMap usage. * First attempt at ouputting a source map as part of emit. * Added support for -source-map option. SourceMap is added to the artifact. * Small improvements around column calculation in SourceWriter. * Source Loc obuscation WIP. * Fix some issues around SourceMap obfuscation. * Split out obfuscation into its own file. * Keep obfuscated SourceMap even through serialization bottleneck. | 22 March 2023, 16:04:33 UTC |
d8a40ab | Yong He | 22 March 2023, 04:29:13 UTC | [TreatAsDifferentiable] functions. (#2720) | 22 March 2023, 04:29:13 UTC |
8387673 | Yong He | 22 March 2023, 03:03:12 UTC | Fix associated type resolution bug. (#2719) * Fix associated type resolution bug. * Fix. * Fix language server hinting messed up by breadcrumb nodes. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 22 March 2023, 03:03:12 UTC |
5d2c5dd | Yong He | 21 March 2023, 22:44:43 UTC | Fix IntVal type check. (#2718) * Fix IntVal type check. * add test case result. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 21 March 2023, 22:44:43 UTC |
96caba7 | Yong He | 21 March 2023, 22:44:21 UTC | Add texture tri-linear autodiff example. (#2715) * Add quad texture example. * delete output image * remove irrelavent files * update project files * fix * Update example. * Fix. * remove out-texture --------- Co-authored-by: Yong He <yhe@nvidia.com> | 21 March 2023, 22:44:21 UTC |
7f11f88 | Yong He | 17 March 2023, 22:57:22 UTC | Add support for emitting cuda kernel and host functions. (#2712) * Add support for emitting cuda kernel and host functions. * Update test. * Fix cuda preamble emit. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 17 March 2023, 22:57:22 UTC |
4b55bf6 | Yong He | 17 March 2023, 22:14:44 UTC | Rework differentiability dataflow check. (#2711) Co-authored-by: Yong He <yhe@nvidia.com> | 17 March 2023, 22:14:44 UTC |
29abe39 | Yong He | 17 March 2023, 17:53:48 UTC | Update 07-autodiff.md | 17 March 2023, 17:53:48 UTC |
8a61b9d | jsmall-nvidia | 17 March 2023, 15:05:15 UTC | Support for producing SourceMap on emit (#2707) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP source map. * Split out handling of RttiTypeFuncs to a map type. * Make RttiTypeFuncsMap hold default impls. * Slightly more sophisticated RttiTypeFuncsMap * Source map decoding. * Fix tabs. * Fix asserts due to negative values. * Use less obscure mechanisms in SourceMap. * Source map decoding. Simplifying SourceMap usage. * First attempt at ouputting a source map as part of emit. * Added support for -source-map option. SourceMap is added to the artifact. | 17 March 2023, 15:05:15 UTC |
9476d45 | Yong He | 17 March 2023, 06:46:14 UTC | Fix Phi simplification bug. (#2710) * Fix Phi simplification bug. * Fix up. * Fix. * Fix. * Fix. * Fix. * Fix. * Fix test. * Fix test. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 17 March 2023, 06:46:14 UTC |
77d3630 | Yong He | 17 March 2023, 05:17:51 UTC | Fix name mangling of `FuncCallIntVal` in func signatures. (#2709) * Fix name mangling of `FuncCallIntVal` in func signatures. * Enhance the test. * Fix. * Fix. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 17 March 2023, 05:17:51 UTC |
fc9cba5 | Yong He | 17 March 2023, 05:17:34 UTC | Add `[CudaDeviceExport]` to allow exporting CUDA device functions. (#2708) * Add `[CudaDeviceExport]` to allow exporting CUDA device functions. * Fix. --------- Co-authored-by: Yong He <yhe@nvidia.com> | 17 March 2023, 05:17:34 UTC |
2fd1ac6 | jsmall-nvidia | 17 March 2023, 05:13:59 UTC | SourceMap encoding (#2706) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP source map. * Split out handling of RttiTypeFuncs to a map type. * Make RttiTypeFuncsMap hold default impls. * Slightly more sophisticated RttiTypeFuncsMap * Source map decoding. * Fix tabs. * Fix asserts due to negative values. * Use less obscure mechanisms in SourceMap. * Source map decoding. Simplifying SourceMap usage. | 17 March 2023, 05:13:59 UTC |
4cb899f | jsmall-nvidia | 16 March 2023, 19:19:20 UTC | Preliminary SourceMap support (#2701) * #include an absolute path didn't work - because paths were taken to always be relative. * WIP source map. * Split out handling of RttiTypeFuncs to a map type. * Make RttiTypeFuncsMap hold default impls. * Slightly more sophisticated RttiTypeFuncsMap * Source map decoding. * Fix tabs. * Fix asserts due to negative values. * Use less obscure mechanisms in SourceMap. | 16 March 2023, 19:19:20 UTC |
1036d1a | Yong He | 16 March 2023, 18:06:01 UTC | Add test case for `makeStruct` transcription. (#2705) * Add test case for `makeStruct` transcription. * Update documentation on `no_diff`. * Update toc --------- Co-authored-by: Yong He <yhe@nvidia.com> | 16 March 2023, 18:06:01 UTC |