swh:1:snp:a72e953ecd624a7df6e6196bbdd05851996c5e40

sort by:
Revision Author Date Message Commit Date
50faa84 wip 12 February 2022, 22:55:29 UTC
071ae18 doc: add note about `reduce([hv]cat, x)` in `[hv]cat` (#44049) 12 February 2022, 18:17:56 UTC
7826ac8 Generalize storage of factorizations. (#42594) Introduce additional type parameters to remove hard-coded arrays from structures. 12 February 2022, 17:46:05 UTC
4c101a1 fix asyncmap docs typo (#43571) 12 February 2022, 17:09:55 UTC
e422590 clarify mkpath docstring (#43987) Close #43964 12 February 2022, 17:07:42 UTC
3326cc3 Revert "use specialized code when compiling opaque closure expressions (#43320)" (#44149) This reverts commit 5cb0f14a1beb65fe562ef24d27741e9ec92d1feb due to failing on CI (analyzegc). 12 February 2022, 17:04:24 UTC
e7f2307 Fix docstring for keepat! (#44128) `keepat!` also works with `BitVector` 12 February 2022, 17:02:45 UTC
aad5c1c doc: improve docstring of eachsplit (#43745) 12 February 2022, 14:41:50 UTC
f6aab71 RFC: Add description to load a local module and using it in example codes. (#41398) * Add description to load a local module and using it in example codes. * make example codes to doctests. Co-authored-by: Viral B. Shah <ViralBShah@users.noreply.github.com> 12 February 2022, 13:47:51 UTC
0a91e71 Docs: clarify `JULIA_NUM_THREADS=auto` compat 1.7 (#43842) 12 February 2022, 03:00:46 UTC
b1d152d improve Channel special constructor docstring (#43979) 12 February 2022, 02:57:05 UTC
e376fcc Add a `:dynamic` scheduling option for `Threads.@threads` (#43919) Co-authored-by: Julian Samaroo <jpsamaroo@jpsamaroo.me> Co-authored-by: Takafumi Arakaki <29282+tkf@users.noreply.github.com> Co-authored-by: Valentin Churavy <vchuravy@users.noreply.github.com> 12 February 2022, 02:17:45 UTC
4409a1b remove tuple indexing by non-integer (#43760) Co-authored-by: Fredrik Ekre <ekrefredrik@gmail.com> Co-authored-by: Simeon Schaub <schaub@mit.edu> 11 February 2022, 21:00:58 UTC
a557536 Add a few missing effects taints (#44125) Should fix Downloads test (it decided Downloads' easy_hook didn't do anything and accidentally deleted it). 11 February 2022, 17:51:10 UTC
5cb0f14 use specialized code when compiling opaque closure expressions (#43320) invoke specialization when an OC is created at run time 11 February 2022, 16:47:35 UTC
98e60ff Add `allequal` (#43354) * Add `allequal`. 11 February 2022, 14:41:52 UTC
32a026a Test: document and export `TestLogger` and `LogRecord`. (#44080) 10 February 2022, 19:24:53 UTC
85c83a7 some improvements to GC stat printing (#44098) - line up `elapsed time` - print most numbers unconditionally - print minor/full collections instead of all/full collections - count `free` calls for arrays, since we count `malloc`s for them 10 February 2022, 19:19:03 UTC
0bf6ce3 Fix doctests for CodeUnits indexing. (#44105) 10 February 2022, 15:07:51 UTC
94bbfd4 [Docs] Add missing code block (#44094) * [Docs] Add missing code block * Fix block order 10 February 2022, 14:02:00 UTC
aae68a5 alloc profiler: add compat note to docstring (#44100) 10 February 2022, 08:04:01 UTC
c360a58 Fixed inference on String and Symbol causing invalidations in show.jl (#44091) 10 February 2022, 07:16:20 UTC
fed4544 inference: follow up #43852 (#44101) This commit consists of minor follow up tweaks for #43852: - inlining: use `ConstResult` if available - refactor tests - simplify `CodeInstance` constructor signature - tweak `concrete_eval_const_proven_total_or_error` signature for JET integration 10 February 2022, 05:58:41 UTC
708873a actually add the atributes (#44097) Co-authored-by: oscarddssmith <oscar.smith@juliacomputing.com> 10 February 2022, 05:03:03 UTC
93a5f69 Test.TestLogger: add `respect_maxlog` option (#44085) 09 February 2022, 22:59:56 UTC
f090992 Take purity modeling seriously (#43852) * Implement new effect system * TLDR Before: ``` julia> let b = Expr(:block, (:(y += sin($x)) for x in randn(1000))...) @eval function f_sin_perf() y = 0.0 $b y end end f_sin_perf (generic function with 1 method) julia> @time @code_typed f_sin_perf() 15.707267 seconds (25.95 M allocations: 1.491 GiB, 3.30% gc time) [lots of junk] ``` After: ``` julia> @time @code_typed f_sin_perf() 0.016818 seconds (187.35 k allocations: 7.901 MiB, 99.73% compilation time) CodeInfo( 1 ─ return 27.639138714768546 ) => Float64 ``` so roughly a 1000x improvement in compile time performance for const-prop heavy functions. There are also run time improvements for functions that have patterns like: ``` function some_function_to_big_to_be_inlined_but_pure(x) .... end function foo(x) some_function_to_big_to_be_inlined_but_pure(x) return x end ``` The inliner will now be able to see that some_function_to_big_to_be_inlined_but_pure is effect free, even without inlining it and just delete it, improving runtime performance (if some_function_to_big_to_be_inlined_but_pure is small enough to be inlined, there is a small compile time throughput win, by being able to delete it without inlining, but that's a smaller gain than the compile time gain above). * Motivation / Overview There are two motivations for this work. The first is the above mentioned improvement in compiler performance for const-prop heavy functions. This comes up a fair bit in various Modeling & Simulation codes we have where Julia code is often auto-generated from some combination of parameterized model codes and data. This ends up creating enormous functions with significant need for constant propagation (~50k statements with ~20k constant calls are not uncommon). Our current compiler was designed for people occasionally throwing a `sqrt(2)` or something in a function, not 20k of them, so performance is quite bad. The second motivation is to have finer grained control over our purity modeling. We have `@Base.pure`, but that has somewhat nebulous semantics and is quite a big hammer that is not appropriate in most situations. These may seem like orthogonal concerns at first, but they are not. The compile time issues fundamentally stem from us running constant propagation in inference's abstract interpreter. However, for simple, pure functions, that is entirely unnecessary, because we have a super-fast, JIT compiler version of that function just laying around in general. The issue is that we currently, we generally do not know when it is legal to run the JIT-compiled version of the function and when we need to abstractly interpret it. However, if the compiler were able to figure out an appropriate notion of purity, it could start doing that (which is what it does now for `@Base.pure` functions). This PR adds that kind of notion of purity, converges it along with type information during inference and then makes use of it to speed up evaluation of constant propagation (where it is legal to do so), as well as improving the inliner. * The new purity notions The new purity model consists of four different kinds flags per code instance. For builtins and intrinsics the existing effect free and nothrow models are re-used. There is also a new macro `@Base.assume_effects` available, which can set the purity base case for methods or `:foreigncall`s. Here is the docstring for that macro, which also explains the semantics of the new purity flags: ``` @assume_effects setting... ex @assume_effects(setting..., ex) `@assume_effects` overrides the compiler's effect modeling for the given method. `ex` must be a method definition. WARNING: Improper use of this macro causes undefined behavior (including crashes, incorrect answers, or other hard to track bugs). Use with care an only if absolutely required. In general, each `setting` value makes an assertion about the behavior of the function, without requiring the compiler to prove that this behavior is indeed true. These assertions are made for all world ages. It is thus advisable to limit the use of generic functions that may later be extended to invalidate the assumption (which would cause undefined behavior). The following `settings` are supported. ** `:idempotent` The `:idempotent` setting asserts that for egal inputs: - The manner of termination (return value, exception, non-termination) will always be the same. - If the method returns, the results will always be egal. Note: This in particular implies that the return value of the method must be immutable. Multiple allocations of mutable objects (even with identical contents) are not egal. Note: The idempotency assertion is made world-arge wise. More formally, write fₐ for the evaluation of `f` in world-age `a`, then we require: ∀ a, x, y: x === y → fₐ(x) === fₐ(y) However, for two world ages `a, b` s.t. `a != b`, we may have `fₐ(x) !== fₐ(y)`` Note: A further implication is that idempontent functions may not make their return value dependent on the state of the heap or any other global state that is not constant for a given world age. Note: The idempontency includes all legal rewrites performed by the optimizizer. For example, floating-point fastmath operations are not considered idempotent, because the optimizer may rewrite them causing the output to not be idempotent, even for the same world age (e.g. because one ran in the interpreter, while the other was optimized). ** `:effect_free` The `:effect_free` setting asserts that the method is free of externally semantically visible side effects. The following is an incomplete list of externally semantically visible side effects: - Changing the value of a global variable. - Mutating the heap (e.g. an array or mutable value), except as noted below - Changing the method table (e.g. through calls to eval) - File/Network/etc. I/O - Task switching However, the following are explicitly not semantically visible, even if they may be observable: - Memory allocations (both mutable and immutable) - Elapsed time - Garbage collection - Heap mutations of objects whose lifetime does not exceed the method (i.e. were allocated in the method and do not escape). - The returned value (which is externally visible, but not a side effect) The rule of thumb here is that an externally visible side effect is anything that would affect the execution of the remainder of the program if the function were not executed. Note: The effect free assertion is made both for the method itself and any code that is executed by the method. Keep in mind that the assertion must be valid for all world ages and limit use of this assertion accordingly. ** `:nothrow` The `:nothrow` settings asserts that this method does not terminate abnormally (i.e. will either always return a value or never return). Note: It is permissible for :nothrow annotated methods to make use of exception handling internally as long as the exception is not rethrown out of the method itself. Note: MethodErrors and similar exceptions count as abnormal termination. ** `:terminates_globally` The `:terminates_globally` settings asserts that this method will eventually terminate (either normally or abnormally), i.e. does not infinite loop. Note: The compiler will consider this a strong indication that the method will terminate relatively *quickly* and may (if otherwise legal), call this method at compile time. I.e. it is a bad idea to annotate this setting on a method that *technically*, but not *practically*, terminates. Note: The `terminates_globally` assertion, covers any other methods called by the annotated method. ** `:terminates_locally` The `:terminates_locally` setting is like `:terminates_globally`, except that it only applies to syntactic control flow *within* the annotated method. It is this a much weaker (and thus safer) assertion that allows for the possibility of non-termination if the method calls some other method that does not terminate. Note: `terminates_globally` implies `terminates_locally`. * `:total` The `setting` combines the following other assertions: - `:idempotent` - `:effect_free` - `:nothrow` - `:terminates_globally` and is a convenient shortcut. Note: `@assume_effects :total` is similar to `@Base.pure` with the primary distinction that the idempotency requirement applies world-age wise rather than globally as described above. However, in particular, a method annotated `@Base.pure` is always total. ``` * Changes to data structures - Each CodeInstance gains two sets of four flags corresponding to the notions above (except terminates_locally, which is just a type inference flag). One set of flags tracks IPO-valid information (as determined by inference), the other set of flags tracks optimizer-valid information (as determined after optimization). Otherwise they have identical semantics. - Method and CodeInfo each gain 5 bit flags corresponding 1:1 to the purity notions defined above. No separate distinction is made between IPO valid and optimizer valid flags here. We might in the future want such a distinction, but I'm hoping to get away without it for now, since the IPO-vs-optimizer distinction is a bit subtle and I don't really want to expose that to the user. - `:foreigncall` gains an extra argument (after `cconv`) to describe the effects of the call. * Algorithm Relatively straightforward. - Every call or builtin accumulates its effect information into the current frame. - Finding an effect (throw/global side effect/non-idempotenct, etc.) taints the entire frame. Idempotency is technically a dataflow property, but that is not modeled here and any non-idempotent intrinsic will taint the idempotency flag, even if it does not contribute to the return value. I don't think that's a huge problem in practice, because currently we only use idempotency if effect-free is also set and in effect-free functions you'd generally expect every statement to contribute to the return value. - Any backedge taints the termination effect, as does any recursion - Unknown statements (generic calls, things I haven't gotten around to) taint all effects * Make INV_2PI a tuple Without this, the compiler cannot assume that the range reduction is idempotent to make use of the new fast constprop code path. In the future this could potentially be an ImmutableArray, but since this is relatively small, a tuple is probably fine. * Evalute :total function in the proper world * Finish effects implementation for ccall * Add missing `esc` * Actually make use of terminates_locally override * Mark ^(x::Float64, n::Integer) as locally terminating * Shove effects into calling convention field * Make inbounds taint consistency Inbounds and `--check-bounds=no` basically make the assertion: If this is dynamically reached during exceution then the index will be inbounds. However, effects on a function are a stronger statement. In particular, for *any* input values (not just the dynamically reached ones), the effects need to hold. This is in particular true, because inference can run functions that are dynamically dead, e.g. ``` if unknown_bool_return() # false at runtime, but inference doesn't know x = sin(1.0) end ``` Inference will (and we want it to) run the `sin(1.0)` even though it is not dynamically reached. For the moment, make any use of `--check-bounds=no` or `@inbounds` taint the consistency effect, which is semantically meaningful and prevents inference from running the function. In the future, we may want more precise tracking of inbounds that would let us recover some precision here. * Allow constprop to refine effects * Properly taint unknown call in apply * Add NEWS and doc anchor * Correct effect modeling for arraysize * Address Shuhei's review * Fix regression on inference time benchmark The issue wasn't actually the changes here, they just added additional error paths which bridged inference into the Base printing code, which as usual takes a fairly long time to infer. Add some judicious barriers and nospecialize statements to bring inference time back down. * refine docstrings of `@assume_effects` This commit tries to render the docstring of `@assume_effects` within Documenter.jl-generated HTML: - render bullet points - codify the names of settings - use math syntax - use note admonitions * improve effect analysis on allocation Improves `:nothrow` assertion for mutable allocations. Also adds missing `IR_FLAG_EFFECT_FREE` flagging for non-inlined callees in `handle_single_case!` so that we can do more dead code elimination. * address some reviews * Address Jameson's review feedback * Fix tests - address rebase issues Co-authored-by: Shuhei Kadowaki <aviatesk@gmail.com> 09 February 2022, 22:24:00 UTC
76fa182 alloc profiler: adjust warnings in docs; remove logged warning from implementation (#44077) * docs: update warning in the manual; add to reference * remove missed allocs warning from `fetch` since we think we've gotten all the allocs * remove tests of warning message, since it's no longer there 09 February 2022, 22:23:13 UTC
6b16eba doc: remind users that versioninfo may include secrets (#43839) Add a warning to the docstring of `InteractiveUtils.versioninfo()` reminding users to remove sensitive information before sharing the output text. 09 February 2022, 21:37:22 UTC
a84a058 Merge pull request #43953 from JuliaLang/jn/43636 impose stricter and earlier validation on ccall/cfunction types 09 February 2022, 19:50:41 UTC
dc0aa61 Manual for `JULIA_EDITOR`: note `code.cmd` for vscode on Windows (#44083) 09 February 2022, 16:53:16 UTC
2118a08 Add an option to not emit unionsplit fallback blocks (#43923) When union splitting, we currently emit a fallback block that prints `fatal error in type inference (type bound)`. This has always been an oddity, because there are plenty of other places in the compiler that we rely on the correctness of inference, but it has caught a number of issues over the years and we do still have a few issues (e.g. #43064) that show this error. Nevertheless, the occurrence of issues like this has become less frequent, so it might be time to turn it off soon. At the same time, we have downstream users of the compiler infrastructure that get confused by having extra `throw` calls inserted into functions that (post-#43852) were inferred as `:nothrow`. Here we add an optimization param (defaulted to on) to determine whether or not to insert the unionsplit fallback block. Because of the conservative default, we can decide later what the correct default is (maybe turn it on in `debug` mode?), while letting downstream consumers play with the setting for now to see if any issues crop up. 09 February 2022, 11:19:23 UTC
0bdf24f Move `reinterpret`-based optimization for complex matrix * real vec/mat to lower level. (#44052) 09 February 2022, 09:11:36 UTC
33a71b7 Allow negative `stride(A,2)` in `gemv!` (#42054) 09 February 2022, 09:00:55 UTC
a947fc7 Add a donotdelete builtin (#44036) In #43852 we noticed that the compiler is getting good enough to completely DCE a number of our benchmarks. We need to add some sort of mechanism to prevent the compiler from doing so. This adds just such an intrinsic. The intrinsic itself doesn't do anything, but it is considered effectful by our optimizer, preventing it from being DCE'd. At the LLVM level, it turns into call to an external varargs function. The docs for the new intrinsic are as follows: ``` donotdelete(args...) This function prevents dead-code elimination (DCE) of itself and any arguments passed to it, but is otherwise the lightest barrier possible. In particular, it is not a GC safepoint, does model an observable heap effect, does not expand to any code itself and may be re-ordered with respect to other side effects (though the total number of executions may not change). A useful model for this function is that it hashes all memory `reachable` from args and escapes this information through some observable side-channel that does not otherwise impact program behavior. Of course that's just a model. The function does nothing and returns `nothing`. This is intended for use in benchmarks that want to guarantee that `args` are actually computed. (Otherwise DCE may see that the result of the benchmark is unused and delete the entire benchmark code). **Note**: `donotdelete` does not affect constant foloding. For example, in `donotdelete(1+1)`, no add instruction needs to be executed at runtime and the code is semantically equivalent to `donotdelete(2).` *# Examples function loop() for i = 1:1000 # The complier must guarantee that there are 1000 program points (in the correct # order) at which the value of `i` is in a register, but has otherwise # total control over the program. donotdelete(i) end end ``` 09 February 2022, 06:36:07 UTC
2e2b1d2 Fixed inference on String causing invalidations in binaryplatform (#44084) 09 February 2022, 03:46:26 UTC
7ccc83e Add USDTs for the task runtime (#43453) Co-authored-by: Takafumi Arakaki <aka.tkf@gmail.com> 09 February 2022, 01:36:22 UTC
0639d0d 🤖 Bump the Pkg stdlib from 89517355 to c991ce0a (#44075) Co-authored-by: Dilum Aluthge <dilum@aluthge.com> 09 February 2022, 00:07:44 UTC
d7fd3b7 `===` Optimization: Defer recursion into pointees (#43658) * jl_egal Optimization: Defer recursion into pointees Immutable struct comparisons with `===` can be arbitrarily expensive for deeply recursive but (almost) equal objects. Whenever possible, it's valuable to defer the potentially expensive recursion by first comparing the struct fields for bitwise equality. Before this commit, two structs are compare elementwise, in the order of the struct definition, recursing when pointer fields are encountered. This commit defers the recursion into pointed-to fields until after all other non-pointer fields of the struct are compared. This has two advantages: 1. It defers the expensive part of === comparison as long as possible, in the hopes that we can exit early from dissimilarities discovered elsewhere in the struct instances. 2. It improves cache-locality by scanning the whole struct before jumping into any new places in memory (and reducing comparisons needed on the current cache line after returning from the recursive call). The drawback is that you'll have to scan the pointer fields again, which means potentially more cache misses if the struct is very large. The best way to tell if this is helpful or harmful is benchmarking. Here is the motivating benchmark, which indeed improves by 10x with this commit, compared to master: ```julia julia> using BenchmarkTools julia> struct VN val::Float32 next::Union{VN, Array} # put a mutable thing at the end, to prevent runtime from sharing instances end julia> struct NV next::Union{NV, Array} # put a mutable thing at the end, to prevent runtime from sharing instances val::Float32 end julia> function make_chain_vn(n, sentinal) head = VN(1, sentinal) for i in 2:n head = VN(rand(Int), head) end return head end make_chain_vn (generic function with 1 method) julia> function make_chain_nv(n, sentinal) head = NV(sentinal, 1) for i in 2:n head = NV(head, rand(Int)) end return head end make_chain_nv (generic function with 1 method) julia> vn1, vn2 = make_chain_vn(10000, []), make_chain_vn(10000, []); julia> nv1, nv2 = make_chain_nv(10000, []), make_chain_nv(10000, []); ``` Master: ``` julia> @btime $vn1 === $vn2 7.562 ns (0 allocations: 0 bytes) false julia> @btime $nv1 === $nv2 # slower, since it recurses pointers unnecessarily 76.952 μs (0 allocations: 0 bytes) false ``` After this commit: ``` julia> @btime $vn1 === $vn2 8.597 ns (0 allocations: 0 bytes) false julia> @btime $nv1 === $nv2 # We get to skip the recursion and exit early. :) 10.280 ns (0 allocations: 0 bytes) false ``` However, I think that there are probably other benchmarks where it harms performance, so we'll have to see... For example, here's one: In the exact opposite case as above, if the two objects _are_ (almost) equal, necessitating checking every object, the `NV` comparisons could have exited after all the recursive pointer checks, and never compare the fields, whereas now the fields are checked first, so this gets slower. I'm not exactly sure why the `VN` comparisons get somewhat slower too, but it's maybe because of the second scan mentioned above. ```julia julia> function make_chain_nv(n, sentinal) head = NV(sentinal, 1) for i in 2:n head = NV(head, i) end return head end make_chain_nv (generic function with 1 method) julia> function make_chain_vn(n, sentinal) head = VN(1, sentinal) for i in 2:n head = VN(i, head) end return head end make_chain_vn (generic function with 1 method) julia> vn1, vn2 = make_chain_vn(10000, []), make_chain_vn(10000, []); julia> nv1, nv2 = make_chain_nv(10000, []), make_chain_nv(10000, []); ``` Master: ``` julia> @btime $vn1 === $vn2 95.996 μs (0 allocations: 0 bytes) false julia> @btime $nv1 === $nv2 82.192 μs (0 allocations: 0 bytes) false ``` This commit: ``` julia> @btime $vn1 === $vn2 127.512 μs (0 allocations: 0 bytes) false julia> @btime $nv1 === $nv2 126.837 μs (0 allocations: 0 bytes) ``` * Ignore pointer fields completely in the first pass of === Delay the nullptr checks until all non-ptr fields have been compared, since we have to go back to those anyways to follow the pointers. Co-authored-by:Jeff Bezanson <jeff.bezanson@gmail.com> 08 February 2022, 21:18:35 UTC
60f414e Merge pull request #44081 from oscardssmith/fix-linux32-ci fix #43978 (missing parens) 08 February 2022, 20:38:50 UTC
9c9bb86 Make `to_indices` infered better for CartesianIndex/Indices (#42055) 08 February 2022, 20:22:30 UTC
f6def1a Optimize real matrix * complex matrix by using real matmul (#44074) 08 February 2022, 20:17:24 UTC
7b1e454 support type annotations on global variables (#43671) add type declarations to global bindings replaces #43455 closes #964 Co-authored-by: Miguel Raz Guzmán Macedo <miguelraz@gmail.com> 08 February 2022, 20:08:15 UTC
0f6c9d3 oops 08 February 2022, 19:47:06 UTC
942697f make sure Julia doesn't use x87 math. (#43978) * use fpmath=sse on 32 bit x86 to prevent 80 bit floats causing double rounding Co-authored-by: Oscar Smith <oscardssmith@gmail.com> 08 February 2022, 18:11:00 UTC
d30f403 Detect the number of performance cores in the M1 macs via the new macos12 API (#44072) * Use the new macos12 api to query perf cores 08 February 2022, 16:40:49 UTC
24d96ab Instrument jl_gc_big_alloc() when called from generated code for Allocations Profiler (#44043) * Instrument jl_gc_big_alloc() when called from generated code for Allocations Profiler Add _maybe_record_alloc_to_profile() call into jl_gc_big_alloc() to allow us to instrument it in the Allocations Profiler. Followup to #43868 Finishes fixing #43688 (gets the stacks, though not the types, of big allocs). ----- - Rename jl_gc_big_alloc to jl_gc_big_alloc_inner, make it inlined - add jl_gc_pool_alloc_noinline that calls jl_gc_big_alloc_inner - replace all internal calls to jl_gc_big_alloc to call this instead - add a new jl_gc_pool_alloc that calls jl_gc_big_alloc_inner - This one is instrumented, and is meant to be called by generated code. Co-authored-by: Pete Vilter <7341+vilterp@users.noreply.github.com> 08 February 2022, 14:53:52 UTC
e865d33 Optimize real matrix * complex vector by reintrepreting as real (#44011) 08 February 2022, 11:33:10 UTC
6bad936 Improve performance of `svd` and `eigen` of `Diagonal`s (#43856) 08 February 2022, 09:17:54 UTC
c84896c Fix `Diagonal \ StaticArray` (#44018) 08 February 2022, 09:09:21 UTC
418fbb3 impose stricter and earlier validation on ccall/cfunction types Closes #43636 07 February 2022, 19:59:58 UTC
9f3b3f5 ccall: fix jl_value_ptr definition 07 February 2022, 19:59:58 UTC
546a774 fix #43960, evaluation order of splat inside ref (#44024) 07 February 2022, 18:49:32 UTC
48b7d49 allocation profiler: get stacks for pool allocs via wrapper function (#43868) 07 February 2022, 17:16:48 UTC
70e2432 improve eigvals & co docstrings (#43904) 07 February 2022, 14:58:21 UTC
e0a4b77 Generalize `strides` for `ReinterpretArray` and `ReshapedArray` (#44027) 06 February 2022, 12:55:09 UTC
5181e36 document behavior of keywords in dot calls (#44033) 05 February 2022, 21:53:34 UTC
e623275 Try/finally to stop allocation profiler even on error (#44045) 05 February 2022, 04:09:18 UTC
46032c5 Inferrability: eliminate more Core.Box (#44029) These stem from julia#15276, i.e., https://docs.julialang.org/en/v1/manual/performance-tips/#man-performance-captured They were identified by scanning all compiled MethodInstances with hasbox in the newly-released MethodAnalysis 0.4.5. Core.Box often causes "follow-on" inference problems, but for these cases there were relatively few, which may be why these didn't show up earlier during the Great Invalidation Hunt. Still, there doesn't seem to be any particular reason not to fix them. This doesn't eliminate all Core.Box cases from Base, but only a handful remain. The most common remaining case stems from inner functions calling themselves. It also prevents a couple of Test invalidations Co-authored-by: Jameson Nash <vtjnash@gmail.com> 04 February 2022, 20:16:27 UTC
bd9c510 Remove incomplete manifest package load fallback. Add instantiate suggestion to error (#42822) 04 February 2022, 13:35:15 UTC
b486f0d docs: reference `error` in docstring for `throw`, closes #44028 (#44032) 04 February 2022, 08:56:37 UTC
9c5d7ff 🤖 Bump the Pkg stdlib from 66482586 to 89517355 (#44023) 04 February 2022, 02:29:53 UTC
9c4b75d [libblastrampoline] Bump to v5.0.1 (#44017) This should allow us to use CBLAS symbols in MKL v2022. Verified that this fixes https://github.com/JuliaLinearAlgebra/libblastrampoline/issues/56 03 February 2022, 23:50:01 UTC
a80afe5 Merge pull request #44030 from KronosTheLate/change_order_of_differences_R Change order in "Noteworthy differences from R" 03 February 2022, 15:39:05 UTC
fa6084f Change order of differences from R I came to "Noteworthy differences from R" because I know julia well, but am starting some R now. When I came to the following section: " * In Julia, not all data structures support logical indexing. Furthermore, logical indexing in Julia is supported only with vectors of length equal to the object being indexed. For example: * In R, `c(1, 2, 3, 4)[c(TRUE, FALSE)]` is equivalent to `c(1, 3)`. * In R, `c(1, 2, 3, 4)[c(TRUE, FALSE, TRUE, FALSE)]` is equivalent to `c(1, 3)`. * In Julia, `[1, 2, 3, 4][[true, false]]` throws a [`BoundsError`](@ref). * In Julia, `[1, 2, 3, 4][[true, false, true, false]]` produces `[1, 3]`. " I did not know the `c(1, 2, 3)` syntax, so encountering this point did not help me. In this PR I have moved the line " * Julia constructs vectors using brackets. Julia's `[1, 2, 3]` is the equivalent of R's `c(1, 2, 3)`. " to immediately before the point I mentioned first. IMO this makes the section more readable. 03 February 2022, 10:51:00 UTC
c10dac1 Dates parsing: remove `throw InexactError` from `tryparsenext`, fixes #44003 (#44004) 03 February 2022, 09:35:39 UTC
1edafa0 Fix detection of LIBGFORTRAN_VERSION (#44026) 03 February 2022, 04:49:32 UTC
60a811c fix #44013: aliasing in property destructuring (#44020) 03 February 2022, 04:21:15 UTC
e6fa3ec Profile: multithread getdict! (redo of #43805) (#43816) 03 February 2022, 01:52:20 UTC
cf8338a fix typo in NEWS.md (#44015) I was reading the NEWS.md on `master` and found this typo. 02 February 2022, 15:13:37 UTC
e3b681c inference: always use const-prop'ed result (#44001) Previously, for `invoke`/opaque closure callsite, we use constant-prop'ed method body only when the inferred return type gets strictly improved by const-prop. But since the inliner is now able to inline the method body from `InferenceResult`, I believe it is always better to use method body shaped up by const-prop' no matter if the return type is improved (as we already do for ordinal call sites). > use constant prop' result even when the return type doesn't get refined ```julia const Gx = Ref{Any}() Base.@constprop :aggressive function conditional_escape!(cnd, x) if cnd Gx[] = x end return nothing end @test fully_eliminated((String,)) do x Base.@invoke conditional_escape!(false::Any, x::Any) end ``` 02 February 2022, 01:39:19 UTC
b405562 fix macro expansion of `(::typeof(x))() = ...` (#43993) 01 February 2022, 21:21:35 UTC
0cbacf2 docs: fix heading level of Manual > Modules > init & precomp (#43996) 01 February 2022, 15:23:20 UTC
a0093d2 adce: Slight tweak to phinode elimination (#43948) Just a slight tweak to #43922 to also allow the implicit narrowing from other phi users. It's not a particularly common occurrence, but can happen particularly when running the adce pass twice, (because that narrows the types of some phis by dropping edges). 01 February 2022, 00:11:22 UTC
a7beb93 Update mailmap (#43991) * Update mailmap 31 January 2022, 00:10:54 UTC
76a3156 Add private functions for getting the locations of the global and local startup files (#43807) Co-authored-by: Jameson Nash <vtjnash@gmail.com> 30 January 2022, 20:39:47 UTC
4abf26e Add relocatable root compression (#43881) Currently we can't cache "external" CodeInstances, i.e., those generated by compiling other modules' methods with externally-defined types. For example, consider `push!([], MyPkg.MyType())`: Base owns the method `push!(::Vector{Any}, ::Any)` but doesn't know about `MyType`. While there are several obstacles to caching exteral CodeInstances, the primary one is that in compressed IR, method roots are referenced from a list by index, and the index is defined by order of insertion. This order might change depending on package-loading sequence or other history-dependent factors. If the order isn't consistent, our current serialization techniques would result in corrupted code upon decompression, and that would generally trigger catastrophic failure. To avoid this problem, we simply avoid caching such CodeInstances. This enables roots to be referenced with respect to a `(key, index)` pair, where `key` identifies the module and `index` numbers just those roots with the same `key`. Roots with `key = 0` are considered to be of unknown origin, and CodeInstances referencing such roots will remain unserializable unless all such roots were added at the time of system image creation. To track this additional data, this adds two fields to core types: - to methods, it adds a `nroots_sysimg` field to count the number of roots defined at the time of writing the system image (such occur first in the list of `roots`) - to CodeInstances, it adds a flag `relocatability` having value 1 if every root is "safe," meaning it was either added at sysimg creation or is tagged with a non-zero `key`. Even a single unsafe root will cause this to have value 0. 30 January 2022, 20:04:58 UTC
11bac43 Update PCRE download location in SPDX file (#43985) * Update PCRE download location in SPDX file * Make it the actual git URL 30 January 2022, 14:16:13 UTC
e8f469a Merge pull request #43973 from JuliaLang/avi/infresult AbstractInterpreter: pass `caller::InferenceResult` to the optimizer 29 January 2022, 19:25:49 UTC
c3235cd Fix typo in triangular ldiv! (#43962) 29 January 2022, 02:45:02 UTC
9a3cfd9 cross-ref bunch-kaufman to ldlt docs (#43968) 28 January 2022, 22:43:40 UTC
e88bb22 fix incorrect quotes in SVD docs (#43969) 28 January 2022, 20:24:41 UTC
1aac07e codegen: encapsulate complex global types in jl_codectx_t (#43789) Move global complex types to context type cache in LLVM codegen. 28 January 2022, 20:10:51 UTC
4789979 use iszero for generic numbers (#43970) 28 January 2022, 20:06:30 UTC
eb43734 optimizer: delay `atype` computation in inlining (#43974) Makes `Signature` data structure slightly more efficient. 28 January 2022, 17:27:10 UTC
cd458a9 AbstractInterpreter: pass `caller::InferenceResult` to the optimizer `caller::InferenceResult` will be used as a key for caching interprocedural information analyzed at optimization phase (as we will do with the escape analysis). 28 January 2022, 15:26:51 UTC
ad2b8fc Fix the download URLs for the libunwind source code (#43958) 28 January 2022, 12:57:03 UTC
2ba3a22 doc: add note about Timer needing yield, closes #43954 (#43956) 28 January 2022, 09:17:39 UTC
3dbd31b Fix typo in 3-arg "rdiv!" (#43951) 28 January 2022, 08:31:46 UTC
9400c91 a bit simplify inlining analysis code (#43959) 28 January 2022, 04:22:26 UTC
a25f128 Don't `invokelatest` in `@threads` (#43957) 28 January 2022, 01:28:43 UTC
78b7d76 [REPLCompletions] follow up #43865, keep input tuple type in `MethodCompletion` (#43946) `MethodCompletion.tt` is being used by an external consumer like Juno or VSCode's runtime completion features to get additional information like return type, etc. 28 January 2022, 00:44:34 UTC
ef35804 Update libunwind to v1.5.0 (#42782) This updates the nongnu libunwind, not LLVM libunwind, so this only affects Linux and FreeBSD. 27 January 2022, 16:29:32 UTC
8f13fea relax `withenv` signature from Function to Any (#43584) 27 January 2022, 16:08:48 UTC
2983327 Teach cfg_simplify to drop GotoIfNots that go to the same destination (#43906) An example is in the test. The primary reason to do this is not for the GotoIfNot itself, but rather to drop the use of the GotoIfNot condition, which may be rooting a number of other statements. 27 January 2022, 12:13:27 UTC
5b893dd deleteat!: Handle unassigned array elements (#43941) This function was erroring on unassigned array elements. This is the minimal fix to resolve that issue. I tried something more generic, but it turns out all of our Array code basically assumes that all elements are assigned. Even basic things like `==`, `hash`, etc. error for arrays with unassigned elements. I think we may want to strongly consider removing the ability to unassign elements in arrays in favor of Union{Nothing, T} arrays, which our existing code handles fine and should be able to be made to have equivalent performance. 27 January 2022, 12:10:44 UTC
4650cff Forward negation of a Tridiagonal matrix to diagonals (#43942) 27 January 2022, 09:50:46 UTC
9868411 optimizer: performance tuning (#43928) 27 January 2022, 04:11:41 UTC
05e0cb9 Merge pull request #43418 from JuliaLang/improve-threads Improve thread scheduler memory ordering 27 January 2022, 01:42:26 UTC
fcd34ae Refine `isunordered` docstring (#43828) 26 January 2022, 20:45:03 UTC
d261458 Remove fast_unwind_on_malloc and malloc_context_size restrictions in ASAN CI (#42515) No need to set ASAN_OPTIONS any more either: these options are already set in `loader_exe.c` with `__asan_default_options`. 26 January 2022, 20:43:33 UTC
back to top