| 2021-10-26 Martin Sebor <msebor@redhat.com> |
| |
| PR tree-optimization/102238 |
| PR tree-optimization/102919 |
| * gimple-ssa-sprintf.c (get_string_length): Add an argument. |
| (array_elt_at_offset): Move to pointer-query. |
| (set_aggregate_size_and_offset): New function. |
| (field_at_offset): Move to pointer-query. |
| (get_origin_and_offset): Rename... |
| (get_origin_and_offset_r): this. Add an argument. Make aggregate |
| handling more robust. |
| (get_origin_and_offset): New. |
| (alias_offset): Add an argument. |
| (format_string): Use subobject size determined by get_origin_and_offset. |
| * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c. |
| Improve/correct handling of aggregates. |
| (array_elt_at_offset): Same. |
| * pointer-query.h (field_at_offset): Declare. |
| (array_elt_at_offset): Declare. |
| |
| 2021-10-26 Martin Sebor <msebor@redhat.com> |
| |
| * builtins.c (check_strncat_sizes): Pass access_data ctor additional |
| arguments. |
| (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc. |
| (expand_builtin_fork_or_exec): Same. |
| * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass |
| compute_objsize additional arguments. |
| (inbounds_memaccess_p): Same. |
| (array_bounds_checker::check_array_bounds): Add an assert. Stash |
| statement in a member. |
| (check_array_bounds_dom_walker::before_dom_children): Same. |
| * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member. |
| * gimple-ssa-sprintf.c (get_destination_size): Add an argument. |
| (handle_printf_call): Pass a new argument. |
| * gimple-ssa-warn-access.cc (get_size_range): Add an argument. |
| (check_access): Add an argument and pass it along to callees. |
| (check_read_access): Make a member function. |
| (pass_waccess::check_strcat): Pass access_data ctor additional |
| arguments. |
| (pass_waccess::check_strncat): Same. |
| (pass_waccess::check_stxcpy): Same. |
| (pass_waccess::check_stxncpy): Same. |
| (pass_waccess::check_strncmp): Same. |
| (pass_waccess::check_read_access): Same. |
| (pass_waccess::check_builtin): Same. |
| (pass_waccess::maybe_check_access_sizes): Same. |
| (pass_waccess::maybe_check_dealloc_call): Same. |
| * gimple-ssa-warn-access.h (check_read_access): Declare a new |
| member function. |
| * pointer-query.cc (compute_objsize_r): Add an argument. |
| (gimple_call_return_array): Same. |
| (gimple_call_alloc_size): Same. |
| (access_ref::access_ref): Same. |
| (access_ref::get_ref): Same. |
| (pointer_query::get_ref): Same. |
| (handle_min_max_size): Pass an arguments to callees. |
| (handle_array_ref): Add an argument. |
| (handle_mem_ref): Same. |
| (compute_objsize): Same. |
| * pointer-query.h (struct access_ref): Adjust signatures. |
| (struct access_data): Same. |
| (gimple_call_alloc_size): Add an argument. |
| (gimple_parm_array_size): Same. |
| (compute_objsize): Same. |
| * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional |
| argument to compute_objsize. |
| (strlen_pass::maybe_warn_overflow): Same. |
| (maybe_diag_stxncpy_trunc): Same. |
| |
| 2021-10-26 Martin Sebor <msebor@redhat.com> |
| |
| PR middle-end/102453 |
| * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New. |
| (pass_waccess::check_atomic_builtin): Call it. |
| |
| 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com> |
| |
| PR rtl-optimization/102842 |
| * lra-constraints.c (match_reload): Ignore out in checking values |
| of outs. |
| (curr_insn_transform): Collect outputs before doing reloads of operands. |
| |
| 2021-10-26 Paul A. Clarke <pc@us.ibm.com> |
| |
| PR target/102719 |
| * config/rs6000/x86intrin.h: Move some included headers to new |
| headers. Include new immintrin.h instead of those headers. |
| * config/rs6000/immintrin.h: New. |
| * config/rs6000/x86gprintrin.h: New. |
| * config.gcc (powerpc*-*-*): Add new headers to extra_headers. |
| |
| 2021-10-26 Richard Biener <rguenther@suse.de> |
| |
| * tree-vectorizer.h (vect_create_addr_base_for_vector_ref): |
| Remove byte_offset parameter. |
| (vect_create_data_ref_ptr): Likewise. |
| * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref): |
| Likewise. |
| (vect_create_data_ref_ptr): Likewise. |
| * tree-vect-stmts.c (vectorizable_store): Adjust. |
| (vectorizable_load): Likewise. |
| |
| 2021-10-26 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/96109 |
| * tree-vectorizer.h (dr_misalignment): Add optional offset |
| parameter. |
| * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove |
| offset applied for negative stride accesses. |
| (vect_enhance_data_refs_alignment): Compute negative stride |
| access offset and pass it to dr_misalignment. |
| * tree-vect-stmts.c (get_negative_load_store_type): Pass |
| negative offset to dr_misalignment. |
| (get_group_load_store_type): Likewise. |
| (get_load_store_type): Likewise. |
| (vectorizable_store): Remove asserts about alignment. |
| (vectorizable_load): Likewise. |
| |
| 2021-10-26 Kewen Lin <linkw@linux.ibm.com> |
| |
| PR tree-optimization/102897 |
| * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion. |
| |
| 2021-10-26 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref): |
| Take offset in bytes. |
| (vect_create_data_ref_ptr): Likewise. |
| * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply |
| offset by element size. |
| (vect_create_cond_for_align_checks): Likewise. |
| * tree-vect-stmts.c (get_negative_load_store_type): Likewise. |
| (vectorizable_load): Remove duplicate leftover from merge |
| conflict. |
| |
| 2021-10-26 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper |
| function to expand V1TI mode logical shifts by integer constants. |
| (ix86_expand_v1ti_rotate): New helper function to expand V1TI |
| mode rotations by integer constants. |
| * config/i386/i386-protos.h (ix86_expand_v1ti_shift, |
| ix86_expand_v1ti_rotate): Prototype new functions here. |
| * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3): |
| New TARGET_SSE2 expanders to implement V1TI shifts and rotations. |
| |
| 2021-10-26 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (back_threader::maybe_register_path): |
| Avoid threading circular paths. |
| |
| 2021-10-26 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (back_threader::resolve_phi): |
| Attempt to resolve all incoming paths to a PHI. |
| (back_threader::resolve_def): Always return true for PHIs. |
| |
| 2021-10-26 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (back_threader::find_paths_to_names): |
| Always try to resolve path without looking back. |
| * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether |
| edge is a back edge. |
| |
| 2021-10-26 Kewen Lin <linkw@linux.ibm.com> |
| |
| PR tree-optimization/102789 |
| * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not |
| update inits of simd_lane_access. |
| |
| 2021-10-25 Andrew MacLeod <amacleod@redhat.com> |
| |
| * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and |
| move to vr-values.c. |
| (simplify_casted_conds): Move to vrp_folder class. |
| (execute_vrp): Call via vrp_folder now. |
| * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond. |
| (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c. |
| * vr-values.h (simplify_casted_cond): Add prototype. |
| |
| 2021-10-25 Andrew MacLeod <amacleod@redhat.com> |
| |
| * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try |
| to fold anyway. |
| |
| 2021-10-25 Paul A. Clarke <pc@us.ibm.com> |
| |
| * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to |
| function signature. |
| (_mm_testc_si128): Likewise. |
| (_mm_testnzc_si128): Likewise. |
| (_mm_blend_ps): Likewise. |
| (_mm_blendv_ps): Likewise. |
| (_mm_blend_pd): Likewise. |
| (_mm_blendv_pd): Likewise. |
| (_mm_ceil_pd): Likewise. |
| (_mm_ceil_sd): Likewise. |
| (_mm_ceil_ps): Likewise. |
| (_mm_ceil_ss): Likewise. |
| (_mm_floor_pd): Likewise. |
| (_mm_floor_sd): Likewise. |
| (_mm_floor_ps): Likewise. |
| (_mm_floor_ss): Likewise. |
| (_mm_minpos_epu16): Likewise. |
| (_mm_mul_epi32): Likewise. |
| (_mm_cvtepi8_epi16): Likewise. |
| (_mm_packus_epi32): Likewise. |
| (_mm_cmpgt_epi64): Likewise. |
| |
| 2021-10-25 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify |
| shifts of the mode's smin_value and smax_value when the bit count |
| operand doesn't have side-effects. |
| [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value |
| when the bit count operand doesn't have side-effects. |
| (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform |
| compile-time evaluation of saturating left shifts with constant |
| arguments. |
| |
| 2021-10-25 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check |
| for TDF_DETAILS. |
| |
| 2021-10-25 Andrew MacLeod <amacleod@redhat.com> |
| |
| * flag-types.h (enum ranger_debug): Adjust values. |
| * params.opt (ranger_debug): Ditto. |
| |
| 2021-10-25 Martin Jambor <mjambor@suse.cz> |
| |
| PR tree-optimization/102886 |
| * tree-sra.c (totally_scalarize_subtree): Fix the out of |
| access-condition. |
| |
| 2021-10-25 Andrew Pinski <apinski@marvell.com> |
| |
| * tree-ssa-dce.c (simple_dce_from_worklist): |
| Check stmt_unremovable_because_of_non_call_eh_p also |
| before removing the statement. |
| |
| 2021-10-25 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102905 |
| * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): |
| Use vect_supportable_dr_alignment again to determine whether |
| an access is supported when not aligned. |
| |
| 2021-10-25 Kito Cheng <kito.cheng@sifive.com> |
| |
| * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model |
| for zbs extension. |
| |
| 2021-10-25 Jim Wilson <jimw@sifive.com> |
| Kito Cheng <kito.cheng@sifive.com> |
| Jia-Wei Chen <jiawei@iscas.ac.cn> |
| Shi-Hua Liao <shihua@iscas.ac.cn> |
| |
| * config/riscv/bitmanip.md (shiftm1): New. |
| (*bset<mode>): Ditto. |
| (*bset<mode>_mask): Ditto. |
| (*bset<mode>_1): Ditto. |
| (*bset<mode>_1_mask): Ditto. |
| (*bseti<mode>): Ditto. |
| (*bclr<mode>): Ditto. |
| (*bclri<mode>): Ditto. |
| (*binv<mode>): Ditto. |
| (*binvi<mode>): Ditto. |
| (*bext<mode>): Ditto. |
| (*bexti): Ditto. |
| * config/riscv/predicates.md (splittable_const_int_operand): |
| Handle bseti. |
| (single_bit_mask_operand): New. |
| (not_single_bit_mask_operand): Ditto. |
| (const31_operand): Ditto. |
| (const63_operand): Ditto. |
| * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti. |
| (riscv_output_move): Ditto. |
| (riscv_print_operand): Handle new operand type: T and S. |
| * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New. |
| |
| 2021-10-25 Jim Wilson <jimw@sifive.com> |
| |
| * config/riscv/riscv.c (riscv_build_integer_1): Build integer |
| with rotate. |
| |
| 2021-10-25 Kito Cheng <kito.cheng@sifive.com> |
| |
| * config/riscv/riscv.c (riscv_extend_cost): Handle cost model |
| for zbb extension. |
| (riscv_rtx_costs): Ditto. |
| |
| 2021-10-25 Jim Wilson <jimw@sifive.com> |
| Kito Cheng <kito.cheng@sifive.com> |
| Jia-Wei Chen <jiawei@iscas.ac.cn> |
| |
| * config/riscv/bitmanip.md (bitmanip_bitwise): New. |
| (bitmanip_minmax): New. |
| (clz_ctz_pcnt): New. |
| (bitmanip_optab): New. |
| (bitmanip_insn): New. |
| (*<optab>_not<mode>): New. |
| (*xor_not<mode>): New. |
| (<bitmanip_optab>si2): New. |
| (*<bitmanip_optab>disi2): New. |
| (<bitmanip_optab>di2): New. |
| (*zero_extendhi<GPR:mode>2_bitmanip): New. |
| (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New. |
| (*zero_extendhi<GPR:mode>2_zbb): New. |
| (rotrsi3): New. |
| (rotrdi3): New. |
| (rotrsi3_sext): New. |
| (rotlsi3): New. |
| (rotldi3): New. |
| (rotlsi3_sext): New. |
| (bswap<mode>2): New. |
| (<bitmanip_optab><mode>3): New. |
| * config/riscv/riscv.md (type): Add rotate. |
| (zero_extendhi<GPR:mode>2): Change to define_expand pattern. |
| (*zero_extendhi<GPR:mode>2): New. |
| (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern. |
| (*extend<SHORT:mode><SUPERQI:mode>2): New. |
| |
| 2021-10-25 Kito Cheng <kito.cheng@sifive.com> |
| |
| * config/riscv/riscv.c (riscv_extend_cost): Handle cost model |
| for zba extension. |
| (riscv_rtx_costs): Ditto. |
| |
| 2021-10-25 Jim Wilson <jimw@sifive.com> |
| Kito Cheng <kito.cheng@sifive.com> |
| Jia-Wei Chen <jiawei@iscas.ac.cn> |
| |
| * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New. |
| (*shNadd): Ditto. |
| (*shNadduw): Ditto. |
| (*add.uw): Ditto. |
| (*slliuw): Ditto. |
| (riscv_rtx_costs): Ditto. |
| * config/riscv/riscv.md: Include bitmanip.md |
| (type): Add bitmanip bype. |
| (zero_extendsidi2): Change to define_expand pattern. |
| (*zero_extendsidi2_internal): New. |
| (zero_extendsidi2_shifted): Disable for ZBA. |
| |
| 2021-10-25 Kito Cheng <kito.cheng@sifive.com> |
| |
| * common/config/riscv/riscv-common.c (riscv_ext_version_table): |
| Add zba, zbb, zbc and zbs. |
| (riscv_ext_flag_table): Ditto. |
| * config/riscv/riscv-opts.h (MASK_ZBA): New. |
| (MASK_ZBB): Ditto. |
| (MASK_ZBC): Ditto. |
| (MASK_ZBS): Ditto. |
| (TARGET_ZBA): Ditto. |
| (TARGET_ZBB): Ditto. |
| (TARGET_ZBC): Ditto. |
| (TARGET_ZBS): Ditto. |
| * config/riscv/riscv.opt (riscv_zb_subext): New. |
| |
| 2021-10-25 liuhongt <hongtao.liu@intel.com> |
| |
| PR target/102464 |
| * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a) |
| when direct_internal_fn_supported_p, similar for sqrt/sqrtl. |
| |
| 2021-10-25 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102920 |
| * tree-ssa-sccvn.h (expressions_equal_p): Add argument |
| controlling VN_TOP matching behavior. |
| * tree-ssa-sccvn.c (expressions_equal_p): Likewise. |
| (vn_phi_eq): Do not optimistically match VN_TOP. |
| |
| 2021-10-25 konglin1 <lingling.kong@intel.com> |
| |
| * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new |
| define_insn_and_split. |
| (fma_<mode>_fadd_fcmul):Likewise |
| (fma_<complexopname>_<mode>_fma_zero):Likewise |
| |
| 2021-10-24 John David Anglin <danglin@gcc.gnu.org> |
| |
| * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check |
| TARGET_DISABLE_FPREGS. |
| * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of |
| MASK_DISABLE_FPREGS. |
| (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust |
| cost of hardware integer multiplication. |
| (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS. |
| * config/pa/pa.h (INT14_OK_STRICT): Likewise. |
| * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check |
| TARGET_SOFT_FLOAT in patterns that use xmpyu instruction. |
| * config/pa/pa.opt (mdisable-fpregs): Change target mask to |
| SOFT_FLOAT. Revise comment. |
| (msoft-float): New option. |
| |
| 2021-10-24 John David Anglin <danglin@gcc.gnu.org> |
| |
| * config/pa/pa.md: Don't use 'G' constraint in integer move patterns. |
| |
| 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com> |
| |
| * doc/install.texi (Specific): Remove obsolete details |
| around GNU/Linux on Itanium. |
| (Specific): Remove reference to Windows for Itanium. |
| |
| 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org> |
| |
| * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary |
| typo fix. |
| |
| 2021-10-23 Jan Hubicka <hubicka@ucw.cz> |
| |
| * tree-ssa-structalias.c (compute_points_to_sets): Cleanup. |
| |
| 2021-10-23 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * config/i386/sse.md (<any_logic>v1ti3): New define_insn to |
| implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above). |
| (one_cmplv1ti2): New define expand. |
| |
| 2021-10-22 Eric Gallager <egallager@gcc.gnu.org> |
| |
| PR other/102663 |
| * Makefile.in: Handle dvidir and install-dvi target. |
| * configure: Regenerate. |
| * configure.ac: Add install-dvi to target_list. |
| |
| 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com> |
| |
| * doc/install.texi (Binaries): Convert mingw-w64.org to https. |
| (Specific): Ditto. |
| |
| 2021-10-22 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102893 |
| * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the |
| test for an exit edge. |
| |
| 2021-10-22 Aldy Hernandez <aldyh@redhat.com> |
| Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::compute_phi_relations): |
| Kill any global relations we may know before registering a new |
| one. |
| * value-relation.cc (path_oracle::killing_def): New. |
| * value-relation.h (path_oracle::killing_def): New. |
| |
| 2021-10-22 Richard Biener <rguenther@suse.de> |
| |
| PR bootstrap/102681 |
| * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args |
| record VN_TOP. |
| (vn_phi_lookup): Likewise. |
| |
| 2021-10-21 H.J. Lu <hjl.tools@gmail.com> |
| |
| PR target/98667 |
| * doc/invoke.texi: Document -fcf-protection requires i686 or |
| new. |
| |
| 2021-10-21 Eric Botcazou <ebotcazou@adacore.com> |
| |
| PR middle-end/102764 |
| * cfgexpand.c (expand_gimple_basic_block): Robustify latest change. |
| |
| 2021-10-21 Jonathan Wright <jonathan.wright@arm.com> |
| |
| * config/aarch64/arm_neon.h (__STRUCTN): Delete function |
| macro and all invocations. |
| |
| 2021-10-21 Andrew MacLeod <amacleod@redhat.com> |
| |
| * doc/invoke.texi (ranger-debug): Document. |
| * flag-types.h (enum ranger_debug): New. |
| (enum evrp_mode): Remove debug values. |
| * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag. |
| * gimple-range-gori.cc (gori_compute::gori_compute): Ditto. |
| * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto. |
| * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto. |
| (execute_early_vrp): Use evrp-mode directly. |
| * params.opt (enum evrp_mode): Remove debug values. |
| (ranger-debug): New. |
| (ranger-logical-depth): Relocate to be in alphabetical order. |
| |
| 2021-10-21 Andrew MacLeod <amacleod@redhat.com> |
| |
| * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document. |
| * flag-types.h: (enum vrp_mode): New. |
| * params.opt: (vrp1-mode, vrp2-mode): New. |
| * tree-vrp.c (vrp_pass_num): New. |
| (pass_vrp::pass_vrp): Set pass number. |
| (pass_vrp::execute): Choose which VRP mode to execute. |
| |
| 2021-10-21 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c. |
| (execute_early_vrp): For ranger only mode, invoke ranger_vrp. |
| * tree-vrp.c (class rvrp_folder): Relocate here. |
| (execute_ranger_vrp): New. |
| * tree-vrp.h (execute_ranger_vrp): Export. |
| |
| 2021-10-21 Martin Liska <mliska@suse.cz> |
| |
| PR debug/102585 |
| PR bootstrap/102766 |
| * opts.c (finish_options): Process flag_var_tracking* options |
| here as they can be adjusted by optimize attribute. |
| Process also flag_syntax_only and flag_gtoggle. |
| * toplev.c (process_options): Remove it here. |
| * common.opt: Make debug_nonbind_markers_p as PerFunction |
| attribute as it depends on optimization level. |
| |
| 2021-10-21 Martin Jambor <mjambor@suse.cz> |
| |
| PR tree-optimization/102505 |
| * tree-sra.c (totally_scalarize_subtree): Check that the |
| encountered field fits within the acces we would like to put it |
| in. |
| |
| 2021-10-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c |
| (back_threader::maybe_register_path): Remove circular paths check. |
| |
| 2021-10-21 Richard Biener <rguenther@suse.de> |
| |
| * toplev.c (process_options): Move the initial debug_hooks |
| setting ... |
| (toplev::main): ... before the call of the post_options |
| langhook. |
| |
| 2021-10-21 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102847 |
| * tree-vect-stmts.c (vect_model_load_cost): Add the scalar |
| load cost in the prologue for VMAT_INVARIANT. |
| |
| 2021-10-21 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102847 |
| * tree-vect-stmts.c (vect_model_load_cost): Explicitely |
| handle VMAT_INVARIANT as a splat in the prologue. |
| |
| 2021-10-21 Hongyu Wang <hongyu.wang@intel.com> |
| |
| PR target/102812 |
| * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector |
| move to use the same logic as HImode. |
| |
| 2021-10-21 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-slp.c (vect_build_slp_tree_1): Remove |
| superfluous gimple_call_nothrow_p check. |
| |
| 2021-10-21 Andrew Pinski <apinski@marvell.com> |
| |
| * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument. |
| Mark the ssa-name of the rhs as one to be removed. |
| (execute_fixup_cfg): Update call to maybe_remove_writeonly_store. |
| Call simple_dce_from_worklist at the end to a simple dce. |
| |
| 2021-10-21 Andrew Pinski <apinski@marvell.com> |
| |
| * tree-cfg.c (maybe_remove_writeonly_store): New function |
| factored out from ... |
| (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store. |
| |
| 2021-10-21 Andrew Pinski <apinski@marvell.com> |
| |
| * tree-cfg.c (execute_fixup_cfg): Remove comment |
| about standalone pass. |
| |
| 2021-10-21 Andrew Pinski <apinski@marvell.com> |
| |
| * tree-cfg.c (execute_fixup_cfg): Output when the statement |
| is removed when it is a write only var. |
| |
| 2021-10-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (back_threader::maybe_register_path): |
| Avoid threading circular paths. |
| |
| 2021-10-20 Alex Coplan <alex.coplan@arm.com> |
| |
| * calls.c (initialize_argument_information): Remove some dead |
| code, remove handling for function_arg returning const_int. |
| * doc/tm.texi: Delete documentation for unused target hooks. |
| * doc/tm.texi.in: Likewise. |
| * target.def (load_bounds_for_arg): Delete. |
| (store_bounds_for_arg): Delete. |
| (load_returned_bounds): Delete. |
| (store_returned_bounds): Delete. |
| * targhooks.c (default_load_bounds_for_arg): Delete. |
| (default_store_bounds_for_arg): Delete. |
| (default_load_returned_bounds): Delete. |
| (default_store_returned_bounds): Delete. |
| * targhooks.h (default_load_bounds_for_arg): Delete. |
| (default_store_bounds_for_arg): Delete. |
| (default_load_returned_bounds): Delete. |
| (default_store_returned_bounds): Delete. |
| |
| 2021-10-20 Jonathan Wakely <jwakely@redhat.com> |
| |
| * doc/extend.texi (Basic Asm): Clarify that asm is not an |
| extension in C++. |
| * doc/invoke.texi (-fno-asm): Fix description for C++. |
| |
| 2021-10-20 Jonathan Wakely <jwakely@redhat.com> |
| |
| * doc/install.texi: Remove link to old.html |
| |
| 2021-10-20 Tamar Christina <tamar.christina@arm.com> |
| |
| * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New. |
| |
| 2021-10-20 Tamar Christina <tamar.christina@arm.com> |
| |
| * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New. |
| |
| 2021-10-20 Tamar Christina <tamar.christina@arm.com> |
| |
| * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp |
| case. |
| * config/aarch64/constraints.md (D1): New. |
| |
| 2021-10-20 Tamar Christina <tamar.christina@arm.com> |
| |
| * config/aarch64/aarch64-simd.md |
| (*aarch64_<srn_op>topbits_shuffle<mode>_le): New. |
| (*aarch64_topbits_shuffle<mode>_le): New. |
| (*aarch64_<srn_op>topbits_shuffle<mode>_be): New. |
| (*aarch64_topbits_shuffle<mode>_be): New. |
| * config/aarch64/predicates.md |
| (aarch64_simd_shift_imm_vec_exact_top): New. |
| |
| 2021-10-20 Tamar Christina <tamar.christina@arm.com> |
| |
| * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect, |
| *aarch64_<srn_op>shrn<mode>2_vect_le, |
| *aarch64_<srn_op>shrn<mode>2_vect_be): New. |
| * config/aarch64/iterators.md (srn_op): New. |
| |
| 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com> |
| |
| * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy |
| as local variable. |
| (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in |
| ctx->outer instead of ctx. |
| |
| 2021-10-20 Martin Liska <mliska@suse.cz> |
| |
| Revert: |
| 2021-10-19 Martin Liska <mliska@suse.cz> |
| |
| PR target/102374 |
| * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces. |
| * system.h (strip_whilespaces): New function. |
| |
| 2021-10-20 Martin Liska <mliska@suse.cz> |
| |
| Revert: |
| 2021-10-19 Martin Liska <mliska@suse.cz> |
| |
| PR target/102375 |
| * config/aarch64/aarch64.c (aarch64_process_one_target_attr): |
| Strip whitespaces. |
| |
| 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com> |
| |
| * config/aarch64/aarch64.c (aarch64_case_values_threshold): |
| Change to 8 with -Os, 11 otherwise. |
| |
| 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com> |
| |
| * config/aarch64/aarch64.c (neoversev1_tunings): |
| Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND. |
| (neoversen2_tunings): Likewise. |
| |
| 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com> |
| |
| PR target/100966 |
| * config/aarch64/aarch64.md (frint_pattern): Update comment. |
| * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven. |
| * config/aarch64/arm_fp16.h: Change frintn to roundeven. |
| * config/aarch64/arm_neon.h: Likewise. |
| * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN. |
| |
| 2021-10-20 Martin Liska <mliska@suse.cz> |
| |
| * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare |
| already declared global variable. |
| (arm_unwind_emit_set): Use out_file as function argument. |
| (arm_unwind_emit): Likewise. |
| * config/darwin.c (machopic_output_data_section_indirection): Likewise. |
| (machopic_output_stub_indirection): Likewise. |
| (machopic_output_indirection): Likewise. |
| (machopic_finish): Likewise. |
| * config/i386/i386.c (ix86_asm_output_function_label): Likewise. |
| * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise. |
| * config/ia64/ia64.c (process_epilogue): Likewise. |
| (process_cfa_adjust_cfa): Likewise. |
| (process_cfa_register): Likewise. |
| (process_cfa_offset): Likewise. |
| (ia64_asm_unwind_emit): Likewise. |
| * config/s390/s390.c (s390_asm_output_function_label): Likewise. |
| |
| 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com> |
| |
| * config/aarch64/aarch64-builtins.c |
| (aarch64_general_gimple_fold_builtin): |
| lower vld1 and vst1 variants of the neon builtins |
| * config/aarch64/aarch64-protos.h: |
| (aarch64_general_gimple_fold_builtin): Add gsi parameter. |
| * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin): |
| Likwise. |
| |
| 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com> |
| |
| * match.pd: Generate IFN_TRUNC. |
| |
| 2021-10-20 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102853 |
| * tree-data-ref.c (split_constant_offset_1): Bail out |
| immediately if the expression traps on overflow. |
| |
| 2021-10-20 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove. |
| |
| 2021-10-20 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication): |
| Remove superflous debugging message. |
| (back_jt_path_registry::duplicate_thread_path): Same. |
| |
| 2021-10-20 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry): |
| Remove. |
| (back_threader_registry::register_path): Remove m_threaded_paths. |
| |
| 2021-10-20 Aldy Hernandez <aldyh@redhat.com> |
| |
| PR tree-optimization/102814 |
| * doc/invoke.texi: Document --param=max-fsm-thread-length. |
| * params.opt: Add --param=max-fsm-thread-length. |
| * tree-ssa-threadbackward.c |
| (back_threader_profitability::profitable_path_p): Fail on paths |
| longer than max-fsm-thread-length. |
| |
| 2021-10-20 Eric Botcazou <ebotcazou@adacore.com> |
| |
| PR middle-end/102764 |
| * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug |
| statement to reset the current location for the outgoing edges. |
| |
| 2021-10-20 Aldy Hernandez <aldyh@redhat.com> |
| Richard Biener <rguenther@suse.de> |
| |
| * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason |
| on the same line as the threading cancellation. |
| (jt_path_registry::cancel_invalid_paths): Avoid rotating loops. |
| Avoid threading through loop headers where the path remains in the |
| loop. |
| |
| 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org> |
| |
| * tree-object-size.c (unknown): Make into a function. Adjust |
| all uses. |
| (unknown_object_size): Simplify implementation. |
| |
| 2021-10-20 Hongtao Liu <hongtao.liu@intel.com> |
| Kewen Lin <linkw@linux.ibm.com> |
| |
| * doc/sourcebuild.texi (Effective-Target Keywords): Document |
| vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store, |
| vect_slp_v16qi_store, vect_slp_v2hi_store, |
| vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store. |
| |
| 2021-10-19 Jonathan Wakely <jwakely@redhat.com> |
| |
| * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo. |
| |
| 2021-10-19 Paul A. Clarke <pc@us.ibm.com> |
| |
| PR target/101893 |
| PR target/102719 |
| * config/rs6000/emmintrin.h: Guard POWER8 intrinsics. |
| * config/rs6000/pmmintrin.h: Same. |
| * config/rs6000/smmintrin.h: Same. |
| * config/rs6000/tmmintrin.h: Same. |
| |
| 2021-10-19 Paul A. Clarke <pc@us.ibm.com> |
| |
| * config.gcc (extra_headers): Add nmmintrin.h. |
| |
| 2021-10-19 Richard Biener <rguenther@suse.de> |
| |
| * tree-vectorizer.h (vect_supportable_dr_alignment): Add |
| misalignment parameter. |
| * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs): |
| Do not change DR_MISALIGNMENT in place, instead pass the |
| adjusted misalignment to vect_supportable_dr_alignment. |
| (vect_peeling_supportable): Likewise. |
| (vect_peeling_hash_get_lowest_cost): Adjust. |
| (vect_enhance_data_refs_alignment): Likewise. |
| (vect_vfa_access_size): Likewise. |
| (vect_supportable_dr_alignment): Add misalignment |
| parameter and simplify. |
| * tree-vect-stmts.c (get_negative_load_store_type): Adjust. |
| (get_group_load_store_type): Likewise. |
| (get_load_store_type): Likewise. |
| |
| 2021-10-19 Clément Chigot <clement.chigot@atos.net> |
| |
| * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move |
| __tls_get_addr reference to .text csect. |
| |
| 2021-10-19 Martin Liska <mliska@suse.cz> |
| |
| PR target/102375 |
| * config/aarch64/aarch64.c (aarch64_process_one_target_attr): |
| Strip whitespaces. |
| |
| 2021-10-19 Richard Biener <rguenther@suse.de> |
| |
| * tree-vectorizer.h (vect_get_store_cost): Adjust signature. |
| (vect_get_load_cost): Likewise. |
| * tree-vect-data-refs.c (vect_get_data_access_cost): Get |
| alignment support scheme and misalignment as arguments |
| and pass them down. |
| (vect_get_peeling_costs_all_drs): Compute that info here |
| and note that we shouldn't need to. |
| * tree-vect-stmts.c (vect_model_store_cost): Get |
| alignment support scheme and misalignment as arguments. |
| (vect_get_store_cost): Likewise. |
| (vect_model_load_cost): Likewise. |
| (vect_get_load_cost): Likewise. |
| (vectorizable_store): Pass down alignment support scheme |
| and misalignment to costing. |
| (vectorizable_load): Likewise. |
| |
| 2021-10-19 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-stmts.c (get_negative_load_store_type): Add |
| offset output parameter and initialize it. |
| (get_group_load_store_type): Likewise. |
| (get_load_store_type): Likewise. |
| (vectorizable_store): Use offset as computed by |
| get_load_store_type. |
| (vectorizable_load): Likewise. |
| |
| 2021-10-19 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102827 |
| * tree-if-conv.c (predicate_statements): Add pe parameter |
| and use that edge to insert invariant stmts on. |
| (combine_blocks): Pass through pe. |
| (tree_if_conversion): Compute the edge to insert invariant |
| stmts on and pass it along. |
| |
| 2021-10-19 Roger Sayle <roger@nextmovesoftware.com> |
| |
| PR target/102785 |
| * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3, |
| sssubaddv2hi3): Swap the order of operators in vec_concat. |
| |
| 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com> |
| |
| * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete. |
| (altivec_vmrghb_direct): New. |
| (*altivec_vmrghh_internal): Delete. |
| (altivec_vmrghh_direct): New. |
| (*altivec_vmrghw_internal): Delete. |
| (altivec_vmrghw_direct_<mode>): New. |
| (altivec_vmrghw_direct): Delete. |
| (*altivec_vmrglb_internal): Delete. |
| (altivec_vmrglb_direct): New. |
| (*altivec_vmrglh_internal): Delete. |
| (altivec_vmrglh_direct): New. |
| (*altivec_vmrglw_internal): Delete. |
| (altivec_vmrglw_direct_<mode>): New. |
| (altivec_vmrglw_direct): Delete. |
| * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust. |
| * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): |
| Adjust. |
| * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust. |
| (vsx_xxmrglw_<mode>): Adjust. |
| |
| 2021-10-19 Aldy Hernandez <aldyh@redhat.com> |
| |
| * passes.def: Change threading comment before pass_ccp pass. |
| |
| 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org> |
| |
| * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin): |
| Modify the expansion for sign extension. All extensions are done |
| within VSX registers. |
| |
| 2021-10-19 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-stmts.c (get_group_load_store_type): Add |
| misalignment output parameter and initialize it. |
| (get_group_load_store_type): Likewise. |
| (vectorizable_store): Remove now redundant queries. |
| (vectorizable_load): Likewise. |
| |
| 2021-10-19 Richard Biener <rguenther@suse.de> |
| |
| * tree-vectorizer.h (vect_supportable_dr_alignment): Remove |
| check_aligned argument. |
| * tree-vect-data-refs.c (vect_supportable_dr_alignment): |
| Likewise. |
| (vect_peeling_hash_insert): Add supportable_if_not_aligned |
| argument and do not call vect_supportable_dr_alignment here. |
| (vect_peeling_supportable): Adjust. |
| (vect_enhance_data_refs_alignment): Compute whether the |
| access is supported with different alignment here and |
| pass that down to vect_peeling_hash_insert. |
| (vect_vfa_access_size): Adjust. |
| * tree-vect-stmts.c (vect_get_store_cost): Likewise. |
| (vect_get_load_cost): Likewise. |
| (get_negative_load_store_type): Likewise. |
| (get_group_load_store_type): Likewise. |
| (get_load_store_type): Likewise. |
| |
| 2021-10-19 Martin Liska <mliska@suse.cz> |
| |
| PR target/102374 |
| * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces. |
| * system.h (strip_whilespaces): New function. |
| |
| 2021-10-19 dianhong xu <dianhong.xu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h: |
| (_mm512_set1_pch): New intrinsic. |
| * config/i386/avx512fp16vlintrin.h: |
| (_mm256_set1_pch): New intrinsic. |
| (_mm_set1_pch): Ditto. |
| |
| 2021-10-18 Andrew MacLeod <amacleod@redhat.com> |
| |
| PR tree-optimization/102796 |
| * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges |
| normally. Return get_tree_range for non gimple_range_ssa_p names. |
| (gimple_ranger::range_of_stmt): Use get_tree_range for non |
| gimple_range_ssa_p names. |
| |
| 2021-10-18 Uroš Bizjak <ubizjak@gmail.com> |
| |
| PR target/102761 |
| * config/i386/i386.c (ix86_print_operand_address): |
| Error out for non-address_operand asm operands. |
| |
| 2021-10-18 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-data-refs.c (vect_peeling_hash_insert): Do |
| not auto-convert dr_alignment_support to bool. |
| (vect_peeling_supportable): Likewise. |
| (vect_enhance_data_refs_alignment): Likewise. |
| (vect_supportable_dr_alignment): Commonize read/write case. |
| * tree-vect-stmts.c (vect_get_store_cost): Use |
| dr_alignment_support, not int, for the vect_supportable_dr_alignment |
| result. |
| (vect_get_load_cost): Likewise. |
| |
| 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org> |
| |
| * tree-object-size.c (object_sizes_execute): Consolidate LHS |
| null check and do it early. |
| |
| 2021-10-18 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-stmts.c (vectorizable_store): Use the |
| computed alignment scheme instead of querying |
| aligned_access_p. |
| |
| 2021-10-18 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-stmts.c (vectorizable_store): Do not recompute |
| alignment scheme already determined by get_load_store_type. |
| |
| 2021-10-18 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (class pass_thread_jumps_full): |
| Clone corresponding pass. |
| |
| 2021-10-18 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * combine.c (recog_for_combine): For an unrecognized move/set of |
| a constant, try force_const_mem to place it in the constant pool. |
| * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost): |
| Delete global variables (that are no longer assigned a cost value). |
| (cse_insn): Simplify logic for deciding whether to place a folded |
| constant in the constant pool using force_const_mem. |
| (cse_main): Remove zero initialization of constant_pool_entries_cost |
| and constant_pool_entries_regcost. |
| * config/i386/i386.c (ix86_rtx_costs): Make memory accesses |
| fractionally more expensive, when optimizing for speed. |
| |
| 2021-10-18 Martin Liska <mliska@suse.cz> |
| |
| PR gcov-profile/102746 |
| PR gcov-profile/102747 |
| * gcov.c (main): Return return_code. |
| (output_gcov_file): Mark return_code when error happens. |
| (generate_results): Likewise. |
| (read_graph_file): Likewise. |
| (read_count_file): Likewise. |
| |
| 2021-10-18 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES. |
| (define_insn "ones"): Replace UNSPEC_ONES with a truncate of |
| a popcount, allowing compile-time evaluation/simplification. |
| (popcountsi2, popcounthi2): New expanders using a "ones" insn. |
| |
| 2021-10-18 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102788 |
| * tree-vect-patterns.c (vect_init_pattern_stmt): Allow |
| a NULL vectype. |
| (vect_pattern_recog_1): Likewise. |
| (vect_recog_bool_pattern): Continue matching the pattern |
| even if we do not have a vector type for a conversion |
| result. |
| |
| 2021-10-18 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]: |
| Evalute SS_NEG and SS_ABS of a constant argument. |
| |
| 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org> |
| |
| PR target/93183 |
| * gimple-match-head.c (try_conditional_simplification): Add case for single operand. |
| * internal-fn.def: Add entry for COND_NEG internal function. |
| * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for |
| NEGATE_EXPR, COND_NEG mapping. |
| * optabs.def: Add entry for cond_neg_optab. |
| * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists. |
| (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern. |
| (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise. |
| |
| 2021-10-18 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-strlen.c (class strlen_pass): Rename from |
| strlen_dom_walker. |
| (handle_builtin_stxncpy_strncat): Move to strlen_pass. |
| (handle_assign): Same. |
| (adjust_last_stmt): Same. |
| (maybe_warn_overflow): Same. |
| (handle_builtin_strlen): Same. |
| (handle_builtin_strchr): Same. |
| (handle_builtin_strcpy): Same. |
| (handle_builtin_strncat): Same. |
| (handle_builtin_stxncpy_strncat): Same. |
| (handle_builtin_memcpy): Same. |
| (handle_builtin_strcat): Same. |
| (handle_alloc_call): Same. |
| (handle_builtin_memset): Same. |
| (handle_builtin_memcmp): Same. |
| (get_len_or_size): Same. |
| (strxcmp_eqz_result): Same. |
| (handle_builtin_string_cmp): Same. |
| (handle_pointer_plus): Same. |
| (count_nonzero_bytes_addr): Same. |
| (count_nonzero_bytes): Same. |
| (handle_store): Same. |
| (strlen_check_and_optimize_call): Same. |
| (handle_integral_assign): Same. |
| (check_and_optimize_stmt): Same. |
| (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass. |
| |
| 2021-10-18 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102798 |
| * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref): |
| Only copy points-to info to newly generated SSA names. |
| |
| 2021-10-18 Martin Liska <mliska@suse.cz> |
| |
| * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable. |
| * gcov.c (get_cycles_count): Likewise. |
| * lto-compress.c (lto_compression_zlib): Likewise. |
| (lto_uncompression_zlib): Likewise. |
| * targhooks.c (default_pch_valid_p): Likewise. |
| |
| 2021-10-17 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-pass.h (make_pass_thread_jumps_full): New. |
| * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline. |
| (try_thread_blocks): Add resolve and speed arguments. |
| (pass_thread_jumps::execute): Inline. |
| (do_early_thread_jumps): New. |
| (do_thread_jumps): New. |
| (make_pass_thread_jumps): Move. |
| (pass_early_thread_jumps::gate): Inline. |
| (pass_early_thread_jumps::execute): Inline. |
| (class pass_thread_jumps_full): New. |
| |
| 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org> |
| |
| * configure.ac: Treat powerpc64*-*-freebsd* the same as |
| powerpc64-*-freebsd*. |
| * configure: Regenerate. |
| |
| 2021-10-16 H.J. Lu <hjl.tools@gmail.com> |
| |
| * value-query.cc (get_ssa_name_ptr_info_nonnull): Change |
| set_ptr_nonull to set_ptr_nonnull in comments. |
| |
| 2021-10-16 Jan Hubicka <hubicka@ucw.cz> |
| |
| PR tree-optimization/102720 |
| * tree-ssa-structalias.c (compute_points_to_sets): Fix producing |
| of call used and clobbered sets. |
| |
| 2021-10-15 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to |
| skip gomp and itm when r or nodefaultlibs is given. |
| (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines. |
| Update ordering of exclusions, remove duplicate 'v' addition |
| (collect2 will add this from the main command line). |
| |
| 2021-10-15 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * config/darwin-driver.c (darwin_driver_init): Revise comments, handle |
| filelist and framework options in specs instead of code. |
| * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link |
| specs that are really driver ones. |
| (DARWIN_CC1_SPEC): Likewise. |
| (CPP_SPEC): Likewise. |
| (SYSROOT_SPEC): Append space. |
| (LINK_SYSROOT_SPEC): Remove most driver link specs. |
| (STANDARD_STARTFILE_PREFIX_2): Update link-related specs. |
| (STARTFILE_SPEC): Likewise. |
| (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap. |
| (ASM_SPEC): Update driver-related specs. |
| (ASM_FINAL_SPEC): Likewise. |
| * config/darwin.opt: Remove now unused option aliases. |
| * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt. |
| (ASM_SPEC): Update driver-related specs. |
| |
| 2021-10-15 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes, |
| allow standard_sse_constant_p immediate constants. |
| |
| 2021-10-15 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * config.gcc: Add tm-dwarf2.h to tm_d-file. |
| |
| 2021-10-15 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range-fold.h (gimple_range_ssa_p): Don't process names |
| that occur in abnormal phis. |
| * gimple-range.cc (gimple_ranger::range_on_edge): Return false for |
| abnormal and EH edges. |
| * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto. |
| (rvrp_folder::value_on_edge): Ditto. |
| (rvrp_folder::value_of_stmt): Ditto. |
| (hybrid_folder::value_of_expr): Ditto for ranger queries. |
| (hybrid_folder::value_on_edge): Ditto. |
| (hybrid_folder::value_of_stmt): Ditto. |
| * value-query.cc (gimple_range_global): Always return a range if |
| the type is supported. |
| |
| 2021-10-15 John David Anglin <danglin@gcc.gnu.org> |
| |
| * config/pa/pa.md: Consistently use "rG" constraint for copy |
| instruction in move patterns. |
| |
| 2021-10-15 Andrew Stubbs <ams@codesourcery.com> |
| |
| * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally. |
| (S_900): Likewise. |
| (S_906): Likewise. |
| * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures. |
| * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ... |
| (ELFABIVERSION_AMDGPU_HSA_V3): ... this. |
| (ELFABIVERSION_AMDGPU_HSA_V4): New. |
| (SET_SRAM_ECC_UNSUPPORTED): New. |
| (copy_early_debug_info): Create elf flags to match the other objects. |
| (main): Just let the attribute flags pass through. |
| |
| 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com> |
| |
| * tree-loop-distribution.c (reduction_var_overflows_first): |
| Pass the type of reduction_var as first argument as it is also |
| done for the load type. |
| (loop_distribution::transform_reduction_loop): Add missing |
| TREE_TYPE while determining precission of reduction_var. |
| |
| 2021-10-15 Richard Biener <rguenther@suse.de> |
| |
| * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG |
| when not set. |
| * toplev.c (process_options): Warn when STABS debugging is |
| enabled but not the preferred format. |
| * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to |
| DBX_DEBUG. |
| * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise. |
| |
| 2021-10-15 Richard Biener <rguenther@suse.de> |
| |
| PR ipa/102762 |
| * tree-inline.c (copy_bb): Avoid underflowing nargs. |
| |
| 2021-10-15 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert |
| HFmode input operand to HImode. |
| (ix86_vectorize_vec_perm_const): Likewise. |
| * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf): |
| New define_insn. |
| (*avx512f_permvar_truncv8siv8hi_1_hf): |
| Likewise. |
| |
| 2021-10-15 Richard Biener <rguenther@suse.de> |
| |
| PR middle-end/102682 |
| * expmed.c (store_bit_field_1): Ensure a LHS subreg would |
| not create a paradoxical subreg. |
| |
| 2021-10-15 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/i386-expand.c (ix86_expand_vector_init): |
| For half_vector concat for HFmode, handle them like HImode. |
| |
| 2021-10-14 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (class back_threader): Add m_resolve. |
| (back_threader::back_threader): Same. |
| (back_threader::resolve_phi): Try to solve without looking back if |
| possible. |
| (back_threader::find_paths_to_names): Same. |
| (try_thread_blocks): Pass resolve argument to back threader. |
| (pass_early_thread_jumps::execute): Same. |
| |
| 2021-10-14 Aldy Hernandez <aldyh@redhat.com> |
| |
| * doc/invoke.texi: Remove max-fsm-thread-length, |
| max-fsm-thread-paths, and fsm-maximum-phi-arguments. |
| * params.opt: Same. |
| * tree-ssa-threadbackward.c (back_threader::back_threader): Remove |
| argument. |
| (back_threader_registry::back_threader_registry): Same. |
| (back_threader_profitability::profitable_path_p): Remove |
| param_max_fsm_thread-length. |
| (back_threader_registry::register_path): Remove |
| m_max_allowable_paths. |
| |
| 2021-10-14 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (class back_threader): Make m_imports |
| an auto_bitmap. |
| (back_threader::~back_threader): Do not release m_path. |
| |
| 2021-10-14 Andrew MacLeod <amacleod@redhat.com> |
| |
| PR tree-optimization/102738 |
| * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR. |
| |
| 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com> |
| |
| * omp-general.c (omp_check_context_selector): Move from c-omp.c. |
| (omp_mark_declare_variant): Move from c-omp.c. |
| (omp_context_name_list_prop): Update for Fortran strings. |
| * omp-general.h (omp_check_context_selector): New prototype. |
| (omp_mark_declare_variant): New prototype. |
| |
| 2021-10-14 Richard Sandiford <richard.sandiford@arm.com> |
| |
| * config/rs6000/rs6000.c (rs6000_density_test): Move early |
| exit test further up the function. |
| |
| 2021-10-14 Richard Sandiford <richard.sandiford@arm.com> |
| |
| * config/arm/arm.c (arm_add_stmt_cost): Delete. |
| (TARGET_VECTORIZE_ADD_STMT_COST): Delete. |
| |
| 2021-10-14 Martin Jambor <mjambor@suse.cz> |
| |
| * doc/invoke.texi (Optimize Options): Add entry for |
| ipa-cp-recursive-freq-factor. |
| |
| 2021-10-14 Tamar Christina <tamar.christina@arm.com> |
| |
| * match.pd: New rule. |
| |
| 2021-10-14 Jan Hubicka <hubicka@ucw.cz> |
| |
| PR ipa/102557 |
| * ipa-modref-tree.h (modref_access_node::update2): |
| Also check that parm_offset is unchanged. |
| (modref_ref_node::insert_access): Fix updating of |
| parameter. |
| |
| 2021-10-14 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add |
| FIXME note. |
| |
| 2021-10-14 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102659 |
| * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also |
| rewrite pointer typed undefined overflow operations. |
| (predicate_statements): Likewise. Make sure to emit invariant |
| conversions in the preheader. |
| * tree-vectorizer.c (vect_loop_vectorized_call): Look through |
| non-empty preheaders. |
| * tree-data-ref.c (dr_analyze_indices): Strip useless |
| conversions to the MEM_REF base type. |
| |
| 2021-10-14 Martin Liska <mliska@suse.cz> |
| |
| * common.opt: Stop using AUTODETECT_VALUE |
| and use EnabledBy where possible. |
| * opts.c: Enable OPT_fvar_tracking with optimize >= 1. |
| * toplev.c (AUTODETECT_VALUE): Remove macro. |
| (process_options): Simplify by using EnabledBy and |
| OPT_fvar_tracking. Use OPTION_SET_P macro instead of |
| AUTODETECT_VALUE. |
| |
| 2021-10-14 Jonathan Wright <jonathan.wright@arm.com> |
| |
| * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for |
| pointer parameter. |
| (vld1_s32_x3): Likewise. |
| |
| 2021-10-14 Aldy Hernandez <aldyh@redhat.com> |
| |
| PR tree-optimization/102736 |
| PR tree-optimization/102736 |
| * gimple-range-path.cc (path_range_query::range_on_path_entry): |
| Assert that the requested range is defined outside the path. |
| (path_range_query::ssa_range_in_phi): Do not call |
| range_on_path_entry for SSA names that are defined within the |
| path. |
| |
| 2021-10-14 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * config/darwin-driver.c (darwin_find_version_from_kernel): |
| Quote internal identifiers and avoid contractions in |
| warnings. |
| (darwin_default_min_version): Likewise. |
| (darwin_driver_init): Likewise. |
| |
| 2021-10-14 Martin Jambor <mjambor@suse.cz> |
| |
| * params.opt (ipa-cp-recursive-freq-factor): New. |
| * ipa-cp.c (ipcp_value): Switch to inline initialization. New members |
| scc_no, self_recursion_generated_level, same_scc and |
| self_recursion_generated_p. |
| (ipcp_lattice::add_value): Replaced parameter unlimited with |
| same_lat_gen_level, usit it determine limit of values and store it to |
| the value. |
| (ipcp_lattice<valtype>::print): Dump the new fileds. |
| (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new |
| parameter and store it to the new value. |
| (self_recursively_generated_p): Removed. |
| (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p |
| instead of self_recursively_generated_p, store self generation level |
| to such values. |
| (value_topo_info<valtype>::add_val): Set scc_no. |
| (value_topo_info<valtype>::propagate_effects): Multiply frequencies of |
| recursively feeding values and self generated values by appropriate |
| new factors. |
| |
| 2021-10-14 Richard Sandiford <richard.sandiford@arm.com> |
| |
| * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove |
| redundant test for flag_vect_cost_model. |
| |
| 2021-10-14 Aldy Hernandez <aldyh@redhat.com> |
| |
| * bitmap.c (debug): New overloaded function for auto_bitmaps. |
| * bitmap.h (debug): Same. |
| |
| 2021-10-14 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch): |
| Adjust builtin call. |
| (_mm512_mask3_fcmadd_pch): Likewise. |
| (_mm512_mask_fmadd_pch): Likewise |
| (_mm512_mask3_fmadd_pch): Likewise |
| (_mm512_mask_fcmadd_round_pch): Likewise |
| (_mm512_mask3_fcmadd_round_pch): Likewise |
| (_mm512_mask_fmadd_round_pch): Likewise |
| (_mm512_mask3_fmadd_round_pch): Likewise |
| (_mm_mask_fcmadd_sch): Likewise |
| (_mm_mask3_fcmadd_sch): Likewise |
| (_mm_mask_fmadd_sch): Likewise |
| (_mm_mask3_fmadd_sch): Likewise |
| (_mm_mask_fcmadd_round_sch): Likewise |
| (_mm_mask3_fcmadd_round_sch): Likewise |
| (_mm_mask_fmadd_round_sch): Likewise |
| (_mm_mask3_fmadd_round_sch): Likewise |
| (_mm_fcmadd_round_sch): Likewise |
| * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch): |
| Adjust builtin call. |
| (_mm_mask3_fmadd_pch): Likewise |
| (_mm256_mask_fmadd_pch): Likewise |
| (_mm256_mask3_fmadd_pch): Likewise |
| (_mm_mask_fcmadd_pch): Likewise |
| (_mm_mask3_fcmadd_pch): Likewise |
| (_mm256_mask_fcmadd_pch): Likewise |
| (_mm256_mask3_fcmadd_pch): Likewise |
| * config/i386/i386-builtin.def: Add mask3 builtin for complex |
| fma, and adjust mask_builtin to corresponding expander. |
| * config/i386/i386-expand.c (ix86_expand_round_builtin): |
| Skip eraseing embedded rounding for expanders that emits |
| multiple insns. |
| * config/i386/sse.md (complexmove): New mode_attr. |
| (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander. |
| (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise. |
| (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise. |
| (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise. |
| (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise. |
| (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise. |
| * config/i386/subst.md (round_embedded_complex): New subst. |
| |
| 2021-10-14 John David Anglin <danglin@gcc.gnu.org> |
| |
| * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT. |
| (cbranchdf4): Likewise. |
| Add missing move patterns for TARGET_SOFT_FLOAT. |
| |
| 2021-10-13 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * config/i386/i386-expand.c (ix86_expand_vector_move): Use a |
| pseudo intermediate when moving a SUBREG into a hard register, |
| by checking ix86_hardreg_mov_ok. |
| (ix86_expand_vector_extract): Store zero-extended SImode |
| intermediate in a pseudo, then set target using a SUBREG_PROMOTED |
| annotated subreg. |
| * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating |
| complex (SUBREG) sets of (vector) hard registers before reload, by |
| checking ix86_hardreg_mov_ok. |
| |
| 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com> |
| |
| * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma. |
| |
| 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com> |
| |
| * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment. |
| |
| 2021-10-13 Martin Sebor <msebor@redhat.com> |
| |
| PR middle-end/102630 |
| * pointer-query.cc (compute_objsize_r): Handle named address spaces. |
| |
| 2021-10-13 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * collect2.c (is_lto_object_file): Release simple-object |
| resources, close files. |
| |
| 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com> |
| |
| * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE |
| when checking mve addressing modes. |
| (mve_vector_mem_operand): Fix the way we handle pre, post and offset |
| addressing modes. |
| (arm_print_operand): Fix printing of POST_ and PRE_MODIFY. |
| * config/arm/mve.md: Use mve_memory_operand predicate everywhere where |
| there is a single Ux constraint. |
| |
| 2021-10-13 John David Anglin <danglin@gcc.gnu.org> |
| |
| * config/pa/pa.md (muldi3): Add support for inlining 64-bit |
| multiplication on 32-bit PA 1.1 and 2.0 targets. |
| |
| 2021-10-13 Martin Liska <mliska@suse.cz> |
| |
| PR gcov-profile/90364 |
| * coverage.c (build_info): Emit checksum to the global variable. |
| (build_info_type): Add new field for checksum. |
| (coverage_obj_finish): Pass object_checksum. |
| (coverage_init): Use 0 as checksum for .gcno files. |
| * gcov-dump.c (dump_gcov_file): Dump also new checksum field. |
| * gcov.c (read_graph_file): Read also checksum. |
| * doc/invoke.texi: Document the behaviour change. |
| |
| 2021-10-13 Richard Biener <rguenther@suse.de> |
| |
| * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT, |
| start at integer value 2. |
| * gimple-iterator.c (gsi_insert_seq_nodes_before): Update |
| the iterator for GSI_LAST_NEW_STMT. |
| (gsi_insert_seq_nodes_after): Likewise. |
| * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT. |
| * tree-ssa.c (execute_update_addresses_taken): Correct bogus |
| arguments to gsi_replace. |
| |
| 2021-10-13 Martin Liska <mliska@suse.cz> |
| |
| PR target/102688 |
| * common.opt: Use EnabledBy instead of detection in |
| finish_options and process_options. |
| * opts.c (finish_options): Remove handling of |
| x_flag_unroll_all_loops. |
| * toplev.c (process_options): Likewise for flag_web and |
| flag_rename_registers. |
| |
| 2021-10-13 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102659 |
| * tree-if-conv.c (need_to_rewrite_undefined): New flag. |
| (if_convertible_gimple_assign_stmt_p): Mark the loop for |
| rewrite when stmts with undefined behavior on integer |
| overflow appear. |
| (combine_blocks): Predicate also when we need to rewrite stmts. |
| (predicate_statements): Rewrite affected stmts to something |
| with well-defined behavior on overflow. |
| (tree_if_conversion): Initialize need_to_rewrite_undefined. |
| |
| 2021-10-13 Richard Biener <rguenther@suse.de> |
| |
| PR ipa/102714 |
| * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness |
| check. |
| |
| 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com> |
| |
| * dwarf2ctf.c: Fix typo in comment. |
| |
| 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com> |
| |
| PR target/101985 |
| * config/rs6000/altivec.h (vec_cpsgn): Swap operand order. |
| * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to |
| avoid generating an automatic #define of vec_cpsgn. Use the |
| correct built-in for V4SFmode that doesn't depend on VSX. |
| |
| 2021-10-12 Uroš Bizjak <ubizjak@gmail.com> |
| |
| PR target/85730 |
| PR target/82524 |
| * config/i386/i386.md (*add<mode>_1_slp): Rewrite as |
| define_insn_and_split pattern. Add alternative 1 and split it |
| post reload to insert operand 1 into the low part of operand 0. |
| (*sub<mode>_1_slp): Ditto. |
| (*and<mode>_1_slp): Ditto. |
| (*<any_or:code><mode>_1_slp): Ditto. |
| (*ashl<mode>3_1_slp): Ditto. |
| (*<any_shiftrt:insn><mode>3_1_slp): Ditto. |
| (*<any_rotate:insn><mode>3_1_slp): Ditto. |
| (*neg<mode>_1_slp): New insn_and_split pattern. |
| (*one_cmpl<mode>_1_slp): Ditto. |
| |
| 2021-10-12 David Edelsohn <dje.gcc@gmail.com> |
| |
| * doc/install.texi: Update MinGW and mingw-64 Binaries |
| download links. |
| |
| 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com> |
| |
| * doc/extend.texi (Common Variable Attributes): Fix typos in |
| alloc_size documentation. |
| |
| 2021-10-12 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102696 |
| * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark |
| the tree fatally failed when we reject a BIT_FIELD_REF. |
| |
| 2021-10-12 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102572 |
| * tree-vect-stmts.c (vect_build_gather_load_calls): When |
| gathering the vectorized defs for the mask pass in the |
| desired mask vector type so invariants will be handled |
| correctly. |
| |
| 2021-10-12 Tamar Christina <tamar.christina@arm.com> |
| |
| * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine, |
| *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine, |
| *fcmuo<mode>_nor_combine): New. |
| |
| 2021-10-12 Eric Botcazou <ebotcazou@adacore.com> |
| |
| PR target/102588 |
| * config/sparc/sparc-modes.def (OI): New integer mode. |
| |
| 2021-10-12 Jakub Jelinek <jakub@redhat.com> |
| |
| * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare. |
| * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer |
| static. |
| |
| 2021-10-12 Jakub Jelinek <jakub@redhat.com> |
| |
| * tree-vectorizer.h (loop_cost_model): New function. |
| (unlimited_cost_model): Use it. |
| * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model |
| call instead of flag_vect_cost_model. |
| * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise. |
| (vect_prune_runtime_alias_test_list): Likewise. Also use it instead |
| of flag_simd_cost_model. |
| |
| 2021-10-12 liuhongt <hongtao.liu@intel.com> |
| |
| PR target/102483 |
| * config/i386/i386-expand.c (emit_reduc_half): Handle |
| V4QImode. |
| * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander. |
| (reduc_plus_scal_v4qi): Ditto. |
| |
| 2021-10-12 Paul A. Clarke <pc@us.ibm.com> |
| |
| * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64, |
| _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New. |
| * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit. |
| |
| 2021-10-12 Paul A. Clarke <pc@us.ibm.com> |
| |
| * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32, |
| _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64, |
| _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32, |
| _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64, |
| _mm_cvtepu32_epi64): New. |
| |
| 2021-10-12 Paul A. Clarke <pc@us.ibm.com> |
| |
| * config/rs6000/smmintrin.h (_mm_test_all_zeros, |
| _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro. |
| |
| 2021-10-12 Paul A. Clarke <pc@us.ibm.com> |
| |
| * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16, |
| _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16, |
| _mm_max_epi32, _mm_max_epu32): New. |
| |
| 2021-10-11 Jan Hubicka <hubicka@ucw.cz> |
| |
| * ipa-modref-tree.h (struct modref_access_node): Revert |
| accidental change. |
| (struct modref_ref_node): Likewise. |
| |
| 2021-10-11 Jan Hubicka <hubicka@ucw.cz> |
| |
| * ipa-modref-tree.h (modref_tree::global_access_p): New member |
| function. |
| * ipa-modref.c: |
| (implicint_const_eaf_flags,implicit_pure_eaf_flags, |
| ignore_stores_eaf_flags): Move to ipa-modref.h |
| (remove_useless_eaf_flags): Remove early exit on NOCLOBBER. |
| (modref_summary::global_memory_read_p): New member function. |
| (modref_summary::global_memory_written_p): New member function. |
| * ipa-modref.h (modref_summary::global_memory_read_p, |
| modref_summary::global_memory_written_p): Declare. |
| (implicint_const_eaf_flags,implicit_pure_eaf_flags, |
| ignore_stores_eaf_flags): move here. |
| * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h |
| and attr-fnspec.h. |
| (handle_rhs_call): Rewrite. |
| (handle_call_arg): New function. |
| (determine_global_memory_access): New function. |
| (handle_const_call): Remove |
| (handle_pure_call): Remove |
| (find_func_aliases_for_call): Update use of handle_rhs_call. |
| (compute_points_to_sets): Handle global memory acccesses |
| selectively |
| |
| 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com> |
| |
| * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer |
| documentation, mention UBSAN_OPTIONS, similar to what is done |
| for AddressSanitizer. |
| |
| 2021-10-11 Richard Biener <rguenther@suse.de> |
| |
| PR middle-end/102683 |
| * internal-fn.c (expand_DEFERRED_INIT): Check for mode |
| availability before building an integer type for storage |
| purposes. |
| |
| 2021-10-11 Richard Biener <rguenther@suse.de> |
| |
| PR middle-end/101480 |
| * gimple.c (gimple_call_fnspec): Do not mark operator new/delete |
| as const. |
| |
| 2021-10-11 Martin Liska <mliska@suse.cz> |
| |
| * common.opt: Remove Init(2) for some options. |
| * toplev.c (process_options): Do not use AUTODETECT_VALUE, but |
| use rather OPTION_SET_P. |
| |
| 2021-10-11 Martin Liska <mliska@suse.cz> |
| |
| * common.opt: Remove usage of IRA_REGION_AUTODETECT. |
| * flag-types.h (enum ira_region): Likewise. |
| * toplev.c (process_options): Use OPTION_SET_P instead of |
| IRA_REGION_AUTODETECT. |
| |
| 2021-10-11 Jakub Jelinek <jakub@redhat.com> |
| |
| * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams, |
| omp_[sg]et_teams_thread_limit and omp_set_num_teams. |
| |
| 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com> |
| |
| * config/s390/s390-protos.h (s390_rawmemchr): Add prototype. |
| * config/s390/s390.c (s390_rawmemchr): New function. |
| * config/s390/s390.md (rawmemchr<SINT:mode>): New expander. |
| * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of |
| the pattern vfees<mode> from vx-builtins.md. |
| * config/s390/vx-builtins.md (*vfees<mode>): Remove. |
| |
| 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com> |
| |
| * builtins.c (get_memory_rtx): Change to external linkage. |
| * builtins.h (get_memory_rtx): Add function prototype. |
| * doc/md.texi (rawmemchr<mode>): Document. |
| * internal-fn.c (expand_RAWMEMCHR): Define. |
| * internal-fn.def (RAWMEMCHR): Add. |
| * optabs.def (rawmemchr_optab): Add. |
| * tree-loop-distribution.c (find_single_drs): Change return code |
| behaviour by also returning true if no single store was found |
| but a single load. |
| (loop_distribution::classify_partition): Respect the new return |
| code behaviour of function find_single_drs. |
| (loop_distribution::execute): Call new function |
| transform_reduction_loop in order to replace rawmemchr or strlen |
| like loops by calls into builtins. |
| (generate_reduction_builtin_1): New function. |
| (generate_rawmemchr_builtin): New function. |
| (generate_strlen_builtin_1): New function. |
| (generate_strlen_builtin): New function. |
| (generate_strlen_builtin_using_rawmemchr): New function. |
| (reduction_var_overflows_first): New function. |
| (determine_reduction_stmt_1): New function. |
| (determine_reduction_stmt): New function. |
| (loop_distribution::transform_reduction_loop): New function. |
| |
| 2021-10-11 Martin Liska <mliska@suse.cz> |
| |
| * tree.c (cl_option_hasher::hash): Use cl_optimization_hash |
| and remove legacy hashing code. |
| |
| 2021-10-11 Kito Cheng <kito.cheng@sifive.com> |
| |
| PR target/100316 |
| * builtins.c (maybe_emit_call_builtin___clear_cache): Allow |
| CONST_INT for BEGIN and END, and use gcc_assert rather than |
| error. |
| |
| 2021-10-10 Jakub Jelinek <jakub@redhat.com> |
| |
| PR debug/102441 |
| * var-tracking.c (add_stores): For cselib_sp_derived_value_p values |
| use MO_VAL_SET if loc is not sp. |
| |
| 2021-10-10 Andrew Pinski <apinski@marvell.com> |
| |
| PR tree-optimization/102622 |
| * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations. |
| Swap the order of a?0:pow2cst and a?0:-1 transformations. |
| |
| 2021-10-09 liuhongt <hongtao.liu@intel.com> |
| |
| PR target/102639 |
| * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle |
| HFmode. |
| (ix86_use_mask_cmp_p): Ditto. |
| (ix86_expand_sse_movcc): Ditto. |
| * config/i386/i386.md (setcc_hf_mask): New define_insn. |
| (movhf_mask): Ditto. |
| (UNSPEC_MOVCC_MASK): New unspec. |
| * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md. |
| |
| 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com> |
| |
| PR rtl-optimization/102627 |
| * lra-constraints.c (split_reg): Use at least natural mode of hard reg. |
| |
| 2021-10-08 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow |
| bitmap if needed. |
| |
| 2021-10-08 Aldy Hernandez <aldyh@redhat.com> |
| |
| * value-range.cc (irange::debug): New. |
| * value-range.h (irange::debug): New. |
| |
| 2021-10-08 Richard Sandiford <richard.sandiford@arm.com> |
| |
| PR tree-optimization/102385 |
| * predict.h (change_edge_frequency): Declare. |
| * predict.c (change_edge_frequency): New function. |
| * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove |
| edge argument. |
| (tree_unroll_loop): Likewise. |
| * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly. |
| * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop): |
| Likewise. |
| * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise. |
| * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise. |
| (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit |
| to retrieve the exit edges. Make all the old profile update code |
| conditional on !single_loop_p -- the case it was written for -- |
| and use a different approach for the single-loop case. |
| |
| 2021-10-08 Martin Liska <mliska@suse.cz> |
| |
| * config/alpha/alpha.c (alpha_option_override): Use new macro |
| OPTION_SET_P. |
| * config/arc/arc.c (arc_override_options): Likewise. |
| * config/arm/arm.c (arm_option_override): Likewise. |
| * config/bfin/bfin.c (bfin_load_pic_reg): Likewise. |
| * config/c6x/c6x.c (c6x_option_override): Likewise. |
| * config/csky/csky.c: Likewise. |
| * config/darwin.c (darwin_override_options): Likewise. |
| * config/frv/frv.c (frv_option_override): Likewise. |
| * config/i386/djgpp.h: Likewise. |
| * config/i386/i386.c (ix86_stack_protect_guard): Likewise. |
| (ix86_max_noce_ifcvt_seq_cost): Likewise. |
| * config/ia64/ia64.c (ia64_option_override): Likewise. |
| (ia64_override_options_after_change): Likewise. |
| * config/m32c/m32c.c (m32c_option_override): Likewise. |
| * config/m32r/m32r.c (m32r_init): Likewise. |
| * config/m68k/m68k.c (m68k_option_override): Likewise. |
| * config/microblaze/microblaze.c (microblaze_option_override): Likewise. |
| * config/mips/mips.c (mips_option_override): Likewise. |
| * config/nios2/nios2.c (nios2_option_override): Likewise. |
| * config/nvptx/nvptx.c (nvptx_option_override): Likewise. |
| * config/pa/pa.c (pa_option_override): Likewise. |
| * config/riscv/riscv.c (riscv_option_override): Likewise. |
| * config/rs6000/aix71.h: Likewise. |
| * config/rs6000/aix72.h: Likewise. |
| * config/rs6000/aix73.h: Likewise. |
| * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise. |
| (rs6000_override_options_after_change): Likewise. |
| (rs6000_linux64_override_options): Likewise. |
| (glibc_supports_ieee_128bit): Likewise. |
| (rs6000_option_override_internal): Likewise. |
| (rs6000_file_start): Likewise. |
| (rs6000_darwin_file_start): Likewise. |
| * config/rs6000/rtems.h: Likewise. |
| * config/rs6000/sysv4.h: Likewise. |
| * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise. |
| * config/s390/s390.c (s390_option_override): Likewise. |
| * config/sh/linux.h: Likewise. |
| * config/sh/netbsd-elf.h (while): Likewise. |
| * config/sh/sh.c (sh_option_override): Likewise. |
| * config/sol2.c (solaris_override_options): Likewise. |
| * config/sparc/sparc.c (sparc_option_override): Likewise. |
| * config/tilegx/tilegx.c (tilegx_option_override): Likewise. |
| * config/visium/visium.c (visium_option_override): Likewise. |
| * config/vxworks.c (vxworks_override_options): Likewise. |
| * lto-opts.c (lto_write_options): Likewise. |
| * omp-expand.c (expand_omp_simd): Likewise. |
| * omp-general.c (omp_max_vf): Likewise. |
| * omp-offload.c (oacc_xform_loop): Likewise. |
| * opts.h (OPTION_SET_P): Likewise. |
| * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise. |
| * toplev.c (process_options): Likewise. |
| * tree-predcom.c: Likewise. |
| * tree-sra.c (analyze_all_variable_accesses): Likewise. |
| |
| 2021-10-08 liuhongt <hongtao.liu@intel.com> |
| |
| PR target/102464 |
| * config/i386/i386.c (ix86_optab_supported_p): |
| Return true for HFmode. |
| * match.pd: Simplify (_Float16) ceil ((double) x) to |
| __builtin_ceilf16 (a) when a is _Float16 type and |
| direct_internal_fn_supported_p. |
| |
| 2021-10-08 liuhongt <hongtao.liu@intel.com> |
| |
| PR target/102494 |
| * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode. |
| * config/i386/mmx.md (reduc_plus_scal_v4hi): New. |
| (reduc_<code>_scal_v4hi): New. |
| |
| 2021-10-08 liuhongt <hongtao.liu@intel.com> |
| |
| * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize). |
| * doc/invoke.texi (Options That Control Optimization): Update |
| documents. |
| * opts.c (default_options_table): Enable auto-vectorization at |
| O2 with very-cheap cost model. |
| (finish_options): Use cheap cost model for |
| explicit -ftree{,-loop}-vectorize. |
| |
| 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com> |
| |
| * ctfc.c (ctfc_delete_container): Free hash table contents. |
| |
| 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com> |
| |
| * toplev.c (process_options): Do not warn for GNU GIMPLE. |
| |
| 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org> |
| |
| * tree-object-size.c (addr_object_size, |
| compute_builtin_object_size): Drop PDECL and POFF arguments. |
| (addr_object_size): Adjust calls. |
| * tree-object-size.h (compute_builtin_object_size): Drop PDECL |
| and POFF arguments. |
| |
| 2021-10-07 Roger Sayle <roger@nextmovesoftware.com> |
| |
| * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for |
| representing signed and unsigned high-part multiplication resp. |
| * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART, |
| UMUL_HIGHPART]: Simplify high-part multiplications by zero. |
| [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT, |
| SS_DIV, US_DIV]: Similar simplifications for saturating |
| arithmetic. |
| (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS, |
| US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]: |
| Implement compile-time evaluation for constant operands. |
| * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and |
| UMUL_HIGHPART. |
| * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes. |
| * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart): |
| Mention the new smul_highpart and umul_highpart RTX codes. |
| * doc/invoke.texi: Silence @xref "compilation" warnings. |
| |
| 2021-10-07 Martin Jambor <mjambor@suse.cz> |
| |
| PR ipa/102388 |
| * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the |
| case when the source reference description corresponds to a |
| referance taken in a function src->caller is inlined to. |
| |
| 2021-10-07 Jan Hubicka <hubicka@ucw.cz> |
| |
| PR ipa/102581 |
| * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets |
| better. |
| (modref_access_node::try_merge_with): Add sanity check that there |
| are no redundant entries in the list. |
| |
| 2021-10-07 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102608 |
| * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to |
| varying. |
| |
| 2021-10-07 Martin Liska <mliska@suse.cz> |
| |
| * toplev.c (toplev::main): Make |
| save_opt_decoded_options a pointer type |
| * toplev.h: Likewise. |
| |
| 2021-10-07 Andrew Stubbs <ams@codesourcery.com> |
| |
| * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply |
| HAVE_GCN_ASM_GLOBAL_LOAD_FIXED. |
| (scatter<mode>_insn_2offsets<exec_scatter>): Likewise. |
| |
| 2021-10-07 Andrew Stubbs <ams@codesourcery.com> |
| |
| * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string. |
| Adjust for new -msram-ecc=any behaviour. |
| (ASM_SPEC): Adjust -mxnack and -msram-ecc usage. |
| * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any. |
| * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ... |
| (EF_AMDGPU_XNACK_V3): ... this. |
| (EF_AMDGPU_SRAM_ECC): Rename to ... |
| (EF_AMDGPU_SRAM_ECC_V3): ... this. |
| (EF_AMDGPU_FEATURE_XNACK_V4): New. |
| (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New. |
| (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New. |
| (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New. |
| (EF_AMDGPU_FEATURE_XNACK_ON_V4): New. |
| (EF_AMDGPU_FEATURE_SRAMECC_V4): New. |
| (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New. |
| (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New. |
| (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New. |
| (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New. |
| (SET_XNACK_ON): New. |
| (SET_XNACK_OFF): New. |
| (TEST_XNACK): New. |
| (SET_SRAM_ECC_ON): New. |
| (SET_SRAM_ECC_ANY): New. |
| (SET_SRAM_ECC_OFF): New. |
| (TEST_SRAM_ECC_ANY): New. |
| (TEST_SRAM_ECC_ON): New. |
| (main): Implement HSACOv4 and -msram-ecc=any. |
| |
| 2021-10-07 Andrew Stubbs <ams@codesourcery.com> |
| |
| * config.in: Regenerate. |
| * config/gcn/gcn-hsa.h (X_FIJI): New macro. |
| (X_900): New macro. |
| (X_906): New macro. |
| (X_908): New macro. |
| (A_FIJI): Rename to ... |
| (S_FIJI): ... this. |
| (A_900): Rename to ... |
| (S_900): ... this. |
| (A_906): Rename to ... |
| (S_906): ... this. |
| (A_908): Rename to ... |
| (S_908): ... this. |
| (SRAMOPT): New macro. |
| (ASM_SPEC): Adjust xnack option usage. |
| * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage. |
| * configure: Regenerate. |
| * configure.ac: Detect LLVM assembler dialect. |
| |
| 2021-10-07 Richard Biener <rguenther@suse.de> |
| |
| * tree-pretty-print.c (dump_generic_node): Do not elide |
| printing '&' when dumping with -gimple. |
| |
| 2021-10-06 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range-cache.cc (non_null_ref::adjust_range): Call new |
| intersect routine. |
| * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto. |
| (adjust_imagpart_expr): Ditto. |
| * value-range.cc (irange::irange_intersect): Call new routine if |
| RHS is a single pair. |
| (irange::intersect): New wide_int version. |
| * value-range.h (class irange): New prototype. |
| |
| 2021-10-06 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range): |
| Add parameter to limit size when recognizing switches. |
| (gimple_outgoing_range::edge_range_p): Check size limit. |
| * gimple-range-edge.h (gimple_outgoing_range): Add size field. |
| * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches |
| that exceed the size limit. |
| (gori_compute::gori_compute): Add initializer. |
| * params.opt (evrp-switch-limit): New. |
| * doc/invoke.texi: Update docs. |
| |
| 2021-10-06 Andrew MacLeod <amacleod@redhat.com> |
| |
| * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and |
| TYPE_MAX_VALUE instead of creating new trees when possible. |
| |
| 2021-10-06 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range-cache.cc (non_null_ref::adjust_range): Check for |
| zero and non-zero more efficently. |
| |
| 2021-10-06 Richard Biener <rguenther@suse.de> |
| |
| PR c/102605 |
| * dumpfile.h (TDF_GIMPLE_VAL): New. |
| (dump_flag): Re-order and adjust TDF_* flags. Make |
| the enum uint32_t. Use std::underlying_type in the |
| operator overloads. |
| (optgroup_flag): Likewise for the operator overloads. |
| * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR |
| in _Literal if TDF_GIMPLE_VAL. |
| * gimple-pretty-print.c (dump_gimple_assign): Add |
| TDF_GIMPLE_VAL to flags when dumping operands where only |
| is_gimple_val are allowed. |
| (dump_gimple_cond): Likewise. |
| |
| 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org> |
| |
| * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if |
| condition. |
| |
| 2021-10-05 qing zhao <qing.zhao@oracle.com> |
| |
| PR middle-end/102359 |
| * gimplify.c (gimplify_decl_expr): Not add initialization for an |
| auto variable when it has been initialized by frontend. |
| |
| 2021-10-05 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths): |
| Loosen restrictions |
| |
| 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de> |
| |
| * common/config/avr/avr-common.c (avr_handle_option): Mark |
| argument as ATTRIBUTE_UNUSED. |
| |
| 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de> |
| |
| * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC): |
| Undefine before redefinition. |
| |
| 2021-10-05 Richard Biener <rguenther@suse.de> |
| |
| * toplev.c (no_backend): Remove global var. |
| (process_options): Pass in no_backend, move post_options |
| langhook call to toplev::main. |
| (do_compile): Pass in no_backend, move process_options call |
| to toplev::main. |
| (toplev::run_self_tests): Check no_backend at the caller. |
| (toplev::main): Call post_options and process_options |
| split out from do_compile, do self-tests only if |
| no_backend is initialized. |
| |
| 2021-10-05 Richard Biener <rguenther@suse.de> |
| |
| * tree-cfg.c (dump_function_to_file): Dump the UID of the |
| function as part of the name when requested. |
| * tree-pretty-print.c (dump_function_name): Dump the UID when |
| requested and the langhook produced the actual name. |
| |
| 2021-10-05 Richard Biener <rguenther@suse.de> |
| |
| PR middle-end/102587 |
| PR middle-end/102285 |
| * internal-fn.c (expand_DEFERRED_INIT): Fall back to |
| zero-initialization as last resort, use the constant |
| size as given by the DEFERRED_INIT argument to build |
| the initializer. |
| |
| 2021-10-04 Marek Polacek <polacek@redhat.com> |
| |
| PR c++/97573 |
| * doc/invoke.texi: Document -Warray-compare. |
| |
| 2021-10-04 Richard Biener <rguenther@suse.de> |
| |
| * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER |
| variables are not to be initialized. |
| |
| 2021-10-04 Richard Biener <rguenther@suse.de> |
| |
| * expr.h (non_mem_decl_p): Declare. |
| (mem_ref_refers_to_non_mem_p): Likewise. |
| * expr.c (non_mem_decl_p): Export. |
| (mem_ref_refers_to_non_mem_p): Likewise. |
| * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS |
| but check the base with mem_ref_refers_to_non_mem_p |
| and non_mem_decl_p. |
| |
| 2021-10-04 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102570 |
| * tree-ssa-sccvn.h (vn_reference_op_struct): Document |
| we are using clique for the internal function code. |
| * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the |
| internal function code. |
| (print_vn_reference_ops): Print the internal function code. |
| (vn_reference_op_compute_hash): Hash it. |
| (copy_reference_ops_from_call): Record it. |
| (visit_stmt): Remove the restriction around internal function |
| calls. |
| (fully_constant_vn_reference_p): Use fold_const_call and handle |
| internal functions. |
| (vn_reference_eq): Compare call return types. |
| * tree-ssa-pre.c (create_expression_by_pieces): Handle |
| generating calls to internal functions. |
| (compute_avail): Remove the restriction around internal function |
| calls. |
| |
| 2021-10-04 Aldy Hernandez <aldyh@redhat.com> |
| |
| PR tree-optimization/102560 |
| * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static |
| marker for invalid_range. |
| |
| 2021-10-04 Richard Biener <rguenther@suse.de> |
| |
| PR middle-end/102587 |
| * internal-fn.c (expand_DEFERRED_INIT): Guard register |
| initialization path an avoid initializing VLA registers |
| with it. |
| |
| 2021-10-04 Eric Botcazou <ebotcazou@adacore.com> |
| |
| * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete. |
| |
| 2021-10-03 Martin Liska <mliska@suse.cz> |
| |
| * toplev.c (toplev::main): Check opt_index if it is a part |
| of cl_options. |
| |
| 2021-10-02 Aldy Hernandez <aldyh@redhat.com> |
| |
| PR tree-optimization/102563 |
| * range-op.cc (operator_lshift::op1_range): Do not clobber |
| range. |
| |
| 2021-10-02 Martin Liska <mliska@suse.cz> |
| |
| * toplev.c (toplev::main): save_decoded_options[0] is program |
| name and so it should be skipped. |
| |
| 2021-10-01 Aldy Hernandez <aldyh@redhat.com> |
| |
| PR tree-optimization/102546 |
| * range-op.cc (operator_lshift::op1_range): Teach range-ops that |
| X << Y is non-zero implies X is also non-zero. |
| |
| 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com> |
| |
| * config/aarch64/aarch64-cores.def (AARCH64_CORE): New |
| Cortex-X2 core. |
| * config/aarch64/aarch64-tune.md: Regenerate. |
| * doc/invoke.texi: Update docs. |
| |
| 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com> |
| |
| * config/aarch64/aarch64-cores.def (AARCH64_CORE): New |
| Cortex-A710 core. |
| * config/aarch64/aarch64-tune.md: Regenerate. |
| * doc/invoke.texi: Update docs. |
| |
| 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com> |
| |
| * config/aarch64/aarch64-cores.def (AARCH64_CORE): New |
| Cortex-A510 core. |
| * config/aarch64/aarch64-tune.md: Regenerate. |
| * doc/invoke.texi: Update docs. |
| |
| 2021-10-01 Martin Sebor <msebor@redhat.com> |
| |
| PR c/102103 |
| * doc/invoke.texi (-Waddress): Update. |
| * gengtype.c (write_types): Avoid -Waddress. |
| * poly-int.h (POLY_SET_COEFF): Avoid using null. |
| |
| 2021-10-01 John David Anglin <danglin@gcc.gnu.org> |
| |
| PR debug/102373 |
| * config/pa/pa.c (pa_option_override): Default to dwarf version 4 |
| on hppa64-hpux. |
| |
| 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com> |
| |
| * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value. |
| |
| 2021-10-01 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::compute_ranges): Use |
| get_path_oracle. |
| * gimple-range-path.h (class path_range_query): Remove shadowed |
| m_oracle field. |
| (path_range_query::get_path_oracle): New. |
| |
| 2021-10-01 Jakub Jelinek <jakub@redhat.com> |
| Richard Biener <rguenther@suse.de> |
| |
| PR sanitizer/102515 |
| * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove |
| INT_MIN / -1 division detection from here ... |
| (-fsanitize=signed-integer-overflow): ... and add it here. |
| |
| 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com> |
| |
| * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added |
| armv9-a. |
| * config/aarch64/aarch64.h (AARCH64_FL_V9): New. |
| (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A. |
| (AARCH64_ISA_V9): New ISA flag. |
| * doc/invoke.texi: Update docs. |
| |
| 2021-10-01 Martin Liska <mliska@suse.cz> |
| |
| * toplev.c (toplev::main): Save decoded optimization options. |
| * toplev.h (save_opt_decoded_options): New. |
| * doc/extend.texi: Be more clear about optimize and target |
| attributes. |
| |
| 2021-10-01 Eric Botcazou <ebotcazou@adacore.com> |
| |
| * explow.c: Include langhooks.h. |
| (set_stack_check_libfunc): Build a proper function type. |
| |
| 2021-10-01 Eric Botcazou <ebotcazou@adacore.com> |
| |
| PR c++/64697 |
| * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do |
| not return true for external weak function symbols in medium model. |
| |
| 2021-10-01 Jakub Jelinek <jakub@redhat.com> |
| |
| * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define. |
| * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print |
| reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE. |
| * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen |
| without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to |
| OMP_CLAUSE_SCHEDULE_STATIC. |
| |
| 2021-10-01 Richard Biener <rguenther@suse.de> |
| |
| PR middle-end/102518 |
| * tree-inline.c (setup_one_parameter): Avoid substituting |
| an invariant into contexts where a GIMPLE register is not valid. |
| |
| 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com> |
| |
| * config/arm/arm-cpus.in: Add Cortex-R52+ CPU. |
| * config/arm/arm-tables.opt: Regenerate. |
| * config/arm/arm-tune.md: Regenerate. |
| * doc/invoke.texi: Update docs. |
| |
| 2021-09-30 Uroš Bizjak <ubizjak@gmail.com> |
| |
| PR target/89954 |
| * config/i386/i386.md |
| (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters): |
| New splitters. |
| |
| 2021-09-30 Tobias Burnus <tobias@codesourcery.com> |
| |
| * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and |
| omp_{c,re}alloc, fix omp_alloc/omp_free. |
| |
| 2021-09-30 Martin Liska <mliska@suse.cz> |
| |
| * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable |
| asm_out_file and stream directly to MYFILE. |
| |
| 2021-09-30 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-data-refs.c (vect_update_misalignment_for_peel): |
| Restore and fix condition under which we apply npeel to |
| the DRs misalignment value. |
| |
| 2021-09-30 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-data-refs.c (vect_update_misalignment_for_peel): |
| Fix npeel check for variable amount of peeling. |
| |
| 2021-09-30 Aldy Hernandez <aldyh@redhat.com> |
| |
| * lto-wrapper.c (run_gcc): Plug snprintf overflow. |
| |
| 2021-09-30 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range.cc (gimple_ranger::debug): New. |
| * gimple-range.h (class gimple_ranger): Add debug. |
| |
| 2021-09-30 Aldy Hernandez <aldyh@redhat.com> |
| |
| PR middle-end/102519 |
| * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query. |
| |
| 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com> |
| |
| PR debug/102507 |
| * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose. |
| (btf_finalize): Empty the hash_map btf_var_ids. |
| |
| 2021-09-29 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-vrp.c (thread_through_all_blocks): Return bool. |
| (execute_vrp_threader): Return TODO_* flags. |
| (pass_data_vrp_threader): Set todo_flags_finish to 0. |
| |
| 2021-09-29 Aldy Hernandez <aldyh@redhat.com> |
| |
| * timevar.def (TV_TREE_VRP_THREADER): New. |
| * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass. |
| |
| 2021-09-29 David Faust <david.faust@oracle.com> |
| |
| * config.gcc (bpf-*-*): Do not overwrite extra_headers. |
| |
| 2021-09-29 Jonathan Wright <jonathan.wright@arm.com> |
| |
| * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define |
| new type qualifier enum. |
| (TYPES_TERNOP_SSSU): Likewise. |
| (TYPES_TERNOP_PPPU): Likewise. |
| * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU, |
| PPPU and SSSU builtin generator macros for qtbl1 and qtbx1 |
| Neon builtins. |
| * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified |
| builtin and remove casts. |
| (vqtbl1_s8): Likewise. |
| (vqtbl1q_p8): Likewise. |
| (vqtbl1q_s8): Likewise. |
| (vqtbx1_s8): Likewise. |
| (vqtbx1_p8): Likewise. |
| (vqtbx1q_s8): Likewise. |
| (vqtbx1q_p8): Likewise. |
| (vtbl1_p8): Likewise. |
| (vtbl2_p8): Likewise. |
| (vtbx2_p8): Likewise. |
| |
| 2021-09-29 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access): |
| New helper. |
| (vect_update_misalignment_for_peel): Use it to update |
| misaligned to the value necessary for an aligned access. |
| (vect_get_peeling_costs_all_drs): Likewise. |
| (vect_enhance_data_refs_alignment): Likewise. |
| |
| 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com> |
| |
| * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of |
| emitted operations and adjust heuristic for code size. |
| |
| 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com> |
| |
| * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of |
| emitted operations and adjust heuristic for code size. |
| |
| 2021-09-29 Jakub Jelinek <jakub@redhat.com> |
| |
| PR middle-end/102504 |
| * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even |
| in OMP_SCOPE clauses, not just on worksharing construct clauses. |
| |
| 2021-09-28 Geng Qi <gengqi@linux.alibaba.com> |
| |
| * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart. |
| (<u>mulditi3): Call <su>muldi3_highpart. |
| (<u>muldi3_highpart): Rename to <su>muldi3_highpart. |
| (<u>mulsidi3): Call <su>mulsi3_highpart. |
| (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart. |
| |
| 2021-09-28 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources. |
| |
| 2021-09-28 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * config/rs6000/darwin.h (FIXED_R13): Add for PPC64. |
| (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one |
| of the fixed regs. |
| |
| 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com> |
| |
| * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define |
| (AARCH64_FL_V8_7): Likewise. |
| (AARCH64_FL_FOR_ARCH8_7): Likewise. |
| * config/aarch64/aarch64-arches.def (armv8.7-a): Define. |
| * config/aarch64/aarch64-option-extensions.def (ls64): Define. |
| * doc/invoke.texi: Document the above. |
| |
| 2021-09-28 Aldy Hernandez <aldyh@redhat.com> |
| |
| * dbgcnt.c (dbg_cnt_counter): New. |
| * dbgcnt.h (dbg_cnt_counter): New. |
| * dumpfile.c (dump_options): Add entry for TDF_THREADING. |
| * dumpfile.h (enum dump_flag): Add TDF_THREADING. |
| * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING. |
| * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out |
| debug counter. |
| |
| 2021-09-28 Aldy Hernandez <aldyh@redhat.com> |
| |
| * cfgcleanup.c (pass_jump::execute): Check |
| flag_expensive_optimizations. |
| (pass_jump_after_combine::gate): Same. |
| * doc/invoke.texi (-fthread-jumps): Enable for -O1. |
| * opts.c (default_options_table): Enable -fthread-jumps at -O1. |
| * tree-ssa-threadupdate.c |
| (fwd_jt_path_registry::remove_jump_threads_including): Bail unless |
| flag_thread_jumps. |
| |
| 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com> |
| |
| * tree-ssa-reassoc.c (biased_names): New global. |
| (propagate_bias_p): New function. |
| (loop_carried_phi): Remove. |
| (propagate_rank): Propagate bias along single uses. |
| (get_rank): Update biased_names when needed. |
| |
| 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com> |
| |
| * passes.def (pass_reassoc): Rename parameter to early_p. |
| * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p): |
| New variable. |
| (phi_rank): Don't bias loop-carried phi ranks |
| before vectorization pass. |
| (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter. |
| (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p |
| initializer. |
| (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p |
| value. |
| (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to |
| execute_reassoc. |
| (pass_reassoc::bias_loop_carried_phi_ranks_p): New member. |
| |
| 2021-09-28 Jakub Jelinek <jakub@redhat.com> |
| |
| PR target/102498 |
| * config/i386/i386.c (standard_80387_constant_p): Don't recognize |
| special 80387 instruction XFmode constants if flag_rounding_math. |
| |
| 2021-09-28 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/100112 |
| * tree-ssa-sccvn.c (visit_reference_op_load): Record the |
| referece into the hashtable twice in case last_vuse is |
| different from the original vuse on the stmt. |
| |
| 2021-09-28 Jakub Jelinek <jakub@redhat.com> |
| |
| PR middle-end/102492 |
| * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the |
| omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE |
| clauses on SIMD constructs. |
| |
| 2021-09-28 Aldy Hernandez <aldyh@redhat.com> |
| |
| PR tree-optimization/102511 |
| * gimple-range-path.cc (path_range_query::range_on_path_entry): |
| Return VARYING when nothing found. |
| |
| 2021-09-28 Hongyu Wang <hongyu.wang@intel.com> |
| |
| PR target/102230 |
| * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add |
| V2HF mode check. |
| (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode. |
| (VALID_MMX_REG_MODE): Add V4HFmode. |
| (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with |
| vector mode condition. |
| * config/i386/i386.c (classify_argument): Parse V4HF/V2HF |
| via sse regs. |
| (function_arg_32): Add V4HFmode. |
| (function_arg_advance_32): Likewise. |
| * config/i386/i386.md (mode): Add V4HF/V2HF. |
| (MODE_SIZE): Likewise. |
| * config/i386/mmx.md (MMXMODE): Add V4HF mode. |
| (V_32): Add V2HF mode. |
| (VHF_32_64): New mode iterator. |
| (*mov<mode>_internal): Adjust sse alternatives to support |
| V4HF mode move. |
| (*mov<mode>_internal): Adjust sse alternatives to support |
| V2HF mode move. |
| (<insn><mode>3): New define_insn for add/sub/mul/div. |
| |
| 2021-09-28 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check |
| flag_thread_jumps. |
| (pass_early_thread_jumps::gate): Same. |
| * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges): |
| Return if !flag_thread_jumps. |
| * tree-ssa-threadupdate.c |
| (jt_path_registry::register_jump_thread): Assert that |
| flag_thread_jumps is true. |
| |
| 2021-09-28 liuhongt <hongtao.liu@intel.com> |
| |
| * simplify-rtx.c |
| (simplify_context::simplify_binary_operation_1): Relax |
| condition of simplifying (vec_concat:M (vec_select op0 |
| index0)(vec_select op1 index1)) to allow different modes |
| between op0 and M, but have same inner mode. |
| |
| 2021-09-28 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/i386-expand.c (emit_reduc_half): Handle |
| V8HF/V16HF/V32HFmode. |
| * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF. |
| (REDUC_SSE_SMINMAX_MODE): Ditto. |
| (REDUC_PLUS_MODE): Add V16HF and V32HF. |
| (REDUC_SMINMAX_MODE): Ditto. |
| |
| 2021-09-27 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc |
| (path_range_query::precompute_ranges_in_block): Rename to... |
| (path_range_query::compute_ranges_in_block): ...this. |
| (path_range_query::precompute_ranges): Rename to... |
| (path_range_query::compute_ranges): ...this. |
| (path_range_query::precompute_relations): Rename to... |
| (path_range_query::compute_relations): ...this. |
| (path_range_query::precompute_phi_relations): Rename to... |
| (path_range_query::compute_phi_relations): ...this. |
| * gimple-range-path.h: Rename precompute* to compute*. |
| * tree-ssa-threadbackward.c |
| (back_threader::find_taken_edge_switch): Same. |
| (back_threader::find_taken_edge_cond): Same. |
| * tree-ssa-threadedge.c |
| (hybrid_jt_simplifier::compute_ranges_from_state): Same. |
| (hybrid_jt_state::register_equivs_stmt): Inline... |
| * tree-ssa-threadedge.h: ...here. |
| |
| 2021-09-27 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-vrp.c (lhs_of_dominating_assert): Remove. |
| (class vrp_jt_state): Remove. |
| (class vrp_jt_simplifier): Remove. |
| (vrp_jt_simplifier::simplify): Remove. |
| (class vrp_jump_threader): Remove. |
| (vrp_jump_threader::vrp_jump_threader): Remove. |
| (vrp_jump_threader::~vrp_jump_threader): Remove. |
| (vrp_jump_threader::before_dom_children): Remove. |
| (vrp_jump_threader::after_dom_children): Remove. |
| |
| 2021-09-27 Aldy Hernandez <aldyh@redhat.com> |
| |
| * passes.def (pass_vrp_threader): New. |
| * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader. |
| * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New. |
| (hybrid_jt_simplifier::hybrid_jt_simplifier): New. |
| (hybrid_jt_simplifier::simplify): New. |
| (hybrid_jt_simplifier::compute_ranges_from_state): New. |
| * tree-ssa-threadedge.h (class hybrid_jt_state): New. |
| (class hybrid_jt_simplifier): New. |
| * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump |
| threader. |
| (class hybrid_threader): New. |
| (hybrid_threader::hybrid_threader): New. |
| (hybrid_threader::~hybrid_threader): New. |
| (hybrid_threader::before_dom_children): New. |
| (hybrid_threader::after_dom_children): New. |
| (execute_vrp_threader): New. |
| (class pass_vrp_threader): New. |
| (make_pass_vrp_threader): New. |
| |
| 2021-09-27 Martin Liska <mliska@suse.cz> |
| |
| * output.h (enum section_flag): New. |
| (SECTION_FORGET): Remove. |
| (SECTION_ENTSIZE): Make it (1UL << 8) - 1. |
| (SECTION_STYLE_MASK): Define it based on other enum |
| values. |
| * varasm.c (switch_to_section): Remove unused handling of |
| SECTION_FORGET. |
| |
| 2021-09-27 Martin Liska <mliska@suse.cz> |
| |
| * common.opt: Add new variable flag_default_complex_method. |
| * opts.c (finish_options): Handle flags related to |
| x_flag_complex_method. |
| * toplev.c (process_options): Remove option handling related |
| to flag_complex_method. |
| |
| 2021-09-27 Richard Biener <rguenther@suse.de> |
| |
| PR middle-end/102450 |
| * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using |
| type_for_size, instead use int_mode_for_size. |
| |
| 2021-09-27 Andrew Pinski <apinski@marvell.com> |
| |
| PR c/94726 |
| * gimplify.c (gimplify_save_expr): Return early |
| if the type of val is error_mark_node. |
| |
| 2021-09-27 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssanames.c (ssa_name_has_boolean_range): Use |
| get_range_query. |
| |
| 2021-09-27 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove |
| vrp_visit_cond_stmt. |
| * tree-ssa-dom.c (cprop_operand): Convert to range_query API. |
| (cprop_into_stmt): Same. |
| (dom_opt_dom_walker::optimize_stmt): Same. |
| |
| 2021-09-27 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/97351 |
| PR tree-optimization/97352 |
| PR tree-optimization/82426 |
| * tree-vectorizer.h (dr_misalignment): Add vector type |
| argument. |
| (aligned_access_p): Likewise. |
| (known_alignment_for_access_p): Likewise. |
| (vect_supportable_dr_alignment): Likewise. |
| (vect_known_alignment_in_bytes): Likewise. Refactor. |
| (DR_MISALIGNMENT): Remove. |
| (vect_update_shared_vectype): Likewise. |
| * tree-vect-data-refs.c (dr_misalignment): Refactor, handle |
| a vector type with larger alignment requirement and apply |
| the negative step adjustment here. |
| (vect_calculate_target_alignment): Remove. |
| (vect_compute_data_ref_alignment): Get explicit vector type |
| argument, do not apply a negative step alignment adjustment |
| here. |
| (vect_slp_analyze_node_alignment): Re-analyze alignment |
| when we re-visit the DR with a bigger desired alignment but |
| keep more precise results from smaller alignments. |
| * tree-vect-slp.c (vect_update_shared_vectype): Remove. |
| (vect_slp_analyze_node_operations_1): Do not update the |
| shared vector type on stmts. |
| * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the |
| vector type of an SLP node to the representative stmt-info. |
| (vect_transform_stmt): Likewise. |
| |
| 2021-09-27 liuhongt <hongtao.liu@intel.com> |
| |
| Revert: |
| 2021-09-09 liuhongt <hongtao.liu@intel.com> |
| |
| PR target/101059 |
| * config/i386/sse.md (reduc_plus_scal_<mode>): Split to .. |
| (reduc_plus_scal_v4sf): .. this, New define_expand. |
| (reduc_plus_scal_v2df): .. and this, New define_expand. |
| |
| 2021-09-26 liuhongt <hongtao.liu@intel.com> |
| |
| * doc/extend.texi (Half-Precision): Remove storage only |
| description for _Float16 w/o avx512fp16. |
| |
| 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu> |
| |
| * config/pru/constraints.md (Rrio): New constraint. |
| * config/pru/predicates.md (regio_operand): New predicate. |
| * config/pru/pru-pragma.c (pru_register_pragmas): Register |
| the __regio_symbol address space. |
| * config/pru/pru-protos.h (pru_symref2ioregno): Declaration. |
| * config/pru/pru.c (pru_symref2ioregno): New helper function. |
| (pru_legitimate_address_p): Remove. |
| (pru_addr_space_legitimate_address_p): Use the address space |
| aware hook variant. |
| (pru_nongeneric_pointer_addrspace): New helper function. |
| (pru_insert_attributes): New function to validate __regio_symbol |
| usage. |
| (TARGET_INSERT_ATTRIBUTES): New macro. |
| (TARGET_LEGITIMATE_ADDRESS_P): Remove. |
| (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro. |
| * config/pru/pru.h (enum reg_class): Add REGIO_REGS class. |
| * config/pru/pru.md (*regio_readsi): New pattern to read I/O |
| registers. |
| (*regio_nozext_writesi): New pattern to write to I/O registers. |
| (*regio_zext_write_r30<EQS0:mode>): Ditto. |
| * doc/extend.texi: Document the new PRU Named Address Space. |
| |
| 2021-09-24 Patrick Palka <ppalka@redhat.com> |
| |
| PR c++/98216 |
| PR c++/91292 |
| * real.c (encode_ieee_double): Avoid unwanted sign extension. |
| (encode_ieee_quad): Likewise. |
| |
| 2021-09-24 Vladimir Makarov <vmakarov@redhat.com> |
| |
| PR rtl-optimization/102147 |
| * ira-build.c (ira_conflict_vector_profitable_p): Make |
| profitability calculation independent of host compiler pointer and |
| IRA_INT_BITS sizes. |
| |
| 2021-09-24 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::path_range_query): |
| Move debugging header... |
| (path_range_query::precompute_ranges): ...here. |
| (path_range_query::internal_range_of_expr): Do not call |
| range_on_path_entry if NAME is defined in the current block. |
| |
| 2021-09-24 Richard Biener <rguenther@suse.de> |
| |
| * cfghooks.c (verify_flow_info): Verify unallocated BB and |
| edge flags are not set. |
| |
| 2021-09-24 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths): |
| New. |
| (jt_path_registry::register_jump_thread): Call |
| cancel_invalid_paths. |
| * tree-ssa-threadupdate.h (class jt_path_registry): Add |
| cancel_invalid_paths. |
| |
| 2021-09-24 Feng Xue <fxue@os.amperecomputing.com> |
| |
| PR tree-optimization/102400 |
| * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize |
| result_vdef to zero value. |
| |
| 2021-09-24 Feng Xue <fxue@os.amperecomputing.com> |
| |
| PR tree-optimization/102451 |
| * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt |
| before removal. |
| |
| 2021-09-24 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/sse.md (cond_<insn><mode>): Extend to support |
| vector HFmodes. |
| (cond_mul<mode>): Likewise. |
| (cond_div<mode>): Likewise. |
| (cond_<code><mode>): Likewise. |
| (cond_fma<mode>): Likewise. |
| (cond_fms<mode>): Likewise. |
| (cond_fnma<mode>): Likewise. |
| (cond_fnms<mode>): Likewise. |
| |
| 2021-09-23 Andrew MacLeod <amacleod@redhat.com> |
| |
| PR tree-optimization/102463 |
| * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If |
| there is no range-ops handler, don't look for a relation. |
| |
| 2021-09-23 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range-cache.cc (ranger_cache::ranger_cache): Take |
| non-executable_edge flag as parameter. |
| * gimple-range-cache.h (ranger_cache): Adjust prototype. |
| * gimple-range-gori.cc (gori_compute::gori_compute): Take |
| non-executable_edge flag as parameter. |
| (gori_compute::outgoing_edge_range_p): Check new flag. |
| * gimple-range-gori.h (gori_compute): Adjust prototype. |
| * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag. |
| (gimple_ranger::range_on_edge): Check new flag. |
| * gimple-range.h (gimple_ranger::non_executable_edge_flag): New. |
| * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer. |
| (hybrid_folder::hybrid_folder): Set ranger non-executable flag value. |
| (hybrid_folder::fold_stmt): Set flag value in the simplifer. |
| * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable): |
| Use not_executable flag if provided inmstead of EDGE_EXECUTABLE. |
| (simplify_using_ranges::simplify_switch_using_ranges): Clear |
| EDGE_EXECUTABLE like it originally did. |
| (simplify_using_ranges::cleanup_edges_and_switches): Clear any |
| NON_EXECUTABLE flags. |
| (simplify_using_ranges::simplify_using_ranges): Adjust. |
| * vr-values.h (class simplify_using_ranges): Adjust. |
| (simplify_using_ranges::set_range_query): Add non-executable flag param. |
| |
| 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com> |
| |
| PR target/102024 |
| * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect |
| zero-width bit fields and return indicator. |
| (rs6000_discover_homogeneous_aggregate): Diagnose when the |
| presence of a zero-width bit field changes parameter passing in |
| GCC 12. |
| |
| 2021-09-23 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-fold.cc (fold_using_range::range_of_phi): |
| Remove dominator check. |
| |
| 2021-09-23 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::precompute_relations): |
| Hoist edge calculations before using EDGE_SUCC. |
| |
| 2021-09-23 Jonathan Wakely <jwakely@redhat.com> |
| |
| * configure.ac: Fix --with-multilib-list description. |
| * configure: Regenerate. |
| |
| 2021-09-23 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102448 |
| * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info): |
| Clear alignment info copied from DR_PTR_INFO. |
| |
| 2021-09-23 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable |
| HFmode mask_cmp. |
| * config/i386/sse.md (sseintvecmodelower): Add HF vector modes. |
| (<avx512>_store<mode>_mask): Extend to support HF vector modes. |
| (vec_cmp<mode><avx512fmaskmodelower>): Likewise. |
| (vcond_mask_<mode><avx512fmaskmodelower>): Likewise. |
| (vcond<mode><mode>): New expander. |
| (vcond<mode><sseintvecmodelower>): Likewise. |
| (vcond<sseintvecmodelower><mode>): Likewise. |
| (vcondu<mode><sseintvecmodelower>): Likewise. |
| |
| 2021-09-23 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/sse.md (extend<ssePHmodelower><mode>2): |
| New expander. |
| (extendv4hf<mode>2): Likewise. |
| (extendv2hfv2df2): Likewise. |
| (trunc<mode><ssePHmodelower>2): Likewise. |
| (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ... |
| (trunc<mode>v4hf2): ... this, and drop constraints. |
| (avx512fp16_vcvtpd2ph_v2df): Rename to ... |
| (truncv2dfv2hf2): ... this, and likewise. |
| |
| 2021-09-23 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2): |
| New expander. |
| (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): |
| Rename to ... |
| (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints. |
| (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ... |
| (float<floatunssuffix>v2div2hf2): ... this, and likewise. |
| |
| 2021-09-23 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander. |
| (fixuns_trunchfhi2): Likewise. |
| (*fixuns_trunchfsi2zext): New define_insn. |
| * config/i386/sse.md (ssePHmodelower): New mode_attr. |
| (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2): |
| New expander for same element vector fix_truncate. |
| (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2): |
| Likewise for V4HF to V4SI/V4DI fix_truncate. |
| (fix<fixunssuffix>_truncv2hfv2di2): |
| Likeise for V2HF to V2DI fix_truncate. |
| |
| 2021-09-23 Hongyu Wang <hongyu.wang@intel.com> |
| |
| * config/i386/i386.md (<code>hf3): New expander. |
| |
| 2021-09-23 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/sse.md (FMAMODEM): extend to handle FP16. |
| (VFH_SF_AVX512VL): Extend to handle HFmode. |
| (VF_SF_AVX512VL): Deleted. |
| |
| 2021-09-23 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/i386.md (rinthf2): New expander. |
| (nearbyinthf2): New expander. |
| |
| 2021-09-23 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename... |
| (class dom_jt_state): ...this and provide virtual overrides. |
| (dom_jt_state::register_equiv): New. |
| (class dom_jt_simplifier): Rename from |
| dom_jump_threader_simplifier. |
| (dom_jump_threader_simplifier::simplify): Rename... |
| (dom_jt_simplifier::simplify): ...to this. |
| (pass_dominator::execute): Use dom_jt_simplifier and |
| dom_jt_state. |
| * tree-ssa-threadedge.c (jump_threader::jump_threader): |
| Clean-up. |
| (jt_state::register_equivs_stmt): Abstract out... |
| (jump_threader::record_temporary_equivalences_from_stmts_at_dest): |
| ...from here. |
| (jump_threader::thread_around_empty_blocks): Update state. |
| (jump_threader::thread_through_normal_block): Same. |
| (jt_state::jt_state): Remove. |
| (jt_state::push): Remove pass specific bits. Keep block vector |
| updated. |
| (jt_state::append_path): New. |
| (jt_state::pop): Remove pass specific bits. |
| (jt_state::register_equiv): Same. |
| (jt_state::record_ranges_from_stmt): Same. |
| (jt_state::register_equivs_on_edge): Same. Rename... |
| (jt_state::register_equivs_edge): ...to this. |
| (jt_state::dump): New. |
| (jt_state::debug): New. |
| (jump_threader_simplifier::simplify): Remove. |
| (jt_state::get_path): New. |
| * tree-ssa-threadedge.h (class jt_simplifier): Make into a base |
| class. Expose common functionality as virtual methods. |
| (class jump_threader_simplifier): Same. Rename... |
| (class jt_simplifier): ...to this. |
| * tree-vrp.c (class vrp_jump_threader_simplifier): Rename... |
| (class vrp_jt_simplifier): ...to this. Provide pass specific |
| overrides. |
| (class vrp_jt_state): New. |
| (vrp_jump_threader_simplifier::simplify): Rename... |
| (vrp_jt_simplifier::simplify): ...to this. Inline code from |
| what used to be the base class. |
| (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and |
| vrp_jt_simplifier. |
| |
| 2021-09-22 Tobias Burnus <tobias@codesourcery.com> |
| |
| PR fortran/55534 |
| * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran |
| behavior. |
| |
| 2021-09-22 Roger Sayle <roger@nextmovesoftware.com> |
| Richard Biener <rguenther@suse.de> |
| |
| * match.pd (negation simplifications): Implement some negation |
| folding transformations from fold-const.c's fold_negate_expr. |
| * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY |
| argument, to control whether the op should be simplified prior |
| to looking up/assigning a value number. |
| (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1. |
| (vn_nary_simplify): Likewise. |
| (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR |
| now call vn_nary_build_or_lookup_1 disabling simplification. |
| |
| 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com> |
| |
| PR tree-optimization/102087 |
| * tree-ssa-loop-niter.c (number_of_iterations_until_wrap): |
| Update bound/cmp/control for niter. |
| |
| 2021-09-22 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-fold.cc (fold_using_range::range_of_range_op): |
| Move check for non-empty BB here. |
| (fur_source::register_outgoing_edges): ...from here. |
| |
| 2021-09-22 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::internal_range_of_expr): |
| Remove call to improve_range_with_equivs. |
| (path_range_query::improve_range_with_equivs): Remove |
| * gimple-range-path.h: Remove improve_range_with_equivs. |
| |
| 2021-09-22 dianhong xu <dianhong.xu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h: |
| (_mm512_mask_blend_ph): New intrinsic. |
| (_mm512_permutex2var_ph): Ditto. |
| (_mm512_permutexvar_ph): Ditto. |
| * config/i386/avx512fp16vlintrin.h: |
| (_mm256_mask_blend_ph): New intrinsic. |
| (_mm256_permutex2var_ph): Ditto. |
| (_mm256_permutexvar_ph): Ditto. |
| (_mm_mask_blend_ph): Ditto. |
| (_mm_permutex2var_ph): Ditto. |
| (_mm_permutexvar_ph): Ditto. |
| |
| 2021-09-22 dianhong xu <dianhong.xu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h: Add new intrinsics. |
| (_mm512_conj_pch): New intrinsic. |
| (_mm512_mask_conj_pch): Ditto. |
| (_mm512_maskz_conj_pch): Ditto. |
| * config/i386/avx512fp16vlintrin.h: Add new intrinsics. |
| (_mm256_conj_pch): New intrinsic. |
| (_mm256_mask_conj_pch): Ditto. |
| (_mm256_maskz_conj_pch): Ditto. |
| (_mm_conj_pch): Ditto. |
| (_mm_mask_conj_pch): Ditto. |
| (_mm_maskz_conj_pch): Ditto. |
| |
| 2021-09-22 dianhong xu <dianhong.xu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro |
| (_mm512_reduce_add_ph): New intrinsic. |
| (_mm512_reduce_mul_ph): Ditto. |
| (_mm512_reduce_min_ph): Ditto. |
| (_mm512_reduce_max_ph): Ditto. |
| * config/i386/avx512fp16vlintrin.h |
| (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro. |
| (_mm256_reduce_add_ph): New intrinsic. |
| (_mm256_reduce_mul_ph): Ditto. |
| (_mm256_reduce_min_ph): Ditto. |
| (_mm256_reduce_max_ph): Ditto. |
| (_mm_reduce_add_ph): Ditto. |
| (_mm_reduce_mul_ph): Ditto. |
| (_mm_reduce_min_ph): Ditto. |
| (_mm_reduce_max_ph): Ditto. |
| |
| 2021-09-22 dianhong xu <dianhong.xu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u, |
| __m128h_u): New typedef. |
| (_mm512_load_ph): New intrinsic. |
| (_mm256_load_ph): Ditto. |
| (_mm_load_ph): Ditto. |
| (_mm512_loadu_ph): Ditto. |
| (_mm256_loadu_ph): Ditto. |
| (_mm_loadu_ph): Ditto. |
| (_mm512_store_ph): Ditto. |
| (_mm256_store_ph): Ditto. |
| (_mm_store_ph): Ditto. |
| (_mm512_storeu_ph): Ditto. |
| (_mm256_storeu_ph): Ditto. |
| (_mm_storeu_ph): Ditto. |
| (_mm512_abs_ph): Ditto. |
| * config/i386/avx512fp16vlintrin.h |
| (_mm_abs_ph): Ditto. |
| (_mm256_abs_ph): Ditto. |
| |
| 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com> |
| |
| * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber. |
| |
| 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com> |
| |
| PR target/102222 |
| * config/s390/s390.c (s390_expand_insv): Emit a normal move if it |
| is actually a full copy of the source operand into the target. |
| Don't emit a strict low part move if source and target mode match. |
| |
| 2021-09-22 Jakub Jelinek <jakub@redhat.com> |
| |
| PR middle-end/102415 |
| * omp-expand.c (expand_omp_single): If region->exit is NULL, |
| assert region->entry is GIMPLE_OMP_SCOPE region and return. |
| |
| 2021-09-22 Jakub Jelinek <jakub@redhat.com> |
| |
| * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define. |
| * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE |
| arguments from 2 to 3. |
| * tree-pretty-print.c (dump_omp_clause): Print allocator() around |
| allocate clause allocator and print align if present. |
| * omp-low.c (scan_sharing_clauses): Force allocate_map entry even |
| for omp_default_mem_alloc if align modifier is present. If align |
| modifier is present, use TREE_LIST to encode both allocator and |
| align. |
| (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn): |
| Handle align modifier on allocator clause if present. |
| |
| 2021-09-22 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/i386.md (define_attr "isa"): Add |
| fma_or_avx512vl. |
| (define_attr "enabled"): Correspond fma_or_avx512vl to |
| TARGET_FMA || TARGET_AVX512VL. |
| * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma. |
| (fmsv2sf4): Ditto. |
| (fnmav2sf4): Ditto. |
| (fnmsv2sf4): Ditto. |
| |
| 2021-09-22 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/i386.md (cstorehf3): New define_expand. |
| |
| 2021-09-22 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/i386.md (<rounding_insn>hf2): New expander. |
| (sse4_1_round<mode>2): Extend from MODEF to MODEFH. |
| * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>): |
| Extend from VF_128 to VFH_128. |
| |
| 2021-09-22 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/i386-features.c (i386-features.c): Handle |
| E_HFmode. |
| * config/i386/i386.md (sqrthf2): New expander. |
| (*sqrthf2): New define_insn. |
| * config/i386/sse.md |
| (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>): |
| Extend to VFH_128. |
| |
| 2021-09-22 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch): |
| New intrinsic. |
| (_mm_mask3_fcmadd_sch): Likewise. |
| (_mm_maskz_fcmadd_sch): Likewise. |
| (_mm_fcmadd_sch): Likewise. |
| (_mm_mask_fmadd_sch): Likewise. |
| (_mm_mask3_fmadd_sch): Likewise. |
| (_mm_maskz_fmadd_sch): Likewise. |
| (_mm_fmadd_sch): Likewise. |
| (_mm_mask_fcmadd_round_sch): Likewise. |
| (_mm_mask3_fcmadd_round_sch): Likewise. |
| (_mm_maskz_fcmadd_round_sch): Likewise. |
| (_mm_fcmadd_round_sch): Likewise. |
| (_mm_mask_fmadd_round_sch): Likewise. |
| (_mm_mask3_fmadd_round_sch): Likewise. |
| (_mm_maskz_fmadd_round_sch): Likewise. |
| (_mm_fmadd_round_sch): Likewise. |
| (_mm_fcmul_sch): Likewise. |
| (_mm_mask_fcmul_sch): Likewise. |
| (_mm_maskz_fcmul_sch): Likewise. |
| (_mm_fmul_sch): Likewise. |
| (_mm_mask_fmul_sch): Likewise. |
| (_mm_maskz_fmul_sch): Likewise. |
| (_mm_fcmul_round_sch): Likewise. |
| (_mm_mask_fcmul_round_sch): Likewise. |
| (_mm_maskz_fcmul_round_sch): Likewise. |
| (_mm_fmul_round_sch): Likewise. |
| (_mm_mask_fmul_round_sch): Likewise. |
| (_mm_maskz_fmul_round_sch): Likewise. |
| * config/i386/i386-builtin.def: Add corresponding new builtins. |
| * config/i386/sse.md |
| (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander. |
| (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto. |
| (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>): |
| New define insn. |
| (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto. |
| (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>): |
| Ditto. |
| * config/i386/subst.md (mask_scalarcz_name): New. |
| (mask_scalarc_name): Ditto. |
| (mask_scalarc_operand3): Ditto. |
| (mask_scalarcz_operand4): Ditto. |
| (round_scalarcz_name): Ditto. |
| (round_scalarc_mask_operand3): Ditto. |
| (round_scalarcz_mask_operand4): Ditto. |
| (round_scalarc_mask_op3): Ditto. |
| (round_scalarcz_mask_op4): Ditto. |
| (round_scalarcz_constraint): Ditto. |
| (round_scalarcz_nimm_predicate): Ditto. |
| (mask_scalarcz): Ditto. |
| (mask_scalarc): Ditto. |
| (round_scalarcz): Ditto. |
| |
| 2021-09-22 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch): |
| New intrinsic. |
| (_mm512_mask_fcmadd_pch): Likewise. |
| (_mm512_mask3_fcmadd_pch): Likewise. |
| (_mm512_maskz_fcmadd_pch): Likewise. |
| (_mm512_fmadd_pch): Likewise. |
| (_mm512_mask_fmadd_pch): Likewise. |
| (_mm512_mask3_fmadd_pch): Likewise. |
| (_mm512_maskz_fmadd_pch): Likewise. |
| (_mm512_fcmadd_round_pch): Likewise. |
| (_mm512_mask_fcmadd_round_pch): Likewise. |
| (_mm512_mask3_fcmadd_round_pch): Likewise. |
| (_mm512_maskz_fcmadd_round_pch): Likewise. |
| (_mm512_fmadd_round_pch): Likewise. |
| (_mm512_mask_fmadd_round_pch): Likewise. |
| (_mm512_mask3_fmadd_round_pch): Likewise. |
| (_mm512_maskz_fmadd_round_pch): Likewise. |
| (_mm512_fcmul_pch): Likewise. |
| (_mm512_mask_fcmul_pch): Likewise. |
| (_mm512_maskz_fcmul_pch): Likewise. |
| (_mm512_fmul_pch): Likewise. |
| (_mm512_mask_fmul_pch): Likewise. |
| (_mm512_maskz_fmul_pch): Likewise. |
| (_mm512_fcmul_round_pch): Likewise. |
| (_mm512_mask_fcmul_round_pch): Likewise. |
| (_mm512_maskz_fcmul_round_pch): Likewise. |
| (_mm512_fmul_round_pch): Likewise. |
| (_mm512_mask_fmul_round_pch): Likewise. |
| (_mm512_maskz_fmul_round_pch): Likewise. |
| * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch): |
| New intrinsic. |
| (_mm_mask_fmadd_pch): Likewise. |
| (_mm_mask3_fmadd_pch): Likewise. |
| (_mm_maskz_fmadd_pch): Likewise. |
| (_mm256_fmadd_pch): Likewise. |
| (_mm256_mask_fmadd_pch): Likewise. |
| (_mm256_mask3_fmadd_pch): Likewise. |
| (_mm256_maskz_fmadd_pch): Likewise. |
| (_mm_fcmadd_pch): Likewise. |
| (_mm_mask_fcmadd_pch): Likewise. |
| (_mm_mask3_fcmadd_pch): Likewise. |
| (_mm_maskz_fcmadd_pch): Likewise. |
| (_mm256_fcmadd_pch): Likewise. |
| (_mm256_mask_fcmadd_pch): Likewise. |
| (_mm256_mask3_fcmadd_pch): Likewise. |
| (_mm256_maskz_fcmadd_pch): Likewise. |
| (_mm_fmul_pch): Likewise. |
| (_mm_mask_fmul_pch): Likewise. |
| (_mm_maskz_fmul_pch): Likewise. |
| (_mm256_fmul_pch): Likewise. |
| (_mm256_mask_fmul_pch): Likewise. |
| (_mm256_maskz_fmul_pch): Likewise. |
| (_mm_fcmul_pch): Likewise. |
| (_mm_mask_fcmul_pch): Likewise. |
| (_mm_maskz_fcmul_pch): Likewise. |
| (_mm256_fcmul_pch): Likewise. |
| (_mm256_mask_fcmul_pch): Likewise. |
| (_mm256_maskz_fcmul_pch): Likewise. |
| * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF, |
| V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI, |
| V32HF_FTYPE_V32HF_V32HF_V32HF_INT, |
| V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types. |
| * config/i386/i386-builtin.def: Add new builtins. |
| * config/i386/i386-expand.c: Handle new builtin types. |
| * config/i386/subst.md (SUBST_CV): New. |
| (maskc_name): Ditto. |
| (maskc_operand3): Ditto. |
| (maskc): Ditto. |
| (sdc_maskz_name): Ditto. |
| (sdc_mask_op4): Ditto. |
| (sdc_mask_op5): Ditto. |
| (sdc_mask_mode512bit_condition): Ditto. |
| (sdc): Ditto. |
| (round_maskc_operand3): Ditto. |
| (round_sdc_mask_operand4): Ditto. |
| (round_maskc_op3): Ditto. |
| (round_sdc_mask_op4): Ditto. |
| (round_saeonly_sdc_mask_operand5): Ditto. |
| * config/i386/sse.md (unspec): Add complex fma unspecs. |
| (avx512fmaskcmode): New. |
| (UNSPEC_COMPLEX_F_C_MA): Ditto. |
| (UNSPEC_COMPLEX_F_C_MUL): Ditto. |
| (complexopname): Ditto. |
| (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander. |
| (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto. |
| (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New |
| define insn. |
| (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto. |
| (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto. |
| |
| 2021-09-22 Kewen Lin <linkw@linux.ibm.com> |
| |
| * config/rs6000/rs6000.opt (rs6000-density-pct-threshold, |
| rs6000-density-size-threshold, rs6000-density-penalty, |
| rs6000-density-load-pct-threshold, |
| rs6000-density-load-num-threshold): New parameter. |
| * config/rs6000/rs6000.c (rs6000_density_test): Adjust with |
| corresponding parameters. |
| |
| 2021-09-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::defined_outside_path): |
| New. |
| (path_range_query::range_on_path_entry): New. |
| (path_range_query::internal_range_of_expr): Resolve unknowns |
| with ranger. |
| (path_range_query::improve_range_with_equivs): New. |
| (path_range_query::ssa_range_in_phi): Resolve unknowns with |
| ranger. |
| * gimple-range-path.h (class path_range_query): Add |
| defined_outside_path, range_on_path_entry, and |
| improve_range_with_equivs. |
| |
| 2021-09-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::add_to_imports): New. |
| (path_range_query::add_copies_to_imports): New. |
| (path_range_query::precompute_ranges): Call |
| add_copies_to_imports. |
| * gimple-range-path.h (class path_range_query): Add prototypes |
| for add_copies_to_imports and add_to_imports. |
| |
| 2021-09-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::range_defined_in_block): |
| Remove useless code. |
| |
| 2021-09-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-fold.h (class fur_source): Make oracle protected. |
| * gimple-range-path.cc (path_range_query::path_range_query): Add |
| resolve argument. Initialize oracle. |
| (path_range_query::~path_range_query): Delete oracle. |
| (path_range_query::range_of_stmt): Adapt to use relations. |
| (path_range_query::precompute_ranges): Pre-compute relations. |
| (class jt_fur_source): New |
| (jt_fur_source::jt_fur_source): New. |
| (jt_fur_source::register_relation): New. |
| (jt_fur_source::query_relation): New. |
| (path_range_query::precompute_relations): New. |
| (path_range_query::precompute_phi_relations): New. |
| * gimple-range-path.h (path_range_query): Add resolve argument. |
| Add oracle, precompute_relations, precompute_phi_relations. |
| * tree-ssa-threadbackward.c (back_threader::back_threader): Pass |
| resolve argument to solver. |
| |
| 2021-09-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-fold.cc (fold_using_range::range_of_range_op): |
| Rename postfold_gcond_edges to register_outgoing_edges and |
| adapt. |
| (fold_using_range::postfold_gcond_edges): Rename... |
| (fur_source::register_outgoing_edges): ...to this. |
| * gimple-range-fold.h (postfold_gcond_edges): Rename to |
| register_outgoing_edges and move to fur_source. |
| |
| 2021-09-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-fold.cc (fold_using_range::range_of_phi): Check |
| dom_info_available_p. |
| |
| 2021-09-21 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create |
| and quick_grow_cleared instead of safe_grow_cleared. |
| |
| 2021-09-21 Thomas Schwinge <thomas@codesourcery.com> |
| |
| PR other/102408 |
| * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate |
| 'random ()' to '0'. |
| |
| 2021-09-21 Richard Earnshaw <rearnsha@arm.com> |
| |
| * configure.ac: Detect when the assembler supports new-style |
| architecture extensions. |
| * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return |
| the full CPU string if the assembler can grok it. |
| (arm_rewrite_march): Likewise but for the architecture. |
| * config.in: Regenerate. |
| * configure: Regenerate. |
| |
| 2021-09-21 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/102421 |
| * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and |
| adjust alignment info. |
| |
| 2021-09-21 Kewen Lin <linkw@linux.ibm.com> |
| |
| * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent |
| bitfield stream out. |
| |
| 2021-09-20 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore |
| undefined edges, apply an equivalence if appropriate. |
| * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return |
| UNDEFINED if EDGE_EXECUTABLE is not set. |
| * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges |
| as EXECUTABLE upon startup. |
| (gimple_ranger::range_on_edge): Return UNDEFINED for edges without |
| EDGE_EXECUTABLE set. |
| * vr-values.c (set_and_propagate_unexecutable): New. |
| (simplify_using_ranges::fold_cond): Call set_and_propagate. |
| (simplify_using_ranges::simplify_switch_using_ranges): Ditto. |
| * vr-values.h: Add prototype. |
| |
| 2021-09-20 Andrew MacLeod <amacleod@redhat.com> |
| |
| * value-relation.cc (equiv_oracle::register_initial_def): New. |
| (equiv_oracle::register_relation): Call register_initial_def. |
| (equiv_oracle::add_equiv_to_block): New. Split register_relation. |
| (relation_oracle::register_stmt): Check def block of PHI arguments. |
| * value-relation.h (equiv_oracle): Add new prototypes. |
| |
| 2021-09-20 Matthias Kretz <m.kretz@gsi.de> |
| |
| * cppbuiltin.c (define_builtin_macros_for_compilation_flags): |
| Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__, |
| __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and |
| __ROUNDING_MATH__ according to their corresponding flags. |
| * doc/cpp.texi: Document __RECIPROCAL_MATH__, |
| __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, |
| and __ROUNDING_MATH__. |
| |
| 2021-09-20 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-stmts.c (vectorizable_load): Use the vectype |
| from the SLP node. |
| |
| 2021-09-20 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info): |
| Do not compute alignment of the vectorized access here. |
| |
| 2021-09-20 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): |
| Store -1 for runtime alias peeling iterations. |
| |
| 2021-09-20 Richard Biener <rguenther@suse.de> |
| |
| * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*. |
| |
| 2021-09-20 Thomas Schwinge <thomas@codesourcery.com> |
| |
| * input.c (string_concat_db::record_string_concatenation) |
| (string_concat_db::get_string_concatenation): Skip for |
| 'RESERVED_LOCATION_P'. |
| |
| 2021-09-20 Richard Biener <rguenther@suse.de> |
| |
| PR tree-optimization/65206 |
| * tree-data-ref.h (struct data_reference): Add alt_indices, |
| order it last. |
| * tree-data-ref.c (free_data_ref): Release alt_indices. |
| (dr_analyze_indices): Work on struct indices and get DR_REF as tree. |
| (create_data_ref): Adjust. |
| (initialize_data_dependence_relation): Split into head |
| and tail. When the base objects fail to match up try |
| again with pointer-based analysis of indices. |
| * tree-vectorizer.c (vec_info_shared::check_datarefs): Do |
| not compare the lazily computed alternate set of indices. |
| |
| 2021-09-20 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * gcc.c: Test for execute OK when we find the |
| programs for assembler linker and dsymutil and those |
| were specified at configure-time. |
| |
| 2021-09-19 Martin Sebor <msebor@redhat.com> |
| |
| PR middle-end/102403 |
| * gimple-predicate-analysis.cc (predicate::init_from_control_deps): |
| Correct a function pre/postcondition. |
| |
| 2021-09-19 Martin Sebor <msebor@redhat.com> |
| |
| PR middle-end/102243 |
| * tree-ssa-strlen.c (get_range): Handle null cfun. |
| |
| 2021-09-19 Iain Sandoe <iain@sandoe.co.uk> |
| |
| * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10 |
| unwinder shim as a convenience library. |
| |
| 2021-09-19 Andrew Pinski <apinski@marvell.com> |
| |
| * doc/install.texi: Add note about |
| binutils 2.35 is required for LTO usage. |
| |
| 2021-09-19 Aldy Hernandez <aldyh@redhat.com> |
| |
| * tree-ssa-threadbackward.c |
| (back_threader_registry::register_path): Use push_edge. |
| * tree-ssa-threadedge.c |
| (jump_threader::thread_around_empty_blocks): Same. |
| (jump_threader::thread_through_normal_block): Same. |
| (jump_threader::thread_across_edge): Same. Also, use auto_bitmap. |
| Tidy up code. |
| * tree-ssa-threadupdate.c |
| (jt_path_registry::allocate_thread_edge): Remove. |
| (jt_path_registry::push_edge): New. |
| (dump_jump_thread_path): Make static. |
| * tree-ssa-threadupdate.h (allocate_thread_edge): Remove. |
| (push_edge): New. |
| |
| 2021-09-19 Aldy Hernandez <aldyh@redhat.com> |
| |
| * gimple-range-path.cc (path_range_query::path_range_query): Add |
| header. |
| (path_range_query::dump): Remove extern declaration of dump_ranger. |
| * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker. |
| * gimple-range-trace.h (dump_ranger): Add prototype. |
| |
| 2021-09-19 John Ericson <git@JohnEricson.me> |
| |
| * gcc.c (find_a_program): New function, factored out of... |
| (find_a_file): Here. |
| (execute): Use find_a_program when looking for programs rather |
| than find_a_file. |
| |
| 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com> |
| |
| * config/avr/avr-mcus.def: Add atmega324pb. |
| * doc/avr-mmcu.texi: Corresponding changes. |
| |
| 2021-09-19 Roger Sayle <roger@nextmovesoftware.com> |
| |
| PR middle-end/88173 |
| * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply |
| the same transformations as to @1. For comparisons against NaN, |
| don't check HONOR_SNANS but confirm that neither operand is a |
| signaling NaN. |
| |
| 2021-09-19 Benjamin Peterson <benjamin@locrian.net> |
| |
| * attribs.c (make_unique_name): Delete. |
| * attribs.h (make_unique_name): Delete. |
| |
| 2021-09-19 Andrew Pinski <apinski@marvell.com> |
| |
| * lra-constraints.c (check_and_process_move): Assert |
| that dclass and sclass are greater than or equal to NO_REGS. |
| |
| 2021-09-18 Jakub Jelinek <jakub@redhat.com> |
| |
| * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define. |
| * tree-pretty-print.c (dump_omp_clause): Print unconstrained: |
| for OMP_CLAUSE_ORDER_UNCONSTRAINED. |
| |
| 2021-09-18 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/i386-features.c (remove_partial_avx_dependency): |
| Restrict TARGET_USE_VECTOR_FP_CONVERTS and |
| TARGET_USE_VECTOR_CONVERTS to conversion instructions only. |
| |
| 2021-09-18 Jakub Jelinek <jakub@redhat.com> |
| |
| * gimplify.c (omp_default_clause): For C/C++ default({,first}private), |
| if file/namespace scope variable doesn't have predetermined sharing, |
| treat it as if there was default(none). |
| |
| 2021-09-18 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm_fmadd_sh): |
| New intrinsic. |
| (_mm_mask_fmadd_sh): Likewise. |
| (_mm_mask3_fmadd_sh): Likewise. |
| (_mm_maskz_fmadd_sh): Likewise. |
| (_mm_fmadd_round_sh): Likewise. |
| (_mm_mask_fmadd_round_sh): Likewise. |
| (_mm_mask3_fmadd_round_sh): Likewise. |
| (_mm_maskz_fmadd_round_sh): Likewise. |
| (_mm_fnmadd_sh): Likewise. |
| (_mm_mask_fnmadd_sh): Likewise. |
| (_mm_mask3_fnmadd_sh): Likewise. |
| (_mm_maskz_fnmadd_sh): Likewise. |
| (_mm_fnmadd_round_sh): Likewise. |
| (_mm_mask_fnmadd_round_sh): Likewise. |
| (_mm_mask3_fnmadd_round_sh): Likewise. |
| (_mm_maskz_fnmadd_round_sh): Likewise. |
| (_mm_fmsub_sh): Likewise. |
| (_mm_mask_fmsub_sh): Likewise. |
| (_mm_mask3_fmsub_sh): Likewise. |
| (_mm_maskz_fmsub_sh): Likewise. |
| (_mm_fmsub_round_sh): Likewise. |
| (_mm_mask_fmsub_round_sh): Likewise. |
| (_mm_mask3_fmsub_round_sh): Likewise. |
| (_mm_maskz_fmsub_round_sh): Likewise. |
| (_mm_fnmsub_sh): Likewise. |
| (_mm_mask_fnmsub_sh): Likewise. |
| (_mm_mask3_fnmsub_sh): Likewise. |
| (_mm_maskz_fnmsub_sh): Likewise. |
| (_mm_fnmsub_round_sh): Likewise. |
| (_mm_mask_fnmsub_round_sh): Likewise. |
| (_mm_mask3_fnmsub_round_sh): Likewise. |
| (_mm_maskz_fnmsub_round_sh): Likewise. |
| * config/i386/i386-builtin-types.def |
| (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type. |
| * config/i386/i386-builtin.def: Add new builtins. |
| * config/i386/i386-expand.c: Handle new builtin type. |
| * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>): |
| Ajdust to support FP16. |
| (fmai_vmfmsub_<mode><round_name>): Ditto. |
| (fmai_vmfnmadd_<mode><round_name>): Ditto. |
| (fmai_vmfnmsub_<mode><round_name>): Ditto. |
| (*fmai_fmadd_<mode>): Ditto. |
| (*fmai_fmsub_<mode>): Ditto. |
| (*fmai_fnmadd_<mode><round_name>): Ditto. |
| (*fmai_fnmsub_<mode><round_name>): Ditto. |
| (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto. |
| (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto. |
| (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto. |
| (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto. |
| (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto. |
| (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto. |
| (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto. |
| (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto. |
| (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto. |
| (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto. |
| (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ... |
| (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and |
| adjust to support FP16. |
| (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto. |
| (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto. |
| (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New |
| expander. |
| |
| 2021-09-18 H.J. Lu <hjl.tools@gmail.com> |
| |
| * config/i386/sse.md (avx512fmaskmodelower): Extend to support |
| HF modes. |
| (maskload<mode><avx512fmaskmodelower>): Ditto. |
| (maskstore<mode><avx512fmaskmodelower>): Ditto. |
| |
| 2021-09-18 H.J. Lu <hjl.tools@gmail.com> |
| |
| * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator): |
| Handle HFmode. |
| (ix86_expand_copysign): Ditto. |
| (ix86_expand_xorsign): Ditto. |
| * config/i386/i386.c (ix86_build_const_vector): Handle HF vector |
| modes. |
| (ix86_build_signbit_mask): Ditto. |
| (ix86_can_change_mode_class): Ditto. |
| * config/i386/i386.md |
| (SSEMODEF): Add HFmode. |
| (ssevecmodef): Ditto. |
| (<code>hf2): New define_expand. |
| (*<code>hf2_1): New define_insn_and_split. |
| (copysign<mode>): Extend to support HFmode under AVX512FP16. |
| (xorsign<mode>): Ditto. |
| * config/i386/sse.md (VFB): New mode iterator. |
| (VFB_128_256): Ditto. |
| (VFB_512): Ditto. |
| (sseintvecmode2): Support HF vector mode. |
| (<code><mode>2): Use new mode iterator. |
| (*<code><mode>2): Ditto. |
| (copysign<mode>3): Ditto. |
| (xorsign<mode>3): Ditto. |
| (<code><mode>3<mask_name>): Ditto. |
| (<code><mode>3<mask_name>): Ditto. |
| (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode. |
| (<sse>_andnot<mode>3<mask_name>): Ditto. |
| (*<code><mode>3<mask_name>): Ditto. |
| (*<code><mode>3<mask_name>): Ditto. |
| |
| 2021-09-18 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph): |
| New intrinsic. |
| (_mm512_mask3_fmadd_ph): Likewise. |
| (_mm512_maskz_fmadd_ph): Likewise. |
| (_mm512_fmadd_round_ph): Likewise. |
| (_mm512_mask_fmadd_round_ph): Likewise. |
| (_mm512_mask3_fmadd_round_ph): Likewise. |
| (_mm512_maskz_fmadd_round_ph): Likewise. |
| (_mm512_fnmadd_ph): Likewise. |
| (_mm512_mask_fnmadd_ph): Likewise. |
| (_mm512_mask3_fnmadd_ph): Likewise. |
| (_mm512_maskz_fnmadd_ph): Likewise. |
| (_mm512_fnmadd_round_ph): Likewise. |
| (_mm512_mask_fnmadd_round_ph): Likewise. |
| (_mm512_mask3_fnmadd_round_ph): Likewise. |
| (_mm512_maskz_fnmadd_round_ph): Likewise. |
| (_mm512_fmsub_ph): Likewise. |
| (_mm512_mask_fmsub_ph): Likewise. |
| (_mm512_mask3_fmsub_ph): Likewise. |
| (_mm512_maskz_fmsub_ph): Likewise. |
| (_mm512_fmsub_round_ph): Likewise. |
| (_mm512_mask_fmsub_round_ph): Likewise. |
| (_mm512_mask3_fmsub_round_ph): Likewise. |
| (_mm512_maskz_fmsub_round_ph): Likewise. |
| (_mm512_fnmsub_ph): Likewise. |
| (_mm512_mask_fnmsub_ph): Likewise. |
| (_mm512_mask3_fnmsub_ph): Likewise. |
| (_mm512_maskz_fnmsub_ph): Likewise. |
| (_mm512_fnmsub_round_ph): Likewise. |
| (_mm512_mask_fnmsub_round_ph): Likewise. |
| (_mm512_mask3_fnmsub_round_ph): Likewise. |
| (_mm512_maskz_fnmsub_round_ph): Likewise. |
| * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph): |
| New intrinsic. |
| (_mm256_mask_fmadd_ph): Likewise. |
| (_mm256_mask3_fmadd_ph): Likewise. |
| (_mm256_maskz_fmadd_ph): Likewise. |
| (_mm_fmadd_ph): Likewise. |
| (_mm_mask_fmadd_ph): Likewise. |
| (_mm_mask3_fmadd_ph): Likewise. |
| (_mm_maskz_fmadd_ph): Likewise. |
| (_mm256_fnmadd_ph): Likewise. |
| (_mm256_mask_fnmadd_ph): Likewise. |
| (_mm256_mask3_fnmadd_ph): Likewise. |
| (_mm256_maskz_fnmadd_ph): Likewise. |
| (_mm_fnmadd_ph): Likewise. |
| (_mm_mask_fnmadd_ph): Likewise. |
| (_mm_mask3_fnmadd_ph): Likewise. |
| (_mm_maskz_fnmadd_ph): Likewise. |
| (_mm256_fmsub_ph): Likewise. |
| (_mm256_mask_fmsub_ph): Likewise. |
| (_mm256_mask3_fmsub_ph): Likewise. |
| (_mm256_maskz_fmsub_ph): Likewise. |
| (_mm_fmsub_ph): Likewise. |
| (_mm_mask_fmsub_ph): Likewise. |
| (_mm_mask3_fmsub_ph): Likewise. |
| (_mm_maskz_fmsub_ph): Likewise. |
| (_mm256_fnmsub_ph): Likewise. |
| (_mm256_mask_fnmsub_ph): Likewise. |
| (_mm256_mask3_fnmsub_ph): Likewise. |
| (_mm256_maskz_fnmsub_ph): Likewise. |
| (_mm_fnmsub_ph): Likewise. |
| (_mm_mask_fnmsub_ph): Likewise. |
| (_mm_mask3_fnmsub_ph): Likewise. |
| (_mm_maskz_fnmsub_ph): Likewise. |
| * config/i386/i386-builtin.def: Add corresponding new builtins. |
| * config/i386/sse.md |
| (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to |
| support HF vector modes. |
| (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>): |
| Ditto. |
| (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto. |
| (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto. |
| (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto. |
| (<avx512>_fmadd_<mode>_mask<round_name>): Ditto. |
| (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto. |
| (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto. |
| (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>): |
| Ditto. |
| (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto. |
| (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto. |
| (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto. |
| (<avx512>_fmsub_<mode>_mask<round_name>): Ditto. |
| (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto. |
| (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>): |
| Ditto. |
| (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto. |
| (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto. |
| (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto. |
| (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto. |
| (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto. |
| (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto. |
| (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>): |
| Ditto. |
| (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto. |
| (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto. |
| (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto. |
| (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto. |
| (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto. |
| |
| 2021-09-18 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph): |
| New intrinsic. |
| (_mm512_mask_fmaddsub_ph): Likewise. |
| (_mm512_mask3_fmaddsub_ph): Likewise. |
| (_mm512_maskz_fmaddsub_ph): Likewise. |
| (_mm512_fmaddsub_round_ph): Likewise. |
| (_mm512_mask_fmaddsub_round_ph): Likewise. |
| (_mm512_mask3_fmaddsub_round_ph): Likewise. |
| (_mm512_maskz_fmaddsub_round_ph): Likewise. |
| (_mm512_mask_fmsubadd_ph): Likewise. |
| (_mm512_mask3_fmsubadd_ph): Likewise. |
| (_mm512_maskz_fmsubadd_ph): Likewise. |
| (_mm512_fmsubadd_round_ph): Likewise. |
| (_mm512_mask_fmsubadd_round_ph): Likewise. |
| (_mm512_mask3_fmsubadd_round_ph): Likewise. |
| (_mm512_maskz_fmsubadd_round_ph): Likewise. |
| * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph): |
| New intrinsic. |
| (_mm256_mask_fmaddsub_ph): Likewise. |
| (_mm256_mask3_fmaddsub_ph): Likewise. |
| (_mm256_maskz_fmaddsub_ph): Likewise. |
| (_mm_fmaddsub_ph): Likewise. |
| (_mm_mask_fmaddsub_ph): Likewise. |
| (_mm_mask3_fmaddsub_ph): Likewise. |
| (_mm_maskz_fmaddsub_ph): Likewise. |
| (_mm256_fmsubadd_ph): Likewise. |
| (_mm256_mask_fmsubadd_ph): Likewise. |
| (_mm256_mask3_fmsubadd_ph): Likewise. |
| (_mm256_maskz_fmsubadd_ph): Likewise. |
| (_mm_fmsubadd_ph): Likewise. |
| (_mm_mask_fmsubadd_ph): Likewise. |
| (_mm_mask3_fmsubadd_ph): Likewise. |
| (_mm_maskz_fmsubadd_ph): Likewise. |
| * config/i386/i386-builtin.def: Add corresponding new builtins. |
| * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator. |
| * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander. |
| * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use |
| VFH_SF_AVX512VL. |
| * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>): |
| Ditto. |
| * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto. |
| * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto. |
| * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>): |
| Ditto. |
| * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto. |
| * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto. |
| |
| 2021-09-18 liuhongt <hongtao.liu@intel.com> |
| |
| PR target/87767 |
| * config/i386/i386.c (ix86_print_operand): Handle |
| V8HF/V16HF/V32HFmode. |
| * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode. |
| * config/i386/sse.md (avx512bcst): Remove. |
| |
| 2021-09-17 Martin Sebor <msebor@redhat.com> |
| |
| * Makefile.in (OBJS): Add gimple-predicate-analysis.o. |
| * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis. |
| (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same. |
| (check_defs): Add comment. |
| (can_skip_redundant_opnd): Update comment. |
| (compute_uninit_opnds_pos): Adjust to namespace change. |
| (find_pdom): Move to gimple-predicate-analysis.cc. |
| (find_dom): Same. |
| (struct uninit_undef_val_t): New. |
| (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc. |
| (find_control_equiv_block): Same. |
| (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same. |
| (MAX_SWITCH_CASES): Same. |
| (compute_control_dep_chain): Same. |
| (find_uninit_use): Use predicate analyzer. |
| (struct pred_info): Move to gimple-predicate-analysis. |
| (convert_control_dep_chain_into_preds): Same. |
| (find_predicates): Same. |
| (collect_phi_def_edges): Same. |
| (warn_uninitialized_phi): Use predicate analyzer. |
| (find_def_preds): Move to gimple-predicate-analysis. |
| (dump_pred_info): Same. |
| (dump_pred_chain): Same. |
| (dump_predicates): Same. |
| (destroy_predicate_vecs): Remove. |
| (execute_late_warn_uninitialized): New. |
| (get_cmp_code): Move to gimple-predicate-analysis. |
| (is_value_included_in): Same. |
| (value_sat_pred_p): Same. |
| (find_matching_predicate_in_rest_chains): Same. |
| (is_use_properly_guarded): Same. |
| (prune_uninit_phi_opnds): Same. |
| (find_var_cmp_const): Same. |
| (use_pred_not_overlap_with_undef_path_pred): Same. |
| (pred_equal_p): Same. |
| (is_neq_relop_p): Same. |
| (is_neq_zero_form_p): Same. |
| (pred_expr_equal_p): Same. |
| (is_pred_expr_subset_of): Same. |
| (is_pred_chain_subset_of): Same. |
| (is_included_in): Same. |
| (is_superset_of): Same. |
| (pred_neg_p): Same. |
| (simplify_pred): Same. |
| (simplify_preds_2): Same. |
| (simplify_preds_3): Same. |
| (simplify_preds_4): Same. |
| (simplify_preds): Same. |
| (push_pred): Same. |
| (push_to_worklist): Same. |
| (get_pred_info_from_cmp): Same. |
| (is_degenerated_phi): Same. |
| (normalize_one_pred_1): Same. |
| (normalize_one_pred): Same. |
| (normalize_one_pred_chain): Same. |
| (normalize_preds): Same. |
| (can_one_predicate_be_invalidated_p): Same. |
| (can_chain_union_be_invalidated_p): Same. |
| (uninit_uses_cannot_happen): Same. |
| (pass_late_warn_uninitialized::execute): Define. |
| * gimple-predicate-analysis.cc: New file. |
| * gimple-predicate-analysis.h: New file. |
| |
| 2021-09-17 Julian Brown <julian@codesourcery.com> |
| |
| * config/gcn/gcn.c (gimple.h): Include. |
| (gcn_fork_join): Emit barrier for worker-level joins. |
| * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add |
| writes_gang_private bitmap parameter. Set bit for blocks |
| containing gang-private variable writes. |
| (worker_single_simple): Don't emit barrier after predicated block. |
| (worker_single_copy): Don't emit barrier if we're not broadcasting |
| anything and the block contains no gang-private writes. |
| (neuter_worker_single): Don't predicate blocks that only contain |
| NOPs or internal marker functions. Pass has_gang_private_write |
| argument to worker_single_copy. |
| (oacc_do_neutering): Add writes_gang_private bitmap handling. |
| |
| 2021-09-17 Julian Brown <julian@codesourcery.com> |
| |
| * config/gcn/gcn-protos.h |
| (gcn_goacc_create_worker_broadcast_record): Update prototype. |
| * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use |
| preallocated block of LDS memory. Do not cache/share decls for |
| reduction temporaries between invocations. |
| (gcn_goacc_reduction_teardown): Unshare VAR on second use. |
| (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter |
| and return temporary LDS space at that offset. Return pointer in |
| "sender" case. |
| * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs): |
| New global vars. |
| (ACC_LDS_SIZE): Define as acc_lds_size. |
| (gcn_init_machine_status): Don't initialise lds_allocated, |
| lds_allocs, reduc_decls fields of machine function struct. |
| (gcn_option_override): Handle default size for gang-private |
| variables and -mgang-private-size option. |
| (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when |
| initialising M0_REG. |
| (gcn_shared_mem_layout): New function. |
| (gcn_print_lds_decl): Update comment. Use global lds_allocs map and |
| gang_private_hwm variable. |
| (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook. |
| * config/gcn/gcn.h (machine_function): Remove lds_allocated, |
| lds_allocs, reduc_decls. Add reduction_base, reduction_limit. |
| * config/gcn/gcn.opt (gang_private_size_opt): New global. |
| (mgang-private-size=): New option. |
| * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place |
| documentation hook. |
| * doc/tm.texi: Regenerate. |
| * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h): |
| Add includes. |
| (build_sender_ref): Handle sender_decl being pointer. |
| (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS |
| parameters. Pass placement argument to |
| create_worker_broadcast_record hook invocations. Handle |
| sender_decl being pointer and isolate_broadcasts inserting extra |
| barriers. |
| (blk_offset_map_t): Add typedef. |
| (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass |
| preallocated range to worker_single_copy call. |
| (dfs_broadcast_reachable_1): New function. |
| (idx_decl_pair_t, used_range_vec_t): New typedefs. |
| (sort_size_descending): New function. |
| (addr_range): New class. |
| (splay_tree_compare_addr_range, splay_tree_free_key) |
| (first_fit_range, merge_ranges_1, merge_ranges): New functions. |
| (execute_omp_oacc_neuter_broadcast): Rename to... |
| (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI |
| parameters. Arrange layout of shared memory for broadcast |
| operations. |
| (execute_omp_oacc_neuter_broadcast): New function. |
| (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1 |
| handling from here. Enable pass for all OpenACC routines in order |
| to call shared memory-layout hook. |
| * target.def (create_worker_broadcast_record): Add OFFSET |
| parameter. |
| (shared_mem_layout): New hook. |
| |
| 2021-09-17 Julian Brown <julian@codesourcery.com> |
| Thomas Schwinge <thomas@codesourcery.com> |
| |
| * omp-oacc-neuter-broadcast.cc |
| (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is |
| 1. |
| (execute_omp_oacc_neuter_broadcast): Adjust. |
| |
| 2021-09-17 Andrew MacLeod <amacleod@redhat.com> |
| |
| * value-relation.cc (class equiv_chain): Move to header file. |
| (path_oracle::path_oracle): New. |
| (path_oracle::~path_oracle): New. |
| (path_oracle::register_relation): New. |
| (path_oracle::query_relation): New. |
| (path_oracle::reset_path): New. |
| (path_oracle::dump): New. |
| * value-relation.h (class equiv_chain): Move to here. |
| (class path_oracle): New. |
| |
| 2021-09-17 Andrew MacLeod <amacleod@redhat.com> |
| |
| * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM |
| based oracle. |
| * gimple-range-fold.cc (fur_depend::register_relation): Use |
| register_stmt/edge routines. |
| * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle. |
| (equiv_oracle::equiv_oracle): Create self equivalence cache. |
| (equiv_oracle::~equiv_oracle): Release same. |
| (equiv_oracle::equiv_set): Return entry from self equiv cache if there |
| are no equivalences. |
| (equiv_oracle::find_equiv_block): Move list find to equiv_chain. |
| (equiv_oracle::register_relation): Rename from register_equiv. |
| (relation_chain_head::find_relation): Relocate from dom_oracle. |
| (relation_oracle::register_stmt): New. |
| (relation_oracle::register_edge): New. |
| (dom_oracle::*): Rename from relation_oracle. |
| (dom_oracle::register_relation): Adjust to call equiv_oracle. |
| (dom_oracle::set_one_relation): Split from register_relation. |
| (dom_oracle::register_transitives): Consolidate 2 methods. |
| (dom_oracle::find_relation_block): Move core to relation_chain. |
| (dom_oracle::query_relation): Rename from find_relation_dom and adjust. |
| * value-relation.h (class relation_oracle): New pure virtual base. |
| (class equiv_oracle): Inherit from relation_oracle and adjust. |
| (class dom_oracle): Rename from old relation_oracle and adjust. |
| |
| 2021-09-17 Martin Sebor <msebor@redhat.com> |
| |
| PR middle-end/102200 |
| * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR. |
| (handle_min_max_size): Change argument. Store original SSA_NAME for |
| operands to potentially distinct (sub)objects. |
| (compute_objsize_r): Adjust call to the above. |
| |
| 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com> |
| |
| * config/rs6000/rs6000.c (rs6000-builtins.h): New include. |
| (rs6000_new_builtin_vectorized_function): New function. |
| (rs6000_new_builtin_md_vectorized_function): Likewise. |
| (rs6000_builtin_vectorized_function): Call |
| rs6000_new_builtin_vectorized_function. |
| (rs6000_builtin_md_vectorized_function): Call |
| rs6000_new_builtin_md_vectorized_function. |
| |
| 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com> |
| |
| * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag. |
| (ASSEMBLE_PAIR): Likewise. |
| (BUILD_ACC): Likewise. |
| (DISASSEMBLE_ACC): Likewise. |
| (DISASSEMBLE_PAIR): Likewise. |
| (PMXVBF16GER2): Likewise. |
| (PMXVBF16GER2NN): Likewise. |
| (PMXVBF16GER2NP): Likewise. |
| (PMXVBF16GER2PN): Likewise. |
| (PMXVBF16GER2PP): Likewise. |
| (PMXVF16GER2): Likewise. |
| (PMXVF16GER2NN): Likewise. |
| (PMXVF16GER2NP): Likewise. |
| (PMXVF16GER2PN): Likewise. |
| (PMXVF16GER2PP): Likewise. |
| (PMXVF32GER): Likewise. |
| (PMXVF32GERNN): Likewise. |
| (PMXVF32GERNP): Likewise. |
| (PMXVF32GERPN): Likewise. |
| (PMXVF32GERPP): Likewise. |
| (PMXVF64GER): Likewise. |
| (PMXVF64GERNN): Likewise. |
| (PMXVF64GERNP): Likewise. |
| (PMXVF64GERPN): Likewise. |
| (PMXVF64GERPP): Likewise. |
| (PMXVI16GER2): Likewise. |
| (PMXVI16GER2PP): Likewise. |
| (PMXVI16GER2S): Likewise. |
| (PMXVI16GER2SPP): Likewise. |
| (PMXVI4GER8): Likewise. |
| (PMXVI4GER8PP): Likewise. |
| (PMXVI8GER4): Likewise. |
| (PMXVI8GER4PP): Likewise. |
| (PMXVI8GER4SPP): Likewise. |
| (XVBF16GER2): Likewise. |
| (XVBF16GER2NN): Likewise. |
| (XVBF16GER2NP): Likewise. |
| (XVBF16GER2PN): Likewise. |
| (XVBF16GER2PP): Likewise. |
| (XVF16GER2): Likewise. |
| (XVF16GER2NN): Likewise. |
| (XVF16GER2NP): Likewise. |
| (XVF16GER2PN): Likewise. |
| (XVF16GER2PP): Likewise. |
| (XVF32GER): Likewise. |
| (XVF32GERNN): Likewise. |
| (XVF32GERNP): Likewise. |
| (XVF32GERPN): Likewise. |
| (XVF32GERPP): Likewise. |
| (XVF64GER): Likewise. |
| (XVF64GERNN): Likewise. |
| (XVF64GERNP): Likewise. |
| (XVF64GERPN): Likewise. |
| (XVF64GERPP): Likewise. |
| (XVI16GER2): Likewise. |
| (XVI16GER2PP): Likewise. |
| (XVI16GER2S): Likewise. |
| (XVI16GER2SPP): Likewise. |
| (XVI4GER8): Likewise. |
| (XVI4GER8PP): Likewise. |
| (XVI8GER4): Likewise. |
| (XVI8GER4PP): Likewise. |
| (XVI8GER4SPP): Likewise. |
| (XXMFACC): Likewise. |
| (XXMTACC): Likewise. |
| (XXSETACCZ): Likewise. |
| (ASSEMBLE_PAIR_V): Likewise. |
| (BUILD_PAIR): Likewise. |
| (DISASSEMBLE_PAIR_V): Likewise. |
| (LXVP): New. |
| (STXVP): New. |
| * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin): |
| Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP. |
| * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint. |
| (parse_bif_attrs): Handle ismmaint. |
| (write_decls): Add bif_mmaint_bit and bif_is_mmaint. |
| (write_bif_static_init): Handle ismmaint. |
| |
| 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com> |
| |
| * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New |
| forward decl. |
| (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin. |
| (rs6000_new_builtin_valid_without_lhs): New function. |
| (rs6000_gimple_fold_new_mma_builtin): Likewise. |
| (rs6000_gimple_fold_new_builtin): Likewise. |
| |
| 2021-09-17 Thomas Schwinge <thomas@codesourcery.com> |
| |
| * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand): |
| Destruct stale Value objects. |
| * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand): |
| Update. |
| |
| 2021-09-17 Roger Sayle <roger@nextmovesoftware.com> |
| |
| PR c/102245 |
| * match.pd (shift optimizations): Disable recent sign-changing |
| optimization for shifts by zero, these will be folded later. |
| |
| 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com> |
| |
| * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from |
| [power9] to [always]. |
| |
| 2021-09-17 Richard Biener <rguenther@suse.de> |
| |
| * tree-vect-stmts.c (vectorizable_load): Do not frob |
| stmt_info for SLP. |
| |
| 2021-09-17 H.J. Lu <hjl.tools@gmail.com> |
| |
| * config/i386/i386-features.c (remove_partial_avx_dependency): |
| Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and |
| and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating |
| vxorps. |
| * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): |
| New. |
| (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise. |
| * config/i386/i386.md (SSE FP to FP splitters): Replace |
| TARGET_SSE_PARTIAL_REG_DEPENDENCY with |
| TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY. |
| (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY |
| with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY. |
| * config/i386/x86-tune.def |
| (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New. |
| (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise. |
| |
| 2021-09-17 H.J. Lu <hjl.tools@gmail.com> |
| |
| PR target/101900 |
| * config/i386/i386-features.c (remove_partial_avx_dependency): |
| Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS |
| before generating vxorps. |
| |
| 2021-09-17 H.J. Lu <hjl.tools@gmail.com> |
| |
| * config/i386/i386-options.c (processor_cost_table): Use |
| tremont_cost for Tremont. |
| * config/i386/x86-tune-costs.h (tremont_memcpy): New. |
| (tremont_memset): Likewise. |
| (tremont_cost): Likewise. |
| * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB): |
| Enable for Tremont. |
| |
| 2021-09-17 H.J. Lu <hjl.tools@gmail.com> |
| |
| * common/config/i386/i386-common.c: Use Haswell scheduling model |
| for Tremont. |
| * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont |
| scheduling pass. |
| * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont |
| issue rate to 4. |
| (ix86_adjust_cost): Handle Tremont. |
| * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): |
| Enable for Tremont. |
| (X86_TUNE_USE_LEAVE): Likewise. |
| (X86_TUNE_PUSH_MEMORY): Likewise. |
| (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise. |
| (X86_TUNE_USE_CLTD): Likewise. |
| (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise. |
| (X86_TUNE_AVOID_MFENCE): Likewise. |
| (X86_TUNE_SSE_TYPELESS_STORES): Likewise. |
| (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise. |
| (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont. |
| (X86_TUNE_FOUR_JUMP_LIMIT): Likewise. |
| (X86_TUNE_OPT_AGU): Likewise. |
| (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise. |
| (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise. |
| (X86_TUNE_EXPAND_ABS): Likewise. |
| (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise. |
| (X86_TUNE_SLOW_PSHUFB): Likewise. |
| |
| 2021-09-17 Eric Botcazou <ebotcazou@adacore.com> |
| |
| PR rtl-optimization/102306 |
| * combine.c (try_combine): Abort the combination if we are about to |
| duplicate volatile references. |
| |
| 2021-09-17 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm_undefined_ph): |
| New intrinsic. |
| (_mm256_undefined_ph): Likewise. |
| (_mm512_undefined_ph): Likewise. |
| (_mm_cvtsh_h): Likewise. |
| (_mm256_cvtsh_h): Likewise. |
| (_mm512_cvtsh_h): Likewise. |
| (_mm512_castph_ps): Likewise. |
| (_mm512_castph_pd): Likewise. |
| (_mm512_castph_si512): Likewise. |
| (_mm512_castph512_ph128): Likewise. |
| (_mm512_castph512_ph256): Likewise. |
| (_mm512_castph128_ph512): Likewise. |
| (_mm512_castph256_ph512): Likewise. |
| (_mm512_zextph128_ph512): Likewise. |
| (_mm512_zextph256_ph512): Likewise. |
| (_mm512_castps_ph): Likewise. |
| (_mm512_castpd_ph): Likewise. |
| (_mm512_castsi512_ph): Likewise. |
| * config/i386/avx512fp16vlintrin.h (_mm_castph_ps): |
| New intrinsic. |
| (_mm256_castph_ps): Likewise. |
| (_mm_castph_pd): Likewise. |
| (_mm256_castph_pd): Likewise. |
| (_mm_castph_si128): Likewise. |
| (_mm256_castph_si256): Likewise. |
| (_mm_castps_ph): Likewise. |
| (_mm256_castps_ph): Likewise. |
| (_mm_castpd_ph): Likewise. |
| (_mm256_castpd_ph): Likewise. |
| (_mm_castsi128_ph): Likewise. |
| (_mm256_castsi256_ph): Likewise. |
| (_mm256_castph256_ph128): Likewise. |
| (_mm256_castph128_ph256): Likewise. |
| (_mm256_zextph128_ph256): Likewise. |
| |
| 2021-09-17 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss): |
| New intrinsic. |
| (_mm_mask_cvtsh_ss): Likewise. |
| (_mm_maskz_cvtsh_ss): Likewise. |
| (_mm_cvtsh_sd): Likewise. |
| (_mm_mask_cvtsh_sd): Likewise. |
| (_mm_maskz_cvtsh_sd): Likewise. |
| (_mm_cvt_roundsh_ss): Likewise. |
| (_mm_mask_cvt_roundsh_ss): Likewise. |
| (_mm_maskz_cvt_roundsh_ss): Likewise. |
| (_mm_cvt_roundsh_sd): Likewise. |
| (_mm_mask_cvt_roundsh_sd): Likewise. |
| (_mm_maskz_cvt_roundsh_sd): Likewise. |
| (_mm_cvtss_sh): Likewise. |
| (_mm_mask_cvtss_sh): Likewise. |
| (_mm_maskz_cvtss_sh): Likewise. |
| (_mm_cvtsd_sh): Likewise. |
| (_mm_mask_cvtsd_sh): Likewise. |
| (_mm_maskz_cvtsd_sh): Likewise. |
| (_mm_cvt_roundss_sh): Likewise. |
| (_mm_mask_cvt_roundss_sh): Likewise. |
| (_mm_maskz_cvt_roundss_sh): Likewise. |
| (_mm_cvt_roundsd_sh): Likewise. |
| (_mm_mask_cvt_roundsd_sh): Likewise. |
| (_mm_maskz_cvt_roundsd_sh): Likewise. |
| * config/i386/i386-builtin-types.def |
| (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT, |
| V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT, |
| V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT, |
| V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types. |
| * config/i386/i386-builtin.def: Add corrresponding new builtins. |
| * config/i386/i386-expand.c: Handle new builtin types. |
| * config/i386/sse.md (VF48_128): New mode iterator. |
| (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>): |
| New. |
| (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>): |
| Ditto. |
| |
| 2021-09-17 liuhongt <hongtao.liu@intel.com> |
| |
| * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd): |
| New intrinsic. |
| (_mm512_mask_cvtph_pd): Likewise. |
| (_mm512_maskz_cvtph_pd): Likewise. |
| (_mm512_cvt_roundph_pd): Likewise.
|