c9x.me/qbe.git - QBE - Backend Compiler

Age	Commit message (Collapse)	Author
13 days	If-conversion RFC 4 - x86 only (for now), use cmovXX	Roland Paterson-Jones
	Replacement of tiny conditional jump graphlets with conditional move instructions. Currently enabled only for x86. Arm64 support using cselXX will be essentially identical. Adds (internal) frontend sel0/sel1 ops with flag-specific backend xselXX following jnz implementation pattern. Testing: standard QBE, cproc, harec, hare, roland
2025-03-15	Makefile: add explicit target to test the x86_64 backend	Antonio Terceiro
	When developing on an arm64 machine, it's useful to be able to test the x86_64 target.
2025-03-14	Get rid of movins() infra.	Roland Paterson-Jones

2025-03-14	Global Value Numbering / Global Code Motion	Roland Paterson-Jones
	More or less as proposed in its ninth iteration with the addition of a gcmmove() functionality to restore coherent local schedules. Changes since RFC 8: Features: - generalization of phi 1/0 detection - collapse linear jmp chains before GVN; simplifies if-graph detection used in 0/non-0 value inference and if-elim... - infer 0/non-0 values from dominating blk jnz; eliminates redundant cmp eq/ne 0 and associated jnz/blocks, for example redundant null pointer checks (hare codebase likes this) - remove (emergent) empty if-then-else graphlets between GVN and GCM; improves GCM instruction placement, particularly cmps. - merge %addr =l add %addr1, N sequences - reduces tmp count, register pressure. - squash consecutive associative ops with constant args, e.g. t1 = add t, N ... t2 = add t2, M -> t2 = add t, N+M Bug Fixes: - remove "cmp eq/ne of non-identical RCon's " in copyref(). RCon's are not guaranteed to be dedup'ed, and symbols can alias. Codebase: - moved some stuff into cfg.c including blkmerge() - some refactoring in gvn.c - simplification of reassoc.c - always reassoc all cmp ops and Kl add %t, N. Better on coremark, smaller codebase. - minor simplification of movins() - use vins Testing - standard QBE, cproc, hare, harec, coremark [still have Rust build issues with latest roland] Benchmark - coremark is ~15%+ faster than master - hare "HARETEST_INCLUDE='slow' make check" ~8% faster (crypto::sha1::sha1_1gb is biggest obvious win - ~25% faster) Changes since RFC 7: Bug fixes: - remove isbad4gcm() in GVN/GCM - it is unsound due to different state at GVN vs GCM time; replace with "reassociation" pass after GCM - fix intra-blk use-before-def after GCM - prevent GVN from deduping trapping instructions cos GCM will not move them - remove cmp eq/ne identical arg copy detection for floating point, it is not valid for NaN - fix cges/cged flagged as commutative in ops.h instead of cnes/cned respectively; just a typo Minor features: - copy detection handles cmp le/lt/ge/gt with identical args - treat (integer) div/rem by non-zero constant as non-trapping - eliminate add N/sub N pairs in copy detection - maintain accurate tmp use in GVN; not strictly necessary but enables interim global state sanity checking - "reassociation" of trivial constant offset load/store addresses, and cmp ops with point-of-use in pass after GCM - normalise commutative op arg order - e.g. op con, tmp -> op tmp, con to simplify copy detection and GVN instruction dedup Codebase: - split out core copy detection and constant folding (back) out into copy.c, fold.c respectively; gvn.c was getting monolithic - generic support for instruction moving in ins.c - used by GCM and reassoc - new reassociation pass in reassoc.c - other minor clean-up/refactor Changes since RFC 6: - More ext elimination in GVN by examination of def and use bit width - elimination of redundant and mask by bit width examination - Incorporation of Song's patch Changes since RFC 5: - avoidance of "bad" candidates for GVN/GCM - trivial address offset calculations, and comparisons - more copy detection mostly around boolean values - allow elimination of unused load, alloc, trapping instructions - detection of trivial boolean v ? 1 : 0 phi patterns - bug fix for (removal of) "chg" optimisation in ins recreation - it was missing removal of unused instructions in some cases ifelim() between GVN and GCM; deeper nopunused()
2024-04-12	add "make wc"	Quentin Carbonneaux

2024-03-07	declare check-rv64 as a phony make target	Nguyễn Gia Phong
	References: 4e93eeaa3b63b ("add rv64 backend")
2023-12-30	hard-code $(CC) to equal "cc"	Lorenz (xha)
	this fixes the build on FreeBSD, where the value of $(CC) recently changed from "cc" (13.2) to "c99" (14). Signed-off-by: Lorenz (xha) <[email protected]>
2023-09-25	Fix architecture detection on OpenBSD/arm64	Josiah Frentsos

2023-03-16	silence format warning more reliably	Quentin Carbonneaux

2023-03-15	silence some warnings	Quentin Carbonneaux

2022-12-14	new blit instruction	Quentin Carbonneaux

2022-10-03	new arm64_apple target	Quentin Carbonneaux
	Should make qbe work on apple arm-based hardware.
2022-10-03	add new target-specific abi0 pass	Quentin Carbonneaux
	The general idea is to give abis a chance to talk before we've done all the optimizations. Currently, all targets eliminate {par,arg,ret}{sb,ub,...} during this pass. The forthcoming arm64_apple will, however, insert proper extensions during abi0. Moving forward abis can, for example, lower small-aggregates passing there so that memory optimizations can interact better with function calls.
2022-08-31	drop -G flag and add target amd64_apple	Quentin Carbonneaux
	apple support is more than assembly syntax in case of arm64 machines, and apple syntax is currently useless in all cases but amd64; rather than having a -G option that only makes sense with amd64, we add a new target amd64_apple
2022-07-01	Makefile: Avoid double macro expansion in targets	Roberto E. Vargas Caballero
	POSIX specification stays: string1 = [string2] ... Macro expansions in string1 of macro definition lines shall be evaluated when read. Macro expansions in string2 of macro definition lines shall be performed when the macro identified by string1 is expanded in a rule or command. It means that recursive macro expansion is not guaranteed to work in a portable Make. Also, as make is a declarative language makes more sense to declare your targets as a primary concern instead of derivating them from a informational macro like SRC that is only used in a rule command.
2022-06-16	install with install -m755v1.0	Quentin Carbonneaux

2022-06-14	Makefile: POSIXify	illiliti
	Makefile now compatible with gmake, bmake, smake and pdpmake.
2022-05-12	install in /usr/local by default	Quentin Carbonneaux

2022-02-17	add rv64 backend	Michael Forney
	It is mostly complete, but still has a few ABI bugs when passing floats in structs, or when structs are passed partly in register, and partly on stack.
2019-04-26	Fix config.h dependency when OBJDIR != obj	Michael Forney

2019-04-24	Fix default config.h for arm64	Michael Forney

2017-04-08	add handy src target to the Makefile	Quentin Carbonneaux
	Intended usage: c_count `make src` (or similar).
2017-04-08	enable arm64 tests	Quentin Carbonneaux
	The vararg tests had to be changed because va_list is 32-bit wide on arm. The astute reader will notice that the way we pass va_list values is wrong, we should be using the ':valist' type as defined below instead of 'l'. But eh, that works for now, because of the ABI. type :valist = align 8 { 32 }
2017-04-08	add cross testing for arm64	Quentin Carbonneaux

2017-04-08	new arm64 backend, yeepee	Quentin Carbonneaux

2017-04-08	prepare for multi-target	Quentin Carbonneaux
	This big diff does multiple changes to allow the addition of new targets to qbe. The changes are listed below in decreasing order of impact. 1. Add a new Target structure. To add support for a given target, one has to implement all the members of the Target structure. All the source files where changed to use this interface where needed. 2. Single out amd64-specific code. In this commit, the amd64 target T_amd64_sysv is the only target available, it is implemented in the amd64/ directory. All the non-static items in this directory are prefixed with either amd64_ or amd64_sysv (for items that are specific to the System V ABI). 3. Centralize Ops information. There is now a file 'ops.h' that must be used to store all the available operations together with their metadata. The various targets will only select what they need; but it is beneficial that there is only one place to change to add a new instruction. One good side effect of this change is that any operation 'xyz' in the IL now as a corresponding 'Oxyz' in the code. 4. Misc fixes. One notable change is that instruction selection now generates generic comparison operations and the lowering to the target's comparisons is done in the emitter. GAS directives for data are the same for many targets, so data emission was extracted in a file 'gas.c'. 5. Modularize the Makefile. The Makefile now has a list of C files that are target-independent (SRC), and one list of C files per target. Each target can also use its own 'all.h' header (for example to define registers).
2017-02-27	make install/uninstall phonies	Quentin Carbonneaux

2017-02-27	add install and uninstall targets to Makefile	Quentin Rameau

2017-02-25	do sign/zero extensions removal in copy.c	Quentin Carbonneaux

2017-02-24	start a new simplification pass	Quentin Carbonneaux

2016-12-12	implement a simple load elimination pass	Quentin Carbonneaux

2016-12-12	implement a simple alias analysis	Quentin Carbonneaux

2016-12-12	create cfg.c for cfg-related functions	Quentin Carbonneaux

2016-04-13	add funny 80 target	Quentin Carbonneaux

2016-04-09	enable constant folding	Quentin Carbonneaux

2016-04-05	default to gnu gas format	Quentin Carbonneaux

2016-04-05	yay, support freebsd	Quentin Carbonneaux

2016-03-31	respect the order of the passes	Quentin Carbonneaux

2016-03-31	move abi code in a new file	Quentin Carbonneaux

2016-03-29	do not echo compilation commands if verbose	Quentin Carbonneaux

2016-03-29	new layout, put LICENSE in root	Quentin Carbonneaux

2016-03-27	move check rule into src/	Quentin Carbonneaux

2016-03-27	move paper synchronization to top makefile	Quentin Carbonneaux

2016-03-27	add centralized all and clean targets	Quentin Carbonneaux

2016-03-27	use make variable for extensibility	Quentin Carbonneaux

2016-03-27	extract tests out of src	Quentin Carbonneaux

2015-09-15	move ml prototype in a subdir	Quentin Carbonneaux

2015-09-15	do not remove t subdir while cleaning	Quentin Carbonneaux

2015-09-15	change name of test file	Quentin Carbonneaux

2015-09-15	more cleaning	Quentin Carbonneaux