-
Notifications
You must be signed in to change notification settings - Fork 12.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[X86][GlobalISel] Support G_FCMP for scalar cases #123598
Open
mahesh-attarde
wants to merge
1
commit into
llvm:main
Choose a base branch
from
mahesh-attarde:gisel_fcmp
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
+1,072
−5
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
@llvm/pr-subscribers-backend-x86 Author: Mahesh-Attarde (mahesh-attarde) ChangesWith this patch, we are enabling comparisons for Long double (80) types on X87 stack. It lowers G_FCMP. Patch is 37.93 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/123598.diff 4 Files Affected:
diff --git a/llvm/lib/Target/X86/GISel/X86InstructionSelector.cpp b/llvm/lib/Target/X86/GISel/X86InstructionSelector.cpp
index ee456a11d58441..9c35b1c06b187a 100644
--- a/llvm/lib/Target/X86/GISel/X86InstructionSelector.cpp
+++ b/llvm/lib/Target/X86/GISel/X86InstructionSelector.cpp
@@ -1048,6 +1048,13 @@ bool X86InstructionSelector::selectFCmp(MachineInstr &I,
break;
}
+ assert((LhsReg.isVirtual() && RhsReg.isVirtual()) &&
+ "Both arguments of FCMP need to be virtual!");
+ auto *LhsBank = RBI.getRegBank(LhsReg, MRI, TRI);
+ auto *RhsBank = RBI.getRegBank(RhsReg, MRI, TRI);
+ assert((LhsBank == RhsBank) &&
+ "Both banks assigned to FCMP arguments need to be same!");
+
// Compute the opcode for the CMP instruction.
unsigned OpCmp;
LLT Ty = MRI.getType(LhsReg);
@@ -1055,10 +1062,15 @@ bool X86InstructionSelector::selectFCmp(MachineInstr &I,
default:
return false;
case 32:
- OpCmp = X86::UCOMISSrr;
+ OpCmp = LhsBank->getID() == X86::PSRRegBankID ? X86::UCOM_FpIr32
+ : X86::UCOMISSrr;
break;
case 64:
- OpCmp = X86::UCOMISDrr;
+ OpCmp = LhsBank->getID() == X86::PSRRegBankID ? X86::UCOM_FpIr64
+ : X86::UCOMISDrr;
+ break;
+ case 80:
+ OpCmp = X86::UCOM_FpIr80;
break;
}
diff --git a/llvm/lib/Target/X86/GISel/X86LegalizerInfo.cpp b/llvm/lib/Target/X86/GISel/X86LegalizerInfo.cpp
index bab7fe9d25e441..1136438ee742d1 100644
--- a/llvm/lib/Target/X86/GISel/X86LegalizerInfo.cpp
+++ b/llvm/lib/Target/X86/GISel/X86LegalizerInfo.cpp
@@ -450,7 +450,8 @@ X86LegalizerInfo::X86LegalizerInfo(const X86Subtarget &STI,
getActionDefinitionsBuilder(G_FCMP)
.legalIf([=](const LegalityQuery &Query) {
return (HasSSE1 && typePairInSet(0, 1, {{s8, s32}})(Query)) ||
- (HasSSE2 && typePairInSet(0, 1, {{s8, s64}})(Query));
+ (HasSSE2 && typePairInSet(0, 1, {{s8, s64}})(Query)) ||
+ (UseX87 && typePairInSet(0, 1, {{s8, s80}})(Query));
})
.clampScalar(0, s8, s8)
.clampScalar(1, s32, HasSSE2 ? s64 : s32)
diff --git a/llvm/lib/Target/X86/GISel/X86RegisterBankInfo.cpp b/llvm/lib/Target/X86/GISel/X86RegisterBankInfo.cpp
index 43c0145ec8e2ad..42faf4299c6d5d 100644
--- a/llvm/lib/Target/X86/GISel/X86RegisterBankInfo.cpp
+++ b/llvm/lib/Target/X86/GISel/X86RegisterBankInfo.cpp
@@ -321,8 +321,8 @@ X86RegisterBankInfo::getInstrMapping(const MachineInstr &MI) const {
unsigned Size = Ty1.getSizeInBits();
(void)Size;
- assert((Size == 32 || Size == 64) && "Unsupported size for G_FCMP");
-
+ assert((Size == 32 || Size == 64 || Size == 80) &&
+ "Unsupported size for G_FCMP");
auto FpRegBank = getPartialMappingIdx(MI, Ty1, /* isFP= */ true);
OpRegBankIdx = {PMI_GPR8,
/* Predicate */ PMI_None, FpRegBank, FpRegBank};
diff --git a/llvm/test/CodeGen/X86/isel-fcmp-x87.ll b/llvm/test/CodeGen/X86/isel-fcmp-x87.ll
new file mode 100644
index 00000000000000..c46b354c9c410e
--- /dev/null
+++ b/llvm/test/CodeGen/X86/isel-fcmp-x87.ll
@@ -0,0 +1,1054 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
+; RUN: llc < %s -mtriple=x86_64-apple-darwin10 -mattr=+x87,-sse,-sse2 -verify-machineinstrs | FileCheck %s --check-prefix=SDAG-X64
+; RUN: llc < %s -fast-isel -fast-isel-abort=1 -mtriple=x86_64-apple-darwin10 -mattr=+x87,-sse,-sse2 -verify-machineinstrs | FileCheck %s --check-prefix=FAST-X64
+; RUN: llc < %s -global-isel -global-isel-abort=1 -mtriple=x86_64-apple-darwin10 -mattr=+x87,-sse,-sse2 -verify-machineinstrs | FileCheck %s --check-prefixes=GISEL-X64
+; RUN: llc < %s -mtriple=i686-apple-darwin10 -mattr=+x87,-sse,-sse2 -verify-machineinstrs | FileCheck %s --check-prefixes=SDAG-X86
+; Allow fast-isel to fallback to selection dag on x86
+; RUN: llc < %s -fast-isel -mtriple=i686-apple-darwin10 -mattr=+x87,-sse,-sse2 -verify-machineinstrs | FileCheck %s --check-prefixes=FAST-X86
+; RUN: llc < %s -global-isel -global-isel-abort=1 -mtriple=i686-apple-darwin10 -mattr=+x87,-sse,-sse2 -verify-machineinstrs | FileCheck %s --check-prefixes=GISEL-X86
+
+ define i1 @fcmp_x86_fp80_oeq(x86_fp80 %x, x86_fp80 %y) {
+; SDAG-X64-LABEL: fcmp_x86_fp80_oeq:
+; SDAG-X64: ## %bb.0:
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fucompi %st(1), %st
+; SDAG-X64-NEXT: fstp %st(0)
+; SDAG-X64-NEXT: setnp %cl
+; SDAG-X64-NEXT: sete %al
+; SDAG-X64-NEXT: andb %cl, %al
+; SDAG-X64-NEXT: retq
+;
+; FAST-X64-LABEL: fcmp_x86_fp80_oeq:
+; FAST-X64: ## %bb.0:
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fucompi %st(1), %st
+; FAST-X64-NEXT: fstp %st(0)
+; FAST-X64-NEXT: setnp %cl
+; FAST-X64-NEXT: sete %al
+; FAST-X64-NEXT: andb %cl, %al
+; FAST-X64-NEXT: retq
+;
+; GISEL-X64-LABEL: fcmp_x86_fp80_oeq:
+; GISEL-X64: ## %bb.0:
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fxch %st(1)
+; GISEL-X64-NEXT: fucompi %st(1), %st
+; GISEL-X64-NEXT: fstp %st(0)
+; GISEL-X64-NEXT: sete %cl
+; GISEL-X64-NEXT: setnp %al
+; GISEL-X64-NEXT: andb %cl, %al
+; GISEL-X64-NEXT: retq
+;
+; SDAG-X86-LABEL: fcmp_x86_fp80_oeq:
+; SDAG-X86: ## %bb.0:
+; SDAG-X86-NEXT: subl $12, %esp
+; SDAG-X86-NEXT: .cfi_def_cfa_offset 16
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fucompp
+; SDAG-X86-NEXT: fnstsw %ax
+; SDAG-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; SDAG-X86-NEXT: sahf
+; SDAG-X86-NEXT: setnp %cl
+; SDAG-X86-NEXT: sete %al
+; SDAG-X86-NEXT: andb %cl, %al
+; SDAG-X86-NEXT: addl $12, %esp
+; SDAG-X86-NEXT: retl
+;
+; FAST-X86-LABEL: fcmp_x86_fp80_oeq:
+; FAST-X86: ## %bb.0:
+; FAST-X86-NEXT: subl $12, %esp
+; FAST-X86-NEXT: .cfi_def_cfa_offset 16
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fucompp
+; FAST-X86-NEXT: fnstsw %ax
+; FAST-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; FAST-X86-NEXT: sahf
+; FAST-X86-NEXT: setnp %cl
+; FAST-X86-NEXT: sete %al
+; FAST-X86-NEXT: andb %cl, %al
+; FAST-X86-NEXT: addl $12, %esp
+; FAST-X86-NEXT: retl
+;
+; GISEL-X86-LABEL: fcmp_x86_fp80_oeq:
+; GISEL-X86: ## %bb.0:
+; GISEL-X86-NEXT: subl $12, %esp
+; GISEL-X86-NEXT: .cfi_def_cfa_offset 16
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fxch %st(1)
+; GISEL-X86-NEXT: fucompi %st(1), %st
+; GISEL-X86-NEXT: fstp %st(0)
+; GISEL-X86-NEXT: sete %cl
+; GISEL-X86-NEXT: setnp %al
+; GISEL-X86-NEXT: andb %cl, %al
+; GISEL-X86-NEXT: addl $12, %esp
+; GISEL-X86-NEXT: retl
+ %1 = fcmp oeq x86_fp80 %x, %y
+ ret i1 %1
+ }
+
+ define i1 @fcmp_x86_fp80_ogt(x86_fp80 %x, x86_fp80 %y) {
+; SDAG-X64-LABEL: fcmp_x86_fp80_ogt:
+; SDAG-X64: ## %bb.0:
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fucompi %st(1), %st
+; SDAG-X64-NEXT: fstp %st(0)
+; SDAG-X64-NEXT: seta %al
+; SDAG-X64-NEXT: retq
+;
+; FAST-X64-LABEL: fcmp_x86_fp80_ogt:
+; FAST-X64: ## %bb.0:
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fucompi %st(1), %st
+; FAST-X64-NEXT: fstp %st(0)
+; FAST-X64-NEXT: seta %al
+; FAST-X64-NEXT: retq
+;
+; GISEL-X64-LABEL: fcmp_x86_fp80_ogt:
+; GISEL-X64: ## %bb.0:
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fxch %st(1)
+; GISEL-X64-NEXT: fucompi %st(1), %st
+; GISEL-X64-NEXT: fstp %st(0)
+; GISEL-X64-NEXT: seta %al
+; GISEL-X64-NEXT: retq
+;
+; SDAG-X86-LABEL: fcmp_x86_fp80_ogt:
+; SDAG-X86: ## %bb.0:
+; SDAG-X86-NEXT: subl $12, %esp
+; SDAG-X86-NEXT: .cfi_def_cfa_offset 16
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fucompp
+; SDAG-X86-NEXT: fnstsw %ax
+; SDAG-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; SDAG-X86-NEXT: sahf
+; SDAG-X86-NEXT: seta %al
+; SDAG-X86-NEXT: addl $12, %esp
+; SDAG-X86-NEXT: retl
+;
+; FAST-X86-LABEL: fcmp_x86_fp80_ogt:
+; FAST-X86: ## %bb.0:
+; FAST-X86-NEXT: subl $12, %esp
+; FAST-X86-NEXT: .cfi_def_cfa_offset 16
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fucompp
+; FAST-X86-NEXT: fnstsw %ax
+; FAST-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; FAST-X86-NEXT: sahf
+; FAST-X86-NEXT: seta %al
+; FAST-X86-NEXT: addl $12, %esp
+; FAST-X86-NEXT: retl
+;
+; GISEL-X86-LABEL: fcmp_x86_fp80_ogt:
+; GISEL-X86: ## %bb.0:
+; GISEL-X86-NEXT: subl $12, %esp
+; GISEL-X86-NEXT: .cfi_def_cfa_offset 16
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fxch %st(1)
+; GISEL-X86-NEXT: fucompi %st(1), %st
+; GISEL-X86-NEXT: fstp %st(0)
+; GISEL-X86-NEXT: seta %al
+; GISEL-X86-NEXT: addl $12, %esp
+; GISEL-X86-NEXT: retl
+ %1 = fcmp ogt x86_fp80 %x, %y
+ ret i1 %1
+ }
+
+ define i1 @fcmp_x86_fp80_oge(x86_fp80 %x, x86_fp80 %y) {
+; SDAG-X64-LABEL: fcmp_x86_fp80_oge:
+; SDAG-X64: ## %bb.0:
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fucompi %st(1), %st
+; SDAG-X64-NEXT: fstp %st(0)
+; SDAG-X64-NEXT: setae %al
+; SDAG-X64-NEXT: retq
+;
+; FAST-X64-LABEL: fcmp_x86_fp80_oge:
+; FAST-X64: ## %bb.0:
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fucompi %st(1), %st
+; FAST-X64-NEXT: fstp %st(0)
+; FAST-X64-NEXT: setae %al
+; FAST-X64-NEXT: retq
+;
+; GISEL-X64-LABEL: fcmp_x86_fp80_oge:
+; GISEL-X64: ## %bb.0:
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fxch %st(1)
+; GISEL-X64-NEXT: fucompi %st(1), %st
+; GISEL-X64-NEXT: fstp %st(0)
+; GISEL-X64-NEXT: setae %al
+; GISEL-X64-NEXT: retq
+;
+; SDAG-X86-LABEL: fcmp_x86_fp80_oge:
+; SDAG-X86: ## %bb.0:
+; SDAG-X86-NEXT: subl $12, %esp
+; SDAG-X86-NEXT: .cfi_def_cfa_offset 16
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fucompp
+; SDAG-X86-NEXT: fnstsw %ax
+; SDAG-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; SDAG-X86-NEXT: sahf
+; SDAG-X86-NEXT: setae %al
+; SDAG-X86-NEXT: addl $12, %esp
+; SDAG-X86-NEXT: retl
+;
+; FAST-X86-LABEL: fcmp_x86_fp80_oge:
+; FAST-X86: ## %bb.0:
+; FAST-X86-NEXT: subl $12, %esp
+; FAST-X86-NEXT: .cfi_def_cfa_offset 16
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fucompp
+; FAST-X86-NEXT: fnstsw %ax
+; FAST-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; FAST-X86-NEXT: sahf
+; FAST-X86-NEXT: setae %al
+; FAST-X86-NEXT: addl $12, %esp
+; FAST-X86-NEXT: retl
+;
+; GISEL-X86-LABEL: fcmp_x86_fp80_oge:
+; GISEL-X86: ## %bb.0:
+; GISEL-X86-NEXT: subl $12, %esp
+; GISEL-X86-NEXT: .cfi_def_cfa_offset 16
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fxch %st(1)
+; GISEL-X86-NEXT: fucompi %st(1), %st
+; GISEL-X86-NEXT: fstp %st(0)
+; GISEL-X86-NEXT: setae %al
+; GISEL-X86-NEXT: addl $12, %esp
+; GISEL-X86-NEXT: retl
+ %1 = fcmp oge x86_fp80 %x, %y
+ ret i1 %1
+ }
+
+ define i1 @fcmp_x86_fp80_olt(x86_fp80 %x, x86_fp80 %y) {
+; SDAG-X64-LABEL: fcmp_x86_fp80_olt:
+; SDAG-X64: ## %bb.0:
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fucompi %st(1), %st
+; SDAG-X64-NEXT: fstp %st(0)
+; SDAG-X64-NEXT: seta %al
+; SDAG-X64-NEXT: retq
+;
+; FAST-X64-LABEL: fcmp_x86_fp80_olt:
+; FAST-X64: ## %bb.0:
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fxch %st(1)
+; FAST-X64-NEXT: fucompi %st(1), %st
+; FAST-X64-NEXT: fstp %st(0)
+; FAST-X64-NEXT: seta %al
+; FAST-X64-NEXT: retq
+;
+; GISEL-X64-LABEL: fcmp_x86_fp80_olt:
+; GISEL-X64: ## %bb.0:
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fucompi %st(1), %st
+; GISEL-X64-NEXT: fstp %st(0)
+; GISEL-X64-NEXT: seta %al
+; GISEL-X64-NEXT: retq
+;
+; SDAG-X86-LABEL: fcmp_x86_fp80_olt:
+; SDAG-X86: ## %bb.0:
+; SDAG-X86-NEXT: subl $12, %esp
+; SDAG-X86-NEXT: .cfi_def_cfa_offset 16
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fucompp
+; SDAG-X86-NEXT: fnstsw %ax
+; SDAG-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; SDAG-X86-NEXT: sahf
+; SDAG-X86-NEXT: seta %al
+; SDAG-X86-NEXT: addl $12, %esp
+; SDAG-X86-NEXT: retl
+;
+; FAST-X86-LABEL: fcmp_x86_fp80_olt:
+; FAST-X86: ## %bb.0:
+; FAST-X86-NEXT: subl $12, %esp
+; FAST-X86-NEXT: .cfi_def_cfa_offset 16
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fxch %st(1)
+; FAST-X86-NEXT: fucompp
+; FAST-X86-NEXT: fnstsw %ax
+; FAST-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; FAST-X86-NEXT: sahf
+; FAST-X86-NEXT: seta %al
+; FAST-X86-NEXT: addl $12, %esp
+; FAST-X86-NEXT: retl
+;
+; GISEL-X86-LABEL: fcmp_x86_fp80_olt:
+; GISEL-X86: ## %bb.0:
+; GISEL-X86-NEXT: subl $12, %esp
+; GISEL-X86-NEXT: .cfi_def_cfa_offset 16
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fucompi %st(1), %st
+; GISEL-X86-NEXT: fstp %st(0)
+; GISEL-X86-NEXT: seta %al
+; GISEL-X86-NEXT: addl $12, %esp
+; GISEL-X86-NEXT: retl
+ %1 = fcmp olt x86_fp80 %x, %y
+ ret i1 %1
+ }
+
+ define i1 @fcmp_x86_fp80_ole(x86_fp80 %x, x86_fp80 %y) {
+; SDAG-X64-LABEL: fcmp_x86_fp80_ole:
+; SDAG-X64: ## %bb.0:
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fucompi %st(1), %st
+; SDAG-X64-NEXT: fstp %st(0)
+; SDAG-X64-NEXT: setae %al
+; SDAG-X64-NEXT: retq
+;
+; FAST-X64-LABEL: fcmp_x86_fp80_ole:
+; FAST-X64: ## %bb.0:
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fxch %st(1)
+; FAST-X64-NEXT: fucompi %st(1), %st
+; FAST-X64-NEXT: fstp %st(0)
+; FAST-X64-NEXT: setae %al
+; FAST-X64-NEXT: retq
+;
+; GISEL-X64-LABEL: fcmp_x86_fp80_ole:
+; GISEL-X64: ## %bb.0:
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fucompi %st(1), %st
+; GISEL-X64-NEXT: fstp %st(0)
+; GISEL-X64-NEXT: setae %al
+; GISEL-X64-NEXT: retq
+;
+; SDAG-X86-LABEL: fcmp_x86_fp80_ole:
+; SDAG-X86: ## %bb.0:
+; SDAG-X86-NEXT: subl $12, %esp
+; SDAG-X86-NEXT: .cfi_def_cfa_offset 16
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fucompp
+; SDAG-X86-NEXT: fnstsw %ax
+; SDAG-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; SDAG-X86-NEXT: sahf
+; SDAG-X86-NEXT: setae %al
+; SDAG-X86-NEXT: addl $12, %esp
+; SDAG-X86-NEXT: retl
+;
+; FAST-X86-LABEL: fcmp_x86_fp80_ole:
+; FAST-X86: ## %bb.0:
+; FAST-X86-NEXT: subl $12, %esp
+; FAST-X86-NEXT: .cfi_def_cfa_offset 16
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fxch %st(1)
+; FAST-X86-NEXT: fucompp
+; FAST-X86-NEXT: fnstsw %ax
+; FAST-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; FAST-X86-NEXT: sahf
+; FAST-X86-NEXT: setae %al
+; FAST-X86-NEXT: addl $12, %esp
+; FAST-X86-NEXT: retl
+;
+; GISEL-X86-LABEL: fcmp_x86_fp80_ole:
+; GISEL-X86: ## %bb.0:
+; GISEL-X86-NEXT: subl $12, %esp
+; GISEL-X86-NEXT: .cfi_def_cfa_offset 16
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fucompi %st(1), %st
+; GISEL-X86-NEXT: fstp %st(0)
+; GISEL-X86-NEXT: setae %al
+; GISEL-X86-NEXT: addl $12, %esp
+; GISEL-X86-NEXT: retl
+ %1 = fcmp ole x86_fp80 %x, %y
+ ret i1 %1
+ }
+
+ define i1 @fcmp_x86_fp80_one(x86_fp80 %x, x86_fp80 %y) {
+; SDAG-X64-LABEL: fcmp_x86_fp80_one:
+; SDAG-X64: ## %bb.0:
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fucompi %st(1), %st
+; SDAG-X64-NEXT: fstp %st(0)
+; SDAG-X64-NEXT: setne %al
+; SDAG-X64-NEXT: retq
+;
+; FAST-X64-LABEL: fcmp_x86_fp80_one:
+; FAST-X64: ## %bb.0:
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fucompi %st(1), %st
+; FAST-X64-NEXT: fstp %st(0)
+; FAST-X64-NEXT: setne %al
+; FAST-X64-NEXT: retq
+;
+; GISEL-X64-LABEL: fcmp_x86_fp80_one:
+; GISEL-X64: ## %bb.0:
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fxch %st(1)
+; GISEL-X64-NEXT: fucompi %st(1), %st
+; GISEL-X64-NEXT: fstp %st(0)
+; GISEL-X64-NEXT: setne %al
+; GISEL-X64-NEXT: retq
+;
+; SDAG-X86-LABEL: fcmp_x86_fp80_one:
+; SDAG-X86: ## %bb.0:
+; SDAG-X86-NEXT: subl $12, %esp
+; SDAG-X86-NEXT: .cfi_def_cfa_offset 16
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fucompp
+; SDAG-X86-NEXT: fnstsw %ax
+; SDAG-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; SDAG-X86-NEXT: sahf
+; SDAG-X86-NEXT: setne %al
+; SDAG-X86-NEXT: addl $12, %esp
+; SDAG-X86-NEXT: retl
+;
+; FAST-X86-LABEL: fcmp_x86_fp80_one:
+; FAST-X86: ## %bb.0:
+; FAST-X86-NEXT: subl $12, %esp
+; FAST-X86-NEXT: .cfi_def_cfa_offset 16
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; FAST-X86-NEXT: fucompp
+; FAST-X86-NEXT: fnstsw %ax
+; FAST-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; FAST-X86-NEXT: sahf
+; FAST-X86-NEXT: setne %al
+; FAST-X86-NEXT: addl $12, %esp
+; FAST-X86-NEXT: retl
+;
+; GISEL-X86-LABEL: fcmp_x86_fp80_one:
+; GISEL-X86: ## %bb.0:
+; GISEL-X86-NEXT: subl $12, %esp
+; GISEL-X86-NEXT: .cfi_def_cfa_offset 16
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; GISEL-X86-NEXT: fxch %st(1)
+; GISEL-X86-NEXT: fucompi %st(1), %st
+; GISEL-X86-NEXT: fstp %st(0)
+; GISEL-X86-NEXT: setne %al
+; GISEL-X86-NEXT: addl $12, %esp
+; GISEL-X86-NEXT: retl
+ %1 = fcmp one x86_fp80 %x, %y
+ ret i1 %1
+ }
+
+ define i1 @fcmp_x86_fp80_ord(x86_fp80 %x, x86_fp80 %y) {
+; SDAG-X64-LABEL: fcmp_x86_fp80_ord:
+; SDAG-X64: ## %bb.0:
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; SDAG-X64-NEXT: fucompi %st(1), %st
+; SDAG-X64-NEXT: fstp %st(0)
+; SDAG-X64-NEXT: setnp %al
+; SDAG-X64-NEXT: retq
+;
+; FAST-X64-LABEL: fcmp_x86_fp80_ord:
+; FAST-X64: ## %bb.0:
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; FAST-X64-NEXT: fucompi %st(1), %st
+; FAST-X64-NEXT: fstp %st(0)
+; FAST-X64-NEXT: setnp %al
+; FAST-X64-NEXT: retq
+;
+; GISEL-X64-LABEL: fcmp_x86_fp80_ord:
+; GISEL-X64: ## %bb.0:
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fldt {{[0-9]+}}(%rsp)
+; GISEL-X64-NEXT: fxch %st(1)
+; GISEL-X64-NEXT: fucompi %st(1), %st
+; GISEL-X64-NEXT: fstp %st(0)
+; GISEL-X64-NEXT: setnp %al
+; GISEL-X64-NEXT: retq
+;
+; SDAG-X86-LABEL: fcmp_x86_fp80_ord:
+; SDAG-X86: ## %bb.0:
+; SDAG-X86-NEXT: subl $12, %esp
+; SDAG-X86-NEXT: .cfi_def_cfa_offset 16
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fldt {{[0-9]+}}(%esp)
+; SDAG-X86-NEXT: fucompp
+; SDAG-X86-NEXT: fnstsw %ax
+; SDAG-X86-NEXT: ## kill: def $ah killed $ah killed $ax
+; SDAG-X86-NEXT:...
[truncated]
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
With this patch, we are enabling comparisons for Long double (80) types on X87 stack. It lowers G_FCMP.