; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041244 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041244
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptioncleavage and polyadenylation specificity factor subunit 3-II
Genome locationchr13:14315943..14323273
RNA-Seq ExpressionLag0041244
SyntenyLag0041244
Gene Ontology termsGO:0010197 - polar nucleus fusion (biological process)
GO:0016180 - snRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR011108 - Zn-dependent metallo-hydrolase, RNA specificity domain
IPR022712 - Beta-Casp domain
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022944195.1 cleavage and polyadenylation specificity factor subunit 3-II isoform X1 [Cucurbita moschata]3.3e-20787.78Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LTVQANMYYKMLISWTSQKVKETY+TRNAFDFKNVQ F+RSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSK+NLITLPGYCVAGTIGHKLMSGKPTK
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        IDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKM TLKERIHSELGIPCHDPANNET+SISSTL +KAE+SSMFIQSCSTPNF
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF
        KFLKRNLN+KI+   K+ SSK  G SSM  RR SNPH KHLNRNLD+KFD SL+C PEL+VSDDRVNEGILVME GK+ KVVHQ+ELLLLLGEQEHEVRF
Subjt:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF

Query:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI
        ANC PIYFGSLD+THV+DC+SRKS WLSQLSSKLSSELSDRNVQN GEYLQVESFTLSICSKE+CPYRT NRIENESA  F+CCSWLVADE+LAW+IISI
Subjt:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI

Query:  LEKLDLSAT
        LEK DLS+T
Subjt:  LEKLDLSAT

XP_022944196.1 cleavage and polyadenylation specificity factor subunit 3-II isoform X2 [Cucurbita moschata]3.3e-20787.78Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LTVQANMYYKMLISWTSQKVKETY+TRNAFDFKNVQ F+RSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSK+NLITLPGYCVAGTIGHKLMSGKPTK
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        IDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKM TLKERIHSELGIPCHDPANNET+SISSTL +KAE+SSMFIQSCSTPNF
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF
        KFLKRNLN+KI+   K+ SSK  G SSM  RR SNPH KHLNRNLD+KFD SL+C PEL+VSDDRVNEGILVME GK+ KVVHQ+ELLLLLGEQEHEVRF
Subjt:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF

Query:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI
        ANC PIYFGSLD+THV+DC+SRKS WLSQLSSKLSSELSDRNVQN GEYLQVESFTLSICSKE+CPYRT NRIENESA  F+CCSWLVADE+LAW+IISI
Subjt:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI

Query:  LEKLDLSAT
        LEK DLS+T
Subjt:  LEKLDLSAT

XP_038901162.1 cleavage and polyadenylation specificity factor subunit 3-II isoform X1 [Benincasa hispida]1.7e-20886.68Show/hide
Query:  VIVSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSG
        V   LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQ F+RSMIDAPGPC+LFATPGMISGGFSLEVFKRWAPSK NLITLPGYCVAGT+GHKLMSG
Subjt:  VIVSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSG

Query:  KPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCS
        KPTKIDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKMATLK+RIHSELGIPC+DPANNET+SISSTL VKAEAS MFIQSCS
Subjt:  KPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCS

Query:  TPNFKFLKRNLNNKIDPDLKDASSKAGASS-MLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEH
        TPNFKFLKRNLNNK+DP+LKD S KAG +S + IR+CSN HFK+LNRNLD KFD S +C PELQVSDDRVNEGILVMEKGK+ KV+HQ+E+LLLLGEQEH
Subjt:  TPNFKFLKRNLNNKIDPDLKDASSKAGASS-MLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEH

Query:  EVRFANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWK
        EVRFANC PIYFG+L+ETHV+DCLSRKS WLSQL SKLSSELSD+NVQNLGEYLQVESFTLSICSKE+CPYRT NRIENES +VF CCSWL+ADEILAWK
Subjt:  EVRFANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWK

Query:  IISILEKLDLSAT
        IISILEK +L +T
Subjt:  IISILEKLDLSAT

XP_038901169.1 cleavage and polyadenylation specificity factor subunit 3-II isoform X2 [Benincasa hispida]1.7e-20886.68Show/hide
Query:  VIVSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSG
        V   LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQ F+RSMIDAPGPC+LFATPGMISGGFSLEVFKRWAPSK NLITLPGYCVAGT+GHKLMSG
Subjt:  VIVSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSG

Query:  KPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCS
        KPTKIDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKMATLK+RIHSELGIPC+DPANNET+SISSTL VKAEAS MFIQSCS
Subjt:  KPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCS

Query:  TPNFKFLKRNLNNKIDPDLKDASSKAGASS-MLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEH
        TPNFKFLKRNLNNK+DP+LKD S KAG +S + IR+CSN HFK+LNRNLD KFD S +C PELQVSDDRVNEGILVMEKGK+ KV+HQ+E+LLLLGEQEH
Subjt:  TPNFKFLKRNLNNKIDPDLKDASSKAGASS-MLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEH

Query:  EVRFANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWK
        EVRFANC PIYFG+L+ETHV+DCLSRKS WLSQL SKLSSELSD+NVQNLGEYLQVESFTLSICSKE+CPYRT NRIENES +VF CCSWL+ADEILAWK
Subjt:  EVRFANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWK

Query:  IISILEKLDLSAT
        IISILEK +L +T
Subjt:  IISILEKLDLSAT

XP_038901174.1 cleavage and polyadenylation specificity factor subunit 3-II isoform X6 [Benincasa hispida]1.7e-20886.68Show/hide
Query:  VIVSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSG
        V   LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQ F+RSMIDAPGPC+LFATPGMISGGFSLEVFKRWAPSK NLITLPGYCVAGT+GHKLMSG
Subjt:  VIVSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSG

Query:  KPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCS
        KPTKIDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKMATLK+RIHSELGIPC+DPANNET+SISSTL VKAEAS MFIQSCS
Subjt:  KPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCS

Query:  TPNFKFLKRNLNNKIDPDLKDASSKAGASS-MLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEH
        TPNFKFLKRNLNNK+DP+LKD S KAG +S + IR+CSN HFK+LNRNLD KFD S +C PELQVSDDRVNEGILVMEKGK+ KV+HQ+E+LLLLGEQEH
Subjt:  TPNFKFLKRNLNNKIDPDLKDASSKAGASS-MLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEH

Query:  EVRFANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWK
        EVRFANC PIYFG+L+ETHV+DCLSRKS WLSQL SKLSSELSD+NVQNLGEYLQVESFTLSICSKE+CPYRT NRIENES +VF CCSWL+ADEILAWK
Subjt:  EVRFANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWK

Query:  IISILEKLDLSAT
        IISILEK +L +T
Subjt:  IISILEKLDLSAT

TrEMBL top hitse value%identityAlignment
A0A1S3C9L2 cleavage and polyadenylation specificity factor subunit 3-II3.6e-20785.68Show/hide
Query:  FFEQGSLWHRVIVS--LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYC
        ++E+ +L   + VS  LTVQANMYYKMLISWTSQKVKETY+TRNAFDFKNVQ F+RSMIDAPGPCVLFATPGMIS GFSLEVFKRWAPSK+NLITLPGYC
Subjt:  FFEQGSLWHRVIVS--LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYC

Query:  VAGTIGHKLMSGKPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVK
        VAGT+GHKLMSGKPTKIDLDKDTQIDVQCQ+HQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKMA LKERIHSELGIPCHDPANNET+SISSTL +K
Subjt:  VAGTIGHKLMSGKPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVK

Query:  AEASSMFIQSCSTPNFKFLKRNLNNKIDPDLKDASSKA-GASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQ
        AEASSMFIQSCSTPNFKFLKRNL +KIDPDLKD S KA   S+MLIR CSNPHFKHLNRNLD KFD SL+C PELQVSDDRVNEGILVME GK+ K +HQ
Subjt:  AEASSMFIQSCSTPNFKFLKRNLNNKIDPDLKDASSKA-GASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQ

Query:  NELLLLLGEQEHEVRFANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENE-SAVVFWC
        +ELLLLLGEQEHEVRFA+C PIYFGSLDE HV+D LSRKS WLSQLS KLS+ELSDRNVQNLGEYLQVES TLSICSKENCPYRT NRIENE SA+VF C
Subjt:  NELLLLLGEQEHEVRFANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENE-SAVVFWC

Query:  CSWLVADEILAWKIISILEKLDLSAT
        CSWLVADEILAWKIISILEK DL +T
Subjt:  CSWLVADEILAWKIISILEKLDLSAT

A0A6J1FTR8 cleavage and polyadenylation specificity factor subunit 3-II isoform X21.6e-20787.78Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LTVQANMYYKMLISWTSQKVKETY+TRNAFDFKNVQ F+RSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSK+NLITLPGYCVAGTIGHKLMSGKPTK
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        IDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKM TLKERIHSELGIPCHDPANNET+SISSTL +KAE+SSMFIQSCSTPNF
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF
        KFLKRNLN+KI+   K+ SSK  G SSM  RR SNPH KHLNRNLD+KFD SL+C PEL+VSDDRVNEGILVME GK+ KVVHQ+ELLLLLGEQEHEVRF
Subjt:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF

Query:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI
        ANC PIYFGSLD+THV+DC+SRKS WLSQLSSKLSSELSDRNVQN GEYLQVESFTLSICSKE+CPYRT NRIENESA  F+CCSWLVADE+LAW+IISI
Subjt:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI

Query:  LEKLDLSAT
        LEK DLS+T
Subjt:  LEKLDLSAT

A0A6J1FYI7 cleavage and polyadenylation specificity factor subunit 3-II isoform X11.6e-20787.78Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LTVQANMYYKMLISWTSQKVKETY+TRNAFDFKNVQ F+RSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSK+NLITLPGYCVAGTIGHKLMSGKPTK
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        IDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKM TLKERIHSELGIPCHDPANNET+SISSTL +KAE+SSMFIQSCSTPNF
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF
        KFLKRNLN+KI+   K+ SSK  G SSM  RR SNPH KHLNRNLD+KFD SL+C PEL+VSDDRVNEGILVME GK+ KVVHQ+ELLLLLGEQEHEVRF
Subjt:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF

Query:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI
        ANC PIYFGSLD+THV+DC+SRKS WLSQLSSKLSSELSDRNVQN GEYLQVESFTLSICSKE+CPYRT NRIENESA  F+CCSWLVADE+LAW+IISI
Subjt:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI

Query:  LEKLDLSAT
        LEK DLS+T
Subjt:  LEKLDLSAT

A0A6J1J7M0 cleavage and polyadenylation specificity factor subunit 3-II isoform X17.9e-20787.53Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LTVQANMYYKMLISWTSQKVKETY+TRNAFDFKNVQ F+RSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSK+NLITLPGYCVAGTIGHKLMSGKPTK
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        IDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKM TLKERIHSELGIPCHDPANNET+SISSTL +KAE+SS FIQSCSTPNF
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF
        KFLKRNLN+KI+   K+ SSK  G SSM  RR SNPH KHLNRNLD+KFD SL+C PEL+VSDDRVNEGILVMEKGK+ KVVHQ+ELLLLLGEQEHEVRF
Subjt:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF

Query:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI
        ANC PIYFGSLD+THV+DC+SRKS WLSQLSSKLSSELSDRNVQN GEYLQVESFTLSICSKE+CPYRT NRIENESA  F+CCSWLV DE+LAW+IISI
Subjt:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI

Query:  LEKLDLSAT
        LEK DLS+T
Subjt:  LEKLDLSAT

A0A6J1JG52 cleavage and polyadenylation specificity factor subunit 3-II isoform X27.9e-20787.53Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LTVQANMYYKMLISWTSQKVKETY+TRNAFDFKNVQ F+RSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSK+NLITLPGYCVAGTIGHKLMSGKPTK
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        IDLDKDTQIDVQCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEKPKM TLKERIHSELGIPCHDPANNET+SISSTL +KAE+SS FIQSCSTPNF
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF
        KFLKRNLN+KI+   K+ SSK  G SSM  RR SNPH KHLNRNLD+KFD SL+C PEL+VSDDRVNEGILVMEKGK+ KVVHQ+ELLLLLGEQEHEVRF
Subjt:  KFLKRNLNNKIDPDLKDASSK-AGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRF

Query:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI
        ANC PIYFGSLD+THV+DC+SRKS WLSQLSSKLSSELSDRNVQN GEYLQVESFTLSICSKE+CPYRT NRIENESA  F+CCSWLV DE+LAW+IISI
Subjt:  ANCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISI

Query:  LEKLDLSAT
        LEK DLS+T
Subjt:  LEKLDLSAT

SwissProt top hitse value%identityAlignment
Q2YDM2 Integrator complex subunit 112.2e-4942.13Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LT +AN YYK+ I WT+QK+++T+  RN F+FK+++ F+R+  D+PGP V+FATPGM+  G SL++F++WA ++ N++ +PGYCV GT+GHK++SG+  K
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        ++++    ++V+ Q+  +SFS H DAKGIM LV    P++V+LVHGE  KM  LK++I  E  + C+ PAN ET+++ ++  +    S            
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLK
          LKR +   + PD K
Subjt:  KFLKRNLNNKIDPDLK

Q503E1 Integrator complex subunit 112.7e-5046.81Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LT +AN YYK+ I+WT+QK+++T+  RN F+FK+++ F+RS  D PGP V+FATPGM+  G SL++FK+WA ++ N++ +PGYCV GT+GHK+++G+  K
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEAS
        ++++    +DV+ Q+  +SFS H DAKGIM L+    P++++LVHGE  KM  LK++I  E  I C  PAN ET +I +   V  + S
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEAS

Q54YL3 Integrator complex subunit 11 homolog1.2e-5544.73Show/hide
Query:  FFEQGSLWHRVI---VSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGY
        ++EQ +L H  I     L  +AN+YYK+ I+WT+QK+K+T+  RN FDFK+++ F+  ++DAPG  VLFATPGM+  G SLEVFK+WAP+++N+  +PGY
Subjt:  FFEQGSLWHRVI---VSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGY

Query:  CVAGTIGHKLMS--------GKPTK--IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNE
        CV GT+G+KL++         KP    +++DK T I+V+C+IH LSFS H DAKGI+ L+   +P++VILVHGEK KM  L ++I  E+G+ C+ PAN  
Subjt:  CVAGTIGHKLMS--------GKPTK--IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNE

Query:  TISISSTLFVKAEAS-SMFIQSCSTPNFKFLKRNLNN
        TI I +   +  + S ++  +     ++++   NLNN
Subjt:  TISISSTLFVKAEAS-SMFIQSCSTPNFKFLKRNLNN

Q5NVE6 Integrator complex subunit 111.7e-4947.22Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LT +AN YYK+ I WT+QK+++T+  RN F+FK+++ F+R+  D PGP V+FATPGM+  G SL++F++WA ++ N++ +PGYCV GT+GHK++SG+  K
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISST
        ++++    ++V+ Q+  +SFS H DAKGIM LV    P+ V+LVHGE  KM  LK++I  EL + C+ PAN ET+++ ++
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISST

Q8GUU3 Cleavage and polyadenylation specificity factor subunit 3-II2.9e-10549.13Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LT+QANMYYKMLISWTSQ VKE ++T N FDFKNV++F+RS+I APGPCVLFATPGM+  GFSLEVFK WAPS +NL+ LPGY VAGT+GHKLM+GKPT 
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        +DL   T++DV+C++HQ++FSPHTDAKGIMDL  FLSPK+V+LVHGEKP M  LKE+I SEL IPC  PAN ET+S +ST ++KA AS MF++SCS PNF
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLKDASSKAGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRFA
        KF                              SN                      +L+V+D R  +G+LV+EK K+AK+VHQ+E+  +L E+ H V  A
Subjt:  KFLKRNLNNKIDPDLKDASSKAGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRFA

Query:  NCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISIL
        +C P+      E   +D        + QLS+K+   +S   +      LQV SF  S+C K+ C +R+ +   + S  VF CC+W +AD  L W+II+ +
Subjt:  NCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISIL

Query:  E
        +
Subjt:  E

Arabidopsis top hitse value%identityAlignment
AT1G61010.1 cleavage and polyadenylation specificity factor 73-I1.6e-1830.52Show/hide
Query:  YKMLISWTSQKVKETYSTRNAFDFKNVQNFER-SMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTKIDLDKDT
        Y+  I   + +++  ++  N F FK++         +  GP V+ ATPG +  G S ++F  W   K N   +PGY V GT+  K +  +P ++ L    
Subjt:  YKMLISWTSQKVKETYSTRNAFDFKNVQNFER-SMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTKIDLDKDT

Query:  QIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSE
           +  Q+H +SFS H D       +  L P ++ILVHGE  +M  LK+++ +E
Subjt:  QIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSE

AT1G61010.2 cleavage and polyadenylation specificity factor 73-I1.6e-1830.52Show/hide
Query:  YKMLISWTSQKVKETYSTRNAFDFKNVQNFER-SMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTKIDLDKDT
        Y+  I   + +++  ++  N F FK++         +  GP V+ ATPG +  G S ++F  W   K N   +PGY V GT+  K +  +P ++ L    
Subjt:  YKMLISWTSQKVKETYSTRNAFDFKNVQNFER-SMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTKIDLDKDT

Query:  QIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSE
           +  Q+H +SFS H D       +  L P ++ILVHGE  +M  LK+++ +E
Subjt:  QIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSE

AT1G61010.3 cleavage and polyadenylation specificity factor 73-I1.6e-1830.52Show/hide
Query:  YKMLISWTSQKVKETYSTRNAFDFKNVQNFER-SMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTKIDLDKDT
        Y+  I   + +++  ++  N F FK++         +  GP V+ ATPG +  G S ++F  W   K N   +PGY V GT+  K +  +P ++ L    
Subjt:  YKMLISWTSQKVKETYSTRNAFDFKNVQNFER-SMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTKIDLDKDT

Query:  QIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSE
           +  Q+H +SFS H D       +  L P ++ILVHGE  +M  LK+++ +E
Subjt:  QIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSE

AT2G01730.1 cleavage and polyadenylation specificity factor 73 kDa subunit-II2.1e-10649.13Show/hide
Query:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        LT+QANMYYKMLISWTSQ VKE ++T N FDFKNV++F+RS+I APGPCVLFATPGM+  GFSLEVFK WAPS +NL+ LPGY VAGT+GHKLM+GKPT 
Subjt:  LTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK

Query:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF
        +DL   T++DV+C++HQ++FSPHTDAKGIMDL  FLSPK+V+LVHGEKP M  LKE+I SEL IPC  PAN ET+S +ST ++KA AS MF++SCS PNF
Subjt:  IDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNF

Query:  KFLKRNLNNKIDPDLKDASSKAGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRFA
        KF                              SN                      +L+V+D R  +G+LV+EK K+AK+VHQ+E+  +L E+ H V  A
Subjt:  KFLKRNLNNKIDPDLKDASSKAGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRFA

Query:  NCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISIL
        +C P+      E   +D        + QLS+K+   +S   +      LQV SF  S+C K+ C +R+ +   + S  VF CC+W +AD  L W+II+ +
Subjt:  NCIPIYFGSLDETHVIDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISIL

Query:  E
        +
Subjt:  E

AT5G23880.1 cleavage and polyadenylation specificity factor 1009.2e-0629.59Show/hide
Query:  YYKMLISWTSQKVKETYSTR--NAFDFKNVQ-NFERSMID--APGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK
        Y K  + W S  + +++ T   NAF  ++V     ++ +D   PGP V+ A+   +  GF+ E+F  WA    NL+        GT+   L S  P K
Subjt:  YYKMLISWTSQKVKETYSTR--NAFDFKNVQ-NFERSMID--APGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMSGKPTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCATCTCCAAAATGCAAAGAACCCTAGTCACCCAACCGATTCTACTCTCCTCGCCTCAGACCCTTCGTCTATCACTAAAACATCTGACCTCTTGCCTTTGCCTAA
CCCTTTGATGTTGTTGGCGTTCCCTACCATGCCTTTAGTCCCTATCCCTTTTGTTCCAAATGTTTGTGGTCCGGAGGTCTATTGGAACCCTTCCATGGTCAGTTCTCATT
CCAAGGAGATTTGTGGTCTGGGTTTTGCTTCAATCCCTAGGTTTCAAGGAGTGAGCCTAATAGGAGGAGGCCATCCCTGTTTCTCGAGGAATTTGGTGAGCAATATAAGT
CTCGAGGGTCCATCAAGTGATGTTTTGTTTCAAGACATGCTTACTAGAGAATGGTATGGAGGCATCCTTATGATGTGGGATGGGAGGAAGTTCACTTCGATGGATGCGTT
GAAGGGTGTTTATTTGGTTTCAGTTGCTTTTGATGTTCAGAATTGTGGGGTTGGGTGGATAAGTGGCCTCTACTGTCCCCCCTCCCCTAAAGGGAGAAAGGCTTTTTGGG
ATGAGTTAGGCAGTATATTCTTTGTGGAGATTTTTGGTGCATTGTTGGGGATTTTAATGCGGGAGAGAATGGCGTGTTGTAGACTGGATAGATTTCTTTTCTCCAAAGGT
TGGGGTGATCACTTTTCTGATGGGAGGCAGTTGTTGGGACATAGAATTACTTCTGATCATTGGCCTATCTTCTTGAACCTTGGTAAAGTTTCCTGGGGCCCGTGCCCGTT
TAGATTTGAGAACATGTGGCTTACCCACCCTTCTTTCAAAGCTTCTTCCACTTCGTGGTGGGAGGTCGGGGTGGAAGGGCGGGTTGTCGACGGTTTTGAGAGAAGAGAGA
ATGATTCTAAAGTCGAAGCTGGAATCCTTGATTCAATACTGAGATCCCCATCTCCTCAAGAGAGGACTTGGAGGTGCCCTTCTCTCGAGGAAATCCACAAGGTTATTTTT
GGCTTTGATAGGTCTAAGTCGCCAGGTCCGGACGGTTTCACCATGGCCTTTTATCAGGACAATTGGGTAGGCATTAAGGAGGATCTCTATAAAGCTTTTTGTGAGTTTTA
TGAGAGGGGTATTATAGATGGTACCATTAGAATCATTAGAGTCATTCTTTCATTCTTGTTAGTGGCAAACCTAGAGGGAAGATTTTTGCTACTAGGAGCCTTAGGCAAGG
AGACCCTCTCCTCTTTCCTTTTCATCCTAGTGGTGGATGTCTTAAGTCGGTTAGTATTGACAGCTATAGATAAAGGTGTGGTGGAAGGATTTAAGGTGGGTTTGGATAAC
TTGCAGGTGTCTCATCTCCAATTTGCTGATGATACCCTTTTCTTTTGCTCGGAGAAGGCAAATTCCTTTAGGAATCTGAATTGTTTGTTGTCATTCTTCGAAGCTATCTT
TGGGCTTAAAATTAATTGGCAGAAAGGTTCCATCATTGGGATTAACTGTGAGTGTTCTAAGCTTCTTTCGTGGGCTCGCATCCAACGAAGGGTGGAGGAGGGTGGTGGTG
CTCACCTTGTCGGGTGGGAAGTTGTTTCCAAACCAATTGATGTCGGAAGCTTGGGTATTGGGAACCTTAGACTACGTAACGAGGCTCTTCTGGCGAAGTGGTTGTGGTGC
TTCTTCTTCGAGCAAGGCTCTTTGTGGCATAGGGTCATTGTAAGTTTGACGGTTCAAGCCAACATGTACTATAAAATGCTCATCAGTTGGACCAGCCAGAAAGTTAAAGA
GACGTACTCCACGAGGAATGCATTTGACTTTAAGAATGTTCAAAACTTCGAACGTTCTATGATTGATGCTCCTGGACCTTGTGTTCTCTTTGCTACACCAGGGATGATCA
GTGGTGGATTTTCTCTTGAGGTTTTTAAGCGCTGGGCACCTTCTAAAATGAATCTCATCACATTACCTGGTTACTGTGTGGCCGGAACTATTGGGCACAAGTTAATGTCA
GGTAAACCCACCAAAATTGATCTGGACAAGGACACTCAAATTGATGTGCAATGCCAGATTCACCAACTGTCATTCAGCCCTCATACTGATGCCAAAGGCATTATGGACCT
TGTGAATTTCCTTAGTCCCAAGCATGTGATACTTGTACATGGAGAGAAGCCTAAAATGGCTACTTTAAAGGAGAGGATTCATTCAGAACTGGGAATCCCTTGTCATGATC
CTGCGAATAATGAAACCATATCGATCTCTTCAACTCTTTTTGTCAAAGCAGAAGCCTCGAGCATGTTTATTCAGAGTTGTTCAACTCCCAATTTCAAGTTTTTGAAAAGA
AACTTGAATAATAAGATTGATCCTGATTTAAAAGATGCAAGTTCGAAAGCAGGAGCTTCAAGCATGCTTATACGCAGGTGCTCAAATCCCCATTTCAAGCATTTGAATAG
AAATCTGGATGACAAGTTTGATTGTAGTTTAAATTGTAGGCCAGAGCTACAGGTGAGTGATGATAGAGTGAATGAAGGGATCTTGGTGATGGAAAAAGGTAAAAGAGCAA
AGGTGGTTCACCAGAATGAACTACTGCTTCTCTTGGGAGAACAAGAGCATGAGGTTAGATTTGCTAACTGTATCCCCATATATTTTGGAAGCTTAGATGAGACCCATGTT
ATAGATTGCCTATCTAGAAAATCTTCATGGCTTTCCCAGCTATCTTCAAAGCTTTCAAGTGAACTTTCAGATAGGAATGTTCAAAATCTTGGGGAGTATCTTCAAGTTGA
ATCATTTACACTGTCCATTTGCTCAAAGGAGAATTGCCCTTACAGAACTATAAATAGAATTGAAAATGAATCTGCTGTAGTATTCTGGTGCTGTAGTTGGCTAGTTGCAG
ATGAAATCCTTGCATGGAAAATCATTTCCATCTTGGAGAAGCTTGATCTCAGTGCAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCATCTCCAAAATGCAAAGAACCCTAGTCACCCAACCGATTCTACTCTCCTCGCCTCAGACCCTTCGTCTATCACTAAAACATCTGACCTCTTGCCTTTGCCTAA
CCCTTTGATGTTGTTGGCGTTCCCTACCATGCCTTTAGTCCCTATCCCTTTTGTTCCAAATGTTTGTGGTCCGGAGGTCTATTGGAACCCTTCCATGGTCAGTTCTCATT
CCAAGGAGATTTGTGGTCTGGGTTTTGCTTCAATCCCTAGGTTTCAAGGAGTGAGCCTAATAGGAGGAGGCCATCCCTGTTTCTCGAGGAATTTGGTGAGCAATATAAGT
CTCGAGGGTCCATCAAGTGATGTTTTGTTTCAAGACATGCTTACTAGAGAATGGTATGGAGGCATCCTTATGATGTGGGATGGGAGGAAGTTCACTTCGATGGATGCGTT
GAAGGGTGTTTATTTGGTTTCAGTTGCTTTTGATGTTCAGAATTGTGGGGTTGGGTGGATAAGTGGCCTCTACTGTCCCCCCTCCCCTAAAGGGAGAAAGGCTTTTTGGG
ATGAGTTAGGCAGTATATTCTTTGTGGAGATTTTTGGTGCATTGTTGGGGATTTTAATGCGGGAGAGAATGGCGTGTTGTAGACTGGATAGATTTCTTTTCTCCAAAGGT
TGGGGTGATCACTTTTCTGATGGGAGGCAGTTGTTGGGACATAGAATTACTTCTGATCATTGGCCTATCTTCTTGAACCTTGGTAAAGTTTCCTGGGGCCCGTGCCCGTT
TAGATTTGAGAACATGTGGCTTACCCACCCTTCTTTCAAAGCTTCTTCCACTTCGTGGTGGGAGGTCGGGGTGGAAGGGCGGGTTGTCGACGGTTTTGAGAGAAGAGAGA
ATGATTCTAAAGTCGAAGCTGGAATCCTTGATTCAATACTGAGATCCCCATCTCCTCAAGAGAGGACTTGGAGGTGCCCTTCTCTCGAGGAAATCCACAAGGTTATTTTT
GGCTTTGATAGGTCTAAGTCGCCAGGTCCGGACGGTTTCACCATGGCCTTTTATCAGGACAATTGGGTAGGCATTAAGGAGGATCTCTATAAAGCTTTTTGTGAGTTTTA
TGAGAGGGGTATTATAGATGGTACCATTAGAATCATTAGAGTCATTCTTTCATTCTTGTTAGTGGCAAACCTAGAGGGAAGATTTTTGCTACTAGGAGCCTTAGGCAAGG
AGACCCTCTCCTCTTTCCTTTTCATCCTAGTGGTGGATGTCTTAAGTCGGTTAGTATTGACAGCTATAGATAAAGGTGTGGTGGAAGGATTTAAGGTGGGTTTGGATAAC
TTGCAGGTGTCTCATCTCCAATTTGCTGATGATACCCTTTTCTTTTGCTCGGAGAAGGCAAATTCCTTTAGGAATCTGAATTGTTTGTTGTCATTCTTCGAAGCTATCTT
TGGGCTTAAAATTAATTGGCAGAAAGGTTCCATCATTGGGATTAACTGTGAGTGTTCTAAGCTTCTTTCGTGGGCTCGCATCCAACGAAGGGTGGAGGAGGGTGGTGGTG
CTCACCTTGTCGGGTGGGAAGTTGTTTCCAAACCAATTGATGTCGGAAGCTTGGGTATTGGGAACCTTAGACTACGTAACGAGGCTCTTCTGGCGAAGTGGTTGTGGTGC
TTCTTCTTCGAGCAAGGCTCTTTGTGGCATAGGGTCATTGTAAGTTTGACGGTTCAAGCCAACATGTACTATAAAATGCTCATCAGTTGGACCAGCCAGAAAGTTAAAGA
GACGTACTCCACGAGGAATGCATTTGACTTTAAGAATGTTCAAAACTTCGAACGTTCTATGATTGATGCTCCTGGACCTTGTGTTCTCTTTGCTACACCAGGGATGATCA
GTGGTGGATTTTCTCTTGAGGTTTTTAAGCGCTGGGCACCTTCTAAAATGAATCTCATCACATTACCTGGTTACTGTGTGGCCGGAACTATTGGGCACAAGTTAATGTCA
GGTAAACCCACCAAAATTGATCTGGACAAGGACACTCAAATTGATGTGCAATGCCAGATTCACCAACTGTCATTCAGCCCTCATACTGATGCCAAAGGCATTATGGACCT
TGTGAATTTCCTTAGTCCCAAGCATGTGATACTTGTACATGGAGAGAAGCCTAAAATGGCTACTTTAAAGGAGAGGATTCATTCAGAACTGGGAATCCCTTGTCATGATC
CTGCGAATAATGAAACCATATCGATCTCTTCAACTCTTTTTGTCAAAGCAGAAGCCTCGAGCATGTTTATTCAGAGTTGTTCAACTCCCAATTTCAAGTTTTTGAAAAGA
AACTTGAATAATAAGATTGATCCTGATTTAAAAGATGCAAGTTCGAAAGCAGGAGCTTCAAGCATGCTTATACGCAGGTGCTCAAATCCCCATTTCAAGCATTTGAATAG
AAATCTGGATGACAAGTTTGATTGTAGTTTAAATTGTAGGCCAGAGCTACAGGTGAGTGATGATAGAGTGAATGAAGGGATCTTGGTGATGGAAAAAGGTAAAAGAGCAA
AGGTGGTTCACCAGAATGAACTACTGCTTCTCTTGGGAGAACAAGAGCATGAGGTTAGATTTGCTAACTGTATCCCCATATATTTTGGAAGCTTAGATGAGACCCATGTT
ATAGATTGCCTATCTAGAAAATCTTCATGGCTTTCCCAGCTATCTTCAAAGCTTTCAAGTGAACTTTCAGATAGGAATGTTCAAAATCTTGGGGAGTATCTTCAAGTTGA
ATCATTTACACTGTCCATTTGCTCAAAGGAGAATTGCCCTTACAGAACTATAAATAGAATTGAAAATGAATCTGCTGTAGTATTCTGGTGCTGTAGTTGGCTAGTTGCAG
ATGAAATCCTTGCATGGAAAATCATTTCCATCTTGGAGAAGCTTGATCTCAGTGCAACATGA
Protein sequenceShow/hide protein sequence
MHHLQNAKNPSHPTDSTLLASDPSSITKTSDLLPLPNPLMLLAFPTMPLVPIPFVPNVCGPEVYWNPSMVSSHSKEICGLGFASIPRFQGVSLIGGGHPCFSRNLVSNIS
LEGPSSDVLFQDMLTREWYGGILMMWDGRKFTSMDALKGVYLVSVAFDVQNCGVGWISGLYCPPSPKGRKAFWDELGSIFFVEIFGALLGILMRERMACCRLDRFLFSKG
WGDHFSDGRQLLGHRITSDHWPIFLNLGKVSWGPCPFRFENMWLTHPSFKASSTSWWEVGVEGRVVDGFERRENDSKVEAGILDSILRSPSPQERTWRCPSLEEIHKVIF
GFDRSKSPGPDGFTMAFYQDNWVGIKEDLYKAFCEFYERGIIDGTIRIIRVILSFLLVANLEGRFLLLGALGKETLSSFLFILVVDVLSRLVLTAIDKGVVEGFKVGLDN
LQVSHLQFADDTLFFCSEKANSFRNLNCLLSFFEAIFGLKINWQKGSIIGINCECSKLLSWARIQRRVEEGGGAHLVGWEVVSKPIDVGSLGIGNLRLRNEALLAKWLWC
FFFEQGSLWHRVIVSLTVQANMYYKMLISWTSQKVKETYSTRNAFDFKNVQNFERSMIDAPGPCVLFATPGMISGGFSLEVFKRWAPSKMNLITLPGYCVAGTIGHKLMS
GKPTKIDLDKDTQIDVQCQIHQLSFSPHTDAKGIMDLVNFLSPKHVILVHGEKPKMATLKERIHSELGIPCHDPANNETISISSTLFVKAEASSMFIQSCSTPNFKFLKR
NLNNKIDPDLKDASSKAGASSMLIRRCSNPHFKHLNRNLDDKFDCSLNCRPELQVSDDRVNEGILVMEKGKRAKVVHQNELLLLLGEQEHEVRFANCIPIYFGSLDETHV
IDCLSRKSSWLSQLSSKLSSELSDRNVQNLGEYLQVESFTLSICSKENCPYRTINRIENESAVVFWCCSWLVADEILAWKIISILEKLDLSAT