; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012736 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012736
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4378 domain-containing protein
Genome locationchr1:43835099..43837525
RNA-Seq ExpressionLag0012736
SyntenyLag0012736
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462543.1 PREDICTED: uncharacterized protein LOC103500875 [Cucumis melo]1.0e-23479.64Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MMAQKHLHELLE+DQEPFHLN+YIAEKRVNLKRVSPKT LQVKKRKPISTNSIFP NFCRNACFTSF PSPD RKSPLFEFRSPARNSPCKSPNAIFLHI
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTA LLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRN  NR T+ACGSG DLASF  RKSSIRR   QGETSS NGRSSYGFWSETN
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EEG SMDLGTSCSSQSEDSEE SVAYFGEDYCESPFRFVLQRSPS+GCRTPDFLSPAASPCRRNK                                   
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP
                     E  + AESL KFQVEEDEEDKEQCSPVSVLD PFDDSY+    +RER GD   E+Y +ECSYATVQRTKQQLLNKLRRFERLADLDP
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP

Query:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW
        +ELEKIM  EEELDE  Y+YFDNEECEYY+ESVQWDNEN++EWFVKEV ++  FCKS+QFLPQD+RKLV DLIAEEE DRS+ +TREEVI+RVC RLELW
Subjt:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW

Query:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC
        KEVEFNTIDMMVEEDLRKEVGEWK+NQEQRGEAA+DLELAIFSLLVEELAVELAC
Subjt:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC

XP_022143695.1 uncharacterized protein LOC111013540 [Momordica charantia]1.3e-22176.41Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MMAQKHLH+LLEEDQEPFHLNSYIAEKRVNLKRVSPK+DLQV KRKPIST SIF  NFCRNACFTSFQPSPDLRKSPLFEF SPARN    SPNAIFLH+
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTAALLLEAALKIHKQKSS K KK+QIKNQG ARFGSVLKRLTLRNRN TNR+++ACGSGGDLASFG RKSSIRR +TQGETSSYNGRSSYGFWSE+N
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EE RSMDLGTSCSSQSEDSEE SVAY G DYCESPFRFVLQRSPSYGCRTP F SPA SPCRRNKE                                  
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSY----EDRERHGDEDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVEL
                     +T++  ESLKKFQV EDEEDKEQCSPVS+LD PFDDSY    +DR R   EDY LECSYA VQRTKQQLLNKLRRFERLADLDP+EL
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSY----EDRERHGDEDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVEL

Query:  EKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKEV
        EKIM +E++  E+ YDYF +EECEYY   VQW NEN++EWFVKEV +D   CKSQ+FLPQDMRKLV DLIAEEE D+ N +TREEVIQRVC+RLELWKEV
Subjt:  EKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKEV

Query:  EFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELA
        EFNTIDMMVEEDL+KEV EWKKNQEQRGEAA DLELAIFSLLVEELAVELA
Subjt:  EFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELA

XP_022925872.1 uncharacterized protein LOC111433152 isoform X1 [Cucurbita moschata]1.5e-22576.03Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MM  KHLH+LLEEDQEPFHLN+YIAEKRVNLKRVS KTDLQVKKRKPISTNSIFP NFC+NACFTSFQPSPD RKSPLF+FRSPAR+SPCKSPNAIFLHI
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTAALLLEAALKIHKQKSS K KK+QIKNQGFARFGSVLKRLTLRNRN  NRET  CG G +LASFG RKSS+RRHI QGETSS+NGRSSYGFWSETN
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EEGRSMDLGTSCSSQSEDSEE SVAYFGEDYCESPFRFVLQRSPS+GCRTPDF SPAASPC R KE                                  
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD-------EDYGLECSYATVQRTKQQLLNKLRRFERLA
                     E VN+AESLKK Q E+DEEDKEQCSPVSVLD PFD SY+    DRER GD       EDYGLECSYATVQRTKQQLLNKLRRFE+LA
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD-------EDYGLECSYATVQRTKQQLLNKLRRFERLA

Query:  DLDPVELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRR
        DLDP+ELEK+M  EEEL+E  +DYF+NEECEYYDES Q  NEN +E FVKEV + A FCKS+ FLP+DMRKLVTDL++EEE DRSN +TRE+VIQRVC+R
Subjt:  DLDPVELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRR

Query:  LELWKEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC
        LE+WKEV+FNTIDMMVEEDLRKEV EWKKNQ QRGE A+DLE+AIFSLLVEELAVEL+C
Subjt:  LELWKEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC

XP_031744144.1 uncharacterized protein LOC101207103 [Cucumis sativus]9.5e-23680Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MMAQKHLHELLE+DQEPFHLN+YIAEKRVNLKRVSPKT LQVKKRKPISTNSIFP NFCRNACFTSF PSPD RKSPLFEFRSPARNSPCKSPNAIFLHI
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTA LLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRN  NRET+ACGSG DLASFG RKSSIRR   QGETSS NGRSSYGFWSETN
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EEG SMDLGTSCSSQSEDSEE SVAYFGEDYCESPFRFVLQRSPS+GCRTPDFLSPAASPC RNK                                   
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP
                     E +  AESL KFQVEEDEEDKEQCSPVSVLD PFDDSY+    DRER GD   EDY +ECSYATVQRTKQQLLNKLRRFERLADLDP
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP

Query:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW
        +ELEKIM EEE+ DE  Y+YFDN ECEYY+ESVQWDNEN++EWFV+EV +DA FCKS+QFLPQDMRKLV DL+AEEE DRS+ +TREEVIQRVC RLELW
Subjt:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW

Query:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC
        KEVEFNTIDMMVEEDLRKEVGEWK+NQEQR EAA+DLELAIFSLLVEELAVELAC
Subjt:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC

XP_038881414.1 uncharacterized protein LOC120072951 [Benincasa hispida]3.4e-24181.52Show/hide
Query:  MAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHIP
        MAQKHLHELLEEDQEPFHLN+YIAEKRVNLKRVSPKT LQVKKRKPISTNSIFP NFCRNACFTSF PSPD RKSPLFEFRSPARNSPCKSPNAIFLHIP
Subjt:  MAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHIP

Query:  ARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETNE
        ARTA LLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRN  NRET+ACGSG DLASFG RKSSIRR I QGETSSYNGRSSYGFWSETNE
Subjt:  ARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETNE

Query:  EGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSID
        EGRSMDLGTSCSSQSEDSEE SVAYFGEDYCESPFRFVLQRSPS+GCRTPDFLSPAASPCRRNKE                                   
Subjt:  EGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSID

Query:  KIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSY----EDRERHGD-EDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVEL
                    E ++S E L KFQVEEDEEDKEQCSPVSVLD PFDDSY    +DRER  D E+Y LECSYATVQRTKQQLLNKLRRFERLADLDP+EL
Subjt:  KIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSY----EDRERHGD-EDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVEL

Query:  EKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKEV
        EKIM  EEELDE  Y+Y DNEECEYY+ESV+WDNEN +EWFVKEV N+A FCKS+QF+P+DMRKLVTDLIAEEE DR+N DTREEVIQRVC+RLELWKEV
Subjt:  EKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKEV

Query:  EFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC
        EFNTIDMMVEEDLRKEVGEWK+NQEQRGEAA+DLELAIFSLLVEELAVELAC
Subjt:  EFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC

TrEMBL top hitse value%identityAlignment
A0A0A0KFA1 Uncharacterized protein4.6e-23680Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MMAQKHLHELLE+DQEPFHLN+YIAEKRVNLKRVSPKT LQVKKRKPISTNSIFP NFCRNACFTSF PSPD RKSPLFEFRSPARNSPCKSPNAIFLHI
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTA LLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRN  NRET+ACGSG DLASFG RKSSIRR   QGETSS NGRSSYGFWSETN
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EEG SMDLGTSCSSQSEDSEE SVAYFGEDYCESPFRFVLQRSPS+GCRTPDFLSPAASPC RNK                                   
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP
                     E +  AESL KFQVEEDEEDKEQCSPVSVLD PFDDSY+    DRER GD   EDY +ECSYATVQRTKQQLLNKLRRFERLADLDP
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP

Query:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW
        +ELEKIM EEE+ DE  Y+YFDN ECEYY+ESVQWDNEN++EWFV+EV +DA FCKS+QFLPQDMRKLV DL+AEEE DRS+ +TREEVIQRVC RLELW
Subjt:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW

Query:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC
        KEVEFNTIDMMVEEDLRKEVGEWK+NQEQR EAA+DLELAIFSLLVEELAVELAC
Subjt:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC

A0A1S3CHP7 uncharacterized protein LOC1035008755.1e-23579.64Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MMAQKHLHELLE+DQEPFHLN+YIAEKRVNLKRVSPKT LQVKKRKPISTNSIFP NFCRNACFTSF PSPD RKSPLFEFRSPARNSPCKSPNAIFLHI
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTA LLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRN  NR T+ACGSG DLASF  RKSSIRR   QGETSS NGRSSYGFWSETN
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EEG SMDLGTSCSSQSEDSEE SVAYFGEDYCESPFRFVLQRSPS+GCRTPDFLSPAASPCRRNK                                   
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP
                     E  + AESL KFQVEEDEEDKEQCSPVSVLD PFDDSY+    +RER GD   E+Y +ECSYATVQRTKQQLLNKLRRFERLADLDP
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP

Query:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW
        +ELEKIM  EEELDE  Y+YFDNEECEYY+ESVQWDNEN++EWFVKEV ++  FCKS+QFLPQD+RKLV DLIAEEE DRS+ +TREEVI+RVC RLELW
Subjt:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW

Query:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC
        KEVEFNTIDMMVEEDLRKEVGEWK+NQEQRGEAA+DLELAIFSLLVEELAVELAC
Subjt:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC

A0A5A7SKT4 Histone-lysine N-methyltransferase SETD1B-like5.1e-23579.64Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MMAQKHLHELLE+DQEPFHLN+YIAEKRVNLKRVSPKT LQVKKRKPISTNSIFP NFCRNACFTSF PSPD RKSPLFEFRSPARNSPCKSPNAIFLHI
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTA LLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRN  NR T+ACGSG DLASF  RKSSIRR   QGETSS NGRSSYGFWSETN
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EEG SMDLGTSCSSQSEDSEE SVAYFGEDYCESPFRFVLQRSPS+GCRTPDFLSPAASPCRRNK                                   
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP
                     E  + AESL KFQVEEDEEDKEQCSPVSVLD PFDDSY+    +RER GD   E+Y +ECSYATVQRTKQQLLNKLRRFERLADLDP
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD---EDYGLECSYATVQRTKQQLLNKLRRFERLADLDP

Query:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW
        +ELEKIM  EEELDE  Y+YFDNEECEYY+ESVQWDNEN++EWFVKEV ++  FCKS+QFLPQD+RKLV DLIAEEE DRS+ +TREEVI+RVC RLELW
Subjt:  VELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELW

Query:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC
        KEVEFNTIDMMVEEDLRKEVGEWK+NQEQRGEAA+DLELAIFSLLVEELAVELAC
Subjt:  KEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC

A0A6J1CPH7 uncharacterized protein LOC1110135406.4e-22276.41Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MMAQKHLH+LLEEDQEPFHLNSYIAEKRVNLKRVSPK+DLQV KRKPIST SIF  NFCRNACFTSFQPSPDLRKSPLFEF SPARN    SPNAIFLH+
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTAALLLEAALKIHKQKSS K KK+QIKNQG ARFGSVLKRLTLRNRN TNR+++ACGSGGDLASFG RKSSIRR +TQGETSSYNGRSSYGFWSE+N
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EE RSMDLGTSCSSQSEDSEE SVAY G DYCESPFRFVLQRSPSYGCRTP F SPA SPCRRNKE                                  
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSY----EDRERHGDEDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVEL
                     +T++  ESLKKFQV EDEEDKEQCSPVS+LD PFDDSY    +DR R   EDY LECSYA VQRTKQQLLNKLRRFERLADLDP+EL
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSY----EDRERHGDEDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVEL

Query:  EKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKEV
        EKIM +E++  E+ YDYF +EECEYY   VQW NEN++EWFVKEV +D   CKSQ+FLPQDMRKLV DLIAEEE D+ N +TREEVIQRVC+RLELWKEV
Subjt:  EKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKEV

Query:  EFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELA
        EFNTIDMMVEEDL+KEV EWKKNQEQRGEAA DLELAIFSLLVEELAVELA
Subjt:  EFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELA

A0A6J1ECT2 uncharacterized protein LOC111433152 isoform X17.3e-22676.03Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        MM  KHLH+LLEEDQEPFHLN+YIAEKRVNLKRVS KTDLQVKKRKPISTNSIFP NFC+NACFTSFQPSPD RKSPLF+FRSPAR+SPCKSPNAIFLHI
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN
        PARTAALLLEAALKIHKQKSS K KK+QIKNQGFARFGSVLKRLTLRNRN  NRET  CG G +LASFG RKSS+RRHI QGETSS+NGRSSYGFWSETN
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETN

Query:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        EEGRSMDLGTSCSSQSEDSEE SVAYFGEDYCESPFRFVLQRSPS+GCRTPDF SPAASPC R KE                                  
Subjt:  EEGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD-------EDYGLECSYATVQRTKQQLLNKLRRFERLA
                     E VN+AESLKK Q E+DEEDKEQCSPVSVLD PFD SY+    DRER GD       EDYGLECSYATVQRTKQQLLNKLRRFE+LA
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYE----DRERHGD-------EDYGLECSYATVQRTKQQLLNKLRRFERLA

Query:  DLDPVELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRR
        DLDP+ELEK+M  EEEL+E  +DYF+NEECEYYDES Q  NEN +E FVKEV + A FCKS+ FLP+DMRKLVTDL++EEE DRSN +TRE+VIQRVC+R
Subjt:  DLDPVELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRR

Query:  LELWKEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC
        LE+WKEV+FNTIDMMVEEDLRKEV EWKKNQ QRGE A+DLE+AIFSLLVEELAVEL+C
Subjt:  LELWKEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36420.1 unknown protein5.6e-6136.12Show/hide
Query:  QKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANF-CRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHIPA
        +KHLHE LE+DQEPFHLN YI     NL+     +D++VKKRK  +  +  P  F C N+CF +   SPD RKSPLFE RSP +         +FL IPA
Subjt:  QKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANF-CRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHIPA

Query:  RTAALLLEAALKIHKQKS-SSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETNE
        RTAA+LL+AA +I KQ+S  +KT K++ +  GF  FGSVLK LT R                               IT+                  N 
Subjt:  RTAALLLEAALKIHKQKS-SSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETNE

Query:  EGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSP-SYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI
        +G ++ L       S    E  V    + +CESPF FVLQ +P S G +TP F S A SP RR+ E                       +  SD+++ S+
Subjt:  EGRSMDLGTSCSSQSEDSEEASVAYFGEDYCESPFRFVLQRSP-SYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSI

Query:  DKIRPDLKFLFKGETVNSAESLKKFQVEED--EEDKEQCSPVSVLDGPFDDSYEDRERH---GDEDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVE
        +K+R                     Q EED  EEDKEQCSPVSVLD P ++  ED + H    D    L CS+  VQR K++LL KLRRFE+LA LDPVE
Subjt:  DKIRPDLKFLFKGETVNSAESLKKFQVEED--EEDKEQCSPVSVLDGPFDDSYEDRERH---GDEDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVE

Query:  LEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKE
        LE  M EEE+           EE E Y+ES + DN       ++   +D  +    + + ++ R       AE+E  + N + +++       R+ L  E
Subjt:  LEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKE

Query:  VEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVEL
         +   +D +V +DLR+E GEW ++  +  EA SDLE +IF +L++E + EL
Subjt:  VEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVEL

AT5G03670.1 unknown protein9.6e-7739.7Show/hide
Query:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI
        M +Q+HL +LLEEDQEPF L SYI+++R  +   +  T LQVKKR+PIS N+  P+ FCRNACF S + SPD +KSPLFE +SP R     S NAIF++I
Subjt:  MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHI

Query:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQG--FARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFG------------------------HRKSS
        PARTA++LLEAA++I  QK SS+  K++ +N G  F  FGSVLK+LT    N   RE       G ++S                             +S
Subjt:  PARTAALLLEAALKIHKQKSSSKTKKSQIKNQG--FARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFG------------------------HRKSS

Query:  IRRHITQGETSSYNGRSSYGFWSETNEEG-RSMDL----GTSCSSQSEDSEEASVAYFGED------YCESPFRFVLQRSPSY-GCRTPDFLSPAASPCR
         + H    ET      SS G WSE+   G RS D+      S SS+S  S+E ++   G+D      +CESPF FVLQ  PS  G RTP+F SPAASP  
Subjt:  IRRHITQGETSSYNGRSSYGFWSETNEEG-RSMDL----GTSCSSQSEDSEEASVAYFGED------YCESPFRFVLQRSPSY-GCRTPDFLSPAASPCR

Query:  RNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSIDKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYEDRERHGDEDY
          +  C                             H ++K   ++            E LKK ++EE+EE+KEQ SPVSVLD PF D  +D + H D D 
Subjt:  RNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSIDKIRPDLKFLFKGETVNSAESLKKFQVEEDEEDKEQCSPVSVLDGPFDDSYEDRERHGDEDY

Query:  GLECSYATVQRTKQQLLNKLRRFERLADLDPVELEKIMQE---EEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMR
         +  S+ +VQ+ K  LL KL RFE+LA LDP+ELEK M +   EEE +E+  +      CE   + V       L+ + +E+V           +P+ + 
Subjt:  GLECSYATVQRTKQQLLNKLRRFERLADLDPVELEKIMQE---EEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKEVVNDAGFCKSQQFLPQDMR

Query:  KLVTDLIAEEEGDRSNVDTREE---VIQRVCRRLELWKEVEFNTIDMMVEEDLRKE-VGEWK-KNQEQRGEAASDLELAIFSLLVEELAVEL
         L++DL AEE    S++D   E   V +RVC RL  W++VE NTIDMMVE D R E +G W+ KN     E   D+E  IF  LVEEL+ ++
Subjt:  KLVTDLIAEEEGDRSNVDTREE---VIQRVCRRLELWKEVEFNTIDMMVEEDLRKE-VGEWK-KNQEQRGEAASDLELAIFSLLVEELAVEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCTCAAAAGCACTTGCACGAGTTGCTGGAAGAGGATCAAGAGCCCTTTCATTTGAACAGCTACATTGCGGAGAAACGGGTTAATCTCAAAAGGGTTTCGCCCAA
AACCGATTTGCAAGTCAAGAAACGAAAACCCATCTCCACAAATTCAATTTTCCCGGCAAATTTCTGTAGAAATGCTTGTTTTACGTCCTTCCAGCCCTCGCCGGACCTCA
GGAAGTCTCCGCTCTTTGAGTTTCGTTCTCCCGCGAGGAATAGCCCCTGCAAGAGCCCCAATGCCATTTTCCTCCATATCCCTGCTAGAACGGCCGCCTTGCTTCTTGAA
GCTGCTCTCAAGATTCATAAACAGAAATCGTCTTCCAAAACTAAAAAATCCCAGATTAAGAATCAAGGATTTGCGCGATTTGGGTCGGTTTTAAAGAGATTAACTCTTCG
AAATCGAAACACCACCAACCGTGAAACTCAAGCTTGTGGTAGTGGAGGGGATTTAGCGTCGTTTGGGCATAGAAAAAGCTCCATTCGAAGGCACATAACGCAGGGTGAGA
CGAGTTCCTACAATGGAAGGTCTAGCTATGGCTTCTGGTCTGAAACCAACGAAGAAGGAAGATCAATGGATTTGGGGACTTCATGTAGTAGCCAATCTGAGGATTCAGAG
GAGGCTTCTGTTGCTTATTTTGGAGAAGATTACTGTGAAAGCCCTTTCCGATTTGTTCTTCAGCGAAGCCCCTCATACGGTTGCCGGACGCCGGATTTTCTATCGCCGGC
GGCCTCTCCTTGCCGCCGTAACAAAGAGTATTGCCTTACTGCTTCCTTTTTAGAATACGGTAACTTTTTGCAGTACAAGTTTCTGACCAATTCTTTTGAACTTTTATCTG
ATCAAAGTGATCACTCAATTGACAAAATTAGGCCTGATTTGAAGTTTCTATTCAAGGGCGAAACAGTAAACAGTGCAGAAAGCTTGAAGAAATTTCAGGTCGAAGAAGAC
GAAGAAGATAAGGAGCAATGTAGTCCTGTGTCTGTATTGGACGGTCCTTTTGATGACAGTTATGAAGATCGGGAGAGGCACGGCGACGAAGATTACGGTTTGGAATGCAG
CTATGCAACTGTCCAAAGAACAAAGCAGCAACTGTTAAACAAGCTTCGCAGATTCGAGAGACTCGCCGACTTGGACCCGGTTGAACTTGAGAAAATAATGCAAGAGGAAG
AAGAACTAGACGAGAAGGTTTACGACTACTTCGATAACGAAGAATGTGAATATTACGACGAGTCAGTTCAATGGGATAACGAAAACAACCTAGAATGGTTTGTGAAAGAG
GTAGTGAATGATGCAGGCTTCTGTAAATCCCAACAGTTTCTCCCTCAAGACATGAGGAAACTCGTCACCGATCTCATTGCCGAAGAAGAAGGCGATCGAAGCAATGTCGA
CACGAGAGAGGAGGTGATACAAAGGGTTTGCAGGAGGTTGGAGCTGTGGAAAGAGGTGGAATTCAACACCATAGACATGATGGTGGAAGAAGATTTGAGGAAGGAGGTTG
GTGAGTGGAAGAAAAACCAGGAGCAGAGAGGAGAGGCAGCCAGTGATTTGGAGCTTGCAATCTTCAGCCTGTTGGTGGAGGAATTGGCTGTAGAACTTGCTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGCTCAAAAGCACTTGCACGAGTTGCTGGAAGAGGATCAAGAGCCCTTTCATTTGAACAGCTACATTGCGGAGAAACGGGTTAATCTCAAAAGGGTTTCGCCCAA
AACCGATTTGCAAGTCAAGAAACGAAAACCCATCTCCACAAATTCAATTTTCCCGGCAAATTTCTGTAGAAATGCTTGTTTTACGTCCTTCCAGCCCTCGCCGGACCTCA
GGAAGTCTCCGCTCTTTGAGTTTCGTTCTCCCGCGAGGAATAGCCCCTGCAAGAGCCCCAATGCCATTTTCCTCCATATCCCTGCTAGAACGGCCGCCTTGCTTCTTGAA
GCTGCTCTCAAGATTCATAAACAGAAATCGTCTTCCAAAACTAAAAAATCCCAGATTAAGAATCAAGGATTTGCGCGATTTGGGTCGGTTTTAAAGAGATTAACTCTTCG
AAATCGAAACACCACCAACCGTGAAACTCAAGCTTGTGGTAGTGGAGGGGATTTAGCGTCGTTTGGGCATAGAAAAAGCTCCATTCGAAGGCACATAACGCAGGGTGAGA
CGAGTTCCTACAATGGAAGGTCTAGCTATGGCTTCTGGTCTGAAACCAACGAAGAAGGAAGATCAATGGATTTGGGGACTTCATGTAGTAGCCAATCTGAGGATTCAGAG
GAGGCTTCTGTTGCTTATTTTGGAGAAGATTACTGTGAAAGCCCTTTCCGATTTGTTCTTCAGCGAAGCCCCTCATACGGTTGCCGGACGCCGGATTTTCTATCGCCGGC
GGCCTCTCCTTGCCGCCGTAACAAAGAGTATTGCCTTACTGCTTCCTTTTTAGAATACGGTAACTTTTTGCAGTACAAGTTTCTGACCAATTCTTTTGAACTTTTATCTG
ATCAAAGTGATCACTCAATTGACAAAATTAGGCCTGATTTGAAGTTTCTATTCAAGGGCGAAACAGTAAACAGTGCAGAAAGCTTGAAGAAATTTCAGGTCGAAGAAGAC
GAAGAAGATAAGGAGCAATGTAGTCCTGTGTCTGTATTGGACGGTCCTTTTGATGACAGTTATGAAGATCGGGAGAGGCACGGCGACGAAGATTACGGTTTGGAATGCAG
CTATGCAACTGTCCAAAGAACAAAGCAGCAACTGTTAAACAAGCTTCGCAGATTCGAGAGACTCGCCGACTTGGACCCGGTTGAACTTGAGAAAATAATGCAAGAGGAAG
AAGAACTAGACGAGAAGGTTTACGACTACTTCGATAACGAAGAATGTGAATATTACGACGAGTCAGTTCAATGGGATAACGAAAACAACCTAGAATGGTTTGTGAAAGAG
GTAGTGAATGATGCAGGCTTCTGTAAATCCCAACAGTTTCTCCCTCAAGACATGAGGAAACTCGTCACCGATCTCATTGCCGAAGAAGAAGGCGATCGAAGCAATGTCGA
CACGAGAGAGGAGGTGATACAAAGGGTTTGCAGGAGGTTGGAGCTGTGGAAAGAGGTGGAATTCAACACCATAGACATGATGGTGGAAGAAGATTTGAGGAAGGAGGTTG
GTGAGTGGAAGAAAAACCAGGAGCAGAGAGGAGAGGCAGCCAGTGATTTGGAGCTTGCAATCTTCAGCCTGTTGGTGGAGGAATTGGCTGTAGAACTTGCTTGTTGA
Protein sequenceShow/hide protein sequence
MMAQKHLHELLEEDQEPFHLNSYIAEKRVNLKRVSPKTDLQVKKRKPISTNSIFPANFCRNACFTSFQPSPDLRKSPLFEFRSPARNSPCKSPNAIFLHIPARTAALLLE
AALKIHKQKSSSKTKKSQIKNQGFARFGSVLKRLTLRNRNTTNRETQACGSGGDLASFGHRKSSIRRHITQGETSSYNGRSSYGFWSETNEEGRSMDLGTSCSSQSEDSE
EASVAYFGEDYCESPFRFVLQRSPSYGCRTPDFLSPAASPCRRNKEYCLTASFLEYGNFLQYKFLTNSFELLSDQSDHSIDKIRPDLKFLFKGETVNSAESLKKFQVEED
EEDKEQCSPVSVLDGPFDDSYEDRERHGDEDYGLECSYATVQRTKQQLLNKLRRFERLADLDPVELEKIMQEEEELDEKVYDYFDNEECEYYDESVQWDNENNLEWFVKE
VVNDAGFCKSQQFLPQDMRKLVTDLIAEEEGDRSNVDTREEVIQRVCRRLELWKEVEFNTIDMMVEEDLRKEVGEWKKNQEQRGEAASDLELAIFSLLVEELAVELAC