; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg15487 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg15487
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationCarg_Chr06:9987496..9994220
RNA-Seq ExpressionCarg15487
SyntenyCarg15487
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597636.1 hypothetical protein SDJN03_10816, partial [Cucurbita argyrosperma subsp. sororia]4.1e-23193.84Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD
        DYVEAVETDLAEFPVDWGVPDVASL                       HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESE D
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD

Query:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
        GSSRRSTLYRLEIMRIELFSVYGVQSEI L DFQRAEPDVLMHSTAEIVERFSEKGFRCNI+LKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
Subjt:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF

Query:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
        RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
Subjt:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD

KAG7029078.1 hypothetical protein SDJN02_10261 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-237100Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASLHPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSSRRSTLYRLEIMRIELFSVYG
        DYVEAVETDLAEFPVDWGVPDVASLHPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSSRRSTLYRLEIMRIELFSVYG
Subjt:  DYVEAVETDLAEFPVDWGVPDVASLHPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSSRRSTLYRLEIMRIELFSVYG

Query:  VQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKTRATSEVAAEKQIQQLLF
        VQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKTRATSEVAAEKQIQQLLF
Subjt:  VQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKTRATSEVAAEKQIQQLLF

Query:  PRSRRKKLRSHEGMD
        PRSRRKKLRSHEGMD
Subjt:  PRSRRKKLRSHEGMD

XP_022932761.1 uncharacterized protein At3g49140-like isoform X1 [Cucurbita moschata]4.1e-23193.38Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMD+KMYESRR+IRDY AGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD
        DY+EAVETDLAEFPVDWGVPDVASL                       HPSNGVSMLGCLRPVYADEESYIRRLFYFE+SEGYNTQWKEGETLSFESESD
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD

Query:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
        GSSRRSTLYRLEIMRIELFSVYGVQSEIGL DFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
Subjt:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF

Query:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
        RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
Subjt:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD

XP_022932762.1 uncharacterized protein At3g49140-like isoform X2 [Cucurbita moschata]2.9e-22993.15Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMD+KMYESRR+IRDY AGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD
        DY+EAVETDLAEFPVDWGVPDVASL                       HPSNGVSMLGCLRPVYADEESYIRRLFYFE+SEGYNTQWK GETLSFESESD
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD

Query:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
        GSSRRSTLYRLEIMRIELFSVYGVQSEIGL DFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
Subjt:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF

Query:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
        RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
Subjt:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD

XP_023538710.1 uncharacterized protein At3g49140-like isoform X1 [Cucurbita pepo subsp. pepo]2.6e-23092.69Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAAD+PDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNAL+LFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMD+KMYESRR+IRDY AGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD
        DY+EAVETDLAEFPVDWGVPDVASL                       HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNT+WKEGETLSFESESD
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD

Query:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
         SSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPD+LMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
Subjt:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF

Query:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
        RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
Subjt:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD

TrEMBL top hitse value%identityAlignment
A0A6J1C800 uncharacterized protein At3g49140 isoform X14.9e-19880.73Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSL FEGACCSTS+AFTSSWNR S DVRGRNP+FGST+LHWLSKGRD  LSKVSVAADYPDSVPDSSS LTN+GYHPLEDLKV KRAR+TELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AE+ART VEVNSNAL+LFPGTVH+EPHE VSWDEFQYV+DDYGDL+FE+FD+VNMLEDR A NPVN LIGMDM+MYESRR++ DYNA DSGNGD+VPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWK--EGETLSFESE
        DY+E VETDL++ PVDWGVPDV+SL                       HPSNGVS+LGCLRP +ADEESYIRRLFYFE SEGY T+WK  +GE LSFES+
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWK--EGETLSFESE

Query:  SDGSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVR
        SD SS+RSTLYRLEIMRIELFSVYGVQ+EI LQDFQ AEPD+L+HSTAEIVE FSEKG RCNIALKA CKKRGL VEDAIL+GVDSLG+DVRVCFGTEVR
Subjt:  SDGSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVR

Query:  TFRFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSH
        TFRFPFK RATSEVAAEKQIQQLLFPRSRRKKLRSH
Subjt:  TFRFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSH

A0A6J1EXA5 uncharacterized protein At3g49140-like isoform X12.0e-23193.38Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMD+KMYESRR+IRDY AGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD
        DY+EAVETDLAEFPVDWGVPDVASL                       HPSNGVSMLGCLRPVYADEESYIRRLFYFE+SEGYNTQWKEGETLSFESESD
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD

Query:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
        GSSRRSTLYRLEIMRIELFSVYGVQSEIGL DFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
Subjt:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF

Query:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
        RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
Subjt:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD

A0A6J1EXN6 uncharacterized protein At3g49140-like isoform X21.4e-22993.15Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMD+KMYESRR+IRDY AGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD
        DY+EAVETDLAEFPVDWGVPDVASL                       HPSNGVSMLGCLRPVYADEESYIRRLFYFE+SEGYNTQWK GETLSFESESD
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD

Query:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
        GSSRRSTLYRLEIMRIELFSVYGVQSEIGL DFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
Subjt:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF

Query:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
        RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
Subjt:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD

A0A6J1I3M8 uncharacterized protein At3g49140-like isoform X21.9e-22691.78Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGST+LHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRR+IRDYNAGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD
        DY+E VETDLAEFPVDWGVPDVASL                       HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNT+WK GETL+FESESD
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD

Query:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
         SSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCF TEVRTF
Subjt:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF

Query:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
        RFPFKTRATSEVAAEKQIQ+LLFP+SRRK LRSHEG+D
Subjt:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD

A0A6J1I4V8 uncharacterized protein At3g49140-like isoform X12.7e-22892.01Show/hide
Query:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
        MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGST+LHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA
Subjt:  MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTA

Query:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY
        AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRR+IRDYNAGDSGNGDIVPFDY
Subjt:  AELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDY

Query:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD
        DY+E VETDLAEFPVDWGVPDVASL                       HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNT+WKEGETL+FESESD
Subjt:  DYVEAVETDLAEFPVDWGVPDVASL-----------------------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESD

Query:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF
         SSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCF TEVRTF
Subjt:  GSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTF

Query:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD
        RFPFKTRATSEVAAEKQIQ+LLFP+SRRK LRSHEG+D
Subjt:  RFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491405.8e-3927.96Show/hide
Query:  SKVSVAADYPDSVPDSSSSLTNKGYHPLEDLK--VRKRARNTELTAAELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVN
        ++    A+Y DS  D         YHP E+++  + +   ++ L+ AE  RT +EVN+   ++  G++ +  HE++ W +  Y+ D  G+L+F+V +D +
Subjt:  SKVSVAADYPDSVPDSSSSLTNKGYHPLEDLK--VRKRARNTELTAAELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVN

Query:  MLEDRAAPNP-VNVLIGMD-MKMYESRRVI----RDYNAGDSGNGD----IVPFDYDYVEAV---------------------ETDLAEFPVDWG-----
        +++   + N  V V++G D M+M +   ++     D+   D  +GD        D D  E V                     ++D  E   DW      
Subjt:  MLEDRAAPNP-VNVLIGMD-MKMYESRRVI----RDYNAGDSGNGD----IVPFDYDYVEAV---------------------ETDLAEFPVDWG-----

Query:  -----------VPDVASL-------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETL-----------SFESESDGSSRRS-----T
                   + +VAS         PS G+++ G L  +  ++ S I++     +S       K+ E L             ESE D S          
Subjt:  -----------VPDVASL-------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETL-----------SFESESDGSSRRS-----T

Query:  LYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFC-KKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKT
         Y+LE++RI+L +  G Q+E+ ++D ++A+PD + H++AEI+ R  E G +   ALK+ C +   ++ E+  L+G+DSLG D+R+C G ++ + RF F T
Subjt:  LYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFC-KKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKT

Query:  RATSEVAAEKQIQQLLFPRSRR
        RATSE  AE QI++LLFP++ +
Subjt:  RATSEVAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein4.1e-4027.96Show/hide
Query:  SKVSVAADYPDSVPDSSSSLTNKGYHPLEDLK--VRKRARNTELTAAELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVN
        ++    A+Y DS  D         YHP E+++  + +   ++ L+ AE  RT +EVN+   ++  G++ +  HE++ W +  Y+ D  G+L+F+V +D +
Subjt:  SKVSVAADYPDSVPDSSSSLTNKGYHPLEDLK--VRKRARNTELTAAELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVN

Query:  MLEDRAAPNP-VNVLIGMD-MKMYESRRVI----RDYNAGDSGNGD----IVPFDYDYVEAV---------------------ETDLAEFPVDWG-----
        +++   + N  V V++G D M+M +   ++     D+   D  +GD        D D  E V                     ++D  E   DW      
Subjt:  MLEDRAAPNP-VNVLIGMD-MKMYESRRVI----RDYNAGDSGNGD----IVPFDYDYVEAV---------------------ETDLAEFPVDWG-----

Query:  -----------VPDVASL-------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETL-----------SFESESDGSSRRS-----T
                   + +VAS         PS G+++ G L  +  ++ S I++     +S       K+ E L             ESE D S          
Subjt:  -----------VPDVASL-------HPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETL-----------SFESESDGSSRRS-----T

Query:  LYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFC-KKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKT
         Y+LE++RI+L +  G Q+E+ ++D ++A+PD + H++AEI+ R  E G +   ALK+ C +   ++ E+  L+G+DSLG D+R+C G ++ + RF F T
Subjt:  LYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFC-KKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKT

Query:  RATSEVAAEKQIQQLLFPRSRR
        RATSE  AE QI++LLFP++ +
Subjt:  RATSEVAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein3.9e-11550.93Show/hide
Query:  ASSLAFEGACCSTSHAF--TSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTAAEL
        +SS+ ++    + +  F    S N S    R + P FGS   H  S G D  L+KVSVAADY DSVPDSS      GYHPLEDLK  KR + T+L+A+E+
Subjt:  ASSLAFEGACCSTSHAF--TSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTAAEL

Query:  ARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDYDYV
        ARTTVE NS+A+++FPG +H EPH+H SW EF+YV+DDYGD+FFE+ DD N+LED  A NPV    GMD+  YE+ R   +YN  D GN D + FD  Y 
Subjt:  ARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDYDYV

Query:  EAVETDLAEFPVDWGVPDVAS-----------------------LHPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSS
        E ++++  + P+DWG+PD ++                        +PSNGVS+LGCLRP + DEESYIRRLF  E+ + Y+ + +  +     S  D + 
Subjt:  EAVETDLAEFPVDWGVPDVAS-----------------------LHPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSS

Query:  RRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFP
          S+LYRLEI+ IEL S+YG +S I LQDFQ AEPD+L+HST+ I+ERF+ +G   +IALKA CKK+GL  E+A L+ VDSLG+DVRV  G +V+T RFP
Subjt:  RRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFP

Query:  FKTRATSEVAAEKQIQQLLFPRSRRKKLRSHE
        FKTRAT+E+AAEK+I QLLFPRSRR+KL+ H+
Subjt:  FKTRATSEVAAEKQIQQLLFPRSRRKKLRSHE

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-3827.63Show/hide
Query:  SKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKV---RKRARNTELTAAELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDL
        S G+    ++    A+Y  S  D         YHP ED++     K   ++ L+  E ART +EVN    ++  G +    HE++ W +  YV D +G++
Subjt:  SKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKV---RKRARNTELTAAELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDL

Query:  FFEV-----------------------FDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGD--------IVPFDYDYVEAVETDLAEFP
        +F+V                       FD + M++D    +P  +  G++ ++ +    + D N GD   G+         V  D D  +   +D  E  
Subjt:  FFEV-----------------------FDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGD--------IVPFDYDYVEAVETDLAEFP

Query:  VDWG----------------VPDVASLHPSN-------GVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSSRRSTL------
         DW                 + +VAS  P N       G+++ G L PV  ++ S I++      S G +   KE E      E  G +    L      
Subjt:  VDWG----------------VPDVASLHPSN-------GVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSSRRSTL------

Query:  -----YRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFC-KKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRF
             Y+LEI+RI+L +  G Q+E+ ++D ++A+PDV+  ++  I+ R  E G +   AL++ C +  G++ E+  L+G+DSLG D+R+C G ++ T RF
Subjt:  -----YRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFC-KKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRF

Query:  PFKTRATSEVAAEKQIQQLLFPRSRRK
         F  RATSE  AE Q+++LLF  +  K
Subjt:  PFKTRATSEVAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein3.0e-3827.68Show/hide
Query:  SKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKV---RKRARNTELTAAELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEV----
        ++    A+Y  S  D         YHP ED++     K   ++ L+  E ART +EVN    ++  G +    HE++ W +  YV D +G+++F+V    
Subjt:  SKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKV---RKRARNTELTAAELARTTVEVNSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEV----

Query:  -------------------FDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGD--------IVPFDYDYVEAVETDLAEFPVDWG----
                           FD + M++D    +P  +  G++ ++ +    + D N GD   G+         V  D D  +   +D  E   DW     
Subjt:  -------------------FDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGD--------IVPFDYDYVEAVETDLAEFPVDWG----

Query:  ------------VPDVASLHPSN-------GVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSSRRSTL-----------YRL
                    + +VAS  P N       G+++ G L PV  ++ S I++      S G +   KE E      E  G +    L           Y+L
Subjt:  ------------VPDVASLHPSN-------GVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSSRRSTL-----------YRL

Query:  EIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFC-KKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKTRATS
        EI+RI+L +  G Q+E+ ++D ++A+PDV+  ++  I+ R  E G +   AL++ C +  G++ E+  L+G+DSLG D+R+C G ++ T RF F  RATS
Subjt:  EIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFSEKGFRCNIALKAFC-KKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKTRATS

Query:  EVAAEKQIQQLLFPRSRRK
        E  AE Q+++LLF  +  K
Subjt:  EVAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATCGCTGTAGCTTCTTCACTTGCCTTCGAAGGGGCTTGTTGCTCGACGTCGCATGCATTCACGAGCAGTTGGAACAGATCTTCTTTCGATGTTCGTGGCAGAAA
TCCAATATTTGGATCAACAGACTTACATTGGTTATCTAAGGGACGTGATCATTGCTTGTCAAAAGTTTCAGTTGCTGCTGATTACCCTGATTCAGTTCCAGATTCATCAA
GTTCCTTGACTAACAAAGGCTATCATCCTCTCGAAGATCTAAAAGTTCGTAAAAGAGCACGAAATACTGAACTCACTGCTGCAGAATTAGCCCGGACGACTGTGGAGGTA
AATAGCAACGCTTTGGTACTATTTCCTGGAACTGTGCACAATGAACCACATGAACACGTATCATGGGACGAGTTTCAATACGTTCTTGACGATTATGGAGATTTGTTTTT
CGAAGTTTTCGACGATGTGAACATGTTAGAAGATCGTGCTGCACCAAATCCTGTGAACGTTTTGATTGGAATGGACATGAAAATGTATGAAAGTAGGAGGGTAATTCGAG
ATTATAATGCGGGAGATAGCGGAAATGGTGATATCGTTCCTTTCGATTATGACTATGTTGAGGCTGTGGAAACTGATTTGGCTGAATTCCCTGTTGATTGGGGAGTTCCT
GATGTAGCTAGCTTGCATCCTTCGAATGGAGTTTCCATGTTGGGATGTCTCAGACCTGTCTATGCTGATGAAGAATCTTATATAAGAAGACTGTTTTACTTTGAAGAGAG
TGAAGGCTACAATACACAATGGAAAGAAGGTGAAACCTTGAGCTTCGAGTCCGAAAGCGATGGAAGCAGCCGAAGATCCACCCTCTACAGGCTGGAGATAATGAGAATTG
AGCTCTTCTCGGTGTACGGAGTTCAGTCTGAAATTGGTTTGCAAGATTTTCAACGTGCCGAACCCGATGTTCTTATGCACTCTACTGCAGAAATTGTCGAGCGTTTTAGT
GAGAAGGGTTTTAGGTGCAATATTGCCCTTAAAGCTTTTTGCAAAAAGAGAGGTCTTCGTGTTGAGGATGCTATTCTGCTCGGAGTCGATAGTCTTGGCATCGATGTGAG
GGTATGTTTCGGAACAGAAGTACGGACTTTTCGATTTCCCTTTAAAACCCGGGCAACATCTGAAGTTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGGTCTC
GTCGTAAAAAACTACGAAGCCATGAGGGGATGGATTGA
mRNA sequenceShow/hide mRNA sequence
CACTCCAGTGTTTTATAGGTGTAAAAAAACAAAATTGATAGACCCATTTGTTATTTTCTATGCTTTTTCAGCTGAGATCTCTGAGTAAAAAAATCAGAAATTTCTGAACA
CGTTTGAAGATCTTCTGCGAATCTCGATGGCCATTTCGGACTTCGAACTATACGTAGAGACTAATACCCATCTCAGCTCGCGTCGACTTTGATCTCTGTCTCTCATGGCA
ATCGCTGTAGCTTCTTCACTTGCCTTCGAAGGGGCTTGTTGCTCGACGTCGCATGCATTCACGAGCAGTTGGAACAGATCTTCTTTCGATGTTCGTGGCAGAAATCCAAT
ATTTGGATCAACAGACTTACATTGGTTATCTAAGGGACGTGATCATTGCTTGTCAAAAGTTTCAGTTGCTGCTGATTACCCTGATTCAGTTCCAGATTCATCAAGTTCCT
TGACTAACAAAGGCTATCATCCTCTCGAAGATCTAAAAGTTCGTAAAAGAGCACGAAATACTGAACTCACTGCTGCAGAATTAGCCCGGACGACTGTGGAGGTAAATAGC
AACGCTTTGGTACTATTTCCTGGAACTGTGCACAATGAACCACATGAACACGTATCATGGGACGAGTTTCAATACGTTCTTGACGATTATGGAGATTTGTTTTTCGAAGT
TTTCGACGATGTGAACATGTTAGAAGATCGTGCTGCACCAAATCCTGTGAACGTTTTGATTGGAATGGACATGAAAATGTATGAAAGTAGGAGGGTAATTCGAGATTATA
ATGCGGGAGATAGCGGAAATGGTGATATCGTTCCTTTCGATTATGACTATGTTGAGGCTGTGGAAACTGATTTGGCTGAATTCCCTGTTGATTGGGGAGTTCCTGATGTA
GCTAGCTTGCATCCTTCGAATGGAGTTTCCATGTTGGGATGTCTCAGACCTGTCTATGCTGATGAAGAATCTTATATAAGAAGACTGTTTTACTTTGAAGAGAGTGAAGG
CTACAATACACAATGGAAAGAAGGTGAAACCTTGAGCTTCGAGTCCGAAAGCGATGGAAGCAGCCGAAGATCCACCCTCTACAGGCTGGAGATAATGAGAATTGAGCTCT
TCTCGGTGTACGGAGTTCAGTCTGAAATTGGTTTGCAAGATTTTCAACGTGCCGAACCCGATGTTCTTATGCACTCTACTGCAGAAATTGTCGAGCGTTTTAGTGAGAAG
GGTTTTAGGTGCAATATTGCCCTTAAAGCTTTTTGCAAAAAGAGAGGTCTTCGTGTTGAGGATGCTATTCTGCTCGGAGTCGATAGTCTTGGCATCGATGTGAGGGTATG
TTTCGGAACAGAAGTACGGACTTTTCGATTTCCCTTTAAAACCCGGGCAACATCTGAAGTTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGGTCTCGTCGTA
AAAAACTACGAAGCCATGAGGGGATGGATTGAGAGATGCCATGAGTTCTTAGAACACCGTGTGCGTTATTTTGAGATTTGGAGGATCTTTGAAATAAACCTCACAGGTTA
AGACGACTTCTTTGCTCAAATCTGTCATAACAGTAGTCTTAGTTCCTCGATTTAGTGAGCATGTCTATATCGTCAACTACAACGAATGGAAAACTCTCTAGATTTTTACT
ATGATCATCAACCCAAAACTTCGTATGACTCGGCGAGGGGTAGTTCATCTATAGGGAAGGCATTGGAAGGGGCAAGTCGAAACTCGAGATAGG
Protein sequenceShow/hide protein sequence
MAIAVASSLAFEGACCSTSHAFTSSWNRSSFDVRGRNPIFGSTDLHWLSKGRDHCLSKVSVAADYPDSVPDSSSSLTNKGYHPLEDLKVRKRARNTELTAAELARTTVEV
NSNALVLFPGTVHNEPHEHVSWDEFQYVLDDYGDLFFEVFDDVNMLEDRAAPNPVNVLIGMDMKMYESRRVIRDYNAGDSGNGDIVPFDYDYVEAVETDLAEFPVDWGVP
DVASLHPSNGVSMLGCLRPVYADEESYIRRLFYFEESEGYNTQWKEGETLSFESESDGSSRRSTLYRLEIMRIELFSVYGVQSEIGLQDFQRAEPDVLMHSTAEIVERFS
EKGFRCNIALKAFCKKRGLRVEDAILLGVDSLGIDVRVCFGTEVRTFRFPFKTRATSEVAAEKQIQQLLFPRSRRKKLRSHEGMD