; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037830 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037830
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr2:9595371..9597173
RNA-Seq ExpressionLag0037830
SyntenyLag0037830
Gene Ontology termsGO:0009793 - embryo development ending in seed dormancy (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591919.1 hypothetical protein SDJN03_14265, partial [Cucurbita argyrosperma subsp. sororia]8.6e-29185.93Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MAT +FP  K LNPSSPF+ STSLTPFSNPLLQTLTLKSHQT KPLSI S  PN S LPI R+ISQFP A  R DIRTFAGRSKKKGGGPSPGRIEGNAE
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS
        FRRKLR N RRKSQK AESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRF+DERVT+I 
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS

Query:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII
        EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII
Subjt:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII

Query:  YVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
        YVKKPVYQCRFEPQ EFFQA+MPFLDPKTE+D LFEL++DEG VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLS+EKKSKCLEFLLTNHPVPLLHPYT
Subjt:  YVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAP-DDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE
        KEWKAKLEEEELGCDAP DD ENR  +ENV+ +WIETDD         +D+D ED+ EDVVMET EE EDE D GE ++ EEDE+YWDERFRKAISSPEE
Subjt:  KEWKAKLEEEELGCDAP-DDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE

Query:  LEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG
        LEKL KRS E SDE YEKQK  N GSR+ ME DGDETELRGKRAKV+PEEWE+IGYGPWRK+IK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Subjt:  LEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG

Query:  EIGV
        EIGV
Subjt:  EIGV

KAG7024792.1 hypothetical protein SDJN02_13611, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-29185.93Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MAT +FP  K LNPSSPF+ STSLTPFSNPLLQTLTLKSHQT KPLSI S  PN S LPI R+ISQFP A  R DIRTFAGRSKKKGGGPSPGRIEGNAE
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS
        FRRKLR N RRKSQK AESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRF+DERVT+I 
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS

Query:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII
        EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDI+
Subjt:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII

Query:  YVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
        YVKKPVYQCRFEPQ EFFQA+MPFLDPKTE+D LFEL++DEG VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
Subjt:  YVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAP-DDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE
        KEWKAKLEEEELGCDAP DD ENR  +ENV+ +WIETDD         +D+D ED  EDVVMET EE EDE D GE ++ EEDE+YWDERFRKAISSPEE
Subjt:  KEWKAKLEEEELGCDAP-DDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE

Query:  LEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG
        LEKL KRS E SDE YEKQK  N GSR+ ME DGDETELRGKRAKV+PEEWE+IGYGPWRK+IK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Subjt:  LEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG

Query:  EIGV
        EIGV
Subjt:  EIGV

XP_022142781.1 uncharacterized protein LOC111012814 [Momordica charantia]8.3e-29487.4Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MAT  F   K LNPSSP      LTPFSNPLLQTLTLK H++HKPLSI SA PNP FLPISR+ISQFP A I RDIRTFAGRSKKKGGG SPGRIEGNAE
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS
        FRR+LRQNARRKSQK AESHFYRRKNSNSNYADNF+EDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGE+GPYSWRGVVVGEPIRGRF+DERVTIIS
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS

Query:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIY
        EVKDHEEWEKIEQSEMASDFSEGLQRMD+SKGFRYFWVFVRHPRWRIS+LPWQQWTLIAEVVLEAGKERLDKW+LMGRLGNKSRKNITQCAAWMRPDIIY
Subjt:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIY

Query:  VKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK
        VKKPVYQCRFEPQDEFFQAIMPFLDPKTE+DFLFEL+NDEG+VEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK
Subjt:  VKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK

Query:  EWKAKLEEEELGCDAPDDIENRCG--NENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEAD-KGEGRSGEEDEEYWDERFRKAISSPE
        EWKAKLEEEELGCDAPDD E R G   ENVI +WIETDDDN       D+ DDEDQ +D+VME E   ED AD K + RS EEDE+YWDERFRKAISSPE
Subjt:  EWKAKLEEEELGCDAPDDIENRCG--NENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEAD-KGEGRSGEEDEEYWDERFRKAISSPE

Query:  ELEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGE
        E+EKLFKRSAEVSDELYEKQ E    ++GMEDGDETE+RGKRAKVR EEWEQIGYGPWRKRIK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGE
Subjt:  ELEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGE

Query:  IGV
        IGV
Subjt:  IGV

XP_022937202.1 uncharacterized protein LOC111443567 [Cucurbita moschata]1.1e-29085.93Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MA  +FP  K LNPSSPF+ STSLTPFSNPLLQTLTLKSHQT KPLSI S  PN S LPI R+ISQFP A  R DIRTFAGRSKKKGGGPSPGRIEGNAE
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS
        FRRKLR N RRKSQK AESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRF+DERVT+I 
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS

Query:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII
        EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII
Subjt:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII

Query:  YVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
        YVKKPVYQCRFEPQ EFFQA+MPFLDPKTE+D LFEL++DEG VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
Subjt:  YVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAP-DDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE
        KEWKAKLEEEELGCDAP DD ENR  +ENV+ +WIETDD         +D+D ED+ EDVVMET EE EDE D GE ++ EEDE+YWDERFRKAISSPEE
Subjt:  KEWKAKLEEEELGCDAP-DDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE

Query:  LEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG
        LEKL KRS E SDE YEKQK  N GSR+ ME DGDETELRGKRAKV+PEEWE+IGYGPWRK+IK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Subjt:  LEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG

Query:  EIGV
        EIGV
Subjt:  EIGV

XP_038898752.1 uncharacterized protein LOC120086270 [Benincasa hispida]4.4e-29586.66Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MAT +FP SK LN SS F+HSTSL+PF +PLLQTLTLKSHQTHKPLSIRS  PNPSFLPISR+IS    A   R+IRT AGRSKKKGGGPSPGRIEGNAE
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS
        FRRKLR NARRKSQKLAESHFYRRK  NSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRF+DERVTIIS
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS

Query:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIY
        EVKDHEEWEKIEQSEMASDFS+GL RMD+SKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIY
Subjt:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIY

Query:  VKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDE-GEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
        VKKPVYQCRFEPQDEFFQAIMPFLDPKTE+DFLFEL++DE G+VEWVTYF GLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
Subjt:  VKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDE-GEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAPDDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPED-VVMETEEEVEDEADKGEGRSG----EEDEEYWDERFRKAIS
        KEWKAKLEEEELGCDAPDDIE RCG+ENVIT+WIETDDDN ED        +EDQPE+ VVMETE+E EDE +  +   G    EEDE YWDERFRKAIS
Subjt:  KEWKAKLEEEELGCDAPDDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPED-VVMETEEEVEDEADKGEGRSG----EEDEEYWDERFRKAIS

Query:  SPEELEKLFKRSAEVSDELYEKQKENEGSRE--GMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHA
        SPEELEKLFK SAEV+DE YEK+KE+ GSR    MEDGDETELRGKRAKV+ EEWE IGYGPWRK+IK+S+IPPELFLRSTVRPFTYRNLVKEIVLTRHA
Subjt:  SPEELEKLFKRSAEVSDELYEKQKENEGSRE--GMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHA

Query:  ILDGEIG
        ILDGEIG
Subjt:  ILDGEIG

TrEMBL top hitse value%identityAlignment
A0A1S3CKF2 uncharacterized protein LOC1035018142.9e-27681.89Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTH--KPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGN
        MAT +FP  K LNPSSPF++STSLTPFSNPLLQTLTLKSHQTH  KPLSI S   NP       +IS  P    R DIRT AGRSKK  GGPSPGRIEGN
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTH--KPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGN

Query:  AEFRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTI
        AEFRRKLR NARRKSQKLAESHFYRRK  NSNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRF+DERVTI
Subjt:  AEFRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTI

Query:  ISEVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDI
        ISEVKDHEEWEKIEQSEMA+DFS GLQRMD+SKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDI
Subjt:  ISEVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDI

Query:  IYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPY
        IYVKKPVYQCRFEPQDEFFQA+MPFLDPKTE+DFLFEL++DEG VEWVTYFGGLCKIVR++PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPY
Subjt:  IYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPY

Query:  TKEWKAKLEEEELGCDAPDDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE
        TKEWKAKLEEEELGCDAPD++ENR  ++NVIT+WIETD++ E +   E+D   ED  ED     ++E +DE ++G     EEDE YWDERFRKAISSPEE
Subjt:  TKEWKAKLEEEELGCDAPDDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE

Query:  LEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI
        LEKLFKRS E++DELYEK+         M+DGDE E+RGKR KV+ EEWE IGYGPWRK+IK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI
Subjt:  LEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI

Query:  GV
        GV
Subjt:  GV

A0A5A7VK56 Uncharacterized protein2.9e-27681.89Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTH--KPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGN
        MAT +FP  K LNPSSPF++STSLTPFSNPLLQTLTLKSHQTH  KPLSI S   NP       +IS  P    R DIRT AGRSKK  GGPSPGRIEGN
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTH--KPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGN

Query:  AEFRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTI
        AEFRRKLR NARRKSQKLAESHFYRRK  NSNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRF+DERVTI
Subjt:  AEFRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTI

Query:  ISEVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDI
        ISEVKDHEEWEKIEQSEMA+DFS GLQRMD+SKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDI
Subjt:  ISEVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDI

Query:  IYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPY
        IYVKKPVYQCRFEPQDEFFQA+MPFLDPKTE+DFLFEL++DEG VEWVTYFGGLCKIVR++PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPY
Subjt:  IYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPY

Query:  TKEWKAKLEEEELGCDAPDDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE
        TKEWKAKLEEEELGCDAPD++ENR  ++NVIT+WIETD++ E +   E+D   ED  ED     ++E +DE ++G     EEDE YWDERFRKAISSPEE
Subjt:  TKEWKAKLEEEELGCDAPDDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE

Query:  LEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI
        LEKLFKRS E++DELYEK+         M+DGDE E+RGKR KV+ EEWE IGYGPWRK+IK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI
Subjt:  LEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI

Query:  GV
        GV
Subjt:  GV

A0A6J1CN80 uncharacterized protein LOC1110128144.0e-29487.4Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MAT  F   K LNPSSP      LTPFSNPLLQTLTLK H++HKPLSI SA PNP FLPISR+ISQFP A I RDIRTFAGRSKKKGGG SPGRIEGNAE
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS
        FRR+LRQNARRKSQK AESHFYRRKNSNSNYADNF+EDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGE+GPYSWRGVVVGEPIRGRF+DERVTIIS
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS

Query:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIY
        EVKDHEEWEKIEQSEMASDFSEGLQRMD+SKGFRYFWVFVRHPRWRIS+LPWQQWTLIAEVVLEAGKERLDKW+LMGRLGNKSRKNITQCAAWMRPDIIY
Subjt:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIY

Query:  VKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK
        VKKPVYQCRFEPQDEFFQAIMPFLDPKTE+DFLFEL+NDEG+VEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK
Subjt:  VKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK

Query:  EWKAKLEEEELGCDAPDDIENRCG--NENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEAD-KGEGRSGEEDEEYWDERFRKAISSPE
        EWKAKLEEEELGCDAPDD E R G   ENVI +WIETDDDN       D+ DDEDQ +D+VME E   ED AD K + RS EEDE+YWDERFRKAISSPE
Subjt:  EWKAKLEEEELGCDAPDDIENRCG--NENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEAD-KGEGRSGEEDEEYWDERFRKAISSPE

Query:  ELEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGE
        E+EKLFKRSAEVSDELYEKQ E    ++GMEDGDETE+RGKRAKVR EEWEQIGYGPWRKRIK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGE
Subjt:  ELEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGE

Query:  IGV
        IGV
Subjt:  IGV

A0A6J1FAH0 uncharacterized protein LOC1114435675.5e-29185.93Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MA  +FP  K LNPSSPF+ STSLTPFSNPLLQTLTLKSHQT KPLSI S  PN S LPI R+ISQFP A  R DIRTFAGRSKKKGGGPSPGRIEGNAE
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS
        FRRKLR N RRKSQK AESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRF+DERVT+I 
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIIS

Query:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII
        EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII
Subjt:  EVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDII

Query:  YVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
        YVKKPVYQCRFEPQ EFFQA+MPFLDPKTE+D LFEL++DEG VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
Subjt:  YVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAP-DDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE
        KEWKAKLEEEELGCDAP DD ENR  +ENV+ +WIETDD         +D+D ED+ EDVVMET EE EDE D GE ++ EEDE+YWDERFRKAISSPEE
Subjt:  KEWKAKLEEEELGCDAP-DDIENRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEE

Query:  LEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG
        LEKL KRS E SDE YEKQK  N GSR+ ME DGDETELRGKRAKV+PEEWE+IGYGPWRK+IK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Subjt:  LEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG

Query:  EIGV
        EIGV
Subjt:  EIGV

A0A6J1INI9 uncharacterized protein LOC1114768533.5e-29085.53Show/hide
Query:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQ--TLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGN
        MAT +FP  K LNPSSPF+HSTSLTPFSNPLLQ  TLTLKSH+T KPLSI S  PN S LPI R+ISQFP A  R DIRTFAGRSKKKGGG SPGRIEGN
Subjt:  MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQ--TLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGN

Query:  AEFRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTI
        AEFRRKLR N RRKSQK AESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRF+DERVT+
Subjt:  AEFRRKLRQNARRKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTI

Query:  ISEVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPD
        I EVKDHEEWEKIEQSEMASDFSEGLQRMDR+KGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPD
Subjt:  ISEVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPD

Query:  IIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHP
        IIYVKKPVYQCRFEPQ EFFQA+MPFLDPKTE+D LFEL++DEG VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHP
Subjt:  IIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHP

Query:  YTKEWKAKLEEEELGCDAPDDIE---NRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAIS
        YTKEWKAKLEEEELGCDAPDD +   NR  +ENVI +WIETDDDN        D+D ED+ EDVVMET EE EDE D GE ++ EEDE+YWDERFRKAIS
Subjt:  YTKEWKAKLEEEELGCDAPDDIE---NRCGNENVITDWIETDDDNEEDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAIS

Query:  SPEELEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHA
        SPEELEKL KRS E SDE YEKQK  N GSR+ ME DGDETELRGKRAKV+PEEWE+IGYGPWRK+IK+SQIPPELFLRSTVRPFTYRNLVKEIVLTRHA
Subjt:  SPEELEKLFKRSAEVSDELYEKQK-ENEGSREGME-DGDETELRGKRAKVRPEEWEQIGYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHA

Query:  ILDGEIGV
        IL+GEIGV
Subjt:  ILDGEIGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G14900.1 unknown protein1.5e-18761.75Show/hide
Query:  RRDIRTFAGRSKKK-GGGPSPGRIEGNAEFRRKLRQNARRKSQKLAESHFYRRKNSN--------SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRH
        RRD+R  AGRSKKK GGG S GRIEG+++ R+++++NAR KS+KLAES FYR  N+         S++ D F+E+EL+ IGLGYDRMVRFM+KDDP LRH
Subjt:  RRDIRTFAGRSKKK-GGGPSPGRIEGNAEFRRKLRQNARRKSQKLAESHFYRRKNSN--------SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRH

Query:  PYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVL
        PYDW+KYGEFGPYSWRGVVVG+P+RG  SDE VT+I EV++HEE+EKIEQ EM   F + ++ +D + G RYFWVFVRHP+WR+SELPW+QWTL++EVV+
Subjt:  PYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVL

Query:  EAG-KERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKA
        EA  K+RLDKW+LMGRLGNKSR  I QCAAW RPDI+YVKKPV+QCRFEPQ++FF +++P+L+P TE  F+ E+E+DEG VE  TY+GGLCK+++V   A
Subjt:  EAG-KERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEEDFLFELENDEGEVEWVTYFGGLCKIVRVNPKA

Query:  FVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIENR-----CGNENVITDWIETDDDNEEDVDVEDDNDDE----D
        FVDDVVNAYEKLSDEKKS+ L+FLL NHP  LLHPYTKEWKAKLEE ELGCDAPD+ E+         +   ++WIE + DN++D D +DD+D E    D
Subjt:  FVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIENR-----CGNENVITDWIETDDDNEEDVDVEDDNDDE----D

Query:  QPEDVVMETEEEVEDEA--DKGEGRSGEEDEEYWDERFRKAISSPEELEKLFKRSAEVSDELYEKQ-KENEGSREGMEDGDETELRGKRAKVRPEEWEQI
          +++V++ E  VE+++  D+ E    EEDE YW+E+F KA ++ E +EKL + S  VSD+ YEKQ K  E   +G  +GDE E+RGK+AKV+PEEW+ +
Subjt:  QPEDVVMETEEEVEDEA--DKGEGRSGEEDEEYWDERFRKAISSPEELEKLFKRSAEVSDELYEKQ-KENEGSREGMEDGDETELRGKRAKVRPEEWEQI

Query:  GYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIG
        GYG W K+IK+S+IPPELFLR+ VRPF YRNLVKEIVLTRHAIL+GEIG
Subjt:  GYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTCCAAAATTCCCTCCCTCTAAAGCCCTAAACCCTTCATCTCCATTCATCCACTCCACCTCCCTCACACCATTCTCCAATCCTCTTCTTCAAACCCTAACCCT
AAAATCCCATCAAACCCACAAACCACTCTCTATTCGTTCCGCTCGCCCAAATCCTTCGTTTCTTCCGATATCCCGCCGAATTTCGCAATTCCCATGCGCAAAAATCCGCC
GGGATATCCGGACGTTCGCTGGCCGGAGCAAGAAGAAGGGCGGCGGGCCCTCTCCCGGCCGGATAGAAGGCAACGCCGAGTTCCGGCGGAAGTTGAGGCAAAATGCCCGC
CGGAAGAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAACTCGAACAGCAATTACGCCGATAACTTCAGCGAGGATGAGCTTCAGCAGATCGGCCTCGGTTA
CGATAGGATGGTCCGATTCATGGAGAAAGACGACCCGAATCTGCGTCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTTGTCG
GCGAGCCGATTCGCGGTCGGTTTTCCGACGAGCGAGTTACGATCATCAGCGAGGTTAAGGATCACGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCGTCTGATTTC
AGCGAGGGCTTGCAGCGGATGGACAGGAGCAAAGGGTTTCGGTACTTTTGGGTGTTCGTGAGGCACCCGCGGTGGAGGATTTCGGAGCTGCCATGGCAGCAGTGGACTTT
GATTGCAGAGGTTGTGCTTGAAGCAGGTAAAGAAAGGTTAGATAAATGGAGTTTGATGGGTCGGCTTGGAAACAAGTCAAGAAAGAACATAACTCAATGTGCAGCGTGGA
TGAGACCCGATATCATATATGTGAAAAAACCAGTTTACCAATGCAGATTCGAGCCTCAGGATGAGTTCTTCCAGGCGATAATGCCGTTTCTCGATCCGAAAACAGAGGAA
GATTTTCTGTTTGAGTTGGAGAATGATGAAGGAGAAGTTGAATGGGTGACTTATTTTGGTGGGCTGTGTAAGATTGTGAGGGTGAACCCAAAGGCATTTGTGGATGATGT
GGTGAATGCTTATGAGAAGCTGAGTGACGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACCAACCACCCTGTTCCATTGCTGCATCCATACACAAAAGAGTGGAAGG
CTAAGTTGGAGGAAGAGGAGTTAGGTTGTGATGCTCCGGACGACATCGAAAATCGATGTGGCAACGAAAATGTGATCACGGACTGGATTGAGACTGATGATGACAATGAA
GAGGATGTAGATGTTGAAGACGACAACGATGACGAGGATCAGCCTGAGGATGTGGTGATGGAGACAGAGGAAGAAGTTGAGGACGAGGCGGATAAAGGAGAGGGTCGGAG
TGGAGAAGAAGATGAGGAATATTGGGATGAGAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTCAAACGCAGTGCAGAAGTGAGTGATGAGTTGT
ATGAGAAACAGAAGGAGAATGAGGGAAGCAGAGAGGGCATGGAAGATGGGGATGAGACAGAATTGAGAGGGAAGAGAGCAAAAGTGAGACCAGAAGAATGGGAGCAAATT
GGATATGGGCCATGGAGGAAGAGGATAAAGAGAAGTCAAATTCCTCCAGAGCTGTTTTTGAGATCTACAGTGAGGCCTTTCACTTATAGGAACCTTGTGAAGGAGATTGT
ATTGACCAGACATGCTATTTTGGATGGTGAAATTGGGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACTCCAAAATTCCCTCCCTCTAAAGCCCTAAACCCTTCATCTCCATTCATCCACTCCACCTCCCTCACACCATTCTCCAATCCTCTTCTTCAAACCCTAACCCT
AAAATCCCATCAAACCCACAAACCACTCTCTATTCGTTCCGCTCGCCCAAATCCTTCGTTTCTTCCGATATCCCGCCGAATTTCGCAATTCCCATGCGCAAAAATCCGCC
GGGATATCCGGACGTTCGCTGGCCGGAGCAAGAAGAAGGGCGGCGGGCCCTCTCCCGGCCGGATAGAAGGCAACGCCGAGTTCCGGCGGAAGTTGAGGCAAAATGCCCGC
CGGAAGAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAACTCGAACAGCAATTACGCCGATAACTTCAGCGAGGATGAGCTTCAGCAGATCGGCCTCGGTTA
CGATAGGATGGTCCGATTCATGGAGAAAGACGACCCGAATCTGCGTCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTTGTCG
GCGAGCCGATTCGCGGTCGGTTTTCCGACGAGCGAGTTACGATCATCAGCGAGGTTAAGGATCACGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCGTCTGATTTC
AGCGAGGGCTTGCAGCGGATGGACAGGAGCAAAGGGTTTCGGTACTTTTGGGTGTTCGTGAGGCACCCGCGGTGGAGGATTTCGGAGCTGCCATGGCAGCAGTGGACTTT
GATTGCAGAGGTTGTGCTTGAAGCAGGTAAAGAAAGGTTAGATAAATGGAGTTTGATGGGTCGGCTTGGAAACAAGTCAAGAAAGAACATAACTCAATGTGCAGCGTGGA
TGAGACCCGATATCATATATGTGAAAAAACCAGTTTACCAATGCAGATTCGAGCCTCAGGATGAGTTCTTCCAGGCGATAATGCCGTTTCTCGATCCGAAAACAGAGGAA
GATTTTCTGTTTGAGTTGGAGAATGATGAAGGAGAAGTTGAATGGGTGACTTATTTTGGTGGGCTGTGTAAGATTGTGAGGGTGAACCCAAAGGCATTTGTGGATGATGT
GGTGAATGCTTATGAGAAGCTGAGTGACGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACCAACCACCCTGTTCCATTGCTGCATCCATACACAAAAGAGTGGAAGG
CTAAGTTGGAGGAAGAGGAGTTAGGTTGTGATGCTCCGGACGACATCGAAAATCGATGTGGCAACGAAAATGTGATCACGGACTGGATTGAGACTGATGATGACAATGAA
GAGGATGTAGATGTTGAAGACGACAACGATGACGAGGATCAGCCTGAGGATGTGGTGATGGAGACAGAGGAAGAAGTTGAGGACGAGGCGGATAAAGGAGAGGGTCGGAG
TGGAGAAGAAGATGAGGAATATTGGGATGAGAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTCAAACGCAGTGCAGAAGTGAGTGATGAGTTGT
ATGAGAAACAGAAGGAGAATGAGGGAAGCAGAGAGGGCATGGAAGATGGGGATGAGACAGAATTGAGAGGGAAGAGAGCAAAAGTGAGACCAGAAGAATGGGAGCAAATT
GGATATGGGCCATGGAGGAAGAGGATAAAGAGAAGTCAAATTCCTCCAGAGCTGTTTTTGAGATCTACAGTGAGGCCTTTCACTTATAGGAACCTTGTGAAGGAGATTGT
ATTGACCAGACATGCTATTTTGGATGGTGAAATTGGGGTATGA
Protein sequenceShow/hide protein sequence
MATPKFPPSKALNPSSPFIHSTSLTPFSNPLLQTLTLKSHQTHKPLSIRSARPNPSFLPISRRISQFPCAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAEFRRKLRQNAR
RKSQKLAESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFSDERVTIISEVKDHEEWEKIEQSEMASDF
SEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEE
DFLFELENDEGEVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIENRCGNENVITDWIETDDDNE
EDVDVEDDNDDEDQPEDVVMETEEEVEDEADKGEGRSGEEDEEYWDERFRKAISSPEELEKLFKRSAEVSDELYEKQKENEGSREGMEDGDETELRGKRAKVRPEEWEQI
GYGPWRKRIKRSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV