; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G202120 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G202120
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUnknown protein
Genome locationCiama_Chr11:1725266..1727053
RNA-Seq ExpressionCaUC11G202120
SyntenyCaUC11G202120
Gene Ontology termsGO:0009793 - embryo development ending in seed dormancy (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591919.1 hypothetical protein SDJN03_14265, partial [Cucurbita argyrosperma subsp. sororia]4.7e-28986.7Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK
        QFP  KTLNPSSPFL STS TPF NPLLQTLTLKSHQT KPLSI+SG PN S LPI RQIS F FANSR DIRT AGRSKKKGGGPSPGRIEGNAEFRRK
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK

Query:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
        LR+N RRKSQK AESHFYRRK SNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I EVKD
Subjt:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD

Query:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
        HEEWEKIEQSEMASDFSEGLQ+MD+SKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
Subjt:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK

Query:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
        PVYQCRFEPQ EFFQA+MPFLDPKTEQD LFELQDDEG+VEWVTYFGGLCKI+R+NPKAFVDDV NAYEKLS+EKKSKCLEFLLTNHPVPLLHPYTKEWK
Subjt:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK

Query:  AKLEEEELGCDAP-DEIENQRGDENVITEWIETDDDDNEDQPE-EDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVAD
        AKLEEEELGCDAP D+ EN+  DENV+ EWIETDD+D++ + E EDV+MET++E EDE+D GE  NEEEDE+YWDERFRKAISSPEELEKL K S+E +D
Subjt:  AKLEEEELGCDAP-DEIENQRGDENVITEWIETDDDDNEDQPE-EDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVAD

Query:  EFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        EFYEK+ +  N GSR+  AME DGDETE+RGKRAKVK EEWE IGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  EFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

KAG7024792.1 hypothetical protein SDJN02_13611, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-28986.7Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK
        QFP  KTLNPSSPFL STS TPF NPLLQTLTLKSHQT KPLSI+SG PN S LPI RQIS F FANSR DIRT AGRSKKKGGGPSPGRIEGNAEFRRK
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK

Query:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
        LR+N RRKSQK AESHFYRRK SNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I EVKD
Subjt:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD

Query:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
        HEEWEKIEQSEMASDFSEGLQ+MD+SKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDI+YVKK
Subjt:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK

Query:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
        PVYQCRFEPQ EFFQA+MPFLDPKTEQD LFELQDDEG+VEWVTYFGGLCKI+R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
Subjt:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK

Query:  AKLEEEELGCDAP-DEIENQRGDENVITEWIETDD-DDNEDQPEEDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVAD
        AKLEEEELGCDAP D+ EN+  DENV+ EWIETDD DD+ +   EDV+MET++E EDE+D GE  NEEEDE+YWDERFRKAISSPEELEKL K S+E +D
Subjt:  AKLEEEELGCDAP-DEIENQRGDENVITEWIETDD-DDNEDQPEEDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVAD

Query:  EFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        EFYEK+ +  N GSR+  AME DGDETE+RGKRAKVK EEWE IGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  EFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

XP_008463741.1 PREDICTED: uncharacterized protein LOC103501814 [Cucumis melo]2.0e-29287.46Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTH--KPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFR
        QFP  KTLNPSSPFL+STS TPF NPLLQTLTLKSHQTH  KPLSI+SG  NP       QIS     +SR DIRTHAGRSKK  GGPSPGRIEGNAEFR
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTH--KPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFR

Query:  RKLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV
        RKLRHNARRKSQKLAESHFYRRKK NSNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV
Subjt:  RKLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV

Query:  KDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK
        KDHEEWEKIEQSEMA+DFS GLQ+MDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK
Subjt:  KDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK

Query:  KPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEW
        KPVYQCRFEPQDEFFQA+MPFLDPKTEQDFLFELQDDEG+VEWVTYFGGLCKIVRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEW
Subjt:  KPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEW

Query:  KAKLEEEELGCDAPDEIENQRGDENVITEWIETDDDDN-EDQPEEDVIMETDDEEED--EDDKGEDGN---EEEDENYWDERFRKAISSPEELEKLFKGS
        KAKLEEEELGCDAPDE+EN+R D+NVITEWIETD+++  EDQPEED++ME  DE++D  +DD+ E+GN   EEEDE+YWDERFRKAISSPEELEKLFK S
Subjt:  KAKLEEEELGCDAPDEIENQRGDENVITEWIETDDDDN-EDQPEEDVIMETDDEEED--EDDKGEDGN---EEEDENYWDERFRKAISSPEELEKLFKGS

Query:  KEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
         E+ADE Y    EKENVG RR TAM+DGDE EMRGKR KVKAEEWEYIGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  KEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

XP_022937202.1 uncharacterized protein LOC111443567 [Cucurbita moschata]9.5e-29086.87Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK
        QFP  KTLNPSSPFL STS TPF NPLLQTLTLKSHQT KPLSI+SG PN S LPI RQIS F FANSR DIRT AGRSKKKGGGPSPGRIEGNAEFRRK
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK

Query:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
        LR+N RRKSQK AESHFYRRK SNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I EVKD
Subjt:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD

Query:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
        HEEWEKIEQSEMASDFSEGLQ+MD+SKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
Subjt:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK

Query:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
        PVYQCRFEPQ EFFQA+MPFLDPKTEQD LFELQDDEG+VEWVTYFGGLCKI+R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
Subjt:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK

Query:  AKLEEEELGCDAP-DEIENQRGDENVITEWIETDDDDNEDQPE-EDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVAD
        AKLEEEELGCDAP D+ EN+  DENV+ EWIETDD+D++ + E EDV+MET++E EDE+D GE  NEEEDE+YWDERFRKAISSPEELEKL K S+E +D
Subjt:  AKLEEEELGCDAP-DEIENQRGDENVITEWIETDDDDNEDQPE-EDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVAD

Query:  EFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        EFYEK+ +  N GSR+  AME DGDETE+RGKRAKVK EEWE IGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  EFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

XP_038898752.1 uncharacterized protein LOC120086270 [Benincasa hispida]0.0e+0092.31Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK
        QFP SKTLN SS FLHSTS +PF +PLLQTLTLKSHQTHKPLSI SG PNPSFLPISRQISH QFANS R+IRTHAGRSKKKGGGPSPGRIEGNAEFRRK
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK

Query:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
        LRHNARRKSQKLAESHFYRRKK NSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
Subjt:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD

Query:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP
        HEEWEKIEQSEMASDFS+GL +MDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP
Subjt:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP

Query:  VYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDE-GDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
        VYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDE GDVEWVTYF GLCKIVR+NPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
Subjt:  VYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDE-GDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK

Query:  AKLEEEELGCDAPDEIENQRGDENVITEWIETDDDD----NEDQPEEDVIMET--DDEEEDEDDKGEDGN--EEEDENYWDERFRKAISSPEELEKLFKG
        AKLEEEELGCDAPD+IE + GDENVITEWIETDDD+     EDQPEE+V+MET  +DE+EDEDDK EDGN  EEEDE YWDERFRKAISSPEELEKLFK 
Subjt:  AKLEEEELGCDAPDEIENQRGDENVITEWIETDDDD----NEDQPEEDVIMET--DDEEEDEDDKGEDGN--EEEDENYWDERFRKAISSPEELEKLFKG

Query:  SKEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIG
        S EVADEFYEK  EKE+VGSRR TAMEDGDETE+RGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIG
Subjt:  SKEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIG

TrEMBL top hitse value%identityAlignment
A0A0A0L3A4 Uncharacterized protein5.1e-28986.62Show/hide
Query:  FPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTH--KPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRR
        FP  KTLNPSSPFL+STS TPF NPLLQTLTLK H TH  KPLSI+SG   P       QIS F    SR DIRTHAGRSKKK GGPSPGRIEGNA+FRR
Subjt:  FPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTH--KPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRR

Query:  KLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVK
        KLR NARRK+QKLAESHFYRRKKSN NYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVK
Subjt:  KLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVK

Query:  DHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
        DHEEWEKIEQSEMA+DFS GLQ+MDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLE+GKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
Subjt:  DHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK

Query:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
        PVYQCRFEPQDEFFQA+MPFLDPKTEQDFLFELQDDEG+VEWVTYFGGLCKIVRINPKAF+DDVVNAYEKLSDEKKSKCLEFLL+NHPVPLLHPYTKEWK
Subjt:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK

Query:  AKLEEEELGCDAPDEIENQRGDENVITEWIETDDDDN-EDQPEEDVIMETDDEEEDE----DDKGEDGN--EEEDENYWDERFRKAISSPEELEKLFKGS
        AKLEEEELGCDAPDE+EN+R D+NVITEWIETD+++  E+QP+ED++ME  DE+EDE    DD+ E+GN  EEEDE YWDERFRKAISSPEELEKLFK S
Subjt:  AKLEEEELGCDAPDEIENQRGDENVITEWIETDDDDN-EDQPEEDVIMETDDEEEDE----DDKGEDGN--EEEDENYWDERFRKAISSPEELEKLFKGS

Query:  KEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
         E+ADE Y    EKENVG RR TAM+DGDE EMRGK+ KVKAEEWEYIGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  KEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A1S3CKF2 uncharacterized protein LOC1035018149.9e-29387.46Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTH--KPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFR
        QFP  KTLNPSSPFL+STS TPF NPLLQTLTLKSHQTH  KPLSI+SG  NP       QIS     +SR DIRTHAGRSKK  GGPSPGRIEGNAEFR
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTH--KPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFR

Query:  RKLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV
        RKLRHNARRKSQKLAESHFYRRKK NSNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV
Subjt:  RKLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV

Query:  KDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK
        KDHEEWEKIEQSEMA+DFS GLQ+MDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK
Subjt:  KDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK

Query:  KPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEW
        KPVYQCRFEPQDEFFQA+MPFLDPKTEQDFLFELQDDEG+VEWVTYFGGLCKIVRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEW
Subjt:  KPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEW

Query:  KAKLEEEELGCDAPDEIENQRGDENVITEWIETDDDDN-EDQPEEDVIMETDDEEED--EDDKGEDGN---EEEDENYWDERFRKAISSPEELEKLFKGS
        KAKLEEEELGCDAPDE+EN+R D+NVITEWIETD+++  EDQPEED++ME  DE++D  +DD+ E+GN   EEEDE+YWDERFRKAISSPEELEKLFK S
Subjt:  KAKLEEEELGCDAPDEIENQRGDENVITEWIETDDDDN-EDQPEEDVIMETDDEEED--EDDKGEDGN---EEEDENYWDERFRKAISSPEELEKLFKGS

Query:  KEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
         E+ADE Y    EKENVG RR TAM+DGDE EMRGKR KVKAEEWEYIGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  KEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A5A7VK56 Uncharacterized protein9.9e-29387.46Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTH--KPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFR
        QFP  KTLNPSSPFL+STS TPF NPLLQTLTLKSHQTH  KPLSI+SG  NP       QIS     +SR DIRTHAGRSKK  GGPSPGRIEGNAEFR
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTH--KPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFR

Query:  RKLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV
        RKLRHNARRKSQKLAESHFYRRKK NSNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV
Subjt:  RKLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV

Query:  KDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK
        KDHEEWEKIEQSEMA+DFS GLQ+MDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK
Subjt:  KDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVK

Query:  KPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEW
        KPVYQCRFEPQDEFFQA+MPFLDPKTEQDFLFELQDDEG+VEWVTYFGGLCKIVRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEW
Subjt:  KPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEW

Query:  KAKLEEEELGCDAPDEIENQRGDENVITEWIETDDDDN-EDQPEEDVIMETDDEEED--EDDKGEDGN---EEEDENYWDERFRKAISSPEELEKLFKGS
        KAKLEEEELGCDAPDE+EN+R D+NVITEWIETD+++  EDQPEED++ME  DE++D  +DD+ E+GN   EEEDE+YWDERFRKAISSPEELEKLFK S
Subjt:  KAKLEEEELGCDAPDEIENQRGDENVITEWIETDDDDN-EDQPEEDVIMETDDEEED--EDDKGEDGN---EEEDENYWDERFRKAISSPEELEKLFKGS

Query:  KEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
         E+ADE Y    EKENVG RR TAM+DGDE EMRGKR KVKAEEWEYIGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  KEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A6J1FAH0 uncharacterized protein LOC1114435674.6e-29086.87Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK
        QFP  KTLNPSSPFL STS TPF NPLLQTLTLKSHQT KPLSI+SG PN S LPI RQIS F FANSR DIRT AGRSKKKGGGPSPGRIEGNAEFRRK
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRK

Query:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
        LR+N RRKSQK AESHFYRRK SNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I EVKD
Subjt:  LRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD

Query:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
        HEEWEKIEQSEMASDFSEGLQ+MD+SKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK
Subjt:  HEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKK

Query:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
        PVYQCRFEPQ EFFQA+MPFLDPKTEQD LFELQDDEG+VEWVTYFGGLCKI+R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK
Subjt:  PVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWK

Query:  AKLEEEELGCDAP-DEIENQRGDENVITEWIETDDDDNEDQPE-EDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVAD
        AKLEEEELGCDAP D+ EN+  DENV+ EWIETDD+D++ + E EDV+MET++E EDE+D GE  NEEEDE+YWDERFRKAISSPEELEKL K S+E +D
Subjt:  AKLEEEELGCDAP-DEIENQRGDENVITEWIETDDDDNEDQPE-EDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVAD

Query:  EFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        EFYEK+ +  N GSR+  AME DGDETE+RGKRAKVK EEWE IGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  EFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A6J1INI9 uncharacterized protein LOC1114768532.1e-28785.81Show/hide
Query:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQ--TLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFR
        QFP  KTLNPSSPFLHSTS TPF NPLLQ  TLTLKSH+T KPLSI+SG PN S LPI RQIS F FANSR DIRT AGRSKKKGGG SPGRIEGNAEFR
Subjt:  QFPFSKTLNPSSPFLHSTSFTPFPNPLLQ--TLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFR

Query:  RKLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV
        RKLR+N RRKSQK AESHFYRRK SNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I EV
Subjt:  RKLRHNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEV

Query:  KDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV
        KDHEEWEKIEQSEMASDFSEGLQ+MD++KGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV
Subjt:  KDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV

Query:  KKPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKE
        KKPVYQCRFEPQ EFFQA+MPFLDPKTEQD LFELQDDEG+VEWVTYFGGLCKI+R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKE
Subjt:  KKPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKE

Query:  WKAKLEEEELGCDAPDEIE---NQRGDENVITEWIETDDDDNEDQPE--EDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGS
        WKAKLEEEELGCDAPD+ +   N+  DENVI EWIETDDD++ D  +  EDV+MET++E EDE+D GE  NEEEDE+YWDERFRKAISSPEELEKL K S
Subjt:  WKAKLEEEELGCDAPDEIE---NQRGDENVITEWIETDDDDNEDQPE--EDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGS

Query:  KEVADEFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        +E +DEFYEK+ +  N+GSR+  AME DGDETE+RGKRAKVK EEWE IGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  KEVADEFYEKEMEKENVGSRRGTAME-DGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G14900.1 unknown protein8.0e-18656.73Show/hide
Query:  FSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKK-GGGPSPGRIEGNAEFRRKLR
        FSKTLNPS  F  S    P  + + + +++    T +         N +F     ++   +    RRD+R  AGRSKKK GGG S GRIEG+++ R++++
Subjt:  FSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKK-GGGPSPGRIEGNAEFRRKLR

Query:  HNARRKSQKLAESHFYR--------RKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
         NAR KS+KLAES FYR        R +  S++ D F+E+EL+ IGLGYDRMVRFM+KDDP LRHPYDW+KYGEFGPYSWRGVVVG+P+RG  +DE VT+
Subjt:  HNARRKSQKLAESHFYR--------RKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI

Query:  ISEVKDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAG-KERLDKWSLMGRLGNKSRKNITQCAAWMRPD
        I EV++HEE+EKIEQ EM   F + ++++D + G RYFWVFVRHP+WR+SELPW+QWTL++EVV+EA  K+RLDKW+LMGRLGNKSR  I QCAAW RPD
Subjt:  ISEVKDHEEWEKIEQSEMASDFSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAG-KERLDKWSLMGRLGNKSRKNITQCAAWMRPD

Query:  IIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHP
        I+YVKKPV+QCRFEPQ++FF +++P+L+P TE  F+ E++DDEG VE  TY+GGLCK++++   AFVDDVVNAYEKLSDEKKS+ L+FLL NHP  LLHP
Subjt:  IIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHP

Query:  YTKEWKAKLEEEELGCDAPDEIENQ-----RGDENVITEWIE----TDDDDNEDQPEEDVIMETDD------------EEEDEDDKGEDGNEEEDENYWD
        YTKEWKAKLEE ELGCDAPDE E++       ++   +EWIE     DDDD++D  ++  + E DD            EE+  +D+ E+ + EEDE YW+
Subjt:  YTKEWKAKLEEEELGCDAPDEIENQ-----RGDENVITEWIE----TDDDDNEDQPEEDVIMETDD------------EEEDEDDKGEDGNEEEDENYWD

Query:  ERFRKAISSPEELEKLFKGSKEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNL
        E+F KA ++ E +EKL + S  V+D+FYEK+++       R     +GDE EMRGK+AKVK EEW+ +GYG W KKIKKS+IPPELFLR+ VRPF YRNL
Subjt:  ERFRKAISSPEELEKLFKGSKEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNL

Query:  VKEIVLTRHAILDGEIG
        VKEIVLTRHAIL+GEIG
Subjt:  VKEIVLTRHAILDGEIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATTCCTTTGCAATTCCCTTTCTCCAAAACCCTAAACCCTTCATCTCCATTCCTCCACTCAACCTCCTTCACACCGTTCCCCAATCCCCTTCTTCAAACCTTAAC
CCTAAAATCCCATCAAACGCATAAACCCCTTTCCATTGTATCCGGTTCCCCAAATCCTTCCTTTCTTCCCATCTCCCGCCAAATTTCGCATTTCCAATTTGCAAACTCCC
GTCGGGATATTCGTACGCACGCCGGCCGGAGCAAGAAGAAGGGTGGAGGGCCCTCTCCCGGCCGGATAGAAGGCAACGCCGAGTTTCGACGGAAATTGAGGCACAATGCC
CGCCGGAAGAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAAGTCGAACAGCAATTACGCGGATAACTTCAGTGAGGATGAACTTCAGCAGATCGGCCTCGG
CTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCTAATCTACGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTTG
TCGGCGAGCCTATTCGTGGGCGGTTCACGGATGAGCGAGTTACGATTATCAGCGAGGTTAAGGATCATGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCTTCTGAT
TTCAGCGAGGGATTGCAGCAGATGGACAAGAGCAAAGGGTTTCGGTACTTTTGGGTGTTCGTGAGACACCCGCGGTGGAGGATTTCGGAGCTTCCATGGCAGCAATGGAC
TTTGATTGCAGAGGTTGTACTTGAAGCTGGTAAAGAAAGGTTAGATAAATGGAGCTTGATGGGTCGGCTTGGAAATAAGTCAAGAAAGAATATAACTCAATGTGCAGCTT
GGATGAGACCTGATATCATATATGTGAAAAAGCCTGTTTACCAATGCAGATTTGAGCCTCAGGATGAGTTTTTCCAGGCAATAATGCCATTTCTTGATCCCAAAACAGAG
CAAGATTTTCTGTTTGAGTTGCAGGATGATGAAGGAGACGTTGAATGGGTGACTTATTTTGGTGGCTTGTGTAAGATTGTGAGGATAAACCCAAAGGCATTTGTGGATGA
TGTGGTGAATGCTTATGAGAAGCTGAGTGATGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACTAACCATCCTGTTCCATTGCTGCATCCATATACAAAAGAGTGGA
AGGCTAAGTTGGAGGAGGAGGAGTTGGGTTGTGATGCCCCGGACGAGATCGAGAATCAACGTGGTGATGAAAATGTGATCACAGAGTGGATTGAGACTGATGATGATGAC
AATGAGGATCAGCCTGAGGAGGATGTCATAATGGAGACCGACGATGAGGAGGAGGACGAGGATGATAAAGGAGAGGATGGAAATGAGGAAGAAGATGAGAATTACTGGGA
TGAAAGGTTTAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTTAAAGGCAGTAAAGAAGTGGCTGATGAATTTTATGAGAAGGAGATGGAGAAGGAGAATG
TGGGAAGTAGAAGGGGTACAGCCATGGAAGATGGGGATGAAACAGAAATGAGAGGGAAGAGAGCAAAAGTGAAAGCAGAAGAATGGGAGTACATTGGGTATGGGCCATGG
AGGAAGAAGATAAAGAAAAGTAAAATTCCTCCAGAGCTGTTTTTAAGATCTACAGTAAGGCCTTTCACTTACAGGAACCTTGTGAAGGAAATTGTATTGACAAGGCATGC
TATTTTGGATGGTGAAATTGGGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATTCCTTTGCAATTCCCTTTCTCCAAAACCCTAAACCCTTCATCTCCATTCCTCCACTCAACCTCCTTCACACCGTTCCCCAATCCCCTTCTTCAAACCTTAAC
CCTAAAATCCCATCAAACGCATAAACCCCTTTCCATTGTATCCGGTTCCCCAAATCCTTCCTTTCTTCCCATCTCCCGCCAAATTTCGCATTTCCAATTTGCAAACTCCC
GTCGGGATATTCGTACGCACGCCGGCCGGAGCAAGAAGAAGGGTGGAGGGCCCTCTCCCGGCCGGATAGAAGGCAACGCCGAGTTTCGACGGAAATTGAGGCACAATGCC
CGCCGGAAGAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAAGTCGAACAGCAATTACGCGGATAACTTCAGTGAGGATGAACTTCAGCAGATCGGCCTCGG
CTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCTAATCTACGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTTG
TCGGCGAGCCTATTCGTGGGCGGTTCACGGATGAGCGAGTTACGATTATCAGCGAGGTTAAGGATCATGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCTTCTGAT
TTCAGCGAGGGATTGCAGCAGATGGACAAGAGCAAAGGGTTTCGGTACTTTTGGGTGTTCGTGAGACACCCGCGGTGGAGGATTTCGGAGCTTCCATGGCAGCAATGGAC
TTTGATTGCAGAGGTTGTACTTGAAGCTGGTAAAGAAAGGTTAGATAAATGGAGCTTGATGGGTCGGCTTGGAAATAAGTCAAGAAAGAATATAACTCAATGTGCAGCTT
GGATGAGACCTGATATCATATATGTGAAAAAGCCTGTTTACCAATGCAGATTTGAGCCTCAGGATGAGTTTTTCCAGGCAATAATGCCATTTCTTGATCCCAAAACAGAG
CAAGATTTTCTGTTTGAGTTGCAGGATGATGAAGGAGACGTTGAATGGGTGACTTATTTTGGTGGCTTGTGTAAGATTGTGAGGATAAACCCAAAGGCATTTGTGGATGA
TGTGGTGAATGCTTATGAGAAGCTGAGTGATGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACTAACCATCCTGTTCCATTGCTGCATCCATATACAAAAGAGTGGA
AGGCTAAGTTGGAGGAGGAGGAGTTGGGTTGTGATGCCCCGGACGAGATCGAGAATCAACGTGGTGATGAAAATGTGATCACAGAGTGGATTGAGACTGATGATGATGAC
AATGAGGATCAGCCTGAGGAGGATGTCATAATGGAGACCGACGATGAGGAGGAGGACGAGGATGATAAAGGAGAGGATGGAAATGAGGAAGAAGATGAGAATTACTGGGA
TGAAAGGTTTAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTTAAAGGCAGTAAAGAAGTGGCTGATGAATTTTATGAGAAGGAGATGGAGAAGGAGAATG
TGGGAAGTAGAAGGGGTACAGCCATGGAAGATGGGGATGAAACAGAAATGAGAGGGAAGAGAGCAAAAGTGAAAGCAGAAGAATGGGAGTACATTGGGTATGGGCCATGG
AGGAAGAAGATAAAGAAAAGTAAAATTCCTCCAGAGCTGTTTTTAAGATCTACAGTAAGGCCTTTCACTTACAGGAACCTTGTGAAGGAAATTGTATTGACAAGGCATGC
TATTTTGGATGGTGAAATTGGGGTATGA
Protein sequenceShow/hide protein sequence
MAIPLQFPFSKTLNPSSPFLHSTSFTPFPNPLLQTLTLKSHQTHKPLSIVSGSPNPSFLPISRQISHFQFANSRRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNA
RRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASD
FSEGLQQMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTE
QDFLFELQDDEGDVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDEIENQRGDENVITEWIETDDDD
NEDQPEEDVIMETDDEEEDEDDKGEDGNEEEDENYWDERFRKAISSPEELEKLFKGSKEVADEFYEKEMEKENVGSRRGTAMEDGDETEMRGKRAKVKAEEWEYIGYGPW
RKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV