; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy7G128530 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy7G128530
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionUnknown protein
Genome locationchrH07:207046..208818
RNA-Seq ExpressionChy7G128530
SyntenyChy7G128530
Gene Ontology termsGO:0009793 - embryo development ending in seed dormancy (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024792.1 hypothetical protein SDJN02_13611, partial [Cucurbita argyrosperma subsp. argyrosperma]0.081.02Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----------SRPDIRTHAGRSNKKPGGPSPGRIEGN
        MATS FP  KTLNPSSPFL STSLTPFSNPLLQTLTL+ HQT   KPLSIIS +     + +           SRPDIRT AGRS KK GGPSPGRIEGN
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----------SRPDIRTHAGRSNKKPGGPSPGRIEGN

Query:  AEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
        AEFRRKLR+N RRKSQK AESHFYRRK SNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+
Subjt:  AEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI

Query:  ISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKD-RLDKWSLMGRLGNKSRKNITQCAAWMRPD
        I EVKDHEEWEKIEQSEMA+DFS GLQRMD+SKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK+ RLDKWSLMGRLGNKSRKNITQCAAWMRPD
Subjt:  ISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKD-RLDKWSLMGRLGNKSRKNITQCAAWMRPD

Query:  IIYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP
        I+YVKKPVYQCRFEPQ +FFQA+MPFLDPKTEQD LFELQDDEGNVEWVTYFGGLCKI+R+NPKAFVDDV NAYEKLSDEKKSKCLEFLL+NHPVPLLHP
Subjt:  IIYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP

Query:  YTKEWKAKLEEEELGCDAPDE-MENRRRDDNVITEWIETD-NEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPE
        YTKEWKAKLEEEELGCDAPD+  ENR  D+NV+ EWIETD N+++YE+   ED+VME  +E E       DE + G  + EEEDE YWDERFRKAISSPE
Subjt:  YTKEWKAKLEEEELGCDAPDE-MENRRRDDNVITEWIETD-NEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPE

Query:  ELEKLFKRSGEMADELYEKE---NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
        ELEKL KRS E +DE YEK+   N G R+A    DGDE E+RGK+ KVK EEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
Subjt:  ELEKLFKRSGEMADELYEKE---NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL

Query:  DGEIGV
        +GEIGV
Subjt:  DGEIGV

XP_004146025.1 uncharacterized protein LOC101207599 [Cucumis sativus]0.096.78Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLISRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRKLRDN
        MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTL+PH THYYKPLSIIS ISYPYQISL SRPDIRTHAGRS KKPGGPSPGRIEGNA+FRRKLRDN
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLISRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRKLRDN

Query:  ARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEW
        ARRK+QKLAESHFYRRKKSN NYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEW
Subjt:  ARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEW

Query:  EKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQC
        EKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLE+GK+RLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQC
Subjt:  EKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQC

Query:  RFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEE
        RFEPQD+FFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAF+DDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEE
Subjt:  RFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEE

Query:  EELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRSGEMA
        EELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQP EDIVMEDMDE+E +DE+DDDE+EEGNQEEEE DEGYWDERFRKAISSPEELEKLFKRSGEMA
Subjt:  EELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRSGEMA

Query:  DELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        DELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  DELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

XP_008463741.1 PREDICTED: uncharacterized protein LOC103501814 [Cucumis melo]0.094.61Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----SRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRK
        MATS FP PKTLNPSSPFLNSTSLTPFSNPLLQTLTL+ HQTHYYKPLSI+S  S PYQISL+    SRPDIRTHAGRS K PGGPSPGRIEGNAEFRRK
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----SRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRK

Query:  LRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
        LR NARRKSQKLAESHFYRRKK NSNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
Subjt:  LRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD

Query:  HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP
        HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK+RLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP
Subjt:  HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP

Query:  VYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKA
        VYQCRFEPQD+FFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRI+PKAFVDDVVNAYEKLSDEKKS CLEFLLSNHPVPLLHPYTKEWKA
Subjt:  VYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKA

Query:  KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRS
        KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYE+QP EDIVMEDMDE+  KD++DDDEREEGNQEEEEEDE YWDERFRKAISSPEELEKLFKRS
Subjt:  KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRS

Query:  GEMADELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        GEMADELYEKENVGRRRATAMKDGDE+EMRGK+PKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  GEMADELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

XP_022937202.1 uncharacterized protein LOC111443567 [Cucurbita moschata]0.081.02Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----------SRPDIRTHAGRSNKKPGGPSPGRIEGN
        MA S FP  KTLNPSSPFL STSLTPFSNPLLQTLTL+ HQT   KPLSIIS +     + +           SRPDIRT AGRS KK GGPSPGRIEGN
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----------SRPDIRTHAGRSNKKPGGPSPGRIEGN

Query:  AEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
        AEFRRKLR+N RRKSQK AESHFYRRK SNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+
Subjt:  AEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI

Query:  ISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKD-RLDKWSLMGRLGNKSRKNITQCAAWMRPD
        I EVKDHEEWEKIEQSEMA+DFS GLQRMD+SKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK+ RLDKWSLMGRLGNKSRKNITQCAAWMRPD
Subjt:  ISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKD-RLDKWSLMGRLGNKSRKNITQCAAWMRPD

Query:  IIYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP
        IIYVKKPVYQCRFEPQ +FFQA+MPFLDPKTEQD LFELQDDEGNVEWVTYFGGLCKI+R+NPKAFVDDV NAYEKLSDEKKSKCLEFLL+NHPVPLLHP
Subjt:  IIYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP

Query:  YTKEWKAKLEEEELGCDAPDE-MENRRRDDNVITEWIETD-NEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPE
        YTKEWKAKLEEEELGCDAPD+  ENR  D+NV+ EWIETD N+++YE++  ED+VME  +E E       DE + G  + EEEDE YWDERFRKAISSPE
Subjt:  YTKEWKAKLEEEELGCDAPDE-MENRRRDDNVITEWIETD-NEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPE

Query:  ELEKLFKRSGEMADELYEKE---NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
        ELEKL KRS E +DE YEK+   N G R+A    DGDE E+RGK+ KVK EEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
Subjt:  ELEKLFKRSGEMADELYEKE---NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL

Query:  DGEIGV
        +GEIGV
Subjt:  DGEIGV

XP_038898752.1 uncharacterized protein LOC120086270 [Benincasa hispida]0.085.62Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPY------QISLI----SRPDIRTHAGRSNKKPGGPSPGRIEGN
        MATS FP  KTLN SS FL+STSL+PF +PLLQTLTL+ HQTH  KPLSI S    P       QIS +    S  +IRTHAGRS KK GGPSPGRIEGN
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPY------QISLI----SRPDIRTHAGRSNKKPGGPSPGRIEGN

Query:  AEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
        AEFRRKLR NARRKSQKLAESHFYRRKK NSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
Subjt:  AEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI

Query:  ISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDI
        ISEVKDHEEWEKIEQSEMA+DFS GL RMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK+RLDKWSLMGRLGNKSRKNITQCAAWMRPDI
Subjt:  ISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDI

Query:  IYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEG-NVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP
        IYVKKPVYQCRFEPQD+FFQA+MPFLDPKTEQDFLFELQDDEG +VEWVTYF GLCKIVR+NPKAFVDDVVNAYEKLSDEKKSKCLEFLL+NHPVPLLHP
Subjt:  IYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEG-NVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP

Query:  YTKEWKAKLEEEELGCDAPDEMENRRRDDNVITEWIETD--NEEEYEE-QPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSP
        YTKEWKAKLEEEELGCDAPD++E R  D+NVITEWIETD  N E+YEE QP E++VME  DE+E +DE  DD+RE+GNQEEEE DEGYWDERFRKAISSP
Subjt:  YTKEWKAKLEEEELGCDAPDEMENRRRDDNVITEWIETD--NEEEYEE-QPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSP

Query:  EELEKLFKRSGEMADELYEKE--NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
        EELEKLFK S E+ADE YEKE  +VG RRATAM+DGDE E+RGK+ KVKAEEWEYIGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
Subjt:  EELEKLFKRSGEMADELYEKE--NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL

Query:  DGEIG
        DGEIG
Subjt:  DGEIG

TrEMBL top hitse value%identityAlignment
A0A0A0L3A4 Uncharacterized protein0.0e+0096.78Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLISRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRKLRDN
        MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTL+PH THYYKPLSIIS ISYPYQISL SRPDIRTHAGRS KKPGGPSPGRIEGNA+FRRKLRDN
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLISRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRKLRDN

Query:  ARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEW
        ARRK+QKLAESHFYRRKKSN NYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEW
Subjt:  ARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEW

Query:  EKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQC
        EKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLE+GK+RLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQC
Subjt:  EKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQC

Query:  RFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEE
        RFEPQD+FFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAF+DDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEE
Subjt:  RFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEE

Query:  EELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRSGEMA
        EELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQP EDIVMEDMDE+E +DE+DDDE+EEGNQ EEEEDEGYWDERFRKAISSPEELEKLFKRSGEMA
Subjt:  EELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRSGEMA

Query:  DELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        DELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  DELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A1S3CKF2 uncharacterized protein LOC1035018140.0e+0094.61Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----SRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRK
        MATS FP PKTLNPSSPFLNSTSLTPFSNPLLQTLTL+ HQTHYYKPLSI+S  S PYQISL+    SRPDIRTHAGRS K PGGPSPGRIEGNAEFRRK
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----SRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRK

Query:  LRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
        LR NARRKSQKLAESHFYRRKK NSNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
Subjt:  LRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD

Query:  HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP
        HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK+RLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP
Subjt:  HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP

Query:  VYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKA
        VYQCRFEPQD+FFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRI+PKAFVDDVVNAYEKLSDEKKS CLEFLLSNHPVPLLHPYTKEWKA
Subjt:  VYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKA

Query:  KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRS
        KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYE+QP EDIVMEDMDE+  KD++DDDEREEGNQEEEEEDE YWDERFRKAISSPEELEKLFKRS
Subjt:  KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRS

Query:  GEMADELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        GEMADELYEKENVGRRRATAMKDGDE+EMRGK+PKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  GEMADELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A5A7VK56 Uncharacterized protein0.0e+0094.61Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----SRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRK
        MATS FP PKTLNPSSPFLNSTSLTPFSNPLLQTLTL+ HQTHYYKPLSI+S  S PYQISL+    SRPDIRTHAGRS K PGGPSPGRIEGNAEFRRK
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----SRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRK

Query:  LRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
        LR NARRKSQKLAESHFYRRKK NSNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD
Subjt:  LRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKD

Query:  HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP
        HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK+RLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP
Subjt:  HEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKP

Query:  VYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKA
        VYQCRFEPQD+FFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRI+PKAFVDDVVNAYEKLSDEKKS CLEFLLSNHPVPLLHPYTKEWKA
Subjt:  VYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKA

Query:  KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRS
        KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYE+QP EDIVMEDMDE+  KD++DDDEREEGNQEEEEEDE YWDERFRKAISSPEELEKLFKRS
Subjt:  KLEEEELGCDAPDEMENRRRDDNVITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRS

Query:  GEMADELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        GEMADELYEKENVGRRRATAMKDGDE+EMRGK+PKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  GEMADELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A6J1CN80 uncharacterized protein LOC1110128143.4e-26980.33Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYP-----------YQISLISRPDIRTHAGRSNKKPGGPSPGRIEG
        MAT  F   KTLNPSSP      LTPFSNPLLQTLTL+PH++H  KPLSI+S+   P           +  ++I R DIRT AGRS KK GG SPGRIEG
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYP-----------YQISLISRPDIRTHAGRSNKKPGGPSPGRIEG

Query:  NAEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVT
        NAEFRR+LR NARRKSQK AESHFYRRK SNSNYADNF+EDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGE+GPYSWRGVVVGEPIRGRFTDERVT
Subjt:  NAEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVT

Query:  IISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPD
        IISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKGFRYFWVFVRHPRWRIS+LPWQQWTLIAEVVLEAGK+RLDKW+LMGRLGNKSRKNITQCAAWMRPD
Subjt:  IISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPD

Query:  IIYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP
        IIYVKKPVYQCRFEPQD+FFQA+MPFLDPKTEQDFLFELQ+DEG+VEWVTYFGGLCKIVR+NPKAFVDDVVNAYEKLSDEKKSKCLEFLL+NHPVPLLHP
Subjt:  IIYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP

Query:  YTKEWKAKLEEEELGCDAPDEMENRRRDD--NVITEWIET--DNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISS
        YTKEWKAKLEEEELGCDAPD+ E RR  D  NVI EWIET  DN+E  +E   +D+VME+  +E+G D K+DD          EEDE YWDERFRKAISS
Subjt:  YTKEWKAKLEEEELGCDAPDEMENRRRDD--NVITEWIET--DNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISS

Query:  PEELEKLFKRSGEMADELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILD
        PEE+EKLFKRS E++DELYEK+         M+DGDE EMRGK+ KV+AEEWE IGYGPWRK+IKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILD
Subjt:  PEELEKLFKRSGEMADELYEKENVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILD

Query:  GEIGV
        GEIGV
Subjt:  GEIGV

A0A6J1FAH0 uncharacterized protein LOC1114435671.4e-27081.02Show/hide
Query:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----------SRPDIRTHAGRSNKKPGGPSPGRIEGN
        MA S FP  KTLNPSSPFL STSLTPFSNPLLQTLTL+ HQT   KPLSIIS +     + +           SRPDIRT AGRS KK GGPSPGRIEGN
Subjt:  MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLI----------SRPDIRTHAGRSNKKPGGPSPGRIEGN

Query:  AEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
        AEFRRKLR+N RRKSQK AESHFYRRK SNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+
Subjt:  AEFRRKLRDNARRKSQKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI

Query:  ISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-DRLDKWSLMGRLGNKSRKNITQCAAWMRPD
        I EVKDHEEWEKIEQSEMA+DFS GLQRMD+SKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGK +RLDKWSLMGRLGNKSRKNITQCAAWMRPD
Subjt:  ISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGK-DRLDKWSLMGRLGNKSRKNITQCAAWMRPD

Query:  IIYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP
        IIYVKKPVYQCRFEPQ +FFQA+MPFLDPKTEQD LFELQDDEGNVEWVTYFGGLCKI+R+NPKAFVDDV NAYEKLSDEKKSKCLEFLL+NHPVPLLHP
Subjt:  IIYVKKPVYQCRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHP

Query:  YTKEWKAKLEEEELGCDAP-DEMENRRRDDNVITEWIET-DNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPE
        YTKEWKAKLEEEELGCDAP D+ ENR  D+NV+ EWIET DN+++YE++  ED+VME  +E E       DE + G  + EEEDE YWDERFRKAISSPE
Subjt:  YTKEWKAKLEEEELGCDAP-DEMENRRRDDNVITEWIET-DNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPE

Query:  ELEKLFKRSGEMADELYEKE---NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
        ELEKL KRS E +DE YEK+   N G R+A    DGDE E+RGK+ KVK EEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
Subjt:  ELEKLFKRSGEMADELYEKE---NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL

Query:  DGEIGV
        +GEIGV
Subjt:  DGEIGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G14900.1 unknown protein1.0e-18056.86Show/hide
Query:  KTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLISRPDIRTHAGRSNKK-PGGPSPGRIEGNAEFRRKLRDNARRKSQKL
        KTLNPS  F  S    P ++ + + +++ P  T      S+  S     +     R D+R  AGRS KK  GG S GRIEG+++ R++++ NAR KS+KL
Subjt:  KTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLISRPDIRTHAGRSNKK-PGGPSPGRIEGNAEFRRKLRDNARRKSQKL

Query:  AESHFYR--------RKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEW
        AES FYR        R +  S++ D F+E+EL+ IGLGYDRMVRFM+KDDP LRHPYDW+KYGEFGPYSWRGVVVG+P+RG  +DE VT+I EV++HEE+
Subjt:  AESHFYR--------RKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEW

Query:  EKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAG-KDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQ
        EKIEQ EM   F   ++ +D + G RYFWVFVRHP+WR+SELPW+QWTL++EVV+EA  K RLDKW+LMGRLGNKSR  I QCAAW RPDI+YVKKPV+Q
Subjt:  EKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAG-KDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQ

Query:  CRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLE
        CRFEPQ+DFF +++P+L+P TE  F+ E++DDEG VE  TY+GGLCK++++   AFVDDVVNAYEKLSDEKKS+ L+FLL NHP  LLHPYTKEWKAKLE
Subjt:  CRFEPQDDFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLE

Query:  EEELGCDAPDEMENR-----RRDDNVITEWI--ETDNEEEYEEQPNEDIVMEDMDEE-------EGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISS
        E ELGCDAPDE E+        +    +EWI  E DN+++ ++  ++D  +E++D++       EG  E+D  E +E  + + EEDE YW+E+F KA ++
Subjt:  EEELGCDAPDEMENR-----RRDDNVITEWI--ETDNEEEYEEQPNEDIVMEDMDEE-------EGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISS

Query:  PEELEKLFKRSGEMADELYEKE-NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
         E +EKL + S  ++D+ YEK+      R     +GDE+EMRGKK KVK EEW+ +GYG W KKIKKS+IPPELFLR+ VRPF YRNLVKEIVLTRHAIL
Subjt:  PEELEKLFKRSGEMADELYEKE-NVGRRRATAMKDGDEVEMRGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL

Query:  DGEIG
        +GEIG
Subjt:  DGEIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTTCCCCATTCCCTCCCCCTAAAACCCTAAACCCTTCTTCTCCATTTCTCAACTCCACCTCCCTCACACCATTCTCCAATCCCCTTCTTCAAACCCTA
ACCCTTCAACCCCATCAAACCCATTATTATAAACCTCTTTCCATCATTTCCAGTATCTCATATCCTTACCAAATTTCCCTTATCTCACGCCCGGACATTCGTACC
CATGCCGGCCGGAGCAACAAGAAACCTGGAGGCCCCTCTCCCGGACGGATTGAAGGCAACGCCGAATTCCGACGGAAACTCAGGGATAATGCCCGCCGTAAAAGC
CAGAAGCTCGCCGAGTCCCATTTCTACCGTCGCAAGAAGTCGAACAGCAATTATGCGGATAACTTCAGTGAGGATGAGCTTCAGCAGATTGGCCTTGGCTACGAT
CGTATGGTTCGATTCATGGAGAAAGATGACCCGAATCTACGCCATCCCTACGATTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTCGTC
GGCGAGCCGATTCGTGGACGGTTCACGGATGAACGAGTTACGATTATCAGTGAGGTTAAGGATCATGAGGAGTGGGAGAAGATTGAGCAATCAGAAATGGCTGCT
GATTTCAGCACGGGATTGCAGAGGATGGACAAGAGCAAAGGATTTCGTTACTTTTGGGTGTTCGTGAGACACCCACGGTGGAGAATCTCAGAGCTGCCTTGGCAG
CAATGGACTTTGATTGCAGAGGTTGTACTGGAAGCTGGTAAAGACAGGTTGGATAAATGGAGCTTGATGGGTAGGCTTGGAAACAAGTCAAGAAAGAACATAACT
CAATGTGCAGCTTGGATGAGACCCGATATCATATATGTGAAAAAACCTGTTTATCAGTGCAGATTTGAGCCACAGGATGATTTTTTCCAAGCAATGATGCCATTT
CTTGATCCCAAAACAGAACAAGATTTTCTCTTTGAGTTGCAGGATGACGAAGGAAATGTTGAATGGGTGACTTATTTTGGTGGGTTGTGTAAGATTGTGAGGATA
AATCCAAAGGCATTTGTAGATGATGTAGTGAATGCTTACGAGAAGTTGAGCGATGAGAAGAAATCCAAATGTTTGGAGTTTCTTCTGAGTAACCACCCTGTTCCA
TTGCTGCATCCATATACAAAAGAGTGGAAGGCTAAGTTGGAGGAGGAGGAGTTGGGATGTGATGCCCCAGACGAGATGGAAAATCGACGTAGGGACGACAATGTG
ATCACGGAGTGGATTGAGACTGACAATGAAGAAGAGTACGAGGAGCAACCCAACGAAGATATCGTAATGGAGGACATGGATGAGGAGGAGGGCAAGGACGAAAAG
GATGACGATGAACGAGAGGAGGGAAATCAGGAAGAAGAAGAAGAAGATGAGGGTTACTGGGATGAGAGGTTTAGGAAGGCAATAAGTAGTCCAGAAGAGCTTGAG
AAGCTGTTTAAACGCAGTGGAGAAATGGCTGATGAGTTGTATGAGAAGGAGAATGTGGGGAGAAGAAGGGCAACAGCCATGAAAGACGGGGATGAGGTGGAAATG
AGAGGGAAGAAACCAAAAGTGAAAGCAGAAGAATGGGAGTATATTGGGTATGGGCCATGGAGGAAGAAGATAAAGAAGAGTCAAATTCCTCCAGAGTTGTTCTTG
AGATCTACAGTAAGGCCTTTCACTTATAGGAATCTTGTGAAGGAAATTGTATTGACAAGGCATGCTATTTTGGATGGTGAAATTGGGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTTCCCCATTCCCTCCCCCTAAAACCCTAAACCCTTCTTCTCCATTTCTCAACTCCACCTCCCTCACACCATTCTCCAATCCCCTTCTTCAAACCCTA
ACCCTTCAACCCCATCAAACCCATTATTATAAACCTCTTTCCATCATTTCCAGTATCTCATATCCTTACCAAATTTCCCTTATCTCACGCCCGGACATTCGTACC
CATGCCGGCCGGAGCAACAAGAAACCTGGAGGCCCCTCTCCCGGACGGATTGAAGGCAACGCCGAATTCCGACGGAAACTCAGGGATAATGCCCGCCGTAAAAGC
CAGAAGCTCGCCGAGTCCCATTTCTACCGTCGCAAGAAGTCGAACAGCAATTATGCGGATAACTTCAGTGAGGATGAGCTTCAGCAGATTGGCCTTGGCTACGAT
CGTATGGTTCGATTCATGGAGAAAGATGACCCGAATCTACGCCATCCCTACGATTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTCGTC
GGCGAGCCGATTCGTGGACGGTTCACGGATGAACGAGTTACGATTATCAGTGAGGTTAAGGATCATGAGGAGTGGGAGAAGATTGAGCAATCAGAAATGGCTGCT
GATTTCAGCACGGGATTGCAGAGGATGGACAAGAGCAAAGGATTTCGTTACTTTTGGGTGTTCGTGAGACACCCACGGTGGAGAATCTCAGAGCTGCCTTGGCAG
CAATGGACTTTGATTGCAGAGGTTGTACTGGAAGCTGGTAAAGACAGGTTGGATAAATGGAGCTTGATGGGTAGGCTTGGAAACAAGTCAAGAAAGAACATAACT
CAATGTGCAGCTTGGATGAGACCCGATATCATATATGTGAAAAAACCTGTTTATCAGTGCAGATTTGAGCCACAGGATGATTTTTTCCAAGCAATGATGCCATTT
CTTGATCCCAAAACAGAACAAGATTTTCTCTTTGAGTTGCAGGATGACGAAGGAAATGTTGAATGGGTGACTTATTTTGGTGGGTTGTGTAAGATTGTGAGGATA
AATCCAAAGGCATTTGTAGATGATGTAGTGAATGCTTACGAGAAGTTGAGCGATGAGAAGAAATCCAAATGTTTGGAGTTTCTTCTGAGTAACCACCCTGTTCCA
TTGCTGCATCCATATACAAAAGAGTGGAAGGCTAAGTTGGAGGAGGAGGAGTTGGGATGTGATGCCCCAGACGAGATGGAAAATCGACGTAGGGACGACAATGTG
ATCACGGAGTGGATTGAGACTGACAATGAAGAAGAGTACGAGGAGCAACCCAACGAAGATATCGTAATGGAGGACATGGATGAGGAGGAGGGCAAGGACGAAAAG
GATGACGATGAACGAGAGGAGGGAAATCAGGAAGAAGAAGAAGAAGATGAGGGTTACTGGGATGAGAGGTTTAGGAAGGCAATAAGTAGTCCAGAAGAGCTTGAG
AAGCTGTTTAAACGCAGTGGAGAAATGGCTGATGAGTTGTATGAGAAGGAGAATGTGGGGAGAAGAAGGGCAACAGCCATGAAAGACGGGGATGAGGTGGAAATG
AGAGGGAAGAAACCAAAAGTGAAAGCAGAAGAATGGGAGTATATTGGGTATGGGCCATGGAGGAAGAAGATAAAGAAGAGTCAAATTCCTCCAGAGTTGTTCTTG
AGATCTACAGTAAGGCCTTTCACTTATAGGAATCTTGTGAAGGAAATTGTATTGACAAGGCATGCTATTTTGGATGGTGAAATTGGGGTATGA
Protein sequenceShow/hide protein sequence
MATSPFPPPKTLNPSSPFLNSTSLTPFSNPLLQTLTLQPHQTHYYKPLSIISSISYPYQISLISRPDIRTHAGRSNKKPGGPSPGRIEGNAEFRRKLRDNARRKS
QKLAESHFYRRKKSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAA
DFSTGLQRMDKSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKDRLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQCRFEPQDDFFQAMMPF
LDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDAPDEMENRRRDDNV
ITEWIETDNEEEYEEQPNEDIVMEDMDEEEGKDEKDDDEREEGNQEEEEEDEGYWDERFRKAISSPEELEKLFKRSGEMADELYEKENVGRRRATAMKDGDEVEM
RGKKPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV