; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024004 (gene) of Chayote v1 genome

Gene IDSed0024004
OrganismSechium edule (Chayote v1)
DescriptionGTP binding
Genome locationLG04:29392504..29416499
RNA-Seq ExpressionSed0024004
SyntenySed0024004
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR001606 - ARID DNA-binding domain
IPR019341 - Alpha/gamma-adaptin-binding protein p34
IPR036431 - ARID DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464616.1 PREDICTED: uncharacterized protein LOC103502455 [Cucumis melo]2.7e-8060.98Show/hide
Query:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI
        DL KF+VLLCI N+VDLVP                                    GSSLLGDE+ SWETRRSCL+WCIERNIEF+EACASNADFDKCLSI
Subjt:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI

Query:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQG--LRIECSSIDAGARPKDIDIVKQ
        D D+QGVQRLY ALSA MWPGMIL SGD    +TK     +L   +LS+EESD E+DYEILS GSAEPW D   QG  L  E  SIDAGA  KD  I +Q
Subjt:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQG--LRIECSSIDAGARPKDIDIVKQ

Query:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE
        ++AR +I E  + EEN VA+DGE+D+VT+ C   HL LEDLER+MSE GN+RD LRLMPDFQRREMAAKLA KMAAMF G SDD++E
Subjt:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE

XP_011653760.1 uncharacterized protein LOC101203083 [Cucumis sativus]3.0e-7959.93Show/hide
Query:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI
        DL  FDVLLCI N+VDLVP                                    GSSLLGDE+ SWETRRSCL+WCIERNIEF+EACASNADFD+CLSI
Subjt:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI

Query:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQG--LRIECSSIDAGARPKDIDIVKQ
        D D+QGVQRLY ALSA MWPGM LKSGD    +TK     +L   +LS+EESD E+DYE+LS GSAEPW D   QG  L  E  S+DAGA  KD DI +Q
Subjt:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQG--LRIECSSIDAGARPKDIDIVKQ

Query:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE
        ++A  +I +Q + EEN VA+DGE+D+VT+ C   HL LEDLER+MSE GN+RD LRLMPDFQRREMAAKLA KMAAMF G SDD++E
Subjt:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE

XP_022158283.1 uncharacterized protein LOC111024806 [Momordica charantia]2.5e-8161.3Show/hide
Query:  DLLKFDVLLCIENEVDLVP-------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLS
        DL KFDVLLCI N+VDLVP                                     GSSLLGDEE SWE RRSCL+WC ERNIEFIEACASN+DFDKCLS
Subjt:  DLLKFDVLLCIENEVDLVP-------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLS

Query:  IDDDVQGVQRLYSALSARMWPGMILKSGD--NQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYD---RIQGLRIECSSIDAGARPKDIDI
        ID DVQGV+RLY ALSA +WPGM+LKSGD   +  L K          +LS+EESD E+DYEILSGGSAE W D   +      E SSID GARPK IDI
Subjt:  IDDDVQGVQRLYSALSARMWPGMILKSGD--NQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYD---RIQGLRIECSSIDAGARPKDIDI

Query:  VKQDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEETR
         +QD+A  ++ +QPEAE N VAVD EVD+VTD     +L LEDLER+MSE GN+RD LRLMPDFQRREMAAKLA+KMAAMFGG SDDEEETR
Subjt:  VKQDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEETR

XP_022971472.1 uncharacterized protein LOC111470181 [Cucurbita maxima]1.2e-8060.9Show/hide
Query:  SDLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLS
        +DL KFDVLLCI N+VDLVP                                    GSSLLGDE+ SWETRRSCL+WC+ERNIEFIEACASNADFDKCLS
Subjt:  SDLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLS

Query:  IDDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQGLRI--ECSSIDAGARPKDIDIVK
        ID D+QGVQRLY ALSA MWPGMILKSGD    +TK    F     +LS+EESD E+DYE LS GSAEPW D   +G     E SSID GA PK++D  +
Subjt:  IDDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQGLRI--ECSSIDAGARPKDIDIVK

Query:  QDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEET
        QDQA  + ++Q EAEEN VA +GE D V D C+  H  +EDLER+M E GN+RD LRLMPDFQRREMAAKLA+KMAAMFGG SDDEEET
Subjt:  QDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEET

XP_038883470.1 uncharacterized protein LOC120074423 [Benincasa hispida]1.6e-8062.59Show/hide
Query:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI
        DL K DVLLCI N+VDLVP                                    GSSLLGDE+ SWETRRSCL WCIERNIEFIEACASNADFDKCLSI
Subjt:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI

Query:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDRIQGLRI--ECSSIDAGARPKDIDIVKQD
        D D+QGVQRLY ALSA MWPGMILKSGD    +TK          +LS+EESD E+DYEILSGGSAEPW D  +G     E SSIDAGA  KD DI +QD
Subjt:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDRIQGLRI--ECSSIDAGARPKDIDIVKQD

Query:  QARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE
        +A  +I EQP  EEN VA DGE +++TD C    L LEDLER+MSE GN+RD LRLMPDFQRREMAAKLA KMAAMFGG SDD+EE
Subjt:  QARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE

TrEMBL top hitse value%identityAlignment
A0A0A0L1S1 Uncharacterized protein1.5e-7959.93Show/hide
Query:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI
        DL  FDVLLCI N+VDLVP                                    GSSLLGDE+ SWETRRSCL+WCIERNIEF+EACASNADFD+CLSI
Subjt:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI

Query:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQG--LRIECSSIDAGARPKDIDIVKQ
        D D+QGVQRLY ALSA MWPGM LKSGD    +TK     +L   +LS+EESD E+DYE+LS GSAEPW D   QG  L  E  S+DAGA  KD DI +Q
Subjt:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQG--LRIECSSIDAGARPKDIDIVKQ

Query:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE
        ++A  +I +Q + EEN VA+DGE+D+VT+ C   HL LEDLER+MSE GN+RD LRLMPDFQRREMAAKLA KMAAMF G SDD++E
Subjt:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE

A0A1S3CMD7 uncharacterized protein LOC1035024551.3e-8060.98Show/hide
Query:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI
        DL KF+VLLCI N+VDLVP                                    GSSLLGDE+ SWETRRSCL+WCIERNIEF+EACASNADFDKCLSI
Subjt:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI

Query:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQG--LRIECSSIDAGARPKDIDIVKQ
        D D+QGVQRLY ALSA MWPGMIL SGD    +TK     +L   +LS+EESD E+DYEILS GSAEPW D   QG  L  E  SIDAGA  KD  I +Q
Subjt:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQG--LRIECSSIDAGARPKDIDIVKQ

Query:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE
        ++AR +I E  + EEN VA+DGE+D+VT+ C   HL LEDLER+MSE GN+RD LRLMPDFQRREMAAKLA KMAAMF G SDD++E
Subjt:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE

A0A6J1DVN3 uncharacterized protein LOC1110248061.2e-8161.3Show/hide
Query:  DLLKFDVLLCIENEVDLVP-------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLS
        DL KFDVLLCI N+VDLVP                                     GSSLLGDEE SWE RRSCL+WC ERNIEFIEACASN+DFDKCLS
Subjt:  DLLKFDVLLCIENEVDLVP-------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLS

Query:  IDDDVQGVQRLYSALSARMWPGMILKSGD--NQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYD---RIQGLRIECSSIDAGARPKDIDI
        ID DVQGV+RLY ALSA +WPGM+LKSGD   +  L K          +LS+EESD E+DYEILSGGSAE W D   +      E SSID GARPK IDI
Subjt:  IDDDVQGVQRLYSALSARMWPGMILKSGD--NQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYD---RIQGLRIECSSIDAGARPKDIDI

Query:  VKQDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEETR
         +QD+A  ++ +QPEAE N VAVD EVD+VTD     +L LEDLER+MSE GN+RD LRLMPDFQRREMAAKLA+KMAAMFGG SDDEEETR
Subjt:  VKQDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEETR

A0A6J1E5M4 uncharacterized protein LOC111430989 isoform X14.2e-7960.28Show/hide
Query:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI
        D+ KFDVLLCI N+VD+VP                                    GSSLLGDE+ SWETRRSCL+WCIERNIEFIEACASNADFDKCLSI
Subjt:  DLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSI

Query:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYD-RIQGLRI--ECSSIDAGARPKDIDIVKQ
        D D+QGVQRLY ALSA MWPGMILKSGD      K          +LS+EESD E+DYEILSGGSAE W D   +G     E SSIDAGA   D+DI +Q
Subjt:  DDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYD-RIQGLRI--ECSSIDAGARPKDIDIVKQ

Query:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE
        DQA   I E P+AEEN VAVDGE+D++T+     HL LEDLER+MSE GN+RD LRLMPDFQRREMAAKLA+KMA MFG  S+D++E
Subjt:  DQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEE

A0A6J1I216 uncharacterized protein LOC1114701815.9e-8160.9Show/hide
Query:  SDLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLS
        +DL KFDVLLCI N+VDLVP                                    GSSLLGDE+ SWETRRSCL+WC+ERNIEFIEACASNADFDKCLS
Subjt:  SDLLKFDVLLCIENEVDLVP------------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLS

Query:  IDDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQGLRI--ECSSIDAGARPKDIDIVK
        ID D+QGVQRLY ALSA MWPGMILKSGD    +TK    F     +LS+EESD E+DYE LS GSAEPW D   +G     E SSID GA PK++D  +
Subjt:  IDDDVQGVQRLYSALSARMWPGMILKSGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDR-IQGLRI--ECSSIDAGARPKDIDIVK

Query:  QDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEET
        QDQA  + ++Q EAEEN VA +GE D V D C+  H  +EDLER+M E GN+RD LRLMPDFQRREMAAKLA+KMAAMFGG SDDEEET
Subjt:  QDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEET

SwissProt top hitse value%identityAlignment
C0SUW7 AT-rich interactive domain-containing protein 61.2e-1158.93Show/hide
Query:  EGSD-SGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD
        +G+D +GT  EQ AF++E+ +F++E  LEFKPPKFYG+ LN LKLWR V  LGGY+
Subjt:  EGSD-SGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD

Q0WNR6 AT-rich interactive domain-containing protein 51.2e-1158.18Show/hide
Query:  EGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD
        E  ++G  ++Q AF+KE+ +F +E  LEFK PKFYG+ LNCLKLWR V +LGGYD
Subjt:  EGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD

Q940Y3 AT-rich interactive domain-containing protein 32.3e-2153.33Show/hide
Query:  SSMDAKPEAFGMPETETGSHD--------DGHSNMNHSLDENLIDEGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD
        +S + K EA G  +   G H         DG      S   +   +G++SGTEE+Q+AFMKEL SFFRER ++FKPPKFYG+GLNCLKLWR VTRLGGYD
Subjt:  SSMDAKPEAFGMPETETGSHD--------DGHSNMNHSLDENLIDEGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD

Query:  KNGGN
        K  G+
Subjt:  KNGGN

Arabidopsis top hitse value%identityAlignment
AT1G20910.1 ARID/BRIGHT DNA-binding domain-containing protein8.9e-1358.93Show/hide
Query:  EGSD-SGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD
        +G+D +GT  EQ AF++E+ +F++E  LEFKPPKFYG+ LN LKLWR V  LGGY+
Subjt:  EGSD-SGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD

AT1G76510.1 ARID/BRIGHT DNA-binding domain-containing protein8.9e-1358.18Show/hide
Query:  EGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD
        E  ++G  ++Q AF+KE+ +F +E  LEFK PKFYG+ LNCLKLWR V +LGGYD
Subjt:  EGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD

AT2G17410.1 ARID/BRIGHT DNA-binding domain-containing protein1.6e-2253.33Show/hide
Query:  SSMDAKPEAFGMPETETGSHD--------DGHSNMNHSLDENLIDEGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD
        +S + K EA G  +   G H         DG      S   +   +G++SGTEE+Q+AFMKEL SFFRER ++FKPPKFYG+GLNCLKLWR VTRLGGYD
Subjt:  SSMDAKPEAFGMPETETGSHD--------DGHSNMNHSLDENLIDEGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD

Query:  KNGGN
        K  G+
Subjt:  KNGGN

AT2G17410.2 ARID/BRIGHT DNA-binding domain-containing protein1.6e-2253.33Show/hide
Query:  SSMDAKPEAFGMPETETGSHD--------DGHSNMNHSLDENLIDEGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD
        +S + K EA G  +   G H         DG      S   +   +G++SGTEE+Q+AFMKEL SFFRER ++FKPPKFYG+GLNCLKLWR VTRLGGYD
Subjt:  SSMDAKPEAFGMPETETGSHD--------DGHSNMNHSLDENLIDEGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYD

Query:  KNGGN
        K  G+
Subjt:  KNGGN

AT5G65960.1 GTP binding2.6e-5743.57Show/hide
Query:  ELSSYIGLG---RSSDLLKFDVLLCIENEVDLVP-----------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIE
        ELS+ + L      +D+  FD+LLCI N+VD VP                                   GSSLLG E+ S + R +CL+WC E NIEFIE
Subjt:  ELSSYIGLG---RSSDLLKFDVLLCIENEVDLVP-----------------------------------GSSLLGDEELSWETRRSCLDWCIERNIEFIE

Query:  ACASNADFDKCLSIDDDVQGVQRLYSALSARMWPGMILKSGD--NQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDRIQGLRIECSS--
        ACASN DFDKCLS+D D QGV+RL+ ALSA MWPGMILKSGD  N+ +L +          +LS+EES+ EL+YE+LS GS +PW D I+   +  S   
Subjt:  ACASNADFDKCLSIDDDVQGVQRLYSALSARMWPGMILKSGD--NQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDRIQGLRIECSS--

Query:  --IDAG--------------ARPKDID-IVKQDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAA
           DAG               +P  +D  +    +RL++Q   + +  +V  +  +    + C +     ED+E++MSE GNIRD LRLMPDFQRRE+AA
Subjt:  --IDAG--------------ARPKDID-IVKQDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGLEDLERIMSERGNIRDGLRLMPDFQRREMAA

Query:  KLAMKMAAMFGGCSDDEEE
         LAMKMA+MFGG +DDEEE
Subjt:  KLAMKMAAMFGGCSDDEEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTATTTCATCTTGAAGCAAATGTATGGTTTCGGGGAGAGCGGGTGCATGAATGTTAGAGAGACTAATTTAGTGGTGGCTGTAATGGTGGGGGCTAAAGGCTATGA
TAGTGGCAGACAACAGTTATATGATGCCAAAAATTATGGCATCAGTGGTGGTGGAGATAAATCTACGATTCCTCAACCATCAAATCAAGATTCTGCAAATTCACTTCCGT
GGGATCATGACCAAGGGCCTACTTCTTTTGTCAAGACTGGGCCTGAGGAGTTGCTCTCAAGGGAGTTGGGTCTAGATGCTGCTACTTTCAATACTCACCTGGTAAAGGTT
CAATCCTCCATGGATGCCAAACCTGAGGCATTTGGGATGCCTGAAACTGAAACTGGTTCGCATGATGATGGGCATTCAAACATGAATCACTCGTTAGATGAAAATCTTAT
CGATGAAGGTAGTGATTCAGGAACAGAAGAAGAGCAAGCTGCTTTCATGAAGGAGCTTCATAGCTTCTTTAGAGAAAGAGGCCTGGAATTCAAACCTCCCAAGTTTTATG
GGAAGGGACTGAATTGCCTCAAGTTATGGAGGGTTGTGACTAGACTGGGAGGCTATGACAAGAATGGGGGAAACACGGTTCTTGTACGCAATTCAATGAGGTGGGCTATT
TTCGCAAGTTTTTGTGTTTTGTCTCCCGAGCTATCTTCTTATATCGGATTGGGCCGCTCATCTGATCTGCTGAAATTTGATGTATTATTATGCATAGAAAATGAAGTCGA
TCTTGTTCCAGGAAGTAGTTTATTGGGAGACGAGGAACTCTCATGGGAAACTAGGAGGTCATGTTTGGATTGGTGCATTGAACGCAATATTGAGTTCATTGAGGCTTGTG
CATCTAATGCAGATTTTGATAAATGTTTATCGATTGATGATGATGTACAAGGAGTTCAAAGGCTTTATAGCGCTCTTTCTGCTCGTATGTGGCCTGGAATGATTCTAAAA
TCTGGAGATAACCAGGCCATCCTTACCAAAAGAAGAAGGTTTTTTGCATTACATAATCTTAAATTGTCCAAAGAAGAATCTGATTCTGAACTTGACTACGAAATACTATC
TGGGGGTTCAGCTGAGCCGTGGTATGACAGAATTCAAGGGTTGCGTATCGAATGTTCTTCCATAGATGCTGGAGCTCGTCCAAAGGACATTGACATTGTCAAGCAAGATC
AAGCGCGCCTAGATATCCAGGAACAGCCCGAGGCTGAAGAAAACCTGGTGGCTGTTGATGGAGAAGTTGACCGAGTGACAGATTCCTGCAGGGACGACCATCTTGGCCTT
GAGGATCTAGAACGGATCATGTCTGAGAGAGGGAATATACGCGACGGCTTGAGGTTGATGCCTGATTTCCAAAGAAGAGAAATGGCTGCAAAATTGGCAATGAAAATGGC
GGCCATGTTTGGAGGCTGTAGTGATGATGAAGAGGAAACTAGATGCAAGGAATTTGGTATGCAATTAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTATTTCATCTTGAAGCAAATGTATGGTTTCGGGGAGAGCGGGTGCATGAATGTTAGAGAGACTAATTTAGTGGTGGCTGTAATGGTGGGGGCTAAAGGCTATGA
TAGTGGCAGACAACAGTTATATGATGCCAAAAATTATGGCATCAGTGGTGGTGGAGATAAATCTACGATTCCTCAACCATCAAATCAAGATTCTGCAAATTCACTTCCGT
GGGATCATGACCAAGGGCCTACTTCTTTTGTCAAGACTGGGCCTGAGGAGTTGCTCTCAAGGGAGTTGGGTCTAGATGCTGCTACTTTCAATACTCACCTGGTAAAGGTT
CAATCCTCCATGGATGCCAAACCTGAGGCATTTGGGATGCCTGAAACTGAAACTGGTTCGCATGATGATGGGCATTCAAACATGAATCACTCGTTAGATGAAAATCTTAT
CGATGAAGGTAGTGATTCAGGAACAGAAGAAGAGCAAGCTGCTTTCATGAAGGAGCTTCATAGCTTCTTTAGAGAAAGAGGCCTGGAATTCAAACCTCCCAAGTTTTATG
GGAAGGGACTGAATTGCCTCAAGTTATGGAGGGTTGTGACTAGACTGGGAGGCTATGACAAGAATGGGGGAAACACGGTTCTTGTACGCAATTCAATGAGGTGGGCTATT
TTCGCAAGTTTTTGTGTTTTGTCTCCCGAGCTATCTTCTTATATCGGATTGGGCCGCTCATCTGATCTGCTGAAATTTGATGTATTATTATGCATAGAAAATGAAGTCGA
TCTTGTTCCAGGAAGTAGTTTATTGGGAGACGAGGAACTCTCATGGGAAACTAGGAGGTCATGTTTGGATTGGTGCATTGAACGCAATATTGAGTTCATTGAGGCTTGTG
CATCTAATGCAGATTTTGATAAATGTTTATCGATTGATGATGATGTACAAGGAGTTCAAAGGCTTTATAGCGCTCTTTCTGCTCGTATGTGGCCTGGAATGATTCTAAAA
TCTGGAGATAACCAGGCCATCCTTACCAAAAGAAGAAGGTTTTTTGCATTACATAATCTTAAATTGTCCAAAGAAGAATCTGATTCTGAACTTGACTACGAAATACTATC
TGGGGGTTCAGCTGAGCCGTGGTATGACAGAATTCAAGGGTTGCGTATCGAATGTTCTTCCATAGATGCTGGAGCTCGTCCAAAGGACATTGACATTGTCAAGCAAGATC
AAGCGCGCCTAGATATCCAGGAACAGCCCGAGGCTGAAGAAAACCTGGTGGCTGTTGATGGAGAAGTTGACCGAGTGACAGATTCCTGCAGGGACGACCATCTTGGCCTT
GAGGATCTAGAACGGATCATGTCTGAGAGAGGGAATATACGCGACGGCTTGAGGTTGATGCCTGATTTCCAAAGAAGAGAAATGGCTGCAAAATTGGCAATGAAAATGGC
GGCCATGTTTGGAGGCTGTAGTGATGATGAAGAGGAAACTAGATGCAAGGAATTTGGTATGCAATTAAATTAA
Protein sequenceShow/hide protein sequence
MPYFILKQMYGFGESGCMNVRETNLVVAVMVGAKGYDSGRQQLYDAKNYGISGGGDKSTIPQPSNQDSANSLPWDHDQGPTSFVKTGPEELLSRELGLDAATFNTHLVKV
QSSMDAKPEAFGMPETETGSHDDGHSNMNHSLDENLIDEGSDSGTEEEQAAFMKELHSFFRERGLEFKPPKFYGKGLNCLKLWRVVTRLGGYDKNGGNTVLVRNSMRWAI
FASFCVLSPELSSYIGLGRSSDLLKFDVLLCIENEVDLVPGSSLLGDEELSWETRRSCLDWCIERNIEFIEACASNADFDKCLSIDDDVQGVQRLYSALSARMWPGMILK
SGDNQAILTKRRRFFALHNLKLSKEESDSELDYEILSGGSAEPWYDRIQGLRIECSSIDAGARPKDIDIVKQDQARLDIQEQPEAEENLVAVDGEVDRVTDSCRDDHLGL
EDLERIMSERGNIRDGLRLMPDFQRREMAAKLAMKMAAMFGGCSDDEEETRCKEFGMQLN