; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010969 (gene) of Chayote v1 genome

Gene IDSed0010969
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG12:5824086..5828654
RNA-Seq ExpressionSed0010969
SyntenySed0010969
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138115.1 uncharacterized protein LOC111009363 isoform X1 [Momordica charantia]1.8e-9974.22Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALFELE VLRSKQNSLTIEEA  LQTCKSKA+RDFT G L G GV WAGTW+LN FIR+ LS G A+  G+WR +RSLN+ V HIL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVTKYH DP TM+HISKHFYYE+VFDDST+D+P+IRWRYRNFFSD+VA  QRTH+ D KNNLHGNS H SSN DSNS Q+ SY +PDDK NA+EFKP
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTETIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQT
        VL +  T+  ADPLDC+FG LA+ EEIQHS+SS+T+ KS SRSRRY RRHRRHNQT
Subjt:  VLNRLDTETIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQT

XP_022138116.1 uncharacterized protein LOC111009363 isoform X2 [Momordica charantia]1.1e-9973.11Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALFELE VLRSKQNSLTIEEA  LQTCKSKA+RDFT G L G GV WAGTW+LN FIR+ LS G A+  G+WR +RSLN+ V HIL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVTKYH DP TM+HISKHFYYE+VFDDST+D+P+IRWRYRNFFSD+VA  QRTH+ D KNNLHGNS H SSN DSNS Q+ SY +PDDK NA+EFKP
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTETIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQT-IPTSFEH
        VL +  T+  ADPLDC+FG LA+ EEIQHS+SS+T+ KS SRSRRY RRHRRHNQT I  S  H
Subjt:  VLNRLDTETIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQT-IPTSFEH

XP_022956077.1 uncharacterized protein LOC111457878 [Cucurbita moschata]1.2e-9873.58Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALFELE VLRSKQNSLTIEEA  LQTCKSKA+RDFT G LVG GV WAGTW+LN F+R+ LS G  +  G+ R +RSL++ V HIL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVTKYH DP TM+HISKHF+YE VFDDST+D+PKIRWRYRNFFSD+VA AQRTH  DPK+NLHGN  HDSSNRDSN  QS+SYGDPDDK NA EF P
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTE-TIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV
        VL +   +   ADPLD IFG+L REEEIQHSS+SS S KS  RS+RY RRHRRHNQT+PT FEHV
Subjt:  VLNRLDTE-TIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV

XP_023527180.1 uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-9873.21Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALFELE VLRSKQNSLTIEEA  LQTCKSKA+RDFT G LVG GV WAGTW+LN F+R+ LS G  +  G+ R +RSL++ V HIL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVTKYH DP TM+HISKHF+YE VFDDST+D+PKIRWRYRNFFSD+VA AQRTH  DPK+NLHGN  HDSSNRDSN  QS+SYGDPDDK NA EF P
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTE-TIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV
        VL +   +   ADPLD IFG++ REEEIQHSS+SS S KS  RS+RY RRHRRHNQT+PT FEHV
Subjt:  VLNRLDTE-TIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV

XP_038878005.1 uncharacterized protein LOC120070209 isoform X1 [Benincasa hispida]1.3e-9772.28Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALF LE VLRSKQNSLTIEEA  LQTC+SKA+RDFT G L+G GV WAGTW+LN FIR+ LS G A+  G+WR + SL + V HIL L GSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVT+YH DP  M+ ISKHFYYE VFDDST+D+PKIRWR RNFFSD+VA AQRT + DPK+NLHGNS HDSSNRDS++YQS+SYGDPDDK NA+E KP
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTE-TIADPLDCIFGSLAR--EEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV
        VL +  T+ T  DPLDCIFG+LAR  EEEIQHSS+SS S KS SRSRRY RRHR+ NQT+PT+FEHV
Subjt:  VLNRLDTE-TIADPLDCIFGSLAR--EEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV

TrEMBL top hitse value%identityAlignment
A0A1S3AWL2 uncharacterized protein LOC103483703 isoform X16.5e-8765.28Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG  L ELE+VLRSK N LTIEEA  LQTC+SKA+RDFT G ++G G+ WAGTW+LN F R+ LS G A+  G WR +RSLN+ V +IL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVT+YH DP  M++ISKHF+YE VFDDST D+PKIRWRYRNFFSD+VA +QRTH  D       N+ H++S+RDS+++Q +SYGD DDK NA EFKP
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTET-IADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV
        VL +  T++  ADPLDCIFG+LAREEEIQHS+ S+ S K  SRSRRY RRHR+ NQT PT+FE+V
Subjt:  VLNRLDTET-IADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV

A0A6J1C8I6 uncharacterized protein LOC111009363 isoform X18.7e-10074.22Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALFELE VLRSKQNSLTIEEA  LQTCKSKA+RDFT G L G GV WAGTW+LN FIR+ LS G A+  G+WR +RSLN+ V HIL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVTKYH DP TM+HISKHFYYE+VFDDST+D+P+IRWRYRNFFSD+VA  QRTH+ D KNNLHGNS H SSN DSNS Q+ SY +PDDK NA+EFKP
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTETIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQT
        VL +  T+  ADPLDC+FG LA+ EEIQHS+SS+T+ KS SRSRRY RRHRRHNQT
Subjt:  VLNRLDTETIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQT

A0A6J1C8T0 uncharacterized protein LOC111009363 isoform X25.1e-10073.11Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALFELE VLRSKQNSLTIEEA  LQTCKSKA+RDFT G L G GV WAGTW+LN FIR+ LS G A+  G+WR +RSLN+ V HIL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVTKYH DP TM+HISKHFYYE+VFDDST+D+P+IRWRYRNFFSD+VA  QRTH+ D KNNLHGNS H SSN DSNS Q+ SY +PDDK NA+EFKP
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTETIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQT-IPTSFEH
        VL +  T+  ADPLDC+FG LA+ EEIQHS+SS+T+ KS SRSRRY RRHRRHNQT I  S  H
Subjt:  VLNRLDTETIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQT-IPTSFEH

A0A6J1GVC2 uncharacterized protein LOC1114578785.7e-9973.58Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALFELE VLRSKQNSLTIEEA  LQTCKSKA+RDFT G LVG GV WAGTW+LN F+R+ LS G  +  G+ R +RSL++ V HIL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANIVVTKYH DP TM+HISKHF+YE VFDDST+D+PKIRWRYRNFFSD+VA AQRTH  DPK+NLHGN  HDSSNRDSN  QS+SYGDPDDK NA EF P
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTE-TIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV
        VL +   +   ADPLD IFG+L REEEIQHSS+SS S KS  RS+RY RRHRRHNQT+PT FEHV
Subjt:  VLNRLDTE-TIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV

A0A6J1IXZ4 uncharacterized protein LOC1114795421.5e-9672.08Show/hide
Query:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL
        MG ALFELE VLRSKQNSLTIEEA  LQTCKSKA+RDFT G LVG GV WAGTW+LN F+R+ LS G  +  G+ R +RSL++ V HIL LDGSRMQKEL
Subjt:  MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKEL

Query:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP
        ANI+VTK H DP TM+HISKHF+YE VFDDST+D+PKIRWRYRNFFSD+VA AQR H  DPK+NLHGN  HDSSNRDSN  QS+SYG+PDDK NA EF P
Subjt:  ANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKP

Query:  VLNRLDTE-TIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV
        VL +   +   ADPLD IFG+L REEEIQHSS+SS S KS  RS+RY RRHRRHNQT+PT FEHV
Subjt:  VLNRLDTE-TIADPLDCIFGSLAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05430.1 unknown protein2.5e-1430.97Show/hide
Query:  ALFELEHVLRSK--QNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKL---NNFIRIILSSGFASALGV----WRLNRSLNASVGHILQLDGS
        AL +L  VL SK  Q  +T EE+ ++ +C  KAL        VG G+ W  T KL       R+ L++G A++  V    W  ++   +S+ HIL  D +
Subjt:  ALFELEHVLRSK--QNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKL---NNFIRIILSSGFASALGV----WRLNRSLNASVGHILQLDGS

Query:  RMQKELANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKEN
        RMQKEL N++V     +    + +SKHFY E V+ D   D+P++RWR R  F++  +     +    + N +G  P+ S  R S        G  D  + 
Subjt:  RMQKELANIVVTKYHKDPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKEN

Query:  AIEFKPVLNRLDTETI-ADPLDCIFGSLAREEEIQHSSSSSTSTKSQSR-SRRYRRRHRRHNQTIPTS
            +      D E    D LD +FG     E I     S  ++K+Q+R  +R +RR R  N+   T+
Subjt:  AIEFKPVLNRLDTETI-ADPLDCIFGSLAREEEIQHSSSSSTSTKSQSR-SRRYRRRHRRHNQTIPTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCACAGCTTTATTCGAGCTTGAACACGTTCTCAGGTCCAAACAGAACAGTTTGACGATCGAGGAAGCGGCTTCGCTTCAAACATGTAAGTCTAAGGCTCTACGAGA
TTTTACGCAGGGATGTCTCGTTGGATGTGGTGTGGCATGGGCAGGAACATGGAAGCTGAATAACTTTATTCGGATAATTCTTTCCTCAGGATTTGCTTCGGCACTTGGAG
TATGGAGATTAAACAGGTCTCTAAATGCAAGCGTGGGTCATATTCTTCAACTGGATGGAAGTAGAATGCAAAAGGAGCTGGCAAATATTGTGGTGACAAAATATCACAAA
GATCCTATCACGATGCGACACATATCCAAGCATTTTTATTACGAGAGAGTTTTTGATGATTCAACCATGGACCAGCCAAAAATTAGGTGGCGATATCGAAATTTCTTTAG
TGATGAGGTTGCCGATGCTCAGAGGACACATGAAAAGGACCCTAAGAACAACTTGCATGGAAACTCCCCCCATGACTCATCCAACCGCGATTCTAATTCCTACCAGAGTG
AATCCTATGGTGACCCTGATGACAAAGAAAATGCAATTGAATTCAAGCCAGTCCTTAATAGGCTAGACACTGAGACTATCGCTGACCCTCTCGATTGTATCTTCGGTTCA
CTAGCAAGAGAAGAAGAAATCCAACACTCGAGTTCCTCTAGCACATCAACCAAATCTCAGTCTCGTAGCAGAAGATACCGTCGTCGGCATCGAAGACATAATCAAACAAT
ACCAACAAGCTTTGAACACGTCTAA
mRNA sequenceShow/hide mRNA sequence
GAGGGATTTGCTTCTTTACATCTCAGCCTCAAAGCTCTCAGTTGCTCCGTTGGATCGTCGTGTCTCGTGATCGGAGGTCCGCCATGGGCACAGCTTTATTCGAGCTTGAA
CACGTTCTCAGGTCCAAACAGAACAGTTTGACGATCGAGGAAGCGGCTTCGCTTCAAACATGTAAGTCTAAGGCTCTACGAGATTTTACGCAGGGATGTCTCGTTGGATG
TGGTGTGGCATGGGCAGGAACATGGAAGCTGAATAACTTTATTCGGATAATTCTTTCCTCAGGATTTGCTTCGGCACTTGGAGTATGGAGATTAAACAGGTCTCTAAATG
CAAGCGTGGGTCATATTCTTCAACTGGATGGAAGTAGAATGCAAAAGGAGCTGGCAAATATTGTGGTGACAAAATATCACAAAGATCCTATCACGATGCGACACATATCC
AAGCATTTTTATTACGAGAGAGTTTTTGATGATTCAACCATGGACCAGCCAAAAATTAGGTGGCGATATCGAAATTTCTTTAGTGATGAGGTTGCCGATGCTCAGAGGAC
ACATGAAAAGGACCCTAAGAACAACTTGCATGGAAACTCCCCCCATGACTCATCCAACCGCGATTCTAATTCCTACCAGAGTGAATCCTATGGTGACCCTGATGACAAAG
AAAATGCAATTGAATTCAAGCCAGTCCTTAATAGGCTAGACACTGAGACTATCGCTGACCCTCTCGATTGTATCTTCGGTTCACTAGCAAGAGAAGAAGAAATCCAACAC
TCGAGTTCCTCTAGCACATCAACCAAATCTCAGTCTCGTAGCAGAAGATACCGTCGTCGGCATCGAAGACATAATCAAACAATACCAACAAGCTTTGAACACGTCTAATA
CCAGGATATACAAAGCATTGTGCATGGTCCAACAGAGCACTTACCTGTGGATAGCTATATTCCAGAGAACGCTGTTTAGAAGGTTGGAAACTTCAATCTCTTTTCCAGTT
ACTCAATTTTCTTACATCTCTGGAAGATTTGTGGGTGAAAATCTTTGTTTTTTCAGACCGTGTGGAAAACCTTACACACAAACAAGGCGATTATAAATACGCGTCAAGAC
TGGTAAATGCCTTTTTTATGAGAGACTCTTTGACTTTTAATCCCCTAATAACTTGTGATAAACTTCATGCTTTGTCTTTTTTACATTAATATGATATTCCCCCCTTCTTT
TTTATGGCTGTGATTGAAAGATGGAAATTGGAGTTGTCATAATTTGTCTGACTCTTAA
Protein sequenceShow/hide protein sequence
MGTALFELEHVLRSKQNSLTIEEAASLQTCKSKALRDFTQGCLVGCGVAWAGTWKLNNFIRIILSSGFASALGVWRLNRSLNASVGHILQLDGSRMQKELANIVVTKYHK
DPITMRHISKHFYYERVFDDSTMDQPKIRWRYRNFFSDEVADAQRTHEKDPKNNLHGNSPHDSSNRDSNSYQSESYGDPDDKENAIEFKPVLNRLDTETIADPLDCIFGS
LAREEEIQHSSSSSTSTKSQSRSRRYRRRHRRHNQTIPTSFEHV