; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007653 (gene) of Snake gourd v1 genome

Gene IDTan0007653
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant protein of unknown function (DUF863)
Genome locationLG03:72526367..72528829
RNA-Seq ExpressionTan0007653
SyntenyTan0007653
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034303.1 hypothetical protein SDJN02_04030 [Cucurbita argyrosperma subsp. argyrosperma]4.1e-9188.18Show/hide
Query:  KKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPA
        +KGAEIPMEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR SGWNP+SWDKRNEICFRQIYEQDAKNY RS+T T KLDLEQPA
Subjt:  KKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPA

Query:  EGEIEANNGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSL
        E E EANNGALQIINENELELTLGPSSYNTSDSGI THSSSSTGSSHEGRR D K+V+GQEM VLGV EN SG+QNGS+RGEKKM  +DYPPWLFQ VSL
Subjt:  EGEIEANNGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSL

Query:  NMT
        NMT
Subjt:  NMT

XP_022949937.1 uncharacterized protein LOC111453182 [Cucurbita moschata]2.8e-8788.27Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN
        MEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR SGWNP+SWDKRNEICFRQIYEQDAKNY RS+T T KLDLEQPAE E EAN
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN

Query:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT
        NGALQIINENELELTLGPSSYNTSDSGI THSSSSTGSSHEGRR D K+V+GQEM VLGV EN SG+QNGS+RGEKKM  +DYPPWLFQ VSLNMT
Subjt:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT

XP_023543361.1 uncharacterized protein LOC111803264 [Cucurbita pepo subsp. pepo]1.4e-8687.76Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN
        MEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMK VEKSR SGWNP+SWDKRNEICFRQIYEQDAKNY RS+T T KLDLEQPAE E EAN
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN

Query:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT
        NGALQIINENELELTLGPSSYNTSDSGI THSSSSTGSSHEGRR D K+V+GQEM VLGV EN SG+QNGS+RGEKKM  +DYPPWLFQ VSLNMT
Subjt:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT

XP_038883305.1 uncharacterized protein LOC120074293 isoform X1 [Benincasa hispida]5.5e-9686.61Show/hide
Query:  FGLMIFKPILGTQDSRLYQLK--KGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYE
        F L+I  P     D+ L+Q+K  KGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWNPESWDKRNEICFRQIYE
Subjt:  FGLMIFKPILGTQDSRLYQLK--KGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYE

Query:  QDAKNYYRSSTHTAKLDLEQPAEGEIEANNGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRR-TDTKKVKGQEMEVLGVTENSSGYQNG
        QDAKNYYR STH  KLD+EQPAEGE EA NGALQIINENELELTLGPSSYNTSDSG+ THSSSSTGSSHEGRR TD+++VKGQEM VLGVTENSSGY+NG
Subjt:  QDAKNYYRSSTHTAKLDLEQPAEGEIEANNGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRR-TDTKKVKGQEMEVLGVTENSSGYQNG

Query:  SNRGEKKMLDYPPWLFQVVSLNMT
        SNRGEKKMLDYPPWLFQVVSLNMT
Subjt:  SNRGEKKMLDYPPWLFQVVSLNMT

XP_038883306.1 uncharacterized protein LOC120074293 isoform X2 [Benincasa hispida]7.0e-9192.31Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN
        MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWNPESWDKRNEICFRQIYEQDAKNYYR STH  KLD+EQPAEGE EA 
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN

Query:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRR-TDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT
        NGALQIINENELELTLGPSSYNTSDSG+ THSSSSTGSSHEGRR TD+++VKGQEM VLGVTENSSGY+NGSNRGEKKMLDYPPWLFQVVSLNMT
Subjt:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRR-TDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT

TrEMBL top hitse value%identityAlignment
A0A1S3B1P5 uncharacterized protein LOC1034848363.4e-8388.21Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN
        MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR++    ESWDKRNEICFRQIYEQDAKNYYR STHT KLD+EQPAE E E N
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN

Query:  NGALQIINENELELTLGPSSYNTSDSG-ITHSSSSTGSSHEGRR-TDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT
        NGA QIINE ELELTLGPSSYNTSDSG  T+SSSSTGSSHEGRR TDTK+VKGQEM  LGVTENSSG QNG+NRGEKKMLDYPPWLFQVVSLNMT
Subjt:  NGALQIINENELELTLGPSSYNTSDSG-ITHSSSSTGSSHEGRR-TDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT

A0A6J1GEC8 uncharacterized protein LOC1114531821.3e-8788.27Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN
        MEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR SGWNP+SWDKRNEICFRQIYEQDAKNY RS+T T KLDLEQPAE E EAN
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN

Query:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT
        NGALQIINENELELTLGPSSYNTSDSGI THSSSSTGSSHEGRR D K+V+GQEM VLGV EN SG+QNGS+RGEKKM  +DYPPWLFQ VSLNMT
Subjt:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT

A0A6J1HJS3 uncharacterized protein LOC1114635699.0e-8485.28Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSST-HTAKLDLEQPAE-GEIE
        MEKLP+  EKE+MRMAMLKHEETFKQQVHELHRLYRTQKTLMKN+EK+RQSGWNPESWDKRNEICFRQIYEQDAK YYRSST H+ K+DLEQPAE  + E
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSST-HTAKLDLEQPAE-GEIE

Query:  ANNGALQIINENELELTLGPSSYNTSDSGITHSSSSTGSSHEGRRTDTKKV-KGQEMEVLGVTENSSGYQNGSNRGEKKM-LDYPPWLFQVVSLNMT
        +++GALQ INENELELTLGPSSYNTSDSG+THSSSSTGSSHEGRR D+K+V KGQEMEVLGV ENSSGYQNGS++GEKKM LDYPPWLFQVVS NMT
Subjt:  ANNGALQIINENELELTLGPSSYNTSDSGITHSSSSTGSSHEGRRTDTKKV-KGQEMEVLGVTENSSGYQNGSNRGEKKM-LDYPPWLFQVVSLNMT

A0A6J1IUL1 uncharacterized protein LOC1114787141.1e-8486.22Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN
        MEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR SGWN +SWDKRNEICFRQIYEQDAKNY RS+T T KLDLEQPAE E EAN
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN

Query:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT
        NGALQIINENELELTLGPSSYNTSDSGI THSSSSTGSSHEGRR D  +V+ QEM VLGV EN SG+QNGS+RGEKKM  +DYPPWLFQ VSLN+T
Subjt:  NGALQIINENELELTLGPSSYNTSDSGI-THSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT

A0A6J1KXC4 uncharacterized protein LOC1114972112.8e-8585.71Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSS-THTAKLDLEQPAE-GEIE
        MEKLP+ YEKE+MRMAMLKHEETFKQQVHELHRLYRTQKTLMKN+EKSRQSGWNPESWDKRNEICFRQIY+QDAK YYRSS TH+ K+DLEQPAE  + E
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSS-THTAKLDLEQPAE-GEIE

Query:  ANNGALQIINENELELTLGPSSYNTSDSGITHSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKK-MLDYPPWLFQVVSLNMT
        +++GALQ INE ELELTLGPSSYNTSDSG+THSSSSTGSSHEGRR D+K+VKGQEMEVLGV ENSSGYQNGS+RGEKK MLDYPPW+FQVVS NMT
Subjt:  ANNGALQIINENELELTLGPSSYNTSDSGITHSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKK-MLDYPPWLFQVVSLNMT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26620.1 Plant protein of unknown function (DUF863)6.7e-0761.54Show/hide
Query:  YEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE
        YEK++M+  ML+HE  FK QVHELHRLYR QK L++ V+
Subjt:  YEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE

AT1G69360.1 Plant protein of unknown function (DUF863)8.8e-0757.5Show/hide
Query:  AYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE
        +YE+++++  ML+HE  FK QV+ELHRLYRTQK+LM  V+
Subjt:  AYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE

AT5G57340.2 unknown protein2.8e-0528.64Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN
        M++  +    E +R  M   E+ FKQQV ELHR+Y TQK +M  + K R   W               +  +D       +   + +DL    E  + A 
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEAN

Query:  NGALQIINENELELTL--GPSSYNTSDSGITHSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNR-----------GEKKMLDYPPWLFQV
              I E+ELELTL  G SS +T+ +  T+      SS    R+ +     Q    L    N++   +G N             EKK    P WLFQ 
Subjt:  NGALQIINENELELTL--GPSSYNTSDSGITHSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNR-----------GEKKMLDYPPWLFQV

Query:  VSLNMT
        +S+N T
Subjt:  VSLNMT

AT5G67390.1 unknown protein1.5e-1433.82Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRS--STHTAKLDLEQPAEGEIE
        MEKL   Y+K+ M+MAMLKHEETFKQQV+ELHRLY+ QK LMKN+E ++ +  N                     N+  S   T   ++D E        
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRS--STHTAKLDLEQPAEGEIE

Query:  ANNGALQIINENELELTLGPSSY--------NTSDSGITHSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVV
          N  ++I++E+E+ELTLGPS Y        N      +      GS + GRR+ +    G        + N++       R E+ M      PWL Q +
Subjt:  ANNGALQIINENELELTLGPSSY--------NTSDSGITHSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVV

Query:  SLNM
        +LN+
Subjt:  SLNM

AT5G67390.2 unknown protein1.5e-1433.82Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRS--STHTAKLDLEQPAEGEIE
        MEKL   Y+K+ M+MAMLKHEETFKQQV+ELHRLY+ QK LMKN+E ++ +  N                     N+  S   T   ++D E        
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRS--STHTAKLDLEQPAEGEIE

Query:  ANNGALQIINENELELTLGPSSY--------NTSDSGITHSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVV
          N  ++I++E+E+ELTLGPS Y        N      +      GS + GRR+ +    G        + N++       R E+ M      PWL Q +
Subjt:  ANNGALQIINENELELTLGPSSY--------NTSDSGITHSSSSTGSSHEGRRTDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVV

Query:  SLNM
        +LN+
Subjt:  SLNM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCCTTTATCACAAGTTTTTTATTCATTCGTACTGGGGTGTTCTGTCCTCATTATAGCTGAATACCCAATTCATAATGGCGTTGGCCAGTGGTACTCAAAGGA
AATAATTTATTTATATATATTTTTTATCAAACTCGAGACCTTTGGGTTAATGATTTTTAAGCCTATTTTGGGGACTCAAGATTCAAGACTGTATCAGTTAAAAAAAGGGG
CTGAAATCCCAATGGAAAAGCTGCCCAAGGCATATGAAAAGGAGTACATGAGGATGGCTATGTTAAAGCATGAAGAAACATTCAAACAACAGGTACATGAACTTCATCGG
CTTTATCGAACCCAAAAGACTTTAATGAAAAACGTAGAGAAAAGCAGACAAAGTGGATGGAATCCAGAGAGTTGGGACAAAAGGAATGAGATATGTTTCAGACAAATTTA
TGAACAGGATGCAAAAAATTATTACAGATCATCAACTCATACAGCCAAACTAGACTTGGAGCAGCCTGCTGAAGGTGAAATAGAAGCCAATAATGGAGCTTTGCAGATCA
TAAACGAGAATGAACTCGAACTAACTCTAGGGCCTTCAAGTTACAATACTTCAGATTCAGGAATAACCCACTCTTCTTCTTCAACAGGGTCCAGCCATGAGGGAAGACGT
ACGGACACAAAGAAAGTTAAAGGTCAAGAAATGGAGGTTTTGGGTGTGACTGAAAATTCTTCAGGCTACCAAAATGGAAGTAATAGGGGGGAGAAGAAGATGCTAGATTA
CCCTCCTTGGCTTTTTCAAGTTGTGAGCCTTAACATGACTTAA
mRNA sequenceShow/hide mRNA sequence
TGGCTATTTCTGAAGCATATCTTTCACGAAACTTGAAGGAGATTGGCTGGAATCTGACCCCAAACATCGCAGAACTTGGGTACGTTTCTGTTCTTGTTCCCCTCTTCCTC
AGTGGGTAAACGAGAGAGAATCTTTTTGTTTCAGTTCAATTTATTTTCCTCTTTCCTTTATTTATTCACTACTCATTTGAGGGGCATGCTGGAGTCTTCCATGTTCCATC
TTTCTCTGTGCAAACGGATTAGGATTTTTTTGCCCAAGCTAGAACTTCTTGGAGGAGATGAAGTTTCCTCTTCTTCTTTTTCTTCATCTTCATCTTCCTCTTCCTCCATT
TCTTCCCTCTTTTTCGTCTTAGCTTTTGGCTTCTTTTGCAGTGTCCTGAAATATAATATAAAAATACATGCGATCGAAGCTCTGCAATTCCATCTGCTGATCATCAGTCG
TTAGGATTACGGATCATTGTTTCAGGTAATTCCTGTGTTTTTTCGCGTTTTGGTTGTATAGAAGCAATTTTGGTTCGTTTTTCTGTTACTCTCTGGATCGATCTGTGTGT
TTAATTCGAGGTTCGATCCAGACATTGTAATTTGTAATTAGAGTTGGTTTTGGTTTAATCTTTGTTTGGTTTTAATGTACTACAGAGTTTCTGTGAAAGCGAATTCTTAT
CGTATGGCTTTTTTTTTAGGTAGCGCATTAAGTGCTCGACGTAATGACTGAATGAGAAAGGGATATTTAGGATAGAAAACAAAAGACCAATGGAAAAGCCTTTATCACAA
GTTTTTTATTCATTCGTACTGGGGTGTTCTGTCCTCATTATAGCTGAATACCCAATTCATAATGGCGTTGGCCAGTGGTACTCAAAGGAAATAATTTATTTATATATATT
TTTTATCAAACTCGAGACCTTTGGGTTAATGATTTTTAAGCCTATTTTGGGGACTCAAGATTCAAGACTGTATCAGTTAAAAAAAGGGGCTGAAATCCCAATGGAAAAGC
TGCCCAAGGCATATGAAAAGGAGTACATGAGGATGGCTATGTTAAAGCATGAAGAAACATTCAAACAACAGGTACATGAACTTCATCGGCTTTATCGAACCCAAAAGACT
TTAATGAAAAACGTAGAGAAAAGCAGACAAAGTGGATGGAATCCAGAGAGTTGGGACAAAAGGAATGAGATATGTTTCAGACAAATTTATGAACAGGATGCAAAAAATTA
TTACAGATCATCAACTCATACAGCCAAACTAGACTTGGAGCAGCCTGCTGAAGGTGAAATAGAAGCCAATAATGGAGCTTTGCAGATCATAAACGAGAATGAACTCGAAC
TAACTCTAGGGCCTTCAAGTTACAATACTTCAGATTCAGGAATAACCCACTCTTCTTCTTCAACAGGGTCCAGCCATGAGGGAAGACGTACGGACACAAAGAAAGTTAAA
GGTCAAGAAATGGAGGTTTTGGGTGTGACTGAAAATTCTTCAGGCTACCAAAATGGAAGTAATAGGGGGGAGAAGAAGATGCTAGATTACCCTCCTTGGCTTTTTCAAGT
TGTGAGCCTTAACATGACTTAATTTTCAAGGTATTAAAAAAAAAAAAACAAATTCCAATTCATTTGAATCAAATTATATGAGAACATCTTTGGTTGCTTCTCATTGACTA
ATCAACAGGGTGATTGATGTTTAGTTTGTATTGAGCTTTCTTCTTCATCTCTTGCTTGTATATTGAGATGTTGATGTGTATTACTTTGTAAATTTTTAACTGAGGAAAGA
AAAATCACTTTTGAATTTCAAGCTACCA
Protein sequenceShow/hide protein sequence
MEKPLSQVFYSFVLGCSVLIIAEYPIHNGVGQWYSKEIIYLYIFFIKLETFGLMIFKPILGTQDSRLYQLKKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHR
LYRTQKTLMKNVEKSRQSGWNPESWDKRNEICFRQIYEQDAKNYYRSSTHTAKLDLEQPAEGEIEANNGALQIINENELELTLGPSSYNTSDSGITHSSSSTGSSHEGRR
TDTKKVKGQEMEVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT