; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022441 (gene) of Snake gourd v1 genome

Gene IDTan0022441
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLysM domain-containing protein
Genome locationLG10:63739292..63743646
RNA-Seq ExpressionTan0022441
SyntenyTan0022441
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039254.1 uncharacterized protein E6C27_scaffold64G00450 [Cucumis melo var. makuwa]8.5e-10574.64Show/hide
Query:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM
        MEVK+ QRNRA RFSLLP PT PSLT  LRNWANTK+S NNQ R I LRWRFQL D+SK QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+
Subjt:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM

Query:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH
        D DL+QK Q+IKI+N RA   IR  + L+EKLQS+LNGL  YK  F LA  RL PAR TS IVL+PL++FC RCIIGASYARV GT KLK  NK EGERH
Subjt:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH

Query:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPS-EEEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
        KFRSGHWRSALRDIRELDGLD E+ ID TSPS +E+ISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPS-EEEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

XP_008459632.1 PREDICTED: uncharacterized protein LOC103498697 isoform X1 [Cucumis melo]1.8e-10271.48Show/hide
Query:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM
        MEVK+ QRNRA RFSLLP PT PSLT  LRNWANTK+S NNQ R I LRWRFQL D+SK QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+
Subjt:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM

Query:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR---------TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTA
        D D +QK Q+IKI+NPR    IR  + L+EKLQ++LNGL  YK  F LA  RL P           TS IVL+PL++FC RCIIGASYARV GT KLK  
Subjt:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR---------TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTA

Query:  NKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
        NK EGERHKFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  NKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

XP_008459633.1 PREDICTED: uncharacterized protein LOC103498697 isoform X2 [Cucumis melo]1.9e-10474.28Show/hide
Query:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM
        MEVK+ QRNRA RFSLLP PT PSLT  LRNWANTK+S NNQ R I LRWRFQL D+SK QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+
Subjt:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM

Query:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH
        D D +QK Q+IKI+NPR    IR  + L+EKLQ++LNGL  YK  F LA  RL PAR TS IVL+PL++FC RCIIGASYARV GT KLK  NK EGERH
Subjt:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH

Query:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
        KFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

XP_011656102.1 uncharacterized protein LOC101208955 isoform X2 [Cucumis sativus]3.7e-10072.92Show/hide
Query:  MEVKLSQRNRADRFSLLPQPTFPS--LTLRNWANTKRSLNNQFRTIKLRWRFQ-LQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKI
        MEVK+ QRNRA RFSLLP  T PS  L+L NWANTK+S NNQ R I LRWRFQ L D+SK QLS KHHFVHI+EG ESL+S  NQNG P HSIV++++KI
Subjt:  MEVKLSQRNRADRFSLLPQPTFPS--LTLRNWANTKRSLNNQFRTIKLRWRFQ-LQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKI

Query:  MDMDLKQKRQDIKIENP---RAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGER
        MD DL+QKRQ+IKI+NP   R IR  + L+EKLQS+LNGL  YK  F LA     PAR TS IVL+PL++FCARCIIGASYAR  GT KLK  +K EGER
Subjt:  MDMDLKQKRQDIKIENP---RAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGER

Query:  HKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
         KFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISVE+LSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  HKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

XP_038890844.1 uncharacterized protein LOC120080288 isoform X1 [Benincasa hispida]1.6e-10374.64Show/hide
Query:  MEVKLSQRNRADRF----SLLPQPTFPSLT-LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSR
        MEVKLSQRNRADRF     LLPQPT PSLT  RNWANT++SL NQFR I LRWRFQLQD+SK+QLS KHH VHIVEG ESL+   NQNG P+HSI ++++
Subjt:  MEVKLSQRNRADRF----SLLPQPTFPSLT-LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSR

Query:  KIMDMDLKQKRQDIKIENPRAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH
        +I D DL+QK Q+IKI+NPRAIR  Y L+EKLQS+LN L  YK  F LA   L PAR TS IVL+PLIVFCARCIIGASYARV GTS+L+T +K EG+ H
Subjt:  KIMDMDLKQKRQDIKIENPRAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH

Query:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
        KFRSGHWRSALRDIRELDGLD ES ID  SPSE E+IS EDLSH YKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

TrEMBL top hitse value%identityAlignment
A0A0A0KSX5 Uncharacterized protein1.8e-10072.92Show/hide
Query:  MEVKLSQRNRADRFSLLPQPTFPS--LTLRNWANTKRSLNNQFRTIKLRWRFQ-LQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKI
        MEVK+ QRNRA RFSLLP  T PS  L+L NWANTK+S NNQ R I LRWRFQ L D+SK QLS KHHFVHI+EG ESL+S  NQNG P HSIV++++KI
Subjt:  MEVKLSQRNRADRFSLLPQPTFPS--LTLRNWANTKRSLNNQFRTIKLRWRFQ-LQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKI

Query:  MDMDLKQKRQDIKIENP---RAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGER
        MD DL+QKRQ+IKI+NP   R IR  + L+EKLQS+LNGL  YK  F LA     PAR TS IVL+PL++FCARCIIGASYAR  GT KLK  +K EGER
Subjt:  MDMDLKQKRQDIKIENP---RAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGER

Query:  HKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
         KFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISVE+LSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  HKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

A0A1S3CB50 uncharacterized protein LOC103498697 isoform X18.6e-10371.48Show/hide
Query:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM
        MEVK+ QRNRA RFSLLP PT PSLT  LRNWANTK+S NNQ R I LRWRFQL D+SK QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+
Subjt:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM

Query:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR---------TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTA
        D D +QK Q+IKI+NPR    IR  + L+EKLQ++LNGL  YK  F LA  RL P           TS IVL+PL++FC RCIIGASYARV GT KLK  
Subjt:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR---------TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTA

Query:  NKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
        NK EGERHKFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  NKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

A0A1S3CBV5 uncharacterized protein LOC103498697 isoform X29.2e-10574.28Show/hide
Query:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM
        MEVK+ QRNRA RFSLLP PT PSLT  LRNWANTK+S NNQ R I LRWRFQL D+SK QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+
Subjt:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM

Query:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH
        D D +QK Q+IKI+NPR    IR  + L+EKLQ++LNGL  YK  F LA  RL PAR TS IVL+PL++FC RCIIGASYARV GT KLK  NK EGERH
Subjt:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH

Query:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
        KFRSGHWRSALRDIRELDGLD E+ ID TSPSE E+ISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPSE-EEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

A0A5A7T6Z4 LysM domain-containing protein4.1e-10574.64Show/hide
Query:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM
        MEVK+ QRNRA RFSLLP PT PSLT  LRNWANTK+S NNQ R I LRWRFQL D+SK QLS KHHFVHI+EG ESL+S+ N+NG P +SIVI ++KI+
Subjt:  MEVKLSQRNRADRFSLLPQPTFPSLT--LRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIM

Query:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH
        D DL+QK Q+IKI+N RA   IR  + L+EKLQS+LNGL  YK  F LA  RL PAR TS IVL+PL++FC RCIIGASYARV GT KLK  NK EGERH
Subjt:  DMDLKQKRQDIKIENPRA---IRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERH

Query:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPS-EEEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
        KFRSGHWRSALRDIRELDGLD E+ ID TSPS +E+ISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE
Subjt:  KFRSGHWRSALRDIRELDGLDSESSIDYTSPS-EEEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE

A0A6J1CKQ6 uncharacterized protein LOC111012032 isoform X33.9e-9572.43Show/hide
Query:  MEVKLSQRNRADRFS----LLPQPTFPSLTLRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRK
        ME+K+SQRNRADRFS    LLPQPT PS T R WA  KRS  NQF  + LRWRFQLQD+ +DQ   KHHFV IVEGGE+ +S+L QNGV +HSIVI +RK
Subjt:  MEVKLSQRNRADRFS----LLPQPTFPSLTLRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRK

Query:  IMDMDLKQKRQDIKIENPRAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHK
        I D DL+ K QD KI NP AIR  Y LQEKLQSSLNGL  YK  F    PRL PAR TS IVL+PLIVFCARCIIGASYARV  TSKLKT +K EGE HK
Subjt:  IMDMDLKQKRQDIKIENPRAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPAR-TSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHK

Query:  FRSGHWRSALRDIRELDGLDSESSID---YTSPS-EEEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG
        FRSGHWRSALRDIRELDGLDSESS D     SPS +E+ISVEDLSHAYKKLD+DYEKFLSECGLS  GYWRG
Subjt:  FRSGHWRSALRDIRELDGLDSESSID---YTSPS-EEEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09970.1 unknown protein3.2e-0925.42Show/hide
Query:  NNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGV----PSHSIVISSRKIMDMDLKQKRQDIK---IENPRAIRYRYPLQ-----E
        + +F+    R RF +Q MS+++   KH      +  ESL  +L Q GV    P  S   +S ++ D+D ++K   +    I++   +     L      E
Subjt:  NNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGV----PSHSIVISSRKIMDMDLKQKRQDIK---IENPRAIRYRYPLQ-----E

Query:  KLQSSLNGLGKYKMHFTLACPRLCPARTSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRE---LDGLDSES-SID
        K   ++              P L      L  L+P++ FC  CIIG  +  +         ++   + H   S  WR+AL D  E    DG DS S    
Subjt:  KLQSSLNGLGKYKMHFTLACPRLCPARTSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRE---LDGLDSES-SID

Query:  YTSPSEEEISVEDLSHAYKKLDQDYEKFLSECGLSK
          S ++E  + ++++ AY +++ +Y++FL ECG+ +
Subjt:  YTSPSEEEISVEDLSHAYKKLDQDYEKFLSECGLSK

AT4G09970.2 unknown protein3.9e-0725Show/hide
Query:  RWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGV----PSHSIVISSRKIMDMDLKQKRQDIK---IENPRAIRYRYPLQ-----EKLQSSLNGL
        +W F +Q MS+++   KH      +  ESL  +L Q GV    P  S   +S ++ D+D ++K   +    I++   +     L      EK   ++   
Subjt:  RWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGV----PSHSIVISSRKIMDMDLKQKRQDIK---IENPRAIRYRYPLQ-----EKLQSSLNGL

Query:  GKYKMHFTLACPRLCPARTSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPS-EEEISVE
                   P L      L  L+P++ FC  CIIG  +  +         ++   + H   S  WR+AL D  E    D   S+   SP   E  + +
Subjt:  GKYKMHFTLACPRLCPARTSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSIDYTSPS-EEEISVE

Query:  DLSHAYKKLDQDYEKFLSECGLSK
        +++ AY +++ +Y++FL ECG+ +
Subjt:  DLSHAYKKLDQDYEKFLSECGLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTGAAGCTGAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCACAACCAACTTTCCCTTCTCTAACTCTCAGAAATTGGGCAAACACCAAAAGATCATT
GAACAATCAATTCAGAACCATTAAGCTGAGATGGAGGTTTCAGCTTCAGGATATGTCCAAAGATCAACTCTCCCCCAAGCACCACTTTGTTCATATTGTCGAAGGGGGCG
AAAGCTTGAGTTCGGTTTTGAACCAGAATGGAGTTCCTTCACATTCCATTGTCATATCTAGTAGGAAGATAATGGACATGGATCTAAAACAAAAGAGGCAGGATATCAAG
ATTGAAAACCCTCGAGCGATTAGATATAGATATCCATTGCAAGAAAAGCTTCAGAGTTCTTTGAATGGACTTGGTAAATATAAAATGCATTTCACGCTTGCCTGCCCTCG
ACTATGTCCTGCTAGAACCAGTTTGATAGTTTTGATTCCTCTTATAGTATTTTGTGCTAGATGCATAATTGGTGCCTCTTATGCTAGAGTCATCGGAACATCGAAGCTTA
AAACCGCTAATAAACCAGAGGGAGAACGTCACAAGTTCAGAAGCGGCCATTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTTGGATTCTGAGTCATCCATA
GATTATACAAGTCCTTCAGAAGAAGAGATCTCAGTTGAAGATTTGTCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATCAGAGTGTGGACTGAGTAA
ATGGGGCTACTGGCGTGGGGGTACACAGAGACCTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTGAAGCTGAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCACAACCAACTTTCCCTTCTCTAACTCTCAGAAATTGGGCAAACACCAAAAGATCATT
GAACAATCAATTCAGAACCATTAAGCTGAGATGGAGGTTTCAGCTTCAGGATATGTCCAAAGATCAACTCTCCCCCAAGCACCACTTTGTTCATATTGTCGAAGGGGGCG
AAAGCTTGAGTTCGGTTTTGAACCAGAATGGAGTTCCTTCACATTCCATTGTCATATCTAGTAGGAAGATAATGGACATGGATCTAAAACAAAAGAGGCAGGATATCAAG
ATTGAAAACCCTCGAGCGATTAGATATAGATATCCATTGCAAGAAAAGCTTCAGAGTTCTTTGAATGGACTTGGTAAATATAAAATGCATTTCACGCTTGCCTGCCCTCG
ACTATGTCCTGCTAGAACCAGTTTGATAGTTTTGATTCCTCTTATAGTATTTTGTGCTAGATGCATAATTGGTGCCTCTTATGCTAGAGTCATCGGAACATCGAAGCTTA
AAACCGCTAATAAACCAGAGGGAGAACGTCACAAGTTCAGAAGCGGCCATTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTTGGATTCTGAGTCATCCATA
GATTATACAAGTCCTTCAGAAGAAGAGATCTCAGTTGAAGATTTGTCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATCAGAGTGTGGACTGAGTAA
ATGGGGCTACTGGCGTGGGGGTACACAGAGACCTGAATAG
Protein sequenceShow/hide protein sequence
MEVKLSQRNRADRFSLLPQPTFPSLTLRNWANTKRSLNNQFRTIKLRWRFQLQDMSKDQLSPKHHFVHIVEGGESLSSVLNQNGVPSHSIVISSRKIMDMDLKQKRQDIK
IENPRAIRYRYPLQEKLQSSLNGLGKYKMHFTLACPRLCPARTSLIVLIPLIVFCARCIIGASYARVIGTSKLKTANKPEGERHKFRSGHWRSALRDIRELDGLDSESSI
DYTSPSEEEISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPE