; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003429 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003429
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLysM domain-containing protein
Genome locationChr08:1227113..1230901
RNA-Seq ExpressionHG10003429
SyntenyHG10003429
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR018392 - LysM domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039254.1 uncharacterized protein E6C27_scaffold64G00450 [Cucumis melo var. makuwa]1.9e-12080.5Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN
        MEVK+ QRNRA RFS    LLPH TLPSLT   RNWANT++SF NQ R I++RWRFQL D+SK QLSTKH FVHI+EG ESL SISN+NGDP +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN

Query:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKRE
        KKI+DTDLEQKGQNIKIQN RA   IRD +QLEEKLQSALNGL+ YKKLFA ASS LPPARTTSFIVLVPL++FC RCIIGAS ARVFGTLKL+ ++K+E
Subjt:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKRE

Query:  GEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSPE-DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        GE HKFRSGHWRSALRDIRE+DGLDCE+PIDS+SP  DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  GEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSPE-DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

XP_008459633.1 PREDICTED: uncharacterized protein LOC103498697 isoform X2 [Cucumis melo]2.5e-12080.14Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN
        MEVK+ QRNRA RFS    LLPH TLPSLT   RNWANT++SF NQ R I++RWRFQL D+SK QLSTKH FVHI+EG ESL SISN+NGDP +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN

Query:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKRE
        KKI+DTD EQKGQNIKIQNPR    IRD +QLEEKLQ+ALNGL+ YKKLFA ASS LPPARTTSFIVLVPL++FC RCIIGAS ARVFGTLKL+ ++K+E
Subjt:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKRE

Query:  GEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        GE HKFRSGHWRSALRDIRE+DGLDCE+PIDS+SP EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  GEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

XP_011656102.1 uncharacterized protein LOC101208955 isoform X2 [Cucumis sativus]2.1e-11980.78Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQ-LQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANK
        MEVK+ QRNRA RFSLLP   P S + SL+  NWANT++SF NQ R IA+RWRFQ L DISK QLSTKH FVHI+EG ESLTS SNQNGDP HSIV+ANK
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQ-LQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANK

Query:  KIMDTDLEQKGQNIKIQNP---RAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREG
        KIMDTDLEQK QNIKIQNP   R IR+ +QLEEKLQSALNGLR YKKLFA ASSH PPARTTSFIVLVPL++FCARCIIGAS AR FGTLKL+ +DK+EG
Subjt:  KIMDTDLEQKGQNIKIQNP---RAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREG

Query:  EHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        E  KFRSGHWRSALRDIRE+DGLDCE+PIDS+SP EDEQISVE+LSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  EHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

XP_011656104.1 uncharacterized protein LOC101208955 isoform X3 [Cucumis sativus]2.1e-11980.78Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQ-LQDISKDQLSTKHQFVHIIEG---RESLTSISNQNGDPAHSIVI
        MEVK+ QRNRA RFSLLP   P S + SL+  NWANT++SF NQ R IA+RWRFQ L DISK QLSTKH FVHI+EG    ESLTS SNQNGDP HSIV+
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQ-LQDISKDQLSTKHQFVHIIEG---RESLTSISNQNGDPAHSIVI

Query:  ANKKIMDTDLEQKGQNIKIQNPRAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREG
        ANKKIMDTDLEQK QNIKIQNPR IR+ +QLEEKLQSALNGLR YKKLFA ASSH PPARTTSFIVLVPL++FCARCIIGAS AR FGTLKL+ +DK+EG
Subjt:  ANKKIMDTDLEQKGQNIKIQNPRAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREG

Query:  EHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        E  KFRSGHWRSALRDIRE+DGLDCE+PIDS+SP EDEQISVE+LSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  EHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

XP_038890844.1 uncharacterized protein LOC120080288 isoform X1 [Benincasa hispida]7.6e-13388.13Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT-HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANK
        MEVKLSQRNRADRF L PKLLP  TLPSLT HRNWANTR+S KNQFRAI +RWRFQLQDISK+QLSTKH  VHI+EG ESLT   NQNGDP HSI +ANK
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT-HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANK

Query:  KIMDTDLEQKGQNIKIQNPRAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREGEHH
        +I DTDLEQKGQNIKIQNPRAIRDVYQLEEKLQSALN LRNYKKLFA ASSHLPPARTTSFIVLVPLIVFCARCIIGAS ARVFGT +LETVDKREG+HH
Subjt:  KIMDTDLEQKGQNIKIQNPRAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREGEHH

Query:  KFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        KFRSGHWRSALRDIRE+DGLDCESPIDS SP EDEQIS EDLSH YKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  KFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

TrEMBL top hitse value%identityAlignment
A0A0A0KSX5 Uncharacterized protein1.0e-11980.78Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQ-LQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANK
        MEVK+ QRNRA RFSLLP   P S + SL+  NWANT++SF NQ R IA+RWRFQ L DISK QLSTKH FVHI+EG ESLTS SNQNGDP HSIV+ANK
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQ-LQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANK

Query:  KIMDTDLEQKGQNIKIQNP---RAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREG
        KIMDTDLEQK QNIKIQNP   R IR+ +QLEEKLQSALNGLR YKKLFA ASSH PPARTTSFIVLVPL++FCARCIIGAS AR FGTLKL+ +DK+EG
Subjt:  KIMDTDLEQKGQNIKIQNP---RAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREG

Query:  EHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        E  KFRSGHWRSALRDIRE+DGLDCE+PIDS+SP EDEQISVE+LSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  EHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

A0A1S3CB50 uncharacterized protein LOC103498697 isoform X12.0e-11877.93Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN
        MEVK+ QRNRA RFS    LLPH TLPSLT   RNWANT++SF NQ R I++RWRFQL D+SK QLSTKH FVHI+EG ESL SISN+NGDP +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN

Query:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPP--------ARTTSFIVLVPLIVFCARCIIGASSARVFGTLK
        KKI+DTD EQKGQNIKIQNPR    IRD +QLEEKLQ+ALNGL+ YKKLFA ASS LPP        ARTTSFIVLVPL++FC RCIIGAS ARVFGTLK
Subjt:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPP--------ARTTSFIVLVPLIVFCARCIIGASSARVFGTLK

Query:  LETVDKREGEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        L+ ++K+EGE HKFRSGHWRSALRDIRE+DGLDCE+PIDS+SP EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  LETVDKREGEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

A0A1S3CBV5 uncharacterized protein LOC103498697 isoform X21.2e-12080.14Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN
        MEVK+ QRNRA RFS    LLPH TLPSLT   RNWANT++SF NQ R I++RWRFQL D+SK QLSTKH FVHI+EG ESL SISN+NGDP +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN

Query:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKRE
        KKI+DTD EQKGQNIKIQNPR    IRD +QLEEKLQ+ALNGL+ YKKLFA ASS LPPARTTSFIVLVPL++FC RCIIGAS ARVFGTLKL+ ++K+E
Subjt:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKRE

Query:  GEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        GE HKFRSGHWRSALRDIRE+DGLDCE+PIDS+SP EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  GEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP-EDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

A0A5A7T6Z4 LysM domain-containing protein9.4e-12180.5Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN
        MEVK+ QRNRA RFS    LLPH TLPSLT   RNWANT++SF NQ R I++RWRFQL D+SK QLSTKH FVHI+EG ESL SISN+NGDP +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLT--HRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIAN

Query:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKRE
        KKI+DTDLEQKGQNIKIQN RA   IRD +QLEEKLQSALNGL+ YKKLFA ASS LPPARTTSFIVLVPL++FC RCIIGAS ARVFGTLKL+ ++K+E
Subjt:  KKIMDTDLEQKGQNIKIQNPRA---IRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKRE

Query:  GEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSPE-DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
        GE HKFRSGHWRSALRDIRE+DGLDCE+PIDS+SP  DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE
Subjt:  GEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSPE-DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE

A0A6J1CKQ6 uncharacterized protein LOC111012032 isoform X31.1e-10876.47Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANKK
        ME+K+SQRNRADRFSLLPKLLP  TLPS THR WA  +RS KNQF A+A+RWRFQLQDI +DQ  TKH FV I+EG E+ TSI  QNG   HSIVI N+K
Subjt:  MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANKK

Query:  IMDTDLEQKGQNIKIQNPRAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREGEHHK
        I DTDLE KGQ+ KI+NP AIRDVYQL+EKLQS+LNGL+NYKKLF   S  LPPARTTSFIVLVPLIVFCARCIIGAS ARV  T KL+T+DK EGEHHK
Subjt:  IMDTDLEQKGQNIKIQNPRAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREGEHHK

Query:  FRSGHWRSALRDIREVDGLDCESPID---SSSPE-DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG
        FRSGHWRSALRDIRE+DGLD ES  D   S+SP  DEQISVEDLSHAYKKLD+DYEKFLSECGLS  GYWRG
Subjt:  FRSGHWRSALRDIREVDGLDCESPID---SSSPE-DEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09970.1 unknown protein3.5e-1127.08Show/hide
Query:  QFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNG----DPAHSIVIANKKIMDTDLEQKGQNI---------KIQNPRAIRDVYQLEEK
        +F+  + R RF +Q +S+++  TKH+     +  ESL  I  Q G    +P  S    + ++ D D E+K   +         K+     + D+ ++E+ 
Subjt:  QFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNG----DPAHSIVIANKKIMDTDLEQKGQNI---------KIQNPRAIRDVYQLEEK

Query:  LQSALNGLRNYKKLFAQASSHLPPARTTSFIV-LVPLIVFCARCIIGASSARVFGTLKLETVDKR---EGEHHKFRSGHWRSALRDIRE---VDGLDCES
         ++    +     L  Q    LP   T   +  L+P++ FC  CIIG           L T+  R   +G HH   S  WR+AL D  E    DG D  S
Subjt:  LQSALNGLRNYKKLFAQASSHLPPARTTSFIV-LVPLIVFCARCIIGASSARVFGTLKLETVDKR---EGEHHKFRSGHWRSALRDIRE---VDGLDCES

Query:  P-IDSSSPEDEQISVEDLSHAYKKLDQDYEKFLSECGLSK
        P    +S   E  + ++++ AY +++ +Y++FL ECG+ +
Subjt:  P-IDSSSPEDEQISVEDLSHAYKKLDQDYEKFLSECGLSK

AT4G09970.2 unknown protein1.5e-0927.04Show/hide
Query:  IAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNG----DPAHSIVIANKKIMDTDLEQKGQNI---------KIQNPRAIRDVYQLEEKLQSA
        ++ +W F +Q +S+++  TKH+     +  ESL  I  Q G    +P  S    + ++ D D E+K   +         K+     + D+ ++E+  ++ 
Subjt:  IAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNG----DPAHSIVIANKKIMDTDLEQKGQNI---------KIQNPRAIRDVYQLEEKLQSA

Query:  LNGLRNYKKLFAQASSHLPPARTTSFIV-LVPLIVFCARCIIGASSARVFGTLKLETVDKR---EGEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP
           +     L  Q    LP   T   +  L+P++ FC  CIIG           L T+  R   +G HH   S  WR+AL D  E    D     DS SP
Subjt:  LNGLRNYKKLFAQASSHLPPARTTSFIV-LVPLIVFCARCIIGASSARVFGTLKLETVDKR---EGEHHKFRSGHWRSALRDIREVDGLDCESPIDSSSP

Query:  E-DEQISVEDLSHAYKKLDQDYEKFLSECGLSK
        E  E  + ++++ AY +++ +Y++FL ECG+ +
Subjt:  E-DEQISVEDLSHAYKKLDQDYEKFLSECGLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTGAAGCTCAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCCAAGTTGCTCCCACACTCGACTCTCCCTTCTCTAACTCACAGAAATTGGGCCAACAC
CAGAAGATCATTCAAGAATCAATTCAGAGCCATTGCTATGAGATGGAGGTTTCAACTTCAAGATATATCCAAAGATCAACTCTCCACCAAGCACCAGTTTGTTCATATTA
TAGAAGGGAGGGAGAGCTTGACTTCAATTTCAAATCAGAATGGAGATCCTGCACATTCCATTGTCATAGCTAATAAGAAGATAATGGACACGGATCTAGAACAAAAGGGG
CAGAATATCAAGATTCAAAACCCTCGAGCGATTAGAGATGTATATCAATTAGAAGAAAAGCTTCAAAGTGCTTTGAATGGACTTCGAAACTATAAGAAGCTTTTCGCGCA
AGCCTCCTCTCATCTACCTCCTGCTAGAACCACTAGTTTTATAGTTTTGGTTCCTCTTATAGTATTTTGTGCCAGATGCATAATTGGTGCCTCTTCTGCTAGAGTTTTCG
GAACATTGAAGCTTGAAACCGTTGATAAACGAGAGGGAGAACATCACAAGTTCAGAAGCGGGCACTGGAGATCTGCTCTTCGTGATATAAGGGAAGTGGATGGTTTGGAT
TGTGAGTCCCCTATAGATTCTTCAAGTCCAGAAGATGAACAGATCTCAGTAGAAGATTTATCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATCAGA
ATGTGGACTGAGTAAATGGGGCTACTGGCGTGGGGGTACCCAGAGACCTGAACAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTGAAGCTCAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCCAAGTTGCTCCCACACTCGACTCTCCCTTCTCTAACTCACAGAAATTGGGCCAACAC
CAGAAGATCATTCAAGAATCAATTCAGAGCCATTGCTATGAGATGGAGGTTTCAACTTCAAGATATATCCAAAGATCAACTCTCCACCAAGCACCAGTTTGTTCATATTA
TAGAAGGGAGGGAGAGCTTGACTTCAATTTCAAATCAGAATGGAGATCCTGCACATTCCATTGTCATAGCTAATAAGAAGATAATGGACACGGATCTAGAACAAAAGGGG
CAGAATATCAAGATTCAAAACCCTCGAGCGATTAGAGATGTATATCAATTAGAAGAAAAGCTTCAAAGTGCTTTGAATGGACTTCGAAACTATAAGAAGCTTTTCGCGCA
AGCCTCCTCTCATCTACCTCCTGCTAGAACCACTAGTTTTATAGTTTTGGTTCCTCTTATAGTATTTTGTGCCAGATGCATAATTGGTGCCTCTTCTGCTAGAGTTTTCG
GAACATTGAAGCTTGAAACCGTTGATAAACGAGAGGGAGAACATCACAAGTTCAGAAGCGGGCACTGGAGATCTGCTCTTCGTGATATAAGGGAAGTGGATGGTTTGGAT
TGTGAGTCCCCTATAGATTCTTCAAGTCCAGAAGATGAACAGATCTCAGTAGAAGATTTATCACATGCTTACAAGAAACTGGACCAGGATTACGAAAAATTTCTATCAGA
ATGTGGACTGAGTAAATGGGGCTACTGGCGTGGGGGTACCCAGAGACCTGAACAGGAATAG
Protein sequenceShow/hide protein sequence
MEVKLSQRNRADRFSLLPKLLPHSTLPSLTHRNWANTRRSFKNQFRAIAMRWRFQLQDISKDQLSTKHQFVHIIEGRESLTSISNQNGDPAHSIVIANKKIMDTDLEQKG
QNIKIQNPRAIRDVYQLEEKLQSALNGLRNYKKLFAQASSHLPPARTTSFIVLVPLIVFCARCIIGASSARVFGTLKLETVDKREGEHHKFRSGHWRSALRDIREVDGLD
CESPIDSSSPEDEQISVEDLSHAYKKLDQDYEKFLSECGLSKWGYWRGGTQRPEQE