; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039667 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039667
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLysM domain-containing protein
Genome locationchr2:47954142..47957755
RNA-Seq ExpressionLag0039667
SyntenyLag0039667
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR018392 - LysM domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039254.1 uncharacterized protein E6C27_scaffold64G00450 [Cucumis melo var. makuwa]5.5e-10775.35Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT--HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIAN
        MEVK+ QRNRA RFS    LLP PTLPSLT   RNWANTK+S  NQ R I+LRWRFQL DV K QLSTKHHFVH++EG+ESL S+SN+NG P +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT--HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIAN

Query:  RKIMDTDQEHKGQDIKIQNSRA---IRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE
        +KI+DTD E KGQ+IKIQNSRA   IRD +QL+EKLQS+LN L+ YK L  LA+ RLPPARTTS IVLVPL++FC RCIIGASYARVF    LK  NK+E
Subjt:  RKIMDTDQEHKGQDIKIQNSRA---IRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE

Query:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE
        GERHKFRSGHWRSALRDIRELDG D E+ IDSTSPS +EQISVEDLSHAY+KLDQDYEKFLSECGLSKWGYWR  GGTQRPEQE
Subjt:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE

XP_008459633.1 PREDICTED: uncharacterized protein LOC103498697 isoform X2 [Cucumis melo]2.1e-10674.65Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT--HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIAN
        MEVK+ QRNRA RFS    LLP PTLPSLT   RNWANTK+S  NQ R I+LRWRFQL DV K QLSTKHHFVH++EG+ESL S+SN+NG P +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT--HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIAN

Query:  RKIMDTDQEHKGQDIKIQNSRA---IRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE
        +KI+DTD E KGQ+IKIQN R    IRD +QL+EKLQ++LN L+ YK L  LA+ RLPPARTTS IVLVPL++FC RCIIGASYARVF    LK  NK+E
Subjt:  RKIMDTDQEHKGQDIKIQNSRA---IRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE

Query:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE
        GERHKFRSGHWRSALRDIRELDG D E+ IDSTSPSE+EQISVEDLSHAY+KLDQDYEKFLSECGLSKWGYWR  GGTQRPEQE
Subjt:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE

XP_022141746.1 uncharacterized protein LOC111012032 isoform X2 [Momordica charantia]7.2e-10776.64Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEG-----SESLTSVSNQNGVPTHSIV
        ME+K+SQRNRADRFSLLPKLLPQPTLPS THR WA  KRS KNQF A+ALRWRFQLQD+ +DQ  TKHHFV +VEG      E+ TS+  QNGV THSIV
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEG-----SESLTSVSNQNGVPTHSIV

Query:  IANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE
        I NRKI DTD EHKGQD KI+N  AIRD YQLQEKLQSSLN L+ YK L +  +PRLPPARTTS IVLVPLIVFCARCIIGASYARV + S LKT +K E
Subjt:  IANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE

Query:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG
        GE HKFRSGHWRSALRDIRELDG DSESS D +SPS +EQISVEDLSHAY+KLD+DYEKFLSECGLS  GYWRG
Subjt:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG

XP_022141747.1 uncharacterized protein LOC111012032 isoform X3 [Momordica charantia]1.5e-10777.57Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIANRK
        ME+K+SQRNRADRFSLLPKLLPQPTLPS THR WA  KRS KNQF A+ALRWRFQLQD+ +DQ  TKHHFV +VEG E+ TS+  QNGV THSIVI NRK
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIANRK

Query:  IMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKREGERHK
        I DTD EHKGQD KI+N  AIRD YQLQEKLQSSLN L+ YK L +  +PRLPPARTTS IVLVPLIVFCARCIIGASYARV + S LKT +K EGE HK
Subjt:  IMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKREGERHK

Query:  FRSGHWRSALRDIRELDGSDSESSID---STSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG
        FRSGHWRSALRDIRELDG DSESS D   S SPS +EQISVEDLSHAY+KLD+DYEKFLSECGLS  GYWRG
Subjt:  FRSGHWRSALRDIRELDGSDSESSID---STSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG

XP_038890844.1 uncharacterized protein LOC120080288 isoform X1 [Benincasa hispida]4.5e-11780.71Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT-HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIANR
        MEVKLSQRNRADRF L PKLLPQPTLPSLT HRNWANT++SLKNQFRAI LRWRFQLQD+ K+QLSTKHH VH+VEGSESLT   NQNG PTHSI +AN+
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT-HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIANR

Query:  KIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKREGERH
        +I DTD E KGQ+IKIQN RAIRD YQL+EKLQS+LN LR YK L  LA+  LPPARTTS IVLVPLIVFCARCIIGASYARVF  S L+T +KREG+ H
Subjt:  KIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKREGERH

Query:  KFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE
        KFRSGHWRSALRDIRELDG D ES IDS SPSE+EQIS EDLSH Y+KLDQDYEKFLSECGLSKWGYWR  GGTQRPEQE
Subjt:  KFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE

TrEMBL top hitse value%identityAlignment
A0A1S3CBV5 uncharacterized protein LOC103498697 isoform X21.0e-10674.65Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT--HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIAN
        MEVK+ QRNRA RFS    LLP PTLPSLT   RNWANTK+S  NQ R I+LRWRFQL DV K QLSTKHHFVH++EG+ESL S+SN+NG P +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT--HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIAN

Query:  RKIMDTDQEHKGQDIKIQNSRA---IRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE
        +KI+DTD E KGQ+IKIQN R    IRD +QL+EKLQ++LN L+ YK L  LA+ RLPPARTTS IVLVPL++FC RCIIGASYARVF    LK  NK+E
Subjt:  RKIMDTDQEHKGQDIKIQNSRA---IRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE

Query:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE
        GERHKFRSGHWRSALRDIRELDG D E+ IDSTSPSE+EQISVEDLSHAY+KLDQDYEKFLSECGLSKWGYWR  GGTQRPEQE
Subjt:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE

A0A5A7T6Z4 LysM domain-containing protein2.7e-10775.35Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT--HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIAN
        MEVK+ QRNRA RFS    LLP PTLPSLT   RNWANTK+S  NQ R I+LRWRFQL DV K QLSTKHHFVH++EG+ESL S+SN+NG P +SIVI N
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLT--HRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIAN

Query:  RKIMDTDQEHKGQDIKIQNSRA---IRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE
        +KI+DTD E KGQ+IKIQNSRA   IRD +QL+EKLQS+LN L+ YK L  LA+ RLPPARTTS IVLVPL++FC RCIIGASYARVF    LK  NK+E
Subjt:  RKIMDTDQEHKGQDIKIQNSRA---IRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE

Query:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE
        GERHKFRSGHWRSALRDIRELDG D E+ IDSTSPS +EQISVEDLSHAY+KLDQDYEKFLSECGLSKWGYWR  GGTQRPEQE
Subjt:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE

A0A6J1CIZ2 uncharacterized protein LOC111012032 isoform X15.0e-10676.17Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEG-----SESLTSVSNQNGVPTHSIV
        ME+K+SQRNRADRFSLLPKLLPQPTLPS THR WA  KRS KNQF A+ALRWRFQLQD+ +DQ  TKHHFV +VEG      E+ TS+  QNGV THSIV
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEG-----SESLTSVSNQNGVPTHSIV

Query:  IANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE
        I NRKI DTD EHKGQD KI+N  AIRD YQLQEKLQSSLN L+ YK L +  +PRLPPARTTS IVLVPLIVFCARCIIGASYARV + S LKT +K E
Subjt:  IANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE

Query:  GERHKFRSGHWRSALRDIRELDGSDSESSID---STSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG
        GE HKFRSGHWRSALRDIRELDG DSESS D   S SPS +EQISVEDLSHAY+KLD+DYEKFLSECGLS  GYWRG
Subjt:  GERHKFRSGHWRSALRDIRELDGSDSESSID---STSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG

A0A6J1CK58 uncharacterized protein LOC111012032 isoform X23.5e-10776.64Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEG-----SESLTSVSNQNGVPTHSIV
        ME+K+SQRNRADRFSLLPKLLPQPTLPS THR WA  KRS KNQF A+ALRWRFQLQD+ +DQ  TKHHFV +VEG      E+ TS+  QNGV THSIV
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEG-----SESLTSVSNQNGVPTHSIV

Query:  IANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE
        I NRKI DTD EHKGQD KI+N  AIRD YQLQEKLQSSLN L+ YK L +  +PRLPPARTTS IVLVPLIVFCARCIIGASYARV + S LKT +K E
Subjt:  IANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKRE

Query:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG
        GE HKFRSGHWRSALRDIRELDG DSESS D +SPS +EQISVEDLSHAY+KLD+DYEKFLSECGLS  GYWRG
Subjt:  GERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG

A0A6J1CKQ6 uncharacterized protein LOC111012032 isoform X37.0e-10877.57Show/hide
Query:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIANRK
        ME+K+SQRNRADRFSLLPKLLPQPTLPS THR WA  KRS KNQF A+ALRWRFQLQD+ +DQ  TKHHFV +VEG E+ TS+  QNGV THSIVI NRK
Subjt:  MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIANRK

Query:  IMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKREGERHK
        I DTD EHKGQD KI+N  AIRD YQLQEKLQSSLN L+ YK L +  +PRLPPARTTS IVLVPLIVFCARCIIGASYARV + S LKT +K EGE HK
Subjt:  IMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKREGERHK

Query:  FRSGHWRSALRDIRELDGSDSESSID---STSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG
        FRSGHWRSALRDIRELDG DSESS D   S SPS +EQISVEDLSHAY+KLD+DYEKFLSECGLS  GYWRG
Subjt:  FRSGHWRSALRDIRELDGSDSESSID---STSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09970.1 unknown protein1.9e-0925.96Show/hide
Query:  QFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGV----PTHSIVIANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLR
        +F+  + R RF +Q + +++  TKH      + SESL  +  Q GV    P  S   +    +D +++H      + +  +   +    E+L   L R+ 
Subjt:  QFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGV----PTHSIVIANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLR

Query:  KY-KTLGILAT------PRLPPARTTSLIV-LVPLIVFCARCIIGASYARVFRASGLKTANKREGERHKFRSGHWRSALRDIRE---LDGSDSESSIDST
        KY +T+G            LP   T  L+  L+P++ FC  CIIG  +  +         +++  + H   S  WR+AL D  E    DG DS S     
Subjt:  KY-KTLGILAT------PRLPPARTTSLIV-LVPLIVFCARCIIGASYARVFRASGLKTANKREGERHKFRSGHWRSALRDIRE---LDGSDSESSIDST

Query:  SPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSK
        + + +E  + ++++ AY +++ +Y++FL ECG+ +
Subjt:  SPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSK

AT4G09970.2 unknown protein9.7e-0926.67Show/hide
Query:  RWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGV----PTHSIVIANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKY-KTLG
        +W F +Q + +++  TKH      + SESL  +  Q GV    P  S   +    +D +++H      + +  +   +    E+L   L R+ KY +T+G
Subjt:  RWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGV----PTHSIVIANRKIMDTDQEHKGQDIKIQNSRAIRDRYQLQEKLQSSLNRLRKY-KTLG

Query:  ILAT------PRLPPARTTSLIV-LVPLIVFCARCIIGASYARVFRASGLKTANKREGERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISV
                    LP   T  L+  L+P++ FC  CIIG  +  +         +++  + H   S  WR+AL D  E   SD     DS SP   E  + 
Subjt:  ILAT------PRLPPARTTSLIV-LVPLIVFCARCIIGASYARVFRASGLKTANKREGERHKFRSGHWRSALRDIRELDGSDSESSIDSTSPSEEEQISV

Query:  EDLSHAYRKLDQDYEKFLSECGLSK
        ++++ AY +++ +Y++FL ECG+ +
Subjt:  EDLSHAYRKLDQDYEKFLSECGLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTGAAACTCAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCCAAGTTGCTCCCACAACCAACTCTCCCTTCTCTAACTCACAGAAATTGGGCCAACAC
CAAAAGATCACTGAAGAATCAATTCAGAGCCATTGCGCTGAGATGGAGGTTTCAACTTCAGGATGTATACAAAGATCAACTCTCCACCAAGCACCACTTTGTTCATGTTG
TCGAAGGGAGCGAGAGCTTGACTTCGGTTTCGAATCAGAATGGAGTTCCTACACATTCCATTGTCATAGCTAATAGGAAGATAATGGATACGGATCAAGAACACAAGGGG
CAGGATATCAAGATTCAAAACTCTCGAGCGATTAGAGATAGATATCAATTGCAAGAAAAGCTTCAGAGTTCTTTGAATAGACTTCGTAAATATAAAACGCTTGGCATACT
TGCTACCCCTCGACTACCTCCTGCTAGAACCACTAGTTTGATAGTTTTGGTTCCTCTTATAGTATTTTGTGCCAGATGCATTATTGGTGCCTCTTATGCTAGAGTTTTCA
GAGCATCGGGGCTTAAAACCGCTAATAAACGAGAGGGAGAACGTCACAAGTTCAGAAGTGGCCATTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTCGGAT
TCTGAGTCATCCATAGATTCAACTAGTCCTTCAGAAGAAGAACAGATCTCAGTTGAAGATTTGTCACATGCTTACAGGAAACTGGACCAGGATTACGAAAAGTTTCTATC
AGAATGTGGACTGAGTAAATGGGGCTACTGGCGTGGGGGTGGGGGTACCCAGAGACCTGAACAAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTGAAACTCAGCCAGAGAAATAGAGCAGACCGTTTTTCTCTCCTCCCCAAGTTGCTCCCACAACCAACTCTCCCTTCTCTAACTCACAGAAATTGGGCCAACAC
CAAAAGATCACTGAAGAATCAATTCAGAGCCATTGCGCTGAGATGGAGGTTTCAACTTCAGGATGTATACAAAGATCAACTCTCCACCAAGCACCACTTTGTTCATGTTG
TCGAAGGGAGCGAGAGCTTGACTTCGGTTTCGAATCAGAATGGAGTTCCTACACATTCCATTGTCATAGCTAATAGGAAGATAATGGATACGGATCAAGAACACAAGGGG
CAGGATATCAAGATTCAAAACTCTCGAGCGATTAGAGATAGATATCAATTGCAAGAAAAGCTTCAGAGTTCTTTGAATAGACTTCGTAAATATAAAACGCTTGGCATACT
TGCTACCCCTCGACTACCTCCTGCTAGAACCACTAGTTTGATAGTTTTGGTTCCTCTTATAGTATTTTGTGCCAGATGCATTATTGGTGCCTCTTATGCTAGAGTTTTCA
GAGCATCGGGGCTTAAAACCGCTAATAAACGAGAGGGAGAACGTCACAAGTTCAGAAGTGGCCATTGGAGATCTGCTCTTCGTGATATAAGGGAATTGGATGGTTCGGAT
TCTGAGTCATCCATAGATTCAACTAGTCCTTCAGAAGAAGAACAGATCTCAGTTGAAGATTTGTCACATGCTTACAGGAAACTGGACCAGGATTACGAAAAGTTTCTATC
AGAATGTGGACTGAGTAAATGGGGCTACTGGCGTGGGGGTGGGGGTACCCAGAGACCTGAACAAGAATAG
Protein sequenceShow/hide protein sequence
MEVKLSQRNRADRFSLLPKLLPQPTLPSLTHRNWANTKRSLKNQFRAIALRWRFQLQDVYKDQLSTKHHFVHVVEGSESLTSVSNQNGVPTHSIVIANRKIMDTDQEHKG
QDIKIQNSRAIRDRYQLQEKLQSSLNRLRKYKTLGILATPRLPPARTTSLIVLVPLIVFCARCIIGASYARVFRASGLKTANKREGERHKFRSGHWRSALRDIRELDGSD
SESSIDSTSPSEEEQISVEDLSHAYRKLDQDYEKFLSECGLSKWGYWRGGGGTQRPEQE