; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G009120 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G009120
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPlant protein of unknown function (DUF863)
Genome locationchr01:7392850..7395979
RNA-Seq ExpressionLsi01G009120
SyntenyLsi01G009120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034303.1 hypothetical protein SDJN02_04030 [Cucurbita argyrosperma subsp. argyrosperma]5.5e-9687.21Show/hide
Query:  DCFGDSRFKAASDKKVKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYY
        DC GDSRF  AS K+ KGAEIPMEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWN +SWDKRNEICFRQIYEQDAKNY 
Subjt:  DCFGDSRFKAASDKKVKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYY

Query:  RS-THTTKLDMEQPAEDEPEAINGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKK
        RS T TTKLD+EQPAEDE EA NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRR  D KQV+GQEMAVLGV EN SG+QNGS+RGEKK
Subjt:  RS-THTTKLDMEQPAEDEPEAINGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKK

Query:  M--LDYPPWLFQVVSLNMT
        M  +DYPPWLFQ VSLNMT
Subjt:  M--LDYPPWLFQVVSLNMT

XP_008440360.1 PREDICTED: uncharacterized protein LOC103484836 [Cucumis melo]8.3e-9292.78Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN
        MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRE+    ESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPE  N
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN

Query:  GALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT
        GA QIINE ELELTLGPSSYNTSDSG TT+SSSSTGSSHEGRRCTDTKQVKGQEMA LGVTENSSG QNG+NRGEKKMLDYPPWLFQVVSLNMT
Subjt:  GALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT

XP_022949937.1 uncharacterized protein LOC111453182 [Cucurbita moschata]1.4e-8688.83Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPAEDEPEAI
        MEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWN +SWDKRNEICFRQIYEQDAKNY RS T TTKLD+EQPAEDE EA 
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPAEDEPEAI

Query:  NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT
        NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRR  D KQV+GQEMAVLGV EN SG+QNGS+RGEKKM  +DYPPWLFQ VSLNMT
Subjt:  NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT

XP_038883305.1 uncharacterized protein LOC120074293 isoform X1 [Benincasa hispida]7.2e-10496.06Show/hide
Query:  KKVKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQP
        KKVKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWN ESWDKRNEICFRQIYEQDAKNYYRSTH TKLDMEQP
Subjt:  KKVKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQP

Query:  AEDEPEAINGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSL
        AE EPEAINGALQIINENELELTLGPSSYNTSDSG+TTHSSSSTGSSHEGRRCTD++QVKGQEM VLGVTENSSGY+NGSNRGEKKMLDYPPWLFQVVSL
Subjt:  AEDEPEAINGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSL

Query:  NMT
        NMT
Subjt:  NMT

XP_038883306.1 uncharacterized protein LOC120074293 isoform X2 [Benincasa hispida]1.2e-9895.88Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN
        MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWN ESWDKRNEICFRQIYEQDAKNYYRSTH TKLDMEQPAE EPEAIN
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN

Query:  GALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT
        GALQIINENELELTLGPSSYNTSDSG+TTHSSSSTGSSHEGRRCTD++QVKGQEM VLGVTENSSGY+NGSNRGEKKMLDYPPWLFQVVSLNMT
Subjt:  GALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT

TrEMBL top hitse value%identityAlignment
A0A0A0KIH1 Uncharacterized protein1.1e-8690.26Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEA-I
        MEKLPKAYEKEYMRMAMLKHEETFKQQV ELHRLYRTQKTLMKNVEKSRE+    ESWDKRNEICFRQIYEQDAKNYYRST TTKLDMEQPAEDEPE   
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEA-I

Query:  NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT
        NGA QIINE ELELTLGPSSYNTSDSG TT+SSSSTGSSH+ RRCTDTKQVKGQEMA LGVTENSSG QNG+NRGEKKMLDYPPWLFQVVSLNMT
Subjt:  NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT

A0A1S3B1P5 uncharacterized protein LOC1034848364.0e-9292.78Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN
        MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRE+    ESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPE  N
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN

Query:  GALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT
        GA QIINE ELELTLGPSSYNTSDSG TT+SSSSTGSSHEGRRCTDTKQVKGQEMA LGVTENSSG QNG+NRGEKKMLDYPPWLFQVVSLNMT
Subjt:  GALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT

A0A6J1BU22 uncharacterized protein LOC1110053591.4e-8184.42Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR--ESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEA
        MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR  ++GWN ESWDKRNEICFRQIYEQDAK+YY+STHTTKLD+EQPAEDE   
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR--ESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEA

Query:  I--NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQ-NGSNRGEKKMLDYPPWLFQVVSLNMT
           NG LQIINENE+ELTLGPSSYNTSDSG+T  SSSSTGSSHEGRR  D+KQVKGQEM VLG TE SSGYQ N SNR ++K+LDYPPWLFQVVSLNMT
Subjt:  I--NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQ-NGSNRGEKKMLDYPPWLFQVVSLNMT

A0A6J1GEC8 uncharacterized protein LOC1114531826.6e-8788.83Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPAEDEPEAI
        MEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWN +SWDKRNEICFRQIYEQDAKNY RS T TTKLD+EQPAEDE EA 
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPAEDEPEAI

Query:  NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT
        NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRR  D KQV+GQEMAVLGV EN SG+QNGS+RGEKKM  +DYPPWLFQ VSLNMT
Subjt:  NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT

A0A6J1IUL1 uncharacterized protein LOC1114787146.2e-8587.31Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPAEDEPEAI
        MEKLPKA+EKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWN +SWDKRNEICFRQIYEQDAKNY RS T TTKLD+EQPAEDE EA 
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPAEDEPEAI

Query:  NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT
        NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRR  D  QV+ QEMAVLGV EN SG+QNGS+RGEKKM  +DYPPWLFQ VSLN+T
Subjt:  NGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKM--LDYPPWLFQVVSLNMT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26620.1 Plant protein of unknown function (DUF863)8.8e-0761.54Show/hide
Query:  YEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE
        YEK++M+  ML+HE  FK QVHELHRLYR QK L++ V+
Subjt:  YEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE

AT1G69360.1 Plant protein of unknown function (DUF863)1.1e-0657.5Show/hide
Query:  AYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE
        +YE+++++  ML+HE  FK QV+ELHRLYRTQK+LM  V+
Subjt:  AYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE

AT5G57340.2 unknown protein3.7e-0525.87Show/hide
Query:  LNTQFIMALASGTSRKYKIDQTGPLDCFGDSRFKAA----SDKKVKGAEI--PMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEK
        + T+ +  + +  S    +D +G +    + R +++    +D++ +G  I   M++  +    E +R  M   E+ FKQQV ELHR+Y TQK +M  + K
Subjt:  LNTQFIMALASGTSRKYKIDQTGPLDCFGDSRFKAA----SDKKVKGAEI--PMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEK

Query:  SRESGWNQESWD-----KRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAINGALQIINENELELTLGPSSYNTSDSGITTHS---SSSTGSSH
         R   W   + D     +R   C       D +N  R+T TT   +E+                +E EL L++G SS +T+ +  T      SS+T    
Subjt:  SRESGWNQESWD-----KRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAINGALQIINENELELTLGPSSYNTSDSGITTHS---SSSTGSSH

Query:  EGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLD----YPPWLFQVVSLNMT
            C +  Q            + SSG     +      LD     P WLFQ +S+N T
Subjt:  EGRRCTDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLD----YPPWLFQVVSLNMT

AT5G67390.1 unknown protein2.5e-1436.72Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN
        MEKL   Y+K+ M+MAMLKHEETFKQQV+ELHRLY+ QK LMKN+E ++ +        K N +           N    T   ++D E          N
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN

Query:  GALQIINENELELTLGPSSY----------------------NTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEM
          ++I++E+E+ELTLGPS Y                       + +SG  + SSSSTGSS+      + +QV+ + M
Subjt:  GALQIINENELELTLGPSSY----------------------NTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEM

AT5G67390.2 unknown protein2.5e-1436.72Show/hide
Query:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN
        MEKL   Y+K+ M+MAMLKHEETFKQQV+ELHRLY+ QK LMKN+E ++ +        K N +           N    T   ++D E          N
Subjt:  MEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAIN

Query:  GALQIINENELELTLGPSSY----------------------NTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEM
          ++I++E+E+ELTLGPS Y                       + +SG  + SSSSTGSS+      + +QV+ + M
Subjt:  GALQIINENELELTLGPSSY----------------------NTSDSGITTHSSSSTGSSHEGRRCTDTKQVKGQEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGCAAACAAAAGACCAATGGAAAAGGCTTTATCACAAGTTTTTTTTTATTCGTATTCGGTTGTTCTGTCCTCATTATTGCTGCTGAATACCCAATTCATAATGGC
GTTGGCTAGTGGTACTTCAAGGAAATATAAAATTGATCAAACCGGACCTTTAGACTGTTTTGGGGATTCAAGATTCAAGGCTGCATCAGATAAAAAAGTAAAGGGGGCTG
AAATCCCAATGGAAAAGCTGCCTAAGGCATATGAAAAGGAGTACATGAGGATGGCCATGTTAAAGCATGAAGAAACATTCAAACAACAGGTACATGAACTTCATCGACTT
TATCGAACACAAAAGACCTTAATGAAAAACGTAGAGAAAAGCAGGGAAAGTGGATGGAATCAAGAGAGTTGGGACAAAAGGAATGAAATATGTTTCAGACAAATCTATGA
ACAGGATGCAAAAAATTATTACAGATCAACTCATACGACCAAACTAGACATGGAACAGCCCGCTGAAGATGAACCAGAAGCCATTAATGGAGCTCTGCAGATCATAAACG
AGAATGAACTCGAACTAACTCTAGGGCCTTCAAGTTACAATACTTCAGATTCAGGAATAACCACCCACTCTTCTTCTTCAACAGGGTCCAGCCATGAGGGAAGAAGATGT
ACGGATACAAAGCAAGTTAAAGGTCAAGAAATGGCGGTTTTGGGGGTAACTGAAAATTCCTCAGGCTACCAAAATGGAAGTAATAGAGGAGAGAAGAAGATGCTAGATTA
TCCTCCTTGGCTTTTTCAAGTTGTGAGCCTTAACATGACTTAA
mRNA sequenceShow/hide mRNA sequence
TAAAGTCGTTATCAAAATTAGGCTAAAAGTATCAAAAGAAAAACCAAAATAAAATACGTAGAGAGAAAAAAGGAAAAAGGAAAAAAAAAAGAGGTCAATGAAGAGAGAAG
TGGATGTGAATTATATATACCAAGTACTTGGAATCTCGTGCACTGCTGGTGTTGCCTATTTCTGAAGCATATCTTTCACGAAAGTTGAAGGAGATTGGCTGGAATCTGAC
CCCAAACATCGCAGAACTTGGTATTCTGATAAATGGCAAATACATGCAAACTCTGCAATTCCGTCTGCTGATCGCTAGTCGTTACCATAACGGATTGTTTCAGGTAATTC
CGAAGTTCCCCTTCGGTTTTTTGTTACTTTCTTGATCGATCTGTATGTTTAATTCGATATTCGATCCAGCTTACACATCGTAATTTGTAATTAGAGATGGTTTTGGTTTA
ATTTTCATTTGGTTTTTTGAATGTATTACAGAGTTTCTGTGAAAGGGAATTATCGGTATTATTATTATTTTTGTGGGGCTGGGGGAAGCGCATTAAGTGCTCGACGTAAT
GACTGAATGAGAAAGGGATATTTAGGATGATTGCAAACAAAAGACCAATGGAAAAGGCTTTATCACAAGTTTTTTTTTATTCGTATTCGGTTGTTCTGTCCTCATTATTG
CTGCTGAATACCCAATTCATAATGGCGTTGGCTAGTGGTACTTCAAGGAAATATAAAATTGATCAAACCGGACCTTTAGACTGTTTTGGGGATTCAAGATTCAAGGCTGC
ATCAGATAAAAAAGTAAAGGGGGCTGAAATCCCAATGGAAAAGCTGCCTAAGGCATATGAAAAGGAGTACATGAGGATGGCCATGTTAAAGCATGAAGAAACATTCAAAC
AACAGGTACATGAACTTCATCGACTTTATCGAACACAAAAGACCTTAATGAAAAACGTAGAGAAAAGCAGGGAAAGTGGATGGAATCAAGAGAGTTGGGACAAAAGGAAT
GAAATATGTTTCAGACAAATCTATGAACAGGATGCAAAAAATTATTACAGATCAACTCATACGACCAAACTAGACATGGAACAGCCCGCTGAAGATGAACCAGAAGCCAT
TAATGGAGCTCTGCAGATCATAAACGAGAATGAACTCGAACTAACTCTAGGGCCTTCAAGTTACAATACTTCAGATTCAGGAATAACCACCCACTCTTCTTCTTCAACAG
GGTCCAGCCATGAGGGAAGAAGATGTACGGATACAAAGCAAGTTAAAGGTCAAGAAATGGCGGTTTTGGGGGTAACTGAAAATTCCTCAGGCTACCAAAATGGAAGTAAT
AGAGGAGAGAAGAAGATGCTAGATTATCCTCCTTGGCTTTTTCAAGTTGTGAGCCTTAACATGACTTAATTTTCAAGGTATAAAGAAAACCAATTCATTTCAATCAAATT
ATATGAGAGCATCTTTGTTTGCTTCTCATTCACTAATCAAATTGATGCTTTAGTTGTATTGAGCTTCCTTCTTCATCTTGCTTGTATATTGAGATGTTGATGTGTATTAC
TTTGTAAATTTTAACTTTGAGAACAGAATCACTTTTGAATTTCAAGCTATGATTAATGTGGATTGACAGAGAGCTAAGGGTGAATTAGACATATTTTTTAGAATGG
Protein sequenceShow/hide protein sequence
MIANKRPMEKALSQVFFYSYSVVLSSLLLLNTQFIMALASGTSRKYKIDQTGPLDCFGDSRFKAASDKKVKGAEIPMEKLPKAYEKEYMRMAMLKHEETFKQQVHELHRL
YRTQKTLMKNVEKSRESGWNQESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPAEDEPEAINGALQIINENELELTLGPSSYNTSDSGITTHSSSSTGSSHEGRRC
TDTKQVKGQEMAVLGVTENSSGYQNGSNRGEKKMLDYPPWLFQVVSLNMT