; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C025210 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C025210
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationchr08:26854219..26856172
RNA-Seq ExpressionMELO3C025210
SyntenyMELO3C025210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR027443 - Isopenicillin N synthase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463252.1 PREDICTED: uncharacterized protein LOC103501456 [Cucumis melo]1.6e-160100Show/hide
Query:  MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEF
        MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEF
Subjt:  MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEF

Query:  GAFRIVNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVG
        GAFRIVNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVG
Subjt:  GAFRIVNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVG

Query:  ENVEKLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
        ENVEKLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
Subjt:  ENVEKLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ

XP_011653719.1 uncharacterized protein LOC101207912 [Cucumis sativus]2.3e-13584.56Show/hide
Query:  MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEF
        MPPPST+ +RKM+LLRT+S LTIPAPPPSPIPT TGSRSA NETFKTFL+NSTHLPQLSLPESRF S  N  PAV+DF+SLVSSGC + VARMLRSV+EF
Subjt:  MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEF

Query:  GAFRIVNHGISGEEVLSVVNEAK--SVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGF
        GAFRIVNHGISGEEVLSVVN+AK  SVLEDSNKGVDDR WDGDDGNREAILQVRR NDSEVSGNTVVEAETNREIS+KMEKIRRKLEGIGEKLSEILCGF
Subjt:  GAFRIVNHGISGEEVLSVVNEAK--SVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGF

Query:  VGENVEKLGEKKETIFSIYRYHHH---PNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
        +GENVEKLG+KKET+FSIYRY+++   PND+FER+ DHNTK SK+ERE DE VMMKLEIPGEHCQFYV+YSC QQKQY+ CFDAAADTIVVTIGKQFQ
Subjt:  VGENVEKLGEKKETIFSIYRYHHH---PNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ

XP_022950423.1 uncharacterized protein LOC111453527 [Cucurbita moschata]1.5e-8667.49Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGIS
        MAL RTKSRLTIPAPPPSPIPT TGSRSA NETFK FLE S HLPQLSLPESRF S  N + AV+DFRSL S   G+A ARMLRSVNEFGAFRIVNHGIS
Subjt:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGIS

Query:  GEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKE
        GEE+LSVVNEAKSV ED     DDR W GD GNRE + QVRR NDSE S NTVV+A TNR+ISEKMEKIR KLEGI EK+SE L   +GEN++K G+KKE
Subjt:  GEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKE

Query:  TIFSIYRY---HHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQ
        TIFSIYRY   H +PN                ERE+  K MM L IP EHCQF +N     Q+  S  FDAAADTIVVT+G+Q
Subjt:  TIFSIYRY---HHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQ

XP_022978219.1 uncharacterized protein LOC111478267 [Cucurbita maxima]2.6e-8666.79Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGIS
        MAL RTKSRLTIPAPPPSPIPT TGSRSA NETFK FLE S HLPQLSLPESRF S  N + AV+DFRSL S   G+A ARMLRS NEFGAFRIVNHGIS
Subjt:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGIS

Query:  GEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKE
        GEE+LSVVNEAKSV ED     DDR W GD GNR+ + QVRR NDS+ S  TVV+A TNR+ISEKMEKIR KLEGI EK+SE L   +GEN++K G+KKE
Subjt:  GEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKE

Query:  TIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQ
        TIFSIYRY++H N             +  ER++  K MM L IPGEHCQF +N    QQ   S  FDAAADTIVVT+G+Q
Subjt:  TIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQ

XP_038881407.1 uncharacterized protein LOC120072944, partial [Benincasa hispida]9.1e-10074.28Show/hide
Query:  LTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGISGEEVLSVVN
        + IPAPPPSPIPT TGSRSA NETFKTFLE S HLPQLSLPESRF SG N TPAVVDFRSLVS G GEA ARMLRSVNEFGAFRIVNHGISGEE+LSVVN
Subjt:  LTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGISGEEVLSVVN

Query:  EAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKETIFSIYRYH
        EAKSVLED NKGVDDR W  +DGNREAILQ+RR NDS+ S NT+V AETNR+IS KME+IR KLEGI EKLSEIL   +GENV+K  +KKE IFSIYRY+
Subjt:  EAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKETIFSIYRYH

Query:  HHPNDLFERKKDHNTKFSKN---ERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
        ++ N + ER+ ++N   +KN   ERE+D   MM+L IPGEHCQFYVN    QQ+Q S CFDAAADTIVVTIGKQ Q
Subjt:  HHPNDLFERKKDHNTKFSKN---ERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ

TrEMBL top hitse value%identityAlignment
A0A1S3CIU6 uncharacterized protein LOC1035014567.6e-161100Show/hide
Query:  MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEF
        MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEF
Subjt:  MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEF

Query:  GAFRIVNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVG
        GAFRIVNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVG
Subjt:  GAFRIVNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVG

Query:  ENVEKLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
        ENVEKLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
Subjt:  ENVEKLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ

A0A314Y0U8 Uncharacterized protein8.7e-4042.01Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLP---ESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNH
        MA++R +SRLT  APPPSPIPTA GSRSA NE F  FL+    +P L+ P      F+   +  PA VD RSL S    +A+AR+L S  EFGAFRI NH
Subjt:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLP---ESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNH

Query:  GISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGE
        GIS EE+ SVV EA+SV    N G   RR+    GNRE I  VR       SG  VVE E  R   + MEK+  K+E I E++SE+L     ++VEK   
Subjt:  GISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGE

Query:  KKETIFSIYRYHHHPNDLFERKKDH---NTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
         +     +YRY+H  + + +    +   N   + N     E   + L +P EH QF +     + +  SLCFDA  +T+VVT+G Q +
Subjt:  KKETIFSIYRYHHHPNDLFERKKDH---NTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ

A0A6J1CSG7 uncharacterized protein LOC1110137957.1e-7459.86Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLE-NSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGI
        MALLRTKSRLTIPAPPPSPIPT TGSRSA NETFK FLE  S  LPQLSLPESRF SG N  PA++D+R L++S  G+AVARMLRS  EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLE-NSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGI

Query:  SGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGE------NVE
        SGEE+LSVV +AKS+LEDS+        + +DG R AI+QVRR      S ++V   E  R  S +MEK+ RK+EGIGEKLSEIL   +GE       V+
Subjt:  SGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGE------NVE

Query:  KLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
        K   +KE I SI+RY+++  + F    D   +    ERESDE VMM L IP EHCQF VN       Q S CFD+AADTIVVTIGKQ Q
Subjt:  KLGEKKETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ

A0A6J1GEV0 uncharacterized protein LOC1114535277.3e-8767.49Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGIS
        MAL RTKSRLTIPAPPPSPIPT TGSRSA NETFK FLE S HLPQLSLPESRF S  N + AV+DFRSL S   G+A ARMLRSVNEFGAFRIVNHGIS
Subjt:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGIS

Query:  GEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKE
        GEE+LSVVNEAKSV ED     DDR W GD GNRE + QVRR NDSE S NTVV+A TNR+ISEKMEKIR KLEGI EK+SE L   +GEN++K G+KKE
Subjt:  GEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKE

Query:  TIFSIYRY---HHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQ
        TIFSIYRY   H +PN                ERE+  K MM L IP EHCQF +N     Q+  S  FDAAADTIVVT+G+Q
Subjt:  TIFSIYRY---HHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQ

A0A6J1ITI2 uncharacterized protein LOC1114782671.2e-8666.79Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGIS
        MAL RTKSRLTIPAPPPSPIPT TGSRSA NETFK FLE S HLPQLSLPESRF S  N + AV+DFRSL S   G+A ARMLRS NEFGAFRIVNHGIS
Subjt:  MALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRIVNHGIS

Query:  GEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKE
        GEE+LSVVNEAKSV ED     DDR W GD GNR+ + QVRR NDS+ S  TVV+A TNR+ISEKMEKIR KLEGI EK+SE L   +GEN++K G+KKE
Subjt:  GEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKKE

Query:  TIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQ
        TIFSIYRY++H N             +  ER++  K MM L IPGEHCQF +N    QQ   S  FDAAADTIVVT+G+Q
Subjt:  TIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38500.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.2e-3336.45Show/hide
Query:  MALLRTKSRLTI-----PAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPES----RFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGA
        MAL+RT+S+L +     P PPPSPIP A GSR A +E     +E S  +P+L+LPES          +  PA +DFR L S   G +V R++RS  EFGA
Subjt:  MALLRTKSRLTI-----PAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPES----RFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGA

Query:  FRIVNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDD----GNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGF
        FR+  HGISGEE+ S+V E+  V      GV + R  G      GNR+ I+ VR   +        +  E  R  S++ME +  KLE I  KL +I+   
Subjt:  FRIVNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDD----GNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGF

Query:  VGENVEKLGEKK----ETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ
          EN  +  +KK    E++ S+YRY+H      E   + +    K   E      + L +P ++C+F VN       +  L F A  DTI+VT G+Q +
Subjt:  VGENVEKLGEKK----ETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCACCCTCTACCGTCGTCGTTAGAAAAATGGCTCTCTTACGCACAAAATCCCGTTTAACAATCCCAGCTCCCCCACCCTCCCCAATCCCAACCGCCACC
GGATCCCGTTCCGCCGGCAACGAAACCTTCAAAACCTTCCTCGAGAACTCCACTCACCTCCCCCAGCTCTCTTTGCCGGAATCCCGCTTCGCCTCCGGCTTCAAT
ACTACTCCCGCCGTCGTCGATTTCCGATCGCTAGTTTCTTCCGGCTGCGGTGAAGCCGTGGCGCGGATGCTTCGGTCCGTCAATGAATTTGGGGCGTTTCGGATC
GTTAATCATGGGATATCCGGGGAGGAGGTTCTGTCGGTGGTGAATGAAGCTAAATCTGTATTGGAAGATAGTAATAAGGGAGTTGATGATCGGAGATGGGACGGG
GACGACGGAAATCGCGAGGCGATTTTGCAGGTGCGGCGGCCGAATGACAGCGAGGTGTCGGGAAATACAGTTGTGGAGGCCGAAACGAACCGGGAAATCAGCGAA
AAGATGGAGAAAATAAGAAGGAAACTAGAAGGCATCGGAGAGAAATTAAGTGAGATATTATGTGGATTCGTGGGAGAGAATGTGGAGAAATTAGGAGAGAAAAAA
GAGACAATTTTTAGCATCTACAGATATCATCATCATCCAAATGATCTGTTTGAAAGGAAAAAGGATCATAATACTAAATTTTCAAAAAATGAGAGAGAAAGTGAT
GAGAAAGTGATGATGAAGCTTGAAATTCCAGGAGAACATTGCCAATTTTATGTGAATTATTCTTGTCAGCAACAAAAACAATACTCACTTTGCTTTGATGCTGCT
GCTGATACCATTGTTGTCACCATTGGTAAACAATTCCAGGTATGTGTT
mRNA sequenceShow/hide mRNA sequence
ATTAACTCAAAAAATGCCGCCACCCTCTACCGTCGTCGTTAGAAAAATGGCTCTCTTACGCACAAAATCCCGTTTAACAATCCCAGCTCCCCCACCCTCCCCAAT
CCCAACCGCCACCGGATCCCGTTCCGCCGGCAACGAAACCTTCAAAACCTTCCTCGAGAACTCCACTCACCTCCCCCAGCTCTCTTTGCCGGAATCCCGCTTCGC
CTCCGGCTTCAATACTACTCCCGCCGTCGTCGATTTCCGATCGCTAGTTTCTTCCGGCTGCGGTGAAGCCGTGGCGCGGATGCTTCGGTCCGTCAATGAATTTGG
GGCGTTTCGGATCGTTAATCATGGGATATCCGGGGAGGAGGTTCTGTCGGTGGTGAATGAAGCTAAATCTGTATTGGAAGATAGTAATAAGGGAGTTGATGATCG
GAGATGGGACGGGGACGACGGAAATCGCGAGGCGATTTTGCAGGTGCGGCGGCCGAATGACAGCGAGGTGTCGGGAAATACAGTTGTGGAGGCCGAAACGAACCG
GGAAATCAGCGAAAAGATGGAGAAAATAAGAAGGAAACTAGAAGGCATCGGAGAGAAATTAAGTGAGATATTATGTGGATTCGTGGGAGAGAATGTGGAGAAATT
AGGAGAGAAAAAAGAGACAATTTTTAGCATCTACAGATATCATCATCATCCAAATGATCTGTTTGAAAGGAAAAAGGATCATAATACTAAATTTTCAAAAAATGA
GAGAGAAAGTGATGAGAAAGTGATGATGAAGCTTGAAATTCCAGGAGAACATTGCCAATTTTATGTGAATTATTCTTGTCAGCAACAAAAACAATACTCACTTTG
CTTTGATGCTGCTGCTGATACCATTGTTGTCACCATTGGTAAACAATTCCAGGTATGTGTTCA
Protein sequenceShow/hide protein sequence
MPPPSTVVVRKMALLRTKSRLTIPAPPPSPIPTATGSRSAGNETFKTFLENSTHLPQLSLPESRFASGFNTTPAVVDFRSLVSSGCGEAVARMLRSVNEFGAFRI
VNHGISGEEVLSVVNEAKSVLEDSNKGVDDRRWDGDDGNREAILQVRRPNDSEVSGNTVVEAETNREISEKMEKIRRKLEGIGEKLSEILCGFVGENVEKLGEKK
ETIFSIYRYHHHPNDLFERKKDHNTKFSKNERESDEKVMMKLEIPGEHCQFYVNYSCQQQKQYSLCFDAAADTIVVTIGKQFQVCV