; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g19370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g19370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationchr1:13438069..13443766
RNA-Seq ExpressionMoc01g19370
SyntenyMoc01g19370
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR027443 - Isopenicillin N synthase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603537.1 hypothetical protein SDJN03_04146, partial [Cucurbita argyrosperma subsp. sororia]2.3e-7863.67Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRSA EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   + G R  + QVRRR    ASE++V +    R  S +MEK+  K+EGI EK+SE L + MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ-GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
        KE I SI+RYNNN QN   +  EREN+ +      MMSLHIP EHCQFS+N H Q  SF FD+AAD IVVT+G+QL E S+GKLKSARS
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ-GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS

KAG7033722.1 hypothetical protein SDJN02_03447, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-7964.01Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRSA EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   + G R  + QVRRR    ASE++V +    R  S +MEK+  K+EGI EK+SE L + MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ-GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
        KE I SI+RYNNN QN   +  EREN+ +      MMSLHIP EHCQFS+N H Q  SF FD+AADTIVVT+G+QL E S+GKLKSARS
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ-GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS

XP_022144007.1 uncharacterized protein LOC111013795 [Momordica charantia]2.4e-152100Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
        MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS

Query:  GEEILSVVKDAKSILEDSSSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA
        GEEILSVVKDAKSILEDSSSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA
Subjt:  GEEILSVVKDAKSILEDSSSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA

Query:  ILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
        ILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
Subjt:  ILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS

XP_022950423.1 uncharacterized protein LOC111453527 [Cucurbita moschata]4.7e-7963.67Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRS  EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   + G R  + QVRRR    ASE++V +    R  S +MEK+  K+EGI EK+SE L + MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ-GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
        KE I SI+RYNNN QN   +  EREN+ +      MMSLHIP EHCQFS+N H Q  SF FD+AADTIVVT+G+QL E S+GKLKSARS
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ-GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS

XP_038881407.1 uncharacterized protein LOC120072944, partial [Benincasa hispida]1.2e-8263.36Show/hide
Query:  LTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASP-DGDAVARMLRSAGEFGAFRIVNHGISGEEILSVV
        + IPAPPPSPIPTGTGSRSAANETFK FLE KSI LPQLSLPESRF+SG NP PA++D+R L SP  G+A ARMLRS  EFGAFRIVNHGISGEEILSVV
Subjt:  LTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASP-DGDAVARMLRSAGEFGAFRIVNHGISGEEILSVV

Query:  KDAKSILEDSSSE-------RNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEAI
         +AKS+LED +          NDG R AI+Q+RRR    ASE+++   E  R  SG+ME++  K+EGI EKLSEIL + MGE         KKR +KEAI
Subjt:  KDAKSILEDSSSE-------RNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEAI

Query:  LSIFRYNNNQ------QNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPH--TQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
         SI+RYNNNQ       N  ++  + + EERE+D + MM LHIP EHCQF VN H   Q SFCFD+AADTIVVTIGKQLQE S+GKLKSARS
Subjt:  LSIFRYNNNQ------QNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPH--TQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS

TrEMBL top hitse value%identityAlignment
A0A1S3CIU6 uncharacterized protein LOC1035014562.5e-7860.8Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYR-LLASPDGDAVARMLRSAGEFGAFRIVNHGI
        MALLRTKSRLTIPAPPPSPIPT TGSRSA NETFK FLE  S  LPQLSLPESRF SG N  PA++D+R L++S  G+AVARMLRS  EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYR-LLASPDGDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDSSS-------ERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVK
        SGEE+LSVV +AKS+LEDS+        + +DG R AI+QVRR      S ++V   E  R  S +MEK+ RK+EGIGEKLSEIL   +GE       V+
Subjt:  SGEEILSVVKDAKSILEDSSS-------ERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVK

Query:  KKRGEKEAILSIFRYNNNQQNQF----DDGGERENEERESDESVMMSLHIPAEHCQFSVN----PHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSAR
        K   +KE I SI+RY+++  + F    D   +    ERESDE VMM L IP EHCQF VN       Q S CFD+AADTIVVTIGKQ QE S+GKLKSAR
Subjt:  KKRGEKEAILSIFRYNNNQQNQF----DDGGERENEERESDESVMMSLHIPAEHCQFSVN----PHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSAR

Query:  S
        S
Subjt:  S

A0A5J4ZG41 Uncharacterized protein2.8e-4543.06Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
        MA++RTKS L+ PAPPPSPIPTG G RSAA+ TF E+++ KSIQ+P+L LPE         +PA +DY+ L S DGD+V R+LRSA EFG  RI  HGI 
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS

Query:  GEEILSVVKDAKSILEDSSSERNDGARAAIVQ---VRRRRHGGASE--HSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKR
         ++I S +   +S+   S   + +  R  +     V RR     +E    V   E YR FS EME V  K+E I E+L+ ++SE+  ++       +KK 
Subjt:  GEEILSVVKDAKSILEDSSSERNDGARAAIVQ---VRRRRHGGASE--HSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKR

Query:  GEKEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSA
          +E+ILS++RY  N+ ++ D      NEE        +SLH+  E C+FSV   +  S  F+++ DTIVVTIG+QL+EWS+G  KSA
Subjt:  GEKEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSA

A0A6J1CSG7 uncharacterized protein LOC1110137951.2e-152100Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
        MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS

Query:  GEEILSVVKDAKSILEDSSSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA
        GEEILSVVKDAKSILEDSSSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA
Subjt:  GEEILSVVKDAKSILEDSSSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA

Query:  ILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
        ILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
Subjt:  ILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS

A0A6J1GEV0 uncharacterized protein LOC1114535272.3e-7963.67Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRS  EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   + G R  + QVRRR    ASE++V +    R  S +MEK+  K+EGI EK+SE L + MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ-GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
        KE I SI+RYNNN QN   +  EREN+ +      MMSLHIP EHCQFS+N H Q  SF FD+AADTIVVT+G+QL E S+GKLKSARS
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ-GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS

A0A6J1ITI2 uncharacterized protein LOC1114782675.3e-7662.41Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRSA EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   + G R  + QVRRR    AS  +V +    R  S +MEK+  K+EGI EK+SE L E MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ--GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS
        KE I SI+RYNN+Q     +  ER+N+ +      MMSLHIP EHCQFS+N H Q   SF FD+AADTIVVT+G+QL E S  KLKSARS
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQ--GSFCFDSAADTIVVTIGKQLQEWSMGKLKSARS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38500.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.6e-4340.54Show/hide
Query:  MALLRTKSRLTI-----PAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPES----RFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGA
        MAL+RT+S+L +     P PPPSPIP   GSR AA+E   E +E +SIQ+P+L+LPES          + +PA +D+RLLAS    +V R++RSA EFGA
Subjt:  MALLRTKSRLTI-----PAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPES----RFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGA

Query:  FRIVNHGISGEEILSVVKDAKSIL-----EDSSSERN-DGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEW
        FR+  HGISGEE+ S+V+++  +       D+   R+  G R  IV VR  +            E YR FS EME VA K+E I  KL +I+ E+     
Subjt:  FRIVNHGISGEEILSVVKDAKSIL-----EDSSSERN-DGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEW

Query:  GEDQKVKKKRGEKEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLK
          D+K+++     E++LS++RYN+  +N  +       E  E      +SLH+PA++C+F VN   +G   F +  DTI+VT G+QL+EWS+G+ K
Subjt:  GEDQKVKKKRGEKEAILSIFRYNNNQQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTCCTCCGCACGAAGAGTCGGTTGACGATTCCGGCACCGCCGCCGTCGCCGATCCCGACCGGCACCGGATCGCGCTCGGCGGCCAACGAGACCTTCAAG
GAATTCCTCGAGACGAAGTCGATTCAACTCCCGCAACTCTCGCTGCCGGAATCTCGCTTCGTCTCCGGCGCCAATCCCTTGCCCGCGCTCCTCGATTACCGATTG
CTCGCGTCTCCCGACGGCGACGCGGTGGCGCGGATGCTCCGTTCGGCCGGCGAGTTCGGCGCCTTTCGGATCGTTAATCACGGAATTTCCGGGGAGGAAATTTTG
TCGGTGGTGAAGGACGCGAAATCTATTCTGGAAGATTCTTCTTCGGAGAGGAATGACGGAGCCCGGGCGGCGATTGTACAGGTCCGCCGTCGGAGACACGGCGGG
GCGTCGGAACATTCGGTTGCGCGGGACGAGGCGTACCGGCACTTTAGCGGAGAGATGGAGAAGGTAGCCAGGAAAGTAGAGGGAATTGGAGAGAAATTGAGTGAG
ATTTTATCAGAAAGCATGGGGGAGGAATGGGGCGAAGATCAGAAGGTGAAGAAGAAGAGAGGAGAAAAAGAGGCAATTTTGAGCATTTTCAGATACAATAATAAT
CAGCAGAATCAATTTGATGATGGAGGAGAGAGAGAAAATGAAGAGAGAGAAAGTGATGAGAGTGTGATGATGAGCCTCCACATTCCAGCAGAGCACTGCCAATTC
TCTGTCAATCCTCACACACAGGGCTCTTTCTGCTTTGATTCTGCTGCTGATACCATTGTTGTCACCATTGGCAAACAGCTCCAGGAATGGAGCATGGGAAAATTA
AAAAGTGCAAGAAGTCACAATTGGGAGCAAATTCCCAAAGAAGCAGCGCCACAACGCCAGTACACAGTGCCATGGCACCTGCTGACAAAGCTTGGACCTATTTCC
GTGATGATAGCGCCGCAACGCTATCTTGTAGCGTCATGGCTCTACGAACTTGCTTGTCCATCACAATTTTGGGACAACGCCACGGCGCTCACTACTGGCGTCATG
GCGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTCCTCCGCACGAAGAGTCGGTTGACGATTCCGGCACCGCCGCCGTCGCCGATCCCGACCGGCACCGGATCGCGCTCGGCGGCCAACGAGACCTTCAAG
GAATTCCTCGAGACGAAGTCGATTCAACTCCCGCAACTCTCGCTGCCGGAATCTCGCTTCGTCTCCGGCGCCAATCCCTTGCCCGCGCTCCTCGATTACCGATTG
CTCGCGTCTCCCGACGGCGACGCGGTGGCGCGGATGCTCCGTTCGGCCGGCGAGTTCGGCGCCTTTCGGATCGTTAATCACGGAATTTCCGGGGAGGAAATTTTG
TCGGTGGTGAAGGACGCGAAATCTATTCTGGAAGATTCTTCTTCGGAGAGGAATGACGGAGCCCGGGCGGCGATTGTACAGGTCCGCCGTCGGAGACACGGCGGG
GCGTCGGAACATTCGGTTGCGCGGGACGAGGCGTACCGGCACTTTAGCGGAGAGATGGAGAAGGTAGCCAGGAAAGTAGAGGGAATTGGAGAGAAATTGAGTGAG
ATTTTATCAGAAAGCATGGGGGAGGAATGGGGCGAAGATCAGAAGGTGAAGAAGAAGAGAGGAGAAAAAGAGGCAATTTTGAGCATTTTCAGATACAATAATAAT
CAGCAGAATCAATTTGATGATGGAGGAGAGAGAGAAAATGAAGAGAGAGAAAGTGATGAGAGTGTGATGATGAGCCTCCACATTCCAGCAGAGCACTGCCAATTC
TCTGTCAATCCTCACACACAGGGCTCTTTCTGCTTTGATTCTGCTGCTGATACCATTGTTGTCACCATTGGCAAACAGCTCCAGGAATGGAGCATGGGAAAATTA
AAAAGTGCAAGAAGTCACAATTGGGAGCAAATTCCCAAAGAAGCAGCGCCACAACGCCAGTACACAGTGCCATGGCACCTGCTGACAAAGCTTGGACCTATTTCC
GTGATGATAGCGCCGCAACGCTATCTTGTAGCGTCATGGCTCTACGAACTTGCTTGTCCATCACAATTTTGGGACAACGCCACGGCGCTCACTACTGGCGTCATG
GCGCTGTAG
Protein sequenceShow/hide protein sequence
MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGISGEEIL
SVVKDAKSILEDSSSERNDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEAILSIFRYNNN
QQNQFDDGGERENEERESDESVMMSLHIPAEHCQFSVNPHTQGSFCFDSAADTIVVTIGKQLQEWSMGKLKSARSHNWEQIPKEAAPQRQYTVPWHLLTKLGPIS
VMIAPQRYLVASWLYELACPSQFWDNATALTTGVMAL