; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013867 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013867
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationscaffold607:392100..393735
RNA-Seq ExpressionMS013867
SyntenyMS013867
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR027443 - Isopenicillin N synthase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603537.1 hypothetical protein SDJN03_04146, partial [Cucurbita argyrosperma subsp. sororia]8.4e-7463.41Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRSA EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   D G R  + QVRRR    ASE++V +    R  S +MEK+  K+EGI EK+SE L + MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQ-GSFCFDSAADTIVVTIGKQL
        KE I SI+RYNNN QN           EREN    MMSLHIP EHCQFS+N H Q  SF FD+AAD IVVT+G+QL
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQ-GSFCFDSAADTIVVTIGKQL

KAG7033722.1 hypothetical protein SDJN02_03447, partial [Cucurbita argyrosperma subsp. argyrosperma]4.5e-7563.77Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRSA EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   D G R  + QVRRR    ASE++V +    R  S +MEK+  K+EGI EK+SE L + MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQ-GSFCFDSAADTIVVTIGKQL
        KE I SI+RYNNN QN           EREN    MMSLHIP EHCQFS+N H Q  SF FD+AADTIVVT+G+QL
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQ-GSFCFDSAADTIVVTIGKQL

XP_022144007.1 uncharacterized protein LOC111013795 [Momordica charantia]3.8e-14398.9Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
        MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS

Query:  GEEILSVVKDAKSILEDSSSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA
        GEEILSVVKDAKSILEDSSSER+DGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA
Subjt:  GEEILSVVKDAKSILEDSSSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA

Query:  ILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQ
        ILSIFRYNNNQQNQFDDGGERENEERE+DESVMMSLHIPAEHCQFSVNPH QGSFCFDSAADTIVVTIGKQLQ
Subjt:  ILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQ

XP_022950423.1 uncharacterized protein LOC111453527 [Cucurbita moschata]2.2e-7463.41Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRS  EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   D G R  + QVRRR    ASE++V +    R  S +MEK+  K+EGI EK+SE L + MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQ-GSFCFDSAADTIVVTIGKQL
        KE I SI+RYNNN QN           EREN    MMSLHIP EHCQFS+N H Q  SF FD+AADTIVVT+G+QL
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQ-GSFCFDSAADTIVVTIGKQL

XP_038881407.1 uncharacterized protein LOC120072944, partial [Benincasa hispida]2.1e-7762.86Show/hide
Query:  LTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASP-DGDAVARMLRSAGEFGAFRIVNHGISGEEILSVV
        + IPAPPPSPIPTGTGSRSAANETFK FLE KSI LPQLSLPESRF+SG NP PA++D+R L SP  G+A ARMLRS  EFGAFRIVNHGISGEEILSVV
Subjt:  LTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASP-DGDAVARMLRSAGEFGAFRIVNHGISGEEILSVV

Query:  KDAKSILEDSSSERD-------DGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEAI
         +AKS+LED +   D       DG R AI+Q+RRR    ASE+++   E  R  SG+ME++  K+EGI EKLSEIL + MGE         KKR +KEAI
Subjt:  KDAKSILEDSSSERD-------DGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEAI

Query:  LSIFRYNNNQ------QNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPH--PQGSFCFDSAADTIVVTIGKQLQ
         SI+RYNNNQ       N  ++  + + EEREND + MM LHIP EHCQF VN H   Q SFCFD+AADTIVVTIGKQLQ
Subjt:  LSIFRYNNNQ------QNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPH--PQGSFCFDSAADTIVVTIGKQLQ

TrEMBL top hitse value%identityAlignment
A0A1S3CIU6 uncharacterized protein LOC1035014561.0e-7258.47Show/hide
Query:  NTAAVRRRPSSSMALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYR-LLASPDGDAVARMLRSAG
        +T  VR+     MALLRTKSRLTIPAPPPSPIPT TGSRSA NETFK FLE  S  LPQLSLPESRF SG N  PA++D+R L++S  G+AVARMLRS  
Subjt:  NTAAVRRRPSSSMALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYR-LLASPDGDAVARMLRSAG

Query:  EFGAFRIVNHGISGEEILSVVKDAKSILEDSSS-------ERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSES
        EFGAFRIVNHGISGEE+LSVV +AKS+LEDS+        + DDG R AI+QVRR      S ++V   E  R  S +MEK+ RK+EGIGEKLSEIL   
Subjt:  EFGAFRIVNHGISGEEILSVVKDAKSILEDSSS-------ERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSES

Query:  MGEEWGEDQKVKKKRGEKEAILSIFRYNNNQQNQF----DDGGERENEERENDESVMMSLHIPAEHCQFSVN----PHPQGSFCFDSAADTIVVTIGKQL
        +GE       V+K   +KE I SI+RY+++  + F    D   +    ERE+DE VMM L IP EHCQF VN       Q S CFD+AADTIVVTIGKQ 
Subjt:  MGEEWGEDQKVKKKRGEKEAILSIFRYNNNQQNQF----DDGGERENEERENDESVMMSLHIPAEHCQFSVN----PHPQGSFCFDSAADTIVVTIGKQL

Query:  Q
        Q
Subjt:  Q

A0A5J4ZG41 Uncharacterized protein3.0e-4042.09Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
        MA++RTKS L+ PAPPPSPIPTG G RSAA+ TF E+++ KSIQ+P+L LPE         +PA +DY+ L S DGD+V R+LRSA EFG  RI  HGI 
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS

Query:  GEEILSVVKDAKSILEDSSSERDDGARAAIVQ---VRRRRHGGASE--HSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKR
         ++I S +   +S+   S   + +  R  +     V RR     +E    V   E YR FS EME V  K+E I E+L+ ++SE+  ++       +KK 
Subjt:  GEEILSVVKDAKSILEDSSSERDDGARAAIVQ---VRRRRHGGASE--HSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKR

Query:  GEKEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQ
          +E+ILS++RY  N+ ++ D      NEE        +SLH+  E C+FSV      S  F+++ DTIVVTIG+QL+
Subjt:  GEKEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQ

A0A6J1CSG7 uncharacterized protein LOC1110137951.9e-14398.9Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
        MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNHGIS

Query:  GEEILSVVKDAKSILEDSSSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA
        GEEILSVVKDAKSILEDSSSER+DGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA
Subjt:  GEEILSVVKDAKSILEDSSSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEA

Query:  ILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQ
        ILSIFRYNNNQQNQFDDGGERENEERE+DESVMMSLHIPAEHCQFSVNPH QGSFCFDSAADTIVVTIGKQLQ
Subjt:  ILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQ

A0A6J1GEV0 uncharacterized protein LOC1114535271.1e-7463.41Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRS  EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   D G R  + QVRRR    ASE++V +    R  S +MEK+  K+EGI EK+SE L + MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQ-GSFCFDSAADTIVVTIGKQL
        KE I SI+RYNNN QN           EREN    MMSLHIP EHCQFS+N H Q  SF FD+AADTIVVT+G+QL
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQ-GSFCFDSAADTIVVTIGKQL

A0A6J1ITI2 uncharacterized protein LOC1114782674.5e-7362.45Show/hide
Query:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI
        MAL RTKSRLTIPAPPPSPIPTGTGSRSAANETFK+FLE KSI LPQLSLPESRF+S  NP  A++D+R LASP  GDA ARMLRSA EFGAFRIVNHGI
Subjt:  MALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPD-GDAVARMLRSAGEFGAFRIVNHGI

Query:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE
        SGEEILSVV +AKS+ ED   S   D G R  + QVRRR    AS  +V +    R  S +MEK+  K+EGI EK+SE L E MGE         KK  +
Subjt:  SGEEILSVVKDAKSILEDS--SSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGE

Query:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPH--PQGSFCFDSAADTIVVTIGKQL
        KE I SI+RYNN+Q             ER+N    MMSLHIP EHCQFS+N H  P  SF FD+AADTIVVT+G+QL
Subjt:  KEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPH--PQGSFCFDSAADTIVVTIGKQL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38500.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.5e-3939.93Show/hide
Query:  MALLRTKSRLTI-----PAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPES----RFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGA
        MAL+RT+S+L +     P PPPSPIP   GSR AA+E   E +E +SIQ+P+L+LPES          + +PA +D+RLLAS    +V R++RSA EFGA
Subjt:  MALLRTKSRLTI-----PAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPES----RFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGA

Query:  FRIVNHGISGEEILSVVKDAKSIL-----EDSSSERD-DGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEW
        FR+  HGISGEE+ S+V+++  +       D+   R   G R  IV VR  +            E YR FS EME VA K+E I  KL +I+ E+     
Subjt:  FRIVNHGISGEEILSVVKDAKSIL-----EDSSSERD-DGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEW

Query:  GEDQKVKKKRGEKEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQ
          D+K+++     E++LS++RYN+  +N  +       E  E      +SLH+PA++C+F VN   +G   F +  DTI+VT G+QL+
Subjt:  GEDQKVKKKRGEKEAILSIFRYNNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACAAATACCGCCGCCGTCCGCCGTCGACCGTCCTCTTCCATGGCTCTCCTCCGCACGAAGAGTCGGTTGACGATCCCGGCACCGCCGCCGTCGCCGATCCCGACCGGCAC
CGGATCGCGCTCGGCGGCCAACGAGACCTTCAAGGAATTCCTCGAGACGAAGTCGATTCAACTCCCGCAACTCTCGCTGCCGGAATCTCGCTTCGTCTCCGGCGCCAATC
CCTTGCCCGCGCTCCTCGATTACCGATTGCTCGCGTCTCCCGACGGCGACGCGGTGGCGCGGATGCTCCGTTCGGCCGGCGAGTTCGGCGCCTTTCGGATCGTTAATCAC
GGAATTTCCGGGGAGGAAATTTTGTCGGTGGTGAAGGACGCGAAATCTATTCTGGAAGATTCTTCTTCGGAGAGGGATGACGGAGCCCGGGCGGCGATTGTACAGGTCCG
CCGTCGGAGACACGGCGGGGCGTCGGAACATTCGGTTGCGCGGGACGAGGCGTACCGGCACTTTAGCGGAGAGATGGAGAAGGTAGCCAGGAAAGTAGAGGGAATTGGAG
AGAAATTGAGTGAGATTTTATCAGAAAGCATGGGGGAGGAATGGGGCGAAGATCAGAAGGTGAAGAAGAAGAGAGGAGAAAAAGAGGCAATTTTGAGCATTTTCAGATAC
AATAATAATCAGCAGAATCAATTTGATGATGGAGGAGAGAGAGAAAATGAAGAGAGAGAAAATGATGAGAGTGTGATGATGAGCCTCCACATTCCAGCAGAGCACTGCCA
ATTCTCTGTCAATCCTCACCCACAGGGCTCTTTCTGCTTTGATTCTGCTGCTGATACCATTGTTGTCACCATTGGCAAACAGCTCCAGGTAATTTTATCAATATCC
mRNA sequenceShow/hide mRNA sequence
ACAAATACCGCCGCCGTCCGCCGTCGACCGTCCTCTTCCATGGCTCTCCTCCGCACGAAGAGTCGGTTGACGATCCCGGCACCGCCGCCGTCGCCGATCCCGACCGGCAC
CGGATCGCGCTCGGCGGCCAACGAGACCTTCAAGGAATTCCTCGAGACGAAGTCGATTCAACTCCCGCAACTCTCGCTGCCGGAATCTCGCTTCGTCTCCGGCGCCAATC
CCTTGCCCGCGCTCCTCGATTACCGATTGCTCGCGTCTCCCGACGGCGACGCGGTGGCGCGGATGCTCCGTTCGGCCGGCGAGTTCGGCGCCTTTCGGATCGTTAATCAC
GGAATTTCCGGGGAGGAAATTTTGTCGGTGGTGAAGGACGCGAAATCTATTCTGGAAGATTCTTCTTCGGAGAGGGATGACGGAGCCCGGGCGGCGATTGTACAGGTCCG
CCGTCGGAGACACGGCGGGGCGTCGGAACATTCGGTTGCGCGGGACGAGGCGTACCGGCACTTTAGCGGAGAGATGGAGAAGGTAGCCAGGAAAGTAGAGGGAATTGGAG
AGAAATTGAGTGAGATTTTATCAGAAAGCATGGGGGAGGAATGGGGCGAAGATCAGAAGGTGAAGAAGAAGAGAGGAGAAAAAGAGGCAATTTTGAGCATTTTCAGATAC
AATAATAATCAGCAGAATCAATTTGATGATGGAGGAGAGAGAGAAAATGAAGAGAGAGAAAATGATGAGAGTGTGATGATGAGCCTCCACATTCCAGCAGAGCACTGCCA
ATTCTCTGTCAATCCTCACCCACAGGGCTCTTTCTGCTTTGATTCTGCTGCTGATACCATTGTTGTCACCATTGGCAAACAGCTCCAGGTAATTTTATCAATATCC
Protein sequenceShow/hide protein sequence
TNTAAVRRRPSSSMALLRTKSRLTIPAPPPSPIPTGTGSRSAANETFKEFLETKSIQLPQLSLPESRFVSGANPLPALLDYRLLASPDGDAVARMLRSAGEFGAFRIVNH
GISGEEILSVVKDAKSILEDSSSERDDGARAAIVQVRRRRHGGASEHSVARDEAYRHFSGEMEKVARKVEGIGEKLSEILSESMGEEWGEDQKVKKKRGEKEAILSIFRY
NNNQQNQFDDGGERENEERENDESVMMSLHIPAEHCQFSVNPHPQGSFCFDSAADTIVVTIGKQLQVILSIS