; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g34370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g34370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:26288083..26291385
RNA-Seq ExpressionMoc09g34370
SyntenyMoc09g34370
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]3.3e-8589.94Show/hide
Query:  WFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDAS
        W + +   EEVATSMVIAWQIW+SRNRSIFRGETIDEQQL RSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLG RWSAP TNCWKLNTDAS
Subjt:  WFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDAS

Query:  WSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLT
        WSEEREVGG+GWIL DCRGEIVLAGNCKIREKKEI ALELM IIRGLQ INMQSRSPIYLE DSVEV RLMKKEDVDLT
Subjt:  WSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLT

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]3.3e-3742.86Show/hide
Query:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG
        EE   SM+IAWQIW+ RN+SIF+G   + + +  +I  +I ++  + T +    +S   D +L R  E  +  GA+W  P +N WKLNT+A+W  +   G
Subjt:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG

Query:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNN
        G+GWILRD +GE++ A    IR ++ I  LE+MAI  GL++I  +   PI+LE DS+E   L+ ++  D TE IWL+EE  ++ K   IVS+ HI+R  N
Subjt:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNN

Query:  SVAHELARRATHNDEEE
         VAH LARRA  ND  E
Subjt:  SVAHELARRATHNDEEE

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]5.1e-2234.78Show/hide
Query:  FSVKSAYKLGMELENAN---LASTSNLDSHRQMRWFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQ
        F  KS   L     N N   L   +N  +     W +     EE   SM+IA QIW+ RN+SIF+G   + + +  +I  +I ++  + T     +R  +
Subjt:  FSVKSAYKLGMELENAN---LASTSNLDSHRQMRWFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQ

Query:  NDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVE
        +   + R  +N     ARW  P +N WKLNTDA+W  +    G+GWILRD +GE++  G   IR ++ I  LE+MAI  GL++I  +   PI+LE DS+E
Subjt:  NDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVE

Query:  VTRLMKK
           L+ +
Subjt:  VTRLMKK

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]4.5e-2634.91Show/hide
Query:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG
        E++   ++ +W IW+ RN  IFRGE      + + +  F+            T  S Q++  L    + LN    +W  P  + W LN DASWS+    G
Subjt:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG

Query:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSI-NMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGN
        G+GWI+R   G+IVLAGN  +     +K LE  AI+ GL+++ N+    P+++E DS EV  L+ ++  DLT+  W+VEE + L  +  I++ A + R  
Subjt:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSI-NMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGN

Query:  NSVAHELARRAT
        N  AH LA+RA+
Subjt:  NSVAHELARRAT

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]6.9e-3542.22Show/hide
Query:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG
        EE   SM+IAWQIW+ RN+SIF+G   + + +   I  +I ++  + T +    +S   D +L R R   N  GARW  P +N WKLNTDA+W  +   G
Subjt:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG

Query:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQS--------INMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSI
        G+GWILRD +GE++ A    IR ++ I  LE+MAI  GL++        I  +   PI+LE DS+E   L+ ++  D TE IWL+EE  ++ +   IVS+
Subjt:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQS--------INMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSI

Query:  AHINRGNNSVAHELARRATHNDEEE
         HI+R  N VAH+LARRA  ND  E
Subjt:  AHINRGNNSVAHELARRATHNDEEE

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134121.6e-3742.86Show/hide
Query:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG
        EE   SM+IAWQIW+ RN+SIF+G   + + +  +I  +I ++  + T +    +S   D +L R  E  +  GA+W  P +N WKLNT+A+W  +   G
Subjt:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG

Query:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNN
        G+GWILRD +GE++ A    IR ++ I  LE+MAI  GL++I  +   PI+LE DS+E   L+ ++  D TE IWL+EE  ++ K   IVS+ HI+R  N
Subjt:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNN

Query:  SVAHELARRATHNDEEE
         VAH LARRA  ND  E
Subjt:  SVAHELARRATHNDEEE

A0A6J1CQG0 uncharacterized protein LOC1110132161.6e-8589.94Show/hide
Query:  WFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDAS
        W + +   EEVATSMVIAWQIW+SRNRSIFRGETIDEQQL RSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLG RWSAP TNCWKLNTDAS
Subjt:  WFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDAS

Query:  WSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLT
        WSEEREVGG+GWIL DCRGEIVLAGNCKIREKKEI ALELM IIRGLQ INMQSRSPIYLE DSVEV RLMKKEDVDLT
Subjt:  WSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLT

A0A6J1DNV9 uncharacterized protein LOC1110224032.2e-2634.91Show/hide
Query:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG
        E++   ++ +W IW+ RN  IFRGE      + + +  F+            T  S Q++  L    + LN    +W  P  + W LN DASWS+    G
Subjt:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG

Query:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSI-NMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGN
        G+GWI+R   G+IVLAGN  +     +K LE  AI+ GL+++ N+    P+++E DS EV  L+ ++  DLT+  W+VEE + L  +  I++ A + R  
Subjt:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSI-NMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGN

Query:  NSVAHELARRAT
        N  AH LA+RA+
Subjt:  NSVAHELARRAT

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X22.5e-2234.78Show/hide
Query:  FSVKSAYKLGMELENAN---LASTSNLDSHRQMRWFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQ
        F  KS   L     N N   L   +N  +     W +     EE   SM+IA QIW+ RN+SIF+G   + + +  +I  +I ++  + T     +R  +
Subjt:  FSVKSAYKLGMELENAN---LASTSNLDSHRQMRWFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQ

Query:  NDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVE
        +   + R  +N     ARW  P +N WKLNTDA+W  +    G+GWILRD +GE++  G   IR ++ I  LE+MAI  GL++I  +   PI+LE DS+E
Subjt:  NDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVE

Query:  VTRLMKK
           L+ +
Subjt:  VTRLMKK

A0A6J1DSV1 uncharacterized protein LOC1110236083.3e-3542.22Show/hide
Query:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG
        EE   SM+IAWQIW+ RN+SIF+G   + + +   I  +I ++  + T +    +S   D +L R R   N  GARW  P +N WKLNTDA+W  +   G
Subjt:  EEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVG

Query:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQS--------INMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSI
        G+GWILRD +GE++ A    IR ++ I  LE+MAI  GL++        I  +   PI+LE DS+E   L+ ++  D TE IWL+EE  ++ +   IVS+
Subjt:  GLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQS--------INMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSI

Query:  AHINRGNNSVAHELARRATHNDEEE
         HI+R  N VAH+LARRA  ND  E
Subjt:  AHINRGNNSVAHELARRATHNDEEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein5.2e-0430.77Show/hide
Query:  KLNTDASWSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLM
        K N DAS  E   V GLGW++R+ +G ++  G  K + +   +  E  A+I  +Q+ +    + +  E D+  V RL+
Subjt:  KLNTDASWSEEREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLM

AT2G02650.1 Ribonuclease H-like superfamily protein2.9e-0723.19Show/hide
Query:  IAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRD
        I W++W SRN  +F+ +        R  +      ++       T      +      R++     ++W+ P     K N D+ +++       GW +R+
Subjt:  IAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRD

Query:  CRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVE-VTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNNSVAHELA
        C G IVL GN K++        E +  +  LQ I       ++ E DS   VT +   ED  L     L+ +           S+  +NR  NS A  LA
Subjt:  CRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVE-VTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNNSVAHELA

Query:  RRATHND
              D
Subjt:  RRATHND

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.7e-1226.73Show/hide
Query:  IAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRD
        + W++W SRN  +F+G+  D  ++ R  +     + ++ +    TRR  +     P+   NL++   +W AP     K NTDA+W  E    G+GWILR+
Subjt:  IAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRD

Query:  CRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNNSVAHELAR
          G ++  G   +   K +   EL A+   + +++  +   I  E D+  +  L+  +D   T    L E+  +L      V      RG N VA  +AR
Subjt:  CRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNNSVAHELAR

Query:  RA
         +
Subjt:  RA

AT4G29090.1 Ribonuclease H-like superfamily protein3.3e-1124.54Show/hide
Query:  EVATSMV--IAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREV
        E A+ +V  + W++W +RN  +FRG   + Q++ R        ++++     + R   ++ G  P  + N +  G RW  P     K NTDA+W+ + E 
Subjt:  EVATSMV--IAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREV

Query:  GGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIW-----LVEEAVELAKTRGIVSIAH
         G+GW+LR+ +GE+   G   + + K +   EL A+   + S++    + +  E DS  +  ++  ++      IW      +++   L      V    
Subjt:  GGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIW-----LVEEAVELAKTRGIVSIAH

Query:  INRGNNSVAHELARRA
        I R  N++A  +AR +
Subjt:  INRGNNSVAHELARRA

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.0e-0825.81Show/hide
Query:  IAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRD
        + W+IW S N  +F       Q    ++ + +N   D    +  T  ++Q +G     R        +WS P  +  K N DAS  E   V GLGWILR+
Subjt:  IAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSEEREVGGLGWILRD

Query:  CRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLM
         +G ++  G  K + +   +  E   +I  +Q+        +  E D+  +TR++
Subjt:  CRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACGAAAGATAGAGGGAGACCGCCCGACCTCACTGGAGAGATGGAGCTCTCTCCCCGTTCCAGATCCGACGAAGTGCGGCAATGATGACGGCGGTGGTCGGTGGCC
GGTGGGAAGAAGGGAGAGAGGCAGCTGGAATGAAGAGCTCATCAGGGAGTCCTTCAATGCTCAAGAGGCCGAAGCAATTCTCCAAATCCCTCTTCCTAGACATTGTCGGG
CAGATGAAGTCATTTGGAATAGAGATAAAAGGGGAGTGTTCAGTGTGAAAAGTGCTTATAAATTGGGAATGGAACTTGAGAACGCTAATTTAGCTTCCACTTCCAATTTG
GATTCCCATCGACAAATGAGATGGTTTCTCCTTTTATGGGAATATGAAGAGGTGGCTACTAGCATGGTTATTGCTTGGCAGATATGGGATAGCAGAAACAGAAGCATCTT
TAGAGGAGAAACCATTGATGAACAACAACTAGGCAGATCAATAGTCTTGTTCATAAATTCCAACATAGATAAAGGCACATGCATATCCCAAACGAGAAGGAGCCAACAAA
ATGATGGGTACCTGCCACGAGGAAGGGAGAATCTCAACATGCTTGGCGCTCGATGGAGTGCTCCCCGTACCAATTGCTGGAAACTCAATACAGATGCATCGTGGAGTGAG
GAGCGCGAAGTAGGTGGCCTAGGGTGGATTCTTCGTGACTGTAGGGGGGAGATTGTTTTGGCTGGAAACTGTAAAATCAGGGAGAAGAAGGAAATCAAAGCCCTAGAACT
AATGGCGATAATTCGAGGACTCCAATCCATCAACATGCAAAGTAGAAGTCCAATTTACCTTGAATTAGATTCAGTCGAAGTAACTCGATTAATGAAGAAGGAGGACGTCG
ATCTAACAGAAAATATTTGGCTTGTTGAGGAAGCTGTGGAATTAGCGAAAACGAGAGGAATTGTTTCGATCGCCCACATCAACAGAGGGAATAATTCAGTGGCCCACGAA
TTGGCGAGGAGAGCGACGCACAACGATGAGGAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACGAAAGATAGAGGGAGACCGCCCGACCTCACTGGAGAGATGGAGCTCTCTCCCCGTTCCAGATCCGACGAAGTGCGGCAATGATGACGGCGGTGGTCGGTGGCC
GGTGGGAAGAAGGGAGAGAGGCAGCTGGAATGAAGAGCTCATCAGGGAGTCCTTCAATGCTCAAGAGGCCGAAGCAATTCTCCAAATCCCTCTTCCTAGACATTGTCGGG
CAGATGAAGTCATTTGGAATAGAGATAAAAGGGGAGTGTTCAGTGTGAAAAGTGCTTATAAATTGGGAATGGAACTTGAGAACGCTAATTTAGCTTCCACTTCCAATTTG
GATTCCCATCGACAAATGAGATGGTTTCTCCTTTTATGGGAATATGAAGAGGTGGCTACTAGCATGGTTATTGCTTGGCAGATATGGGATAGCAGAAACAGAAGCATCTT
TAGAGGAGAAACCATTGATGAACAACAACTAGGCAGATCAATAGTCTTGTTCATAAATTCCAACATAGATAAAGGCACATGCATATCCCAAACGAGAAGGAGCCAACAAA
ATGATGGGTACCTGCCACGAGGAAGGGAGAATCTCAACATGCTTGGCGCTCGATGGAGTGCTCCCCGTACCAATTGCTGGAAACTCAATACAGATGCATCGTGGAGTGAG
GAGCGCGAAGTAGGTGGCCTAGGGTGGATTCTTCGTGACTGTAGGGGGGAGATTGTTTTGGCTGGAAACTGTAAAATCAGGGAGAAGAAGGAAATCAAAGCCCTAGAACT
AATGGCGATAATTCGAGGACTCCAATCCATCAACATGCAAAGTAGAAGTCCAATTTACCTTGAATTAGATTCAGTCGAAGTAACTCGATTAATGAAGAAGGAGGACGTCG
ATCTAACAGAAAATATTTGGCTTGTTGAGGAAGCTGTGGAATTAGCGAAAACGAGAGGAATTGTTTCGATCGCCCACATCAACAGAGGGAATAATTCAGTGGCCCACGAA
TTGGCGAGGAGAGCGACGCACAACGATGAGGAAGAATGA
Protein sequenceShow/hide protein sequence
MERKIEGDRPTSLERWSSLPVPDPTKCGNDDGGGRWPVGRRERGSWNEELIRESFNAQEAEAILQIPLPRHCRADEVIWNRDKRGVFSVKSAYKLGMELENANLASTSNL
DSHRQMRWFLLLWEYEEVATSMVIAWQIWDSRNRSIFRGETIDEQQLGRSIVLFINSNIDKGTCISQTRRSQQNDGYLPRGRENLNMLGARWSAPRTNCWKLNTDASWSE
EREVGGLGWILRDCRGEIVLAGNCKIREKKEIKALELMAIIRGLQSINMQSRSPIYLELDSVEVTRLMKKEDVDLTENIWLVEEAVELAKTRGIVSIAHINRGNNSVAHE
LARRATHNDEEE