; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g03620 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g03620
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr3:2722725..2724819
RNA-Seq ExpressionMoc03g03620
SyntenyMoc03g03620
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MQL96670.1 hypothetical protein [Colocasia esculenta]6.8e-3227.67Show/hide
Query:  EDLPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARI
        E LP  S+ S++ L   F D F    +  RT A L  ++QK  E    ++  F  E LR+S  +  +A  AL QG ++  L R+ G     ++ EL++  
Subjt:  EDLPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARI

Query:  NRHIAGEEMLAA-------QSLDKKPGE-ATWRDIGKSAERSVSKRKP-GVTRYLLESMGPR---------------STTMGDAAKKNRSKYCKFYRDHG
         RH A EE LAA       QS  K+P +  + RD  K      S R+P   T Y   ++ P                     D  K++++KYC+F+RDHG
Subjt:  NRHIAGEEMLAA-------QSLDKKPGE-ATWRDIGKSAERSVSKRKP-GVTRYLLESMGPR---------------STTMGDAAKKNRSKYCKFYRDHG

Query:  HDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQ-----------------------------------
        HDT++ R LKD++E L++RGYL  FV    E    R +  E  +  +    + +      GA  +                                   
Subjt:  HDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQ-----------------------------------

Query:  --------------IQPHNDALIVVVGIDHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIE
                        PH+DAL+V   I+H    ++LV  G+  +VL + C+  +G ++     V+              G I+LP+TLG   +AV +  
Subjt:  --------------IQPHNDALIVVVGIDHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIE

Query:  EFVVIDGPLAYNTIFGRPTIHHFRAVPEGM
        +F+VI+    YN I GR  I   +AVP  +
Subjt:  EFVVIDGPLAYNTIFGRPTIHHFRAVPEGM

MQM14562.1 hypothetical protein [Colocasia esculenta]6.8e-3227.67Show/hide
Query:  EDLPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARI
        E LP  S+ S++ L   F D F    +  R  A L +++QK SE    +V  F  E LR+S  +  +A  AL QG ++  L R+ G     ++ EL++  
Subjt:  EDLPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARI

Query:  NRHIAGEEMLAA-------QSLDKKPGE-ATWRDIGKSAERSVSKRKP-GVTRYLLESMGPR---------------STTMGDAAKKNRSKYCKFYRDHG
         RH A EE LAA       QS  K+P +  + RD  K      S R+P   T Y   ++ P                     D  K++++KYC+F+RDHG
Subjt:  NRHIAGEEMLAA-------QSLDKKPGE-ATWRDIGKSAERSVSKRKP-GVTRYLLESMGPR---------------STTMGDAAKKNRSKYCKFYRDHG

Query:  HDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQ-----------------------------------
        HDT + R LKD++E L++RGYL+ FV    E    R +  E  +  +    + +      GA  +                                   
Subjt:  HDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQ-----------------------------------

Query:  --------------IQPHNDALIVVVGIDHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIE
                        PH DAL+V   I+H    ++LV  G+  +VL + C+  +G  +     V+              G ++LP+TLG   +AV +I 
Subjt:  --------------IQPHNDALIVVVGIDHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIE

Query:  EFVVIDGPLAYNTIFGRPTIHHFRAVPEGM
        +F+V++    YN I GR  I   +AVP  +
Subjt:  EFVVIDGPLAYNTIFGRPTIHHFRAVPEGM

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.0e-3629.12Show/hide
Query:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR
        LP   + +Y  L  +F+ QF     +R+T  HL++I+QK  E  + YV  F  E+L+V++ ++  A+     G+ +  L     E    +  E++ +  +
Subjt:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR

Query:  HIAGEEMLA-----------------------AQSLDKKPGEATWRDIGKSAERSVSKRKP----------------GVTRYLLESMGPRSTTM-GDAAK
         I G+E+L                        ++S DK P  ++ R   + +  S ++ +P                 +    +E +  R   + GD  K
Subjt:  HIAGEEMLA-----------------------AQSLDKKPGEATWRDIGKSAERSVSKRKP----------------GVTRYLLESMGPRSTTM-GDAAK

Query:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFV-----------DEGPELRHHRRKYNEGKVQLKLK----------C-----PKTSSYLYRR
        +N  KYC+F+RDHGH+T++Y +LK Q+E L+Q GY K+FV           +E   LR   R+ +   V  K K          C       TSS  +  
Subjt:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFV-----------DEGPELRHHRRKYNEGKVQLKLK----------C-----PKTSSYLYRR

Query:  GANSQIQ-PHNDALIVVVGIDHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGP
             +  PHNDAL++   ID +   ++LV GG   N+L    Y  LG      K               L+G I LP+++ +    V ++ EFVVIDG 
Subjt:  GANSQIQ-PHNDALIVVVGIDHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGP

Query:  LAYNTIFGRPTIHHFRAVP
         AYN IFGRP IH FRAVP
Subjt:  LAYNTIFGRPTIHHFRAVP

XP_022153957.1 uncharacterized protein LOC111021344 [Momordica charantia]3.2e-3729.34Show/hide
Query:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR
        LP  S+ +Y  L  + + QF     +R+T  HL++I+QK  E  + YV  F  E+L+V++ ++  A+     G+ +  L    GE    +  E++ +  +
Subjt:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR

Query:  HIAGEEMLAAQS------LDKKPGEATWRDIGKSAERSVSKRKPGVTRYLLESMGP---------RSTTM-------------------------GDAAK
         I G+E+L  ++      +D+K      R     ++   S      T Y     GP          STT+                         GD  K
Subjt:  HIAGEEMLAAQS------LDKKPGEATWRDIGKSAERSVSKRKPGVTRYLLESMGP---------RSTTM-------------------------GDAAK

Query:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHIPTHQ
        +N+ KYC+F+RDHGH+TT   +LK Q+E L+Q  Y K     G + +   R+       ++ + P                PHNDAL++   IDH+   +
Subjt:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHIPTHQ

Query:  VLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAVP
        VLV G    N+L    Y  LG      K     +          +G I LP+T+G+    V ++ EFVVIDG  AYN IFGRP IH F AVP
Subjt:  VLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAVP

XP_022158513.1 uncharacterized protein LOC111024986 [Momordica charantia]2.3e-3549.75Show/hide
Query:  DAAKKNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHI
        DA KK RS Y KF+RDH H+T + RD+KD+V+SL QRG LKEF+     +       N+ K   +    K          N+Q+QPHND  +VV  IDH 
Subjt:  DAAKKNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHI

Query:  PTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFI-----LKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAV
        PT  +LV GG+L NVL       LG    + K   RC  A LL  F      L+GSI+LPLTLGEGKE V RIEEF+VIDG LAYNTI GR  IH+ RAV
Subjt:  PTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFI-----LKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAV

Query:  P
        P
Subjt:  P

TrEMBL top hitse value%identityAlignment
A0A6J1DHB3 uncharacterized protein LOC1110204799.9e-3729.12Show/hide
Query:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR
        LP   + +Y  L  +F+ QF     +R+T  HL++I+QK  E  + YV  F  E+L+V++ ++  A+     G+ +  L     E    +  E++ +  +
Subjt:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR

Query:  HIAGEEMLA-----------------------AQSLDKKPGEATWRDIGKSAERSVSKRKP----------------GVTRYLLESMGPRSTTM-GDAAK
         I G+E+L                        ++S DK P  ++ R   + +  S ++ +P                 +    +E +  R   + GD  K
Subjt:  HIAGEEMLA-----------------------AQSLDKKPGEATWRDIGKSAERSVSKRKP----------------GVTRYLLESMGPRSTTM-GDAAK

Query:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFV-----------DEGPELRHHRRKYNEGKVQLKLK----------C-----PKTSSYLYRR
        +N  KYC+F+RDHGH+T++Y +LK Q+E L+Q GY K+FV           +E   LR   R+ +   V  K K          C       TSS  +  
Subjt:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFV-----------DEGPELRHHRRKYNEGKVQLKLK----------C-----PKTSSYLYRR

Query:  GANSQIQ-PHNDALIVVVGIDHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGP
             +  PHNDAL++   ID +   ++LV GG   N+L    Y  LG      K               L+G I LP+++ +    V ++ EFVVIDG 
Subjt:  GANSQIQ-PHNDALIVVVGIDHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGP

Query:  LAYNTIFGRPTIHHFRAVP
         AYN IFGRP IH FRAVP
Subjt:  LAYNTIFGRPTIHHFRAVP

A0A6J1DKD3 uncharacterized protein LOC1110213441.5e-3729.34Show/hide
Query:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR
        LP  S+ +Y  L  + + QF     +R+T  HL++I+QK  E  + YV  F  E+L+V++ ++  A+     G+ +  L    GE    +  E++ +  +
Subjt:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR

Query:  HIAGEEMLAAQS------LDKKPGEATWRDIGKSAERSVSKRKPGVTRYLLESMGP---------RSTTM-------------------------GDAAK
         I G+E+L  ++      +D+K      R     ++   S      T Y     GP          STT+                         GD  K
Subjt:  HIAGEEMLAAQS------LDKKPGEATWRDIGKSAERSVSKRKPGVTRYLLESMGP---------RSTTM-------------------------GDAAK

Query:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHIPTHQ
        +N+ KYC+F+RDHGH+TT   +LK Q+E L+Q  Y K     G + +   R+       ++ + P                PHNDAL++   IDH+   +
Subjt:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHIPTHQ

Query:  VLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAVP
        VLV G    N+L    Y  LG      K     +          +G I LP+T+G+    V ++ EFVVIDG  AYN IFGRP IH F AVP
Subjt:  VLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAVP

A0A6J1DTD9 uncharacterized protein LOC1110241448.1e-3127.46Show/hide
Query:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR
        L   S+ +Y  L  +F+ QF Y   +R+T  HL++I+QK  E  + YV  F  E+L+V++ ++   +     G+ +  L    GE    +  E++ +  +
Subjt:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR

Query:  HIAGEEMLAA-------QSLDKKPG-------EATWRDIGKSAE--------RSVSKRKPGVTRYL-----------------LESMGPRSTTM-GDAAK
         I G+E+L         +S  KKPG       ++ ++D G S++         S S  +    RY                  LE +  R   + GD  K
Subjt:  HIAGEEMLAA-------QSLDKKPG-------EATWRDIGKSAE--------RSVSKRKPGVTRYL-----------------LESMGPRSTTM-GDAAK

Query:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDE--GPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHIPT
        +N+  YC+F+R+H HDT+   +LK Q+E L+Q GY K++V +     +   + + +      +   P   + ++   +  Q       L+  V +D    
Subjt:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDE--GPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHIPT

Query:  HQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAVPEGM
               G   N+L    Y  LG      K               L+  I L +T+G+G   V R+ EFVVIDG  AYN IFGRP IH  RA+P  M
Subjt:  HQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAVPEGM

A0A6J1DWB0 uncharacterized protein LOC1110249861.1e-3549.75Show/hide
Query:  DAAKKNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHI
        DA KK RS Y KF+RDH H+T + RD+KD+V+SL QRG LKEF+     +       N+ K   +    K          N+Q+QPHND  +VV  IDH 
Subjt:  DAAKKNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHI

Query:  PTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFI-----LKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAV
        PT  +LV GG+L NVL       LG    + K   RC  A LL  F      L+GSI+LPLTLGEGKE V RIEEF+VIDG LAYNTI GR  IH+ RAV
Subjt:  PTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFI-----LKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAV

Query:  P
        P
Subjt:  P

A0A6J1DZB9 uncharacterized protein LOC1110249041.4e-3027.76Show/hide
Query:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR
        LP  S+ +Y  L  +F+ QF     +R+T  HL++I+QK  E  + YV  F  E+L+V++ ++  A+      + +  L    GE    + VE++ +  +
Subjt:  LPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKPSEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINR

Query:  HIAGEEMLAAQ-------------SLDKKPGEATWRDIGKSAERS--------------------------VSKRKPGVTRYLLESMGPRSTTM-GDAAK
         I G+E+L  +             S +K+  ++  RD G S+  S                          +S+    +    +E +  R   + GD  K
Subjt:  HIAGEEMLAAQ-------------SLDKKPGEATWRDIGKSAERS--------------------------VSKRKPGVTRYLLESMGPRSTTM-GDAAK

Query:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQL---KLKCPKTSSYLY-----RRGANSQIQPHNDALIVVVG
        +N+ KYC+F+RDHGH+TT   +LK Q+E L+Q GY K+FV + P      +K    + +    +   P   + ++      +  N + +   +A   V  
Subjt:  KNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQL---KLKCPKTSSYLY-----RRGANSQIQPHNDALIVVVG

Query:  I-DHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILK------GSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTI
        I +H PT  +     +L  V           +  +  +V     ASL++  +++      G I LP+T+G+    V ++ EFVVIDG  AYN IFGRP I
Subjt:  I-DHIPTHQVLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILK------GSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTI

Query:  HHFRAVP
        H FRAVP
Subjt:  HHFRAVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCCGACAAAGAGTCGTCCTACGAAAACCCCGTCGACGAAGAGCGTCGACGTGACAAAGCCACGGAAAAGCCTGTCCCAACACCCCCAGCCCCTCAAAAGGGAAA
AGGGATATGGGACACAGTCAGCTCGATCCGCCTGGAAAAGCTCTGGGTTCCGCAACGAGATGAGAACGGCCCATGCCCAGATGACTGCCCAAGTGAGGATCTACCTCCGC
ACTCGGTTGGGAGCTACCAGGACCTAGCGAGCAAGTTCGTAGACCAGTTCCATTACTCCTGCACCAACCGGCGAACGGAAGCCCATCTCTCTTCGATCAAGCAGAAGCCA
AGCGAGGGCTTTAAGGCGTATGTCGCAAGCTTCTCGAATGAGAGGCTTCGAGTATCCAACAGTAACGAAAGTGTGGCCCTATGGGCACTCTCCCAAGGCATCAAAGAAAG
AAAGCTTGTACGATCGCGTGGTGAGTACCCCACTAGATCTATGGTCGAACTCATGGCCAGGATCAATAGGCACATAGCTGGAGAGGAGATGCTCGCAGCCCAATCCCTAG
ACAAAAAACCAGGCGAAGCGACTTGGCGCGATATAGGGAAGTCGGCGGAGAGATCGGTATCAAAGAGAAAGCCTGGCGTGACTCGATACTTGCTTGAATCGATGGGACCC
CGGTCGACAACCATGGGCGACGCCGCCAAGAAGAACAGGAGCAAGTATTGCAAATTCTATCGAGACCACGGACATGACACTACCGATTACAGAGACCTCAAGGATCAGGT
CGAGTCCCTGGTTCAAAGAGGATATTTGAAGGAGTTTGTCGATGAAGGCCCGGAGCTTAGACACCATAGACGAAAGTATAATGAAGGTAAAGTACAGCTAAAGCTAAAGT
GTCCAAAGACCAGCAGTTACCTTTACAGACGGGGAGCGAACTCCCAGATCCAGCCGCACAACGACGCCCTTATCGTTGTCGTCGGGATCGACCACATACCTACACATCAA
GTGCTGGTACATGGAGGGAACTTGACAAACGTCCTACACTGGCGGTGCTACGCACGCTTGGGTGGGATGTCACGAAGCTCAAAAGTTGTGCATCGCTGCTGGTCGGCTTC
TCTGTTGAACGAGTTTATCCTGAAAGGAAGCATCAAATTGCCCCTCACTCTGGGGGAAGGAAAAGAAGCAGTCATTCGAATCGAGGAGTTCGTGGTGATCGACGGTCCCC
TTGCTTACAACACCATCTTCGGAAGGCCGACGATACACCACTTCAGGGCTGTCCCTGAAGGGATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTTCCGACAAAGAGTCGTCCTACGAAAACCCCGTCGACGAAGAGCGTCGACGTGACAAAGCCACGGAAAAGCCTGTCCCAACACCCCCAGCCCCTCAAAAGGGAAA
AGGGATATGGGACACAGTCAGCTCGATCCGCCTGGAAAAGCTCTGGGTTCCGCAACGAGATGAGAACGGCCCATGCCCAGATGACTGCCCAAGTGAGGATCTACCTCCGC
ACTCGGTTGGGAGCTACCAGGACCTAGCGAGCAAGTTCGTAGACCAGTTCCATTACTCCTGCACCAACCGGCGAACGGAAGCCCATCTCTCTTCGATCAAGCAGAAGCCA
AGCGAGGGCTTTAAGGCGTATGTCGCAAGCTTCTCGAATGAGAGGCTTCGAGTATCCAACAGTAACGAAAGTGTGGCCCTATGGGCACTCTCCCAAGGCATCAAAGAAAG
AAAGCTTGTACGATCGCGTGGTGAGTACCCCACTAGATCTATGGTCGAACTCATGGCCAGGATCAATAGGCACATAGCTGGAGAGGAGATGCTCGCAGCCCAATCCCTAG
ACAAAAAACCAGGCGAAGCGACTTGGCGCGATATAGGGAAGTCGGCGGAGAGATCGGTATCAAAGAGAAAGCCTGGCGTGACTCGATACTTGCTTGAATCGATGGGACCC
CGGTCGACAACCATGGGCGACGCCGCCAAGAAGAACAGGAGCAAGTATTGCAAATTCTATCGAGACCACGGACATGACACTACCGATTACAGAGACCTCAAGGATCAGGT
CGAGTCCCTGGTTCAAAGAGGATATTTGAAGGAGTTTGTCGATGAAGGCCCGGAGCTTAGACACCATAGACGAAAGTATAATGAAGGTAAAGTACAGCTAAAGCTAAAGT
GTCCAAAGACCAGCAGTTACCTTTACAGACGGGGAGCGAACTCCCAGATCCAGCCGCACAACGACGCCCTTATCGTTGTCGTCGGGATCGACCACATACCTACACATCAA
GTGCTGGTACATGGAGGGAACTTGACAAACGTCCTACACTGGCGGTGCTACGCACGCTTGGGTGGGATGTCACGAAGCTCAAAAGTTGTGCATCGCTGCTGGTCGGCTTC
TCTGTTGAACGAGTTTATCCTGAAAGGAAGCATCAAATTGCCCCTCACTCTGGGGGAAGGAAAAGAAGCAGTCATTCGAATCGAGGAGTTCGTGGTGATCGACGGTCCCC
TTGCTTACAACACCATCTTCGGAAGGCCGACGATACACCACTTCAGGGCTGTCCCTGAAGGGATGTGA
Protein sequenceShow/hide protein sequence
MTSDKESSYENPVDEERRRDKATEKPVPTPPAPQKGKGIWDTVSSIRLEKLWVPQRDENGPCPDDCPSEDLPPHSVGSYQDLASKFVDQFHYSCTNRRTEAHLSSIKQKP
SEGFKAYVASFSNERLRVSNSNESVALWALSQGIKERKLVRSRGEYPTRSMVELMARINRHIAGEEMLAAQSLDKKPGEATWRDIGKSAERSVSKRKPGVTRYLLESMGP
RSTTMGDAAKKNRSKYCKFYRDHGHDTTDYRDLKDQVESLVQRGYLKEFVDEGPELRHHRRKYNEGKVQLKLKCPKTSSYLYRRGANSQIQPHNDALIVVVGIDHIPTHQ
VLVHGGNLTNVLHWRCYARLGGMSRSSKVVHRCWSASLLNEFILKGSIKLPLTLGEGKEAVIRIEEFVVIDGPLAYNTIFGRPTIHHFRAVPEGM