; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g15380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g15380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr6:12148094..12150265
RNA-Seq ExpressionMoc06g15380
SyntenyMoc06g15380
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149836.1 uncharacterized protein LOC111018172 [Momordica charantia]2.2e-4043.79Show/hide
Query:  MAKENGSRSGSNENENGDRDLPNDRPGKELERPPLSKKLKTRADARKTKSTPVEVSSSEQMQSQLAKMRAHLKAMIQGLLNAGVSLPSSSVTRRKGKRRR
        MA+ NGS SGSNEN+NGDRDLPN R G++ + PPL +K KT     + +       + +++              +Q +L+A +S  S   T  K     
Subjt:  MAKENGSRSGSNENENGDRDLPNDRPGKELERPPLSKKLKTRADARKTKSTPVEVSSSEQMQSQLAKMRAHLKAMIQGLLNAGVSLPSSSVTRRKGKRRR

Query:  ARILRKSEMSKTRGPKRVDLEKEHLEESRQYELPGRVRPAEKGAEEPTSDVQQGVLHNSARTNEEVVQTTGPYSISSWKQLR----------KKPKEPIR
                          D  K+ ++    YE    +      ++          L   AR   +  +    YSISS KQLR          K  K    
Subjt:  ARILRKSEMSKTRGPKRVDLEKEHLEESRQYELPGRVRPAEKGAEEPTSDVQQGVLHNSARTNEEVVQTTGPYSISSWKQLR----------KKPKEPIR

Query:  DYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKSEKKKDEKKNGNGGNG-GNTF-
        DYIK FLSEQIKVE  TDLLARSAF NG+THEKLSWSLAKKP+ TLKGCLDRATKFIE EDIMMSKEDKYS  GSPSK EK KD+KKNGNGGNG GN+F 
Subjt:  DYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKSEKKKDEKKNGNGGNG-GNTF-

Query:  -RRPDY-----GRQDDQKSDKP
          R  Y     G++D +++  P
Subjt:  -RRPDY-----GRQDDQKSDKP

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.1e-3130.21Show/hide
Query:  LRKKPKEPIRDYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSK---EDKYSMKGSPSKSEKKKDEKK
        +R+K  E +R+Y+  F  EQ+KV + +D  A   F  GL  E L+  L ++   T    L +  K I+ ++++ +K    +K   +G   K + K D K 
Subjt:  LRKKPKEPIRDYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSK---EDKYSMKGSPSKSEKKKDEKK

Query:  NGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIR
           G +  ++  R DY R+ +   ++ + +  +T   +P+ EIL NI++  ++ LL    K++ D E+R+  KYCR+HRDHGH TS Y++L++QIE LI+
Subjt:  NGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIR

Query:  QG---------TSKSISISETKKKVLILP------------AEEGKGPRRWWCF--------SKYTNTVHLQGARL-----------------------G
         G          S S+   E +K++   P             E  +  RR  C         S   N   L+G  L                       G
Subjt:  QG---------TSKSISISETKKKVLILP------------AEEGKGPRRWWCF--------SKYTNTVHLQGARL-----------------------G

Query:  PSSAEEESN--------TASQI----------------LQG-----------------KFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPK
         +SA   S         T SQ+                L+G                   EFV+ D +  YNAIF RPI+H+  AV ST HQ++KY T  
Subjt:  PSSAEEESN--------TASQI----------------LQG-----------------KFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPK

Query:  GVRIVRGEQKTSRECYAAALKGSNTYA
        GV  VRGE KTSRECYA+  K S+  A
Subjt:  GVRIVRGEQKTSRECYAAALKGSNTYA

XP_022154405.1 uncharacterized protein LOC111021682 [Momordica charantia]6.3e-3232.24Show/hide
Query:  FYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSK---EDKYSMKGSPSKSEKKKDEKKNGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFT
        F  GL  E L+  L ++   T    L +A K I+ ++++ +K    +K   +  PS+ ++K D K    G +  ++  R DY R D   S + + +  +T
Subjt:  FYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSK---EDKYSMKGSPSKSEKKKDEKKNGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFT

Query:  QLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSK---------SISISETKKKVLILPAEEGK
           + +SEIL NI++  ++ LL    K+++DLE+R+K KYC +HRDHGH TS Y++L++QIE LI+ G  K         S+   E KK+    P  + +
Subjt:  QLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSK---------SISISETKKKVLILPAEEGK

Query:  ---------GP----------------RRWWCF---SKYTNTV-----HLQGARLGPSSAEEESNTASQILQGKF------------------------E
                 GP                +R  C     K T ++      L+G  L  + A   +     +L  +                         E
Subjt:  ---------GP----------------RRWWCF---SKYTNTV-----HLQGARLGPSSAEEESNTASQILQGKF------------------------E

Query:  FVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGSNTYA
        FV+ D +  YNAIF RPI+H+   V ST HQ++KY TP GV  VRGEQKT RECYA+ALKGS+  A
Subjt:  FVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGSNTYA

XP_022158257.1 uncharacterized protein LOC111024791 [Momordica charantia]2.4e-3132.7Show/hide
Query:  FLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKS--EKKKDEKKNGNGGNGGNTFRRPDY
        F  EQ+KV + +D  A+  F+ GL  E L+  L ++   T    L +A K I+ ++++  K  +   +    K+  EK++ E K+ + G+  ++  R +Y
Subjt:  FLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKS--EKKKDEKKNGNGGNGGNTFRRPDY

Query:  GRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSKSISISETKKKV
           D+  S + + +  +T   + +SEIL NI++  ++ LL   +K++ DLE+R+K KYC +HRDHGH TS  ++L++QIE LI+ G  K   + +++  +
Subjt:  GRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSKSISISETKKKV

Query:  LILPAE--EGKGPRRWWCFSKYTNTVHLQGARLGPSSAEEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRG
        +    E    + P R        NT+   G   G  S  +    A +  +   EFV+ D K  YNAIF RPI+H+  AV ST HQ++KY TP GV  VR 
Subjt:  LILPAE--EGKGPRRWWCFSKYTNTVHLQGARLGPSSAEEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRG

Query:  EQKTSRECYAAALKG
         +K          KG
Subjt:  EQKTSRECYAAALKG

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.5e-3632.32Show/hide
Query:  LRKKPKEPIRDYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSK--SEKKKDEKKN
        +R+K +E +R+Y+  F  EQ+KV + +D  A   F   L  E L+  L ++   T    L +A K I+ ++++ +K  +   +    K   EK+K + K+
Subjt:  LRKKPKEPIRDYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSK--SEKKKDEKKN

Query:  GNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQ
         + G+  +   R +Y R +   S + + +  +T   +P+SEIL NI++  ++ LL    K++ DLE+R+K KYCR+HRDHGH T+  ++L++QIE LI+ 
Subjt:  GNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQ

Query:  G---------TSKSISISETKKKVLILPAEEGK---------GP----------------RRWWCF--------SKYTNTVHLQGARLGPSSA-------
        G          S S+   E +K+    P  E +         GP                RR  C         S       L+G  L  + A       
Subjt:  G---------TSKSISISETKKKVLILPAEEGK---------GP----------------RRWWCF--------SKYTNTVHLQGARLGPSSA-------

Query:  ------------------EEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGS
                                A+Q+ Q   EFV+ D +  YNAIF RPI+H+  AV ST HQ++KY TP  V +VRGEQKTSRECYA+ALKGS
Subjt:  ------------------EEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGS

TrEMBL top hitse value%identityAlignment
A0A6J1D9M1 uncharacterized protein LOC1110181721.0e-4043.79Show/hide
Query:  MAKENGSRSGSNENENGDRDLPNDRPGKELERPPLSKKLKTRADARKTKSTPVEVSSSEQMQSQLAKMRAHLKAMIQGLLNAGVSLPSSSVTRRKGKRRR
        MA+ NGS SGSNEN+NGDRDLPN R G++ + PPL +K KT     + +       + +++              +Q +L+A +S  S   T  K     
Subjt:  MAKENGSRSGSNENENGDRDLPNDRPGKELERPPLSKKLKTRADARKTKSTPVEVSSSEQMQSQLAKMRAHLKAMIQGLLNAGVSLPSSSVTRRKGKRRR

Query:  ARILRKSEMSKTRGPKRVDLEKEHLEESRQYELPGRVRPAEKGAEEPTSDVQQGVLHNSARTNEEVVQTTGPYSISSWKQLR----------KKPKEPIR
                          D  K+ ++    YE    +      ++          L   AR   +  +    YSISS KQLR          K  K    
Subjt:  ARILRKSEMSKTRGPKRVDLEKEHLEESRQYELPGRVRPAEKGAEEPTSDVQQGVLHNSARTNEEVVQTTGPYSISSWKQLR----------KKPKEPIR

Query:  DYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKSEKKKDEKKNGNGGNG-GNTF-
        DYIK FLSEQIKVE  TDLLARSAF NG+THEKLSWSLAKKP+ TLKGCLDRATKFIE EDIMMSKEDKYS  GSPSK EK KD+KKNGNGGNG GN+F 
Subjt:  DYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKSEKKKDEKKNGNGGNG-GNTF-

Query:  -RRPDY-----GRQDDQKSDKP
          R  Y     G++D +++  P
Subjt:  -RRPDY-----GRQDDQKSDKP

A0A6J1DHB3 uncharacterized protein LOC1110204794.4e-3130.21Show/hide
Query:  LRKKPKEPIRDYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSK---EDKYSMKGSPSKSEKKKDEKK
        +R+K  E +R+Y+  F  EQ+KV + +D  A   F  GL  E L+  L ++   T    L +  K I+ ++++ +K    +K   +G   K + K D K 
Subjt:  LRKKPKEPIRDYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSK---EDKYSMKGSPSKSEKKKDEKK

Query:  NGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIR
           G +  ++  R DY R+ +   ++ + +  +T   +P+ EIL NI++  ++ LL    K++ D E+R+  KYCR+HRDHGH TS Y++L++QIE LI+
Subjt:  NGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIR

Query:  QG---------TSKSISISETKKKVLILP------------AEEGKGPRRWWCF--------SKYTNTVHLQGARL-----------------------G
         G          S S+   E +K++   P             E  +  RR  C         S   N   L+G  L                       G
Subjt:  QG---------TSKSISISETKKKVLILP------------AEEGKGPRRWWCF--------SKYTNTVHLQGARL-----------------------G

Query:  PSSAEEESN--------TASQI----------------LQG-----------------KFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPK
         +SA   S         T SQ+                L+G                   EFV+ D +  YNAIF RPI+H+  AV ST HQ++KY T  
Subjt:  PSSAEEESN--------TASQI----------------LQG-----------------KFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPK

Query:  GVRIVRGEQKTSRECYAAALKGSNTYA
        GV  VRGE KTSRECYA+  K S+  A
Subjt:  GVRIVRGEQKTSRECYAAALKGSNTYA

A0A6J1DJI4 uncharacterized protein LOC1110216823.1e-3232.24Show/hide
Query:  FYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSK---EDKYSMKGSPSKSEKKKDEKKNGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFT
        F  GL  E L+  L ++   T    L +A K I+ ++++ +K    +K   +  PS+ ++K D K    G +  ++  R DY R D   S + + +  +T
Subjt:  FYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSK---EDKYSMKGSPSKSEKKKDEKKNGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFT

Query:  QLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSK---------SISISETKKKVLILPAEEGK
           + +SEIL NI++  ++ LL    K+++DLE+R+K KYC +HRDHGH TS Y++L++QIE LI+ G  K         S+   E KK+    P  + +
Subjt:  QLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSK---------SISISETKKKVLILPAEEGK

Query:  ---------GP----------------RRWWCF---SKYTNTV-----HLQGARLGPSSAEEESNTASQILQGKF------------------------E
                 GP                +R  C     K T ++      L+G  L  + A   +     +L  +                         E
Subjt:  ---------GP----------------RRWWCF---SKYTNTV-----HLQGARLGPSSAEEESNTASQILQGKF------------------------E

Query:  FVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGSNTYA
        FV+ D +  YNAIF RPI+H+   V ST HQ++KY TP GV  VRGEQKT RECYA+ALKGS+  A
Subjt:  FVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGSNTYA

A0A6J1DWS1 uncharacterized protein LOC1110247911.2e-3132.7Show/hide
Query:  FLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKS--EKKKDEKKNGNGGNGGNTFRRPDY
        F  EQ+KV + +D  A+  F+ GL  E L+  L ++   T    L +A K I+ ++++  K  +   +    K+  EK++ E K+ + G+  ++  R +Y
Subjt:  FLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKS--EKKKDEKKNGNGGNGGNTFRRPDY

Query:  GRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSKSISISETKKKV
           D+  S + + +  +T   + +SEIL NI++  ++ LL   +K++ DLE+R+K KYC +HRDHGH TS  ++L++QIE LI+ G  K   + +++  +
Subjt:  GRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSKSISISETKKKV

Query:  LILPAE--EGKGPRRWWCFSKYTNTVHLQGARLGPSSAEEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRG
        +    E    + P R        NT+   G   G  S  +    A +  +   EFV+ D K  YNAIF RPI+H+  AV ST HQ++KY TP GV  VR 
Subjt:  LILPAE--EGKGPRRWWCFSKYTNTVHLQGARLGPSSAEEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRG

Query:  EQKTSRECYAAALKG
         +K          KG
Subjt:  EQKTSRECYAAALKG

A0A6J1DZB9 uncharacterized protein LOC1110249047.0e-3732.32Show/hide
Query:  LRKKPKEPIRDYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSK--SEKKKDEKKN
        +R+K +E +R+Y+  F  EQ+KV + +D  A   F   L  E L+  L ++   T    L +A K I+ ++++ +K  +   +    K   EK+K + K+
Subjt:  LRKKPKEPIRDYIKCFLSEQIKVENYTDLLARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSK--SEKKKDEKKN

Query:  GNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQ
         + G+  +   R +Y R +   S + + +  +T   +P+SEIL NI++  ++ LL    K++ DLE+R+K KYCR+HRDHGH T+  ++L++QIE LI+ 
Subjt:  GNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRVPLSEILANIQDLNLD-LLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQ

Query:  G---------TSKSISISETKKKVLILPAEEGK---------GP----------------RRWWCF--------SKYTNTVHLQGARLGPSSA-------
        G          S S+   E +K+    P  E +         GP                RR  C         S       L+G  L  + A       
Subjt:  G---------TSKSISISETKKKVLILPAEEGK---------GP----------------RRWWCF--------SKYTNTVHLQGARLGPSSA-------

Query:  ------------------EEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGS
                                A+Q+ Q   EFV+ D +  YNAIF RPI+H+  AV ST HQ++KY TP  V +VRGEQKTSRECYA+ALKGS
Subjt:  ------------------EEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAGAAAATGGTTCCCGTTCTGGAAGCAATGAGAATGAGAACGGAGATCGTGACCTGCCCAATGACAGACCAGGAAAGGAGTTAGAACGCCCACCACTC
TCGAAGAAGCTGAAGACCCGAGCAGACGCTCGCAAGACCAAGTCCACACCCGTGGAGGTGTCGTCGTCTGAGCAGATGCAGTCACAGTTGGCTAAAATGCGAGCA
CACCTGAAGGCCATGATCCAGGGATTGCTTAATGCAGGGGTCTCGCTACCGAGCTCTAGCGTGACCCGAAGAAAGGGAAAGAGAAGAAGGGCGAGAATCCTCAGA
AAAAGCGAGATGAGCAAGACTCGAGGTCCAAAGCGTGTAGACCTTGAGAAGGAGCATCTAGAGGAATCTAGGCAATATGAGCTTCCAGGTAGAGTTCGACCAGCT
GAAAAGGGAGCTGAAGAGCCAACTTCCGATGTTCAACAAGGCGTTCTCCATAACTCTGCAAGGACCAATGAGGAAGTGGTTCAAACTACTGGCCCCTATTCAATC
TCGAGCTGGAAGCAGCTGCGCAAAAAGCCGAAGGAGCCAATCAGGGACTACATCAAGTGCTTCCTCTCGGAGCAAATTAAGGTCGAGAACTACACCGACCTGCTA
GCTCGGTCTGCCTTCTACAACGGCTTGACACATGAAAAGTTAAGTTGGTCACTAGCAAAGAAGCCCAAGCCGACCTTGAAAGGTTGCTTGGACCGGGCAACGAAA
TTTATTGAAGTTGAGGACATTATGATGTCCAAGGAAGATAAGTACTCGATGAAGGGTTCGCCTTCTAAGTCTGAGAAGAAGAAGGATGAGAAGAAGAATGGCAAT
GGTGGAAATGGCGGTAATACTTTTCGCCGACCTGACTATGGTCGCCAAGATGACCAAAAGAGCGACAAACCTCAAAGATTCAGCACGTTCACACAACTTCGGGTC
CCTTTGTCTGAAATATTGGCCAACATCCAAGATTTAAACTTGGATTTGCTGGCAGAGTCGAGAAAGATGCAAAAAGATCTCGAACAGCGCGACAAGTCAAAATAT
TGCAGGTACCACAGGGATCATGGCCATTACACCTCATACTATTATGACCTGAGGCAACAAATTGAAGGGCTGATTCGACAAGGTACTTCAAAAAGTATATCGATA
AGCGAGACCAAGAAGAAAGTTCTAATCCTTCCAGCAGAAGAGGGAAAGGGTCCTCGTAGATGGTGGTGCTTCAGTAAATATACTAACACTGTCCACCTACAAGGC
GCTCGGTTGGGGCCGAGCTCAGCTGAAGAAGAGTCTAACACCGCTAGTCAGATTCTCCAGGGAAAGTTTGAGTTCGTAATATTCGACGACAAGTTGACTTACAAC
GCCATTTTCGACCGTCCAATTCTACATACCTTAGAAGCTGTGTCGTCGACATATCATCAGATGATGAAGTATCCTACTCCCAAGGGGGTTAGGATCGTTCGGGGT
GAGCAGAAGACTTCACGTGAATGCTACGCTGCTGCCCTAAAGGGGTCGAACACCTACGCCGCCATCATGGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAGAAAATGGTTCCCGTTCTGGAAGCAATGAGAATGAGAACGGAGATCGTGACCTGCCCAATGACAGACCAGGAAAGGAGTTAGAACGCCCACCACTC
TCGAAGAAGCTGAAGACCCGAGCAGACGCTCGCAAGACCAAGTCCACACCCGTGGAGGTGTCGTCGTCTGAGCAGATGCAGTCACAGTTGGCTAAAATGCGAGCA
CACCTGAAGGCCATGATCCAGGGATTGCTTAATGCAGGGGTCTCGCTACCGAGCTCTAGCGTGACCCGAAGAAAGGGAAAGAGAAGAAGGGCGAGAATCCTCAGA
AAAAGCGAGATGAGCAAGACTCGAGGTCCAAAGCGTGTAGACCTTGAGAAGGAGCATCTAGAGGAATCTAGGCAATATGAGCTTCCAGGTAGAGTTCGACCAGCT
GAAAAGGGAGCTGAAGAGCCAACTTCCGATGTTCAACAAGGCGTTCTCCATAACTCTGCAAGGACCAATGAGGAAGTGGTTCAAACTACTGGCCCCTATTCAATC
TCGAGCTGGAAGCAGCTGCGCAAAAAGCCGAAGGAGCCAATCAGGGACTACATCAAGTGCTTCCTCTCGGAGCAAATTAAGGTCGAGAACTACACCGACCTGCTA
GCTCGGTCTGCCTTCTACAACGGCTTGACACATGAAAAGTTAAGTTGGTCACTAGCAAAGAAGCCCAAGCCGACCTTGAAAGGTTGCTTGGACCGGGCAACGAAA
TTTATTGAAGTTGAGGACATTATGATGTCCAAGGAAGATAAGTACTCGATGAAGGGTTCGCCTTCTAAGTCTGAGAAGAAGAAGGATGAGAAGAAGAATGGCAAT
GGTGGAAATGGCGGTAATACTTTTCGCCGACCTGACTATGGTCGCCAAGATGACCAAAAGAGCGACAAACCTCAAAGATTCAGCACGTTCACACAACTTCGGGTC
CCTTTGTCTGAAATATTGGCCAACATCCAAGATTTAAACTTGGATTTGCTGGCAGAGTCGAGAAAGATGCAAAAAGATCTCGAACAGCGCGACAAGTCAAAATAT
TGCAGGTACCACAGGGATCATGGCCATTACACCTCATACTATTATGACCTGAGGCAACAAATTGAAGGGCTGATTCGACAAGGTACTTCAAAAAGTATATCGATA
AGCGAGACCAAGAAGAAAGTTCTAATCCTTCCAGCAGAAGAGGGAAAGGGTCCTCGTAGATGGTGGTGCTTCAGTAAATATACTAACACTGTCCACCTACAAGGC
GCTCGGTTGGGGCCGAGCTCAGCTGAAGAAGAGTCTAACACCGCTAGTCAGATTCTCCAGGGAAAGTTTGAGTTCGTAATATTCGACGACAAGTTGACTTACAAC
GCCATTTTCGACCGTCCAATTCTACATACCTTAGAAGCTGTGTCGTCGACATATCATCAGATGATGAAGTATCCTACTCCCAAGGGGGTTAGGATCGTTCGGGGT
GAGCAGAAGACTTCACGTGAATGCTACGCTGCTGCCCTAAAGGGGTCGAACACCTACGCCGCCATCATGGGTTAG
Protein sequenceShow/hide protein sequence
MAKENGSRSGSNENENGDRDLPNDRPGKELERPPLSKKLKTRADARKTKSTPVEVSSSEQMQSQLAKMRAHLKAMIQGLLNAGVSLPSSSVTRRKGKRRRARILR
KSEMSKTRGPKRVDLEKEHLEESRQYELPGRVRPAEKGAEEPTSDVQQGVLHNSARTNEEVVQTTGPYSISSWKQLRKKPKEPIRDYIKCFLSEQIKVENYTDLL
ARSAFYNGLTHEKLSWSLAKKPKPTLKGCLDRATKFIEVEDIMMSKEDKYSMKGSPSKSEKKKDEKKNGNGGNGGNTFRRPDYGRQDDQKSDKPQRFSTFTQLRV
PLSEILANIQDLNLDLLAESRKMQKDLEQRDKSKYCRYHRDHGHYTSYYYDLRQQIEGLIRQGTSKSISISETKKKVLILPAEEGKGPRRWWCFSKYTNTVHLQG
ARLGPSSAEEESNTASQILQGKFEFVIFDDKLTYNAIFDRPILHTLEAVSSTYHQMMKYPTPKGVRIVRGEQKTSRECYAAALKGSNTYAAIMG