; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011378 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011378
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold8:10802178..10805872
RNA-Seq ExpressionSpg011378
SyntenySpg011378
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]2.9e-4745.42Show/hide
Query:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL
        SFK LARAF TQF+G R + +P   LLT+KQ   ESL+DY+ RF+ E LQVEG  D V+L A +SG++DE L  S G+  P T+ E ++RAQ++++A E 
Subjt:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL

Query:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK
          SK+E       K +  +++R  D  +  R    +   ++DP  KF++YTPT++  EQ+LMEI +  LL+   +MK S   R K ++CLFHRDHGH T+
Subjt:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK

Query:  NCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTI
        +C  LK+E+E LIRRG+LKE+VE+     P+ T+   G  D  P  EIRTI
Subjt:  NCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTI

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]2.2e-4242.63Show/hide
Query:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL
        SFK LARAF TQF+G R + +P   LLT+KQ   ESL DY+ RF+ E LQ+E   D V+L A +SG++DE L  S G+  P T+ E ++RAQ++++A E 
Subjt:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL

Query:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK
          SK+E       K +  +++R  D  +  R    +   ++DP  KF++YTPT++  EQ+LMEI +  LL+   +MK     R K ++CLFHRDH H T+
Subjt:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK

Query:  NCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTI
        +   LK+E+E LIRRG+L+E+VE+     P+ T+   G  +  P  EIRTI
Subjt:  NCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTI

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]4.4e-3537.01Show/hide
Query:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL
        SF +L+R F + F  AR + KP   LLTVKQ   E+L+DYI R++NE+ QV+GYDDG+AL+ ++ GL+   L  S+ +  P +Y E + RA+K+ NAEE 
Subjt:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL

Query:  LKSKKETMTTESTKYSVCEQDRDK------DHNRKKRRTHDNDRG-----REDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFC
         K++ +    ESTK    + D+ +      D + ++ +    +R      R     +F  +T  +  +EQILM++ N  L R    MKT+P  R+ +++C
Subjt:  LKSKKETMTTESTKYSVCEQDRDK------DHNRKKRRTHDNDRG-----REDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFC

Query:  LFHRDHGHTTKNCIQLKDEIETLIRRGFLKEFV---EDK-NQKRPRPTRGGRGR
         FH+DHGH T  C +LK++IE+L+R+G L+E+V   ED+   ++P  ++  +G+
Subjt:  LFHRDHGHTTKNCIQLKDEIETLIRRGFLKEFV---EDK-NQKRPRPTRGGRGR

XP_024042801.1 uncharacterized protein LOC112099618 [Citrus clementina]7.7e-4041.44Show/hide
Query:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL
        SF +L++    QF+GARDQ+ P    L VKQG  ESLKD ITRF+ EV++VE Y D VALT ++ GLQ      S+ ++  RT+ E ++RAQK+ N +EL
Subjt:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL

Query:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK
          +K+    ++S + +   + +++    K++ +    +G +    +F  YTP ++ +EQ+LM+I N DLLR    +K +P  RD+S++C +HRDH H  +
Subjt:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK

Query:  NCIQLKDEIETLIRRGFLKEFV
        +C  LK+EI++LI+RG+LKEFV
Subjt:  NCIQLKDEIETLIRRGFLKEFV

XP_030923026.1 uncharacterized protein LOC115949900 [Quercus lobata]3.7e-3434.23Show/hide
Query:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL
        +F++L+ +F   F+G +  ++   +LLT+KQG +E+L+ Y+ RF+  +L+V+  DD V LT   +GL+    + S+ ++ P+T  E + +AQK++NAEE 
Subjt:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL

Query:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK
        L +      ++  K    +  R +  +R  RR  + +R RED   +  ++TP ++  +QILMEI +   L+    + +SP   DK ++C FH+DHGH T+
Subjt:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK

Query:  NCIQLKDEIETLIRRGFLKEFVE--DKNQKRPRPTRGGRGRGDDGPPL-------EIRTI
        +C  LK++IE LIR+G L+++V+  D ++   +   GG  R +D P         EIRTI
Subjt:  NCIQLKDEIETLIRRGFLKEFVE--DKNQKRPRPTRGGRGRGDDGPPL-------EIRTI

TrEMBL top hitse value%identityAlignment
A0A2N9ESG4 Ribonuclease H1.2e-3538.43Show/hide
Query:  GSFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEE
        GSF +L+R F   F+GA+   +P  +LL +KQ   E+L+ Y+TRF+ E L V+G DD V LTA ISGLQ    L S+ +  P T  E M  AQ+ +N EE
Subjt:  GSFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEE

Query:  LLKSKKETMTTESTKYSVCEQDRDKD-HNRKKRRTHDNDRGREDPMG-----KFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHR
         L ++ +T+     K      DR  + H+ + +   + +R  ED  G     +F  +TP +   + I M+I N   L+  GK+ T P+ R + ++C FHR
Subjt:  LLKSKKETMTTESTKYSVCEQDRDKD-HNRKKRRTHDNDRGREDPMG-----KFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHR

Query:  DHGHTTKNCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEI
        DHGH T++C  LK +IE LI++G L+ FVE K Q+  RP    +G     PP+E+
Subjt:  DHGHTTKNCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEI

A0A2N9F7K6 Reverse transcriptase3.3e-3638.58Show/hide
Query:  GSFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEE
        GSF +L+R F   F+G +   +P  +LL VKQ   E+L+ Y+TRF+ E L V+G DD V LTA ISGLQ    L S+ +  P T  E M  AQ+ +N EE
Subjt:  GSFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEE

Query:  LLKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMG-----KFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRD
         L ++ +T+  +  K+   ++  +    R K + H N R +ED  G     +F  +TP +   + I M+I N   L+  GK+ T P+ R + ++C FHRD
Subjt:  LLKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMG-----KFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRD

Query:  HGHTTKNCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEI
        HGH T++C  LK +IE LI++G L+ FVE + Q+  RP    +G     PP+E+
Subjt:  HGHTTKNCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEI

A0A2N9G9P2 Reverse transcriptase1.1e-3638.58Show/hide
Query:  GSFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEE
        GSF +L+R F   F+G +   +P  +LL VKQ   E+L+ Y+TRF+ E L V+G DD V LTA ISGLQ    L S+ +  P T  E M  AQ+++N EE
Subjt:  GSFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEE

Query:  LLKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMG-----KFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRD
         L ++ +T+  +  K+   ++  +    R K + H N R +ED  G     +F  +TP +   + I M+I N   L+  GK+ T P+ R + ++C FHRD
Subjt:  LLKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMG-----KFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRD

Query:  HGHTTKNCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEI
        HGH T++C  LK +IE LI++G L+ FVE + Q+  RP    +G     PP+E+
Subjt:  HGHTTKNCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEI

A0A6J1DWY0 uncharacterized protein LOC1110252931.4e-4745.42Show/hide
Query:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL
        SFK LARAF TQF+G R + +P   LLT+KQ   ESL+DY+ RF+ E LQVEG  D V+L A +SG++DE L  S G+  P T+ E ++RAQ++++A E 
Subjt:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL

Query:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK
          SK+E       K +  +++R  D  +  R    +   ++DP  KF++YTPT++  EQ+LMEI +  LL+   +MK S   R K ++CLFHRDHGH T+
Subjt:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK

Query:  NCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTI
        +C  LK+E+E LIRRG+LKE+VE+     P+ T+   G  D  P  EIRTI
Subjt:  NCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTI

A0A6J1DYL6 uncharacterized protein LOC1110257851.0e-4242.63Show/hide
Query:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL
        SFK LARAF TQF+G R + +P   LLT+KQ   ESL DY+ RF+ E LQ+E   D V+L A +SG++DE L  S G+  P T+ E ++RAQ++++A E 
Subjt:  SFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMTRAQKFINAEEL

Query:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK
          SK+E       K +  +++R  D  +  R    +   ++DP  KF++YTPT++  EQ+LMEI +  LL+   +MK     R K ++CLFHRDH H T+
Subjt:  LKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTTK

Query:  NCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTI
        +   LK+E+E LIRRG+L+E+VE+     P+ T+   G  +  P  EIRTI
Subjt:  NCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein5.2e-1020Show/hide
Query:  VWSRQVPSKIKFFCWRLYHDYLPTMINLQKRGMEVSVTCLRCRRGEESSFHAISDCKWVRKQWKLSPF-APIFESQDLKCAADLLWWCRSNLSPASFDE-
        +W  Q   KI+ F W+   + LP    L  R +     C+RC   +E+  H +  C + R  W +S    P+          +L W          +++ 
Subjt:  VWSRQVPSKIKFFCWRLYHDYLPTMINLQKRGMEVSVTCLRCRRGEESSFHAISDCKWVRKQWKLSPF-APIFESQDLKCAADLLWWCRSNLSPASFDE-

Query:  ---FVGLSWWIWNSRNKVVH-----------SKGETGLA------------------------------------VDAATNRNSQSSGISAIIRDERGKM
              L W +W +RN++V             + E  L                                      DA  NR+++  GI  ++R+E+G++
Subjt:  ---FVGLSWWIWNSRNKVVH-----------SKGETGLA------------------------------------VDAATNRNSQSSGISAIIRDERGKM

Query:  MLTALKFLPNVTDVDSVEAMAIRDGLMVARDAGFSRLEIETDSARVAALICSEKVDLSEVGEIVREVRHLLKGFTFFSIRWCRREANQLAHAAAR
             + LP +  V   E  A+R  ++      ++ +  E+DS  +  ++ ++++    +   +++++ LL  FT     +  RE N LA   AR
Subjt:  MLTALKFLPNVTDVDSVEAMAIRDGLMVARDAGFSRLEIETDSARVAALICSEKVDLSEVGEIVREVRHLLKGFTFFSIRWCRREANQLAHAAAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCTTAGGATGGTTCTGTTGTTAATGCTCTGTTTGGGGAGCTTGATTCGAAGGTTATTCTTGGGATTCCGAGGCCTAAGCAGACTTAAGGAGACTGGTTCTTGCTC
GAATTCAGAGTTGATTAAGGAGTGGTGGAAGTTTGTGTGGAGTAGGCAAGTTCCATCCAAAATCAAATTCTTTTGTTGGAGGCTCTATCATGATTATCTTCCTACTATGA
TCAATTTACAAAAAAGGGGGATGGAGGTTTCAGTGACCTGTCTGAGATGTCGAAGGGGCGAGGAATCTTCCTTTCATGCTATTTCTGACTGTAAATGGGTGAGGAAGCAG
TGGAAGCTTTCACCGTTTGCTCCTATTTTCGAGTCTCAGGACTTAAAGTGTGCGGCTGATTTATTGTGGTGGTGTAGAAGCAACTTATCCCCGGCCAGCTTCGATGAGTT
CGTGGGGCTGAGTTGGTGGATCTGGAATAGTAGGAATAAGGTGGTTCACTCCAAGGGTGAGACAGGCCTAGCAGTAGATGCGGCAACCAACAGAAATTCACAGTCAAGCG
GTATAAGTGCGATAATTAGAGATGAGAGGGGGAAGATGATGCTTACGGCTTTGAAGTTTCTCCCCAATGTGACCGATGTTGATTCTGTCGAAGCCATGGCTATCAGAGAT
GGATTGATGGTTGCAAGGGACGCTGGATTTAGCAGGTTGGAGATCGAAACCGATTCAGCTCGTGTGGCAGCACTGATTTGTTCTGAGAAGGTGGATCTGTCGGAAGTTGG
AGAGATTGTTCGTGAAGTCCGCCACCTCTTGAAGGGCTTTACCTTCTTCTCGATCAGGTGGTGTCGGAGGGAGGCGAACCAATTGGCTCATGCAGCGGCAAGGCTGGCGG
TGGAGCAGGCAGTCGACGGTATCTGGATCGAGGAAGTGCCGTTACAGCTGGTAATTTTGGACCACTCCGAAGCACTAGGAGCTGATGGAGACAGCCGGGCAAAGATAGGG
CGAGGAAACCGACCCAGAAGAAGACCGGACCAAAGGGTCGGGCCAAATGGTCCGCCTCTTGCCAAGGCCGAGGCTGAGCATAATGGTCGGCCTCTTGCCAAGGCCGAGGC
CGACCATTCAGCCCGTTTGCGCGGCCGAGCTCCTCTTCCTCCGTTCGATCCCTGCAGCCCCTGGACGCCCCGGTTCTACCTGCATCGGAGTCGGTGTGGCAAGCACCACA
CCGGTGTGCAGTGTTTGCTTGTCTCGCAGGTCACGTCTTCCCCCTCTCAAACAAATTTACCGTTGGTGGCACGTGAAGGACGGATGGAGCGCAACAGTGAAGCAGCCGTA
GGAGAGGTATGCCACCAGGCTCGACTTCAATCCCAAGAAGTCGAAATAGCAGCACTCAAAGGAAGGATGGATGATATAAGACAGAATCTTACGGAGATTTTGAGTTTATT
GAAGAAACCCGAGTACTCGGGGAGCAAGGATGACCAACTGTGCAGGGACCCTAAAAAAGGAAAAGGAATGGCGGATGAGGAGCCCGAGCCTAGTCGCAAGAAAGTTCGCA
GAAGCTCGCCATCGAGACCAAAGCAAGGTACACATGTTAAAATCGATGGCAGGGAAAAATCCGAGGCACGGGAAAAGTTCGAGGCCGAGCATAGTCGAGGAGGGCGCGAG
CAAGAGCTGTACAAATGGTTAAAGGAGGAGGATAGTCCTTACAACTCATATAAGAGGACAGGGTCATTCAAAGAGTTGGCACGAGCCTTCGCCACACAGTTTTTGGGGGC
TCGAGATCAACGAAAGCCACAGATCAACCTGCTGACAGTTAAACAAGGACCGAGGGAGAGCCTGAAGGACTATATCACCAGATTCAGTAACGAAGTCCTGCAAGTAGAGG
GCTACGATGATGGAGTTGCACTAACTGCTGTGATTTCAGGATTACAAGATGAGGGGCTGCTTACCTCAATTGGAGAAAGTCAACCACGCACATACGTAGAATTCATGACT
AGGGCACAAAAATTCATAAATGCTGAAGAGTTGCTCAAGTCAAAAAAGGAAACAATGACAACAGAATCCACAAAATATTCGGTATGTGAACAAGACAGAGACAAAGACCA
CAACCGCAAAAAACGGAGAACACATGACAATGATCGAGGGCGAGAAGACCCCATGGGTAAATTCAAAGAATACACCCCCACTTCCATTCAGCAGGAACAAATATTGATGG
AGATTACAAATACGGATCTTCTGAGACATCTTGGAAAAATGAAAACAAGTCCAGAAGGAAGAGATAAAAGCCAATTTTGTCTTTTCCACAGGGACCACGGACACACTACC
AAAAATTGCATCCAGCTTAAAGATGAGATTGAAACGCTGATTCGTCGTGGATTCCTCAAAGAGTTTGTTGAAGACAAGAACCAGAAAAGGCCGAGGCCGACCAGGGGTGG
CCGAGGCCGAGGAGATGATGGACCTCCTTTAGAGATTAGAACCATCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCTTAGGATGGTTCTGTTGTTAATGCTCTGTTTGGGGAGCTTGATTCGAAGGTTATTCTTGGGATTCCGAGGCCTAAGCAGACTTAAGGAGACTGGTTCTTGCTC
GAATTCAGAGTTGATTAAGGAGTGGTGGAAGTTTGTGTGGAGTAGGCAAGTTCCATCCAAAATCAAATTCTTTTGTTGGAGGCTCTATCATGATTATCTTCCTACTATGA
TCAATTTACAAAAAAGGGGGATGGAGGTTTCAGTGACCTGTCTGAGATGTCGAAGGGGCGAGGAATCTTCCTTTCATGCTATTTCTGACTGTAAATGGGTGAGGAAGCAG
TGGAAGCTTTCACCGTTTGCTCCTATTTTCGAGTCTCAGGACTTAAAGTGTGCGGCTGATTTATTGTGGTGGTGTAGAAGCAACTTATCCCCGGCCAGCTTCGATGAGTT
CGTGGGGCTGAGTTGGTGGATCTGGAATAGTAGGAATAAGGTGGTTCACTCCAAGGGTGAGACAGGCCTAGCAGTAGATGCGGCAACCAACAGAAATTCACAGTCAAGCG
GTATAAGTGCGATAATTAGAGATGAGAGGGGGAAGATGATGCTTACGGCTTTGAAGTTTCTCCCCAATGTGACCGATGTTGATTCTGTCGAAGCCATGGCTATCAGAGAT
GGATTGATGGTTGCAAGGGACGCTGGATTTAGCAGGTTGGAGATCGAAACCGATTCAGCTCGTGTGGCAGCACTGATTTGTTCTGAGAAGGTGGATCTGTCGGAAGTTGG
AGAGATTGTTCGTGAAGTCCGCCACCTCTTGAAGGGCTTTACCTTCTTCTCGATCAGGTGGTGTCGGAGGGAGGCGAACCAATTGGCTCATGCAGCGGCAAGGCTGGCGG
TGGAGCAGGCAGTCGACGGTATCTGGATCGAGGAAGTGCCGTTACAGCTGGTAATTTTGGACCACTCCGAAGCACTAGGAGCTGATGGAGACAGCCGGGCAAAGATAGGG
CGAGGAAACCGACCCAGAAGAAGACCGGACCAAAGGGTCGGGCCAAATGGTCCGCCTCTTGCCAAGGCCGAGGCTGAGCATAATGGTCGGCCTCTTGCCAAGGCCGAGGC
CGACCATTCAGCCCGTTTGCGCGGCCGAGCTCCTCTTCCTCCGTTCGATCCCTGCAGCCCCTGGACGCCCCGGTTCTACCTGCATCGGAGTCGGTGTGGCAAGCACCACA
CCGGTGTGCAGTGTTTGCTTGTCTCGCAGGTCACGTCTTCCCCCTCTCAAACAAATTTACCGTTGGTGGCACGTGAAGGACGGATGGAGCGCAACAGTGAAGCAGCCGTA
GGAGAGGTATGCCACCAGGCTCGACTTCAATCCCAAGAAGTCGAAATAGCAGCACTCAAAGGAAGGATGGATGATATAAGACAGAATCTTACGGAGATTTTGAGTTTATT
GAAGAAACCCGAGTACTCGGGGAGCAAGGATGACCAACTGTGCAGGGACCCTAAAAAAGGAAAAGGAATGGCGGATGAGGAGCCCGAGCCTAGTCGCAAGAAAGTTCGCA
GAAGCTCGCCATCGAGACCAAAGCAAGGTACACATGTTAAAATCGATGGCAGGGAAAAATCCGAGGCACGGGAAAAGTTCGAGGCCGAGCATAGTCGAGGAGGGCGCGAG
CAAGAGCTGTACAAATGGTTAAAGGAGGAGGATAGTCCTTACAACTCATATAAGAGGACAGGGTCATTCAAAGAGTTGGCACGAGCCTTCGCCACACAGTTTTTGGGGGC
TCGAGATCAACGAAAGCCACAGATCAACCTGCTGACAGTTAAACAAGGACCGAGGGAGAGCCTGAAGGACTATATCACCAGATTCAGTAACGAAGTCCTGCAAGTAGAGG
GCTACGATGATGGAGTTGCACTAACTGCTGTGATTTCAGGATTACAAGATGAGGGGCTGCTTACCTCAATTGGAGAAAGTCAACCACGCACATACGTAGAATTCATGACT
AGGGCACAAAAATTCATAAATGCTGAAGAGTTGCTCAAGTCAAAAAAGGAAACAATGACAACAGAATCCACAAAATATTCGGTATGTGAACAAGACAGAGACAAAGACCA
CAACCGCAAAAAACGGAGAACACATGACAATGATCGAGGGCGAGAAGACCCCATGGGTAAATTCAAAGAATACACCCCCACTTCCATTCAGCAGGAACAAATATTGATGG
AGATTACAAATACGGATCTTCTGAGACATCTTGGAAAAATGAAAACAAGTCCAGAAGGAAGAGATAAAAGCCAATTTTGTCTTTTCCACAGGGACCACGGACACACTACC
AAAAATTGCATCCAGCTTAAAGATGAGATTGAAACGCTGATTCGTCGTGGATTCCTCAAAGAGTTTGTTGAAGACAAGAACCAGAAAAGGCCGAGGCCGACCAGGGGTGG
CCGAGGCCGAGGAGATGATGGACCTCCTTTAGAGATTAGAACCATCTTTTGA
Protein sequenceShow/hide protein sequence
MGLRMVLLLMLCLGSLIRRLFLGFRGLSRLKETGSCSNSELIKEWWKFVWSRQVPSKIKFFCWRLYHDYLPTMINLQKRGMEVSVTCLRCRRGEESSFHAISDCKWVRKQ
WKLSPFAPIFESQDLKCAADLLWWCRSNLSPASFDEFVGLSWWIWNSRNKVVHSKGETGLAVDAATNRNSQSSGISAIIRDERGKMMLTALKFLPNVTDVDSVEAMAIRD
GLMVARDAGFSRLEIETDSARVAALICSEKVDLSEVGEIVREVRHLLKGFTFFSIRWCRREANQLAHAAARLAVEQAVDGIWIEEVPLQLVILDHSEALGADGDSRAKIG
RGNRPRRRPDQRVGPNGPPLAKAEAEHNGRPLAKAEADHSARLRGRAPLPPFDPCSPWTPRFYLHRSRCGKHHTGVQCLLVSQVTSSPSQTNLPLVAREGRMERNSEAAV
GEVCHQARLQSQEVEIAALKGRMDDIRQNLTEILSLLKKPEYSGSKDDQLCRDPKKGKGMADEEPEPSRKKVRRSSPSRPKQGTHVKIDGREKSEAREKFEAEHSRGGRE
QELYKWLKEEDSPYNSYKRTGSFKELARAFATQFLGARDQRKPQINLLTVKQGPRESLKDYITRFSNEVLQVEGYDDGVALTAVISGLQDEGLLTSIGESQPRTYVEFMT
RAQKFINAEELLKSKKETMTTESTKYSVCEQDRDKDHNRKKRRTHDNDRGREDPMGKFKEYTPTSIQQEQILMEITNTDLLRHLGKMKTSPEGRDKSQFCLFHRDHGHTT
KNCIQLKDEIETLIRRGFLKEFVEDKNQKRPRPTRGGRGRGDDGPPLEIRTIF