; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022542 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022542
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold2:13530786..13533639
RNA-Seq ExpressionSpg022542
SyntenySpg022542
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144467.1 uncharacterized protein LOC111014147 [Momordica charantia]7.4e-5144.44Show/hide
Query:  IQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPAQTEPDKGGNNPPLEIRTILGGPTEGESSRKRK
        I+D  LLK PER+++   +R++++YC+FH  HGH T++C  L++E+E LIR GYLKE+V     + P  T+  +   +P  EIRTI+GGP E ES RKRK
Subjt:  IQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPAQTEPDKGGNNPPLEIRTILGGPTEGESSRKRK

Query:  AAIREAH----------------MEPGGQ----------------------------GFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNA
        A +REA                 +  GG                             GFGGERV P G IE PVTFG G ++VT+M+  LVV+   +YNA
Subjt:  AAIREAH----------------MEPGGQ----------------------------GFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNA

Query:  ILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDK
        ILGRPT+H L+A+ STYHQ +KFPT  GVGE++GEQR SRECY+ + ++ D+
Subjt:  ILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDK

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]7.4e-5136.25Show/hide
Query:  KFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK--QEERESRGVSLSNRHREDRAKGRQTDDRSRGRHEQSSANGR
        + KV    D  A+   ++SL DE L   +GE  P T+VE + +A++ I  +ELL++K  + E++     LS        + R+ D +SR +   SSA+  
Subjt:  KFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK--QEERESRGVSLSNRHREDRAKGRQTDDRSRGRHEQSSANGR

Query:  GRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDR
         R + + L         ++RYT  T  + ++L  I+++    LLKRPE+LR D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FVG  R
Subjt:  GRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDR

Query:  GKRPAQTEPDKGGNNPPLE------IRTILGGPTEGESSRKRKAAIREAHMEP--------------GGQGFGGERVSPR--------------------
             + E  K    PP        I TI GGP  G+S  KRK   REA  E               G     G  +                       
Subjt:  GKRPAQTEPDKGGNNPPLE------IRTILGGPTEGESSRKRKAAIREAHMEP--------------GGQGFGGERVSPR--------------------

Query:  -GSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDKKAKAAPTLGDGRGR
         G I+LPVT G+    VT+M  F+V+D   AYNAI GRP +H  +AV ST HQVLK+ T   VG VRGEQ+TSRECY  A K     A  A      RG+
Subjt:  -GSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDKKAKAAPTLGDGRGR

Query:  ASEG-SDHPME
          E  +D P E
Subjt:  ASEG-SDHPME

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]4.8e-5031.93Show/hide
Query:  KVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGVSLSNRHREDRAKGRQT--DDRSRGRHEQSSANGRGR
        +V+GYDDG+AL+ ++  L+  +L  S+ +  P +Y E + RA++Y +AEE  K++ +E   +G S   + ++D  + R+    D+S  R++        R
Subjt:  KVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGVSLSNRHREDRAKGRQT--DDRSRGRHEQSSANGRGR

Query:  AKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD----RG
        ++ +D R R    ++F  +T L    EQ+L  +++  L + P  ++++P RRN NKYC FH DHGH T EC +L+++IE+L+R+G L+E+V +     + 
Subjt:  AKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD----RG

Query:  KRPAQTEPDKG-----GNNPPLEIRTILGGPTEGESSRKRKAAIREAHMEPGG-----------------------------------------------
        ++P  ++  KG      +    ++  I GGP  G+S + RK   R+A  EP G                                               
Subjt:  KRPAQTEPDKG-----GNNPPLEIRTILGGPTEGESSRKRKAAIREAHMEPGG-----------------------------------------------

Query:  -------------------------------------QGFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQ
                                              GF G  V P G IEL V+FG+    VT M++F+VVD   +YNA+LGRPTL+ LKA  S YH 
Subjt:  -------------------------------------QGFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQ

Query:  VLKFPTEEGVGEVRGEQRTSRECYFMAFK
         LKFPTE GVG VRGEQ+ +RECY +AF+
Subjt:  VLKFPTEEGVGEVRGEQRTSRECYFMAFK

XP_030958631.1 uncharacterized protein LOC115980538 [Quercus lobata]1.8e-4932.54Show/hide
Query:  VEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGVSLSNRHRED--RAKGRQTDDRSRGRHEQSSANGRGRA
        V+  DD + L A  + +  +  ++ + E +P+T  E +  AQ +++AE+ + +K+ +R  R  +   RH E   R K  +T+DR                
Subjt:  VEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGVSLSNRHRED--RAKGRQTDDRSRGRHEQSSANGRGRA

Query:  KAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPAQ
        K +D R +  P  +   YTPL A L QVL  I+D   LK PE+++ DP++RN+NKYC FH DHGH T EC  L+ +IE LIR+G LK FVG DR     +
Subjt:  KAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPAQ

Query:  TEPDKGGNNPPLEIRTILGGPTEGESSRKRKAAI----------------------------------------------------REAHMEPGGQ----
         + ++    P  EIR I+GG   G+SS+ +K  +                                                    R   ++ G      
Subjt:  TEPDKGGNNPPLEIRTILGGPTEGESSRKRKAAI----------------------------------------------------REAHMEPGGQ----

Query:  -----------------------GFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRG
                               GFGG +V P G+I LPV  G   Q +T+ ++FLVVDC  +YNAI+GRPTL+  KA+ STYH  +KFPTE G+G+ +G
Subjt:  -----------------------GFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRG

Query:  EQRTSRECYFMAFKNIDKKAK
        +Q  +RECY +A   +D++ +
Subjt:  EQRTSRECYFMAFKNIDKKAK

XP_030963320.1 uncharacterized protein LOC115984434 [Quercus lobata]5.1e-5236.86Show/hide
Query:  VEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGVSLSNRHRED--RAKGRQTDDRSRGRHEQSSANGRGRA
        V+  DD + L A  + +  +  ++ + E +P+T  E +  AQ +++AE+ + +K+ +R  +  +   RH E   R K  +T+D+                
Subjt:  VEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGVSLSNRHRED--RAKGRQTDDRSRGRHEQSSANGRGRA

Query:  KAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPAQ
          KD   +A P A+  +YTPL    +QVL  I+D   LK  E+++ DP++ NRNKYC FH DHGH T EC  L+ +IE LIR+G L+ F+G D      +
Subjt:  KAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPAQ

Query:  TEPDKGGNNPPLEIRTILGGPTEGESSRK-----------------RKAAIREAHMEPGGQ---GFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVD
         + ++    P  EIR I+GG +  +SSR+                 ++  +    + P      GFGG +V P G++ LPV  G   Q VT+ +SFLVVD
Subjt:  TEPDKGGNNPPLEIRTILGGPTEGESSRK-----------------RKAAIREAHMEPGGQ---GFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVD

Query:  CVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECY
        C  +YNAI+GRPTL+  K V STYH  +KFPTE GVG+V+G+Q  +RECY
Subjt:  CVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECY

TrEMBL top hitse value%identityAlignment
A0A2N9ECS2 RNase H domain-containing protein4.4e-4937.4Show/hide
Query:  VEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGVSLSNRHREDRAKGRQTD--DRSRGRHEQSSANGRGRA
        V+G DD V LTA IS LQ    L S+ +  P +  E M  AQRY++ E+ L+++                 D  K R+ D  DR     E      R   
Subjt:  VEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGVSLSNRHREDRAKGRQTD--DRSRGRHEQSSANGRGRA

Query:  KAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD-RGKRPA
        K ++ R R     +F+ +TPL A ++++   I+D   L+ P +L ++PDRR ++KYC FH DHGH T +C  L+ +IE LI++G L+ FV  D R  RP 
Subjt:  KAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD-RGKRPA

Query:  QTEPDK----GGNNPPL-EIRTILGGPTEGESSR-KRKAAIREAH------------------------MEPGGQGFGGERVSPRGSIELPVTFGEGLQT
        Q  P +        PP+ EI  I GG   G +SR  RKA  R+ H                        +E    GF G  V P G I L +  G   + 
Subjt:  QTEPDK----GGNNPPL-EIRTILGGPTEGESSR-KRKAAIREAH------------------------MEPGGQGFGGERVSPRGSIELPVTFGEGLQT

Query:  VTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECY
         T+ + FLVVDC  AYN I+GRPTL+ L+AV STYH +++FPTE G+GE++G+Q  +RECY
Subjt:  VTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECY

A0A2N9F5L1 RNase H domain-containing protein6.8e-5033.25Show/hide
Query:  HRDSQRRIENEDIEGLIGQMEPPFTDEIMGGEVPHKFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRG
        H  S +++E E +   + +    F +E M        K++   + V +TA ++ L+    L  + +  P T  E M  A ++++AE+ L++  +    R 
Subjt:  HRDSQRRIENEDIEGLIGQMEPPFTDEIMGGEVPHKFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRG

Query:  VSLSNRHREDRAKGRQTDDRSRGRHEQSSANGRGRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHG
                      ++T+DR   + E +        +  + +    P  KF  +TPL   ++++L  IQD   L+ P ++RSDP+ R +N YC FH DHG
Subjt:  VSLSNRHREDRAKGRQTDDRSRGRHEQSSANGRGRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHG

Query:  HTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPA-------QTEPDKGGNNPPLEIRTILGGPTEGESSR-KRKAAIREAH------------------
        H T++C+ L +++ETLIR+G L+++V      RPA       Q EP++    P  EIRTI+GGP  G +SR  R+A  R+AH                  
Subjt:  HTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPA-------QTEPDKGGNNPPLEIRTILGGPTEGESSR-KRKAAIREAH------------------

Query:  ---MEPGGQG--------------FGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRG
            E   +G                G++V P G + LP+T G   +TV++ + FLVV+C  AYNAI+GRPTL+ L+AV STYH +LKFPTE G+GEVRG
Subjt:  ---MEPGGQG--------------FGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRG

Query:  EQRTSRECYFMA
        +Q  +RECY ++
Subjt:  EQRTSRECYFMA

A0A2N9GNB7 Ribonuclease H2.2e-4832.86Show/hide
Query:  HRDSQRRIENEDIEGLIGQMEPPFTDEIMGGEVPHKFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRG
        H  S +++E E +   + +    F +E M        K++   + V +TA ++ L+    L  + +  P T  E M  A ++++AE+ L++  +      
Subjt:  HRDSQRRIENEDIEGLIGQMEPPFTDEIMGGEVPHKFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRG

Query:  VSLSNRHREDRAKGRQTDDRSRGRHEQSSANGRGRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHG
               R   A+ R+ +   +   + S    R R  A        P  KF  +TPL   ++++L  IQD   L+ P ++RSDP+ R +N YC FH DHG
Subjt:  VSLSNRHREDRAKGRQTDDRSRGRHEQSSANGRGRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHG

Query:  HTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPA-------QTEPDKGGNNPPLEIRTILGGPTEGESSR-KRKAAIREAH------------------
        H T +C+ L++++ETLIR+G L+++V      RP        Q EP++ G  P  EIRTI+GGP  G +SR  RKA  R+ H                  
Subjt:  HTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPA-------QTEPDKGGNNPPLEIRTILGGPTEGESSR-KRKAAIREAH------------------

Query:  ----------------------------------MEPGGQGFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTY
                                          M+    GF G++V P G + LP+T G   +TV++ + FLVV+C  AYNAI+GRPTL+ L+AV STY
Subjt:  ----------------------------------MEPGGQGFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTY

Query:  HQVLKFPTEEGVGEVRGEQRTSRECY
        H +LKFPTE G+GEVRG+Q  +RECY
Subjt:  HQVLKFPTEEGVGEVRGEQRTSRECY

A0A6J1CTS4 uncharacterized protein LOC1110141473.6e-5144.44Show/hide
Query:  IQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPAQTEPDKGGNNPPLEIRTILGGPTEGESSRKRK
        I+D  LLK PER+++   +R++++YC+FH  HGH T++C  L++E+E LIR GYLKE+V     + P  T+  +   +P  EIRTI+GGP E ES RKRK
Subjt:  IQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDRGKRPAQTEPDKGGNNPPLEIRTILGGPTEGESSRKRK

Query:  AAIREAH----------------MEPGGQ----------------------------GFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNA
        A +REA                 +  GG                             GFGGERV P G IE PVTFG G ++VT+M+  LVV+   +YNA
Subjt:  AAIREAH----------------MEPGGQ----------------------------GFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPAYNA

Query:  ILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDK
        ILGRPT+H L+A+ STYHQ +KFPT  GVGE++GEQR SRECY+ + ++ D+
Subjt:  ILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDK

A0A6J1DZB9 uncharacterized protein LOC1110249043.6e-5136.25Show/hide
Query:  KFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK--QEERESRGVSLSNRHREDRAKGRQTDDRSRGRHEQSSANGR
        + KV    D  A+   ++SL DE L   +GE  P T+VE + +A++ I  +ELL++K  + E++     LS        + R+ D +SR +   SSA+  
Subjt:  KFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK--QEERESRGVSLSNRHREDRAKGRQTDDRSRGRHEQSSANGR

Query:  GRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDR
         R + + L         ++RYT  T  + ++L  I+++    LLKRPE+LR D ++RN+ KYC FH DHGH T  C +L+ +IE LI++GY K+FVG  R
Subjt:  GRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDT---NLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDR

Query:  GKRPAQTEPDKGGNNPPLE------IRTILGGPTEGESSRKRKAAIREAHMEP--------------GGQGFGGERVSPR--------------------
             + E  K    PP        I TI GGP  G+S  KRK   REA  E               G     G  +                       
Subjt:  GKRPAQTEPDKGGNNPPLE------IRTILGGPTEGESSRKRKAAIREAHMEP--------------GGQGFGGERVSPR--------------------

Query:  -GSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDKKAKAAPTLGDGRGR
         G I+LPVT G+    VT+M  F+V+D   AYNAI GRP +H  +AV ST HQVLK+ T   VG VRGEQ+TSRECY  A K     A  A      RG+
Subjt:  -GSIELPVTFGEGLQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDKKAKAAPTLGDGRGR

Query:  ASEG-SDHPME
          E  +D P E
Subjt:  ASEG-SDHPME

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACGGTCTTTGGGACGGGTCGGGATGCAAAGGCGGCATCGTGCGTCTTTAGAAACTCGACGGCGAGTTCGGGGGAAGTGACGGGTGTTATGGGCAAAATTAGTCC
TTGGTATTTTATGCAAGATAGGCCCAAAATTAGTCAGGTCCGAAGACGACGGAAGCCAAGCCGACGGAACAGAGGGTCGGAACTTCTACCGGCCCATGGGCATAACCCTC
GGCCTCGGCCCAAGGGAGAGGCCGAGGAAGAGGTCGGCCTCGGCCCATTGCCGAGGCCGACCAGGGCCAAAGGCCCGGAGATGAACTGGAGTCCGGCGATAGTGAAAACC
CTATGGGGAGTCTACAAAAAGGAGGACCACGCACACATTCAGGAGGCCCGAGTCGGTAAGGCGCGAGGAAGAGCACGTGCAAAGAGACCCAAGAAGGGTAAAGGGATAGC
GGACGAAGAAGTAGGGGACTCAGAAAGCGTAACTAGCCGAGTACACCGTCCAGGGGATGGTGAAACCCGAAAAGAGGCTGGACCCAGTTGCAAAAGGATTCGCAGGGGTT
CTCCACAGAAACCAGGGTCAGGTAAGCATGTGGAATATAATGACAGGAGAAATTCGGAGGCTCGGACATGTCCCAGGGCCGAGCAGGACCAGAGGGGGCGAGAGTGGGAG
CTGTTCAGGTGGCTGAAAGAGGAGGACAACCATCGGGACTCCCAAAGAAGAATAGAGAACGAAGACATAGAAGGGTTGATCGGGCAGATGGAGCCGCCCTTCACTGACGA
GATAATGGGAGGGGAGGTGCCACATAAATTTAAGGTAGAAGGTTATGACGACGGAGTCGCCCTGACTGCAGTGATCTCGAGTTTGCAGGATGAGAGGTTGCTCAACTCGA
TTGGCGAAAGCCAACCACGAACGTATGTGGAGTTCATGACTAGAGCACAGAGGTACATCAGTGCCGAGGAGTTGCTCAAGTCCAAGCAGGAAGAAAGAGAGAGTCGAGGA
GTTTCTTTATCCAACCGGCATCGAGAGGATCGGGCAAAGGGGCGCCAAACCGATGATAGAAGCCGAGGTCGACATGAGCAGTCCTCGGCCAATGGCCGAGGCCGAGCAAA
AGCCAAGGATCTGCGGGGCCGTGCAGAGCCGAAAGCCAAGTTCGACAGGTATACCCCACTAACGGCTTCACTTGAACAGGTTTTGGCCGCGATACAGGATACGAACCTGC
TAAAACGTCCGGAAAGGCTGAGGTCAGACCCAGACAGGAGAAACCGAAACAAGTATTGCATGTTCCATGGAGACCACGGTCACACAACTCGGGAGTGCATACAGCTAAGG
GATGAAATAGAAACCCTAATTCGAGAGGGTTACCTCAAGGAGTTCGTGGGACATGATAGGGGGAAGAGGCCAGCGCAGACAGAGCCAGACAAGGGGGGCAACAACCCACC
GTTGGAGATACGAACTATTCTTGGGGGACCCACCGAGGGAGAATCGAGCAGGAAGCGAAAAGCCGCGATTCGAGAAGCACATATGGAGCCTGGAGGGCAAGGCTTTGGCG
GAGAAAGAGTAAGCCCAAGGGGAAGCATTGAGCTGCCGGTGACGTTTGGTGAAGGGCTGCAGACAGTTACTAGAATGATAAGCTTCCTAGTTGTGGACTGTGTCCCAGCA
TACAACGCAATCCTGGGACGGCCAACCCTACATGGGCTCAAAGCAGTAGCCTCAACCTACCATCAAGTCCTGAAGTTCCCAACCGAAGAAGGCGTAGGAGAGGTACGAGG
AGAGCAGAGGACATCAAGAGAGTGCTATTTCATGGCGTTCAAGAATATAGACAAAAAGGCCAAGGCAGCGCCCACCTTGGGAGATGGCCGAGGTCGAGCGTCCGAGGGGT
CGGATCACCCAATGGAGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGACGGTCTTTGGGACGGGTCGGGATGCAAAGGCGGCATCGTGCGTCTTTAGAAACTCGACGGCGAGTTCGGGGGAAGTGACGGGTGTTATGGGCAAAATTAGTCC
TTGGTATTTTATGCAAGATAGGCCCAAAATTAGTCAGGTCCGAAGACGACGGAAGCCAAGCCGACGGAACAGAGGGTCGGAACTTCTACCGGCCCATGGGCATAACCCTC
GGCCTCGGCCCAAGGGAGAGGCCGAGGAAGAGGTCGGCCTCGGCCCATTGCCGAGGCCGACCAGGGCCAAAGGCCCGGAGATGAACTGGAGTCCGGCGATAGTGAAAACC
CTATGGGGAGTCTACAAAAAGGAGGACCACGCACACATTCAGGAGGCCCGAGTCGGTAAGGCGCGAGGAAGAGCACGTGCAAAGAGACCCAAGAAGGGTAAAGGGATAGC
GGACGAAGAAGTAGGGGACTCAGAAAGCGTAACTAGCCGAGTACACCGTCCAGGGGATGGTGAAACCCGAAAAGAGGCTGGACCCAGTTGCAAAAGGATTCGCAGGGGTT
CTCCACAGAAACCAGGGTCAGGTAAGCATGTGGAATATAATGACAGGAGAAATTCGGAGGCTCGGACATGTCCCAGGGCCGAGCAGGACCAGAGGGGGCGAGAGTGGGAG
CTGTTCAGGTGGCTGAAAGAGGAGGACAACCATCGGGACTCCCAAAGAAGAATAGAGAACGAAGACATAGAAGGGTTGATCGGGCAGATGGAGCCGCCCTTCACTGACGA
GATAATGGGAGGGGAGGTGCCACATAAATTTAAGGTAGAAGGTTATGACGACGGAGTCGCCCTGACTGCAGTGATCTCGAGTTTGCAGGATGAGAGGTTGCTCAACTCGA
TTGGCGAAAGCCAACCACGAACGTATGTGGAGTTCATGACTAGAGCACAGAGGTACATCAGTGCCGAGGAGTTGCTCAAGTCCAAGCAGGAAGAAAGAGAGAGTCGAGGA
GTTTCTTTATCCAACCGGCATCGAGAGGATCGGGCAAAGGGGCGCCAAACCGATGATAGAAGCCGAGGTCGACATGAGCAGTCCTCGGCCAATGGCCGAGGCCGAGCAAA
AGCCAAGGATCTGCGGGGCCGTGCAGAGCCGAAAGCCAAGTTCGACAGGTATACCCCACTAACGGCTTCACTTGAACAGGTTTTGGCCGCGATACAGGATACGAACCTGC
TAAAACGTCCGGAAAGGCTGAGGTCAGACCCAGACAGGAGAAACCGAAACAAGTATTGCATGTTCCATGGAGACCACGGTCACACAACTCGGGAGTGCATACAGCTAAGG
GATGAAATAGAAACCCTAATTCGAGAGGGTTACCTCAAGGAGTTCGTGGGACATGATAGGGGGAAGAGGCCAGCGCAGACAGAGCCAGACAAGGGGGGCAACAACCCACC
GTTGGAGATACGAACTATTCTTGGGGGACCCACCGAGGGAGAATCGAGCAGGAAGCGAAAAGCCGCGATTCGAGAAGCACATATGGAGCCTGGAGGGCAAGGCTTTGGCG
GAGAAAGAGTAAGCCCAAGGGGAAGCATTGAGCTGCCGGTGACGTTTGGTGAAGGGCTGCAGACAGTTACTAGAATGATAAGCTTCCTAGTTGTGGACTGTGTCCCAGCA
TACAACGCAATCCTGGGACGGCCAACCCTACATGGGCTCAAAGCAGTAGCCTCAACCTACCATCAAGTCCTGAAGTTCCCAACCGAAGAAGGCGTAGGAGAGGTACGAGG
AGAGCAGAGGACATCAAGAGAGTGCTATTTCATGGCGTTCAAGAATATAGACAAAAAGGCCAAGGCAGCGCCCACCTTGGGAGATGGCCGAGGTCGAGCGTCCGAGGGGT
CGGATCACCCAATGGAGCACTGA
Protein sequenceShow/hide protein sequence
MSTVFGTGRDAKAASCVFRNSTASSGEVTGVMGKISPWYFMQDRPKISQVRRRRKPSRRNRGSELLPAHGHNPRPRPKGEAEEEVGLGPLPRPTRAKGPEMNWSPAIVKT
LWGVYKKEDHAHIQEARVGKARGRARAKRPKKGKGIADEEVGDSESVTSRVHRPGDGETRKEAGPSCKRIRRGSPQKPGSGKHVEYNDRRNSEARTCPRAEQDQRGREWE
LFRWLKEEDNHRDSQRRIENEDIEGLIGQMEPPFTDEIMGGEVPHKFKVEGYDDGVALTAVISSLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRG
VSLSNRHREDRAKGRQTDDRSRGRHEQSSANGRGRAKAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPDRRNRNKYCMFHGDHGHTTRECIQLR
DEIETLIREGYLKEFVGHDRGKRPAQTEPDKGGNNPPLEIRTILGGPTEGESSRKRKAAIREAHMEPGGQGFGGERVSPRGSIELPVTFGEGLQTVTRMISFLVVDCVPA
YNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEVRGEQRTSRECYFMAFKNIDKKAKAAPTLGDGRGRASEGSDHPMEH