; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011408 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011408
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr1:23894913..23902291
RNA-Seq ExpressionLag0011408
SyntenyLag0011408
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015383577.1 uncharacterized protein LOC107176071 [Citrus sinensis]4.5e-3634.38Show/hide
Query:  DNHRDSQRRTKNEDIEGLIGQMG----------PPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWF
        DN   S  + + + +E ++  MG          PPFT EIM    P  F++P+   YDG+KDP +H++ Y + M+  G + A  CRAF LTL G AR+WF
Subjt:  DNHRDSQRRTKNEDIEGLIGQMG----------PPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWF

Query:  SKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALT--AVISDRGKGRQAEERGQS----------
         ++   SI SF EL+R F   F  AR R KP+  LLTVKQ   ESL++YI R + E  QV+GYDDGV L    ++S   K   AEER QS          
Subjt:  SKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALT--AVISDRGKGRQAEERGQS----------

Query:  ---------------RHEYSSANGRGRP--EAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDPDRSNTTRECIQLRDEIETIIRE
                       R + +S   + RP  +    + R+    +F  YT L  P E +L  + ++ + K P  L+SD               +IE+++R+
Subjt:  ---------------RHEYSSANGRGRP--EAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDPDRSNTTRECIQLRDEIETIIRE

Query:  GYLKEFV-GQDREKRPSLPK
        G L+ +V GQ   +   LP+
Subjt:  GYLKEFV-GQDREKRPSLPK

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.3e-4637.04Show/hide
Query:  KEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKS
        + E + +    + K  D+E L+ Q   PFT+EIM   VP KFK+P   ++D   DP  HLDAY  WMD +G +EA RCR F+ TL G AR WF ++ R S
Subjt:  KEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKS

Query:  IGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTAVISD--------------------------------
        I SFK L RAFVTQF+G R R +P   LLT+KQ   ESL+DY+ R + E LQVEG  D V+L A +S                                 
Subjt:  IGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTAVISD--------------------------------

Query:  ---RGKGRQAEERGQSRHEYSSANGRG-RPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDP-------------DRSNTTREC
             K     +R   + E S    +G R E +D   + +   KF +YTP T P+EQVL  + D  +LK P ++++               D  + T++C
Subjt:  ---RGKGRQAEERGQSRHEYSSANGRG-RPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDP-------------DRSNTTREC

Query:  IQLRDEIETIIREGYLKEFVGQDR
          L++E+E +IR GYLKE+V + +
Subjt:  IQLRDEIETIIREGYLKEFVGQDR

XP_023916366.1 uncharacterized protein LOC112027956 [Quercus suber]1.5e-3633.43Show/hide
Query:  ERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQW
        E+E+ +  +  D  R++ RRT    +E L+ +   PFT  I G  +P KFK+P+   YDG +DP  H+  + T M   G  +   CRAF  TL G AR W
Subjt:  ERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQW

Query:  FSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTA----VISDR-------------------
        FSKI   S+ SF+EL + FV  F+G +  ++   +LLT++QG  ESL+ +I R + + L V+  DD + L A    V SD                    
Subjt:  FSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTA----VISDR-------------------

Query:  ------------GKGRQAEERGQSRHEYSSANG----RGR-PEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDPDRSN------
                     K R+  ER ++     S  G    +GR  + KD   +A   A+  +YTPL  PLEQVL  + D   LK P K+R DP++ N      
Subjt:  ------------GKGRQAEERGQSRHEYSSANG----RGR-PEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDPDRSN------

Query:  -------TTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKPP
                T EC  L+ +IE +IR+G LK F+G+D +      KL   + ++ E+ +PP
Subjt:  -------TTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKPP

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]6.6e-4030.11Show/hide
Query:  QKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMG----------PPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRC
        Q+   R +++ +  +    D   + + + +E ++ +MG          PPFT +IM    P +F +P    YDG++DP +HL+ Y T M+  GA++A  C
Subjt:  QKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMG----------PPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRC

Query:  RAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTAVISDRGKGR-----
        RAF LTL+G AR+WF ++   SI SF +L R F + F  AR R KP   LLTVKQ   E+L+DYI R +NE+ QV+GYDDG+AL+ ++     G+     
Subjt:  RAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTAVISDRGKGR-----

Query:  -------------------QAEERGQSRHEYSSANGRGRPEAKDSQ--------------------------GRAELKAKFGRYTPLTTPLEQVLTALHD
                            AEER ++R++    + +G+ +  D +                           R  L ++F  +T L TP EQ+L  + +
Subjt:  -------------------QAEERGQSRHEYSSANGRGRPEAKDSQ--------------------------GRAELKAKFGRYTPLTTPLEQVLTALHD

Query:  TNMLKRPNKLRSDPDRSN-------------TTRECIQLRDEIETIIREGYLKEFVGQDREK
          + + P  ++++P R N              T EC +L+++IE+++R+G L+E+V    ++
Subjt:  TNMLKRPNKLRSDPDRSN-------------TTRECIQLRDEIETIIREGYLKEFVGQDREK

XP_030955724.1 uncharacterized protein LOC115977839 [Quercus lobata]7.6e-3633.7Show/hide
Query:  KEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKS
        K  +  +++ RRT    IE L+ +   PFT  I G  +P KFK+P+   YDG +DP  H+  + T M   G  +   CRAF  TL G AR WFSKI   S
Subjt:  KEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKS

Query:  IGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTA----VISD------------------RGKGRQAEER
        + SF+EL + FV  F+G +  ++   +LLT++QG  ESL+ +I R + E L V+  DD + L A    + SD                  R +  + E  
Subjt:  IGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTA----VISD------------------RGKGRQAEER

Query:  GQSRHEYSSANGRGRPE-AKDSQGRAELKAKFGR---YTPLTTPLEQVLTALHDTNMLKRPNKLRSDPDRSN-------------TTRECIQLRDEIETI
             E +    +GR E  K+  GR       GR   YTPL  PL QVL  + D   LK P K++ DP++ N              T EC  L+ +IE +
Subjt:  GQSRHEYSSANGRGRPE-AKDSQGRAELKAKFGR---YTPLTTPLEQVLTALHDTNMLKRPNKLRSDPDRSN-------------TTRECIQLRDEIETI

Query:  IREGYLKEFVGQDRE--------KRPSLPKLFPRRVEVRENPKPPQAFPQKSVRTPRMNAML
        IR+G LK FVG+DR         +  S P L   R+ V  NP    +  +K+      N  L
Subjt:  IREGYLKEFVGQDRE--------KRPSLPKLFPRRVEVRENPKPPQAFPQKSVRTPRMNAML

TrEMBL top hitse value%identityAlignment
A0A2N9FE79 Ribonuclease H5.1e-3831.44Show/hide
Query:  RDDQTRKEAGPSHKKVRRNSSPGP-VPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYD
        R D++ K+  P  +  R NSS  P  P   T   +      E+EL +  K+  + ++S R     +++ L+ +   PF   I    +P +FKVP    +D
Subjt:  RDDQTRKEAGPSHKKVRRNSSPGP-VPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYD

Query:  GKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVL
        G KDP  +L+A+ T M      E   CRAF L L G AR WF+K+  +SIGSF +L RAF+  F+G++ R +P  +LL+VKQ   ESL+ +++R + E +
Subjt:  GKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVL

Query:  QVEGYDDGVALTAVISDRG----------KGRQAEERGQSRHEYSSANGRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSD
        +++   + V +TA +++            + ++AE+R     +         PE K +        KF  +TPL TP++++L  + D   L+ P K+RSD
Subjt:  QVEGYDDGVALTAVISDRG----------KGRQAEERGQSRHEYSSANGRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSD

Query:  P-------------DRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKP
        P             D  + T EC+ L+++IET+IR+G L+++V +    RP+ P   P + E  E+ +P
Subjt:  P-------------DRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKP

A0A2N9H5N1 Integrase catalytic domain-containing protein3.9e-3830.53Show/hide
Query:  RDDQTRKEAGPSHKKVRRNSSPGP-VPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYD
        R D++ K+  P  +  R NSS  P  P   T   +      E+EL +  K+  + ++S R     +++ L+ +   PF   I    +P +FKVP    +D
Subjt:  RDDQTRKEAGPSHKKVRRNSSPGP-VPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYD

Query:  GKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVL
        G KDP  +L+A+ T M      E   CRAF L L G AR WF+K+  +SIGSF +L RAF+  F+G++ R +P  +LL+VKQ   ESL+ +++R + E +
Subjt:  GKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVL

Query:  QVEGYDDGVALTAVISDRG----------KGRQAEERGQSRHEYSSANGRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSD
        +++   + V +TA +++            + ++ E+R     +         PE K +        KF  +TPL TP++++L  + D   L+ P K+RSD
Subjt:  QVEGYDDGVALTAVISDRG----------KGRQAEERGQSRHEYSSANGRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSD

Query:  P-------------DRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKPPQAFPQKSVRT
        P             D  + T EC+ L++++ET+IR+G L+++V +    RP+ P +       RE  +P +  P   +RT
Subjt:  P-------------DRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKPPQAFPQKSVRT

A0A2N9HH57 Ribonuclease H1.1e-4031.83Show/hide
Query:  RDDQTRKEAGPSHKKVRRNSSPGP-VPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYD
        R D++ K+  P  +  R NSS  P  P   T   +      E+EL +  K+  + ++S R     +++ L+ +   PF   I    +P +FKVP    +D
Subjt:  RDDQTRKEAGPSHKKVRRNSSPGP-VPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYD

Query:  GKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVL
        G KDP  +L+A+ T M      E   CRAF L L G AR WF+K+  +SIGSF +L RAF+  F+G++ R +P  +LL+VKQ   ESL+ +++R + E +
Subjt:  GKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVL

Query:  QVEGYDDGVALTAVIS--DRGKGRQAEERGQSRHEYSSAN-----GRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDP--
        +++   + V +TA ++   RG GR   E+ +     ++ N      R  P+ ++ +       KF  +TPL TP++++L  + D   L+ P K+RSDP  
Subjt:  QVEGYDDGVALTAVIS--DRGKGRQAEERGQSRHEYSSAN-----GRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDP--

Query:  -----------DRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKPPQAFPQKSVRT
                   D  + T EC+ L+++IET+IR+G L+++V +    RP+ P         RE  +P +  P   +RT
Subjt:  -----------DRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKPPQAFPQKSVRT

A0A2N9IJR2 Ribonuclease H6.7e-3830.26Show/hide
Query:  RDDQTRKEAGPSHKKVRRNSSPGP-VPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYD
        R+DQ  + +  + K    NSS  P  P   T   +      E+EL +  K+  + ++S R     +++ L+ +   PF   I    +P +FKVP    +D
Subjt:  RDDQTRKEAGPSHKKVRRNSSPGP-VPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYD

Query:  GKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVL
        G KDP  +L+A+ T M      E   CRAF L L G AR WF+K+  +SIGSF +L RAF+  F+G++ R +P  +LL+VKQ   ESL+ +++R + E +
Subjt:  GKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVL

Query:  QVEGYDDGVALTAVISDRG----------KGRQAEERGQSRHEYSSANGRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSD
        +++   + V +TA +++            + ++ E+R     +         PE K +        KF  +TPL TP++++L  + D   L+ P K+RSD
Subjt:  QVEGYDDGVALTAVISDRG----------KGRQAEERGQSRHEYSSANGRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSD

Query:  P-------------DRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKPPQAFPQKSVRT
        P             D  + T EC+ L++++ET+IR+G L+++V +    RP+ P +       RE  +P +  P   +RT
Subjt:  P-------------DRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKRPSLPKLFPRRVEVRENPKPPQAFPQKSVRT

A0A6J1DWY0 uncharacterized protein LOC1110252936.1e-4737.04Show/hide
Query:  KEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKS
        + E + +    + K  D+E L+ Q   PFT+EIM   VP KFK+P   ++D   DP  HLDAY  WMD +G +EA RCR F+ TL G AR WF ++ R S
Subjt:  KEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQHLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKS

Query:  IGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTAVISD--------------------------------
        I SFK L RAFVTQF+G R R +P   LLT+KQ   ESL+DY+ R + E LQVEG  D V+L A +S                                 
Subjt:  IGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTAVISD--------------------------------

Query:  ---RGKGRQAEERGQSRHEYSSANGRG-RPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDP-------------DRSNTTREC
             K     +R   + E S    +G R E +D   + +   KF +YTP T P+EQVL  + D  +LK P ++++               D  + T++C
Subjt:  ---RGKGRQAEERGQSRHEYSSANGRG-RPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDP-------------DRSNTTREC

Query:  IQLRDEIETIIREGYLKEFVGQDR
          L++E+E +IR GYLKE+V + +
Subjt:  IQLRDEIETIIREGYLKEFVGQDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCATCCGAGGGATGATCAGACCCGGAAGGAAGCTGGACCTAGCCACAAAAAGGTTCGCAGGAATTCGTCGCCGGGGCCAGTACCAGGTATGTATACTGTCGGGGC
CGAGCAGGGCCAGAAGGGGCGAGAGCGAGAGCTATCCAAGTGGCTCAAAGAGGAAGACAACCATCGTGACTCCCAAAGAAGAACTAAGAATGAAGACATAGAAGGGTTGA
TCGGACAGATGGGACCACCCTTCACTGATGAAATAATGGGAGGAGATGTGCCGCATAAATTCAAGGTACCAAACTTCCCACGGTATGACGGAAAGAAAGATCCAAAACAG
CACTTAGACGCATACCTAACTTGGATGGATTTCCACGGGGCGAACGAAGCGACAAGATGTCGAGCCTTCACATTAACACTCATGGGTTTGGCAAGACAATGGTTTAGCAA
GATCCTGCGGAAATCAATCGGTTCGTTTAAAGAGTTGGTGCGAGCATTTGTTACGCAATTCTTAGGAGCGCGGAGCCGACAGAAGCCTCAGATCAACCTGCTGACAGTAA
AGCAGGGGCCTCGGGAAAGCCTGAAGGATTATATTAACAGACTTAGTAACGAAGTTTTGCAGGTAGAAGGCTATGACGATGGGGTTGCCTTGACCGCTGTAATTTCAGAC
AGAGGGAAGGGACGCCAGGCTGAAGAGAGAGGCCAAAGTCGACACGAGTACTCCTCGGCCAATGGTCGAGGCCGACCAGAGGCCAAGGATTCGCAGGGTCGTGCAGAGTT
GAAAGCCAAGTTTGGCAGGTATACACCGCTAACAACTCCACTTGAACAGGTTTTAACTGCGTTACATGATACAAATATGCTGAAACGCCCAAACAAGTTGAGATCAGACC
CAGATAGGAGTAACACCACCCGGGAATGCATACAGCTAAGGGATGAGATAGAAACCATAATCCGAGAGGGTTACCTTAAGGAATTTGTGGGACAGGACAGAGAAAAAAGA
CCAAGCCTCCCCAAGCTTTTTCCCAGAAGAGTAGAGGTCCGGGAAAATCCCAAGCCTCCCCAAGCTTTTCCTCAGAAGAGTGTACGCACCCCTAGGATGAACGCGATGCT
GGAAGAGAAGCTCGGGAACTTCCCAGGCCTCCCCAAGTTTTCCTCAATCTTCTCAAGTCATGCAGGGAAAATGTTCGGCCTCATGCCAAGGCCGAGGCCGACCATTAGAC
AAGCGATGAGGCTTAGTCTGGCCTCCCCAAGTTTTCTCCAGAAGAGTGTACGCACCCTTGGGATGAACACAGCGCTTGAAGAGAGGCTCGAGAATTTCCTCGGCCTCCCC
AAGTTCCCCCAGAAGAGTGTACGCACCCCTGGGATGAACACAACACTTGAAGATAGGTTGACGCATGCCCTCAACCTCCCCAAGTCCCCAACTGAGTGCAGACACCTTTG
GGATGGAAAAGAAGTAAGCAAACAATGCAGAAGAGTTGGTCGGTTCATGCCAAGGCTGAGGCCGACCATCCAGAGGCCAAGGCCGAGCTCCACCCTCAAAGACCCAGGAT
GTGGCACATGTGGCCAGATGAGAATGATTCTGGCAGAGGCCAAACTTCTAGAGGTTGACTCATCTCCACAACCTCCTCAAGGCCACAGGTTATCTCAACTCATCTGGTTG
GGCACATCTTGTCGAGTCCGAGCACAAACCCTAAAGGATAGCAAAGAAGTCATACATAAGTTGGCACGAGACAAGAGTCGTACTCTTAAATCGGACTTAAGGAGCTCAGC
CTATAGCAGTCGAGGCCAACCATTTGGCCCTCGGGCGTGGGTCGAGCTTGGCCACCTCCCTTCGGTCTTTGATGTCCCTGACCGCCTCGGTTTCGCCTGGCCTGGAGATC
ATTTGAAGTCCGATGTCGACGAAAATCCTAAGAGGAAAGCTATAAAAAGGGAGACCGCACACGCATTCAGGAAGGCCCAAGTTCGTGGGGCCCAAAAACGGAAGGAACTC
AATCAACGGGACAAAGGGTCGGAGCACGCTCCGGCCCCAGGTGAAGACCCTCGGCCTCGGCCCAAGGGAGAGGCCGAGGGTCATGGTCGGCCTCGGCTAGGCCAAATGCC
CAAATTTCCCTTCTGGAGTTGGGGAGACTCGCAACCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACCATCCGAGGGATGATCAGACCCGGAAGGAAGCTGGACCTAGCCACAAAAAGGTTCGCAGGAATTCGTCGCCGGGGCCAGTACCAGGTATGTATACTGTCGGGGC
CGAGCAGGGCCAGAAGGGGCGAGAGCGAGAGCTATCCAAGTGGCTCAAAGAGGAAGACAACCATCGTGACTCCCAAAGAAGAACTAAGAATGAAGACATAGAAGGGTTGA
TCGGACAGATGGGACCACCCTTCACTGATGAAATAATGGGAGGAGATGTGCCGCATAAATTCAAGGTACCAAACTTCCCACGGTATGACGGAAAGAAAGATCCAAAACAG
CACTTAGACGCATACCTAACTTGGATGGATTTCCACGGGGCGAACGAAGCGACAAGATGTCGAGCCTTCACATTAACACTCATGGGTTTGGCAAGACAATGGTTTAGCAA
GATCCTGCGGAAATCAATCGGTTCGTTTAAAGAGTTGGTGCGAGCATTTGTTACGCAATTCTTAGGAGCGCGGAGCCGACAGAAGCCTCAGATCAACCTGCTGACAGTAA
AGCAGGGGCCTCGGGAAAGCCTGAAGGATTATATTAACAGACTTAGTAACGAAGTTTTGCAGGTAGAAGGCTATGACGATGGGGTTGCCTTGACCGCTGTAATTTCAGAC
AGAGGGAAGGGACGCCAGGCTGAAGAGAGAGGCCAAAGTCGACACGAGTACTCCTCGGCCAATGGTCGAGGCCGACCAGAGGCCAAGGATTCGCAGGGTCGTGCAGAGTT
GAAAGCCAAGTTTGGCAGGTATACACCGCTAACAACTCCACTTGAACAGGTTTTAACTGCGTTACATGATACAAATATGCTGAAACGCCCAAACAAGTTGAGATCAGACC
CAGATAGGAGTAACACCACCCGGGAATGCATACAGCTAAGGGATGAGATAGAAACCATAATCCGAGAGGGTTACCTTAAGGAATTTGTGGGACAGGACAGAGAAAAAAGA
CCAAGCCTCCCCAAGCTTTTTCCCAGAAGAGTAGAGGTCCGGGAAAATCCCAAGCCTCCCCAAGCTTTTCCTCAGAAGAGTGTACGCACCCCTAGGATGAACGCGATGCT
GGAAGAGAAGCTCGGGAACTTCCCAGGCCTCCCCAAGTTTTCCTCAATCTTCTCAAGTCATGCAGGGAAAATGTTCGGCCTCATGCCAAGGCCGAGGCCGACCATTAGAC
AAGCGATGAGGCTTAGTCTGGCCTCCCCAAGTTTTCTCCAGAAGAGTGTACGCACCCTTGGGATGAACACAGCGCTTGAAGAGAGGCTCGAGAATTTCCTCGGCCTCCCC
AAGTTCCCCCAGAAGAGTGTACGCACCCCTGGGATGAACACAACACTTGAAGATAGGTTGACGCATGCCCTCAACCTCCCCAAGTCCCCAACTGAGTGCAGACACCTTTG
GGATGGAAAAGAAGTAAGCAAACAATGCAGAAGAGTTGGTCGGTTCATGCCAAGGCTGAGGCCGACCATCCAGAGGCCAAGGCCGAGCTCCACCCTCAAAGACCCAGGAT
GTGGCACATGTGGCCAGATGAGAATGATTCTGGCAGAGGCCAAACTTCTAGAGGTTGACTCATCTCCACAACCTCCTCAAGGCCACAGGTTATCTCAACTCATCTGGTTG
GGCACATCTTGTCGAGTCCGAGCACAAACCCTAAAGGATAGCAAAGAAGTCATACATAAGTTGGCACGAGACAAGAGTCGTACTCTTAAATCGGACTTAAGGAGCTCAGC
CTATAGCAGTCGAGGCCAACCATTTGGCCCTCGGGCGTGGGTCGAGCTTGGCCACCTCCCTTCGGTCTTTGATGTCCCTGACCGCCTCGGTTTCGCCTGGCCTGGAGATC
ATTTGAAGTCCGATGTCGACGAAAATCCTAAGAGGAAAGCTATAAAAAGGGAGACCGCACACGCATTCAGGAAGGCCCAAGTTCGTGGGGCCCAAAAACGGAAGGAACTC
AATCAACGGGACAAAGGGTCGGAGCACGCTCCGGCCCCAGGTGAAGACCCTCGGCCTCGGCCCAAGGGAGAGGCCGAGGGTCATGGTCGGCCTCGGCTAGGCCAAATGCC
CAAATTTCCCTTCTGGAGTTGGGGAGACTCGCAACCCTAG
Protein sequenceShow/hide protein sequence
MHHPRDDQTRKEAGPSHKKVRRNSSPGPVPGMYTVGAEQGQKGRERELSKWLKEEDNHRDSQRRTKNEDIEGLIGQMGPPFTDEIMGGDVPHKFKVPNFPRYDGKKDPKQ
HLDAYLTWMDFHGANEATRCRAFTLTLMGLARQWFSKILRKSIGSFKELVRAFVTQFLGARSRQKPQINLLTVKQGPRESLKDYINRLSNEVLQVEGYDDGVALTAVISD
RGKGRQAEERGQSRHEYSSANGRGRPEAKDSQGRAELKAKFGRYTPLTTPLEQVLTALHDTNMLKRPNKLRSDPDRSNTTRECIQLRDEIETIIREGYLKEFVGQDREKR
PSLPKLFPRRVEVRENPKPPQAFPQKSVRTPRMNAMLEEKLGNFPGLPKFSSIFSSHAGKMFGLMPRPRPTIRQAMRLSLASPSFLQKSVRTLGMNTALEERLENFLGLP
KFPQKSVRTPGMNTTLEDRLTHALNLPKSPTECRHLWDGKEVSKQCRRVGRFMPRLRPTIQRPRPSSTLKDPGCGTCGQMRMILAEAKLLEVDSSPQPPQGHRLSQLIWL
GTSCRVRAQTLKDSKEVIHKLARDKSRTLKSDLRSSAYSSRGQPFGPRAWVELGHLPSVFDVPDRLGFAWPGDHLKSDVDENPKRKAIKRETAHAFRKAQVRGAQKRKEL
NQRDKGSEHAPAPGEDPRPRPKGEAEGHGRPRLGQMPKFPFWSWGDSQP