; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031848 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031848
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold11:39225731..39229180
RNA-Seq ExpressionSpg031848
SyntenySpg031848
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5515315.1 hypothetical protein RHGRI_036380 [Rhododendron griersonianum]7.5e-2627.19Show/hide
Query:  RSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNV-NFTNIHAIEFELDNLLLEEEI
        + FKFE +W     C   I +N + +    +    +  L      L+NW +      + R+ E KD L      AP   N     AI   ++ ++  EE+
Subjt:  RSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNV-NFTNIHAIEFELDNLLLEEEI

Query:  YWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLW--TDDSQAIKDRFIWEDIPANVHLNEDFNNCLID-------RWMRILEDPSPTNLD
        Y  QRSR +WL +GDRNSK+F+     R++ N+IL +   +G W  +DD   ++    +  + +N+    DF++ + +          RI E+    NL+
Subjt:  YWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLW--TDDSQAIKDRFIWEDIPANVHLNEDFNNCLID-------RWMRILEDPSPTNLD

Query:  -----LVITTCWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEV--LKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVV
              VI  CW IW  RN++I   +    +V     L    +F          L ++P PST      L+   P + +L+I  DA+W++       G++
Subjt:  -----LVITTCWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEV--LKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVV

Query:  CFNQRGSLEGAAASSFDLDFKAPLAELKAINEGLKLAKALACSKVVIKSDCLQAIKF-VTKASETW------SDVEVLVKEIWDNAESFEMCTFDYIPRV
          + RG+L             A LAE   + E    AKAL  S V I++D  Q I   V++    W      SD+ +L +E        E  +F + PR 
Subjt:  CFNQRGSLEGAAASSFDLDFKAPLAELKAINEGLKLAKALACSKVVIKSDCLQAIKF-VTKASETW------SDVEVLVKEIWDNAESFEMCTFDYIPRV

Query:  HNSLADYIA-KEAKRLGFNAVWVSSIPEWVVSLV
         N  A ++A  +A  +G N  WV   PE + S++
Subjt:  HNSLADYIA-KEAKRLGFNAVWVSSIPEWVVSLV

KAG5537975.1 hypothetical protein RHGRI_025166 [Rhododendron griersonianum]2.0e-2323.48Show/hide
Query:  RKKTRSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAP-NVNFTNIHAIEFELDNLLL
        ++  + FKFE +W+ H  C  II+++ +      S   L+  L  C  +L+ W        K +++  K  L       P + N   ++ I+ E++ LL 
Subjt:  RKKTRSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAP-NVNFTNIHAIEFELDNLLL

Query:  EEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILEDPSPTNLDLVITT
         EE+Y+ QRSR +WLR+GDRN+ +FH     R++ N++L + D+NG W                        D N  L   +  + +     N++ V+  
Subjt:  EEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILEDPSPTNLDLVITT

Query:  CWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEV------------LKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVV
                       SIP +  R +W+ N++    R  +               ++      T+  ST  +   PE   ++   D A     P+  I VV
Subjt:  CWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEV------------LKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVV

Query:  CFNQRGSLEGAAASSFDLDFKAPLAELKAINEGLKLAKALAC--------SKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPR
          N +G +         LD  + L +  +  +G  LA  LAC        S+ +I+SD    IK  +  +    +   L+K+I   + +F    F ++PR
Subjt:  CFNQRGSLEGAAASSFDLDFKAPLAELKAINEGLKLAKALAC--------SKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPR

Query:  VHNSLADYIAKEAKRLGFNAVWVSSIPEWVVSLVEYDRSISAL
          N  + ++A +A +      WVS +P  +  + + D S+  L
Subjt:  VHNSLADYIAKEAKRLGFNAVWVSSIPEWVVSLVEYDRSISAL

KAG5542538.1 hypothetical protein RHGRI_022172 [Rhododendron griersonianum]4.4e-2627.42Show/hide
Query:  RSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNV-NFTNIHAIEFELDNLLLEEEI
        + FKFE +W     C   I +N + +    +    +  L      L+NW +      + R+ E KD L      AP   N     AI   ++ ++  EE+
Subjt:  RSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNV-NFTNIHAIEFELDNLLLEEEI

Query:  YWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLW--TDDSQAIKDRFIWEDIPANVHLNEDFNNCLID-------RWMRILEDPSPTNLD
        Y  QRSR +WL +GDRNSK+F+     R++ N+IL +   +G W  +DD   ++    +  + +N+    DF++ + +          RI E+    NLD
Subjt:  YWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLW--TDDSQAIKDRFIWEDIPANVHLNEDFNNCLID-------RWMRILEDPSPTNLD

Query:  -----LVITTCWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEV--LKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVV
              VI  CW IW  RN++I   +    +V     L    +F          L ++P PST      L+   P + +L+I  DA+W++       G++
Subjt:  -----LVITTCWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEV--LKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVV

Query:  CFNQRGSLEGAAASSFDLDFKAPLAELKAINEGLKLAKALACSKVVIKSDCLQAIKF-VTKASETW------SDVEVLVKEIWDNAESFEMCTFDYIPRV
          + RG+L             A LAE   + E    AKAL  S V I++D  Q I   V++    W      SD+ +L +E        E  +F + PR 
Subjt:  CFNQRGSLEGAAASSFDLDFKAPLAELKAINEGLKLAKALACSKVVIKSDCLQAIKF-VTKASETW------SDVEVLVKEIWDNAESFEMCTFDYIPRV

Query:  HNSLADYIA-KEAKRLGFNAVWVSSIPEWVVSLV
         N  A ++A   A  +G N  WV   PE + S++
Subjt:  HNSLADYIA-KEAKRLGFNAVWVSSIPEWVVSLV

KAG7568133.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]7.2e-2124.22Show/hide
Query:  RSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECK-DLLVTEY-KKAPNVNFTNIHAIEFELDNLLLEEE
        R+F+F++ W   E     I+N  N    +L        L+ C  A+  W +N+    +  I+E K +LLV +   + P    T    +   L     +EE
Subjt:  RSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECK-DLLVTEY-KKAPNVNFTNIHAIEFELDNLLLEEE

Query:  IYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFI--WEDIPANVHLNEDFNNCLIDRWMRILED-----PSPTNLDL
        +YW Q+SR  W++ GD NSK+FH     R+  N I G+ DENG+W  + + I+   +  +E++       E F   L +    I E       +P     
Subjt:  IYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFI--WEDIPANVHLNEDFNNCLIDRWMRILED-----PSPTNLDL

Query:  VITTCWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEVLKKNPSPSTDPSSTHLNIPSPESD--ALRIFVDAAWTSNPPETGIGVVCFNQRG
        V T  + +  +  K    +   P  V +  +        +T   ++  +N       +   +   S ES+    R FVD +W  +   +G G  C +  G
Subjt:  VITTCWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEVLKKNPSPSTDPSSTHLNIPSPESD--ALRIFVDAAWTSNPPETGIGVVCFNQRG

Query:  SLEGAAASSFDLDFKAPLAELKAINEGLKLAKALACSKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPRVHNSLADYIAKEAK
              A++         AE++A+   +K        +V   +DC   +K V+  +E W    V ++E   + E F   +   I R  N+ AD +A++ +
Subjt:  SLEGAAASSFDLDFKAPLAELKAINEGLKLAKALACSKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPRVHNSLADYIAKEAK

Query:  RLGFNAVWVSSIP-EWV
            +  +V++IP EW+
Subjt:  RLGFNAVWVSSIP-EWV

RYR79715.1 hypothetical protein Ahy_A01g004533 [Arachis hypogaea]8.6e-2223.99Show/hide
Query:  KTRSFKFEELWTRHEECSAIIANNGN--WKGTDLSFSTLSNDLSTCSIALQNWGR-NINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNLLL
        +T+ FKFE  W  H  C  II    N     +   ++ LS  ++ C   L  W + N  +      K+   L + E +   +     I  ++ ++  L  
Subjt:  KTRSFKFEELWTRHEECSAIIANNGN--WKGTDLSFSTLSNDLSTCSIALQNWGR-NINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNLLL

Query:  EEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILEDPSPTNLDLVITT
        +EE +W QRSR  WL++GD+N+ +FH     R+  N I+ + D +G W                       ED          +IL     T  D V+++
Subjt:  EEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILEDPSPTNLDLVITT

Query:  ------CWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEVLKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVVCFNQRG
               W IW  RN+ +H  S P   +          DF        +  N    T      +    P    ++  VDAA+           V  +  G
Subjt:  ------CWSIWSDRNKIIHGESIPPANVRKNWILNYLTDFRRTNVPEVLKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVVCFNQRG

Query:  SLEGAAASSFDLDFKAPL-AELKAINEGLKLAKALACSKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPRVHNSLADYIAKEA
        SL   AAS+  +   +PL AE  A+ E L +AK     +++++SD L  I+   K+    ++++V++ +I +   S   C F ++PR  N LA  +A+  
Subjt:  SLEGAAASSFDLDFKAPL-AELKAINEGLKLAKALACSKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPRVHNSLADYIAKEA

Query:  KRLGFNAVWVSSIPEWVVSLV
                W+S  P+ +++++
Subjt:  KRLGFNAVWVSSIPEWVVSLV

TrEMBL top hitse value%identityAlignment
A0A803NSJ4 Uncharacterized protein3.9e-2036.49Show/hide
Query:  FKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNLLLEEEIYWR
        F FE  W   E+CS I+  N N   +  +   L   L+ C  AL +W +   K    ++KE +D + +      + ++ ++  +E + + LL +EE +W+
Subjt:  FKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNLLLEEEIYWR

Query:  QRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAI
        QRSR  WL+ GDRN+K+FHRKA+ RKK N ILGIMD N  W   ++ +
Subjt:  QRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAI

A0A803PI64 Uncharacterized protein1.0e-2036.14Show/hide
Query:  KRKKTRSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSI--ALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNL
        K  K   F FEE W   ++C  I+    NW+  D+  S       T  +  AL +W R   K    +IK+ K  L+     +    +  +  IE +L+ +
Subjt:  KRKKTRSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSI--ALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNL

Query:  LLEEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDS---QAIKDRFIW
        L ++E YWRQRSR  WL+WGD N+K+FHRKASAR+K N I G+MD+ G+W  ++   Q + + + W
Subjt:  LLEEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDS---QAIKDRFIW

A0A803PUH4 Uncharacterized protein1.0e-2036.49Show/hide
Query:  FKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNLLLEEEIYWR
        F FE  W   E+C+ I+  + +  G+  +   L + L+ C  ALQ W ++     K+R+KE +D +    +   N ++  +  +E + + LL +EE +WR
Subjt:  FKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNLLLEEEIYWR

Query:  QRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAI
        QRSR  WL+ GDRN+K+FHRKA+ RK+ N ILG++D NG W   ++ +
Subjt:  QRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAI

A0A803PV25 Uncharacterized protein4.1e-2234.04Show/hide
Query:  KRKKTRSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLL-VTEYKKAPNVNFTNIHAIEFELDNLL
        K K+   F FEE W + EEC+ I+    + +       +    ++ C  ALQ W R       + +++ K  L      + P V +  I  +E +L+ LL
Subjt:  KRKKTRSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLL-VTEYKKAPNVNFTNIHAIEFELDNLL

Query:  LEEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILE
         ++E YWRQRSR  WL+WGDRN+K+FH KASAR+K NEI G+ D  G+W DD   +    I ED    +  + D N  +++  + +++
Subjt:  LEEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILE

A0A803PWX1 Uncharacterized protein1.1e-2236.27Show/hide
Query:  KRKKTRSFKFEELWTRHEECSAIIANNGNWK-----GTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLL-VTEYKKAPNVNFTNIHAIEFE
        K K+   F FEE W + EEC+ II N   WK     G  +SF      ++ C  ALQ+W +       N I + K +L     ++ P V +  I  +E +
Subjt:  KRKKTRSFKFEELWTRHEECSAIIANNGNWK-----GTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLL-VTEYKKAPNVNFTNIHAIEFE

Query:  LDNLLLEEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILE
        L+ LL ++E YWRQRSR  WL+WGDRN+K+FH KAS+R+K NEI G+ D+ G+W DD   +    I ED    + +  D +  ++   + +++
Subjt:  LDNLLLEEEIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.2e-0723.77Show/hide
Query:  LVITTCWSIWSDRNKII-HGESIPPANVRKNWILNYLTDFR-RTNVPEVLKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVVCFNQR
        LV    W +W +RN+++  G       V +    + L ++R RT   E       P  + SS     P P    ++   DA W  +    GIG V  N++
Subjt:  LVITTCWSIWSDRNKII-HGESIPPANVRKNWILNYLTDFR-RTNVPEVLKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVVCFNQR

Query:  GSLEGAAASSFDLDFKAPLAELKAINEGLKLAKALACSKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPRVHNSLADYIAKEA
        G ++   A +         AEL+A+   +        + V+ +SD  Q +  +    E W  ++  ++++      F    F +IPR  N+LA+ +A+E+
Subjt:  GSLEGAAASSFDLDFKAPLAELKAINEGLKLAKALACSKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPRVHNSLADYIAKEA

Query:  -KRLGFNAVWVSSIPEWVVSLVE
           L ++    S +P W  S ++
Subjt:  -KRLGFNAVWVSSIPEWVVSLVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCCAAGTGGAGCTAAGTTCAAAAGGAAAAAGACAAGAAGCTTCAAATTTGAAGAGTTGTGGACTAGGCATGAGGAATGTTCCGCTATCATCGCAAATAATGGCAA
TTGGAAAGGTACTGACCTATCATTTTCTACTTTGTCTAATGACCTGTCTACCTGTTCTATAGCCTTACAAAATTGGGGTAGGAATATTAACAAAACAAGGAAAAATAGAA
TAAAAGAGTGTAAGGACTTGCTTGTTACAGAATATAAAAAAGCCCCTAATGTAAACTTTACCAATATCCATGCCATTGAATTTGAACTAGATAATTTGCTTTTAGAAGAG
GAAATTTATTGGCGGCAAAGGTCTAGGGAAGATTGGCTTAGATGGGGTGATAGGAATTCCAAGTGGTTCCATAGGAAAGCTTCAGCTAGGAAGAAAACAAATGAAATCCT
AGGCATTATGGATGAAAATGGGTTATGGACAGATGATTCTCAAGCTATAAAGGACAGATTTATTTGGGAAGATATTCCAGCAAATGTGCACCTGAATGAGGATTTCAACA
ATTGCCTAATCGATAGATGGATGAGAATTCTTGAGGATCCTAGTCCAACTAACCTAGATCTGGTGATAACAACATGCTGGTCAATTTGGAGCGACAGAAACAAAATCATT
CATGGAGAATCTATTCCCCCAGCAAATGTTCGTAAGAATTGGATATTAAATTACCTGACAGATTTTCGAAGAACAAATGTGCCGGAGGTGTTGAAGAAGAATCCTAGTCC
TTCTACGGATCCGTCATCTACGCATCTAAATATCCCCTCTCCAGAATCTGATGCCTTGAGAATTTTCGTTGATGCGGCATGGACTTCCAATCCCCCTGAAACTGGTATTG
GAGTGGTTTGCTTCAATCAGAGAGGTAGTTTGGAAGGAGCAGCGGCTTCATCTTTTGATTTGGATTTTAAAGCTCCCCTTGCTGAGCTGAAAGCCATAAACGAGGGCCTA
AAGCTGGCAAAAGCTCTTGCATGTTCAAAGGTAGTGATTAAGTCAGACTGTCTGCAAGCAATTAAATTTGTGACAAAAGCCTCTGAAACTTGGAGTGATGTGGAAGTGCT
TGTTAAGGAGATTTGGGACAACGCAGAATCATTTGAGATGTGTACTTTTGATTATATCCCCAGAGTGCACAATAGTTTAGCTGATTACATTGCTAAGGAGGCTAAAAGGT
TGGGGTTTAATGCTGTATGGGTGAGTTCAATCCCAGAGTGGGTTGTTTCGTTGGTCGAGTATGACCGTTCTATCTCTGCCCTCGGTGGCGTAATAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCCAAGTGGAGCTAAGTTCAAAAGGAAAAAGACAAGAAGCTTCAAATTTGAAGAGTTGTGGACTAGGCATGAGGAATGTTCCGCTATCATCGCAAATAATGGCAA
TTGGAAAGGTACTGACCTATCATTTTCTACTTTGTCTAATGACCTGTCTACCTGTTCTATAGCCTTACAAAATTGGGGTAGGAATATTAACAAAACAAGGAAAAATAGAA
TAAAAGAGTGTAAGGACTTGCTTGTTACAGAATATAAAAAAGCCCCTAATGTAAACTTTACCAATATCCATGCCATTGAATTTGAACTAGATAATTTGCTTTTAGAAGAG
GAAATTTATTGGCGGCAAAGGTCTAGGGAAGATTGGCTTAGATGGGGTGATAGGAATTCCAAGTGGTTCCATAGGAAAGCTTCAGCTAGGAAGAAAACAAATGAAATCCT
AGGCATTATGGATGAAAATGGGTTATGGACAGATGATTCTCAAGCTATAAAGGACAGATTTATTTGGGAAGATATTCCAGCAAATGTGCACCTGAATGAGGATTTCAACA
ATTGCCTAATCGATAGATGGATGAGAATTCTTGAGGATCCTAGTCCAACTAACCTAGATCTGGTGATAACAACATGCTGGTCAATTTGGAGCGACAGAAACAAAATCATT
CATGGAGAATCTATTCCCCCAGCAAATGTTCGTAAGAATTGGATATTAAATTACCTGACAGATTTTCGAAGAACAAATGTGCCGGAGGTGTTGAAGAAGAATCCTAGTCC
TTCTACGGATCCGTCATCTACGCATCTAAATATCCCCTCTCCAGAATCTGATGCCTTGAGAATTTTCGTTGATGCGGCATGGACTTCCAATCCCCCTGAAACTGGTATTG
GAGTGGTTTGCTTCAATCAGAGAGGTAGTTTGGAAGGAGCAGCGGCTTCATCTTTTGATTTGGATTTTAAAGCTCCCCTTGCTGAGCTGAAAGCCATAAACGAGGGCCTA
AAGCTGGCAAAAGCTCTTGCATGTTCAAAGGTAGTGATTAAGTCAGACTGTCTGCAAGCAATTAAATTTGTGACAAAAGCCTCTGAAACTTGGAGTGATGTGGAAGTGCT
TGTTAAGGAGATTTGGGACAACGCAGAATCATTTGAGATGTGTACTTTTGATTATATCCCCAGAGTGCACAATAGTTTAGCTGATTACATTGCTAAGGAGGCTAAAAGGT
TGGGGTTTAATGCTGTATGGGTGAGTTCAATCCCAGAGTGGGTTGTTTCGTTGGTCGAGTATGACCGTTCTATCTCTGCCCTCGGTGGCGTAATAATATGA
Protein sequenceShow/hide protein sequence
MSPSGAKFKRKKTRSFKFEELWTRHEECSAIIANNGNWKGTDLSFSTLSNDLSTCSIALQNWGRNINKTRKNRIKECKDLLVTEYKKAPNVNFTNIHAIEFELDNLLLEE
EIYWRQRSREDWLRWGDRNSKWFHRKASARKKTNEILGIMDENGLWTDDSQAIKDRFIWEDIPANVHLNEDFNNCLIDRWMRILEDPSPTNLDLVITTCWSIWSDRNKII
HGESIPPANVRKNWILNYLTDFRRTNVPEVLKKNPSPSTDPSSTHLNIPSPESDALRIFVDAAWTSNPPETGIGVVCFNQRGSLEGAAASSFDLDFKAPLAELKAINEGL
KLAKALACSKVVIKSDCLQAIKFVTKASETWSDVEVLVKEIWDNAESFEMCTFDYIPRVHNSLADYIAKEAKRLGFNAVWVSSIPEWVVSLVEYDRSISALGGVII