; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006936 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006936
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold10:38713080..38714035
RNA-Seq ExpressionSpg006936
SyntenySpg006936
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4273082.1 unnamed protein product [Prunus armeniaca]4.1e-2026.07Show/hide
Query:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRD-----
        + V P+ +  +WK  RN+   K NL+ +GID    C LC    ES  H+ +  +F++ +W     +L P  +      D I  W  +  +    +     
Subjt:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRD-----

Query:  LKIAAIIIWKLWNYRNSVLFRGRIVDNR---RCIADM--EASIAEM----------------------------SALKLNANASWSSLQGIGGVGWSIRD
        L++ A  +W++W  RN  +F G +++      C  D   E   A+M                              LK+N +A+W S   +GGVGW +RD
Subjt:  LKIAAIIIWKLWNYRNSVLFRGRIVDNR---RCIADM--EASIAEM----------------------------SALKLNANASWSSLQGIGGVGWSIRD

Query:  SEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAY
        S+G ++ A  K   +    L  EA  I+E L        +    L VESDS  +I+++  +       + +  +I  LV        F + PR+CN+AA+
Subjt:  SEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAY

Query:  ALA
         +A
Subjt:  ALA

CAB4309819.1 unnamed protein product [Prunus armeniaca]1.6e-1628.45Show/hide
Query:  VIPRAKISVWKLLRNIAPTKLNLISKGIDTN-PVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAI
        V P+ K+ VW++L NI PT+ +L+SKG+  +   CVLC +R ES SH++ D  F+  +W       +P     R+  D       +    + + +++  +
Subjt:  VIPRAKISVWKLLRNIAPTKLNLISKGIDTN-PVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAI

Query:  IIWKLWNYRNSVLFRGRIVDNRRCIADMEASIAEMSALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFP
        ++W LWN RNSV  R  I D +            + A+++N + +++   GIGG G   R++EG  + A    F       H EA  ++E L   R L  
Subjt:  IIWKLWNYRNSVLFRGRIVDNRRCIADMEASIAEMSALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFP

Query:  NSSVPLFVESDSTSLIKLLNHQEPDLSEANLV
            P  +E D+  +++ +  +  D S  NL+
Subjt:  NSSVPLFVESDSTSLIKLLNHQEPDLSEANLV

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.2e-2126.49Show/hide
Query:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSE--RDAIGVWRRIASQSNDRDLKI
        +D+  + KI +W+ L+NI PT  NL  +     P+C  C+ + E+ SH++ + K ++ IW      L PL++    +  +D     + + S+S+  + ++
Subjt:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSE--RDAIGVWRRIASQSNDRDLKI

Query:  AAIIIWKLWNYRNSVLFRGRIVDNRRCIADMEASI---------------------------AEMSALKLNANASWSSLQGIGGVGWSIRDSEGSLLLAR
          +  W +W+ RN  +F G+  D+R   A  ++ +                              + LKLN +A+ S+     G+G  +RD+EG +L   
Subjt:  AAIIIWKLWNYRNSVLFRGRIVDNRRCIADMEASI---------------------------AEMSALKLNANASWSSLQGIGGVGWSIRDSEGSLLLAR

Query:  LKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVES
        +K  +    +   EA  I  GL V  ++   SS  L VESD   +++LLN+ +   +E + +  + V        +V F++ PR+CN  A+ALAK A+ +
Subjt:  LKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVES

Query:  GS
         S
Subjt:  GS

XP_023904177.1 uncharacterized protein LOC112015942 [Quercus suber]1.6e-1627.3Show/hide
Query:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAA
        M+V  + KI  W+  + I PT+LNL  + I     C +C    E+  H+IW+   ++ +W+    KL    L    + D + ++  +  +    D+ +  
Subjt:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAA

Query:  IIIWKLWNYRNSVLFRGRIVD----NRRCIADMEASIAEMSAL---------------------KLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSF
        +  W LW+ RNSVL  G++ +    N+R +  +E        L                     K+N +A+  +  G  G G  IR+  G  ++A L + 
Subjt:  IIIWKLWNYRNSVLFRGRIVD----NRRCIADMEASIAEMSAL---------------------KLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSF

Query:  RKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEA
          S +    EA T+    ++   +    +  L +E DS ++IK L+   PDLS    V  +I  L+  G+  V+F+W  R CNR A+ALAK A
Subjt:  RKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEA

XP_024043202.1 uncharacterized protein LOC112099905 [Citrus clementina]1.9e-1726.32Show/hide
Query:  RAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCR--SERDAIGVWRRIASQSNDRDLKIAAIII
        + KI +W+ ++N+ PT  NL  + I     C  C +R E+  H +   K +K +W     +L PL  + +     D +G   R+  + +  D+++   I+
Subjt:  RAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCR--SERDAIGVWRRIASQSNDRDLKIAAIII

Query:  WKLWNYRNSVLFRGRIVDNRRCIADMEASIAEMSALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNS
        W +W  RN ++F    +D R  +A  EA        +     +   LQ +  +G  IRDS G  + A +K+ + S  +   EA + + G+ + +++   S
Subjt:  WKLWNYRNSVLFRGRIVDNRRCIADMEASIAEMSALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNS

Query:  SVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAV
           + +E+DS  +  L+N +E +L+E      EI  L +         + PR+CN +A+ LAK A+
Subjt:  SVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAV

TrEMBL top hitse value%identityAlignment
A0A2N9GC52 Uncharacterized protein8.4e-1928.87Show/hide
Query:  RAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAIIIWK
        + K+ +WK  +NI PTKLNL  K    +  C LC +  ES  H++ D KF++ +W   L  L        S   A  V   I S ++  D+++   I WK
Subjt:  RAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAIIIWK

Query:  LWNYRNSVLFRGRIVDNRRCIADMEASIAEM-----------------------SALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLIL
        LW  RN  ++  + V  R  I    + +++                        S  K+N    W SL   GG G  IRDS GS++ A   +  +    L
Subjt:  LWNYRNSVLFRGRIVDNRRCIADMEASIAEM-----------------------SALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLIL

Query:  HFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVESGS
           A  I++ L + +++  +S   L VE D ++L+  L    P L+    +  EI +L    +  + FN  PRSCN  +++LAKE+    S
Subjt:  HFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVESGS

A0A2N9HW96 Reverse transcriptase domain-containing protein1.4e-1828.87Show/hide
Query:  RAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAIIIWK
        + K+ +WK  +NI PTKLNL  K    +  C LC +  ES  H++ D KF++ +W   L  L        S   A  V   I S ++  D+++   I WK
Subjt:  RAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAIIIWK

Query:  LWNYRNSVLFRGRIVDNRRCIADMEASIAEM-----------------------SALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLIL
        LW  RN  ++  + V  R  I    + +++                        S  K+N    W SL   GG G  IRDS GS++ A   +  +    L
Subjt:  LWNYRNSVLFRGRIVDNRRCIADMEASIAEM-----------------------SALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLIL

Query:  HFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVESGS
           A  I++ L + +++  +S   L VE D ++L+  L    P L+    +  EI +L    +  + FN  PRSCN  +++LAKE+    S
Subjt:  HFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVESGS

A0A2N9I4S0 Uncharacterized protein8.4e-1928.87Show/hide
Query:  RAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAIIIWK
        + K+ +WK  +NI PTKLNL  K    +  C LC +  ES  H++ D KF++ +W   L  L        S   A  V   I S ++  D+++   I WK
Subjt:  RAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAIIIWK

Query:  LWNYRNSVLFRGRIVDNRRCIADMEASIAEM-----------------------SALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLIL
        LW  RN  ++  + V  R  I    + +++                        S  K+N    W SL   GG G  IRDS GS++ A   +  +    L
Subjt:  LWNYRNSVLFRGRIVDNRRCIADMEASIAEM-----------------------SALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLIL

Query:  HFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVESGS
           A  I++ L + +++  +S   L VE D ++L+  L    P L+    +  EI +L    +  + FN  PRSCN  +++LAKE+    S
Subjt:  HFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVESGS

A0A6J5UAY2 Reverse transcriptase domain-containing protein2.0e-2026.07Show/hide
Query:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRD-----
        + V P+ +  +WK  RN+   K NL+ +GID    C LC    ES  H+ +  +F++ +W     +L P  +      D I  W  +  +    +     
Subjt:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRD-----

Query:  LKIAAIIIWKLWNYRNSVLFRGRIVDNR---RCIADM--EASIAEM----------------------------SALKLNANASWSSLQGIGGVGWSIRD
        L++ A  +W++W  RN  +F G +++      C  D   E   A+M                              LK+N +A+W S   +GGVGW +RD
Subjt:  LKIAAIIIWKLWNYRNSVLFRGRIVDNR---RCIADM--EASIAEM----------------------------SALKLNANASWSSLQGIGGVGWSIRD

Query:  SEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAY
        S+G ++ A  K   +    L  EA  I+E L        +    L VESDS  +I+++  +       + +  +I  LV        F + PR+CN+AA+
Subjt:  SEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAY

Query:  ALA
         +A
Subjt:  ALA

A0A6J5WPU6 Reverse transcriptase domain-containing protein8.6e-1625.17Show/hide
Query:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAA
        +D  P+ K  +W+   NI   + NL  + +     C  C  + E+ +H+ ++  F++A W       + L ++  +  D I  W+ + +  N  +    A
Subjt:  MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAA

Query:  II-----IWKLWNYRNSVLFRGRIVDNRRCIADMEASIAEM----------------------------SALKLNANASWSSLQGIGGVGWSIRDSEGSL
        I      +W++W  RN  +F G   D    +  +   ++E                               LK+N +A+WS+ +  GGVGW IRDS G L
Subjt:  II-----IWKLWNYRNSVLFRGRIVDNRRCIADMEASIAEM----------------------------SALKLNANASWSSLQGIGGVGWSIRDSEGSL

Query:  LLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALA
        L A  +        +  E   I+  LS            + VESDS   I +LN +    S+   +  +I  LV   +  V+F + PRSCN+AA+++A
Subjt:  LLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-0428.12Show/hide
Query:  IPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKL
        IP+     W  +R+   TK  +IS G    P+C+ C +  E+  H+ +D +F++ +W  F  ++
Subjt:  IPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKL

AT4G10613.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.0e-0625.77Show/hide
Query:  PTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLK--IAAIIIWKLWNYRNSVL
        PT+  L+S G+  +P+C LC    E+  H+I    FS +IW+    +L    +  R     +  W ++++ S+   ++  +A   +  +W  RN++L
Subjt:  PTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLK--IAAIIIWKLWNYRNSVL

AT4G29090.1 Ribonuclease H-like superfamily protein1.5e-1223.43Show/hide
Query:  PRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGV---WRRIASQSNDRDLKIAAI
        P+ +  +WK L N  P    L  + +     C+ C S  E+ +H+++   F++  W+      IP+ L      D+I V   W       N +  K + +
Subjt:  PRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGV---WRRIASQSNDRDLKIAAI

Query:  I---IWKLWNYRNSVLFRGRIVDNRRCIADMEASIAEMSA----------------------------LKLNANASWSSLQGIGGVGWSIRDSEGSLLLA
        +   +W+LW  RN ++FRGR  + +  +   E  + E                               +K N +A+W+      G+GW +R+ +G +   
Subjt:  I---IWKLWNYRNSVLFRGRIVDNRRCIADMEASIAEMSA----------------------------LKLNANASWSSLQGIGGVGWSIRDSEGSLLLA

Query:  RLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQE--PDLSEANLVAGEIVDLVRY--GMDEVTFNWFPRSCNRAAYALAK
          ++  K   +L  E   ++  +    +   N    +  ESDS  LI++LN+ E  P L         I DL R      EV F + PR  N  A  +A+
Subjt:  RLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLLNHQE--PDLSEANLVAGEIVDLVRY--GMDEVTFNWFPRSCNRAAYALAK

Query:  EAV
        E++
Subjt:  EAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTATCCCTAGAGCAAAAATTAGTGTGTGGAAGCTCCTCAGAAACATAGCCCCCACCAAGCTAAATCTTATCTCTAAAGGAATTGACACTAACCCCGTTTGCGT
TTTGTGCAGGTCAAGGGCAGAATCTAATTCTCATATGATCTGGGACCATAAGTTTTCTAAAGCTATATGGTCTAAGTTCCTACGAAAGCTTATTCCACTATTGTTAAATT
GCAGGTCAGAAAGAGATGCGATTGGTGTCTGGAGGAGGATTGCTTCCCAAAGCAATGACAGAGACCTTAAGATTGCGGCTATAATTATATGGAAGCTATGGAACTATCGC
AACTCAGTTTTATTCAGAGGCAGAATAGTGGACAATAGAAGATGCATTGCAGATATGGAAGCTAGTATTGCTGAAATGTCAGCGCTGAAGTTAAATGCAAATGCTTCTTG
GAGCAGCTTGCAGGGCATTGGCGGCGTGGGATGGTCAATCCGAGACTCTGAGGGATCTTTGCTTCTCGCAAGGCTCAAGTCCTTCAGGAAGAGCTGGTTGATCCTCCACT
TCGAAGCCACGACAATCAAAGAGGGATTGAGCGTGTTCCGGAAACTTTTCCCCAATTCCTCTGTTCCCCTGTTTGTCGAATCTGACTCAACTTCGTTGATCAAGCTGTTA
AATCATCAAGAACCAGATCTCTCGGAAGCAAACCTGGTTGCGGGCGAGATTGTTGATCTTGTGAGATATGGGATGGATGAAGTAACGTTCAATTGGTTCCCTAGATCATG
CAATAGGGCAGCCTATGCGTTGGCGAAGGAAGCCGTCGAATCCGGCTCGATGATCTCTCAATCGTCTGGATTTGTTTTTATTGCTTTGGATTTTTGTTCGTCTCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTTATCCCTAGAGCAAAAATTAGTGTGTGGAAGCTCCTCAGAAACATAGCCCCCACCAAGCTAAATCTTATCTCTAAAGGAATTGACACTAACCCCGTTTGCGT
TTTGTGCAGGTCAAGGGCAGAATCTAATTCTCATATGATCTGGGACCATAAGTTTTCTAAAGCTATATGGTCTAAGTTCCTACGAAAGCTTATTCCACTATTGTTAAATT
GCAGGTCAGAAAGAGATGCGATTGGTGTCTGGAGGAGGATTGCTTCCCAAAGCAATGACAGAGACCTTAAGATTGCGGCTATAATTATATGGAAGCTATGGAACTATCGC
AACTCAGTTTTATTCAGAGGCAGAATAGTGGACAATAGAAGATGCATTGCAGATATGGAAGCTAGTATTGCTGAAATGTCAGCGCTGAAGTTAAATGCAAATGCTTCTTG
GAGCAGCTTGCAGGGCATTGGCGGCGTGGGATGGTCAATCCGAGACTCTGAGGGATCTTTGCTTCTCGCAAGGCTCAAGTCCTTCAGGAAGAGCTGGTTGATCCTCCACT
TCGAAGCCACGACAATCAAAGAGGGATTGAGCGTGTTCCGGAAACTTTTCCCCAATTCCTCTGTTCCCCTGTTTGTCGAATCTGACTCAACTTCGTTGATCAAGCTGTTA
AATCATCAAGAACCAGATCTCTCGGAAGCAAACCTGGTTGCGGGCGAGATTGTTGATCTTGTGAGATATGGGATGGATGAAGTAACGTTCAATTGGTTCCCTAGATCATG
CAATAGGGCAGCCTATGCGTTGGCGAAGGAAGCCGTCGAATCCGGCTCGATGATCTCTCAATCGTCTGGATTTGTTTTTATTGCTTTGGATTTTTGTTCGTCTCAATAG
Protein sequenceShow/hide protein sequence
MDVIPRAKISVWKLLRNIAPTKLNLISKGIDTNPVCVLCRSRAESNSHMIWDHKFSKAIWSKFLRKLIPLLLNCRSERDAIGVWRRIASQSNDRDLKIAAIIIWKLWNYR
NSVLFRGRIVDNRRCIADMEASIAEMSALKLNANASWSSLQGIGGVGWSIRDSEGSLLLARLKSFRKSWLILHFEATTIKEGLSVFRKLFPNSSVPLFVESDSTSLIKLL
NHQEPDLSEANLVAGEIVDLVRYGMDEVTFNWFPRSCNRAAYALAKEAVESGSMISQSSGFVFIALDFCSSQ