; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007608 (gene) of Chayote v1 genome

Gene IDSed0007608
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG07:5456411..5461217
RNA-Seq ExpressionSed0007608
SyntenySed0007608
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PPD83812.1 hypothetical protein GOBAR_DD19246 [Gossypium barbadense]3.9e-3129.67Show/hide
Query:  LGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQR--------GLKVIWCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEG----SNLAYGDWL
        +G  IG++  IDW           R++V + +++PL+R        G++ I   + YE+LPD C  CG+ GHS + C+K   N  EG    +NL +G+W+
Subjt:  LGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQR--------GLKVIWCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEG----SNLAYGDWL

Query:  RAPYVKKSGQPTKEEGG--------SIDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDN-RCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPW
        RAP   ++        G        ++ S G SGGL + W ++VD++I +YS+ H+D  +  DN    RF+  YG    + +H   ++++++ +  +  W
Subjt:  RAPYVKKSGQPTKEEGG--------SIDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDN-RCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPW

Query:  IVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLY
        IVGG+ N  + + EK  G       M+     +D + L D+      FTW+        + +RLDRFL +    E  P    + +    SDH  I L+LY
Subjt:  IVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLY

TXG57064.1 hypothetical protein EZV62_018377 [Acer yangbiense]6.1e-3241.24Show/hide
Query:  EEGGSIDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDNR-CWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSL
        E G S+D  G SGGL L W  +  +S+ S+S+ H+D ++  D    WRFS  YGCL   NK  + EL++RL + DD  W+ GG+ NE +  KEK GG + 
Subjt:  EEGGSIDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDNR-CWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSL

Query:  AARSMELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLN
        +   +   R  +DD  L DLGF G   TW+ +      V +R+DR L + A+ + FPG RV HL +  SDHRP+ L+
Subjt:  AARSMELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLN

TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]6.5e-3428.12Show/hide
Query:  IDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDN-RCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSM
        +D  G SGGL L W  E+D+++ SYSR H+D  +   N + WR +  YG      +     L++RL      PW VGG+ NE +   EK GG +     M
Subjt:  IDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDN-RCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSM

Query:  ELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYA-----------------VASDDYTD
           +  ++D  L+DLGF G  FTWS +      + +RLDR +GN  + + F    + HLD++ SDHRPI L +                      DD+  
Subjt:  ELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYA-----------------VASDDYTD

Query:  GDWDQELLSAFL----WRKDV---ECICSIPICKSYEEDKWIWHYSKD--GDFSVRSVVPPVNVRIEWIRQYLEDFFKSNDMESHKHAEYDSCSAVAPVC
         D+ Q     F+     + D+   E +C +     Y  ++ ++  S     DF V          ++W   +++DF  +  +++    +      VAP  
Subjt:  GDWDQELLSAFL----WRKDV---ECICSIPICKSYEEDKWIWHYSKD--GDFSVRSVVPPVNVRIEWIRQYLEDFFKSNDMESHKHAEYDSCSAVAPVC

Query:  WIAPWPGFLKLNVDASCSPLAPKLGFGLIFRDHMG-----LCKFAYSIFKPVFCDILSAEALALLEGLKVADNLGFHILIIESDSKTLVDAILGNRLSLS
        W     G  K+N DA+    A   G G++ RD  G     LC+    + +P      + EA+A+L G ++A   G     IESDS ++V+ I    +  +
Subjt:  WIAPWPGFLKLNVDASCSPLAPKLGFGLIFRDHMG-----LCKFAYSIFKPVFCDILSAEALALLEGLKVADNLGFHILIIESDSKTLVDAILGNRLSLS

Query:  PKGIILDEI
          G++L +I
Subjt:  PKGIILDEI

XP_010686122.1 PREDICTED: uncharacterized protein LOC104900404 [Beta vulgaris subsp. vulgaris]2.5e-3332.65Show/hide
Query:  RFLGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKV-------IWCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSN--LAYGDWLR
        R +G  IG V +++ DG   W     R+R+LLD+ +PL+R  ++       +   + YE+LP  C++CG+ GH  R+C     N  E  N    +G WLR
Subjt:  RFLGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKV-------IWCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSN--LAYGDWLR

Query:  APYVKKSGQPTKEEGGSIDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDNRCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAI
        A   K      +   G +D +           + +D ++ S+S+NH+   +      WRF  +YG  E  NKH T ELI+ L    D P ++GG+ NE +
Subjt:  APYVKKSGQPTKEEGGSIDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDNRCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAI

Query:  WSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYA
           EK+GG     R+M   R  +D   L+DL   G  +TW +       + +RLDRFL +Q + + FP   V HL  + SDH  I L   A
Subjt:  WSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYA

XP_023871634.1 uncharacterized protein LOC111984238 [Quercus suber]7.9e-3235.48Show/hide
Query:  GASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDNRC-WRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLR
        G  GG+ +FW K+VD S+ SYS NH+D  +N      WRF+  YG  E  N HI+   ++RL      PWI  G+ NE I + EK GG    +R ME  R
Subjt:  GASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDNRC-WRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLR

Query:  TTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYAVASDDYTDGDWDQELLSAFLWRKDVECIC
          +D+   +DLG++G+ FTW      G TVW+R+DR +G   +   FP  +V HL+   SDH+PI ++L  +         ++Q      +W +D     
Subjt:  TTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYAVASDDYTDGDWDQELLSAFLWRKDVECIC

Query:  SIPICKSYEEDKWIWHY
            C+   ED W   Y
Subjt:  SIPICKSYEEDKWIWHY

TrEMBL top hitse value%identityAlignment
A0A2N9GPY1 Reverse transcriptase domain-containing protein1.8e-3429.28Show/hide
Query:  LGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKV-------IWCPIFYEKLPDICFSCGVFGHSSRECSKVASN--CTEGSNLAYGDWLRAP
        +G  +G +  +     N   G   R+RV LD+T+PL RG KV        W    YE+LP+ C+ CG   HS ++C     N          +G WLRAP
Subjt:  LGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKV-------IWCPIFYEKLPDICFSCGVFGHSSRECSKVASN--CTEGSNLAYGDWLRAP

Query:  ------------------------YVKKSGQP----------------------TKEEGGSID---------------SMGASGGLGLFWMKEVDMSICS
                                  +  G P                      T ++ G ++               S    GGL LFW KE+ + + S
Subjt:  ------------------------YVKKSGQP----------------------TKEEGGSID---------------SMGASGGLGLFWMKEVDMSICS

Query:  YSRNHVDCQLNYDNR-CWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTW
        +S +H+D  +N + +  WR +  YG  E  N+  +  L++RL +    PW   G+ NE +  +EK+G    + R M+L R  +DD    DLGF+G  FTW
Subjt:  YSRNHVDCQLNYDNR-CWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTW

Query:  SKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPI
        +     G   W+RLDR +    +   FP  RV HLD   SDH+PI
Subjt:  SKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPI

A0A2N9HFT1 Uncharacterized protein9.1e-3426.76Show/hide
Query:  MARFLGKKIGKV-----EDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKVI-------WCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSN--L
        +A  +G  IG+V     E+ +  GEN       RI+V LD+T+PL RG +V        W    YE+LP+ C+ CG+  H  ++CS+       GS+   
Subjt:  MARFLGKKIGKV-----EDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKVI-------WCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSN--L

Query:  AYGDWLRAPYVKK------------------------------SGQPTKEEGGSIDS-------------------------------------------
         YG WLR    K                               +G P ++  G+ DS                                           
Subjt:  AYGDWLRAPYVKK------------------------------SGQPTKEEGGSIDS-------------------------------------------

Query:  ------------------------------------------MGASGGLGLFWMKEVDMSICSYSRNH-VDCQLNYDNRCWRFSRIYGCLEAHNKHITCE
                                                      GGL LFW + +D+ I SYS +H VD         WRF   YG  + H + ++  
Subjt:  ------------------------------------------MGASGGLGLFWMKEVDMSICSYSRNH-VDCQLNYDNRCWRFSRIYGCLEAHNKHITCE

Query:  LIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDW
        L++ LH   D PW  GG+ NE +  +EK+G +S     M+  R  +DD    DLG+ G  FTW      G TVW++LDR + + A+   FP  RV HLD+
Subjt:  LIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDW

Query:  FGSDHRPICLN
         GSDH+P+ L+
Subjt:  FGSDHRPICLN

A0A5C7IIT4 Uncharacterized protein3.1e-3428.12Show/hide
Query:  IDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDN-RCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSM
        +D  G SGGL L W  E+D+++ SYSR H+D  +   N + WR +  YG      +     L++RL      PW VGG+ NE +   EK GG +     M
Subjt:  IDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDN-RCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSM

Query:  ELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYA-----------------VASDDYTD
           +  ++D  L+DLGF G  FTWS +      + +RLDR +GN  + + F    + HLD++ SDHRPI L +                      DD+  
Subjt:  ELLRTTVDDVFLKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYA-----------------VASDDYTD

Query:  GDWDQELLSAFL----WRKDV---ECICSIPICKSYEEDKWIWHYSKD--GDFSVRSVVPPVNVRIEWIRQYLEDFFKSNDMESHKHAEYDSCSAVAPVC
         D+ Q     F+     + D+   E +C +     Y  ++ ++  S     DF V          ++W   +++DF  +  +++    +      VAP  
Subjt:  GDWDQELLSAFL----WRKDV---ECICSIPICKSYEEDKWIWHYSKD--GDFSVRSVVPPVNVRIEWIRQYLEDFFKSNDMESHKHAEYDSCSAVAPVC

Query:  WIAPWPGFLKLNVDASCSPLAPKLGFGLIFRDHMG-----LCKFAYSIFKPVFCDILSAEALALLEGLKVADNLGFHILIIESDSKTLVDAILGNRLSLS
        W     G  K+N DA+    A   G G++ RD  G     LC+    + +P      + EA+A+L G ++A   G     IESDS ++V+ I    +  +
Subjt:  WIAPWPGFLKLNVDASCSPLAPKLGFGLIFRDHMG-----LCKFAYSIFKPVFCDILSAEALALLEGLKVADNLGFHILIIESDSKTLVDAILGNRLSLS

Query:  PKGIILDEI
          G++L +I
Subjt:  PKGIILDEI

A0A803MV91 Uncharacterized protein1.4e-3431.2Show/hide
Query:  LGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKVIW-------CPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSNLA--YGDWLRA-
        +G+ +G   ++D      W G   RI+VL+D+ +PL+RGL +           I YE+L D CF CG   H+ REC +      +   +   YG WLRA 
Subjt:  LGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKVIW-------CPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSNLA--YGDWLRA-

Query:  -----------------------------------PYVKKSGQ-----------PTKEEGGSIDSMG------------ASGGLGLFWMKEVDMSICSYS
                                           P V K GQ           P K+  GS   +G             SGGL L W    D+ I S+S
Subjt:  -----------------------------------PYVKKSGQ-----------PTKEEGGSIDSMG------------ASGGLGLFWMKEVDMSICSYS

Query:  RNHVDCQLNYDN-RCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTWSK
         NH+D  + + N   W+F+ IYG  +  NK  T  L+  LHNN   PWI GG+ N  + S EK+GG        E+LR  V     +D+G+ G  +TW+ 
Subjt:  RNHVDCQLNYDN-RCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVFLKDLGFSGDIFTWSK

Query:  KWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPI
               V  RLDRF  N+ +C  F    V+HL    SDH P+
Subjt:  KWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPI

A0A803QD63 Uncharacterized protein1.1e-3432.11Show/hide
Query:  FRIRVLLDLTEPLQRGL------KVIWCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSN---LAYGDWLRAPYVKKSGQPTKEEGGS--IDSMGA
        FR RV + + +P+  G       K IW    YE+ P +CF CG  GHS ++C+K     T   N    AYG WL+A           +  G   +++ G 
Subjt:  FRIRVLLDLTEPLQRGL------KVIWCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSN---LAYGDWLRAPYVKKSGQPTKEEGGS--IDSMGA

Query:  SGGLGLFWMKEVDMSICSYSRNHVDCQLNY-DNRCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTT
        SG + LFW  +V+  + S+S  H+D  +   D + WRF+  YG  +   +  + +L++RL      PW VGGN NE +  +EK GG S  +  +   R  
Subjt:  SGGLGLFWMKEVDMSICSYSRNHVDCQLNY-DNRCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTT

Query:  VDDVFLKDLGFSGDIFTWSKKWCRG---PTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYAVASDDYTDGDWDQELLSAFLWRKDVEC
        +D   L+D+GF G  +T    WC G     +++RLD+  GN  + E F    V HLD   SDH P+ L     +S       W         W  D EC
Subjt:  VDDVFLKDLGFSGDIFTWSKKWCRG---PTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYAVASDDYTDGDWDQELLSAFLWRKDVEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCGATTTCTTGGCAAGAAGATTGGAAAAGTTGAAGATATTGACTGGGATGGGGAAAATGACTGGTTAGGGCCGATTTTTCGTATTCGTGTCCTTCTTGATTTAAC
GGAACCGCTTCAAAGAGGATTGAAAGTTATTTGGTGTCCAATATTTTACGAAAAACTACCTGATATTTGCTTCAGCTGTGGTGTTTTTGGCCATTCGTCTAGAGAATGTT
CTAAAGTGGCATCTAATTGTACAGAGGGCTCTAATTTGGCTTATGGTGATTGGTTGAGGGCTCCCTATGTCAAGAAATCAGGACAACCAACGAAGGAGGAGGGAGGTTCT
ATAGACAGTATGGGTGCTAGTGGTGGTCTTGGTCTTTTTTGGATGAAGGAAGTGGATATGTCCATTTGTTCTTATTCCCGAAATCATGTTGACTGTCAATTAAACTATGA
TAATAGGTGTTGGCGCTTCTCAAGGATTTATGGCTGTCTAGAGGCTCATAACAAACATATCACTTGTGAATTAATTCAGAGACTCCATAATAATGATGACTCTCCTTGGA
TTGTTGGAGGGAACCTTAATGAAGCTATTTGGTCAAAAGAGAAGAGAGGCGGGTTATCCCTTGCTGCTAGATCTATGGAGTTATTGAGAACCACTGTTGATGATGTATTT
CTCAAAGATCTTGGGTTTTCTGGAGATATTTTTACTTGGTCTAAAAAATGGTGTCGTGGGCCTACAGTTTGGAAAAGGTTAGATCGGTTTTTGGGCAACCAAGCTTTTTG
TGAATATTTTCCTGGTTGTAGAGTTACTCATCTAGACTGGTTTGGTTCTGATCATCGGCCAATTTGTCTTAACTTGTATGCTGTGGCTAGTGATGATTATACTGATGGAG
ATTGGGATCAGGAGCTCCTTTCTGCTTTTCTTTGGAGGAAAGATGTAGAGTGCATTTGTTCAATTCCTATTTGCAAATCCTATGAAGAAGATAAATGGATTTGGCACTAC
TCAAAGGATGGAGATTTCTCGGTTAGAAGTGTTGTCCCCCCAGTTAATGTTCGAATTGAATGGATTCGTCAGTATTTGGAGGATTTTTTTAAATCTAATGATATGGAATC
CCATAAGCATGCTGAGTATGATTCTTGCTCTGCTGTTGCTCCTGTTTGTTGGATTGCGCCATGGCCGGGATTTCTCAAGCTTAATGTCGATGCTTCATGTTCCCCTTTAG
CGCCGAAATTGGGGTTTGGATTAATTTTCAGAGATCACATGGGATTATGCAAATTTGCTTATTCCATTTTCAAACCTGTGTTTTGTGATATCCTCTCAGCGGAAGCTTTG
GCTTTGTTGGAGGGATTGAAGGTGGCTGATAATCTTGGATTCCATATTTTGATTATTGAATCTGATTCAAAGACACTTGTCGATGCTATTCTAGGGAATCGTTTATCTCT
TTCTCCTAAAGGTATTATCCTTGATGAAATTCGTCTGCTGCTAAAGAAATATGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCGATTTCTTGGCAAGAAGATTGGAAAAGTTGAAGATATTGACTGGGATGGGGAAAATGACTGGTTAGGGCCGATTTTTCGTATTCGTGTCCTTCTTGATTTAAC
GGAACCGCTTCAAAGAGGATTGAAAGTTATTTGGTGTCCAATATTTTACGAAAAACTACCTGATATTTGCTTCAGCTGTGGTGTTTTTGGCCATTCGTCTAGAGAATGTT
CTAAAGTGGCATCTAATTGTACAGAGGGCTCTAATTTGGCTTATGGTGATTGGTTGAGGGCTCCCTATGTCAAGAAATCAGGACAACCAACGAAGGAGGAGGGAGGTTCT
ATAGACAGTATGGGTGCTAGTGGTGGTCTTGGTCTTTTTTGGATGAAGGAAGTGGATATGTCCATTTGTTCTTATTCCCGAAATCATGTTGACTGTCAATTAAACTATGA
TAATAGGTGTTGGCGCTTCTCAAGGATTTATGGCTGTCTAGAGGCTCATAACAAACATATCACTTGTGAATTAATTCAGAGACTCCATAATAATGATGACTCTCCTTGGA
TTGTTGGAGGGAACCTTAATGAAGCTATTTGGTCAAAAGAGAAGAGAGGCGGGTTATCCCTTGCTGCTAGATCTATGGAGTTATTGAGAACCACTGTTGATGATGTATTT
CTCAAAGATCTTGGGTTTTCTGGAGATATTTTTACTTGGTCTAAAAAATGGTGTCGTGGGCCTACAGTTTGGAAAAGGTTAGATCGGTTTTTGGGCAACCAAGCTTTTTG
TGAATATTTTCCTGGTTGTAGAGTTACTCATCTAGACTGGTTTGGTTCTGATCATCGGCCAATTTGTCTTAACTTGTATGCTGTGGCTAGTGATGATTATACTGATGGAG
ATTGGGATCAGGAGCTCCTTTCTGCTTTTCTTTGGAGGAAAGATGTAGAGTGCATTTGTTCAATTCCTATTTGCAAATCCTATGAAGAAGATAAATGGATTTGGCACTAC
TCAAAGGATGGAGATTTCTCGGTTAGAAGTGTTGTCCCCCCAGTTAATGTTCGAATTGAATGGATTCGTCAGTATTTGGAGGATTTTTTTAAATCTAATGATATGGAATC
CCATAAGCATGCTGAGTATGATTCTTGCTCTGCTGTTGCTCCTGTTTGTTGGATTGCGCCATGGCCGGGATTTCTCAAGCTTAATGTCGATGCTTCATGTTCCCCTTTAG
CGCCGAAATTGGGGTTTGGATTAATTTTCAGAGATCACATGGGATTATGCAAATTTGCTTATTCCATTTTCAAACCTGTGTTTTGTGATATCCTCTCAGCGGAAGCTTTG
GCTTTGTTGGAGGGATTGAAGGTGGCTGATAATCTTGGATTCCATATTTTGATTATTGAATCTGATTCAAAGACACTTGTCGATGCTATTCTAGGGAATCGTTTATCTCT
TTCTCCTAAAGGTATTATCCTTGATGAAATTCGTCTGCTGCTAAAGAAATATGGCTAA
Protein sequenceShow/hide protein sequence
MARFLGKKIGKVEDIDWDGENDWLGPIFRIRVLLDLTEPLQRGLKVIWCPIFYEKLPDICFSCGVFGHSSRECSKVASNCTEGSNLAYGDWLRAPYVKKSGQPTKEEGGS
IDSMGASGGLGLFWMKEVDMSICSYSRNHVDCQLNYDNRCWRFSRIYGCLEAHNKHITCELIQRLHNNDDSPWIVGGNLNEAIWSKEKRGGLSLAARSMELLRTTVDDVF
LKDLGFSGDIFTWSKKWCRGPTVWKRLDRFLGNQAFCEYFPGCRVTHLDWFGSDHRPICLNLYAVASDDYTDGDWDQELLSAFLWRKDVECICSIPICKSYEEDKWIWHY
SKDGDFSVRSVVPPVNVRIEWIRQYLEDFFKSNDMESHKHAEYDSCSAVAPVCWIAPWPGFLKLNVDASCSPLAPKLGFGLIFRDHMGLCKFAYSIFKPVFCDILSAEAL
ALLEGLKVADNLGFHILIIESDSKTLVDAILGNRLSLSPKGIILDEIRLLLKKYG