; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0018517 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0018517
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Genome locationchr01:23269945..23272135
RNA-Seq ExpressionIVF0018517
SyntenyIVF0018517
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026229.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.68e-23470.59Show/hide
Query:  MSRGGKKRRLKDLKTQQIPGGRDLENEESEGESSHQQGDITIDEQLSLNNNLEKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKV
        MSRGGKKRRLKDLKTQQIPGGRDLENEESE                                ALGATTFARTTDPADAEGWMNVLEKCFK    +     
Subjt:  MSRGGKKRRLKDLKTQQIPGGRDLENEESEGESSHQQGDITIDEQLSLNNNLEKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKV

Query:  ELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRSEFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSL
                                           A YEK Y +  K A      MIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSL
Subjt:  ELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRSEFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSL

Query:  AERKNEKEVSRNTRHRGSFDYGPNTSATSQVSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCL----NCGKSHTGECLWGANVCYNCGQPGHFRKECPQ
        AERKNEKEVSRN           +T+ T  VS   +   G +G   M   R     H K    L       + H  ECLWGANVCYNCGQPGHFRKECPQ
Subjt:  AERKNEKEVSRNTRHRGSFDYGPNTSATSQVSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCL----NCGKSHTGECLWGANVCYNCGQPGHFRKECPQ

Query:  LHKDRK--------------KGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKT
        LHKDRK               GQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKT
Subjt:  LHKDRK--------------KGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKT

Query:  LVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGED----VIHFSGSKPRLEDIPVVQEFSDDLHGLPPNREIE
        LVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGED    V+   GSKPRLEDIPVVQEFSDDLHGLPPNREIE
Subjt:  LVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGED----VIHFSGSKPRLEDIPVVQEFSDDLHGLPPNREIE

Query:  FSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTL
        FSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTL
Subjt:  FSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTL

KAA0042309.1 reverse transcriptase [Cucumis melo var. makuwa]8.90e-19048.68Show/hide
Query:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS
        EK +GI+RLK LGAT F   TDP DAE W+N+ EKCF VM C ++RKV LATFLL+KEA+ WW++  ++  D   ++W  F+  F +KYYP +Y +AKR 
Subjt:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS

Query:  EFF-------------------------MIEDEIDK--RFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT
        EF                          +I  E D+  RFE GLR EI+TPVTA ++W  FS+LVET +RVE+S+ E K+  E+SR T     F      
Subjt:  EFF-------------------------MIEDEIDK--RFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT

Query:  SATSQVSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKD-------------------------
          T  ++ SS +   N  G +   + E ++S  +   C +CG++H G+CL GA VCY CGQPGHF+K+CPQL+                           
Subjt:  SATSQVSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKD-------------------------

Query:  ---RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQID
           R+KG  GRPR QG  +AMTQQEA   P+++ G + I    A VL DPGATHSF+ ++F  ++++ LE L   L I TPVG  L+ N V RNC   ++
Subjt:  ---RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQID

Query:  GKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-----------------------------------SKPRLEDIPVVQ
        G  L  +L+PL++   DVILGMDFL  H+AS+DCHRKEV FR+PG   + F G                                    K + ED+PVV+
Subjt:  GKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-----------------------------------SKPRLEDIPVVQ

Query:  EF----SDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYP
        EF     DDL GLPP+REIEF+I+L+PGTAPISQ PYRMAP +LKELK QLQELVDKGYI+PSVSPWG+  LFVKKKDGTLRLC DYRQLNK+TIRNKYP
Subjt:  EF----SDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYP

Query:  LPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
        LPRI+DLFDQL+GA++F K+DL SGYHQLK++E D+ KTAFRT
Subjt:  LPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

KAA0060440.1 retrotransposon protein [Cucumis melo var. makuwa]1.06e-19651.25Show/hide
Query:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS
        EK +GI+RLK LGAT F  +TDP D E W+N+LEKCF +M C ++RKV LATFLL+KE + WW++  ++  D   ++W  F+  F +KYYP +Y +AKR 
Subjt:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS

Query:  EFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSF---------DYGPNTSATSQ-VSSSSNRTTGNS
        EF   E +  +RFE GLR EI+TPVTA ++W  FS+LVET +RVE+S+ E K+  E+SR T     F          YG      SQ + S   R+T   
Subjt:  EFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSF---------DYGPNTSATSQ-VSSSSNRTTGNS

Query:  GGPRMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKD----------------------------RKKGQAGRPRVQGM
           R    +E ++S  +   C +CG++H G+CL GA VCY CGQPGHF+K+CPQL+                              R+KG  GRPR QG 
Subjt:  GGPRMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKD----------------------------RKKGQAGRPRVQGM

Query:  VFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFD
         +AMTQQEA   P+++TG + I    A VL DPGATHSF+S++F  ++++ LE L   L I TPVG  L+ N V RNC   ++G  L  +L+PL++   D
Subjt:  VFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFD

Query:  VILGMDFLSVHHASIDCHRKEVAFRRPG----------EDVIHFSGSKPRLEDIPVVQEF----SDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPK
        VILGMDFL  H+AS+D HRKEV F++P             ++     K + ED+PVV+EF     DDL GLPP+REIEF+I+L+PG APISQ PYR+AP 
Subjt:  VILGMDFLSVHHASIDCHRKEVAFRRPG----------EDVIHFSGSKPRLEDIPVVQEF----SDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPK

Query:  DLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFR
        +LKELK QLQELVDKGYI+PSVSPWG+P LFVKKKDGTLRLC+DYRQLNK+TIRNKYPLPRI+DLFDQL+GA++F K+DL SGYHQLK++E D+ KTAFR
Subjt:  DLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFR

Query:  T
        T
Subjt:  T

TYK06334.1 putative polyprotein [Cucumis melo var. makuwa]1.88e-18947.84Show/hide
Query:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS
        EK +GI+RLK LGAT F  +TDPADAE W+N+LEKCF VM C ++RKV LATFLL+KEA+ WW++  ++  D   ++W  F+  F +KYYP +Y +AKR 
Subjt:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS

Query:  EFF-------------------------MIEDEIDK--RFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT
        EF                          +I  E D+  RFE GLR EI+TPVTA ++W  FS+LVET +RVE+S+ E K+  E+SR T     F      
Subjt:  EFF-------------------------MIEDEIDK--RFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT

Query:  SATSQVSSSSNRTTGN-SGGP---------------------------RMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQL
          T  ++ SS +   N SGG                            R    +E ++S  +   C +CG++H G+CL GA VCY CGQPGHF+K+CPQL
Subjt:  SATSQVSSSSNRTTGN-SGGP---------------------------RMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQL

Query:  HKD----------------------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELL
        +                              R+KG  GRPR QG V+AMTQQE    P+++TG + I    A VL DPGATHSF+S++F  ++++ LE L
Subjt:  HKD----------------------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELL

Query:  PSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-------------------
           L I TPVG  L+ N V RNC   ++G  L  +L+PL++   DVILGMDFL  H+AS+DCHRKEV FR+PG   + F G                   
Subjt:  PSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-------------------

Query:  ----------------SKPRLEDIPVVQEF----SDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSL
                         K + ED+PVV+EF     DDL GLPP+REIEF+I+L+PGTAPISQ PYRMAP +LKELK QLQELVDKGYI+PSVSPWG+P L
Subjt:  ----------------SKPRLEDIPVVQEF----SDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSL

Query:  FVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
        FVKKKDGTLRLC+DYRQLNK+TIRNKYPLPRI+DLFDQL+GA++F K+DL SGYHQLK++E D+ KTAFRT
Subjt:  FVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

TYK27507.1 reverse transcriptase [Cucumis melo var. makuwa]1.34e-18947.84Show/hide
Query:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS
        EK +GI+RLK LGAT F  +TDPADAE W+N+LEKCF VM C ++RKV LATFLL+KEA+ WW++  ++  D   ++W  F+  F +KYYP +Y +AKR 
Subjt:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS

Query:  EFF-------------------------MIEDEIDK--RFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT
        EF                          +I  E D+  RFE GLR EI+TPVTA ++W  FS+LVET +RVE+S+ E K+  E+SR T     F      
Subjt:  EFF-------------------------MIEDEIDK--RFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT

Query:  SATSQVSSSSNRTTGN-SGGP---------------------------RMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQL
          T  ++ SS +   N SGG                            R    +E ++S  +   C +CG++H G+CL GA VCY CGQPGHF+K+CPQL
Subjt:  SATSQVSSSSNRTTGN-SGGP---------------------------RMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQL

Query:  HKD----------------------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELL
        +                              R+KG  GRPR QG V+AMTQQE    P+++TG + I    A VL DPGATHSF+S++F  ++++ LE L
Subjt:  HKD----------------------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELL

Query:  PSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-------------------
           L I TPVG  L+ N V RNC   ++G  L  +L+PL++   DVILGMDFL  H+AS+DCHRKEV FR+PG   + F G                   
Subjt:  PSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-------------------

Query:  ----------------SKPRLEDIPVVQEF----SDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSL
                         K + ED+PVV+EF     DDL GLPP+REIEF+I+L+PGTAPISQ PYRMAP +LKELK QLQELVDKGYI+PSVSPWG+P L
Subjt:  ----------------SKPRLEDIPVVQEF----SDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSL

Query:  FVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
        FVKKKDGTLRLC+DYRQLNK+TIRNKYPLPRI+DLFDQL+GA++F K+DL SGYHQLK++E D+ KTAFRT
Subjt:  FVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

TrEMBL top hitse value%identityAlignment
A0A5A7SLT2 DNA/RNA polymerases superfamily protein1.3e-19670.59Show/hide
Query:  MSRGGKKRRLKDLKTQQIPGGRDLENEESEGESSHQQGDITIDEQLSLNNNLEKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKV
        MSRGGKKRRLKDLKTQQIPGGRDLENEESE                                ALGATTFARTTDPADAEGWMNVLEKCFK          
Subjt:  MSRGGKKRRLKDLKTQQIPGGRDLENEESEGESSHQQGDITIDEQLSLNNNLEKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKV

Query:  ELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRSEFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSL
                                           A YEK Y +  K A      MIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSL
Subjt:  ELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRSEFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSL

Query:  AERKNEKEVSRNTRHRGSFDYGPNTSATSQVSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCL----NCGKSHTGECLWGANVCYNCGQPGHFRKECPQ
        AERKNEKEVSRN           +T+ T  VS   +   G +G   M   R     H K    L       + H  ECLWGANVCYNCGQPGHFRKECPQ
Subjt:  AERKNEKEVSRNTRHRGSFDYGPNTSATSQVSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCL----NCGKSHTGECLWGANVCYNCGQPGHFRKECPQ

Query:  LHKDRK--------------KGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKT
        LHKDRK               GQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKT
Subjt:  LHKDRK--------------KGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKT

Query:  LVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGED----VIHFSGSKPRLEDIPVVQEFSDDLHGLPPNREIE
        LVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGED    V+   GSKPRLEDIPVVQEFSDDLHGLPPNREIE
Subjt:  LVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGED----VIHFSGSKPRLEDIPVVQEFSDDLHGLPPNREIE

Query:  FSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTL
        FSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTL
Subjt:  FSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTL

A0A5A7TLH7 Reverse transcriptase2.7e-16548.85Show/hide
Query:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS
        EK +GI+RLK LGAT F  +TDPADAE W+N+LEKCF VM C ++RKV LATFLL+KEA+ WW++  ++  D   ++W  F+  F +KYYP +Y +AKR 
Subjt:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS

Query:  EFFMI---------------------------EDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSF------
        EF  +                           E +  +RFE GLR EI+TPVT  ++W  FS+LVET +RVE+S+AE K+  E+SR T     F      
Subjt:  EFFMI---------------------------EDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSF------

Query:  ---DYGPNTSATSQ-VSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKD---------------
            YG      SQ + S   RTT      R    +E ++S  +   C +CG++H G+CL GA VCY CGQPGHF+K+CPQL+                 
Subjt:  ---DYGPNTSATSQ-VSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKD---------------

Query:  -------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANN
                     R+KG  GRPR QG V+AMTQQE    P+++TG + I    A VL DPGATHSF+S++F  ++++ LE L   L I TPVG  L+ N 
Subjt:  -------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANN

Query:  VYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-----------------------------------SK
        V RNC   ++G  L  +L+PL++   DVILGMDFL  H+AS+DCHRKEV FR+PG   + F G                                    K
Subjt:  VYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-----------------------------------SK

Query:  PRLEDIPVVQEFSD----DLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQL
         + ED+PVV+EF D    DL GLPP+REIEF+I+L+PGTAPISQ PYRMAP +LKELK QLQELVDKGYI+PSVSPWG+P LFVKKKDGTLRLC+DYRQL
Subjt:  PRLEDIPVVQEFSD----DLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQL

Query:  NKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
        NK+TIRNKYPLPRI DLFDQL+GA++F K+DL SGYHQLK++E D+ KT FRT
Subjt:  NKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

A0A5A7U2V7 Reverse transcriptase1.8e-16447.84Show/hide
Query:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS
        EK +GI+RLK LGAT F  +TDPADAE W+N+LEKCF VM C ++RKV LATFLL+KEA+ WW++  ++  D   ++W  F+  F +KYYP +Y +AKR 
Subjt:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS

Query:  EFF-------------------------MIEDEID--KRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT
        EF                          +I  E D  +RFE GLR EI+TPVTA ++W  FS+LVET +RVE+S+ E K+  E+SR T     F      
Subjt:  EFF-------------------------MIEDEID--KRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT

Query:  SATSQVSSSSNRTTGN-SGGP---------------------------RMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQL
          T  ++ SS +   N SGG                            R    +E ++S  +   C +CG++H G+CL GA VCY CGQPGHF+K+CPQL
Subjt:  SATSQVSSSSNRTTGN-SGGP---------------------------RMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQL

Query:  HKD----------------------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELL
        +                              R+KG  GRPR QG V+AMTQQE    P+++TG + I    A VL DPGATHSF+S++F  ++++ LE L
Subjt:  HKD----------------------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELL

Query:  PSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-------------------
           L I TPVG  L+ N V RNC   ++G  L  +L+PL++   DVILGMDFL  H+AS+DCHRKEV FR+PG   + F G                   
Subjt:  PSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-------------------

Query:  ----------------SKPRLEDIPVVQE----FSDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSL
                         K + ED+PVV+E    F DDL GLPP+REIEF+I+L+PGTAPISQ PYRMAP +LKELK QLQELVDKGYI+PSVSPWG+P L
Subjt:  ----------------SKPRLEDIPVVQE----FSDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSL

Query:  FVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
        FVKKKDGTLRLC+DYRQLNK+TIRNKYPLPRI+DLFDQL+GA++F K+DL SGYHQLK++E D+ KTAFRT
Subjt:  FVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

A0A5A7UZA9 Retrotransposon protein1.1e-16651.34Show/hide
Query:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS
        EK +GI+RLK LGAT F  +TDP D E W+N+LEKCF +M C ++RKV LATFLL+KE + WW++  ++  D   ++W  F+  F +KYYP +Y +AKR 
Subjt:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS

Query:  EFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSF---DYGPNTSATSQVSSSSNRTTGN--SGGPRM
        EF   E +  +RFE GLR EI+TPVTA ++W  FS+LVET +RVE+S+ E K+  E+SR T     F       N S  S     S R          R 
Subjt:  EFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSF---DYGPNTSATSQVSSSSNRTTGN--SGGPRM

Query:  WQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKD----------------------------RKKGQAGRPRVQGMVFAMT
           +E ++S  +   C +CG++H G+CL GA VCY CGQPGHF+K+CPQL+                              R+KG  GRPR QG  +AMT
Subjt:  WQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKD----------------------------RKKGQAGRPRVQGMVFAMT

Query:  QQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGM
        QQEA   P+++TG + I    A VL DPGATHSF+S++F  ++++ LE L   L I TPVG  L+ N V RNC   ++G  L  +L+PL++   DVILGM
Subjt:  QQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGM

Query:  DFLSVHHASIDCHRKEVAFRRPG----------EDVIHFSGSKPRLEDIPVVQE----FSDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKEL
        DFL  H+AS+D HRKEV F++P             ++     K + ED+PVV+E    F DDL GLPP+REIEF+I+L+PG APISQ PYR+AP +LKEL
Subjt:  DFLSVHHASIDCHRKEVAFRRPG----------EDVIHFSGSKPRLEDIPVVQE----FSDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKEL

Query:  KTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
        K QLQELVDKGYI+PSVSPWG+P LFVKKKDGTLRLC+DYRQLNK+TIRNKYPLPRI+DLFDQL+GA++F K+DL SGYHQLK++E D+ KTAFRT
Subjt:  KTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

A0A5D3BS67 Reverse transcriptase1.8e-16447.84Show/hide
Query:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS
        EK +GI+RLK LGAT F  +TDPADAE W+N+LEKCF VM C ++RKV LATFLL+KEA+ WW++  ++  D   ++W  F+  F +KYYP +Y +AKR 
Subjt:  EKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKEADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRS

Query:  EFF-------------------------MIEDEID--KRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT
        EF                          +I  E D  +RFE GLR EI+TPVTA ++W  FS+LVET +RVE+S+ E K+  E+SR T     F      
Subjt:  EFF-------------------------MIEDEID--KRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFDYGPNT

Query:  SATSQVSSSSNRTTGN-SGGP---------------------------RMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQL
          T  ++ SS +   N SGG                            R    +E ++S  +   C +CG++H G+CL GA VCY CGQPGHF+K+CPQL
Subjt:  SATSQVSSSSNRTTGN-SGGP---------------------------RMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQL

Query:  HKD----------------------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELL
        +                              R+KG  GRPR QG V+AMTQQE    P+++TG + I    A VL DPGATHSF+S++F  ++++ LE L
Subjt:  HKD----------------------------RKKGQAGRPRVQGMVFAMTQQEAHKTPEIVTGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELL

Query:  PSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-------------------
           L I TPVG  L+ N V RNC   ++G  L  +L+PL++   DVILGMDFL  H+AS+DCHRKEV FR+PG   + F G                   
Subjt:  PSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGEDVIHFSG-------------------

Query:  ----------------SKPRLEDIPVVQE----FSDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSL
                         K + ED+PVV+E    F DDL GLPP+REIEF+I+L+PGTAPISQ PYRMAP +LKELK QLQELVDKGYI+PSVSPWG+P L
Subjt:  ----------------SKPRLEDIPVVQE----FSDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSL

Query:  FVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
        FVKKKDGTLRLC+DYRQLNK+TIRNKYPLPRI+DLFDQL+GA++F K+DL SGYHQLK++E D+ KTAFRT
Subjt:  FVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.6e-2129.86Show/hide
Query:  LIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGE
        LID GA  + I T  T R  K    LP+      P  K+++   VY N        ++    I L+I    + +  +FL V   S   H   ++F    +
Subjt:  LIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGE

Query:  DVIHFSGSK---------------PRLEDIPVVQEFSD-----DLHGLP-PNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSV
        + I  S SK               P L DI   +EF D     +   LP P + +EF ++L      +    Y + P  ++ +  ++ + +  G I+ S 
Subjt:  DVIHFSGSK---------------PRLEDIPVVQEFSD-----DLHGLP-PNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSV

Query:  SPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFR
        +    P +FV KK+GTLR+ VDY+ LNK    N YPLP I  L  +++G+++F K+DL S YH +++++ D  K AFR
Subjt:  SPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFR

P0CT35 Transposon Tf2-2 polyprotein3.6e-2129.86Show/hide
Query:  LIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGE
        LID GA  + I T  T R  K    LP+      P  K+++   VY N        ++    I L+I    + +  +FL V   S   H   ++F    +
Subjt:  LIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGE

Query:  DVIHFSGSK---------------PRLEDIPVVQEFSD-----DLHGLP-PNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSV
        + I  S SK               P L DI   +EF D     +   LP P + +EF ++L      +    Y + P  ++ +  ++ + +  G I+ S 
Subjt:  DVIHFSGSK---------------PRLEDIPVVQEFSD-----DLHGLP-PNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSV

Query:  SPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFR
        +    P +FV KK+GTLR+ VDY+ LNK    N YPLP I  L  +++G+++F K+DL S YH +++++ D  K AFR
Subjt:  SPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFR

P0CT41 Transposon Tf2-12 polyprotein3.6e-2129.86Show/hide
Query:  LIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGE
        LID GA  + I T  T R  K    LP+      P  K+++   VY N        ++    I L+I    + +  +FL V   S   H   ++F    +
Subjt:  LIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRRPGE

Query:  DVIHFSGSK---------------PRLEDIPVVQEFSD-----DLHGLP-PNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSV
        + I  S SK               P L DI   +EF D     +   LP P + +EF ++L      +    Y + P  ++ +  ++ + +  G I+ S 
Subjt:  DVIHFSGSK---------------PRLEDIPVVQEFSD-----DLHGLP-PNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSV

Query:  SPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFR
        +    P +FV KK+GTLR+ VDY+ LNK    N YPLP I  L  +++G+++F K+DL S YH +++++ D  K AFR
Subjt:  SPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.8e-2644.44Show/hide
Query:  DLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLF
        D++ +P   +IE      PG       PY +  K+ +E+   +Q+L+D  +I PS SP  SP + V KKDGT RLCVDYR LNK TI + +PLPRI++L 
Subjt:  DLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLF

Query:  DQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
         ++  A +F  +DLHSGYHQ+ ++  D  KTAF T
Subjt:  DQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.8e-2644.44Show/hide
Query:  DLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLF
        D++ +P   +IE      PG       PY +  K+ +E+   +Q+L+D  +I PS SP  SP + V KKDGT RLCVDYR LNK TI + +PLPRI++L 
Subjt:  DLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNKMTIRNKYPLPRINDLF

Query:  DQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT
         ++  A +F  +DLHSGYHQ+ ++  D  KTAF T
Subjt:  DQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGGGGTGGCAAAAAGCGTAGGCTTAAGGACCTTAAAACTCAACAAATTCCTGGTGGACGAGATTTAGAAAATGAAGAATCAGAGGGTGAGTCCAGTCACCAACA
GGGTGATATAACTATAGATGAACAGTTGTCATTGAATAACAATCTAGAAAAGACTTTTGGTATAAAACGATTAAAAGCACTAGGTGCAACTACGTTTGCGAGGACAACGG
ATCCTGCTGATGCTGAAGGATGGATGAATGTGTTGGAAAAATGTTTTAAAGTAATGAGATGTCTGGATGATAGGAAAGTAGAACTTGCTACATTTTTGCTTAAAAAAGAA
GCAGATTATTGGTGGAGAACATTCGAAAGTCAATATCATGATACAAATGAAATCAACTGGAATGAGTTTAAAAATGCGTTCTATGAGAAATATTATCCACGATCATATAA
GGATGCAAAGCGATCTGAATTTTTCATGATTGAAGATGAGATTGACAAAAGATTTGAAGATGGGTTGCGAGAAGAAATTCAAACTCCTGTTACAGCTAGTTCTGAGTGGA
AAGAATTTTCGAAGTTAGTGGAAACGGTTATAAGAGTTGAGAAAAGTTTGGCAGAGAGAAAGAATGAAAAAGAAGTCTCGAGAAATACAAGGCATCGCGGGTCATTCGAT
TATGGACCTAACACGAGTGCAACATCGCAAGTGTCAAGCAGCTCTAATAGAACTACAGGGAACTCAGGAGGTCCACGTATGTGGCAACAGCGAGAACCAGTGTCCAGCCA
TAATAAAAATCCTACATGTTTAAATTGTGGTAAATCTCACACAGGTGAATGTTTGTGGGGAGCTAATGTTTGCTATAATTGTGGTCAACCCGGACACTTTCGCAAGGAAT
GCCCTCAGCTACATAAGGATCGTAAGAAAGGGCAAGCTGGCAGACCTCGAGTTCAAGGAATGGTTTTTGCCATGACACAACAAGAAGCTCATAAGACACCAGAGATAGTG
ACGGGTATGTTATCTATCCTTGGAAAATATGCTCGTGTTTTAATAGACCCTGGTGCGACACATTCATTTATTTCTACCATGTTCACTGCTCGAATTGATAAGAATTTGGA
ATTATTGCCTAGTGAATTACTAATTGCAACACCTGTTGGCAAAACCCTAGTAGCAAATAATGTTTATCGTAATTGTGTCGCCCAAATAGATGGTAAAGAGTTAACTGCTA
ACTTGATACCACTTGATATCCATGAATTTGATGTTATATTGGGAATGGATTTTCTATCTGTACATCATGCATCGATTGACTGTCATCGGAAAGAGGTTGCATTTAGACGA
CCTGGTGAAGATGTAATCCATTTTAGTGGGAGTAAACCAAGATTGGAAGATATTCCAGTTGTGCAAGAATTTTCAGATGATTTACATGGTTTACCTCCAAATAGAGAGAT
TGAATTTTCAATTGATCTAATTCCTGGAACAGCACCAATATCACAACCTCCATACAGAATGGCACCTAAAGACCTGAAAGAATTGAAGACGCAGTTACAAGAGTTGGTAG
ACAAGGGTTATATCCAACCGAGTGTATCACCATGGGGTTCACCATCGCTGTTTGTGAAAAAGAAGGATGGCACCTTGAGGTTGTGTGTTGATTACAGACAATTGAACAAG
ATGACTATACGCAATAAATATCCATTGCCTCGAATAAATGATTTATTCGATCAGCTTAAAGGTGCCTCAGTATTTTTAAAAGTTGATCTACATTCAGGATATCATCAACT
TAAGATAAAGGAAGTAGATGTTCCAAAAACAGCATTCAGAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGGGGTGGCAAAAAGCGTAGGCTTAAGGACCTTAAAACTCAACAAATTCCTGGTGGACGAGATTTAGAAAATGAAGAATCAGAGGGTGAGTCCAGTCACCAACA
GGGTGATATAACTATAGATGAACAGTTGTCATTGAATAACAATCTAGAAAAGACTTTTGGTATAAAACGATTAAAAGCACTAGGTGCAACTACGTTTGCGAGGACAACGG
ATCCTGCTGATGCTGAAGGATGGATGAATGTGTTGGAAAAATGTTTTAAAGTAATGAGATGTCTGGATGATAGGAAAGTAGAACTTGCTACATTTTTGCTTAAAAAAGAA
GCAGATTATTGGTGGAGAACATTCGAAAGTCAATATCATGATACAAATGAAATCAACTGGAATGAGTTTAAAAATGCGTTCTATGAGAAATATTATCCACGATCATATAA
GGATGCAAAGCGATCTGAATTTTTCATGATTGAAGATGAGATTGACAAAAGATTTGAAGATGGGTTGCGAGAAGAAATTCAAACTCCTGTTACAGCTAGTTCTGAGTGGA
AAGAATTTTCGAAGTTAGTGGAAACGGTTATAAGAGTTGAGAAAAGTTTGGCAGAGAGAAAGAATGAAAAAGAAGTCTCGAGAAATACAAGGCATCGCGGGTCATTCGAT
TATGGACCTAACACGAGTGCAACATCGCAAGTGTCAAGCAGCTCTAATAGAACTACAGGGAACTCAGGAGGTCCACGTATGTGGCAACAGCGAGAACCAGTGTCCAGCCA
TAATAAAAATCCTACATGTTTAAATTGTGGTAAATCTCACACAGGTGAATGTTTGTGGGGAGCTAATGTTTGCTATAATTGTGGTCAACCCGGACACTTTCGCAAGGAAT
GCCCTCAGCTACATAAGGATCGTAAGAAAGGGCAAGCTGGCAGACCTCGAGTTCAAGGAATGGTTTTTGCCATGACACAACAAGAAGCTCATAAGACACCAGAGATAGTG
ACGGGTATGTTATCTATCCTTGGAAAATATGCTCGTGTTTTAATAGACCCTGGTGCGACACATTCATTTATTTCTACCATGTTCACTGCTCGAATTGATAAGAATTTGGA
ATTATTGCCTAGTGAATTACTAATTGCAACACCTGTTGGCAAAACCCTAGTAGCAAATAATGTTTATCGTAATTGTGTCGCCCAAATAGATGGTAAAGAGTTAACTGCTA
ACTTGATACCACTTGATATCCATGAATTTGATGTTATATTGGGAATGGATTTTCTATCTGTACATCATGCATCGATTGACTGTCATCGGAAAGAGGTTGCATTTAGACGA
CCTGGTGAAGATGTAATCCATTTTAGTGGGAGTAAACCAAGATTGGAAGATATTCCAGTTGTGCAAGAATTTTCAGATGATTTACATGGTTTACCTCCAAATAGAGAGAT
TGAATTTTCAATTGATCTAATTCCTGGAACAGCACCAATATCACAACCTCCATACAGAATGGCACCTAAAGACCTGAAAGAATTGAAGACGCAGTTACAAGAGTTGGTAG
ACAAGGGTTATATCCAACCGAGTGTATCACCATGGGGTTCACCATCGCTGTTTGTGAAAAAGAAGGATGGCACCTTGAGGTTGTGTGTTGATTACAGACAATTGAACAAG
ATGACTATACGCAATAAATATCCATTGCCTCGAATAAATGATTTATTCGATCAGCTTAAAGGTGCCTCAGTATTTTTAAAAGTTGATCTACATTCAGGATATCATCAACT
TAAGATAAAGGAAGTAGATGTTCCAAAAACAGCATTCAGAACATGA
Protein sequenceShow/hide protein sequence
MSRGGKKRRLKDLKTQQIPGGRDLENEESEGESSHQQGDITIDEQLSLNNNLEKTFGIKRLKALGATTFARTTDPADAEGWMNVLEKCFKVMRCLDDRKVELATFLLKKE
ADYWWRTFESQYHDTNEINWNEFKNAFYEKYYPRSYKDAKRSEFFMIEDEIDKRFEDGLREEIQTPVTASSEWKEFSKLVETVIRVEKSLAERKNEKEVSRNTRHRGSFD
YGPNTSATSQVSSSSNRTTGNSGGPRMWQQREPVSSHNKNPTCLNCGKSHTGECLWGANVCYNCGQPGHFRKECPQLHKDRKKGQAGRPRVQGMVFAMTQQEAHKTPEIV
TGMLSILGKYARVLIDPGATHSFISTMFTARIDKNLELLPSELLIATPVGKTLVANNVYRNCVAQIDGKELTANLIPLDIHEFDVILGMDFLSVHHASIDCHRKEVAFRR
PGEDVIHFSGSKPRLEDIPVVQEFSDDLHGLPPNREIEFSIDLIPGTAPISQPPYRMAPKDLKELKTQLQELVDKGYIQPSVSPWGSPSLFVKKKDGTLRLCVDYRQLNK
MTIRNKYPLPRINDLFDQLKGASVFLKVDLHSGYHQLKIKEVDVPKTAFRT