; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G24400 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G24400
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr3:21470553..21474131
RNA-Seq ExpressionCSPI03G24400
SyntenyCSPI03G24400
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]7.3e-30855.72Show/hide
Query:  FTSNYSNSGVGRMWVMWKRNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPN
        ++ +YSNSGVGR+WVMWK+ RF F TH + ++F+   +TDL+ G   +V+CVYASNS+ ERR LWR + +I + W   G+VMGDF+ I +HSEAF G+P 
Subjt:  FTSNYSNSGVGRMWVMWKRNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPN

Query:  CGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFL
         G+MEEFD+AI +ADLVEPSVQ NWFTWTSK+ GSG++RRLDR+LVNDE LSAWP MR+NVLPWGISDHS IL YPS+Q + RVVSFRFFNHWVE+ SF+
Subjt:  CGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFL

Query:  DVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFW--------------------
        +VV+ +W+R   VS +VS ++NL +LKPILR  F +HI+++SE+V +AK+ MD AQREVE NPLS+ LS QASLAT  FW                    
Subjt:  DVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFW--------------------

Query:  ---------RMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSG
                    VRSR S N+L S+VD +G+++++HD V Q+  NY  N+LG Q IGY+ELSP I+DIVQF+W++EC QALQ  I REEVRRV+F MDSG
Subjt:  ---------RMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSG

Query:  KAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLEDF--------------------------------ISTFISW
        KAPGPDG+S             DFC+A+LHFFET Y P GVN TAITLIPK  GA+RLEDF                                 S FI  
Subjt:  KAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLEDF--------------------------------ISTFISW

Query:  RSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLRFVSW----------------------------------------
        RSII+NILLC EL+G YH+N+ KPRCT+KVDLQKAYDSVNWDFLFGLLIAI TPL+FVSW                                        
Subjt:  RSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLRFVSW----------------------------------------

Query:  ------------------FHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEASPLEA------------
                          FH  CEKV++THLTFADDL+IFCA ++ S+SFI+E +Q+FGE  GLFAN RKSSIF+VGVN+  AS L A            
Subjt:  ------------------FHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEASPLEA------------

Query:  -------RRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDE
                 L+S DC  LIQ ITSRIRSW+ARVLSFAGRLQLV SVLRSLQVYWASVFVLPA  H +VDKILRSYLWRGKEEGRGG+KVAW +VCLPF+E
Subjt:  -------RRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDE

Query:  EGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR-------------VWLD
         G  I DG S NIA+T+KI   LL   GSLWVAW+EAYILKG+SLW++D  VGRSWC R ILRKR+ +K HV   V     SR              WL 
Subjt:  EGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR-------------VWLD

Query:  PWLQGGLIIQQYGKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRLDIRDRLNRWDRSIPLSCDRV
        P +   LI      + V PC SV D WVW+PG   GF + SAWE I PR  RV W GLLWGGGNIP HSFCAWLAI+DRLD RDRL+RWD SIPLSC  +
Subjt:  PWLQGGLIIQQYGKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRLDIRDRLNRWDRSIPLSCDRV

Query:  LRGGV
         +GGV
Subjt:  LRGGV

KAA0057642.1 reverse transcriptase [Cucumis melo var. makuwa]2.9e-24848.77Show/hide
Query:  NSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIRGG----FTSNYSNSGVGRMWVMWKRN
        NSFGSL+E    DKWALSI++GSP   R      + L +    + LG+ + +G   +L   R+   +   +  R G    ++ +YSNSGVGR+WVMWK+N
Subjt:  NSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIRGG----FTSNYSNSGVGRMWVMWKRN

Query:  RFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVEPS
        RF F+TH   +QFI+  V                                          VM DF+ I  HSEA  G+P  G+ME+FDMAI +ADLVEPS
Subjt:  RFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVEPS

Query:  VQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFV
        VQ NWFTWTSK+ GSG+MRRLDR+L+ND+ LSAWP M VNVLPWGISDHS IL+YPS+Q + +VVSFR FNHWV+D SFL                    
Subjt:  VQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFV

Query:  KNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFW-RMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLK
                     F +HI+++SE+VR+AK+ MD AQREVE NP+S+ LS QASLAT  FW  + +   +    L  VVD   +++++HD V Q+  NY  
Subjt:  KNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFW-RMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLK

Query:  NTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITL
        N+LG Q IGY+EL+P I+DIVQF+W++EC QALQ  I REEVRRV+F MDSGKAPGPDG+S             DFCDA+LHFFET Y P GVN TAITL
Subjt:  NTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITL

Query:  IPKRNGADRLEDF--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLL
        IPK NGA+RLEDF                                 S FI  RSII+NILLC EL+G YH+N+ KPRCT+KVDLQKAYDSVNWDFLFGL 
Subjt:  IPKRNGADRLEDF--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLL

Query:  IAIETPLRFVSWFHQFCEKVRIT---------------HLTFADDLIIFCATE------------KYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVN
        I+I TPL+FVSW          +                +   D L +F                  S  F     +RFGEL GLFAN RKSSIF+ GVN
Subjt:  IAIETPLRFVSWFHQFCEKVRIT---------------HLTFADDLIIFCATE------------KYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVN

Query:  SVEAS--------------------PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWR
        +  AS                    PL   RL+S+DC  LIQ ITSRIRS SARVLSFAGRLQLVCSVL SLQVYWA VFVLPA  H             
Subjt:  SVEAS--------------------PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWR

Query:  GKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGD
                            +E G  I DG++   AST+KILWL+L  SGSLWVAWVEAY+LKGRSLW++D  VGRSWC R ILRK++ LK HVRM+VG+
Subjt:  GKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGD

Query:  GMRSRVWLDPWLQGGLIIQQYGKRGVRPCES-----------VEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRL
        G R RVWLDPWLQ G I++Q G+R +    S            +  W+W  G   GF + SAWE IRPR  RV W GLLWGGGNIP HSFCAWLAI+DRL
Subjt:  GMRSRVWLDPWLQGGLIIQQYGKRGVRPCES-----------VEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRL

Query:  DIRDRLNRWDRSIPLSC
          RDR +RWD S+PLSC
Subjt:  DIRDRLNRWDRSIPLSC

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]5.1e-24550.26Show/hide
Query:  MTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIR----GGFTSNYSNSGVGRMWVMWK
        M NSFGSL+E    DKWALSI++GS  PL+VD  + VL          G  + I  +  L   R+   +   +  R      ++ +YSNSGVGR+WVMWK
Subjt:  MTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIR----GGFTSNYSNSGVGRMWVMWK

Query:  RNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVE
        +NRF F+TH + +QF++  +TDL+SG   +V CVYASNSNIERR LW R+ +I + W  PG+VM DF+ I +HSEAF G+P  G+ME+F++AI +ADLVE
Subjt:  RNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVE

Query:  PSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVS
        PSVQ NWFTWTSK+ GSG++RRLDR+LVND+ LS WP M VNVLPWGISDH  IL YPS+Q S +VVSFRFFNHWVED SF++VV+ +W+R   VS +V 
Subjt:  PSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVS

Query:  FVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYL
         ++NL  LKPILR  F +HI+ +SE+VR+ K+ MD AQRE+ +                                                      NY 
Subjt:  FVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYL

Query:  KNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLED
        +N+LG Q IGY+ELSP I+DIVQF+W++EC QALQ  I REEVRRV+F MDSGKAPGPDG+S                 G+N TAITLIPK NGA+RLED
Subjt:  KNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLED

Query:  F--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAY---------DSVNWDFLFGLLIAI
        F                                 S FI  RSII+NILLC EL+     +   P   +   L+  +         D ++  FLF +++ +
Subjt:  F--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAY---------DSVNWDFLFGLLIAI

Query:  ------ETPLRFVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEAS-------------------
              + P  F   FH  CEKV++THLTFADDL+IFCA ++ S+ FI++ +Q+FGEL GLFAN RKSSIF+ GVN+  AS                   
Subjt:  ------ETPLRFVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEAS-------------------

Query:  -PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEG
         PL   RL+S+DC  LIQ ITS+IRSW+ARVLSFAGRLQLV SVLRSLQVYWASVFVLPA  H +VDKILRSYLWRGKEEGRGG+KVAW +VCLPF+E G
Subjt:  -PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEG

Query:  FAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR---VWLDPWLQGGLIIQQY
        F I DG S NIAST+KILWL+L  SGSLWVAWVEAYILKGRSLW++D  VGRSWC   ILR               G  SR   VWL P +   LI    
Subjt:  FAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR---VWLDPWLQGGLIIQQY

Query:  GKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSG
          + V PC SV D  VW+ G   GF ++SAWE IRPR  RV W G
Subjt:  GKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSG

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]4.1e-27157.01Show/hide
Query:  MTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGIS
        M +I AGWRG G+VMGDF+TI +HSEAF GA N GDMEE DM I EADLVEP+VQ NWFTWT+KIHG GLMRRLDRILVNDEGLS WPNMRVNVLPWGIS
Subjt:  MTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGIS

Query:  DHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEE
        +HS ILVYPS Q SQ VVSFRFFNHWV++SSF+DVVSS WT+D+RVS IV+ V+NLRNLK ILRNHF KHI+TISEDV L +          E+    + 
Subjt:  DHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEE

Query:  LSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFF
           + S          V++RQS NAL S++DP+G +LTNHDQV+                                      QALQ  IGREEVRR +F 
Subjt:  LSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFF

Query:  MDSGKAPGPDGYSDFCDAILHFFE-----TNYFPQGVNTTAITLIPKRNGADRLEDF--------------------------------ISTFISWRSII
        MDSGKAPGPDGYS      + FF+     TNYFPQ VNT AITLIPKRNGADRLEDF                                 S FI  RSII
Subjt:  MDSGKAPGPDGYSDFCDAILHFFE-----TNYFPQGVNTTAITLIPKRNGADRLEDF--------------------------------ISTFISWRSII

Query:  DNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLRFVSW--------------------------------------------
        DNILLC EL+G YH++   PRCT+KVDLQKAYDSVNWDFLFGLLIAI   +RFVSW                                            
Subjt:  DNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLRFVSW--------------------------------------------

Query:  --------------FHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEASPLEAR---------------
                      FHQFCEKV++THLTFADDL+IFCA + YSMSFIKETI+RFGEL GLFANL KSSIFLVGVNS +AS L A                
Subjt:  --------------FHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEASPLEAR---------------

Query:  -----RLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGF
             RLQS DCD LIQ ITSRIRSWSARVLSFAGRLQLV SVLRSLQVYWASVF+LP K HRDVDKILRSYLWRGKEEGRGG KVAW+EVCLPFDE G 
Subjt:  -----RLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGF

Query:  AICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSL--WEIDVGVGRSWC-----FRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLII
        AI DGSS NIAST+KILWLLLVKSGSLWVAWVEAYILKGRS+  W +D+GV    C      ++ILR R  +     +E G           W+QGG II
Subjt:  AICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSL--WEIDVGVGRSWC-----FRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLII

Query:  QQYGKR----------------------------------------GVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHS
        QQ+G+R                                        GVRP  SVEDRWVW+PGS D F + SAWETIRP S RVGWSGLLW  GNIP HS
Subjt:  QQYGKR----------------------------------------GVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHS

Query:  FCAWLAIRDRLDIRDRLNRWDRSIPLSCDRVLRGG
        F AWLAIRDRL  RDRL++WDRSIPLSC  +L GG
Subjt:  FCAWLAIRDRLDIRDRLNRWDRSIPLSCDRVLRGG

XP_031740402.1 uncharacterized protein LOC116403409 [Cucumis sativus]1.2e-24665.48Show/hide
Query:  MRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQ
        MRVNVLPWGISDHS ILVYPS Q SQ+VVSFRFFNHWV+++SF+DVVSS WT+D+RVS IV+ V+NLRNLK ILR HF +HI+TISEDVRLA DTMD+A+
Subjt:  MRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQ

Query:  REVEINPLSEELSNQASLATVNFWRMF-----------------------------VRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNI
        RE+E N LSEE SN ASLATVNFW+                               VRSRQSSNALRSV+DP+G +LTNHDQVTQVV NY K++LG QNI
Subjt:  REVEINPLSEELSNQASLATVNFWRMF-----------------------------VRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNI

Query:  GYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITLIPKRNGAD
         Y ELS SIE+IVQFRWT+EC QALQ+ IGREEVRRV+F MD GKAPGPDGYS              FCD +LHFFETNYFPQGVNTTAITLIPKRNGAD
Subjt:  GYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITLIPKRNGAD

Query:  RLEDFI--------------------------------STFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLR
        RLEDF                                   FI  RSIIDNILLC EL+G YH++  KPRCTMKVDLQKAYD VNWDFLFGLLIAI     
Subjt:  RLEDFI--------------------------------STFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLR

Query:  FVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEAS--------------------PLEARRLQSS
                            DDL+IFC  + +SMSFIKETI+RFGEL GLFANL KS IFLVGVNS +AS                    PL +RRL+SS
Subjt:  FVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEAS--------------------PLEARRLQSS

Query:  DCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNI
        DCD LIQ ITSRIRSWSARVLSFAGRLQLV SVLRSLQVYWASVF+LP K HRDVDKILR+YLWRG EEGRGG KVAW+EVCLPFDE G  I DGSS NI
Subjt:  DCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNI

Query:  ASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLIIQQYGKRGVRPCESVEDR
        AST+KILWLLLVKSGSLWVAWVE+YILKGRSL EID GV RSWCFR ILRKRD LKDHV+MEVG+G + RVWL PW+QGG IIQQ+G+R +    S   R
Subjt:  ASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLIIQQYGKRGVRPCESVEDR

Query:  W
        W
Subjt:  W

TrEMBL top hitse value%identityAlignment
A0A5A7SPE5 Reverse transcriptase domain-containing protein2.5e-24550.26Show/hide
Query:  MTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIR----GGFTSNYSNSGVGRMWVMWK
        M NSFGSL+E    DKWALSI++GS  PL+VD  + VL          G  + I  +  L   R+   +   +  R      ++ +YSNSGVGR+WVMWK
Subjt:  MTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIR----GGFTSNYSNSGVGRMWVMWK

Query:  RNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVE
        +NRF F+TH + +QF++  +TDL+SG   +V CVYASNSNIERR LW R+ +I + W  PG+VM DF+ I +HSEAF G+P  G+ME+F++AI +ADLVE
Subjt:  RNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVE

Query:  PSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVS
        PSVQ NWFTWTSK+ GSG++RRLDR+LVND+ LS WP M VNVLPWGISDH  IL YPS+Q S +VVSFRFFNHWVED SF++VV+ +W+R   VS +V 
Subjt:  PSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVS

Query:  FVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYL
         ++NL  LKPILR  F +HI+ +SE+VR+ K+ MD AQRE+ +                                                      NY 
Subjt:  FVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYL

Query:  KNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLED
        +N+LG Q IGY+ELSP I+DIVQF+W++EC QALQ  I REEVRRV+F MDSGKAPGPDG+S                 G+N TAITLIPK NGA+RLED
Subjt:  KNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLED

Query:  F--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAY---------DSVNWDFLFGLLIAI
        F                                 S FI  RSII+NILLC EL+     +   P   +   L+  +         D ++  FLF +++ +
Subjt:  F--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAY---------DSVNWDFLFGLLIAI

Query:  ------ETPLRFVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEAS-------------------
              + P  F   FH  CEKV++THLTFADDL+IFCA ++ S+ FI++ +Q+FGEL GLFAN RKSSIF+ GVN+  AS                   
Subjt:  ------ETPLRFVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEAS-------------------

Query:  -PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEG
         PL   RL+S+DC  LIQ ITS+IRSW+ARVLSFAGRLQLV SVLRSLQVYWASVFVLPA  H +VDKILRSYLWRGKEEGRGG+KVAW +VCLPF+E G
Subjt:  -PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEG

Query:  FAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR---VWLDPWLQGGLIIQQY
        F I DG S NIAST+KILWL+L  SGSLWVAWVEAYILKGRSLW++D  VGRSWC   ILR               G  SR   VWL P +   LI    
Subjt:  FAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR---VWLDPWLQGGLIIQQY

Query:  GKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSG
          + V PC SV D  VW+ G   GF ++SAWE IRPR  RV W G
Subjt:  GKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSG

A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein3.3e-24246.81Show/hide
Query:  MTMTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIRGG----FTSNYSNSGVGRMWVM
        +TM +SFGSL+E    DKWAL+IV+ SPLP++V   ++         + LG+ +    +  L   R+   +   +  R G    F+ +YSNS +GR+WVM
Subjt:  MTMTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIRGG----FTSNYSNSGVGRMWVM

Query:  WKRNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADL
        WK+NRF F+T  V  QF++  + DL+ G   +V CVYASNSNIERR LWR++ +I +GW  PG+VMGDF  II+HSEAFE AP  G+ME FD+AI +ADL
Subjt:  WKRNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADL

Query:  VEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSI
        +EP VQ                       VND+ L AWPN+ VNVL WGISDHS IL YPS+Q ++RV SFRFFNHWVED SF  VV  +W R   VS +
Subjt:  VEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSI

Query:  VSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVV------DPNGTQLTNHDQV
        VSF++NL+  KPIL  HF                                       LAT  FW M VR  ++S   +S +      D N T        
Subjt:  VSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVV------DPNGTQLTNHDQV

Query:  TQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKR
          +  NY +N+LG Q IGY+ELSP I+ IVQFRW+ ECYQALQ  I  EEVRRV+F MDSGKAPGPDG+S              F +GVN T +TLIPKR
Subjt:  TQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKR

Query:  NGADRLEDF--------ISTFISWRSIIDNILLCLEL---LGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLRFVSW-------------
         GA+R+E+F        I   IS + + D + + L     +  YH+N+ KP CT+KVDLQKAYDS+NWDFLFGLLIAI TPL+FVSW             
Subjt:  NGADRLEDF--------ISTFISWRSIIDNILLCLEL---LGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLRFVSW-------------

Query:  ---------------------------------------------FHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSI
                                                     FHQ CEKV++T LTFADDL+IFC  ++ S+SF++ET+Q+FGELLGL+ANL K SI
Subjt:  ---------------------------------------------FHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSI

Query:  FLVGVNSVEAS--------------------PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKIL
        F+ G  +  AS                    PL   RL+ SDC  LIQ ITSRIRSW+ARVLSFAGR QLV SV RSLQVYWASVFVLP+  H  VDKIL
Subjt:  FLVGVNSVEAS--------------------PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKIL

Query:  RSYLWRGKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHV
        RSYLWR                          + DG S NI ST+KILWLL   S SLWVAWVEAYILK +SLW +D GV RSWC R ILRKRD LK HV
Subjt:  RSYLWRGKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHV

Query:  RMEVGDGMRSRVWLDPWLQGGLIIQQYGKR----------------------------------------GVRPCESVEDRWVWIPGSHDGFLVTSAWET
         +EVGDG   RVWLDPWLQG  I++Q G+R                                         VRPC SV DRWVW+PG   GF + SA +T
Subjt:  RMEVGDGMRSRVWLDPWLQGGLIIQQYGKR----------------------------------------GVRPCESVEDRWVWIPGSHDGFLVTSAWET

Query:  IRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRLDIRDRLNRWDRSIPLS
        IRPR  RV W GLLWGGGN+P HSFCAWL I+++L  RDRL+RWD S+P+S
Subjt:  IRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRLDIRDRLNRWDRSIPLS

A0A5A7TZS0 Reverse transcriptase domain-containing protein3.5e-30855.72Show/hide
Query:  FTSNYSNSGVGRMWVMWKRNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPN
        ++ +YSNSGVGR+WVMWK+ RF F TH + ++F+   +TDL+ G   +V+CVYASNS+ ERR LWR + +I + W   G+VMGDF+ I +HSEAF G+P 
Subjt:  FTSNYSNSGVGRMWVMWKRNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPN

Query:  CGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFL
         G+MEEFD+AI +ADLVEPSVQ NWFTWTSK+ GSG++RRLDR+LVNDE LSAWP MR+NVLPWGISDHS IL YPS+Q + RVVSFRFFNHWVE+ SF+
Subjt:  CGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFL

Query:  DVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFW--------------------
        +VV+ +W+R   VS +VS ++NL +LKPILR  F +HI+++SE+V +AK+ MD AQREVE NPLS+ LS QASLAT  FW                    
Subjt:  DVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFW--------------------

Query:  ---------RMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSG
                    VRSR S N+L S+VD +G+++++HD V Q+  NY  N+LG Q IGY+ELSP I+DIVQF+W++EC QALQ  I REEVRRV+F MDSG
Subjt:  ---------RMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSG

Query:  KAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLEDF--------------------------------ISTFISW
        KAPGPDG+S             DFC+A+LHFFET Y P GVN TAITLIPK  GA+RLEDF                                 S FI  
Subjt:  KAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLEDF--------------------------------ISTFISW

Query:  RSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLRFVSW----------------------------------------
        RSII+NILLC EL+G YH+N+ KPRCT+KVDLQKAYDSVNWDFLFGLLIAI TPL+FVSW                                        
Subjt:  RSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLLIAIETPLRFVSW----------------------------------------

Query:  ------------------FHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEASPLEA------------
                          FH  CEKV++THLTFADDL+IFCA ++ S+SFI+E +Q+FGE  GLFAN RKSSIF+VGVN+  AS L A            
Subjt:  ------------------FHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEASPLEA------------

Query:  -------RRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDE
                 L+S DC  LIQ ITSRIRSW+ARVLSFAGRLQLV SVLRSLQVYWASVFVLPA  H +VDKILRSYLWRGKEEGRGG+KVAW +VCLPF+E
Subjt:  -------RRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDE

Query:  EGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR-------------VWLD
         G  I DG S NIA+T+KI   LL   GSLWVAW+EAYILKG+SLW++D  VGRSWC R ILRKR+ +K HV   V     SR              WL 
Subjt:  EGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR-------------VWLD

Query:  PWLQGGLIIQQYGKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRLDIRDRLNRWDRSIPLSCDRV
        P +   LI      + V PC SV D WVW+PG   GF + SAWE I PR  RV W GLLWGGGNIP HSFCAWLAI+DRLD RDRL+RWD SIPLSC  +
Subjt:  PWLQGGLIIQQYGKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRLDIRDRLNRWDRSIPLSCDRV

Query:  LRGGV
         +GGV
Subjt:  LRGGV

A0A5A7UP65 Reverse transcriptase1.4e-24848.77Show/hide
Query:  NSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIRGG----FTSNYSNSGVGRMWVMWKRN
        NSFGSL+E    DKWALSI++GSP   R      + L +    + LG+ + +G   +L   R+   +   +  R G    ++ +YSNSGVGR+WVMWK+N
Subjt:  NSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIRGG----FTSNYSNSGVGRMWVMWKRN

Query:  RFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVEPS
        RF F+TH   +QFI+  V                                          VM DF+ I  HSEA  G+P  G+ME+FDMAI +ADLVEPS
Subjt:  RFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVEPS

Query:  VQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFV
        VQ NWFTWTSK+ GSG+MRRLDR+L+ND+ LSAWP M VNVLPWGISDHS IL+YPS+Q + +VVSFR FNHWV+D SFL                    
Subjt:  VQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFV

Query:  KNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFW-RMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLK
                     F +HI+++SE+VR+AK+ MD AQREVE NP+S+ LS QASLAT  FW  + +   +    L  VVD   +++++HD V Q+  NY  
Subjt:  KNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFW-RMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLK

Query:  NTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITL
        N+LG Q IGY+EL+P I+DIVQF+W++EC QALQ  I REEVRRV+F MDSGKAPGPDG+S             DFCDA+LHFFET Y P GVN TAITL
Subjt:  NTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITL

Query:  IPKRNGADRLEDF--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLL
        IPK NGA+RLEDF                                 S FI  RSII+NILLC EL+G YH+N+ KPRCT+KVDLQKAYDSVNWDFLFGL 
Subjt:  IPKRNGADRLEDF--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFLFGLL

Query:  IAIETPLRFVSWFHQFCEKVRIT---------------HLTFADDLIIFCATE------------KYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVN
        I+I TPL+FVSW          +                +   D L +F                  S  F     +RFGEL GLFAN RKSSIF+ GVN
Subjt:  IAIETPLRFVSWFHQFCEKVRIT---------------HLTFADDLIIFCATE------------KYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVN

Query:  SVEAS--------------------PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWR
        +  AS                    PL   RL+S+DC  LIQ ITSRIRS SARVLSFAGRLQLVCSVL SLQVYWA VFVLPA  H             
Subjt:  SVEAS--------------------PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWR

Query:  GKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGD
                            +E G  I DG++   AST+KILWL+L  SGSLWVAWVEAY+LKGRSLW++D  VGRSWC R ILRK++ LK HVRM+VG+
Subjt:  GKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGD

Query:  GMRSRVWLDPWLQGGLIIQQYGKRGVRPCES-----------VEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRL
        G R RVWLDPWLQ G I++Q G+R +    S            +  W+W  G   GF + SAWE IRPR  RV W GLLWGGGNIP HSFCAWLAI+DRL
Subjt:  GMRSRVWLDPWLQGGLIIQQYGKRGVRPCES-----------VEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAWLAIRDRL

Query:  DIRDRLNRWDRSIPLSC
          RDR +RWD S+PLSC
Subjt:  DIRDRLNRWDRSIPLSC

A0A5D3DXE4 Reverse transcriptase domain-containing protein2.5e-24550.26Show/hide
Query:  MTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIR----GGFTSNYSNSGVGRMWVMWK
        M NSFGSL+E    DKWALSI++GS  PL+VD  + VL          G  + I  +  L   R+   +   +  R      ++ +YSNSGVGR+WVMWK
Subjt:  MTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIR----GGFTSNYSNSGVGRMWVMWK

Query:  RNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVE
        +NRF F+TH + +QF++  +TDL+SG   +V CVYASNSNIERR LW R+ +I + W  PG+VM DF+ I +HSEAF G+P  G+ME+F++AI +ADLVE
Subjt:  RNRFVFTTHEVTDQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVE

Query:  PSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVS
        PSVQ NWFTWTSK+ GSG++RRLDR+LVND+ LS WP M VNVLPWGISDH  IL YPS+Q S +VVSFRFFNHWVED SF++VV+ +W+R   VS +V 
Subjt:  PSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVS

Query:  FVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYL
         ++NL  LKPILR  F +HI+ +SE+VR+ K+ MD AQRE+ +                                                      NY 
Subjt:  FVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYL

Query:  KNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLED
        +N+LG Q IGY+ELSP I+DIVQF+W++EC QALQ  I REEVRRV+F MDSGKAPGPDG+S                 G+N TAITLIPK NGA+RLED
Subjt:  KNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREEVRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLED

Query:  F--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAY---------DSVNWDFLFGLLIAI
        F                                 S FI  RSII+NILLC EL+     +   P   +   L+  +         D ++  FLF +++ +
Subjt:  F--------------------------------ISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAY---------DSVNWDFLFGLLIAI

Query:  ------ETPLRFVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEAS-------------------
              + P  F   FH  CEKV++THLTFADDL+IFCA ++ S+ FI++ +Q+FGEL GLFAN RKSSIF+ GVN+  AS                   
Subjt:  ------ETPLRFVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEAS-------------------

Query:  -PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEG
         PL   RL+S+DC  LIQ ITS+IRSW+ARVLSFAGRLQLV SVLRSLQVYWASVFVLPA  H +VDKILRSYLWRGKEEGRGG+KVAW +VCLPF+E G
Subjt:  -PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEG

Query:  FAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR---VWLDPWLQGGLIIQQY
        F I DG S NIAST+KILWL+L  SGSLWVAWVEAYILKGRSLW++D  VGRSWC   ILR               G  SR   VWL P +   LI    
Subjt:  FAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSR---VWLDPWLQGGLIIQQY

Query:  GKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSG
          + V PC SV D  VW+ G   GF ++SAWE IRPR  RV W G
Subjt:  GKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.5e-2429.68Show/hide
Query:  PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGF
        P+  +R+       +++ ++SR+  W  + LSFAGRL L  +VL S+ V+  S  +LP      +D++ R++LW    E +    V W++VC P  E G 
Subjt:  PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGF

Query:  AICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGR---SLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLIIQQYG
         +    S N A   K+ W LL +  SLW   ++     G    S W I  G   S    + +  RD +   V    GDG + R W D W+ G  +++   
Subjt:  AICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGR---SLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLIIQQYG

Query:  KRGVRPCESVEDRWVWIPG
              C++V  + +WIPG
Subjt:  KRGVRPCESVEDRWVWIPG

P93295 Uncharacterized mitochondrial protein AtMg003101.0e-0625.87Show/hide
Query:  SLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVC-LPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWE
        +L VY  S F L     + +   +  + W   E  R    VAW ++C    D+ G    D    N A   K  + ++ +  +L    + +      S+ E
Subjt:  SLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVC-LPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWE

Query:  IDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWL
          VG   S+ +R I+  R+ L   +   +GDG+ ++VWLD W+
Subjt:  IDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWL

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein4.2e-1124.85Show/hide
Query:  NSNIERRDLWRRMTQIFAG---WRGPGMVMGDFSTIIIHSEAFEGAP---NCGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDE
        N+  ERR LW  +T++ A       P +V+GDF+ I   +E +   P   +   +E+    + ++DLV+   +   +TW++    + ++R+LDR +VN  
Subjt:  NSNIERRDLWRRMTQIFAG---WRGPGMVMGDFSTIIIHSEAFEGAP---NCGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDE

Query:  GLSAWPNMRVNVLPWGISDHSSILVYPSYQWS-QRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIV
         L+ +P       P   SDH++ +V  +      +  SF++F+       F+  + + W ++  V S +
Subjt:  GLSAWPNMRVNVLPWGISDHSSILVYPSYQWS-QRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIV

AT1G43760.1 DNAse I-like superfamily protein4.8e-2324.03Show/hide
Query:  MVMGDFSTIIIHSEAFE----GAPNCGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVY
        +++GDF  I   S+ +       P  G +EEF   + ++DLV+   +   +TW++    + ++R+LDR + N +  S++P+        G+SDHS  ++ 
Subjt:  MVMGDFSTIIIHSEAFE----GAPNCGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRRLDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVY

Query:  PSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSS-IVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLS---------
              +    FR+F+      +FL  ++  W     V S + S  ++L+  K   +    +    I    + A D+++  Q ++  NP           
Subjt:  PSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSS-IVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKDTMDQAQREVEINPLS---------

Query:  -----------EELSNQASL--------ATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGF-QNIGYKELSPSIEDIVQFRWT
                   E    Q S         A   F+   + + Q+ N ++ +   +  ++ N  QV +++  Y  + LG   +I   +    I+DI  FR  
Subjt:  -----------EELSNQASL--------ATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGF-QNIGYKELSPSIEDIVQFRWT

Query:  DECYQALQALIGREEVRRVMFFMDSGKAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLEDF
        D     L AL   +E+   +F M   KAPGPD ++                 A+  FF T +  +  N TAITLIPK  G D+L  F
Subjt:  DECYQALQALIGREEVRRVMFFMDSGKAPGPDGYS-------------DFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLEDF

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.3e-3228.4Show/hide
Query:  PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGF
        PL  +++ +SD   L++ I  RI  W+AR LSFAGRLQL+ SV+ SL  +W S F LP+   +++D I  S+LW G E      KVAW++VC P DE G 
Subjt:  PLEARRLQSSDCDSLIQWITSRIRSWSARVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGF

Query:  AICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLIIQQYGKRG
         I      N               GS W        + G +          SW ++ IL+ R      V+ ++ +G  +  W D W + G +I   G RG
Subjt:  AICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLIIQQYGKRG

Query:  V------------------RPCESVEDRWVWIPG-----SHDG--------------------FLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAW
                           RP     D  + I        H G                    F     W   R   ++V W   +W     P +S  AW
Subjt:  V------------------RPCESVEDRWVWIPG-----SHDG--------------------FLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAW

Query:  LAIRDRLDIRDRLNRWDRSIPLSC
        +AI++RL   DR+  W+     SC
Subjt:  LAIRDRLDIRDRLNRWDRSIPLSC

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-1030.97Show/hide
Query:  VCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKG------RSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLD
        VCLP  E G  +   S  N    +K++W L    GSLWV W   + L+G         W     V  SW ++ +LR R   +  +R  +G+G+ +R W D
Subjt:  VCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILKG------RSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLD

Query:  PWLQGGLIIQQYG
         W   G ++   G
Subjt:  PWLQGGLIIQQYG

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.1e-1131.45Show/hide
Query:  VGDGMRSRVWLDPWLQGGL----IIQQYGKRGVRPCESV--------EDRWVWIPGSHD---GFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAW
        V D +R  VW   W+ G      IIQ    +   P  SV        +D ++W  G  +   GF   + W  + P   +V W   +W  G IP H+F +W
Subjt:  VGDGMRSRVWLDPWLQGGL----IIQQYGKRGVRPCESV--------EDRWVWIPGSHD---GFLVTSAWETIRPRSIRVGWSGLLWGGGNIPYHSFCAW

Query:  LAIRDRLDIRDRLNRWDRSIPLSC
        + IR RL  RD+L  W   +P  C
Subjt:  LAIRDRLDIRDRLNRWDRSIPLSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTATGACAAACTCTTTTGGCAGCCTCATGGAGGAGGAAAAAGGGGATAAATGGGCACTATCTATAGTGGATGGCTCTCCTCTGCCCCTACGCGTAGATAACTCTTC
TATGGTTCTGTTGAGCAATCCTTGTGATGACAATCCTCTTGGAGCACAAGCTCCCATAGGATCTTATGATTATTTGTTCGGAAGGAGAATTTTAACTCGATCTCTGGTAG
GATTGGTGATTCGTGGGGGGTTCACTAGTAACTATAGCAACAGTGGAGTAGGTCGAATGTGGGTTATGTGGAAACGTAACAGATTCGTGTTCACGACTCACGAGGTTACC
GATCAGTTTATTTCTAGTTTGGTCACAGATTTGATTTCAGGGGAGAGTAGAAAGGTTATGTGTGTGTATGCCTCTAATAGTAATATAGAGAGACGTGATTTGTGGAGGCG
CATGACTCAGATTTTTGCTGGTTGGAGAGGGCCGGGTATGGTCATGGGGGATTTTAGTACCATCATAATTCATTCTGAAGCCTTCGAGGGCGCTCCAAATTGTGGGGATA
TGGAGGAATTTGACATGGCTATTGGAGAGGCGGACCTTGTTGAACCCTCGGTTCAGGAAAACTGGTTTACCTGGACTAGCAAGATCCATGGGTCGGGTTTGATGAGGAGG
CTAGATCGTATCCTAGTGAATGATGAGGGCCTTAGTGCTTGGCCTAACATGAGGGTTAACGTTCTCCCTTGGGGTATTTCTGATCATTCTTCCATCCTTGTTTATCCCAG
CTACCAATGGAGTCAGCGGGTGGTCTCTTTTCGTTTCTTTAATCATTGGGTTGAAGATTCTTCCTTTTTGGATGTAGTTTCCTCTGTTTGGACCAGGGACTCTAGAGTTT
CTTCGATTGTGAGTTTTGTGAAGAACTTAAGGAATCTCAAACCGATTCTACGCAACCATTTTGATAAGCACATCCAGACCATAAGTGAGGATGTCCGTCTTGCTAAGGAT
ACCATGGATCAAGCTCAGAGAGAGGTAGAGATAAACCCTCTGTCGGAGGAGCTAAGTAATCAAGCTAGTTTAGCTACGGTGAATTTTTGGAGAATGTTTGTTCGATCAAG
GCAGAGCAGTAATGCATTGCGTTCAGTTGTTGATCCAAATGGTACTCAGTTGACTAACCATGATCAGGTGACTCAAGTGGTAGATAATTATTTGAAAAATACTCTGGGAT
TCCAGAATATTGGTTACAAAGAGCTTTCTCCTAGTATTGAGGATATTGTTCAGTTTAGATGGACTGATGAGTGCTACCAGGCCCTTCAGGCGTTAATTGGTAGGGAAGAG
GTGAGACGCGTTATGTTTTTCATGGATAGTGGAAAGGCACCAGGACCTGATGGGTATTCGGATTTCTGTGATGCCATCTTACACTTCTTTGAGACTAATTACTTTCCTCA
AGGGGTGAATACTACTGCTATTACGCTTATTCCTAAAAGGAATGGTGCTGATCGGTTGGAGGATTTCATTTCTACTTTCATCTCTTGGAGGAGTATTATTGATAATATTC
TTCTTTGTCTGGAGCTTCTAGGGCGTTACCATATGAACACAAGAAAACCTCGGTGCACTATGAAGGTTGACCTCCAAAAAGCATATGATTCTGTTAATTGGGATTTCCTC
TTTGGACTGTTGATTGCTATTGAAACCCCTTTGAGATTTGTGAGTTGGTTCCATCAGTTTTGTGAGAAGGTCAGAATAACTCATCTTACTTTTGCGGATGATCTAATCAT
CTTTTGTGCTACTGAAAAATATTCTATGAGTTTCATAAAAGAAACTATTCAGAGGTTTGGTGAGCTATTGGGGTTGTTTGCAAATCTTAGGAAAAGTTCTATTTTTCTTG
TGGGGGTTAATAGTGTGGAAGCTTCTCCGTTAGAAGCGAGAAGGTTACAGAGTTCTGATTGTGATTCCCTTATTCAATGGATAACCAGTCGTATTCGGTCTTGGTCTGCT
AGAGTGTTATCTTTTGCAGGCAGGCTTCAACTTGTTTGCTCTGTTCTTCGGAGTCTACAAGTTTATTGGGCTAGTGTTTTCGTGCTTCCTGCGAAATTCCACAGAGATGT
TGATAAGATCTTGAGGTCATATCTGTGGAGAGGTAAGGAGGAGGGTAGAGGTGGTGTAAAAGTGGCCTGGAATGAGGTTTGCCTTCCTTTTGATGAGGAAGGTTTTGCTA
TTTGCGATGGATCTTCTTGTAATATAGCGAGCACAATGAAGATCTTATGGTTGTTGCTCGTGAAATCTGGCTCTTTATGGGTTGCTTGGGTGGAAGCTTATATCCTTAAA
GGGAGATCGTTATGGGAGATTGATGTTGGGGTTGGTCGATCTTGGTGTTTTAGGGTCATCTTGCGTAAGCGGGATTTCCTTAAAGATCATGTTAGGATGGAGGTGGGAGA
TGGAATGAGGAGTAGAGTGTGGTTAGATCCATGGTTACAGGGTGGTTTGATCATTCAGCAGTATGGAAAGAGGGGAGTTAGGCCGTGTGAGAGTGTTGAGGATAGGTGGG
TCTGGATTCCGGGTAGTCACGATGGGTTTTTGGTTACTAGTGCGTGGGAGACTATTCGTCCTCGTAGTATTAGGGTTGGCTGGTCGGGATTACTGTGGGGTGGGGGAAAT
ATTCCTTATCATTCCTTTTGTGCTTGGTTGGCTATTAGAGATAGATTGGATATTAGGGATAGGTTAAATCGGTGGGATAGGTCAATTCCTTTGTCTTGTGATCGGGTATT
GAGGGGTGGAGTTGTCTTGGATTTGCCATCAGGGAATAGGGAAGGGGGTGAGGAAAAAGTTGTGGCGTCTTCTTTGGTGTGCTACGATTTACTTCATTTGGCAAGAGCGT
AA
mRNA sequenceShow/hide mRNA sequence
ATGACTATGACAAACTCTTTTGGCAGCCTCATGGAGGAGGAAAAAGGGGATAAATGGGCACTATCTATAGTGGATGGCTCTCCTCTGCCCCTACGCGTAGATAACTCTTC
TATGGTTCTGTTGAGCAATCCTTGTGATGACAATCCTCTTGGAGCACAAGCTCCCATAGGATCTTATGATTATTTGTTCGGAAGGAGAATTTTAACTCGATCTCTGGTAG
GATTGGTGATTCGTGGGGGGTTCACTAGTAACTATAGCAACAGTGGAGTAGGTCGAATGTGGGTTATGTGGAAACGTAACAGATTCGTGTTCACGACTCACGAGGTTACC
GATCAGTTTATTTCTAGTTTGGTCACAGATTTGATTTCAGGGGAGAGTAGAAAGGTTATGTGTGTGTATGCCTCTAATAGTAATATAGAGAGACGTGATTTGTGGAGGCG
CATGACTCAGATTTTTGCTGGTTGGAGAGGGCCGGGTATGGTCATGGGGGATTTTAGTACCATCATAATTCATTCTGAAGCCTTCGAGGGCGCTCCAAATTGTGGGGATA
TGGAGGAATTTGACATGGCTATTGGAGAGGCGGACCTTGTTGAACCCTCGGTTCAGGAAAACTGGTTTACCTGGACTAGCAAGATCCATGGGTCGGGTTTGATGAGGAGG
CTAGATCGTATCCTAGTGAATGATGAGGGCCTTAGTGCTTGGCCTAACATGAGGGTTAACGTTCTCCCTTGGGGTATTTCTGATCATTCTTCCATCCTTGTTTATCCCAG
CTACCAATGGAGTCAGCGGGTGGTCTCTTTTCGTTTCTTTAATCATTGGGTTGAAGATTCTTCCTTTTTGGATGTAGTTTCCTCTGTTTGGACCAGGGACTCTAGAGTTT
CTTCGATTGTGAGTTTTGTGAAGAACTTAAGGAATCTCAAACCGATTCTACGCAACCATTTTGATAAGCACATCCAGACCATAAGTGAGGATGTCCGTCTTGCTAAGGAT
ACCATGGATCAAGCTCAGAGAGAGGTAGAGATAAACCCTCTGTCGGAGGAGCTAAGTAATCAAGCTAGTTTAGCTACGGTGAATTTTTGGAGAATGTTTGTTCGATCAAG
GCAGAGCAGTAATGCATTGCGTTCAGTTGTTGATCCAAATGGTACTCAGTTGACTAACCATGATCAGGTGACTCAAGTGGTAGATAATTATTTGAAAAATACTCTGGGAT
TCCAGAATATTGGTTACAAAGAGCTTTCTCCTAGTATTGAGGATATTGTTCAGTTTAGATGGACTGATGAGTGCTACCAGGCCCTTCAGGCGTTAATTGGTAGGGAAGAG
GTGAGACGCGTTATGTTTTTCATGGATAGTGGAAAGGCACCAGGACCTGATGGGTATTCGGATTTCTGTGATGCCATCTTACACTTCTTTGAGACTAATTACTTTCCTCA
AGGGGTGAATACTACTGCTATTACGCTTATTCCTAAAAGGAATGGTGCTGATCGGTTGGAGGATTTCATTTCTACTTTCATCTCTTGGAGGAGTATTATTGATAATATTC
TTCTTTGTCTGGAGCTTCTAGGGCGTTACCATATGAACACAAGAAAACCTCGGTGCACTATGAAGGTTGACCTCCAAAAAGCATATGATTCTGTTAATTGGGATTTCCTC
TTTGGACTGTTGATTGCTATTGAAACCCCTTTGAGATTTGTGAGTTGGTTCCATCAGTTTTGTGAGAAGGTCAGAATAACTCATCTTACTTTTGCGGATGATCTAATCAT
CTTTTGTGCTACTGAAAAATATTCTATGAGTTTCATAAAAGAAACTATTCAGAGGTTTGGTGAGCTATTGGGGTTGTTTGCAAATCTTAGGAAAAGTTCTATTTTTCTTG
TGGGGGTTAATAGTGTGGAAGCTTCTCCGTTAGAAGCGAGAAGGTTACAGAGTTCTGATTGTGATTCCCTTATTCAATGGATAACCAGTCGTATTCGGTCTTGGTCTGCT
AGAGTGTTATCTTTTGCAGGCAGGCTTCAACTTGTTTGCTCTGTTCTTCGGAGTCTACAAGTTTATTGGGCTAGTGTTTTCGTGCTTCCTGCGAAATTCCACAGAGATGT
TGATAAGATCTTGAGGTCATATCTGTGGAGAGGTAAGGAGGAGGGTAGAGGTGGTGTAAAAGTGGCCTGGAATGAGGTTTGCCTTCCTTTTGATGAGGAAGGTTTTGCTA
TTTGCGATGGATCTTCTTGTAATATAGCGAGCACAATGAAGATCTTATGGTTGTTGCTCGTGAAATCTGGCTCTTTATGGGTTGCTTGGGTGGAAGCTTATATCCTTAAA
GGGAGATCGTTATGGGAGATTGATGTTGGGGTTGGTCGATCTTGGTGTTTTAGGGTCATCTTGCGTAAGCGGGATTTCCTTAAAGATCATGTTAGGATGGAGGTGGGAGA
TGGAATGAGGAGTAGAGTGTGGTTAGATCCATGGTTACAGGGTGGTTTGATCATTCAGCAGTATGGAAAGAGGGGAGTTAGGCCGTGTGAGAGTGTTGAGGATAGGTGGG
TCTGGATTCCGGGTAGTCACGATGGGTTTTTGGTTACTAGTGCGTGGGAGACTATTCGTCCTCGTAGTATTAGGGTTGGCTGGTCGGGATTACTGTGGGGTGGGGGAAAT
ATTCCTTATCATTCCTTTTGTGCTTGGTTGGCTATTAGAGATAGATTGGATATTAGGGATAGGTTAAATCGGTGGGATAGGTCAATTCCTTTGTCTTGTGATCGGGTATT
GAGGGGTGGAGTTGTCTTGGATTTGCCATCAGGGAATAGGGAAGGGGGTGAGGAAAAAGTTGTGGCGTCTTCTTTGGTGTGCTACGATTTACTTCATTTGGCAAGAGCGT
AA
Protein sequenceShow/hide protein sequence
MTMTNSFGSLMEEEKGDKWALSIVDGSPLPLRVDNSSMVLLSNPCDDNPLGAQAPIGSYDYLFGRRILTRSLVGLVIRGGFTSNYSNSGVGRMWVMWKRNRFVFTTHEVT
DQFISSLVTDLISGESRKVMCVYASNSNIERRDLWRRMTQIFAGWRGPGMVMGDFSTIIIHSEAFEGAPNCGDMEEFDMAIGEADLVEPSVQENWFTWTSKIHGSGLMRR
LDRILVNDEGLSAWPNMRVNVLPWGISDHSSILVYPSYQWSQRVVSFRFFNHWVEDSSFLDVVSSVWTRDSRVSSIVSFVKNLRNLKPILRNHFDKHIQTISEDVRLAKD
TMDQAQREVEINPLSEELSNQASLATVNFWRMFVRSRQSSNALRSVVDPNGTQLTNHDQVTQVVDNYLKNTLGFQNIGYKELSPSIEDIVQFRWTDECYQALQALIGREE
VRRVMFFMDSGKAPGPDGYSDFCDAILHFFETNYFPQGVNTTAITLIPKRNGADRLEDFISTFISWRSIIDNILLCLELLGRYHMNTRKPRCTMKVDLQKAYDSVNWDFL
FGLLIAIETPLRFVSWFHQFCEKVRITHLTFADDLIIFCATEKYSMSFIKETIQRFGELLGLFANLRKSSIFLVGVNSVEASPLEARRLQSSDCDSLIQWITSRIRSWSA
RVLSFAGRLQLVCSVLRSLQVYWASVFVLPAKFHRDVDKILRSYLWRGKEEGRGGVKVAWNEVCLPFDEEGFAICDGSSCNIASTMKILWLLLVKSGSLWVAWVEAYILK
GRSLWEIDVGVGRSWCFRVILRKRDFLKDHVRMEVGDGMRSRVWLDPWLQGGLIIQQYGKRGVRPCESVEDRWVWIPGSHDGFLVTSAWETIRPRSIRVGWSGLLWGGGN
IPYHSFCAWLAIRDRLDIRDRLNRWDRSIPLSCDRVLRGGVVLDLPSGNREGGEEKVVASSLVCYDLLHLARA