; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005532 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005532
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr6:20856417..20862087
RNA-Seq ExpressionLag0005532
SyntenyLag0005532
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037581.1 reverse transcriptase [Cucumis melo var. makuwa]1.0e-28858.22Show/hide
Query:  YFVEIELPVPDTLPTSAESSKSSSIT---LFSKSSYVKAAPRGGQRVS------------LEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDG
        + VEIELPVPD LPTSAESS+S+S T   L+ +S +V+   +   +              +  DDV WLH+IF AK AGGPGGGV               
Subjt:  YFVEIELPVPDTLPTSAESSKSSSIT---LFSKSSYVKAAPRGGQRVS------------LEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDG

Query:  SQNATQSQSERGSSNPRGQNEARSER-FSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFL
            T  QS+    +    +  R     S   +EI R E+AGPSD +KMYGIERLKKL ATVFEGS DP D EVWLNMLEKCFDVMSCP+ERKV+LATFL
Subjt:  SQNATQSQSERGSSNPRGQNEARSER-FSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFL

Query:  LQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTA
        L KEAEGWWKSI+ARR+DA                                      + +YERKYTELSR  ++IVASESDRC RFERGLRFEIRTPVTA
Subjt:  LQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTA

Query:  IANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTT------------------------------------------------------NVARPRTGQ
        IA W NFSQLVETALRV+QSI+EEK+A+E SRG  TT                                                      +VAR RTGQ
Subjt:  IANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTT------------------------------------------------------NVARPRTGQ

Query:  ESVASESRRTPCVSCGKNNRGQCIVGAG-----------------------------------------AEEGSSGARQKGVVDKPRQKGKVYAMTQQEA
        ESVASES+RTPCVSCGK+++G+C++GAG                                         A E +SGARQKGVV +PRQ+GKVYAMTQQEA
Subjt:  ESVASESRRTPCVSCGKNNRGQCIVGAG-----------------------------------------AEEGSSGARQKGVVDKPRQKGKVYAMTQQEA

Query:  EDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLF
        EDAPDVIT  +LICNVP  VLLD  A HSFVSSMFLTK+NRMLE L EELVI TPVGDVLLV+EVL D EV+VEGL M VDLLPLELQ LDVILGM+FLF
Subjt:  EDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLF

Query:  THYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELF
        THY SM+CHRKEVTF+KP  TEV+F+GER IIPTS ISALK E LLRKGC  FLAHVV+VQEEKLKP+DV  VNEYLDVFP DLSGL PDREVEFTIEL 
Subjt:  THYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELF

Query:  PGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGY
        PGT PISQAPYRMA SELKELKV+ QELVDKGYIRPSVS WGAP+LFVKKKDGTLRLCIDY                        RGA+VFSKIDLRSGY
Subjt:  PGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGY

Query:  HQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-----------------------
        HQLKV +S+IPKTAFR RYGHYEFLVMPFGLTNAP VFMDL+N                               +RI                       
Subjt:  HQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-----------------------

Query:  -----PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKE
               V A GV+VD  KVE VVNWERP  ATEV SFLGLAGYYRRFVED S+L LPLTALTRKNA+FEW DKCEQSFQELKKRLVTAPILTL ++GKE
Subjt:  -----PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKE

Query:  YVIYCDASRQ
        YVIYCDASRQ
Subjt:  YVIYCDASRQ

KAA0041108.1 reverse transcriptase [Cucumis melo var. makuwa]1.4e-28561.08Show/hide
Query:  GVMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFD
        GVMP RT RRRRQNQDG Q  TQ  S   SS    +  A +E+F+R+ QEI R +RA PSDP+K YGIERLKKL ATVFEGS DP D E WLNMLEKCFD
Subjt:  GVMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFD

Query:  VMSCPEERKVRLATFLLQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCR
        VM+CPEERKVRLATFLLQKEAEGWWKSILARRSDA                                      + EYERKYTELSR  DVI+ASESDRCR
Subjt:  VMSCPEERKVRLATFLLQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCR

Query:  RFERGLRFEIRTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGT--------------------------------------------------
        RFERGLRFEIRTPVTAIA WTNFSQLVETALRVEQSI EEK+A+E SRGT                                                  
Subjt:  RFERGLRFEIRTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGT--------------------------------------------------

Query:  ----PTTNVARPRTGQESVASESRRTPCVSCGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVD
            P  +  R + GQES+AS  RR PC SCG+N+RGQC+VGAG                                           EG+SGARQKGVV 
Subjt:  ----PTTNVARPRTGQESVASESRRTPCVSCGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVD

Query:  KPRQKGKVYAMTQQEAEDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLP
        +PRQ+GKVYAMTQQE EDAPDVITG +LICNVPA VL DP A HSFVSS+FLTKLNRMLE LSE L IYTPVGDVLLVNEVLR+ EVLVEG+ +LVDLLP
Subjt:  KPRQKGKVYAMTQQEAEDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLP

Query:  LELQALDVILGMNFLFTHYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDL
        LELQ LDVILGM+FLF HY SMDCHRKEV FRKP   EVVFRG RK +  S IS LK E LLRKGC  FLAH+V VQ EKLKP+DV  V E+LDVFP DL
Subjt:  LELQALDVILGMNFLFTHYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDL

Query:  SGLPPDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY-----------------------
        SGLPPDRE+EFTIEL PGT PISQAPYRMA SELKELK++ QELVDKGYIRPSVSPWGAP+LFVKKKDGTLRLCIDY                       
Subjt:  SGLPPDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY-----------------------

Query:  -RGAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-------
         RGA +FSKIDLRSGYHQLKV ESDI KTAFR RYGHYEF VMPFGLTNAP VFMDLMN                               +RI       
Subjt:  -RGAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-------

Query:  ---------------------PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKK
                               V A GV+VD  KVEAVVNWERP  ATEV SFLGLAGYYRRF+ED SRL LPLTALTRKN KFEWSDKCEQSFQELKK
Subjt:  ---------------------PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKK

Query:  RLVTAPILTLLVTGKEYVIYCDASR
        RLVTAPIL L VTGK+YVIYCDASR
Subjt:  RLVTAPILTLLVTGKEYVIYCDASR

KAA0042295.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.6e-28663.47Show/hide
Query:  VMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDV
        VMP RT RRRRQNQDG Q  TQ  S   SS    +  A +E+F+R+ QEI R +RA PSDP+K YGIERLKKL ATVFEGS DP D E WLNMLEKCFDV
Subjt:  VMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDV

Query:  MSCPEERKVRLATFLLQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRR
        M+CPEERKVRLATFLLQKEAEGWWKSILARRSDA                                      + EYERKYT+LSR  DVI+A ESDRCRR
Subjt:  MSCPEERKVRLATFLLQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRR

Query:  FERGLRFEIRTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGT-------------------------------PTTNVARPRTGQESVASESR
        FERGLRFEIRTPVT IA WTNFSQLVETALRVEQSI EEK+A+E SRGT                               P     R + GQES+AS  R
Subjt:  FERGLRFEIRTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGT-------------------------------PTTNVARPRTGQESVASESR

Query:  RTPCVSCGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVDKPRQKGKVYAMTQQEAEDAPDVIT
        R PC SCG+N+RGQC+VGAG                                           EG+SGARQKGVV +PRQ+GKVYAMTQQE EDAPDVIT
Subjt:  RTPCVSCGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVDKPRQKGKVYAMTQQEAEDAPDVIT

Query:  GMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDC
        G +LICNVPA VL DP A HSFVSS+FLTKLNRMLE LSE L IYTPVGDVLLVNEVLR+ EVLVEG+ +LVDLLPLELQ LDVILGM+FLF HY SMDC
Subjt:  GMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDC

Query:  HRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQ
        HRKEV FRKP   EVVFRG RK +  S IS LK E LLRKGC  FLAH+V VQ EKLKP+DV  V E+LDVFP DLSGLPPDRE+EFTIEL PGT PISQ
Subjt:  HRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQ

Query:  APYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGYHQLKVGES
        APYRMA SELKELK++ QELVDKGYIRPSVSPWGAP+LFVKKKDGTLRLCIDY                        RGA +FSKIDLRSGYHQLKV ES
Subjt:  APYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGYHQLKVGES

Query:  DIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN---------------SEVRIPPVE-----------------ADGVTVDLHKVEAVVNWERPTRAT
        DI KT FR RYGHYEF VMPFGLTNAP VFMDLMN                ++ +  V+                 A GV+VD  KVEAVVNWERP  AT
Subjt:  DIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN---------------SEVRIPPVE-----------------ADGVTVDLHKVEAVVNWERPTRAT

Query:  EVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASR
        EV SFLGLAGYYRRF+ED SRL LPLTALTRKN KFEWSDKCEQSFQELKKRLVT PIL L VTGK+ VIYCDASR
Subjt:  EVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASR

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.2e-29158.79Show/hide
Query:  VEIELPVPDTLPTSAE----SSKSSSITLFSKSSYVKAAPRGGQRVSLEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDGSQNATQSQSERGS
        VEI+ PVPD LP           +S  T  + SS+   + +  +++ ++++ V  LH+IF +K+A     GVMP RT RRRRQNQDG Q  TQ  S   S
Subjt:  VEIELPVPDTLPTSAE----SSKSSSITLFSKSSYVKAAPRGGQRVSLEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDGSQNATQSQSERGS

Query:  SNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFLLQKEAEGWWKSILA
        S    +  A +E+F+R+ QEI R +RA PSDP+K YGIERLKKL ATVFEGS DP D E WLNMLEKCFDVM+CPEERKVRLATFLLQKEAEGWWKSILA
Subjt:  SNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFLLQKEAEGWWKSILA

Query:  RRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTAIANWTNFSQLVETA
        RRSDA                                      + EYERKYTELSR  DVI+ASESDRCRRFERGLRFEIRTPVTAIA WTNFSQLVETA
Subjt:  RRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTAIANWTNFSQLVETA

Query:  LRVEQSIIEEKAALEPSRGT------------------------------------------------------PTTNVARPRTGQESVASESRRTPCVS
        LRVEQSI EEK+A+E SRGT                                                      P  +  R + GQES+AS  RR PC S
Subjt:  LRVEQSIIEEKAALEPSRGT------------------------------------------------------PTTNVARPRTGQESVASESRRTPCVS

Query:  CGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVDKPRQKGKVYAMTQQEAEDAPDVITGMVLIC
        CG+N+RGQC+VGAG                                           EG+SGARQKGVV +PRQ+GKVYAMTQQE EDAPDVITG +LIC
Subjt:  CGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVDKPRQKGKVYAMTQQEAEDAPDVITGMVLIC

Query:  NVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDCHRKEVT
        NVPA VL DP A HSFVSS+FLTKLNRMLE LSE L IYTPVGDVLLVNEVLR+ EVLVEG+ +LVDLLPLELQ LDVILGM+FLF HY SMDCHRKEV 
Subjt:  NVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDCHRKEVT

Query:  FRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQAPYRMA
        FRKP   EVVFRG RK +  S IS LK E LLRKGC  FLAH+V VQ EKLKP+DV  V E+LDVFP DLSGLPPDRE+EFTIEL PGT PISQAPYRMA
Subjt:  FRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQAPYRMA

Query:  SSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGYHQLKVGESDIPKTA
         SELKELK++ QELVDKGYIRPSVSPWGAP+LFVKKKDGTLRLCIDY                        RGA +FSKIDLRSGYHQLKV ESDI KTA
Subjt:  SSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGYHQLKVGESDIPKTA

Query:  FRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI----------------------------PPVEADGVT
        FR RYGHYEF VMPFGLTNAP VFMDLMN                               +RI                              V A GV+
Subjt:  FRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI----------------------------PPVEADGVT

Query:  VDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASR
        VD  KVEAVVNWERP  ATEV SFLGLAGYYRRF+ED SRL LPLTALTRKN KFEWSDKCEQSFQELKKRLVTAPIL L VTGK+YVIYCDASR
Subjt:  VDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASR

TYK03091.1 reverse transcriptase [Cucumis melo var. makuwa]1.6e-28958.32Show/hide
Query:  YFVEIELPVPDTLPTSAESSKSSSIT---LFSKSSYVKAAPRGGQRVS------------LEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDG
        + VEIELPVPD LPTSAESS+S+S T   L+ +S +V+   +   +              +  DDV WLH+IF AK AGGPGGGV               
Subjt:  YFVEIELPVPDTLPTSAESSKSSSIT---LFSKSSYVKAAPRGGQRVS------------LEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDG

Query:  SQNATQSQSERGSSNPRGQNEARSER-FSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFL
            T  QS+    +    +  R     S   +EI R E+AGPSD +KMYGIERLKKL ATVFEGS DP D EVWLNMLEKCFDVMSCP+ERKV+LATFL
Subjt:  SQNATQSQSERGSSNPRGQNEARSER-FSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFL

Query:  LQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTA
        LQKEAEGWWKSI+ARR+DA                                      + +YERKYTELSR  ++IVASESDRC RFERGLRFEIRTPVTA
Subjt:  LQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTA

Query:  IANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTT------------------------------------------------------NVARPRTGQ
        IA W NFSQLVETALRV+QSI+EEK+A+E SRG  TT                                                      +VAR RTGQ
Subjt:  IANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTT------------------------------------------------------NVARPRTGQ

Query:  ESVASESRRTPCVSCGKNNRGQCIVGAG-----------------------------------------AEEGSSGARQKGVVDKPRQKGKVYAMTQQEA
        ESVASES+RTPCVSCGK+++G+C++GAG                                         A E +SGARQKGVV +PRQ+GKVYAMTQQEA
Subjt:  ESVASESRRTPCVSCGKNNRGQCIVGAG-----------------------------------------AEEGSSGARQKGVVDKPRQKGKVYAMTQQEA

Query:  EDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLF
        EDAPDVIT  +LICNVP  VLLD  A HSFVSSMFLTK+NRMLE L EELVI TPVGDVLLV+EVL D EV+VEGL M VDLLPLELQ LDVILGM+FLF
Subjt:  EDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLF

Query:  THYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELF
        THY SM+CHRKEVTF+KP  TEV+F+GER IIPTS ISALK E LLRKGC  FLAHVV+VQEEKLKP+DV  VNEYLDVFP DLSGL PDREVEFTIEL 
Subjt:  THYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELF

Query:  PGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGY
        PGT PISQAPYRMA SELKELKV+ QELVDKGYIRPSVS WGAP+LFVKKKDGTLRLCIDY                        RGA+VFSKIDLRSGY
Subjt:  PGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGY

Query:  HQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-----------------------
        HQLKV +S+IPKTAFR RYGHYEFLVMPFGLTNAP VFMDL+N                               +RI                       
Subjt:  HQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-----------------------

Query:  -----PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKE
               V A GV+VD  KVE VVNWERP  ATEV SFLGLAGYYRRFVED S+L LPLTALTRKNA+FEW DKCEQSFQELKKRLVTAPILTL ++GKE
Subjt:  -----PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKE

Query:  YVIYCDASRQ
        YVIYCDASRQ
Subjt:  YVIYCDASRQ

TrEMBL top hitse value%identityAlignment
A0A5A7T7M6 Reverse transcriptase4.9e-28958.22Show/hide
Query:  YFVEIELPVPDTLPTSAESSKSSSIT---LFSKSSYVKAAPRGGQRVS------------LEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDG
        + VEIELPVPD LPTSAESS+S+S T   L+ +S +V+   +   +              +  DDV WLH+IF AK AGGPGGGV               
Subjt:  YFVEIELPVPDTLPTSAESSKSSSIT---LFSKSSYVKAAPRGGQRVS------------LEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDG

Query:  SQNATQSQSERGSSNPRGQNEARSER-FSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFL
            T  QS+    +    +  R     S   +EI R E+AGPSD +KMYGIERLKKL ATVFEGS DP D EVWLNMLEKCFDVMSCP+ERKV+LATFL
Subjt:  SQNATQSQSERGSSNPRGQNEARSER-FSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFL

Query:  LQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTA
        L KEAEGWWKSI+ARR+DA                                      + +YERKYTELSR  ++IVASESDRC RFERGLRFEIRTPVTA
Subjt:  LQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTA

Query:  IANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTT------------------------------------------------------NVARPRTGQ
        IA W NFSQLVETALRV+QSI+EEK+A+E SRG  TT                                                      +VAR RTGQ
Subjt:  IANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTT------------------------------------------------------NVARPRTGQ

Query:  ESVASESRRTPCVSCGKNNRGQCIVGAG-----------------------------------------AEEGSSGARQKGVVDKPRQKGKVYAMTQQEA
        ESVASES+RTPCVSCGK+++G+C++GAG                                         A E +SGARQKGVV +PRQ+GKVYAMTQQEA
Subjt:  ESVASESRRTPCVSCGKNNRGQCIVGAG-----------------------------------------AEEGSSGARQKGVVDKPRQKGKVYAMTQQEA

Query:  EDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLF
        EDAPDVIT  +LICNVP  VLLD  A HSFVSSMFLTK+NRMLE L EELVI TPVGDVLLV+EVL D EV+VEGL M VDLLPLELQ LDVILGM+FLF
Subjt:  EDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLF

Query:  THYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELF
        THY SM+CHRKEVTF+KP  TEV+F+GER IIPTS ISALK E LLRKGC  FLAHVV+VQEEKLKP+DV  VNEYLDVFP DLSGL PDREVEFTIEL 
Subjt:  THYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELF

Query:  PGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGY
        PGT PISQAPYRMA SELKELKV+ QELVDKGYIRPSVS WGAP+LFVKKKDGTLRLCIDY                        RGA+VFSKIDLRSGY
Subjt:  PGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGY

Query:  HQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-----------------------
        HQLKV +S+IPKTAFR RYGHYEFLVMPFGLTNAP VFMDL+N                               +RI                       
Subjt:  HQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-----------------------

Query:  -----PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKE
               V A GV+VD  KVE VVNWERP  ATEV SFLGLAGYYRRFVED S+L LPLTALTRKNA+FEW DKCEQSFQELKKRLVTAPILTL ++GKE
Subjt:  -----PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKE

Query:  YVIYCDASRQ
        YVIYCDASRQ
Subjt:  YVIYCDASRQ

A0A5A7TDR2 Reverse transcriptase6.6e-28661.08Show/hide
Query:  GVMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFD
        GVMP RT RRRRQNQDG Q  TQ  S   SS    +  A +E+F+R+ QEI R +RA PSDP+K YGIERLKKL ATVFEGS DP D E WLNMLEKCFD
Subjt:  GVMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFD

Query:  VMSCPEERKVRLATFLLQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCR
        VM+CPEERKVRLATFLLQKEAEGWWKSILARRSDA                                      + EYERKYTELSR  DVI+ASESDRCR
Subjt:  VMSCPEERKVRLATFLLQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCR

Query:  RFERGLRFEIRTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGT--------------------------------------------------
        RFERGLRFEIRTPVTAIA WTNFSQLVETALRVEQSI EEK+A+E SRGT                                                  
Subjt:  RFERGLRFEIRTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGT--------------------------------------------------

Query:  ----PTTNVARPRTGQESVASESRRTPCVSCGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVD
            P  +  R + GQES+AS  RR PC SCG+N+RGQC+VGAG                                           EG+SGARQKGVV 
Subjt:  ----PTTNVARPRTGQESVASESRRTPCVSCGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVD

Query:  KPRQKGKVYAMTQQEAEDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLP
        +PRQ+GKVYAMTQQE EDAPDVITG +LICNVPA VL DP A HSFVSS+FLTKLNRMLE LSE L IYTPVGDVLLVNEVLR+ EVLVEG+ +LVDLLP
Subjt:  KPRQKGKVYAMTQQEAEDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLP

Query:  LELQALDVILGMNFLFTHYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDL
        LELQ LDVILGM+FLF HY SMDCHRKEV FRKP   EVVFRG RK +  S IS LK E LLRKGC  FLAH+V VQ EKLKP+DV  V E+LDVFP DL
Subjt:  LELQALDVILGMNFLFTHYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDL

Query:  SGLPPDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY-----------------------
        SGLPPDRE+EFTIEL PGT PISQAPYRMA SELKELK++ QELVDKGYIRPSVSPWGAP+LFVKKKDGTLRLCIDY                       
Subjt:  SGLPPDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY-----------------------

Query:  -RGAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-------
         RGA +FSKIDLRSGYHQLKV ESDI KTAFR RYGHYEF VMPFGLTNAP VFMDLMN                               +RI       
Subjt:  -RGAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-------

Query:  ---------------------PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKK
                               V A GV+VD  KVEAVVNWERP  ATEV SFLGLAGYYRRF+ED SRL LPLTALTRKN KFEWSDKCEQSFQELKK
Subjt:  ---------------------PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKK

Query:  RLVTAPILTLLVTGKEYVIYCDASR
        RLVTAPIL L VTGK+YVIYCDASR
Subjt:  RLVTAPILTLLVTGKEYVIYCDASR

A0A5A7TLH7 Reverse transcriptase7.8e-28763.47Show/hide
Query:  VMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDV
        VMP RT RRRRQNQDG Q  TQ  S   SS    +  A +E+F+R+ QEI R +RA PSDP+K YGIERLKKL ATVFEGS DP D E WLNMLEKCFDV
Subjt:  VMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDV

Query:  MSCPEERKVRLATFLLQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRR
        M+CPEERKVRLATFLLQKEAEGWWKSILARRSDA                                      + EYERKYT+LSR  DVI+A ESDRCRR
Subjt:  MSCPEERKVRLATFLLQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRR

Query:  FERGLRFEIRTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGT-------------------------------PTTNVARPRTGQESVASESR
        FERGLRFEIRTPVT IA WTNFSQLVETALRVEQSI EEK+A+E SRGT                               P     R + GQES+AS  R
Subjt:  FERGLRFEIRTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGT-------------------------------PTTNVARPRTGQESVASESR

Query:  RTPCVSCGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVDKPRQKGKVYAMTQQEAEDAPDVIT
        R PC SCG+N+RGQC+VGAG                                           EG+SGARQKGVV +PRQ+GKVYAMTQQE EDAPDVIT
Subjt:  RTPCVSCGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVDKPRQKGKVYAMTQQEAEDAPDVIT

Query:  GMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDC
        G +LICNVPA VL DP A HSFVSS+FLTKLNRMLE LSE L IYTPVGDVLLVNEVLR+ EVLVEG+ +LVDLLPLELQ LDVILGM+FLF HY SMDC
Subjt:  GMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDC

Query:  HRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQ
        HRKEV FRKP   EVVFRG RK +  S IS LK E LLRKGC  FLAH+V VQ EKLKP+DV  V E+LDVFP DLSGLPPDRE+EFTIEL PGT PISQ
Subjt:  HRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQ

Query:  APYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGYHQLKVGES
        APYRMA SELKELK++ QELVDKGYIRPSVSPWGAP+LFVKKKDGTLRLCIDY                        RGA +FSKIDLRSGYHQLKV ES
Subjt:  APYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGYHQLKVGES

Query:  DIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN---------------SEVRIPPVE-----------------ADGVTVDLHKVEAVVNWERPTRAT
        DI KT FR RYGHYEF VMPFGLTNAP VFMDLMN                ++ +  V+                 A GV+VD  KVEAVVNWERP  AT
Subjt:  DIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN---------------SEVRIPPVE-----------------ADGVTVDLHKVEAVVNWERPTRAT

Query:  EVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASR
        EV SFLGLAGYYRRF+ED SRL LPLTALTRKN KFEWSDKCEQSFQELKKRLVT PIL L VTGK+ VIYCDASR
Subjt:  EVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASR

A0A5A7UNA3 Reverse transcriptase1.1e-29158.79Show/hide
Query:  VEIELPVPDTLPTSAE----SSKSSSITLFSKSSYVKAAPRGGQRVSLEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDGSQNATQSQSERGS
        VEI+ PVPD LP           +S  T  + SS+   + +  +++ ++++ V  LH+IF +K+A     GVMP RT RRRRQNQDG Q  TQ  S   S
Subjt:  VEIELPVPDTLPTSAE----SSKSSSITLFSKSSYVKAAPRGGQRVSLEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDGSQNATQSQSERGS

Query:  SNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFLLQKEAEGWWKSILA
        S    +  A +E+F+R+ QEI R +RA PSDP+K YGIERLKKL ATVFEGS DP D E WLNMLEKCFDVM+CPEERKVRLATFLLQKEAEGWWKSILA
Subjt:  SNPRGQNEARSERFSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFLLQKEAEGWWKSILA

Query:  RRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTAIANWTNFSQLVETA
        RRSDA                                      + EYERKYTELSR  DVI+ASESDRCRRFERGLRFEIRTPVTAIA WTNFSQLVETA
Subjt:  RRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTAIANWTNFSQLVETA

Query:  LRVEQSIIEEKAALEPSRGT------------------------------------------------------PTTNVARPRTGQESVASESRRTPCVS
        LRVEQSI EEK+A+E SRGT                                                      P  +  R + GQES+AS  RR PC S
Subjt:  LRVEQSIIEEKAALEPSRGT------------------------------------------------------PTTNVARPRTGQESVASESRRTPCVS

Query:  CGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVDKPRQKGKVYAMTQQEAEDAPDVITGMVLIC
        CG+N+RGQC+VGAG                                           EG+SGARQKGVV +PRQ+GKVYAMTQQE EDAPDVITG +LIC
Subjt:  CGKNNRGQCIVGAGA-----------------------------------------EEGSSGARQKGVVDKPRQKGKVYAMTQQEAEDAPDVITGMVLIC

Query:  NVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDCHRKEVT
        NVPA VL DP A HSFVSS+FLTKLNRMLE LSE L IYTPVGDVLLVNEVLR+ EVLVEG+ +LVDLLPLELQ LDVILGM+FLF HY SMDCHRKEV 
Subjt:  NVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDCHRKEVT

Query:  FRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQAPYRMA
        FRKP   EVVFRG RK +  S IS LK E LLRKGC  FLAH+V VQ EKLKP+DV  V E+LDVFP DLSGLPPDRE+EFTIEL PGT PISQAPYRMA
Subjt:  FRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQAPYRMA

Query:  SSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGYHQLKVGESDIPKTA
         SELKELK++ QELVDKGYIRPSVSPWGAP+LFVKKKDGTLRLCIDY                        RGA +FSKIDLRSGYHQLKV ESDI KTA
Subjt:  SSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGYHQLKVGESDIPKTA

Query:  FRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI----------------------------PPVEADGVT
        FR RYGHYEF VMPFGLTNAP VFMDLMN                               +RI                              V A GV+
Subjt:  FRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI----------------------------PPVEADGVT

Query:  VDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASR
        VD  KVEAVVNWERP  ATEV SFLGLAGYYRRF+ED SRL LPLTALTRKN KFEWSDKCEQSFQELKKRLVTAPIL L VTGK+YVIYCDASR
Subjt:  VDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASR

A0A5D3BTP3 Reverse transcriptase7.5e-29058.32Show/hide
Query:  YFVEIELPVPDTLPTSAESSKSSSIT---LFSKSSYVKAAPRGGQRVS------------LEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDG
        + VEIELPVPD LPTSAESS+S+S T   L+ +S +V+   +   +              +  DDV WLH+IF AK AGGPGGGV               
Subjt:  YFVEIELPVPDTLPTSAESSKSSSIT---LFSKSSYVKAAPRGGQRVS------------LEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDG

Query:  SQNATQSQSERGSSNPRGQNEARSER-FSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFL
            T  QS+    +    +  R     S   +EI R E+AGPSD +KMYGIERLKKL ATVFEGS DP D EVWLNMLEKCFDVMSCP+ERKV+LATFL
Subjt:  SQNATQSQSERGSSNPRGQNEARSER-FSRSAQEICRLERAGPSDPKKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFL

Query:  LQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTA
        LQKEAEGWWKSI+ARR+DA                                      + +YERKYTELSR  ++IVASESDRC RFERGLRFEIRTPVTA
Subjt:  LQKEAEGWWKSILARRSDA--------------------------------------LVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEIRTPVTA

Query:  IANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTT------------------------------------------------------NVARPRTGQ
        IA W NFSQLVETALRV+QSI+EEK+A+E SRG  TT                                                      +VAR RTGQ
Subjt:  IANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTT------------------------------------------------------NVARPRTGQ

Query:  ESVASESRRTPCVSCGKNNRGQCIVGAG-----------------------------------------AEEGSSGARQKGVVDKPRQKGKVYAMTQQEA
        ESVASES+RTPCVSCGK+++G+C++GAG                                         A E +SGARQKGVV +PRQ+GKVYAMTQQEA
Subjt:  ESVASESRRTPCVSCGKNNRGQCIVGAG-----------------------------------------AEEGSSGARQKGVVDKPRQKGKVYAMTQQEA

Query:  EDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLF
        EDAPDVIT  +LICNVP  VLLD  A HSFVSSMFLTK+NRMLE L EELVI TPVGDVLLV+EVL D EV+VEGL M VDLLPLELQ LDVILGM+FLF
Subjt:  EDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLF

Query:  THYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELF
        THY SM+CHRKEVTF+KP  TEV+F+GER IIPTS ISALK E LLRKGC  FLAHVV+VQEEKLKP+DV  VNEYLDVFP DLSGL PDREVEFTIEL 
Subjt:  THYVSMDCHRKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELF

Query:  PGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGY
        PGT PISQAPYRMA SELKELKV+ QELVDKGYIRPSVS WGAP+LFVKKKDGTLRLCIDY                        RGA+VFSKIDLRSGY
Subjt:  PGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDY------------------------RGAVVFSKIDLRSGY

Query:  HQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-----------------------
        HQLKV +S+IPKTAFR RYGHYEFLVMPFGLTNAP VFMDL+N                               +RI                       
Subjt:  HQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMN-----------------------------SEVRI-----------------------

Query:  -----PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKE
               V A GV+VD  KVE VVNWERP  ATEV SFLGLAGYYRRFVED S+L LPLTALTRKNA+FEW DKCEQSFQELKKRLVTAPILTL ++GKE
Subjt:  -----PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKE

Query:  YVIYCDASRQ
        YVIYCDASRQ
Subjt:  YVIYCDASRQ

SwissProt top hitse value%identityAlignment
P0CT41 Transposon Tf2-12 polyprotein2.7e-2624.65Show/hide
Query:  LAHVVKVQEEKLKPKDVTAVNEYLDV-FPTDLSGLP-PDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKK
        L+ + KV     +P+      E+ D+   T+   LP P + +EF +EL      +    Y +   +++ +  E  + +  G IR S +    P++FV KK
Subjt:  LAHVVKVQEEKLKPKDVTAVNEYLDV-FPTDLSGLP-PDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKK

Query:  DGTLRLCIDYR------------------------GAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAP----------------
        +GTLR+ +DY+                        G+ +F+K+DL+S YH ++V + D  K AFR   G +E+LVMP+G++ AP                
Subjt:  DGTLRLCIDYR------------------------GAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAP----------------

Query:  -------------------------DVFMDLMNSEVRIPPVEAD----------------GVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVED
                                 DV   L N+ + I   + +                G T     ++ V+ W++P    E+  FLG   Y R+F+  
Subjt:  -------------------------DVFMDLMNSEVRIPPVEAD----------------GVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVED

Query:  LSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDAS
         S+L  PL  L +K+ +++W+    Q+ + +K+ LV+ P+L      K+ ++  DAS
Subjt:  LSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDAS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-3644.25Show/hide
Query:  MRRIPYTSAVGSLMYVMLCTRPNVCYVVGIASRYQSNPGLDHWTIVKGIIKYLRRTRDYMLVFRAEELVLTVYTDYDFQSDKDSRKSTLGSVFTLNGGVV
        M ++PY+SAVGSLMY M+CTRP++ + VG+ SR+  NPG +HW  VK I++YLR T    L F   + +L  YTD D   D D+RKS+ G +FT +GG +
Subjt:  MRRIPYTSAVGSLMYVMLCTRPNVCYVVGIASRYQSNPGLDHWTIVKGIIKYLRRTRDYMLVFRAEELVLTVYTDYDFQSDKDSRKSTLGSVFTLNGGVV

Query:  VWRSIKQGCIADSTMEAKYVVACEAAKEAVWLRKFLTDLEVVPNMNLLITLYFDNSGAIANSKEPRSHKREHEI
         W+S  Q C+A ST EA+Y+ A E  KE +WL++FL +L +         +Y D+  AI  SK    H R   I
Subjt:  VWRSIKQGCIADSTMEAKYVVACEAAKEAVWLRKFLTDLEVVPNMNLLITLYFDNSGAIANSKEPRSHKREHEI

P20825 Retrovirus-related Pol polyprotein from transposon 2977.1e-2728.53Show/hide
Query:  TPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKD-----GTLRLCIDYR------------------------GAVVFSKIDLRS
        +PI    Y +A +   E++ + QE++++G IR S SP+ +P   V KK         R+ IDYR                            F+ IDL  
Subjt:  TPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKD-----GTLRLCIDYR------------------------GAVVFSKIDLRS

Query:  GYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMNSEVRI--------------------------------------------------
        G+HQ+++ E  I KTAF  + GHYE+L MPFGL NAP  F   MN+ +R                                                   
Subjt:  GYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMNSEVRI--------------------------------------------------

Query:  -------PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCE--QSFQELKKRLVTAPILTLLV
                 V  DG+  +  KV+A+V++  PT+  E+ +FLGL GYYR+F+ + + +  P+T+  +K  K + + K E  ++F++LK  ++  PIL L  
Subjt:  -------PPVEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCE--QSFQELKKRLVTAPILTLLV

Query:  TGKEYVIYCDAS
          K++V+  DAS
Subjt:  TGKEYVIYCDAS

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.4e-2927.83Show/hide
Query:  PTDLSGLPPDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYR------------------
        P D++ +P    V+  IE+ PG       PY +     +E+    Q+L+D  +I PS SP  +P++ V KKDGT RLC+DYR                  
Subjt:  PTDLSGLPPDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYR------------------

Query:  ------GAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAP--------DVFMDLMNSEVRIPPV---------------------
               A +F+ +DL SGYHQ+ +   D  KTAF    G YE+ VMPFGL NAP        D F DL    V +  +                     
Subjt:  ------GAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAP--------DVFMDLMNSEVRIPPV---------------------

Query:  -------------------EADGVTVDL-------HKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQEL
                           E  G ++ +       HK  A+ ++  P    +   FLG+  YYRRF+ + S++  P+        K +W++K +++ ++L
Subjt:  -------------------EADGVTVDL-------HKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQEL

Query:  KKRLVTAPILTLLVTGKEYVIYCDASR
        K  L  +P+L        Y +  DAS+
Subjt:  KKRLVTAPILTLLVTGKEYVIYCDASR

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.8e-2927.83Show/hide
Query:  PTDLSGLPPDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYR------------------
        P D++ +P    V+  IE+ PG       PY +     +E+    Q+L+D  +I PS SP  +P++ V KKDGT RLC+DYR                  
Subjt:  PTDLSGLPPDREVEFTIELFPGTTPISQAPYRMASSELKELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYR------------------

Query:  ------GAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAP--------DVFMDLMNSEVRIPPV---------------------
               A +F+ +DL SGYHQ+ +   D  KTAF    G YE+ VMPFGL NAP        D F DL    V +  +                     
Subjt:  ------GAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAP--------DVFMDLMNSEVRIPPV---------------------

Query:  -------------------EADGVTVDL-------HKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQEL
                           E  G ++ +       HK  A+ ++  P    +   FLG+  YYRRF+ + S++  P+        K +W++K +++  +L
Subjt:  -------------------EADGVTVDL-------HKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQEL

Query:  KKRLVTAPILTLLVTGKEYVIYCDASR
        K  L  +P+L        Y +  DAS+
Subjt:  KKRLVTAPILTLLVTGKEYVIYCDASR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.4e-1631.76Show/hide
Query:  YTSAVGSLMYVMLCTRPNVCYVVGIASRYQSNPGLDHWTIVKGIIKYLRRTRDYMLVFRAE-ELVLTVYTDYDFQSDKDSRKSTLGSVFTLNGGVVVWRS
        Y   +G LMY+ + TR ++ + V   S++   P L H   V  I+ Y++ T    L + ++ E+ L V++D  FQS KD+R+ST G    L   ++ W+S
Subjt:  YTSAVGSLMYVMLCTRPNVCYVVGIASRYQSNPGLDHWTIVKGIIKYLRRTRDYMLVFRAE-ELVLTVYTDYDFQSDKDSRKSTLGSVFTLNGGVVVWRS

Query:  IKQGCIADSTMEAKYVVACEAAKEAVWLRKFLTDLEVVPNMNLLITLYFDNSGAIANSKEPRSHKREHEI
         KQ  ++ S+ EA+Y     A  E +WL +F  +L++  +   L  L+ DN+ AI  +     H+R   I
Subjt:  IKQGCIADSTMEAKYVVACEAAKEAVWLRKFLTDLEVVPNMNLLITLYFDNSGAIANSKEPRSHKREHEI

ATMG00810.1 DNA/RNA polymerases superfamily protein1.5e-0825.98Show/hide
Query:  YTSAVGSLMYVMLCTRPNVCYVVGIASRYQSNPGLDHWTIVKGIIKYLRRTRDY-MLVFRAEELVLTVYTDYDFQSDKDSRKSTLGSVFTLNGGVVVWRS
        + S VG+L Y+ L TRP++ Y V I  +    P L  + ++K +++Y++ T  + + + +  +L +  + D D+     +R+ST G    L   ++ W +
Subjt:  YTSAVGSLMYVMLCTRPNVCYVVGIASRYQSNPGLDHWTIVKGIIKYLRRTRDY-MLVFRAEELVLTVYTDYDFQSDKDSRKSTLGSVFTLNGGVVVWRS

Query:  IKQGCIADSTMEAKYVVACEAAKEAVW
         +Q  ++ S+ E +Y      A E  W
Subjt:  IKQGCIADSTMEAKYVVACEAAKEAVW

ATMG00860.1 DNA/RNA polymerases superfamily protein1.4e-1441.38Show/hide
Query:  VEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTL
        +  +GV+ D  K+EA+V W  P   TE+  FLGL GYYRRFV++  +++ PLT L +KN+  +W++    +F+ LK  + T P+L L
Subjt:  VEADGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGGATTCCCTACACCTCTGCAGTGGGTAGCTTAATGTATGTTATGCTCTGTACGAGGCCTAACGTCTGTTATGTTGTAGGGATTGCGAGCAGATATCAGTCAAA
TCCAGGGTTAGACCACTGGACCATCGTTAAAGGAATCATCAAGTATCTTAGAAGAACGAGGGACTACATGCTCGTGTTTAGGGCTGAAGAGTTAGTTCTCACTGTATACA
CCGACTATGACTTCCAGTCTGACAAGGATTCTAGGAAATCAACGCTAGGGTCAGTGTTCACTCTAAACGGGGGAGTTGTAGTTTGGAGAAGTATAAAGCAAGGATGCATA
GCAGACTCCACCATGGAGGCTAAGTATGTAGTTGCTTGTGAAGCAGCTAAGGAAGCTGTTTGGCTAAGGAAGTTCTTGACTGATTTGGAAGTTGTTCCAAATATGAACTT
GCTAATTACGCTATACTTTGACAACAGTGGGGCTATAGCCAATTCAAAGGAACCTCGCAGCCACAAACGAGAGCACGAAATTTTCAAGCAAAGTAGGAGGATCTCTGGTT
TTTTCTGGTTTTTCGAGCATTCTGGGGTGTACAAGTTGAATCAGAAGCTCTATTTTGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCC
AAATCAAGCTCCATTACGTTATTTTCGAAATCGAGTTATGTCAAGGCTGCACCTCGTGGGGGTCAACGAGTATCGTTAGAGGAGGACGACGTCCATTGGCTTCACTCCAT
CTTTTGGGCTAAGCTAGCAGGTGGTCCGGGAGGGGGAGTCATGCCAACACGTACCAGCAGACGACGCAGGCAGAATCAGGACGGGTCGCAGAATGCTACCCAAAGTCAAT
CTGAAAGGGGATCCAGTAACCCGAGAGGTCAGAACGAGGCCAGGAGTGAGCGATTTTCTAGATCTGCACAGGAGATATGTAGGCTAGAGAGAGCAGGGCCTAGTGATCCG
AAAAAGATGTATGGGATTGAACGGTTGAAGAAGTTAAGAGCCACAGTGTTTGAGGGTTCCATGGATCCAACTGACGTCGAGGTCTGGTTAAATATGTTGGAGAAATGCTT
CGACGTGATGAGTTGTCCTGAGGAGCGAAAAGTCAGATTAGCCACATTTCTTTTGCAGAAGGAGGCGGAGGGATGGTGGAAATCCATTTTAGCCAGGCGCAGTGATGCAC
TGGTCGAGTATGAGAGAAAGTATACCGAGCTTTCACGGTGTGTCGATGTGATTGTGGCATCTGAGAGTGATAGGTGTCGCAGGTTTGAGAGAGGGCTGCGTTTTGAGATA
CGTACGCCAGTTACCGCTATTGCCAATTGGACAAATTTTTCCCAGCTAGTAGAAACTGCCTTACGTGTGGAGCAGAGTATCATAGAGGAAAAGGCGGCATTAGAGCCTAG
TCGTGGAACTCCAACAACTAATGTAGCAAGACCGCGAACGGGTCAAGAGTCCGTTGCTAGTGAATCCAGGAGAACCCCATGTGTAAGTTGCGGCAAGAATAATCGGGGTC
AGTGTATTGTTGGCGCCGGTGCAGAAGAAGGCAGCAGTGGTGCAAGGCAAAAGGGAGTTGTGGACAAACCTAGGCAGAAGGGAAAAGTCTACGCCATGACTCAACAGGAA
GCAGAGGATGCACCAGACGTGATTACTGGTATGGTTTTGATTTGTAATGTACCTGCACATGTTTTATTAGATCCAGACGCTATGCATTCCTTTGTTTCTAGTATGTTCTT
AACTAAGCTAAATAGGATGCTAGAGTCTTTATCTGAGGAGTTAGTCATATACACTCCAGTTGGTGACGTTTTATTAGTTAATGAAGTGTTGCGTGATGGTGAGGTTTTAG
TAGAAGGTCTTTGTATGTTAGTGGATCTTCTTCCCCTAGAGTTGCAGGCGTTGGATGTGATTTTGGGAATGAATTTCTTATTCACTCACTATGTTTCTATGGATTGCCAT
AGGAAGGAAGTGACTTTTAGGAAACCATGTTTGACTGAAGTTGTTTTCAGGGGTGAGAGGAAGATTATTCCTACGAGTTGGATTTCAGCTCTGAAAGTTGAGATGTTGTT
AAGGAAGGGTTGCATAACGTTTCTTGCACACGTAGTAAAAGTGCAAGAAGAAAAACTGAAACCAAAAGATGTTACTGCGGTGAATGAATATCTTGATGTTTTTCCAACTG
ATCTATCGGGTTTGCCACCTGATAGAGAGGTGGAGTTCACTATTGAATTGTTCCCAGGAACAACACCTATTTCACAGGCACCGTACAGAATGGCTTCGAGCGAGCTTAAG
GAGCTGAAGGTGGAGCCGCAAGAACTAGTTGATAAGGGATACATCAGGCCTAGTGTATCACCTTGGGGAGCTCCAATGTTATTCGTGAAGAAGAAAGATGGTACCCTGAG
ATTATGCATTGATTACAGGGGAGCAGTAGTGTTCTCTAAGATTGATCTGAGGTCAGGATACCACCAGTTGAAGGTTGGGGAATCAGATATTCCTAAGACAGCATTCAGGA
TGAGGTATGGGCACTATGAGTTTTTAGTGATGCCATTTGGTTTAACGAATGCGCCAGACGTTTTCATGGACCTCATGAACTCTGAGGTCAGGATACCACCAGTTGAAGCG
GACGGAGTTACTGTTGATCTGCATAAAGTGGAAGCTGTTGTCAATTGGGAAAGACCAACTAGAGCAACAGAGGTATGTAGTTTCCTAGGCCTGGCCGGATACTACAGACG
TTTTGTTGAGGATTTATCACGATTATTATTACCCTTGACAGCTTTGACAAGGAAGAATGCTAAGTTTGAGTGGTCGGATAAATGCGAACAGAGTTTCCAGGAACTGAAGA
AGAGATTAGTGACAGCGCCTATTCTGACACTTCTTGTAACAGGGAAGGAGTATGTGATCTATTGTGACGCTTCGAGGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGACGGATTCCCTACACCTCTGCAGTGGGTAGCTTAATGTATGTTATGCTCTGTACGAGGCCTAACGTCTGTTATGTTGTAGGGATTGCGAGCAGATATCAGTCAAA
TCCAGGGTTAGACCACTGGACCATCGTTAAAGGAATCATCAAGTATCTTAGAAGAACGAGGGACTACATGCTCGTGTTTAGGGCTGAAGAGTTAGTTCTCACTGTATACA
CCGACTATGACTTCCAGTCTGACAAGGATTCTAGGAAATCAACGCTAGGGTCAGTGTTCACTCTAAACGGGGGAGTTGTAGTTTGGAGAAGTATAAAGCAAGGATGCATA
GCAGACTCCACCATGGAGGCTAAGTATGTAGTTGCTTGTGAAGCAGCTAAGGAAGCTGTTTGGCTAAGGAAGTTCTTGACTGATTTGGAAGTTGTTCCAAATATGAACTT
GCTAATTACGCTATACTTTGACAACAGTGGGGCTATAGCCAATTCAAAGGAACCTCGCAGCCACAAACGAGAGCACGAAATTTTCAAGCAAAGTAGGAGGATCTCTGGTT
TTTTCTGGTTTTTCGAGCATTCTGGGGTGTACAAGTTGAATCAGAAGCTCTATTTTGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCC
AAATCAAGCTCCATTACGTTATTTTCGAAATCGAGTTATGTCAAGGCTGCACCTCGTGGGGGTCAACGAGTATCGTTAGAGGAGGACGACGTCCATTGGCTTCACTCCAT
CTTTTGGGCTAAGCTAGCAGGTGGTCCGGGAGGGGGAGTCATGCCAACACGTACCAGCAGACGACGCAGGCAGAATCAGGACGGGTCGCAGAATGCTACCCAAAGTCAAT
CTGAAAGGGGATCCAGTAACCCGAGAGGTCAGAACGAGGCCAGGAGTGAGCGATTTTCTAGATCTGCACAGGAGATATGTAGGCTAGAGAGAGCAGGGCCTAGTGATCCG
AAAAAGATGTATGGGATTGAACGGTTGAAGAAGTTAAGAGCCACAGTGTTTGAGGGTTCCATGGATCCAACTGACGTCGAGGTCTGGTTAAATATGTTGGAGAAATGCTT
CGACGTGATGAGTTGTCCTGAGGAGCGAAAAGTCAGATTAGCCACATTTCTTTTGCAGAAGGAGGCGGAGGGATGGTGGAAATCCATTTTAGCCAGGCGCAGTGATGCAC
TGGTCGAGTATGAGAGAAAGTATACCGAGCTTTCACGGTGTGTCGATGTGATTGTGGCATCTGAGAGTGATAGGTGTCGCAGGTTTGAGAGAGGGCTGCGTTTTGAGATA
CGTACGCCAGTTACCGCTATTGCCAATTGGACAAATTTTTCCCAGCTAGTAGAAACTGCCTTACGTGTGGAGCAGAGTATCATAGAGGAAAAGGCGGCATTAGAGCCTAG
TCGTGGAACTCCAACAACTAATGTAGCAAGACCGCGAACGGGTCAAGAGTCCGTTGCTAGTGAATCCAGGAGAACCCCATGTGTAAGTTGCGGCAAGAATAATCGGGGTC
AGTGTATTGTTGGCGCCGGTGCAGAAGAAGGCAGCAGTGGTGCAAGGCAAAAGGGAGTTGTGGACAAACCTAGGCAGAAGGGAAAAGTCTACGCCATGACTCAACAGGAA
GCAGAGGATGCACCAGACGTGATTACTGGTATGGTTTTGATTTGTAATGTACCTGCACATGTTTTATTAGATCCAGACGCTATGCATTCCTTTGTTTCTAGTATGTTCTT
AACTAAGCTAAATAGGATGCTAGAGTCTTTATCTGAGGAGTTAGTCATATACACTCCAGTTGGTGACGTTTTATTAGTTAATGAAGTGTTGCGTGATGGTGAGGTTTTAG
TAGAAGGTCTTTGTATGTTAGTGGATCTTCTTCCCCTAGAGTTGCAGGCGTTGGATGTGATTTTGGGAATGAATTTCTTATTCACTCACTATGTTTCTATGGATTGCCAT
AGGAAGGAAGTGACTTTTAGGAAACCATGTTTGACTGAAGTTGTTTTCAGGGGTGAGAGGAAGATTATTCCTACGAGTTGGATTTCAGCTCTGAAAGTTGAGATGTTGTT
AAGGAAGGGTTGCATAACGTTTCTTGCACACGTAGTAAAAGTGCAAGAAGAAAAACTGAAACCAAAAGATGTTACTGCGGTGAATGAATATCTTGATGTTTTTCCAACTG
ATCTATCGGGTTTGCCACCTGATAGAGAGGTGGAGTTCACTATTGAATTGTTCCCAGGAACAACACCTATTTCACAGGCACCGTACAGAATGGCTTCGAGCGAGCTTAAG
GAGCTGAAGGTGGAGCCGCAAGAACTAGTTGATAAGGGATACATCAGGCCTAGTGTATCACCTTGGGGAGCTCCAATGTTATTCGTGAAGAAGAAAGATGGTACCCTGAG
ATTATGCATTGATTACAGGGGAGCAGTAGTGTTCTCTAAGATTGATCTGAGGTCAGGATACCACCAGTTGAAGGTTGGGGAATCAGATATTCCTAAGACAGCATTCAGGA
TGAGGTATGGGCACTATGAGTTTTTAGTGATGCCATTTGGTTTAACGAATGCGCCAGACGTTTTCATGGACCTCATGAACTCTGAGGTCAGGATACCACCAGTTGAAGCG
GACGGAGTTACTGTTGATCTGCATAAAGTGGAAGCTGTTGTCAATTGGGAAAGACCAACTAGAGCAACAGAGGTATGTAGTTTCCTAGGCCTGGCCGGATACTACAGACG
TTTTGTTGAGGATTTATCACGATTATTATTACCCTTGACAGCTTTGACAAGGAAGAATGCTAAGTTTGAGTGGTCGGATAAATGCGAACAGAGTTTCCAGGAACTGAAGA
AGAGATTAGTGACAGCGCCTATTCTGACACTTCTTGTAACAGGGAAGGAGTATGTGATCTATTGTGACGCTTCGAGGCAATGA
Protein sequenceShow/hide protein sequence
MRRIPYTSAVGSLMYVMLCTRPNVCYVVGIASRYQSNPGLDHWTIVKGIIKYLRRTRDYMLVFRAEELVLTVYTDYDFQSDKDSRKSTLGSVFTLNGGVVVWRSIKQGCI
ADSTMEAKYVVACEAAKEAVWLRKFLTDLEVVPNMNLLITLYFDNSGAIANSKEPRSHKREHEIFKQSRRISGFFWFFEHSGVYKLNQKLYFVEIELPVPDTLPTSAESS
KSSSITLFSKSSYVKAAPRGGQRVSLEEDDVHWLHSIFWAKLAGGPGGGVMPTRTSRRRRQNQDGSQNATQSQSERGSSNPRGQNEARSERFSRSAQEICRLERAGPSDP
KKMYGIERLKKLRATVFEGSMDPTDVEVWLNMLEKCFDVMSCPEERKVRLATFLLQKEAEGWWKSILARRSDALVEYERKYTELSRCVDVIVASESDRCRRFERGLRFEI
RTPVTAIANWTNFSQLVETALRVEQSIIEEKAALEPSRGTPTTNVARPRTGQESVASESRRTPCVSCGKNNRGQCIVGAGAEEGSSGARQKGVVDKPRQKGKVYAMTQQE
AEDAPDVITGMVLICNVPAHVLLDPDAMHSFVSSMFLTKLNRMLESLSEELVIYTPVGDVLLVNEVLRDGEVLVEGLCMLVDLLPLELQALDVILGMNFLFTHYVSMDCH
RKEVTFRKPCLTEVVFRGERKIIPTSWISALKVEMLLRKGCITFLAHVVKVQEEKLKPKDVTAVNEYLDVFPTDLSGLPPDREVEFTIELFPGTTPISQAPYRMASSELK
ELKVEPQELVDKGYIRPSVSPWGAPMLFVKKKDGTLRLCIDYRGAVVFSKIDLRSGYHQLKVGESDIPKTAFRMRYGHYEFLVMPFGLTNAPDVFMDLMNSEVRIPPVEA
DGVTVDLHKVEAVVNWERPTRATEVCSFLGLAGYYRRFVEDLSRLLLPLTALTRKNAKFEWSDKCEQSFQELKKRLVTAPILTLLVTGKEYVIYCDASRQ