; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012747 (gene) of Snake gourd v1 genome

Gene IDTan0012747
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG02:39690238..39691487
RNA-Seq ExpressionTan0012747
SyntenyTan0012747
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016646912.1 PREDICTED: uncharacterized protein LOC103318979 [Prunus mume]8.4e-9950.64Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAE----
        ML+RLAG  YYCFLDGYSGYNQI I P+DQ+K TF CP+GTFA+RRMPFGLCNAPATFQRCM+ IFSDM++R +E+FMDDFSV G ++++ L N      
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAE----

Query:  QVLREKI--------------------CAGFF---------IDCQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVG
        +V R KI                     AGF+         I       L+++ +     +    F  LK+ L+T PV++ P+WE  FE+MC+ASD+A+G
Subjt:  QVLREKI--------------------CAGFF---------IDCQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVG

Query:  AIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQV
        A+L                         A  EKE    VFA DKFRSYL+G+KV+V+TDH+ALK+LL KK+AKPRLIRW+LLLQEFD+EI+D+KGSEN V
Subjt:  AIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQV

Query:  AYHLSRIQGDDQDRE----IKEIFADEQVIKVDIHRD---PWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQI
        A HLSR+  +D+  E    I E F DEQ+  ++  ++   PWYADFVNYLACGI PP  S +Q+KK    +KHY+WD+PYL+K GPDQI
Subjt:  AYHLSRIQGDDQDRE----IKEIFADEQVIKVDIHRD---PWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQI

XP_020426529.1 uncharacterized protein LOC109950808 [Prunus persica]2.3e-9650.39Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAE----
        ML+RLAG  YYCFLDGYSGYNQI I P+DQ+K TF CP+GTFA+RRMPFGLCNAPATFQRCMM IFSDM++R +E+FMDDFSV G ++++ L N      
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAE----

Query:  QVLREKI--------------------CAGFF---------IDCQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVG
        +V R KI                     AGF+         I       L+++ +     +    F  LK  L+T PV++ P+WE  FE+MC+ASD+A+G
Subjt:  QVLREKI--------------------CAGFF---------IDCQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVG

Query:  AIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQV
        A+L                         A  EKE    VFA DKFRSYL+G KV+V+TDH+ALK+LL KK+AK RLIRW+LLLQEFD+EI+D+KGSEN V
Subjt:  AIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQV

Query:  AYHLSRIQGDDQDRE----IKEIFADEQVIKVDIHR---DPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQI
        A HLSR+  +D+  E    I E F DEQ+  +   +    PWYA+FVNYLACGI PP+ S +Q+KK    VKHY+WD+PYL+K GPDQ+
Subjt:  AYHLSRIQGDDQDRE----IKEIFADEQVIKVDIHR---DPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQI

XP_022151603.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111019515 [Momordica charantia]1.2e-9751.17Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAE----
        MLDRLAGK++YCFLDGYSG N+ITI P+DQ+K TF CPYGTFAFRRMPFGLCNAPATFQRCMM IFSDM++  LE+FMDDFSV G +++A L N E    
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAE----

Query:  QVLREKI-------------------CAGFF---------IDCQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGA
        +V   KI                     GF+         I       L  +R  +   +   +FE LK  LS+ P+++EPNW+  FE+MC+ASD+A+GA
Subjt:  QVLREKI-------------------CAGFF---------IDCQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGA

Query:  ILAK-------------------------GEKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVA
        +L +                          EKE    VF FDKFR YL+G+KV+V TDHSALKYL  KKDAKPRLIRWILLLQEFD+E++DRKG+ENQVA
Subjt:  ILAK-------------------------GEKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVA

Query:  YHLSRIQGD--DQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQI
         HLSR++    D    I+E F DEQ+++VD    PWYAD  NYL   + P + +  Q KK+ ++V+ Y WDEP+LYKLGPD I
Subjt:  YHLSRIQGD--DQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQI

XP_022864608.1 uncharacterized protein LOC111384552 [Olea europaea var. sylvestris]1.3e-9647.72Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR
        ML+RLAG  +YCFLDGYSGYNQI + P+DQ+K TF CPYGTFA+ RMPFGLCNAPATFQRCMM IFSDMI+R +++FMDDFSV G +++  L +   VL+
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR

Query:  EKICAGFFIDCQAIVYLVRE-----------------------------------------------------RKTILVCEIAV-------------TFE
                ++ +   ++V+E                                                     + T  +C + +              F 
Subjt:  EKICAGFFIDCQAIVYLVRE-----------------------------------------------------RKTILVCEIAV-------------TFE

Query:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL
         LKE L+T PV+V P+WE  FE+MC+ASD AVGA+L                         A  EKE    VFAFDKFRSYL+GSK++V+TDH+ALKYLL
Subjt:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL

Query:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQ-GDDQDR-EIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVK
        TKKDAKPRLIRW+LLLQEFD+EI+D+KG+EN VA HLSR++ G+ +D  +I EIF DEQ+++VD    PWYAD VNYLA  I PPD SSHQRKK F ++K
Subjt:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQ-GDDQDR-EIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVK

Query:  HYFWDEPYLYKLGPDQI
        +YF+++P LY+ G DQI
Subjt:  HYFWDEPYLYKLGPDQI

XP_038982209.1 uncharacterized protein LOC120110689 [Phoenix dactylifera]1.7e-9650.91Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR
        +L+RLAG  YYCFLDGYSGYNQI+I P+DQ+K TF CPYGTFAFRRMPFGLCNAPATFQRCMM IFSD +++ +E+FMDDFSV G ++++ L N  +VL+
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR

Query:  EKICAGFFIDCQAIVYL--------VRERKTIL-------------------VCEI-------------AVTFEKLKEVLSTTPVMVEPNWEKSFEVMCE
             G  +D   I  +        V+  ++ L                   +C +                F +LK+ L + P+M  P+W   FE+MC+
Subjt:  EKICAGFFIDCQAIVYL--------VRERKTIL-------------------VCEI-------------AVTFEKLKEVLSTTPVMVEPNWEKSFEVMCE

Query:  ASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDR
        ASDFA+GA+L                         A  EKE    VFAFDKFRSYL+GSKV+V+TDHSA+KYLL KKDAKPRLIRW+LLLQEFDLEI+D+
Subjt:  ASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDR

Query:  KGSENQVAYHLSRIQGDDQDRE--IKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYK
        +G EN VA HLSR++G  +  E  I E F+DEQ++ V +   PWYA+FVNYL  GI PPD S HQ+KK   DVKHYFW EP LYK
Subjt:  KGSENQVAYHLSRIQGDDQDRE--IKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYK

TrEMBL top hitse value%identityAlignment
A0A4Y1RS99 Transposable element protein2.5e-9647.39Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVL-
        ML+RLAG  YYCFLDGYSGYNQI I P+DQ+K TF CP+GTFA+RRMPFGLCNAPATFQRCMM IFSDM++R +E+FMDDFSV G ++++ L N   VL 
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVL-

Query:  ------------------REKICAGFFIDCQAIVYLVRERKTI----------------------------------LVCEIAV-------------TFE
                          +E I  G  I  + I     + +TI                                   +C++ +              F 
Subjt:  ------------------REKICAGFFIDCQAIVYLVRERKTI----------------------------------LVCEIAV-------------TFE

Query:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL
         LK  L+T PV++ P+WE  FE+MC+ASD+A+GA+L                         A  EKE    VFA DKFRSYL+G+KV+V+TDH+ALK+LL
Subjt:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL

Query:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDRE----IKEIFADEQVIKVDIHRD---PWYADFVNYLACGISPPDASSHQRKKI
         KK+AKPRLIRW+LLLQEFD+EI+D+KGSEN VA HLSR+  +D+  E    I E F DEQ+  +   ++   PWYADFVNYLACGI PPD S +Q+KK 
Subjt:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDRE----IKEIFADEQVIKVDIHRD---PWYADFVNYLACGISPPDASSHQRKKI

Query:  FHDVKHYFWDEPYLYKLGPDQI
           VKHY+WD+PYL+K GPDQ+
Subjt:  FHDVKHYFWDEPYLYKLGPDQI

A0A6J1DCL7 LOW QUALITY PROTEIN: uncharacterized protein LOC1110195155.9e-9851.17Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAE----
        MLDRLAGK++YCFLDGYSG N+ITI P+DQ+K TF CPYGTFAFRRMPFGLCNAPATFQRCMM IFSDM++  LE+FMDDFSV G +++A L N E    
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAE----

Query:  QVLREKI-------------------CAGFF---------IDCQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGA
        +V   KI                     GF+         I       L  +R  +   +   +FE LK  LS+ P+++EPNW+  FE+MC+ASD+A+GA
Subjt:  QVLREKI-------------------CAGFF---------IDCQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGA

Query:  ILAK-------------------------GEKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVA
        +L +                          EKE    VF FDKFR YL+G+KV+V TDHSALKYL  KKDAKPRLIRWILLLQEFD+E++DRKG+ENQVA
Subjt:  ILAK-------------------------GEKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVA

Query:  YHLSRIQGD--DQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQI
         HLSR++    D    I+E F DEQ+++VD    PWYAD  NYL   + P + +  Q KK+ ++V+ Y WDEP+LYKLGPD I
Subjt:  YHLSRIQGD--DQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQI

A0A6P8CBX2 Reverse transcriptase2.5e-9647.95Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR
        ML++LAG +YYCFLDGYSGYNQI I P+DQ+K TF CPYGTFAFRRMPFGLCNAPATFQRCMM IFSDM++  +EIFMDDFSV G+++E+ L N   VL+
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR

Query:  EKICAGFFIDCQAIVYLVRE---------RKTILVCEIAV---------------------------------------------------------TFE
                ++ +   ++VRE         +K I V    V                                                          F 
Subjt:  EKICAGFFIDCQAIVYLVRE---------RKTILVCEIAV---------------------------------------------------------TFE

Query:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL
         LKE L++ PV+V PNWE  FE+MC+ASD+AVGA+L                         A  EKE    +FA DKFR YL+GSK++V+TDH+ALKYL 
Subjt:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL

Query:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHY
         K DAKPRLIRWILLLQEFDLEI+D KG+EN VA HLSR++ D  D  I E F DEQ+   +I   PWYAD VNY+   I+P   SS Q+KK  HDVK+Y
Subjt:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHY

Query:  FWDEPYLYKLGPDQI
        FWDEPYL+K   DQ+
Subjt:  FWDEPYLYKLGPDQI

A0A6P8CDK6 Reverse transcriptase1.4e-9648.19Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR
        ML++LAG +YYCFLDGYSGYNQI I P+DQ+K TF CPYGTFAFRRMPFGLCNAPATFQRCMM IFSDMI+  +EIFMDDFSV G+++E+ L N   VL+
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR

Query:  EKICAGFFIDCQAIVYLVRE---------RKTILVCEIAV---------------------------------------------------------TFE
                ++ +   ++VRE         +K I V    V                                                          F 
Subjt:  EKICAGFFIDCQAIVYLVRE---------RKTILVCEIAV---------------------------------------------------------TFE

Query:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL
         LKE L++ PV+V PNWE  FE+MC+ASD+AVGA+L                         A  EKE    +FA DKFR YL+GSK++V+TDH+ALKYL 
Subjt:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL

Query:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHY
         K DAKPRLIRWILLLQEFDLEI+D KG+EN VA HLSR++ D  D  I E F DEQ+   +I   PWYAD VNY+   I+P   SS Q+KK  HDVK+Y
Subjt:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHY

Query:  FWDEPYLYKLGPDQI
        FWDEPYL+K   DQ+
Subjt:  FWDEPYLYKLGPDQI

A0A6P8CP09 uncharacterized protein LOC1161928682.5e-9647.95Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR
        ML++LAG +YYCFLDGYSGYNQI I P+DQ+K TF CPYGTFAFRRMPFGLCNAPATFQRCMM IFSDM++  +EIFMDDFSV G+++E+ L N   VL+
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR

Query:  EKICAGFFIDCQAIVYLVRE---------RKTILVCEIAV---------------------------------------------------------TFE
                ++ +   ++VRE         +K I V    V                                                          F 
Subjt:  EKICAGFFIDCQAIVYLVRE---------RKTILVCEIAV---------------------------------------------------------TFE

Query:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL
         LKE L++ PV+V PNWE  FE+MC+ASD+AVGA+L                         A  EKE    +FA DKFR YL+GSK++V+TDH+ALKYL 
Subjt:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL-------------------------AKGEKEF---VFAFDKFRSYLMGSKVVVHTDHSALKYLL

Query:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHY
         K DAKPRLIRWILLLQEFDLEI+D KG+EN VA HLSR++ D  D  I E F DEQ+   +I   PWYAD VNY+   I+P   SS Q+KK  HDVK+Y
Subjt:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHY

Query:  FWDEPYLYKLGPDQI
        FWDEPYL+K   DQ+
Subjt:  FWDEPYLYKLGPDQI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.67.5e-2626.27Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR
        +L +L    Y+  +D   G++QI +DP+   K  F   +G + + RMPFGL NAPATFQRCM  I   ++ +   +++DD  V   + +  L++   V  
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR

Query:  EKICAGFFIDCQAIVYLVRE-------------------------------------------------------RKTILVC------------EIAVTF
        +   A   +      +L +E                                                        K +  C            E    F
Subjt:  EKICAGFFIDCQAIVYLVRE-------------------------------------------------------RKTILVC------------EIAVTF

Query:  EKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAKG---------------------EKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKK
        +KLK ++S  P++  P++ K F +  +ASD A+GA+L++                      EKE    V+A   FR YL+G    + +DH  L +L   K
Subjt:  EKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAKG---------------------EKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKK

Query:  DAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQ
        D   +L RW + L EFD +I+  KG EN VA  LSRI       +++E +  EQ
Subjt:  DAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQ

P10394 Retrovirus-related Pol polyprotein from transposon 4127.1e-1625.28Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQV--
        +LD+L   +Y+  LD  SG++QI +D   +D  +F    G++ F R+PFGL  AP +FQR M   FS +      ++MDD  V G + +  L+N  +V  
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQV--

Query:  --------LREKICAGFFID-----------------------------------------CQAIVYLVR-----ERKTILVCEIAVTFE----------
                L  + C+ F  +                                         C      ++      R    +C+  V FE          
Subjt:  --------LREKICAGFFID-----------------------------------------CQAIVYLVR-----ERKTILVCEIAVTFE----------

Query:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL------------------AKGEKE----------FVFAFDKFRSYLMGSKVVVHTDHSALKYLL
         LK  L    ++  P++ K F +  +AS  A GA+L                   KGE              +A   FR Y+ G    V TDH  L YL 
Subjt:  KLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAIL------------------AKGEKE----------FVFAFDKFRSYLMGSKVVVHTDHSALKYLL

Query:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEI
        +  +   +L R  L L+E++  ++  KG +N VA  LSRI      +E+K+I
Subjt:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEI

P20825 Retrovirus-related Pol polyprotein from transposon 2974.6e-2325.15Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR
        +L +L   +Y+  +D   G++QI +D +   K  F    G + + RMPFGL NAPATFQRCM  I   ++ +   +++DD  +   +    L + + V  
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLR

Query:  EKICAGFFIDCQAIVYLVRE-------------------------------------------------------RKTILVC------------EIAVTF
        +   A   +      +L +E                                                        K +  C            E    F
Subjt:  EKICAGFFIDCQAIVYLVRE-------------------------------------------------------RKTILVC------------EIAVTF

Query:  EKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAKG---------------------EKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKK
        EKLK ++   P++  P++EK F +  +AS+ A+GA+L++                      EKE    V+A   FR YL+G + ++ +DH  L++L   K
Subjt:  EKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAKG---------------------EKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKK

Query:  DAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQ
        +   +L RW + L E+  +I   KG EN VA  LSRI+
Subjt:  DAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRIQ

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.3e-1724.79Show/hide
Query:  LDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLRE
        L  L   +Y+  LD  SG++QI +   D  K  F    G + F R+PFGL NAPA FQR +  I  + I +   +++DD  V  E Y+   +N   VL  
Subjt:  LDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLRE

Query:  KICAGF--------FIDCQA--IVYL------------------------VRERKTIL------------------------------------------
           A          F+D Q   + Y+                        V+E K  L                                          
Subjt:  KICAGF--------FIDCQA--IVYL------------------------VRERKTIL------------------------------------------

Query:  VCEIAV-TFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAKG-------------------------EKE---FVFAFDKFRSYLMGSKVV-V
        + E A+ +F  LK +L ++ ++  P + K F +  +AS++A+GA+L++                          EKE    +++ D  R+YL G+  + V
Subjt:  VCEIAV-TFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAKG-------------------------EKE---FVFAFDKFRSYLMGSKVV-V

Query:  HTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRI
        +TDH  L + L  ++   +L RW   ++E++ E+  + G  N VA  LSRI
Subjt:  HTDHSALKYLLTKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSRI

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.1e-1924.78Show/hide
Query:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSV----------HGETYEA
        +L R+   + +  LD +SGY+QI ++P+D+ K  F+ P G + +  MPFGL NAP+TF R M   F D+  R + +++DD  +          H +T   
Subjt:  MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSV----------HGETYEA

Query:  GLRNAEQVLREKIC---------AGFFIDCQAIVYL------VRERKT--------------------------------ILVCEIA-------VTFEKL
         L+N   ++++K C          G+ I  Q I  L      +R+  T                                + +C+ +          +KL
Subjt:  GLRNAEQVLREKIC---------AGFFIDCQAIVYL------VRERKT--------------------------------ILVCEIA-------VTFEKL

Query:  KEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAK---------------------------GEKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLL
        K+ L  +PV+V  N + ++ +  +AS   +GA+L +                           GE E    + A   FR  L G    + TDH +L  L 
Subjt:  KEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAK---------------------------GEKE---FVFAFDKFRSYLMGSKVVVHTDHSALKYLL

Query:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSR
         K +   R+ RW+  L  +D  ++   G +N VA  +SR
Subjt:  TKKDAKPRLIRWILLLQEFDLEIQDRKGSENQVAYHLSR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGACCGTTTAGCAGGGAAGGAATATTATTGTTTCCTCGATGGATATTCAGGGTACAACCAGATTACCATCGACCCTCAAGACCAGGACAAGATAACTTTTATATG
TCCCTATGGAACATTTGCCTTCCGGCGGATGCCATTTGGGTTATGCAACGCCCCTGCTACATTTCAAAGATGCATGATGCCAATCTTCTCAGATATGATCAAGAGGACAT
TGGAGATCTTTATGGATGACTTCTCAGTACACGGAGAGACTTACGAAGCAGGTCTAAGGAATGCGGAGCAGGTGTTGCGTGAGAAGATTTGTGCAGGGTTTTTCATAGAT
TGCCAAGCCATTGTGTACCTTGTTAGAGAAAGAAAAACCATTCTTGTTTGTGAAATTGCTGTGACATTTGAGAAACTAAAAGAAGTGTTGAGCACGACTCCTGTCATGGT
GGAACCCAACTGGGAGAAATCATTTGAGGTAATGTGTGAGGCCAGTGACTTCGCGGTGGGAGCCATCCTTGCCAAAGGCGAGAAAGAATTTGTGTTTGCCTTCGATAAAT
TTAGATCGTATCTCATGGGCAGCAAAGTGGTGGTACACACAGATCATTCTGCACTGAAATATCTCCTCACCAAAAAGGATGCAAAACCCAGACTGATCCGATGGATATTA
CTACTTCAAGAGTTTGATTTAGAGATCCAAGACCGGAAGGGAAGCGAGAATCAGGTGGCATATCATTTATCACGAATTCAAGGAGATGATCAAGATAGGGAAATTAAAGA
AATATTTGCAGATGAACAAGTTATCAAGGTTGACATACATAGAGACCCATGGTACGCAGATTTTGTCAATTACTTGGCATGTGGAATCAGCCCACCAGACGCATCGTCGC
ATCAACGAAAGAAGATTTTCCATGACGTCAAGCATTATTTTTGGGATGAACCATATCTTTATAAGCTGGGGCCAGATCAGATCTGGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGACCGTTTAGCAGGGAAGGAATATTATTGTTTCCTCGATGGATATTCAGGGTACAACCAGATTACCATCGACCCTCAAGACCAGGACAAGATAACTTTTATATG
TCCCTATGGAACATTTGCCTTCCGGCGGATGCCATTTGGGTTATGCAACGCCCCTGCTACATTTCAAAGATGCATGATGCCAATCTTCTCAGATATGATCAAGAGGACAT
TGGAGATCTTTATGGATGACTTCTCAGTACACGGAGAGACTTACGAAGCAGGTCTAAGGAATGCGGAGCAGGTGTTGCGTGAGAAGATTTGTGCAGGGTTTTTCATAGAT
TGCCAAGCCATTGTGTACCTTGTTAGAGAAAGAAAAACCATTCTTGTTTGTGAAATTGCTGTGACATTTGAGAAACTAAAAGAAGTGTTGAGCACGACTCCTGTCATGGT
GGAACCCAACTGGGAGAAATCATTTGAGGTAATGTGTGAGGCCAGTGACTTCGCGGTGGGAGCCATCCTTGCCAAAGGCGAGAAAGAATTTGTGTTTGCCTTCGATAAAT
TTAGATCGTATCTCATGGGCAGCAAAGTGGTGGTACACACAGATCATTCTGCACTGAAATATCTCCTCACCAAAAAGGATGCAAAACCCAGACTGATCCGATGGATATTA
CTACTTCAAGAGTTTGATTTAGAGATCCAAGACCGGAAGGGAAGCGAGAATCAGGTGGCATATCATTTATCACGAATTCAAGGAGATGATCAAGATAGGGAAATTAAAGA
AATATTTGCAGATGAACAAGTTATCAAGGTTGACATACATAGAGACCCATGGTACGCAGATTTTGTCAATTACTTGGCATGTGGAATCAGCCCACCAGACGCATCGTCGC
ATCAACGAAAGAAGATTTTCCATGACGTCAAGCATTATTTTTGGGATGAACCATATCTTTATAAGCTGGGGCCAGATCAGATCTGGAGTTGA
Protein sequenceShow/hide protein sequence
MLDRLAGKEYYCFLDGYSGYNQITIDPQDQDKITFICPYGTFAFRRMPFGLCNAPATFQRCMMPIFSDMIKRTLEIFMDDFSVHGETYEAGLRNAEQVLREKICAGFFID
CQAIVYLVRERKTILVCEIAVTFEKLKEVLSTTPVMVEPNWEKSFEVMCEASDFAVGAILAKGEKEFVFAFDKFRSYLMGSKVVVHTDHSALKYLLTKKDAKPRLIRWIL
LLQEFDLEIQDRKGSENQVAYHLSRIQGDDQDREIKEIFADEQVIKVDIHRDPWYADFVNYLACGISPPDASSHQRKKIFHDVKHYFWDEPYLYKLGPDQIWS