; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0006895 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0006895
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr01:10372720..10373696
RNA-Seq ExpressionPay0006895
SyntenyPay0006895
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH92231.1 hypothetical protein [Trifolium medium]3.0e-7555.35Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFD----------------------------------------AYVRSFLGSVGFYRRFI
        M FGLCNA GTFQRCM+SIFS FIE C++VFMDDFTVYG SF+                                          +RSFLG  GFYRRFI
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFD----------------------------------------AYVRSFLGSVGFYRRFI

Query:  KDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFI
        KDFSKIAL L+NLLQKD+ F  DD CK+AFD LK+ L ST ++Q P+W  PFEI+C          L Q +DK  H I YA RTL+ AQSNYTTTEKE +
Subjt:  KDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFI

Query:  AIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI
        AIVFA DKF+SY++GS ++++TDH  +KYL+ K ++KPRL+RW+LLLQEF+L IKD+ GA N VADHLSRI
Subjt:  AIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI

XP_017233173.1 PREDICTED: uncharacterized protein LOC108207223 [Daucus carota subsp. sativus]2.3e-7556.49Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFD--------------------------AYVRSFLGSVGFYRRFIKDFSKIALSLTNLL
        +SFG CNA GTFQRCMMSIFS+++ K I+VFMDDF+V+GDSFD                            +RSFLG  GFYRRFI+DFSKI   LTNLL
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFD--------------------------AYVRSFLGSVGFYRRFIKDFSKIALSLTNLL

Query:  QKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYII
        QKDV F  +D CK AF++LKQ+L S+ I+Q+P+W+LPFEI+C          L Q  DKK+HAI YA RTLN AQ NY TTEKE +A+VFA +KF+SY++
Subjt:  QKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYII

Query:  GSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKK
        G+ +IV+TDH  +KYL++KKE+KPRL+RW+LLLQEF+L IKD+KG+ N VADHLSR+ + ++
Subjt:  GSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKK

XP_019054205.1 PREDICTED: uncharacterized protein LOC109113903 [Nelumbo nucifera]3.3e-7454.45Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY-------------------------------------------------------
        M FGLCNALGTFQRCMMSIFS FIEKCI++FMDDFTVYGDSFD Y                                                       
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY-------------------------------------------------------

Query:  ----VRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEII----------CLEQMIDKKLHAIDYA
            + SFLG  GFY RFI+DFSKIAL L+NLLQKDVPF   D CK+AF  LK+ L   LI+Q P+WNLPFEI+           L Q ID K H I YA
Subjt:  ----VRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEII----------CLEQMIDKKLHAIDYA

Query:  YRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYK
         +TLN AQSNYTTTEKE +AIVFA DKF+SY++GS +IV++DH  +KYL++ KESKPRL+RW+LLLQEF+L IKD+KG  NSVADHLSR+ K
Subjt:  YRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYK

XP_019198426.1 PREDICTED: uncharacterized protein LOC109192308 [Ipomoea nil]3.3e-7453.74Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDA-----------------------YVRSFLGSVGFYRRFIKDFSKIALSLTNLLQKD
        M FGLCNA GTFQRCMMSIFS FIE CI+VFMDDFTVYG    A                        VRSFLG  GFYR+FIKDFS+IA+ L+ LLQK+
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDA-----------------------YVRSFLGSVGFYRRFIKDFSKIALSLTNLLQKD

Query:  VPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSP
        V F     C++AFD LK  L S  ++Q  NW LPFEI+C          L Q + K+ H I YA RTLN AQ +Y+TTEKE +A++FA DKF+SY++GS 
Subjt:  VPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSP

Query:  IIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKKVCQFEKTSLMNISFKQIFKHHG
        +IV++DHT +KYL+ +KESKPRL+RW+LLLQEF+LTIKDRKGA N VADHLSR+ + + +    +    N S +Q+F+  G
Subjt:  IIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKKVCQFEKTSLMNISFKQIFKHHG

XP_024927724.1 LOW QUALITY PROTEIN: uncharacterized protein LOC112491025 [Ziziphus jujuba]4.3e-7456.51Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY---------------------------------VRSFLGSVGFYRRFIKDFSKIA
        M FGLCNA  TFQRCMMSIFSN +E  I++FMDDF+V+GDSF +                                  +RSFLG  GFYRRFIKDFSKI 
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY---------------------------------VRSFLGSVGFYRRFIKDFSKIA

Query:  LSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFD
          L NLL KDVPF+ DD C  AF+ LK++L+   I  +PNWNLPFEI+C          L Q  DKKLH I YA RTLN AQ NY TTEKE +AIVFAFD
Subjt:  LSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFD

Query:  KFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKK
        KF++Y++GS  IVYTDH+ ++YL+SKKESKPRL+RWVLLLQEF+L I D+KG  N VADHLSR+  S++
Subjt:  KFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKK

TrEMBL top hitse value%identityAlignment
A0A0L9TXH6 Reverse transcriptase3.9e-7348.79Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDA--------------------------------------------------------
        M FGLCNA GTFQRCM+SIFS+F+E CI+VFMDDFTVYG SFD                                                         
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDA--------------------------------------------------------

Query:  -------YVRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHA
                VRSFLG  GFYRRFIKDFSK  L L+ LLQKD+ F  DD CK+AFD LKQ L++T I+Q+P+W  PFE++C          L Q IDK    
Subjt:  -------YVRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHA

Query:  IDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKKV
        I YA RTL+ AQ+NYTTTEKE +AIVFA DKF+SY++GSP+IV+TDH  +K+L+ K ESKPRL+RWVLLLQEF+L IKDR GA+N VADHLSRI +++  
Subjt:  IDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKKV

Query:  C-------QFEKTSLMNISFKQIFKHHGTP
                 F   SL+ +S       H TP
Subjt:  C-------QFEKTSLMNISFKQIFKHHGTP

A0A1U8Q7Z8 Reverse transcriptase1.6e-7454.45Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY-------------------------------------------------------
        M FGLCNALGTFQRCMMSIFS FIEKCI++FMDDFTVYGDSFD Y                                                       
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY-------------------------------------------------------

Query:  ----VRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEII----------CLEQMIDKKLHAIDYA
            + SFLG  GFY RFI+DFSKIAL L+NLLQKDVPF   D CK+AF  LK+ L   LI+Q P+WNLPFEI+           L Q ID K H I YA
Subjt:  ----VRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEII----------CLEQMIDKKLHAIDYA

Query:  YRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYK
         +TLN AQSNYTTTEKE +AIVFA DKF+SY++GS +IV++DH  +KYL++ KESKPRL+RW+LLLQEF+L IKD+KG  NSVADHLSR+ K
Subjt:  YRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYK

A0A392MXJ5 Uncharacterized protein (Fragment)1.4e-7555.35Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFD----------------------------------------AYVRSFLGSVGFYRRFI
        M FGLCNA GTFQRCM+SIFS FIE C++VFMDDFTVYG SF+                                          +RSFLG  GFYRRFI
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFD----------------------------------------AYVRSFLGSVGFYRRFI

Query:  KDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFI
        KDFSKIAL L+NLLQKD+ F  DD CK+AFD LK+ L ST ++Q P+W  PFEI+C          L Q +DK  H I YA RTL+ AQSNYTTTEKE +
Subjt:  KDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFI

Query:  AIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI
        AIVFA DKF+SY++GS ++++TDH  +KYL+ K ++KPRL+RW+LLLQEF+L IKD+ GA N VADHLSRI
Subjt:  AIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI

A0A6P6FPE5 Reverse transcriptase7.8e-7456.77Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY------------------------------VRSFLGSVGFYRRFIKDFSKIALSL
        M FGLCNA  TFQRCMMSIFSN +E  I++FMDDF+V+GDSF +                               +RSFLG  GFYRRFIKDFSKI   L
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY------------------------------VRSFLGSVGFYRRFIKDFSKIALSL

Query:  TNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFK
         NLL KDVPF+ DD C  AF+ LK++L+   I+ + NWNLPFEI+C          L Q  DKKLH I YA RTLN  Q NY TTEKE +AIVFAFDKF+
Subjt:  TNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFK

Query:  SYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKK
        +Y++GS  IVYTDH+ +KYL+SKKESKPRL+RWVLLLQEF+L I D+KG  N VADHLSR+  S++
Subjt:  SYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKK

A0A6P6G0T4 Reverse transcriptase2.1e-7456.51Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY---------------------------------VRSFLGSVGFYRRFIKDFSKIA
        M FGLCNA  TFQRCMMSIFSN +E  I++FMDDF+V+GDSF +                                  +RSFLG  GFYRRFIKDFSKI 
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY---------------------------------VRSFLGSVGFYRRFIKDFSKIA

Query:  LSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFD
          L NLL KDVPF+ DD C  AF+ LK++L+   I  +PNWNLPFEI+C          L Q  DKKLH I YA RTLN AQ NY TTEKE +AIVFAFD
Subjt:  LSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEIIC----------LEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFD

Query:  KFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKK
        KF++Y++GS  IVYTDH+ ++YL+SKKESKPRL+RWVLLLQEF+L I D+KG  N VADHLSR+  S++
Subjt:  KFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRIYKSKK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.5e-2628.87Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY-------------------------------------------------------
        M FGL NA  TFQRCM  I    + K   V++DD  V+  S D +                                                       
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAY-------------------------------------------------------

Query:  --------VRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPF-LIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEI------ICLEQMIDKKLHAIDY
                +++FLG  G+YR+FI +F+ IA  +T  L+K++     +     AF  LK  +    IL+ P++   F +      + L  ++ +  H + Y
Subjt:  --------VRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPF-LIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEI------ICLEQMIDKKLHAIDY

Query:  AYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI
          RTLN+ + NY+T EKE +AIV+A   F+ Y++G    + +DH  + +L   K+   +L RW + L EF+  IK  KG  N VAD LSRI
Subjt:  AYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI

P10394 Retrovirus-related Pol polyprotein from transposon 4121.4e-2437.84Show/hide
Query:  RSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEI----------ICLEQMIDKKLHAIDYAYRTLN
        R F+    +YRRFIK+F+  +  +T L +K+VPF   D C+KAF  LK +L++  +LQ P+++  F I            L Q  +     + YA R   
Subjt:  RSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEI----------ICLEQMIDKKLHAIDYAYRTLN

Query:  QAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI
        + +SN +TTE+E  AI +A   F+ YI G    V TDH  + YL S      +L R  L L+E+N T++  KG +N VAD LSRI
Subjt:  QAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.3e-2031.96Show/hide
Query:  VRSFLGSVGFYRRFIKDFSKIALSLTNLLQ-----------KDVPFLIDDNCKKAFDDLKQRLVS-TLILQSPNWNLPFEIIC------LEQMIDKKLHA
        VRSFLG   +YR FIKDF+ IA  +T++L+           K +P   ++  + AF  L+  L S  +IL+ P++  PF++        +  ++ ++   
Subjt:  VRSFLGSVGFYRRFIKDFSKIALSLTNLLQ-----------KDVPFLIDDNCKKAFDDLKQRLVS-TLILQSPNWNLPFEIIC------LEQMIDKKLHA

Query:  IDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGS-PIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSR
        I    RTL Q + NY T E+E +AIV+A  K ++++ GS  I ++TDH  + + V+ + +  ++ RW   + + N  +  + G  N VAD LSR
Subjt:  IDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGS-PIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.8e-2528.91Show/hide
Query:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSF----------------------------------------------------------
        M FGL NA  TFQRCM +I    + K   V++DD  ++  S                                                           
Subjt:  MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSF----------------------------------------------------------

Query:  -----DAYVRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCK----KAFDDLKQRLVSTLILQSPNWNLPFEI------ICLEQMIDKKLHA
             D  +R+FLG  G+YR+FI +++ IA  +T+ L+K       D  K    +AF+ LK  ++   ILQ P++   F +      + L  ++ +  H 
Subjt:  -----DAYVRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCK----KAFDDLKQRLVSTLILQSPNWNLPFEI------ICLEQMIDKKLHA

Query:  IDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI
        I +  RTLN  + NY+  EKE +AIV+A   F+ Y++G   ++ +DH  +++L + KE   +L RW + L E+   I   KG  NSVAD LSRI
Subjt:  IDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.2e-2433.33Show/hide
Query:  VRSFLGSVGFYRRFIKDFSKIALSLTNLLQ-----------KDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEI----------ICLEQMIDKK
        ++ FLG   +YR+FI+D++K+A  LTNL +             VP  +D+   ++F+DLK  L S+ IL  P +  PF +            L Q    +
Subjt:  VRSFLGSVGFYRRFIKDFSKIALSLTNLLQ-----------KDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPFEI----------ICLEQMIDKK

Query:  LHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPII-VYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI
           I Y  R+LN+ + NY T EKE +AI+++ D  ++Y+ G+  I VYTDH  + + +  +    +L RW   ++E+N  +  + G +N VAD LSRI
Subjt:  LHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPII-VYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSRI

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-0538.81Show/hide
Query:  VRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPF
        +R FLG  G+YRRF+K++ KI   LT LL+K+      +    AF  LK  + +  +L  P+  LPF
Subjt:  VRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTTGGTCTTTGTAATGCACTGGGCACCTTCCAACGTTGCATGATGAGCATATTTTCCAACTTTATTGAAAAATGCATTAAAGTTTTCATGGATGATTTCACAGT
TTATGGTGATAGTTTTGATGCATATGTTAGATCATTCCTTGGTAGTGTCGGCTTTTATAGACGATTTATAAAAGATTTTTCTAAAATTGCTTTGTCTCTAACTAATCTCT
TGCAAAAAGATGTACCATTTCTGATTGATGACAATTGTAAGAAGGCATTTGATGATCTCAAACAAAGGTTAGTCTCTACCCTTATCCTTCAATCTCCTAATTGGAATTTA
CCTTTCGAAATAATCTGTTTAGAACAAATGATAGATAAAAAATTGCATGCTATAGACTATGCATATAGGACCCTTAACCAAGCACAATCTAACTACACCACAACTGAAAA
AGAATTTATTGCTATCGTTTTTGCTTTTGATAAGTTTAAAAGCTACATTATTGGCTCCCCAATAATTGTTTACACTGATCATACAGTGGTTAAGTATCTTGTATCAAAAA
AAGAATCAAAACCAAGACTTGTTCGATGGGTTTTACTTTTGCAAGAATTCAACCTAACCATCAAGGATAGAAAAGGAGCCAACAATTCTGTAGCCGACCATCTTAGTCGA
ATTTACAAAAGCAAGAAAGTATGCCAATTCGAGAAGACTTCCCTGATGAACATCTCATTCAAACAGATCTTCAAGCACCATGGTACACCGATATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTTGGTCTTTGTAATGCACTGGGCACCTTCCAACGTTGCATGATGAGCATATTTTCCAACTTTATTGAAAAATGCATTAAAGTTTTCATGGATGATTTCACAGT
TTATGGTGATAGTTTTGATGCATATGTTAGATCATTCCTTGGTAGTGTCGGCTTTTATAGACGATTTATAAAAGATTTTTCTAAAATTGCTTTGTCTCTAACTAATCTCT
TGCAAAAAGATGTACCATTTCTGATTGATGACAATTGTAAGAAGGCATTTGATGATCTCAAACAAAGGTTAGTCTCTACCCTTATCCTTCAATCTCCTAATTGGAATTTA
CCTTTCGAAATAATCTGTTTAGAACAAATGATAGATAAAAAATTGCATGCTATAGACTATGCATATAGGACCCTTAACCAAGCACAATCTAACTACACCACAACTGAAAA
AGAATTTATTGCTATCGTTTTTGCTTTTGATAAGTTTAAAAGCTACATTATTGGCTCCCCAATAATTGTTTACACTGATCATACAGTGGTTAAGTATCTTGTATCAAAAA
AAGAATCAAAACCAAGACTTGTTCGATGGGTTTTACTTTTGCAAGAATTCAACCTAACCATCAAGGATAGAAAAGGAGCCAACAATTCTGTAGCCGACCATCTTAGTCGA
ATTTACAAAAGCAAGAAAGTATGCCAATTCGAGAAGACTTCCCTGATGAACATCTCATTCAAACAGATCTTCAAGCACCATGGTACACCGATATTGTAA
Protein sequenceShow/hide protein sequence
MSFGLCNALGTFQRCMMSIFSNFIEKCIKVFMDDFTVYGDSFDAYVRSFLGSVGFYRRFIKDFSKIALSLTNLLQKDVPFLIDDNCKKAFDDLKQRLVSTLILQSPNWNL
PFEIICLEQMIDKKLHAIDYAYRTLNQAQSNYTTTEKEFIAIVFAFDKFKSYIIGSPIIVYTDHTVVKYLVSKKESKPRLVRWVLLLQEFNLTIKDRKGANNSVADHLSR
IYKSKKVCQFEKTSLMNISFKQIFKHHGTPIL