; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026151 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026151
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr10:30816856..30822902
RNA-Seq ExpressionLag0026151
SyntenyLag0026151
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.1e-20148.26Show/hide
Query:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA
        ++ TFY GL +  +  +D     + L  T  E H +L+ +  N+YE      +R  P K ++G  EVD +T +NAKID L   M                
Subjt:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA

Query:  VGCGICGEGHTHDQCPSNPKSIFYVG--------------------------------------PQGAVRTT----------HFQAHTIKDGEITQTFHG
             CGEGH  DQCP + +SI +V                                        QG    T            QA T+++G   Q    
Subjt:  VGCGICGEGHTHDQCPSNPKSIFYVG--------------------------------------PQGAVRTT----------HFQAHTIKDGEITQTFHG

Query:  IVAQINSHNKAQEEKVVEEEEPKKTE-PHAQREPNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFE
            +    K++E++V  EE+ K+ E P    +P    L  PFPQRL+K+  + QF KFL   ++LHINIP  +ALE+MPSY KF+KDIL+ KR+  ++E
Subjt:  IVAQINSHNKAQEEKVVEEEEPKKTE-PHAQREPNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFE

Query:  TVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPID
        TVALT   SAI+ NKLPPKLKDPG              RALCDLGA+INLMP  +Y+ LG+ EA+P ++TLQLADRS+ + +G IED+LVKVDKFIFP D
Subjt:  TVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPID

Query:  FIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHM----LLSNAFNTTREEAKEEQVEDNCILAQTTGK----FHALDLKER--------------
        F++LD + D EVPIILGRPFLATG+TLI VQK ELTM +    +  N F   +   + ++     +     GK       LD  ER              
Subjt:  FIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHM----LLSNAFNTTREEAKEEQVEDNCILAQTTGK----FHALDLKER--------------

Query:  --ISTL------------------------PSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSY
          + TL                        PSI +PP LELKPLP+HL Y +LGE ++L VIISS L+  Q   L++VL  HK AIGW++ADIKGISPS+
Subjt:  --ISTL------------------------PSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSY

Query:  CMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHF
        CM KI L D+    +E QRRLNP M +VV+KEIIKWL+AG+IYPI+D  W+SPVQCVPKKGG+TVV N +NE IP++T TGWR+CMDYRKLN  T+KDHF
Subjt:  CMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHF

Query:  PLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLT
        PLPFIDQMLDRL G+ +Y FLDGYSGYNQI+IAPEDQEKT FTCPYGTFAFRR+PF LCNA  TFQRCMM IF++++E  ++VFMDDFS+YG SF +CL 
Subjt:  PLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLT

Query:  QLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH
         L  V ++CE+TNLVLNWEKCHF+V+EGIVL H
Subjt:  QLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH

PIN22487.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.7e-20847.9Show/hide
Query:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA
        ++ TFY GL +  +  +D     + L  T  E H +L+ +  N+YE      +R  P K ++G  EVD +T +NAKID L   M     +      Q   
Subjt:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA

Query:  VGCGICGEGHTHDQCPSNPKSIFYV-----------------------------------------GPQGAVR-------------TTHFQAHTIKDGEI
        V C  CGEGH  DQCP + +SI +V                                         G Q  V+                F A T  + + 
Subjt:  VGCGICGEGHTHDQCPSNPKSIFYV-----------------------------------------GPQGAVR-------------TTHFQAHTIKDGEI

Query:  TQTFHGIVAQ-INSH---------------------------------------NKAQEEKVVEEEEPKKTE-PHAQREPNAYKLAVPFPQRLRKKNDEA
         +T  G +A  INS                                         K++E++V+ EE+ K+ E P    +P    L  PFPQRL+K+  E 
Subjt:  TQTFHGIVAQ-INSH---------------------------------------NKAQEEKVVEEEEPKKTE-PHAQREPNAYKLAVPFPQRLRKKNDEA

Query:  QFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLP
        QF KFL   ++LHINIP  +ALE+MPSY KF+KDIL+ KR+  ++ETVALT   SAI+ NKLPPKLKDPGSFTI C+IG     RALCDLGA+INLMP  
Subjt:  QFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLP

Query:  VYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHMLLSNAFNTTREEAKE
        +Y+ LG+GEA+P ++TLQLADRS+ + +G IED+LVKVDKFIFP DF++LD + D EVPIILGRPFLATG+TLI VQK ELTM +          +  K 
Subjt:  VYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHMLLSNAFNTTREEAKE

Query:  EQVEDNCILAQTTGKFHA----------------LDL----------------------KERISTL----------PSIAQPPVLELKPLPTHLKYRFLG
            D C       K                   LDL                        R+ +L          PSI  PP LELKPLP+HL Y +LG
Subjt:  EQVEDNCILAQTTGKFHA----------------LDL----------------------KERISTL----------PSIAQPPVLELKPLPTHLKYRFLG

Query:  EEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPV
        E ++L VIISS L+  Q   L++VL  HK AIGW++ADIKGISPS+CM KI L D+    +E QRRLNP M +VV+KEIIKWL+AG+IYPI+DS WVSPV
Subjt:  EEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPV

Query:  QCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRM
        QCVPKKGG+TVV N +NELIP+RT TGWR+CMDYRKLN  T+KDHFPLPFIDQMLDRL G+ +Y FLDGYSGYNQI+IAPEDQEKT FTCPYGTFAFRRM
Subjt:  QCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRM

Query:  PFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH
        PF LCNA  TFQRCMM IF++++E  ++VFMDDFS+YG SF +CL  L  V ++CE+TNL+LNWEKCHF+V+EGIVL H
Subjt:  PFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH

XP_022156989.1 uncharacterized protein LOC111023818 [Momordica charantia]2.6e-20163.05Show/hide
Query:  PNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIG
        P  +++  P+PQRL+KKN + QF +FL  L++LHINIPL++ALE+MP+Y KFLKDIL  KR+  EFE VALT  +SAIL  KLP K+ DPGSFTI   IG
Subjt:  PNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIG

Query:  GIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKR
        G +V  ALCDLGA+INLMPL VY+ LGIGEARP TVTLQLADRS+ +LEGKIEDVLV+VDKFIFP DFIILDY+AD+E+PIILGRPFL+TG+ LI V   
Subjt:  GIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKR

Query:  ELTMHM----LLSNAFNTTR-----EEAKEEQVEDNCIL--AQTTGKFHALD------LKERIST--LPSIAQPPVLELKPLPTHLKYRFLGEEESLSVI
        ELT+ +    +  + FN+ +     EE    ++ D+ +    QT    + L+      +K+R+     PS+ + P LELK LP+HLKY +LGE E+L V 
Subjt:  ELTMHM----LLSNAFNTTR-----EEAKEEQVEDNCIL--AQTTGKFHALD------LKERIST--LPSIAQPPVLELKPLPTHLKYRFLGEEESLSVI

Query:  ISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGG
        I++ L + +E  L+++L  HKKAIGW+LADIKGISPSYCM KI L +     IE QRRLNPAM +VV+KEIIKWL+AG+IYPIAD   +SPVQCVPKKGG
Subjt:  ISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGG

Query:  MTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNAL
        +TVV N NNELIP+RT TGW ICMDYRKLN  TKKDHFPLPFIDQMLD LVGQ YYY LDGY+GYNQI+I P+DQ+KT FTCPYGTF+FRRMPF LCNA 
Subjt:  MTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNAL

Query:  ETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH
         TFQRCMM IF +L+E  V+VFMDDFS++ K F++ L+ LE+V  +CE+TNLVLNWEKCHF+V EGIVL H
Subjt:  ETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH

XP_023522102.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785979 [Cucurbita pepo subsp. pepo]2.2e-21348.63Show/hide
Query:  LQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALT----------AST
        ++TFY GLN  ++ +VD+SANGA+L KTY+EA+ IL++I  NN +W    D R  P + + G  EVD L+ +NA++ ++T  +  L             T
Subjt:  LQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALT----------AST

Query:  TPLIAQLNAVGCGICGE-----------------------GHTHDQCPSNP------------------------KSIFYVGPQGAVRTTHFQAHTIKDG
          +I Q  A  C +  E                       G   ++  + P                        +S   +  +G     H  + + +  
Subjt:  TPLIAQLNAVGCGICGE-----------------------GHTHDQCPSNP------------------------KSIFYVGPQGAVRTTHFQAHTIKDG

Query:  EITQTFHGIVAQINSHNKAQEEKVVEEE---------EPKKTEPHAQ--REPNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSY
        +  Q     V Q   HNK   E  V++E         +P KT+      +E   Y    PFPQR+++K +EA F+KF++  +E+HINIPLV+AL++MP+Y
Subjt:  EITQTFHGIVAQINSHNKAQEEKVVEEE---------EPKKTEPHAQ--REPNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSY

Query:  AKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLE
         KFLKD+LTN+R+++EF+ V L    SAIL NK+P K KDPGSFTI  SIGG  + RALCDLG++INLMPL +YK LGIGEARP TVTLQLADRS  + E
Subjt:  AKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLE

Query:  GKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHM--------------------LLSNAFNTTREEAKEE-------
        GKIED+L++VDKFIFP DFIILDY+AD +VPIILGRPFL TG+TL+ V K  +T+ M                      S  +  T + A EE       
Subjt:  GKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHM--------------------LLSNAFNTTREEAKEE-------

Query:  ---------QVEDNCILAQTTGKFHALDLKERIST--LPSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLAD
                 ++E   +L + +  F +L+ + R S+   PSI + P L+LKPLP +LKY +LG++++L +IIS+ L+  QE +L++ L KHK AIGW+LAD
Subjt:  ---------QVEDNCILAQTTGKFHALDLKERIST--LPSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLAD

Query:  IKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLN
        IKGISPS CM KI+L +   + IE+QRRLNP M +VVRKEI+KWL+AG+IYPIA+S  VSP+QCVPKKGG+TV+AN NNELI +R   GWRICMDYR+LN
Subjt:  IKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLN

Query:  AITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYG
          T+KDHFPLPFIDQMLDRL G+++Y FLDGYSGYNQI+I+PEDQEKT FTCPYG FAFRRMPF LCNA  TFQRCMM IF++++E  +++FMDDFS+YG
Subjt:  AITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYG

Query:  KSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH
        +SF  CL  L +V ++CEE NLVLNWEKCHF+V EGIVL H
Subjt:  KSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH

XP_030497826.1 LOW QUALITY PROTEIN: uncharacterized protein LOC115713483 [Cannabis sativa]3.4e-20651.91Show/hide
Query:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTAS-------TTP
        ++  FY GL   ++ L+D++A GA +RK+ +EA  +L+++   N +W T     R P K  +G  EVD +T + A+ +   ++M   T +        T 
Subjt:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTAS-------TTP

Query:  LIAQLNAVGCGICGEGHTHDQCPSNPKSIFYVGPQGAVRTT-------HFQAHTIKDGEITQTFHGIVAQINSHNKAQEEKVVEEEEPKKTEPHAQREPN
        L+ Q          +  T         +     PQG + +T       + +A T++ G   +++ G  +Q    N   E+     E+ K T+   Q+E +
Subjt:  LIAQLNAVGCGICGEGHTHDQCPSNPKSIFYVGPQGAVRTT-------HFQAHTIKDGEITQTFHGIVAQINSHNKAQEEKVVEEEEPKKTEPHAQREPN

Query:  A-----YKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHC
              + + +P+PQRLRK N + QF KFL   R+LHINIP  +ALE+MPSY KF+K+IL+ KR+ ++FETVALT   SAIL  KLPPKLKDPGSFTI C
Subjt:  A-----YKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHC

Query:  SIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHV
        +IG I+   ALCDLGA+INLMPL V+K L +GEA+P TVTLQLADRS+ H  G IEDVLVKVDKFIFP DFI+LD + D  VPIILGRPFLATG+ LI +
Subjt:  SIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHV

Query:  QKRELTMHMLLSNAFNTTREEAKEEQVEDNCILAQTTGKFHALDLK--ERISTL-----PSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQ
           E   H+         +EE  E+  ++    A     +  L+ +  E +  +     PS  +PP LELK LP HL+Y +LGE ++L VI++S L+  +
Subjt:  QKRELTMHMLLSNAFNTTREEAKEEQVEDNCILAQTTGKFHALDLK--ERISTL-----PSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQ

Query:  EHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANN
           L++VL KHKKAIGW+LADIKGISPS  M +I + +     I+ QRRLNP M +VVRKE++KWL+AGV YPI+DS+WVSPVQ VPKKGGMTVV N  N
Subjt:  EHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANN

Query:  ELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMT
        ELIP+RT TGWRIC+DYRKLN  T+KDHFPLPFIDQMLD+L GQ YY FLDGYSGY+QI+IAPEDQEKT FTCPYGTFAFRRMPF LCNA  TFQRCMM 
Subjt:  ELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMT

Query:  IFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIV
        IFS+L+EK ++VFMDDFS++G SF  CL+ LE V  +CE++NLVLNWEKCHF+V EGIV
Subjt:  IFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIV

TrEMBL top hitse value%identityAlignment
A0A2G9G6G2 Reverse transcriptase2.3e-20048.54Show/hide
Query:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA
        ++ TFY GL +  +  +D     + L  T  E H +L+ +  N+YE      +R  P K + G  EVD +T +NAKID L   M     +      Q   
Subjt:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA

Query:  VGCGICGEGHTHDQCPSNPKSIFYV----------------------------GPQGAVRTTHFQAH---------TIKDGEITQTFHGIVAQINSHNKA
        V C  CGEGH  DQCP + +SI +V                              QG      FQ             K   + +T    +A   ++ K 
Subjt:  VGCGICGEGHTHDQCPSNPKSIFYV----------------------------GPQGAVRTTHFQAH---------TIKDGEITQTFHGIVAQINSHNKA

Query:  QEEKV----------VEEEEPKKTEPHAQ-----REPNAYKLAVPFPQRLRKKN--DEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQ
         E ++           +   P  TEP+++     R     +  V  P + ++K    E + K+    L +LHINIP  +ALE+MPSY KF+KDIL+ KR+
Subjt:  QEEKV----------VEEEEPKKTEPHAQ-----REPNAYKLAVPFPQRLRKKN--DEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQ

Query:  WKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKF
          ++E V LT   S I+ NKLPPKLK+PGSFTI C+IG     RALCDLGA+INLMP  +Y+ LG+GEA+P ++TLQLADRS+ + +G I+D+LVKVDKF
Subjt:  WKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKF

Query:  IFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKR--------ELTMHMLLSN-------------AFNTTREEAKEEQVEDNCILAQTTG-----K
        IFP DF++LD + D EVPIILGRPFLATG+TLI VQK         E     L  N                    +  +E+ E +C + +T       K
Subjt:  IFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKR--------ELTMHMLLSN-------------AFNTTREEAKEEQVEDNCILAQTTG-----K

Query:  FHALDLKERIS----TLPSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNE
           ++  ER +      PSI +PP LELKPLP+HL Y +LGE ++L VIISS L+  Q   L++VL  HK  IGW++ADIKGISPS+CM KI L D+   
Subjt:  FHALDLKERIS----TLPSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNE

Query:  FIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLV
         IE QRRLNP M +VV+KEIIKWL+AG+IYPI+DS WVSPVQCVPKKGG+TVV N +NELIP+RT TGWR+CMDYRKLN  T+KDHFPLPFIDQMLDRL 
Subjt:  FIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLV

Query:  GQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETN
        G+ +Y FLDGYSGYNQI+IAPEDQEKT FTCPYGTFAFRRMPF LCNA  TFQRCMM IF++++E  ++VFMD+FS+YG SF +CL  L  V ++CE+TN
Subjt:  GQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETN

Query:  LVLNWEKCHFIVEEGIVLCH
        LVLNWEKCHF+V+EGIVL H
Subjt:  LVLNWEKCHFIVEEGIVLCH

A0A2G9HWC5 DNA-directed DNA polymerase1.0e-20048.16Show/hide
Query:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA
        ++ TFY GL +  +  +D     + L  T  E H +L+ +  N+YE      +R  P K ++G  EVD +T +NAKID L   M     +      Q   
Subjt:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA

Query:  VGCGICGEGHTHDQCPSNPKSIFYVGPQGAVRTTHFQAHTIKDGEITQTFHGIVAQINSHNKAQEEKVVEEEEPKKTEPHAQREPNAYKLAVPF--PQRL
        V C  CGE +  DQCP + +SI +V      +   + ++T   G      H   +  N+  +    +  +  +    +P  +++P+  +  + F      
Subjt:  VGCGICGEGHTHDQCPSNPKSIFYVGPQGAVRTTHFQAHTIKDGEITQTFHGIVAQINSHNKAQEEKVVEEEEPKKTEPHAQREPNAYKLAVPF--PQRL

Query:  RKKNDEAQFKKFLNFLR----------------------------ELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPK
          K  E Q  +F N +                             +LHINIP  +ALE+MPSY KF+KDIL+ KR+  ++ETVALT  YSAI+ NKLPPK
Subjt:  RKKNDEAQFKKFLNFLR----------------------------ELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPK

Query:  LKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRP
        LKDPGSFTI C+IG     RALCDLGA+INLMP  +Y+ LG+GEA+P ++TLQLADRS+ + +G IED+LVKVDKFIFP D ++LD + D E+ IILGRP
Subjt:  LKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRP

Query:  FLATGQTLIHVQKRELTMHMLLSNAFNTTREEAKEEQVEDNCILAQTTGKF--------HALDLKER----------------ISTL-------------
        FLATG+TLI VQK ELTM +          +  K     D C        F          LD  ER                + TL             
Subjt:  FLATGQTLIHVQKRELTMHMLLSNAFNTTREEAKEEQVEDNCILAQTTGKF--------HALDLKER----------------ISTL-------------

Query:  -----------PSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQR
                   PSI +PP LELKPLP+HL Y +LGE ++L VIISS L+  Q   L++VL  H+ AIGW++ADIKGISPS+CM KI L D+    +E QR
Subjt:  -----------PSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQR

Query:  RLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYY
        RLNP M +VV+KEIIKWL+AG+IYPI+DS WVSPVQCVPKKGG+TVV N +NELIP+RT TGWR CMDYRKLN  T+KDHFPLPFIDQMLDRL G+ +Y 
Subjt:  RLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYY

Query:  FLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWE
        FLDGYSGYNQI+IAPEDQEK  FTCPYGTFAFRRMPF LCNA  TFQRCMM IF++++E  +++FMDDFS+YG SF +CL  L  + ++CE+TNLVLNWE
Subjt:  FLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWE

Query:  KCHFIVEEGIVLCH
        KCHF+V+EGIVL H
Subjt:  KCHFIVEEGIVLCH

A0A2G9HWF8 Reverse transcriptase5.6e-20248.26Show/hide
Query:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA
        ++ TFY GL +  +  +D     + L  T  E H +L+ +  N+YE      +R  P K ++G  EVD +T +NAKID L   M                
Subjt:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA

Query:  VGCGICGEGHTHDQCPSNPKSIFYVG--------------------------------------PQGAVRTT----------HFQAHTIKDGEITQTFHG
             CGEGH  DQCP + +SI +V                                        QG    T            QA T+++G   Q    
Subjt:  VGCGICGEGHTHDQCPSNPKSIFYVG--------------------------------------PQGAVRTT----------HFQAHTIKDGEITQTFHG

Query:  IVAQINSHNKAQEEKVVEEEEPKKTE-PHAQREPNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFE
            +    K++E++V  EE+ K+ E P    +P    L  PFPQRL+K+  + QF KFL   ++LHINIP  +ALE+MPSY KF+KDIL+ KR+  ++E
Subjt:  IVAQINSHNKAQEEKVVEEEEPKKTE-PHAQREPNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFE

Query:  TVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPID
        TVALT   SAI+ NKLPPKLKDPG              RALCDLGA+INLMP  +Y+ LG+ EA+P ++TLQLADRS+ + +G IED+LVKVDKFIFP D
Subjt:  TVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPID

Query:  FIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHM----LLSNAFNTTREEAKEEQVEDNCILAQTTGK----FHALDLKER--------------
        F++LD + D EVPIILGRPFLATG+TLI VQK ELTM +    +  N F   +   + ++     +     GK       LD  ER              
Subjt:  FIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHM----LLSNAFNTTREEAKEEQVEDNCILAQTTGK----FHALDLKER--------------

Query:  --ISTL------------------------PSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSY
          + TL                        PSI +PP LELKPLP+HL Y +LGE ++L VIISS L+  Q   L++VL  HK AIGW++ADIKGISPS+
Subjt:  --ISTL------------------------PSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSY

Query:  CMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHF
        CM KI L D+    +E QRRLNP M +VV+KEIIKWL+AG+IYPI+D  W+SPVQCVPKKGG+TVV N +NE IP++T TGWR+CMDYRKLN  T+KDHF
Subjt:  CMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHF

Query:  PLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLT
        PLPFIDQMLDRL G+ +Y FLDGYSGYNQI+IAPEDQEKT FTCPYGTFAFRR+PF LCNA  TFQRCMM IF++++E  ++VFMDDFS+YG SF +CL 
Subjt:  PLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLT

Query:  QLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH
         L  V ++CE+TNLVLNWEKCHF+V+EGIVL H
Subjt:  QLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH

A0A2G9HYA0 Reverse transcriptase8.0e-20947.9Show/hide
Query:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA
        ++ TFY GL +  +  +D     + L  T  E H +L+ +  N+YE      +R  P K ++G  EVD +T +NAKID L   M     +      Q   
Subjt:  KLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAILDQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNA

Query:  VGCGICGEGHTHDQCPSNPKSIFYV-----------------------------------------GPQGAVR-------------TTHFQAHTIKDGEI
        V C  CGEGH  DQCP + +SI +V                                         G Q  V+                F A T  + + 
Subjt:  VGCGICGEGHTHDQCPSNPKSIFYV-----------------------------------------GPQGAVR-------------TTHFQAHTIKDGEI

Query:  TQTFHGIVAQ-INSH---------------------------------------NKAQEEKVVEEEEPKKTE-PHAQREPNAYKLAVPFPQRLRKKNDEA
         +T  G +A  INS                                         K++E++V+ EE+ K+ E P    +P    L  PFPQRL+K+  E 
Subjt:  TQTFHGIVAQ-INSH---------------------------------------NKAQEEKVVEEEEPKKTE-PHAQREPNAYKLAVPFPQRLRKKNDEA

Query:  QFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLP
        QF KFL   ++LHINIP  +ALE+MPSY KF+KDIL+ KR+  ++ETVALT   SAI+ NKLPPKLKDPGSFTI C+IG     RALCDLGA+INLMP  
Subjt:  QFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLP

Query:  VYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHMLLSNAFNTTREEAKE
        +Y+ LG+GEA+P ++TLQLADRS+ + +G IED+LVKVDKFIFP DF++LD + D EVPIILGRPFLATG+TLI VQK ELTM +          +  K 
Subjt:  VYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKRELTMHMLLSNAFNTTREEAKE

Query:  EQVEDNCILAQTTGKFHA----------------LDL----------------------KERISTL----------PSIAQPPVLELKPLPTHLKYRFLG
            D C       K                   LDL                        R+ +L          PSI  PP LELKPLP+HL Y +LG
Subjt:  EQVEDNCILAQTTGKFHA----------------LDL----------------------KERISTL----------PSIAQPPVLELKPLPTHLKYRFLG

Query:  EEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPV
        E ++L VIISS L+  Q   L++VL  HK AIGW++ADIKGISPS+CM KI L D+    +E QRRLNP M +VV+KEIIKWL+AG+IYPI+DS WVSPV
Subjt:  EEESLSVIISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPV

Query:  QCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRM
        QCVPKKGG+TVV N +NELIP+RT TGWR+CMDYRKLN  T+KDHFPLPFIDQMLDRL G+ +Y FLDGYSGYNQI+IAPEDQEKT FTCPYGTFAFRRM
Subjt:  QCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRM

Query:  PFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH
        PF LCNA  TFQRCMM IF++++E  ++VFMDDFS+YG SF +CL  L  V ++CE+TNL+LNWEKCHF+V+EGIVL H
Subjt:  PFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH

A0A6J1DV77 uncharacterized protein LOC1110238181.2e-20163.05Show/hide
Query:  PNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIG
        P  +++  P+PQRL+KKN + QF +FL  L++LHINIPL++ALE+MP+Y KFLKDIL  KR+  EFE VALT  +SAIL  KLP K+ DPGSFTI   IG
Subjt:  PNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTSVYSAILANKLPPKLKDPGSFTIHCSIG

Query:  GIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKR
        G +V  ALCDLGA+INLMPL VY+ LGIGEARP TVTLQLADRS+ +LEGKIEDVLV+VDKFIFP DFIILDY+AD+E+PIILGRPFL+TG+ LI V   
Subjt:  GIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIILGRPFLATGQTLIHVQKR

Query:  ELTMHM----LLSNAFNTTR-----EEAKEEQVEDNCIL--AQTTGKFHALD------LKERIST--LPSIAQPPVLELKPLPTHLKYRFLGEEESLSVI
        ELT+ +    +  + FN+ +     EE    ++ D+ +    QT    + L+      +K+R+     PS+ + P LELK LP+HLKY +LGE E+L V 
Subjt:  ELTMHM----LLSNAFNTTR-----EEAKEEQVEDNCIL--AQTTGKFHALD------LKERIST--LPSIAQPPVLELKPLPTHLKYRFLGEEESLSVI

Query:  ISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGG
        I++ L + +E  L+++L  HKKAIGW+LADIKGISPSYCM KI L +     IE QRRLNPAM +VV+KEIIKWL+AG+IYPIAD   +SPVQCVPKKGG
Subjt:  ISSKLNQPQEHLLMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGG

Query:  MTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNAL
        +TVV N NNELIP+RT TGW ICMDYRKLN  TKKDHFPLPFIDQMLD LVGQ YYY LDGY+GYNQI+I P+DQ+KT FTCPYGTF+FRRMPF LCNA 
Subjt:  MTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNAL

Query:  ETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH
         TFQRCMM IF +L+E  V+VFMDDFS++ K F++ L+ LE+V  +CE+TNLVLNWEKCHF+V EGIVL H
Subjt:  ETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.0e-2737.07Show/hide
Query:  VRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYN
        V  +I   LN G+I   ++S + SP+  VPKK       +A+ +         +RI +DYRKLN IT  D  P+P +D++L +L    Y+  +D   G++
Subjt:  VRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYN

Query:  QISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEG
        QI + PE   KTAF+  +G + + RMPF L NA  TFQRCM  I   LL K   V++DD  ++  S  + L  L  V E+  + NL L  +KC F+ +E 
Subjt:  QISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEG

Query:  IVLCH
          L H
Subjt:  IVLCH

P10394 Retrovirus-related Pol polyprotein from transposon 4122.4e-2431.3Show/hide
Query:  KIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLP
        ++RL D+   + +  R  + + ++ ++ ++ K +   ++ P + S++ SP+  VPKK              P+     WR+ +DYR++N     D FPLP
Subjt:  KIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLP

Query:  FIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLE
         ID +LD+L    Y+  LD  SG++QI +    ++ T+F+   G++ F R+PF L  A  +FQR M   FS +      ++MDD  + G S    L  L 
Subjt:  FIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLE

Query:  RVSEQCEETNLVLNWEKCHFIVEEGIVLCH
         V  +C E NL L+ EKC F + E   L H
Subjt:  RVSEQCEETNLVLNWEKCHFIVEEGIVLCH

P20825 Retrovirus-related Pol polyprotein from transposon 2971.9e-2635.61Show/hide
Query:  VRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYN
        V  ++ + LN G+I   ++S + SP   VPKK      + AN           +R+ +DYRKLN IT  D +P+P +D++L +L    Y+  +D   G++
Subjt:  VRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYN

Query:  QISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEG
        QI +  E   KTAF+   G + + RMPF L NA  TFQRCM  I   LL K   V++DD  I+  S  + L  ++ V  +  + NL L  +KC F+ +E 
Subjt:  QISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEEG

Query:  IVLCH
          L H
Subjt:  IVLCH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.6e-2537.19Show/hide
Query:  VRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYN
        + K + K L+   I P + S   SPV  VPKK G                   +R+C+DYR LN  T  D FPLP ID +L R+     +  LD +SGY+
Subjt:  VRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYN

Query:  QISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEE
        QI + P+D+ KTAF  P G + +  MPF L NA  TF R M   F +L  + V V++DD  I+ +S  +    L+ V E+ +  NL++  +KC F  EE
Subjt:  QISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEE

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-2537.19Show/hide
Query:  VRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYN
        + K + K L+   I P + S   SPV  VPKK G                   +R+C+DYR LN  T  D FPLP ID +L R+     +  LD +SGY+
Subjt:  VRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRICMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYN

Query:  QISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEE
        QI + P+D+ KTAF  P G + +  MPF L NA  TF R M   F +L  + V V++DD  I+ +S  +    L+ V E+ +  NL++  +KC F  EE
Subjt:  QISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKSFADCLTQLERVSEQCEETNLVLNWEKCHFIVEE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCAGTGAGAAGAATGCAACTAAGAATTTTCCAGCGAAGAAACTTTAAGGAGGCTGTTGCGTTTTCGTTCGTAGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTA
CAACGAAGTTTCTGCTCCCAGTTTTGCTGCAGCACAAAATTTGCTCCCTCCAGTCGCCCAGCAGCATAATCTACACAAAATAATTTCCAATGCTTCCATTCTTCCAAATG
GGTTAAAACATGTAACTAGAGGTTGCATGCGTGCTGCTCGACAAATCGATTTTGAAGTTGATCCTGAAATTGACAGAACATATCATAGAAGAAGAAGAAATCGAAGAGCT
CGAAGACAATCAGGAGAAATGGCAGCACCCAACCAACTAAGCAGACTAGTAAATCCCATCCAAATGACCGATGATAGTGCCAGAGGAATTAGAGATTATGCAGCCCCTGC
AAACTGTAATTTTAATCCAGGGATAGTACAACCCACTCTAAAACAGAAAGGAGTAACGCGTGAGCAATTACAAGTCATCTTATTTCCTTATTCGTTAAGAGATGCGGCGA
AATTACAAACCTTCTACGTGGGCCTTAACAAGAACTCGCAAGTGTTAGTGGATTCTTCTGCAAATGGTGCGCTACTCAGGAAAACATATGATGAAGCTCACGCAATCCTC
GATCAGATTGAGCGCAATAATTATGAGTGGGGCACTGCGGACGACAAAAGAAGGAGACCCATTAAAACCAGCTCGGGAAGTTTCGAGGTTGACCCATTAACTCCAGTCAA
TGCAAAGATTGATGCGTTGACAACCAAAATGGATGCGCTAACAGCTAGCACAACACCTTTGATCGCGCAACTCAATGCAGTTGGGTGTGGGATTTGTGGAGAAGGACACA
CGCATGACCAGTGTCCTTCGAACCCAAAATCAATTTTCTATGTTGGACCGCAAGGAGCTGTACGCACAACCCATTTTCAAGCACATACAATCAAGGATGGAGAAATCACC
CAAACTTTTCATGGAATAGTGGCGCAAATAAACAGCCACAATAAGGCGCAAGAAGAGAAAGTAGTTGAAGAAGAAGAACCCAAAAAGACGGAGCCTCACGCACAAAGGGA
GCCTAACGCATACAAGCTTGCAGTACCCTTTCCTCAAAGATTAAGGAAGAAAAATGATGAAGCCCAATTCAAGAAATTTTTAAATTTCTTGCGAGAGTTGCACATCAACA
TTCCACTTGTGGATGCATTAGAGAAGATGCCTAGCTACGCAAAGTTCTTGAAAGACATTCTGACGAATAAAAGACAATGGAAAGAATTTGAGACTGTTGCGTTGACAAGT
GTATATAGCGCAATACTTGCCAACAAACTCCCACCTAAATTGAAAGATCCTGGGAGCTTCACAATTCATTGTTCAATAGGAGGAATCGATGTTGATAGAGCATTATGCGA
CCTTGGCGCAAACATAAATTTAATGCCATTACCAGTTTACAAGCCCCTAGGAATCGGAGAAGCAAGACCGAACACTGTCACGCTTCAATTGGCAGACAGATCTGTGGTCC
ACCTCGAAGGAAAAATTGAAGATGTTTTGGTAAAAGTTGACAAATTTATTTTCCCTATAGATTTTATCATACTTGATTACAAAGCTGACAGAGAAGTTCCCATTATTTTG
GGAAGACCGTTCCTTGCAACCGGCCAAACATTGATTCACGTCCAGAAAAGGGAGCTTACCATGCACATGCTACTGTCAAATGCGTTCAACACTACGAGGGAGGAGGCAAA
GGAAGAACAAGTTGAGGATAACTGCATCCTGGCGCAAACCACAGGAAAATTCCATGCGCTAGATCTCAAAGAAAGAATCTCCACCTTACCTTCCATAGCGCAACCCCCTG
TGCTAGAGCTCAAGCCACTCCCAACGCATTTGAAGTATAGGTTTTTAGGAGAAGAAGAATCCCTCTCAGTTATCATTTCATCAAAGCTTAATCAACCACAAGAACACCTT
TTAATGCAGGTCCTCGCAAAGCACAAGAAGGCCATTGGATGGAGTCTCGCAGATATAAAAGGAATTAGCCCCTCGTATTGCATGCCTAAAATTCGCTTGCTAGACGAATC
CAATGAATTTATTGAGCGGCAAAGAAGATTAAATCCTGCGATGATGAAAGTTGTGCGAAAAGAAATTATCAAGTGGCTCAACGCAGGCGTGATATATCCCATCGCAGATA
GCAGATGGGTGAGCCCGGTGCAATGCGTACCTAAGAAAGGAGGAATGACAGTGGTAGCCAACGCAAACAATGAATTGATTCCATCACGCACCACTACTGGGTGGCGCATA
TGCATGGATTACCGCAAGCTTAACGCAATCACGAAGAAGGATCATTTCCCTCTTCCATTCATTGATCAAATGTTGGATCGACTTGTGGGACAGACATATTATTATTTTCT
TGACGGGTACTCAGGGTACAACCAAATATCTATTGCACCGGAAGACCAAGAAAAGACAGCCTTCACCTGCCCTTATGGAACCTTTGCGTTTAGAAGGATGCCTTTTTGGT
TGTGCAATGCGCTAGAGACATTCCAGAGATGTATGATGACCATCTTTTCAGAATTGCTAGAAAAGTCAGTGAAAGTTTTTATGGACGACTTCTCAATATATGGAAAGTCT
TTCGCAGACTGTCTAACACAACTGGAACGAGTATCAGAGCAATGCGAGGAGACTAATCTAGTGCTAAATTGGGAGAAATGTCATTTTATAGTAGAAGAAGGCATCGTCCT
ATGCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCAGTGAGAAGAATGCAACTAAGAATTTTCCAGCGAAGAAACTTTAAGGAGGCTGTTGCGTTTTCGTTCGTAGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTA
CAACGAAGTTTCTGCTCCCAGTTTTGCTGCAGCACAAAATTTGCTCCCTCCAGTCGCCCAGCAGCATAATCTACACAAAATAATTTCCAATGCTTCCATTCTTCCAAATG
GGTTAAAACATGTAACTAGAGGTTGCATGCGTGCTGCTCGACAAATCGATTTTGAAGTTGATCCTGAAATTGACAGAACATATCATAGAAGAAGAAGAAATCGAAGAGCT
CGAAGACAATCAGGAGAAATGGCAGCACCCAACCAACTAAGCAGACTAGTAAATCCCATCCAAATGACCGATGATAGTGCCAGAGGAATTAGAGATTATGCAGCCCCTGC
AAACTGTAATTTTAATCCAGGGATAGTACAACCCACTCTAAAACAGAAAGGAGTAACGCGTGAGCAATTACAAGTCATCTTATTTCCTTATTCGTTAAGAGATGCGGCGA
AATTACAAACCTTCTACGTGGGCCTTAACAAGAACTCGCAAGTGTTAGTGGATTCTTCTGCAAATGGTGCGCTACTCAGGAAAACATATGATGAAGCTCACGCAATCCTC
GATCAGATTGAGCGCAATAATTATGAGTGGGGCACTGCGGACGACAAAAGAAGGAGACCCATTAAAACCAGCTCGGGAAGTTTCGAGGTTGACCCATTAACTCCAGTCAA
TGCAAAGATTGATGCGTTGACAACCAAAATGGATGCGCTAACAGCTAGCACAACACCTTTGATCGCGCAACTCAATGCAGTTGGGTGTGGGATTTGTGGAGAAGGACACA
CGCATGACCAGTGTCCTTCGAACCCAAAATCAATTTTCTATGTTGGACCGCAAGGAGCTGTACGCACAACCCATTTTCAAGCACATACAATCAAGGATGGAGAAATCACC
CAAACTTTTCATGGAATAGTGGCGCAAATAAACAGCCACAATAAGGCGCAAGAAGAGAAAGTAGTTGAAGAAGAAGAACCCAAAAAGACGGAGCCTCACGCACAAAGGGA
GCCTAACGCATACAAGCTTGCAGTACCCTTTCCTCAAAGATTAAGGAAGAAAAATGATGAAGCCCAATTCAAGAAATTTTTAAATTTCTTGCGAGAGTTGCACATCAACA
TTCCACTTGTGGATGCATTAGAGAAGATGCCTAGCTACGCAAAGTTCTTGAAAGACATTCTGACGAATAAAAGACAATGGAAAGAATTTGAGACTGTTGCGTTGACAAGT
GTATATAGCGCAATACTTGCCAACAAACTCCCACCTAAATTGAAAGATCCTGGGAGCTTCACAATTCATTGTTCAATAGGAGGAATCGATGTTGATAGAGCATTATGCGA
CCTTGGCGCAAACATAAATTTAATGCCATTACCAGTTTACAAGCCCCTAGGAATCGGAGAAGCAAGACCGAACACTGTCACGCTTCAATTGGCAGACAGATCTGTGGTCC
ACCTCGAAGGAAAAATTGAAGATGTTTTGGTAAAAGTTGACAAATTTATTTTCCCTATAGATTTTATCATACTTGATTACAAAGCTGACAGAGAAGTTCCCATTATTTTG
GGAAGACCGTTCCTTGCAACCGGCCAAACATTGATTCACGTCCAGAAAAGGGAGCTTACCATGCACATGCTACTGTCAAATGCGTTCAACACTACGAGGGAGGAGGCAAA
GGAAGAACAAGTTGAGGATAACTGCATCCTGGCGCAAACCACAGGAAAATTCCATGCGCTAGATCTCAAAGAAAGAATCTCCACCTTACCTTCCATAGCGCAACCCCCTG
TGCTAGAGCTCAAGCCACTCCCAACGCATTTGAAGTATAGGTTTTTAGGAGAAGAAGAATCCCTCTCAGTTATCATTTCATCAAAGCTTAATCAACCACAAGAACACCTT
TTAATGCAGGTCCTCGCAAAGCACAAGAAGGCCATTGGATGGAGTCTCGCAGATATAAAAGGAATTAGCCCCTCGTATTGCATGCCTAAAATTCGCTTGCTAGACGAATC
CAATGAATTTATTGAGCGGCAAAGAAGATTAAATCCTGCGATGATGAAAGTTGTGCGAAAAGAAATTATCAAGTGGCTCAACGCAGGCGTGATATATCCCATCGCAGATA
GCAGATGGGTGAGCCCGGTGCAATGCGTACCTAAGAAAGGAGGAATGACAGTGGTAGCCAACGCAAACAATGAATTGATTCCATCACGCACCACTACTGGGTGGCGCATA
TGCATGGATTACCGCAAGCTTAACGCAATCACGAAGAAGGATCATTTCCCTCTTCCATTCATTGATCAAATGTTGGATCGACTTGTGGGACAGACATATTATTATTTTCT
TGACGGGTACTCAGGGTACAACCAAATATCTATTGCACCGGAAGACCAAGAAAAGACAGCCTTCACCTGCCCTTATGGAACCTTTGCGTTTAGAAGGATGCCTTTTTGGT
TGTGCAATGCGCTAGAGACATTCCAGAGATGTATGATGACCATCTTTTCAGAATTGCTAGAAAAGTCAGTGAAAGTTTTTATGGACGACTTCTCAATATATGGAAAGTCT
TTCGCAGACTGTCTAACACAACTGGAACGAGTATCAGAGCAATGCGAGGAGACTAATCTAGTGCTAAATTGGGAGAAATGTCATTTTATAGTAGAAGAAGGCATCGTCCT
ATGCCATTGA
Protein sequenceShow/hide protein sequence
MSAVRRMQLRIFQRRNFKEAVAFSFVGASLAKNGQVYNEVSAPSFAAAQNLLPPVAQQHNLHKIISNASILPNGLKHVTRGCMRAARQIDFEVDPEIDRTYHRRRRNRRA
RRQSGEMAAPNQLSRLVNPIQMTDDSARGIRDYAAPANCNFNPGIVQPTLKQKGVTREQLQVILFPYSLRDAAKLQTFYVGLNKNSQVLVDSSANGALLRKTYDEAHAIL
DQIERNNYEWGTADDKRRRPIKTSSGSFEVDPLTPVNAKIDALTTKMDALTASTTPLIAQLNAVGCGICGEGHTHDQCPSNPKSIFYVGPQGAVRTTHFQAHTIKDGEIT
QTFHGIVAQINSHNKAQEEKVVEEEEPKKTEPHAQREPNAYKLAVPFPQRLRKKNDEAQFKKFLNFLRELHINIPLVDALEKMPSYAKFLKDILTNKRQWKEFETVALTS
VYSAILANKLPPKLKDPGSFTIHCSIGGIDVDRALCDLGANINLMPLPVYKPLGIGEARPNTVTLQLADRSVVHLEGKIEDVLVKVDKFIFPIDFIILDYKADREVPIIL
GRPFLATGQTLIHVQKRELTMHMLLSNAFNTTREEAKEEQVEDNCILAQTTGKFHALDLKERISTLPSIAQPPVLELKPLPTHLKYRFLGEEESLSVIISSKLNQPQEHL
LMQVLAKHKKAIGWSLADIKGISPSYCMPKIRLLDESNEFIERQRRLNPAMMKVVRKEIIKWLNAGVIYPIADSRWVSPVQCVPKKGGMTVVANANNELIPSRTTTGWRI
CMDYRKLNAITKKDHFPLPFIDQMLDRLVGQTYYYFLDGYSGYNQISIAPEDQEKTAFTCPYGTFAFRRMPFWLCNALETFQRCMMTIFSELLEKSVKVFMDDFSIYGKS
FADCLTQLERVSEQCEETNLVLNWEKCHFIVEEGIVLCH