; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022499 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022499
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:30978375..30987299
RNA-Seq ExpressionLag0022499
SyntenyLag0022499
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7567979.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]5.0e-10032.54Show/hide
Query:  YDGYISDMVERTLEVFMDDFSVFGETGRHSFYTKTLYYYLTR-TLAVSKLHIKIQDKKGTENQVADHLS--RLGEEAQVKKGLDIQELVADEQ-------
        ++ + S +V   + V+ D  ++     RH +  K     L R  L + +  ++I DKKG EN VADHLS  R+ +E  +   +  ++L+A  Q       
Subjt:  YDGYISDMVERTLEVFMDDFSVFGETGRHSFYTKTLYYYLTR-TLAVSKLHIKIQDKKGTENQVADHLS--RLGEEAQVKKGLDIQELVADEQ-------

Query:  ---ILEVRAV--ELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQ
           +++V A+  +LPW+ADYVNYLVSG +PP  ++ + KKF KD+  + WDE YLY L  D+I R C++E++   IL  CH + YGGHF   K  SK+LQ
Subjt:  ---ILEVRAV--ELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQ

Query:  SRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTR
        + ++ P++FK    F   CD CQR GN++ ++EM  N ILEVEIF VWGIDFM PFP S+GN YIL+AVDYVS W+EAIA+ T D++VV+K+F   +F R
Subjt:  SRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTR

Query:  CGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEKTVDPSKR--------------------IG-------------
         G  R +I+D G+HF+NK+  NL  K+ ++H+VAT YHP+T+GQVE+ N EIKAILEKTV  +++                    IG             
Subjt:  CGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEKTVDPSKR--------------------IG-------------

Query:  ----------------HYDL-------------------------------------------------------TRLELFLGKLKSRWSGPFEIVRIFD
                        ++D+                                                       +RL LF GKLKSRWSGPF +  +  
Subjt:  ----------------HYDL-------------------------------------------------------TRLELFLGKLKSRWSGPFEIVRIFD

Query:  YGTVKLMDNQG---IISRTLNQVL------DTKKINPKQACMEQEERDPRVFRRRLWTKKPLLKLPLPQQCRPSREPIAEKMLEGKCKEEGFQEGCTRKA
        YG + L    G   +  + L + +      +   I   ++C EQ +           +     +  + +QC+ S   + +                +R  
Subjt:  YGTVKLMDNQG---IISRTLNQVL------DTKKINPKQACMEQEERDPRVFRRRLWTKKPLLKLPLPQQCRPSREPIAEKMLEGKCKEEGFQEGCTRKA

Query:  SDHRTPQDVFEEMIRQGSPIGEILALI-STR--DLLWELKARPRSGNQARRRGDRDGDVRCWRIRAFLSLSLNSWIARPQIEAENFEMKPVMFQMMQIVG
        ++    Q    E     S      +   STR   L+ ++K     G     R   DGD       A  +      I  P ++  NFE+K  +  M+Q   
Subjt:  SDHRTPQDVFEEMIRQGSPIGEILALI-STR--DLLWELKARPRSGNQARRRGDRDGDVRCWRIRAFLSLSLNSWIARPQIEAENFEMKPVMFQMMQIVG

Query:  QFHGFSFEDHHLHLKSFLGVSDSFVIQGVSRDVLRLTLFPYSLRDGAKAWLNSFAPRSIRTWNKLAEKFLSKYFPPTRNAKLRSEIVGFRQLEDETFSEA
        +FHG   ED   HL +F  +     I GVS D  +L LFP+SL D A  W  +    SI +W+   + FL+K+F  +R A+LR++I  F Q + E+  EA
Subjt:  QFHGFSFEDHHLHLKSFLGVSDSFVIQGVSRDVLRLTLFPYSLRDGAKAWLNSFAPRSIRTWNKLAEKFLSKYFPPTRNAKLRSEIVGFRQLEDETFSEA

Query:  WE
        WE
Subjt:  WE

KAG7568035.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]4.4e-9631.42Show/hide
Query:  YDGYISDMVERTLEVFMDDFSVFGETGRHSFYTKTLYYYLTR-TLAVSKLHIKIQDKKGTENQVADHLS--RLGEEAQVKKGLDIQELVADEQ-------
        ++ + S +V   + V+ D  ++     RH +  K     L R  L + +  ++I DKKG EN VADHLS  R+ +E  +   +  ++L+A  Q       
Subjt:  YDGYISDMVERTLEVFMDDFSVFGETGRHSFYTKTLYYYLTR-TLAVSKLHIKIQDKKGTENQVADHLS--RLGEEAQVKKGLDIQELVADEQ-------

Query:  ---ILEVRAV--ELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQ
           +++V A+  +LPW+ADYVNYLVSG +PP  ++ + KKF KD+  + WDE YLY L  D+I R CV+E++   IL   H + YGGHF   K  SK+LQ
Subjt:  ---ILEVRAV--ELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQ

Query:  SRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTR
        + ++ P++F+ A  F   CD CQR GN++ ++EM  N ILEVEIF VWGIDFM PFP S+GN YIL+AVDYVSKW+EAIA+ T D++VV+K+F   +F R
Subjt:  SRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTR

Query:  CGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEKTVDPSKR--------------------IG-------------
         G  R +ISD G+HF+NK+  NL  K+ ++H+VAT YHP+T+GQVE+ N EIKAILEKTV  +++                    IG             
Subjt:  CGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEKTVDPSKR--------------------IG-------------

Query:  ----------------HYDL-------------------------------------------------------TRLELFLGKLKSRWSGPFEIVRIFD
                        ++D+                                                       +RL LF GKLKSRWSGPF +  +  
Subjt:  ----------------HYDL-------------------------------------------------------TRLELFLGKLKSRWSGPFEIVRIFD

Query:  YGTVKLMDNQG---IISRTLNQVLDTKKI-------NPKQACMEQEERDP---------------RVFRRRLWTKKPLLKLPLP---QQC----------
        YG + L    G   +  + L + +  + I        P+ A  E E+ D                R     + T      LP P   Q C          
Subjt:  YGTVKLMDNQG---IISRTLNQVLDTKKI-------NPKQACMEQEERDP---------------RVFRRRLWTKKPLLKLPLP---QQC----------

Query:  --------------RPSREPIAEKMLEGKCKEEGFQEGCTRKASDHRTPQDVFEEMI-RQGSPIGEILALISTRDLLWELKARPRSGNQARRRG------
                        +R P+AE+           Q+  TR  S   +P  +F  +   + SP  ++  L+  R  + E      S N   RRG      
Subjt:  --------------RPSREPIAEKMLEGKCKEEGFQEGCTRKASDHRTPQDVFEEMI-RQGSPIGEILALISTRDLLWELKARPRSGNQARRRG------

Query:  -----------DRDGDVRCW---RIRAFLSLSLNSWIAR--------------------------------------------PQIEAENFEMKPVMFQM
                    +   V C+   R R   +L  N  I R                                            P ++  NFE+K  +  M
Subjt:  -----------DRDGDVRCW---RIRAFLSLSLNSWIAR--------------------------------------------PQIEAENFEMKPVMFQM

Query:  MQIVGQFHGFSFEDHHLHLKSFLGVSDSFVIQGVSRDVLRLTLFPYSLRDGAKAWLNSFAPRSIRTWNKLAEKFLSKYFPPTRNAKLRSEIVGFRQLEDE
        +Q   +FHG   ED   HL  F  +     I GVS D  +L LFP+SL D A  W  +    SI TW+   + FL+K+F  +R A+LR+EI GF Q   E
Subjt:  MQIVGQFHGFSFEDHHLHLKSFLGVSDSFVIQGVSRDVLRLTLFPYSLRDGAKAWLNSFAPRSIRTWNKLAEKFLSKYFPPTRNAKLRSEIVGFRQLEDE

Query:  TFSEAWE
        TF EAW+
Subjt:  TFSEAWE

KAG7578951.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]2.6e-9631.58Show/hide
Query:  YDGYISDMVERTLEVFMDDFSVFGETGRHSFYTKTLYYYLTR-TLAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVEL
        ++ + S +V   + V+ D  ++     RH +  K     L R  L + +  ++I DKKG EN  ADHLSR+    ++++ + I + + +EQ+L +++ E+
Subjt:  YDGYISDMVERTLEVFMDDFSVFGETGRHSFYTKTLYYYLTR-TLAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVEL

Query:  ------------------PWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIAS
                          PW+AD VNYL+ G  P    A + KKF +D+  Y WDE YLYK G D + R C+ EE+V  +LE CH + YGGHF   K   
Subjt:  ------------------PWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIAS

Query:  KVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPF-PPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMK
        KVLQ+  + PS+FK A+ F   CD CQR GN+T ++EM  N ILEVE+F VWGIDFM PF P S+GN YIL+AVDYVSKW+EAIA+ T D KVV+K+F  
Subjt:  KVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPF-PPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMK

Query:  NMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEKTVDPSKR--------------------IGH-------
         +F R G  + +ISD GSHF+NK+  +L  K+ ++H+VAT YHP+T+GQVE+ N +IKAIL + V  SKR                    IG        
Subjt:  NMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEKTVDPSKR--------------------IGH-------

Query:  ---------------------------------YDL--------------------------------------------TRLELFLGKLKSRWSGPFEI
                                          DL                                            +RL+LF GKLKSRWSGPF I
Subjt:  ---------------------------------YDL--------------------------------------------TRLELFLGKLKSRWSGPFEI

Query:  VRIFDYGTVKLMDNQGIISRTLNQVLDTKKINPKQACMEQ----------------------------------------------EERDPRVFRRRLWT
          +  +GTV L    G   R   Q +  K I   ++C EQ                                              E+    V  +   +
Subjt:  VRIFDYGTVKLMDNQGIISRTLNQVLDTKKINPKQACMEQ----------------------------------------------EERDPRVFRRRLWT

Query:  KKPLLKLPLP------QQCRPSREPIAEKMLEG-KCKEEGFQEGCTRKASD----------HRTPQDVFEEMIR-----QGSPIGEILALISTRDLLWEL
         +   + P P      + CR   E     +LE  +       E CTR   +            + + V+    R     +   + E L   +   L  EL
Subjt:  KKPLLKLPLP------QQCRPSREPIAEKMLEG-KCKEEGFQEGCTRKASD----------HRTPQDVFEEMIR-----QGSPIGEILALISTRDLLWEL

Query:  K-ARPRSGNQAR-------RRGDRDGDVRCWRIRAF---LSLSLNSWIARPQIEAENFEMKPVMFQMMQIVGQFHGFSFEDHHLHLKSFLGVSDSFVIQG
        K  R R G +         +  D  G      I A     +      I  P ++  NFE+K  +  M+Q   +FHG   ED   HL +F  +     I G
Subjt:  K-ARPRSGNQAR-------RRGDRDGDVRCWRIRAF---LSLSLNSWIARPQIEAENFEMKPVMFQMMQIVGQFHGFSFEDHHLHLKSFLGVSDSFVIQG

Query:  VSRDVLRLTLFPYSLRDGAKAWLNSFAPRSIRTWNKLAEKFLSKYFPPTRNAKLRSEIVGFRQLEDETFSEAWE
        VS D  +L LFP+SL D A  W  +    SI +W+   + FL+K+F  +R A+LR+EI  F Q + E+  EAWE
Subjt:  VSRDVLRLTLFPYSLRDGAKAWLNSFAPRSIRTWNKLAEKFLSKYFPPTRNAKLRSEIVGFRQLEDETFSEAWE

PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]3.7e-9547.38Show/hide
Query:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL
        L + +  ++I+D+KG ENQ+ADHLSRL   A+  +   I +   DEQ+L + A ++PW+AD VNYL  G+ P + +AQ+ KKFL D + Y WD+ +L+K 
Subjt:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL

Query:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP
        GPD ILR CV E ++  I E CH++PYGGHF   + A+K+LQS +F P+LFK  H F  NCD+CQRTGN++ + EM L  ILEVE+F VWGIDFM PF P
Subjt:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP

Query:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK
        SFGN+YIL+AVDY+SKW+EA+A    D+KVV+    KN+FTR GT R IISD G+HF N+    L SKY ++H+++T YHP+T+GQVE+ N EIK  LEK
Subjt:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK

Query:  TV-----DPSKRI----------------------------------------GHYDL---TRLELFLGKLKSRWSGPFEIVRIFDYGTVKLMDNQGIIS
        TV     D SKR+                                        G Y L   +RL+LF  KLKSRWS PF I  +  +G V+L +NQ   +
Subjt:  TV-----DPSKRI----------------------------------------GHYDL---TRLELFLGKLKSRWSGPFEIVRIFDYGTVKLMDNQGIIS

Query:  R
        R
Subjt:  R

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.2e-9557.33Show/hide
Query:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL
        L + +  ++++DKKG+EN VADHLSRL E+ +V+  L IQE   DEQ+     ++LPW+AD VN+L   + PP+ T  + KKFL DVK Y WDE  L+K 
Subjt:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL

Query:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP
         PDQI+R CV EE++  IL  CHS+ YGGHF   + A+KVLQS +F PS+F+ ++   K CD+CQR GN++ + E+ L  ILEVE+F VWGIDFM PFPP
Subjt:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP

Query:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK
        SFG +YILLAVDYVSKW+EAIATTT DAKVV+K   KN+FTR GT R IISDEG+HF NK+  NL SKY ++H++A AYHP+TNGQ E+ N EIK ILEK
Subjt:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK

Query:  TVDPSKR
        TV+ +++
Subjt:  TVDPSKR

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase3.4e-9455.37Show/hide
Query:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL
        L + +  ++I+D+KGTENQ+ADHLSRL   A+  +   I +   DEQ+L + A ++PW+AD VNYL  G+ P + +AQ+ KKFL D + Y WD+ +L+K 
Subjt:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL

Query:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP
        GPD ILR CV E ++  ILE CH++PYGGHF G + A+K+LQS +F P+LFK AH F  NCD+CQRTGN++ + EM LN ILEVE+F VWGIDFM PF P
Subjt:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP

Query:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK
        SFGN+YIL+AVDYVSKW+EA A    D+KVV+    KN+FTR GT R IISD G+HF N+    L SKY ++H+++T YHP+T+GQVE+ N EIK ILEK
Subjt:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK

Query:  TVDPSKR
        TV  +++
Subjt:  TVDPSKR

A0A2G9HBV9 DNA-directed DNA polymerase6.4e-9354.43Show/hide
Query:  VSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGP
        + +  ++I+D+KGTENQ+ADHLSRL   A++ +   I +  +DEQ+L + A ++PW+AD VNYL  G+ P + +AQ+ KK L D + Y WD+L+L+K GP
Subjt:  VSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGP

Query:  DQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPPSF
        D ILR CV E ++  ILE CH++PYGGHF G + A+K+LQS +F P+LFK A+ F  NCD+CQRTGN++ + EM LN ILEVE+F VWGIDFM  F PSF
Subjt:  DQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPPSF

Query:  GNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEKTV
        GN+YIL+AVDYVSKW+EA+A    D+KVV+    KN+FTR GT R IIS+ G+HF N+    L SKY ++H+++T YHP+T+GQVE+ N EIK ILEKTV
Subjt:  GNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEKTV

Query:  DPSKR
          +++
Subjt:  DPSKR

A0A2G9HWF8 Reverse transcriptase1.8e-9547.38Show/hide
Query:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL
        L + +  ++I+D+KG ENQ+ADHLSRL   A+  +   I +   DEQ+L + A ++PW+AD VNYL  G+ P + +AQ+ KKFL D + Y WD+ +L+K 
Subjt:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL

Query:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP
        GPD ILR CV E ++  I E CH++PYGGHF   + A+K+LQS +F P+LFK  H F  NCD+CQRTGN++ + EM L  ILEVE+F VWGIDFM PF P
Subjt:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP

Query:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK
        SFGN+YIL+AVDY+SKW+EA+A    D+KVV+    KN+FTR GT R IISD G+HF N+    L SKY ++H+++T YHP+T+GQVE+ N EIK  LEK
Subjt:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK

Query:  TV-----DPSKRI----------------------------------------GHYDL---TRLELFLGKLKSRWSGPFEIVRIFDYGTVKLMDNQGIIS
        TV     D SKR+                                        G Y L   +RL+LF  KLKSRWS PF I  +  +G V+L +NQ   +
Subjt:  TV-----DPSKRI----------------------------------------GHYDL---TRLELFLGKLKSRWSGPFEIVRIFDYGTVKLMDNQGIIS

Query:  R
        R
Subjt:  R

A0A2K3NJZ5 Integrase catalytic domain-containing protein (Fragment)3.3e-8954.4Show/hide
Query:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL
        L + +  ++I+DKKG+EN VADHLSRL    + ++   IQ+L  DE IL V     PWFAD+ NY+V    P + T+Q+ KKFL D K Y WDE +LYK 
Subjt:  LAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKL

Query:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP
        G D +LR CV E++   +L  CH + YGGHF G + A+KVLQS  F P+LFK A  + K CD+CQRTGN++ ++EM  N +LEVEIF VWGIDFM PFP 
Subjt:  GPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPFPP

Query:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK
        S+   YIL+AVDYVSKW+EAIAT T DA+VV+    KN+F+R G  R +ISDEG+HFLN+ +  L  KYN+ HR+AT YHP+T+GQVE+ N +IK ILEK
Subjt:  SFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAILEK

Query:  TVDPSKR
        TV+ S++
Subjt:  TVDPSKR

A0A803R2M6 Uncharacterized protein8.3e-9356.96Show/hide
Query:  LAVSKLHIKIQDKKGTENQVADHLSRLG-EEAQVKKGLDIQELVADEQILEVR-AVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLY
        L + +  + I+DKKGTEN VADHLSRL  EE+Q  K + I E   DEQ+  VR ++ +PW+ADYVN+L + + PPE + Q+ KKF  +VK Y W+E  LY
Subjt:  LAVSKLHIKIQDKKGTENQVADHLSRLG-EEAQVKKGLDIQELVADEQILEVR-AVELPWFADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLY

Query:  KLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPF
        K   DQI+R CV EE++  IL  CH+ P GGHF G + A+KVLQS +F P+LFK A  F K CD+CQRTGN++ ++EM L  ILEVE+F VWGIDFM PF
Subjt:  KLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFMCPF

Query:  PPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAIL
        P SF N+YILLAVDYVSKW+EA AT   D K V++   KN+FTR GT R IISDEGSHF NK    L S+Y +RHR A  YHP++NGQ E+ N EIK IL
Subjt:  PPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVELFNIEIKAIL

Query:  EKTVDPSKR
        EKTV  S++
Subjt:  EKTVDPSKR

SwissProt top hitse value%identityAlignment
P08361 Gag-Pol polyprotein7.7e-1136.94Show/hide
Query:  WGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVEL
        W IDF    P  +G  Y+L+ VD  S WIEA  T    AKVV K  ++ +F R G  + + +D G  F++K+   +     I  ++  AY P+++GQVE 
Subjt:  WGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVEL

Query:  FNIEIKAILEK
         N  IK  L K
Subjt:  FNIEIKAILEK

P10394 Retrovirus-related Pol polyprotein from transposon 4121.3e-1026.26Show/hide
Query:  EEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLT-AKSEMLLNFILEVEIFGVWGIDFMCPFPPS-FGNIYILL
        E++   IL   H  P  G   G+      ++  Y+  ++ KY   + + C +CQ+       K+ M +    E   F    +D + P P S  GN Y + 
Subjt:  EEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLT-AKSEMLLNFILEVEIFGVWGIDFMCPFPPS-FGNIYILL

Query:  AVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVE
         +  ++K++ AI      AK V K   ++   + G  +  I+D G+ + N II +L     I++  +TA+H +T G VE
Subjt:  AVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVE

P26808 Gag-Pol polyprotein1.3e-1035.14Show/hide
Query:  WGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVEL
        W IDF    P  +G  Y+L+ VD  S W+EA  T    AKVV K  ++ +F R G  + + +D G  F++K+   +     +  ++  AY P+++GQVE 
Subjt:  WGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVEL

Query:  FNIEIKAILEK
         N  IK  L K
Subjt:  FNIEIKAILEK

P26810 Gag-Pol polyprotein1.3e-1035.14Show/hide
Query:  WGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVEL
        W IDF    P  +G  Y+L+ VD  S W+EA  T    AKVV K  ++ +F R G  + + +D G  F++K+   +     +  ++  AY P+++GQVE 
Subjt:  WGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAYHPKTNGQVEL

Query:  FNIEIKAILEK
         N  IK  L K
Subjt:  FNIEIKAILEK

P92516 Uncharacterized mitochondrial protein AtMg007505.9e-1157.14Show/hide
Query:  VLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFM
        VLQ+ ++ P+ FK AH F  +CD CQR GN T ++EM  +FILEVE+F VWGI FM
Subjt:  VLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFM

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein4.2e-1257.14Show/hide
Query:  VLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFM
        VLQ+ ++ P+ FK AH F  +CD CQR GN T ++EM  +FILEVE+F VWGI FM
Subjt:  VLQSRYFRPSLFKYAHLFTKNCDQCQRTGNLTAKSEMLLNFILEVEIFGVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCAGCCACCTTTAAAGGTGTATGATGGCTATATTTCCGATATGGTAGAACGAACATTGGAAGTGTTTATGGACGACTTCTCTGTCTTTGGTGAAACAGGACGACA
CTCGTTTTACACGAAAACACTATATTATTACTTGACCCGTACACTTGCGGTAAGCAAATTGCACATCAAAATCCAAGACAAAAAGGGAACCGAGAATCAAGTAGCAGACC
ACCTATCAAGACTTGGAGAGGAAGCACAAGTGAAGAAAGGGCTGGATATTCAAGAATTGGTTGCAGATGAGCAAATTCTGGAAGTAAGAGCAGTTGAACTTCCATGGTTT
GCAGATTATGTGAACTACTTAGTCAGTGGACTAAAACCCCCGGAAGAAACTGCTCAAAAGCCTAAGAAGTTTTTGAAAGATGTCAAGGACTACAACTGGGATGAGTTATA
CTTGTACAAGCTAGGCCCAGATCAAATACTTAGATGGTGCGTAACAGAAGAAGATGTACCACTCATATTGGAAGCCTGTCACTCAGCTCCATATGGAGGACACTTTTTCG
GACTAAAGATTGCATCCAAAGTCCTCCAATCTAGGTACTTTAGGCCAAGTCTCTTCAAATATGCACACCTCTTCACAAAGAACTGCGATCAATGTCAAAGAACCGGTAAT
CTGACAGCCAAGAGCGAAATGCTTCTAAATTTCATACTCGAAGTAGAAATCTTTGGTGTATGGGGGATAGACTTTATGTGCCCATTCCCACCGTCCTTTGGAAACATTTA
CATTTTGCTTGCAGTTGACTACGTATCAAAATGGATCGAGGCCATAGCTACAACAACAATTGATGCAAAGGTGGTAATGAAAATTTTCATGAAGAATATGTTCACAAGAT
GCGGAACCTCTCGTTTCATCATAAGCGATGAAGGATCTCACTTTTTAAACAAGATAATAGCCAACCTATTTTCCAAATACAATATCAGGCATAGAGTAGCCACTGCTTAC
CATCCTAAAACGAATGGACAAGTAGAGCTCTTTAACATTGAGATCAAAGCTATTTTGGAAAAGACGGTAGACCCTTCTAAAAGGATTGGTCACTACGACTTAACGAGACT
TGAGTTATTTTTGGGAAAACTGAAATCAAGATGGTCGGGACCATTCGAGATCGTCAGAATCTTTGATTATGGAACCGTAAAGCTGATGGATAATCAAGGCATCATTTCAA
GAACGCTGAACCAAGTGCTCGACACCAAGAAGATCAACCCGAAGCAAGCATGCATGGAACAAGAGGAACGAGACCCACGGGTTTTTCGCCGGCGATTGTGGACCAAAAAA
CCACTGCTCAAACTCCCTCTTCCTCAACAATGTCGACCATCTCGAGAGCCAATAGCTGAGAAAATGCTTGAGGGAAAGTGCAAGGAAGAAGGATTCCAAGAAGGTTGCAC
CCGAAAAGCCTCTGATCACCGAACCCCTCAGGATGTGTTCGAAGAGATGATCCGGCAGGGGAGTCCAATTGGTGAGATTTTAGCCCTAATTTCGACTAGAGACCTTTTGT
GGGAGCTAAAGGCAAGGCCAAGGAGCGGGAATCAAGCGAGAAGACGTGGAGATCGTGACGGGGACGTGCGGTGTTGGCGGATTCGAGCTTTCCTTTCTCTTTCCCTTAAC
TCTTGGATTGCACGTCCCCAAATTGAGGCAGAAAATTTTGAAATGAAACCGGTAATGTTTCAGATGATGCAAATCGTGGGTCAGTTCCATGGTTTTTCATTTGAAGACCA
TCATTTACATCTTAAGTCTTTCCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGTCTAGAGATGTCCTTAGATTAACTTTGTTTCCGTATTCTCTTAGAGATGGAG
CAAAGGCATGGTTAAATTCTTTTGCTCCACGATCAATTAGGACATGGAATAAGTTAGCGGAAAAATTCCTTAGTAAATATTTTCCACCAACTAGGAATGCTAAATTGAGG
AGCGAAATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAGAGTTGTCAGTGGTCAGATGTTAGAGCCTCAAATAAAAAGGCGAAGAGTGTGTT
AGAGGTTGATGGTGTGTCAACCATTAGGGATGATATTGCAATGTTAGCTAACGCTCTTCAAAATGTGACAGTGGTTAGTCATCAGCAGCCGCCAGCTCTGGAGCCTGCTG
CAGTGGTGAACCAAGTTGCAGAGGAAGCATGTATCTATTGTGGTCAAGAGCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCGTACG
AATAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAACTTCGCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAGCAAAAGGGGAACCAATCAGG
ATTTGCTAAAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGCACCAGCCACCTTTAAAGGTGTATGATGGCTATATTTCCGATATGGTAGAACGAACATTGGAAGTGTTTATGGACGACTTCTCTGTCTTTGGTGAAACAGGACGACA
CTCGTTTTACACGAAAACACTATATTATTACTTGACCCGTACACTTGCGGTAAGCAAATTGCACATCAAAATCCAAGACAAAAAGGGAACCGAGAATCAAGTAGCAGACC
ACCTATCAAGACTTGGAGAGGAAGCACAAGTGAAGAAAGGGCTGGATATTCAAGAATTGGTTGCAGATGAGCAAATTCTGGAAGTAAGAGCAGTTGAACTTCCATGGTTT
GCAGATTATGTGAACTACTTAGTCAGTGGACTAAAACCCCCGGAAGAAACTGCTCAAAAGCCTAAGAAGTTTTTGAAAGATGTCAAGGACTACAACTGGGATGAGTTATA
CTTGTACAAGCTAGGCCCAGATCAAATACTTAGATGGTGCGTAACAGAAGAAGATGTACCACTCATATTGGAAGCCTGTCACTCAGCTCCATATGGAGGACACTTTTTCG
GACTAAAGATTGCATCCAAAGTCCTCCAATCTAGGTACTTTAGGCCAAGTCTCTTCAAATATGCACACCTCTTCACAAAGAACTGCGATCAATGTCAAAGAACCGGTAAT
CTGACAGCCAAGAGCGAAATGCTTCTAAATTTCATACTCGAAGTAGAAATCTTTGGTGTATGGGGGATAGACTTTATGTGCCCATTCCCACCGTCCTTTGGAAACATTTA
CATTTTGCTTGCAGTTGACTACGTATCAAAATGGATCGAGGCCATAGCTACAACAACAATTGATGCAAAGGTGGTAATGAAAATTTTCATGAAGAATATGTTCACAAGAT
GCGGAACCTCTCGTTTCATCATAAGCGATGAAGGATCTCACTTTTTAAACAAGATAATAGCCAACCTATTTTCCAAATACAATATCAGGCATAGAGTAGCCACTGCTTAC
CATCCTAAAACGAATGGACAAGTAGAGCTCTTTAACATTGAGATCAAAGCTATTTTGGAAAAGACGGTAGACCCTTCTAAAAGGATTGGTCACTACGACTTAACGAGACT
TGAGTTATTTTTGGGAAAACTGAAATCAAGATGGTCGGGACCATTCGAGATCGTCAGAATCTTTGATTATGGAACCGTAAAGCTGATGGATAATCAAGGCATCATTTCAA
GAACGCTGAACCAAGTGCTCGACACCAAGAAGATCAACCCGAAGCAAGCATGCATGGAACAAGAGGAACGAGACCCACGGGTTTTTCGCCGGCGATTGTGGACCAAAAAA
CCACTGCTCAAACTCCCTCTTCCTCAACAATGTCGACCATCTCGAGAGCCAATAGCTGAGAAAATGCTTGAGGGAAAGTGCAAGGAAGAAGGATTCCAAGAAGGTTGCAC
CCGAAAAGCCTCTGATCACCGAACCCCTCAGGATGTGTTCGAAGAGATGATCCGGCAGGGGAGTCCAATTGGTGAGATTTTAGCCCTAATTTCGACTAGAGACCTTTTGT
GGGAGCTAAAGGCAAGGCCAAGGAGCGGGAATCAAGCGAGAAGACGTGGAGATCGTGACGGGGACGTGCGGTGTTGGCGGATTCGAGCTTTCCTTTCTCTTTCCCTTAAC
TCTTGGATTGCACGTCCCCAAATTGAGGCAGAAAATTTTGAAATGAAACCGGTAATGTTTCAGATGATGCAAATCGTGGGTCAGTTCCATGGTTTTTCATTTGAAGACCA
TCATTTACATCTTAAGTCTTTCCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGTCTAGAGATGTCCTTAGATTAACTTTGTTTCCGTATTCTCTTAGAGATGGAG
CAAAGGCATGGTTAAATTCTTTTGCTCCACGATCAATTAGGACATGGAATAAGTTAGCGGAAAAATTCCTTAGTAAATATTTTCCACCAACTAGGAATGCTAAATTGAGG
AGCGAAATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAGAGTTGTCAGTGGTCAGATGTTAGAGCCTCAAATAAAAAGGCGAAGAGTGTGTT
AGAGGTTGATGGTGTGTCAACCATTAGGGATGATATTGCAATGTTAGCTAACGCTCTTCAAAATGTGACAGTGGTTAGTCATCAGCAGCCGCCAGCTCTGGAGCCTGCTG
CAGTGGTGAACCAAGTTGCAGAGGAAGCATGTATCTATTGTGGTCAAGAGCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCGTACG
AATAACCCTTATTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCCAACTTCGCATGGGGAGGACAAGGAAGTAATGTGCAAGCACAGCAAAAGGGGAACCAATCAGG
ATTTGCTAAAGCATAG
Protein sequenceShow/hide protein sequence
MHQPPLKVYDGYISDMVERTLEVFMDDFSVFGETGRHSFYTKTLYYYLTRTLAVSKLHIKIQDKKGTENQVADHLSRLGEEAQVKKGLDIQELVADEQILEVRAVELPWF
ADYVNYLVSGLKPPEETAQKPKKFLKDVKDYNWDELYLYKLGPDQILRWCVTEEDVPLILEACHSAPYGGHFFGLKIASKVLQSRYFRPSLFKYAHLFTKNCDQCQRTGN
LTAKSEMLLNFILEVEIFGVWGIDFMCPFPPSFGNIYILLAVDYVSKWIEAIATTTIDAKVVMKIFMKNMFTRCGTSRFIISDEGSHFLNKIIANLFSKYNIRHRVATAY
HPKTNGQVELFNIEIKAILEKTVDPSKRIGHYDLTRLELFLGKLKSRWSGPFEIVRIFDYGTVKLMDNQGIISRTLNQVLDTKKINPKQACMEQEERDPRVFRRRLWTKK
PLLKLPLPQQCRPSREPIAEKMLEGKCKEEGFQEGCTRKASDHRTPQDVFEEMIRQGSPIGEILALISTRDLLWELKARPRSGNQARRRGDRDGDVRCWRIRAFLSLSLN
SWIARPQIEAENFEMKPVMFQMMQIVGQFHGFSFEDHHLHLKSFLGVSDSFVIQGVSRDVLRLTLFPYSLRDGAKAWLNSFAPRSIRTWNKLAEKFLSKYFPPTRNAKLR
SEIVGFRQLEDETFSEAWESCQWSDVRASNKKAKSVLEVDGVSTIRDDIAMLANALQNVTVVSHQQPPALEPAAVVNQVAEEACIYCGQEHNYEFCPSNPASVFFVGNRT
NNPYSNFYNPGWRNHPNFAWGGQGSNVQAQQKGNQSGFAKA