; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024846 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024846
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:6329158..6336982
RNA-Seq ExpressionLag0024846
SyntenyLag0024846
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN73924.1 hypothetical protein VITISV_041509 [Vitis vinifera]7.8e-15457.08Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR
        GSI R+KARLVA+GF Q+PG+D+F+TFSPVVK  TIR+IL++AV+  WS+RQLD  NAFLNG L+EEV+M+QP G+++P+ P +VCKL+KA+YGLKQAPR
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR

Query:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH
        AW   L+ ALL +GFQ+SR+DTSL+ +    D++ LLVYVDD+++TG+N +L+SH I+ L  +FAL+DLG LS+FLGIQ   L S + L Q KYI DLL+
Subjt:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH

Query:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA
        + ++E  KPAP+P  +G+ LS +DG  L+DP  YR T+GALQY+T TRPDIA+ VN   QF+ +P+D+HW AVKR+LRY+ GT HLGL  Q  +++++  
Subjt:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA

Query:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI
        YSDADWAS  DDR+S + YCVFLG NL+SWSS KQ  +++SS ESEYR L   TAE++W++ LL E+ C  +S P+LWCDN SA  LA NPVFH+R+KHI
Subjt:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI

Query:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGN
        E+D+HF+R++VL+  L++ YVP+ DQLAD  TK +  +QF   RSKL +   P  LRG+
Subjt:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGN

KYP35708.1 Copia protein [Cajanus cajan]2.7e-15455.83Show/hide
Query:  WSLVSLDVLIKFSDTTTNLPNNTTLQGSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPP
        W+LV         D+           G+I+R KARLVAKGF Q+ G+D+ ETFSPVVK+STIR+IL+IAV   W ++QLD NNAFLNG L + VYM+QP 
Subjt:  WSLVSLDVLIKFSDTTTNLPNNTTLQGSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPP

Query:  GYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSF
        G+ DP+ P+HVCKL+KAIYGLKQAPRAW ++LK+ALL WGFQN++SDTSL+  +    + FLL+YVDD+++TGNN   +   +  L+  F+LKDLG+L +
Subjt:  GYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSF

Query:  FLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVK
        FLGI+V+  ++G+ L Q KYI DLL K ++E+  P P+P V GK  ++ +G+ L +P  +R  IGALQYLT TRPDIA+ VN LSQF+  PT  HW  +K
Subjt:  FLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVK

Query:  RVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSK
        R+LRY+ GT H  L I+  + LDI+ +SDADWA+++DDRKS+A  CVFLG+ LVSWSS+KQ  ++RSSTESEYRALA   AE+ W + LLTE+      K
Subjt:  RVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSK

Query:  PVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPA
        P+LWCDNLSA ALA+NPV HARTKHIE DVH++RDQVL+  + V YVP+ADQ+ADCLTK ++H++F   R KLG+   P+
Subjt:  PVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPA

KYP75364.1 Copia protein [Cajanus cajan]7.8e-15453.85Show/hide
Query:  TSPSVFFWSLVSLDV-----LIKFSDTTTNLPNNTTLQGSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNG
        T P+  F+ L+   +        +   + +  NN+T   + +R KARLVAKGF Q+ GVDF ETF+PV+KAST+R+IL+IAV   W +RQ+D NNAFLNG
Subjt:  TSPSVFFWSLVSLDV-----LIKFSDTTTNLPNNTTLQGSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNG

Query:  TLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDG
         L E V+M QP G+ DP+ PHHVCK  KAIYGLKQAPRAW + LK ALL WGFQN+RSD+SL+F +    + FLL+YVDD+++TG+N   +   I  L+ 
Subjt:  TLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDG

Query:  RFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFL
         FALKDLG+L +FLGI+V+  S G+ L Q KYI D+L K +++     P+P V G+  +  +G+P+ +P ++R  IGALQYLT +RPDIAY VN L Q++
Subjt:  RFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFL

Query:  KQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQ
          PTD HW  +KR+LRY+ GT +  L I+S ++LDI+ +SDA WA +IDDRKS+A  CVFLGE+L++WSS+KQ  ++RSSTESEYRAL    AEV W+K 
Subjt:  KQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQ

Query:  LLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGNVKE
        LL EI      KP+LWCDNLSA ALA+NPV HAR+KHIE+D+HF+RDQVL+  + + YVP++DQ+ADCLTK ++HS+F   R KLG+ + P RLRG +K+
Subjt:  LLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGNVKE

Query:  INQPCKI
         N+ C I
Subjt:  INQPCKI

PNX77541.1 retrofit protein [Trifolium pratense]2.2e-15656.49Show/hide
Query:  VLIKFSDTTTNLPNNTTLQ------GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPG
        +L+ + D T  + +    +      GSI+R KARLVAKGF Q+ G+D+ ETFSPVVKAST+R+ILSIAV   W +RQLD NNAFLNG L E VYM QP G
Subjt:  VLIKFSDTTTNLPNNTTLQ------GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPG

Query:  YIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFF
        ++DP+ P+H+CKL+KAIYGLKQAPRAW ++LK+ALL+WGFQN++SD+SL+  K    + FLL+YVDD+++TGNN+  +   I  L+  F+LKDLG+L +F
Subjt:  YIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFF

Query:  LGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKR
        LGI+V    SG+ L Q KYI DLL K ++++    P+P + G+  ++ +G+ L DP V+R  IG LQYLT TRPDIA+ VN LSQ++  PT  HW  +KR
Subjt:  LGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKR

Query:  VLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKP
        +LRY+ GT +  L I+  ++LDI+ +SDADWA++IDDRKS++  CVFLGE L+SWSS+KQ  ++RSSTESEYRALA   AE+ W++ LLTE+      KP
Subjt:  VLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKP

Query:  VLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELP
        +LWCDNLSA ALA+NPV HAR+KHIEIDVH++RDQVL+  + V YVPTADQ+ADCLTKP+SH++F   R KLG+   P
Subjt:  VLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELP

RVW82925.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.3e-15356.05Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR
        GSI R+KARLVA+GF Q+PG+D+F+TFSPVVK  TIR+IL++AV+  WS+RQLD  NAFLNG L+EEV+M+QP G+++P+ P +VCKL+KA+YGLKQAPR
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR

Query:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH
        AW   L+ ALL +GFQ+SR+DTSL+ +    D++ LLVYVDD+++TG+N  L+SH I+ L  +FAL+DLG LS+FLGIQ   L S + L Q KYI DLL+
Subjt:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH

Query:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA
        + ++E  KPAP+P  +G+ LS +DG  L+DP  YR T+GALQY+T TRPDIA+ VN   QF+ +P+D+HW AVKR+LRY+ GT HLGL  Q  +++++  
Subjt:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA

Query:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI
        YSDADWAS  DDR+S + YCVFLG NL+SWSS KQ  +++SS ESEYR L   TAE++W++ LL E+ C  +S P+LWCDN SA  LA NPVFH+R+KHI
Subjt:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI

Query:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGNVKEINQPCKISE
        E+D+HF+R++VL+  L++ YVP+ DQLAD  TK +  +QF   RSKL +   P  LRG + +   P  + E
Subjt:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGNVKEINQPCKISE

TrEMBL top hitse value%identityAlignment
A0A151QZC2 Copia protein1.3e-15455.83Show/hide
Query:  WSLVSLDVLIKFSDTTTNLPNNTTLQGSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPP
        W+LV         D+           G+I+R KARLVAKGF Q+ G+D+ ETFSPVVK+STIR+IL+IAV   W ++QLD NNAFLNG L + VYM+QP 
Subjt:  WSLVSLDVLIKFSDTTTNLPNNTTLQGSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPP

Query:  GYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSF
        G+ DP+ P+HVCKL+KAIYGLKQAPRAW ++LK+ALL WGFQN++SDTSL+  +    + FLL+YVDD+++TGNN   +   +  L+  F+LKDLG+L +
Subjt:  GYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSF

Query:  FLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVK
        FLGI+V+  ++G+ L Q KYI DLL K ++E+  P P+P V GK  ++ +G+ L +P  +R  IGALQYLT TRPDIA+ VN LSQF+  PT  HW  +K
Subjt:  FLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVK

Query:  RVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSK
        R+LRY+ GT H  L I+  + LDI+ +SDADWA+++DDRKS+A  CVFLG+ LVSWSS+KQ  ++RSSTESEYRALA   AE+ W + LLTE+      K
Subjt:  RVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSK

Query:  PVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPA
        P+LWCDNLSA ALA+NPV HARTKHIE DVH++RDQVL+  + V YVP+ADQ+ADCLTK ++H++F   R KLG+   P+
Subjt:  PVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPA

A0A151U7U2 Copia protein3.8e-15453.85Show/hide
Query:  TSPSVFFWSLVSLDV-----LIKFSDTTTNLPNNTTLQGSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNG
        T P+  F+ L+   +        +   + +  NN+T   + +R KARLVAKGF Q+ GVDF ETF+PV+KAST+R+IL+IAV   W +RQ+D NNAFLNG
Subjt:  TSPSVFFWSLVSLDV-----LIKFSDTTTNLPNNTTLQGSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNG

Query:  TLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDG
         L E V+M QP G+ DP+ PHHVCK  KAIYGLKQAPRAW + LK ALL WGFQN+RSD+SL+F +    + FLL+YVDD+++TG+N   +   I  L+ 
Subjt:  TLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDG

Query:  RFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFL
         FALKDLG+L +FLGI+V+  S G+ L Q KYI D+L K +++     P+P V G+  +  +G+P+ +P ++R  IGALQYLT +RPDIAY VN L Q++
Subjt:  RFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFL

Query:  KQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQ
          PTD HW  +KR+LRY+ GT +  L I+S ++LDI+ +SDA WA +IDDRKS+A  CVFLGE+L++WSS+KQ  ++RSSTESEYRAL    AEV W+K 
Subjt:  KQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQ

Query:  LLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGNVKE
        LL EI      KP+LWCDNLSA ALA+NPV HAR+KHIE+D+HF+RDQVL+  + + YVP++DQ+ADCLTK ++HS+F   R KLG+ + P RLRG +K+
Subjt:  LLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGNVKE

Query:  INQPCKI
         N+ C I
Subjt:  INQPCKI

A0A2K3LG81 Retrofit protein1.1e-15656.49Show/hide
Query:  VLIKFSDTTTNLPNNTTLQ------GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPG
        +L+ + D T  + +    +      GSI+R KARLVAKGF Q+ G+D+ ETFSPVVKAST+R+ILSIAV   W +RQLD NNAFLNG L E VYM QP G
Subjt:  VLIKFSDTTTNLPNNTTLQ------GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPG

Query:  YIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFF
        ++DP+ P+H+CKL+KAIYGLKQAPRAW ++LK+ALL+WGFQN++SD+SL+  K    + FLL+YVDD+++TGNN+  +   I  L+  F+LKDLG+L +F
Subjt:  YIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFF

Query:  LGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKR
        LGI+V    SG+ L Q KYI DLL K ++++    P+P + G+  ++ +G+ L DP V+R  IG LQYLT TRPDIA+ VN LSQ++  PT  HW  +KR
Subjt:  LGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKR

Query:  VLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKP
        +LRY+ GT +  L I+  ++LDI+ +SDADWA++IDDRKS++  CVFLGE L+SWSS+KQ  ++RSSTESEYRALA   AE+ W++ LLTE+      KP
Subjt:  VLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKP

Query:  VLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELP
        +LWCDNLSA ALA+NPV HAR+KHIEIDVH++RDQVL+  + V YVPTADQ+ADCLTKP+SH++F   R KLG+   P
Subjt:  VLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELP

A0A803NU97 Uncharacterized protein1.5e-15859.52Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR
        GS+  YK+RLVAKG+ Q+PG+D+ ETFSPVVK  T+R +LS+AVT  W ++QLD +NAFLNG L E VYM QP G+ DP+ P+HVC L+KAIYGLKQAPR
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR

Query:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH
        AW   L++ LL WGFQ SRSDTSL+ Y  G ++I LLVYVDD+++TG N  L+S L+T L+  FALKDLG + +FLGI++   +SG+ L+Q KYI DLL 
Subjt:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH

Query:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA
        +L+L+ +K  P+P    + LSLTDG P  DP +YRST+GALQYLT TRPD+A+I+N LSQF+  PT  HW A KR+LRY+ GT   GLL++  + +DISA
Subjt:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA

Query:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI
        YSDADWA+ IDDRKS   Y VFLG NLVSWS+KKQ  +ARSSTESE+R+LA+T AE+ WL  LL+E+  S +S PV+W DN  A ALA NP+FHAR+KHI
Subjt:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI

Query:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGNVKE
        EID+HFVRDQ+L   L VRYVP  DQ+AD LTK +S  +F+Y + KL +A  P RLRG++ +
Subjt:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGNVKE

A5AYB0 Integrase catalytic domain-containing protein3.8e-15457.08Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR
        GSI R+KARLVA+GF Q+PG+D+F+TFSPVVK  TIR+IL++AV+  WS+RQLD  NAFLNG L+EEV+M+QP G+++P+ P +VCKL+KA+YGLKQAPR
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR

Query:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH
        AW   L+ ALL +GFQ+SR+DTSL+ +    D++ LLVYVDD+++TG+N +L+SH I+ L  +FAL+DLG LS+FLGIQ   L S + L Q KYI DLL+
Subjt:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH

Query:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA
        + ++E  KPAP+P  +G+ LS +DG  L+DP  YR T+GALQY+T TRPDIA+ VN   QF+ +P+D+HW AVKR+LRY+ GT HLGL  Q  +++++  
Subjt:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA

Query:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI
        YSDADWAS  DDR+S + YCVFLG NL+SWSS KQ  +++SS ESEYR L   TAE++W++ LL E+ C  +S P+LWCDN SA  LA NPVFH+R+KHI
Subjt:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI

Query:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGN
        E+D+HF+R++VL+  L++ YVP+ DQLAD  TK +  +QF   RSKL +   P  LRG+
Subjt:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRGN

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-8239.35Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSC-PHHVCKLNKAIYGLKQAP
        G+  RYKARLVA+GF Q   +D+ ETF+PV + S+ R ILS+ +     + Q+D   AFLNGTL EE+YM  P G    SC   +VCKLNKAIYGLKQA 
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSC-PHHVCKLNKAIYGLKQAP

Query:  RAWTNTLKSALLSWGFQNSRSDTSLYFYKRG--CDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYIND
        R W    + AL    F NS  D  +Y   +G   + I++L+YVDDVV+   +   M++    L  +F + DL  +  F+GI++      I L+Q  Y+  
Subjt:  RAWTNTLKSALLSWGFQNSRSDTSLYFYKRG--CDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYIND

Query:  LLHKLELEDLK--PAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQY-LTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGS
        +L K  +E+      P P  +   L L   +    P   RS IG L Y +  TRPD+   VN LS++  +     W  +KRVLRY+ GT  + L+ +   
Subjt:  LLHKLELEDLK--PAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQY-LTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGS

Query:  NLD--ISAYSDADWASNIDDRKSVAAYCVFLGE-NLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNP
          +  I  Y D+DWA +  DRKS   Y   + + NL+ W++K+Q ++A SSTE+EY AL     E +WLK LLT I     +   ++ DN    ++A NP
Subjt:  NLD--ISAYSDADWASNIDDRKSVAAYCVFLGE-NLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNP

Query:  VFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAE
          H R KHI+I  HF R+QV    + + Y+PT +QLAD  TKP+  ++F+  R KLGL +
Subjt:  VFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-8540.74Show/hide
Query:  RYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPRAWTN
        RYKARLV KGF Q  G+DF E FSPVVK ++IR ILS+A +    + QLD   AFL+G L+EE+YM QP G+      H VCKLNK++YGLKQAPR W  
Subjt:  RYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPRAWTN

Query:  TLKSALLSWGFQNSRSDTSLYFYK-RGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQV--NYLSSGILLTQEKYINDLLHK
           S + S  +  + SD  +YF +    + I LL+YVDD+++ G +  L++ L   L   F +KDLG     LG+++     S  + L+QEKYI  +L +
Subjt:  TLKSALLSWGFQNSRSDTSLYFYK-RGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQV--NYLSSGILLTQEKYINDLLHK

Query:  LELEDLKPAPSPCVVGKHLSLT----------DGQPLADPFVYRSTIGALQY-LTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLI
          +++ KP  +P  +  HL L+           G     P  Y S +G+L Y +  TRPDIA+ V  +S+FL+ P   HW AVK +LRY+ GT     L 
Subjt:  LELEDLKPAPSPCVVGKHLSLT----------DGQPLADPFVYRSTIGALQY-LTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLI

Query:  QSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATN
          GS+  +  Y+DAD A +ID+RKS   Y        +SW SK Q  +A S+TE+EY A   T  E+IWLK+ L E+G     + V++CD+ SA  L+ N
Subjt:  QSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATN

Query:  PVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGL
         ++HARTKHI++  H++R+ V   SL+V  + T +  AD LTK +  ++F   +  +G+
Subjt:  PVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGL

P92519 Uncharacterized mitochondrial protein AtMg008107.5e-5144.69Show/hide
Query:  IFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFV
        ++LL+YVDD+++TG+++ L++ LI  L   F++KDLG + +FLGIQ+    SG+ L+Q KY   +L+   + D KP  +P  +  + S++  +   DP  
Subjt:  IFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFV

Query:  YRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSK
        +RS +GALQYLT TRPDI+Y VN + Q + +PT   ++ +KRVLRY+ GT   GL I   S L++ A+ D+DWA     R+S   +C FLG N++SWS+K
Subjt:  YRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSK

Query:  KQTAIARSSTESEYRALAHTTAEVIW
        +Q  ++RSSTE+EYRALA T AE+ W
Subjt:  KQTAIARSSTESEYRALAHTTAEVIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-13851.88Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR
        GS+ RYKARLVAKG++Q PG+D+ ETFSPV+K+++IR++L +AV + W +RQLD NNAFL GTL ++VYMSQPPG+ID   P++VCKL KA+YGLKQAPR
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR

Query:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH
        AW   L++ LL+ GF NS SDTSL+  +RG  ++++LVYVDD+++TGN+  L+ + +  L  RF++KD   L +FLGI+   + +G+ L+Q +YI DLL 
Subjt:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH

Query:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA
        +  +   KP  +P      LSL  G  L DP  YR  +G+LQYL  TRPDI+Y VN LSQF+  PT+ H  A+KR+LRY++GT + G+ ++ G+ L + A
Subjt:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA

Query:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI
        YSDADWA + DD  S   Y V+LG + +SWSSKKQ  + RSSTE+EYR++A+T++E+ W+  LLTE+G   +  PV++CDN+ A  L  NPVFH+R KHI
Subjt:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI

Query:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELP
         ID HF+R+QV  G+L V +V T DQLAD LTKP+S + F    SK+G+  +P
Subjt:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-14051.53Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR
        GS+ RYKARLVAKG++Q PG+D+ ETFSPV+K+++IR++L +AV + W +RQLD NNAFL GTL +EVYMSQPPG++D   P +VC+L KAIYGLKQAPR
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPR

Query:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH
        AW   L++ LL+ GF NS SDTSL+  +RG  +I++LVYVDD+++TGN+ +L+ H +  L  RF++K+   L +FLGI+   +  G+ L+Q +Y  DLL 
Subjt:  AWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLH

Query:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA
        +  +   KP  +P      L+L  G  L DP  YR  +G+LQYL  TRPD++Y VN LSQ++  PTD HWNA+KRVLRY++GT   G+ ++ G+ L + A
Subjt:  KLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISA

Query:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI
        YSDADWA + DD  S   Y V+LG + +SWSSKKQ  + RSSTE+EYR++A+T++E+ W+  LLTE+G   S  PV++CDN+ A  L  NPVFH+R KHI
Subjt:  YSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHI

Query:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRG
         +D HF+R+QV  G+L V +V T DQLAD LTKP+S   F     K+G+ ++P    G
Subjt:  EIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKLGLAELPARLRG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.6e-9943.07Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYI----DPSCPHHVCKLNKAIYGLK
        G+I+RYKARLVAKG+ Q  G+DF ETFSPV K +++++IL+I+    +++ QLD +NAFLNG LDEE+YM  PPGY     D   P+ VC L K+IYGLK
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYI----DPSCPHHVCKLNKAIYGLK

Query:  QAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYIN
        QA R W       L+ +GF  S SD + +        + +LVYVDD+++  NND  +  L + L   F L+DLG L +FLG+++   ++GI + Q KY  
Subjt:  QAPRAWTNTLKSALLSWGFQNSRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYIN

Query:  DLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNL
        DLL +  L   KP+  P       S   G    D   YR  IG L YL  TR DI++ VN LSQF + P   H  AV ++L YI GT   GL   S + +
Subjt:  DLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNL

Query:  DISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHAR
         +  +SDA + S  D R+S   YC+FLG +L+SW SKKQ  +++SS E+EYRAL+  T E++WL Q   E+    S   +L+CDN +A  +ATN VFH R
Subjt:  DISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTAIARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHAR

Query:  TKHIEIDVHFVRDQ-VLKGSLEVRYVPTADQLADCLTK---PISHSQFLYHRSKLGLAELPA
        TKHIE D H VR++ V + +L   +    +Q  D  T+   PI     +Y  S  GLA L A
Subjt:  TKHIEIDVHFVRDQ-VLKGSLEVRYVPTADQLADCLTK---PISHSQFLYHRSKLGLAELPA

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.9e-1344.87Show/hide
Query:  YLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYC
        YLT TRPD+ + VN LSQF          AV +VL Y+ GT   GL   + S+L + A++D+DWAS  D R+SV  +C
Subjt:  YLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYC

ATMG00810.1 DNA/RNA polymerases superfamily protein5.3e-5244.69Show/hide
Query:  IFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFV
        ++LL+YVDD+++TG+++ L++ LI  L   F++KDLG + +FLGIQ+    SG+ L+Q KY   +L+   + D KP  +P  +  + S++  +   DP  
Subjt:  IFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQPLADPFV

Query:  YRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSK
        +RS +GALQYLT TRPDI+Y VN + Q + +PT   ++ +KRVLRY+ GT   GL I   S L++ A+ D+DWA     R+S   +C FLG N++SWS+K
Subjt:  YRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSK

Query:  KQTAIARSSTESEYRALAHTTAEVIW
        +Q  ++RSSTE+EYRALA T AE+ W
Subjt:  KQTAIARSSTESEYRALAHTTAEVIW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.6e-0962.79Show/hide
Query:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIA
        G++ R KARLVAKGFHQ  G+ F ET+SPVV+ +TIR IL++A
Subjt:  GSIQRYKARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTGCTCGTCACATCGCCTCAGATGCTTAAGTCAGAAAACTGATGGAAAAGCTAAAGAGCAAGAAGAGAAGTCGAGAATAGAGTTCGGGATTCTTCCTTCAGTGAT
GAAGAATGATTTAAATACTTGTTCCTGCCCTAGGGTTTTTAGGAATTCAGAGGCGTTTCGGGCTAAACCAGGTGAAACCGGGGTGGCTAGGGGCAGCAGGGACCGAACGG
AGGGGGAAGAGCTCAGCCTCGGCCTTGGCCGAGCCCGAGCATATGGGTCGGGCCCAATTGGGCCGACCCTATGGTCTGTTCATCCTCTGGGGTTGGTTTTTCGGTCGTAT
TTTTGCCCGGTTGTCCTCATCGGCTCCTTGTACATCAGAGTGGTAATTTTGGACCACACAGACGCACAAAGAACTGACGATGACAATCGGACAGAGCTAAGACCAAAAGA
GCCAAAAGAAGGAAGACCGGACCAAAGGGTCGGGCCATGTTATGGTATGCCTCGCCCTTTTGCCGAGGCCGAGCATTCAGCCCGCTTGCGCGGGCCGAGCCCAGTGACCT
CTTTTTGGTCCCTGATGTCCCGGATCGCCTGGCATCGGAGGCAGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTGTTGATCTTGCAAGTCACGTCTTTCCCAGTT
TCTACAAATTCATTGTTGGTGTCACGTGAAGGTCAGATAATTTTGGATCATTCCGATATGCAAAGAGCTGACGAGGACAACGCAATCCCACCCAAAGAGACAATCAGGAA
ACGGATCTCGGAGCAAGAACAGGCCAAAGGGTTGGTCCAAGGCTGGAGGGATCGGGCCTTGGCCCAACCCCCACGGTTGGCCTCGGTCGATCCGTGCGGGACGAGTCCTT
CCGTCTTCTTTTGGTCCCTGGTGTCTCTGGACGTCCTGATAAAATTTAGTGACACCACAACTAATCTACCAAATAATACAACGCTACAAGGCTCAATACAACGCTACAAG
GCTCGCTTGGTTGCCAAGGGATTCCATCAAAGCCCGGGTGTCGATTTCTTCGAAACCTTTAGCCCTGTGGTCAAGGCTTCTACTATTCGAGTAATTCTTAGCATTGCAGT
TACAAAAGGATGGTCGATGAGGCAACTTGACTTCAATAATGCCTTTCTAAATGGCACTTTGGATGAAGAGGTATATATGTCCCAGCCACCCGGTTATATTGATCCCTCTT
GCCCTCATCACGTGTGTAAACTCAACAAGGCCATCTATGGCCTAAAGCAGGCTCCTCGGGCGTGGACAAACACTCTCAAATCTGCTCTTCTTTCTTGGGGATTTCAGAAT
TCAAGGTCAGATACTTCTCTTTATTTTTACAAACGCGGCTGTGATGTTATCTTTCTGCTAGTGTATGTCGATGATGTAGTGATGACTGGAAATAATGATCTGTTGATGTC
TCACCTAATTACTGTTTTGGATGGGCGTTTTGCTTTAAAAGACTTGGGGCGGCTAAGTTTCTTTCTGGGGATTCAGGTTAATTACTTATCTTCTGGAATTTTACTCACTC
AAGAGAAATATATAAATGATTTGTTGCATAAGCTGGAGCTTGAGGATCTTAAACCAGCCCCATCTCCCTGTGTTGTCGGCAAGCACTTGTCCTTGACTGATGGCCAGCCC
CTCGCAGATCCCTTTGTGTATCGCAGCACCATTGGTGCTTTACAGTACCTCACAACCACGCGTCCAGATATTGCCTATATAGTAAATCATCTTAGTCAATTCTTGAAGCA
ACCTACTGACATACATTGGAATGCTGTGAAGAGAGTCTTACGGTATATTAGTGGCACGAAGCATCTTGGTTTATTGATTCAATCTGGTTCGAATCTTGATATTTCAGCTT
ACTCTGATGCTGACTGGGCATCTAATATTGATGACCGTAAATCAGTTGCTGCCTACTGTGTATTTCTAGGTGAAAATTTAGTGTCATGGTCATCTAAAAAGCAAACGGCG
ATTGCTCGTTCAAGCACAGAGTCGGAATATAGAGCCTTAGCTCATACTACTGCTGAAGTTATATGGCTTAAACAGCTTCTGACCGAAATTGGTTGTTCTTCCTCCTCTAA
ACCTGTTCTCTGGTGTGATAATCTAAGTGCCGGGGCACTTGCCACTAATCCGGTGTTCCATGCTCGCACAAAACACATTGAGATCGATGTTCACTTTGTGCGAGATCAAG
TTCTTAAAGGAAGTCTTGAGGTCCGTTATGTTCCAACAGCTGACCAGCTTGCAGATTGCTTAACTAAACCAATCTCTCACTCTCAGTTTCTATATCATCGCTCCAAACTC
GGACTAGCTGAACTACCCGCTCGTTTGAGGGGGAATGTTAAGGAGATTAATCAACCATGTAAGATTAGTGAAGGTGACTCAATTAATCAATCCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGTGCTCGTCACATCGCCTCAGATGCTTAAGTCAGAAAACTGATGGAAAAGCTAAAGAGCAAGAAGAGAAGTCGAGAATAGAGTTCGGGATTCTTCCTTCAGTGAT
GAAGAATGATTTAAATACTTGTTCCTGCCCTAGGGTTTTTAGGAATTCAGAGGCGTTTCGGGCTAAACCAGGTGAAACCGGGGTGGCTAGGGGCAGCAGGGACCGAACGG
AGGGGGAAGAGCTCAGCCTCGGCCTTGGCCGAGCCCGAGCATATGGGTCGGGCCCAATTGGGCCGACCCTATGGTCTGTTCATCCTCTGGGGTTGGTTTTTCGGTCGTAT
TTTTGCCCGGTTGTCCTCATCGGCTCCTTGTACATCAGAGTGGTAATTTTGGACCACACAGACGCACAAAGAACTGACGATGACAATCGGACAGAGCTAAGACCAAAAGA
GCCAAAAGAAGGAAGACCGGACCAAAGGGTCGGGCCATGTTATGGTATGCCTCGCCCTTTTGCCGAGGCCGAGCATTCAGCCCGCTTGCGCGGGCCGAGCCCAGTGACCT
CTTTTTGGTCCCTGATGTCCCGGATCGCCTGGCATCGGAGGCAGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTGTTGATCTTGCAAGTCACGTCTTTCCCAGTT
TCTACAAATTCATTGTTGGTGTCACGTGAAGGTCAGATAATTTTGGATCATTCCGATATGCAAAGAGCTGACGAGGACAACGCAATCCCACCCAAAGAGACAATCAGGAA
ACGGATCTCGGAGCAAGAACAGGCCAAAGGGTTGGTCCAAGGCTGGAGGGATCGGGCCTTGGCCCAACCCCCACGGTTGGCCTCGGTCGATCCGTGCGGGACGAGTCCTT
CCGTCTTCTTTTGGTCCCTGGTGTCTCTGGACGTCCTGATAAAATTTAGTGACACCACAACTAATCTACCAAATAATACAACGCTACAAGGCTCAATACAACGCTACAAG
GCTCGCTTGGTTGCCAAGGGATTCCATCAAAGCCCGGGTGTCGATTTCTTCGAAACCTTTAGCCCTGTGGTCAAGGCTTCTACTATTCGAGTAATTCTTAGCATTGCAGT
TACAAAAGGATGGTCGATGAGGCAACTTGACTTCAATAATGCCTTTCTAAATGGCACTTTGGATGAAGAGGTATATATGTCCCAGCCACCCGGTTATATTGATCCCTCTT
GCCCTCATCACGTGTGTAAACTCAACAAGGCCATCTATGGCCTAAAGCAGGCTCCTCGGGCGTGGACAAACACTCTCAAATCTGCTCTTCTTTCTTGGGGATTTCAGAAT
TCAAGGTCAGATACTTCTCTTTATTTTTACAAACGCGGCTGTGATGTTATCTTTCTGCTAGTGTATGTCGATGATGTAGTGATGACTGGAAATAATGATCTGTTGATGTC
TCACCTAATTACTGTTTTGGATGGGCGTTTTGCTTTAAAAGACTTGGGGCGGCTAAGTTTCTTTCTGGGGATTCAGGTTAATTACTTATCTTCTGGAATTTTACTCACTC
AAGAGAAATATATAAATGATTTGTTGCATAAGCTGGAGCTTGAGGATCTTAAACCAGCCCCATCTCCCTGTGTTGTCGGCAAGCACTTGTCCTTGACTGATGGCCAGCCC
CTCGCAGATCCCTTTGTGTATCGCAGCACCATTGGTGCTTTACAGTACCTCACAACCACGCGTCCAGATATTGCCTATATAGTAAATCATCTTAGTCAATTCTTGAAGCA
ACCTACTGACATACATTGGAATGCTGTGAAGAGAGTCTTACGGTATATTAGTGGCACGAAGCATCTTGGTTTATTGATTCAATCTGGTTCGAATCTTGATATTTCAGCTT
ACTCTGATGCTGACTGGGCATCTAATATTGATGACCGTAAATCAGTTGCTGCCTACTGTGTATTTCTAGGTGAAAATTTAGTGTCATGGTCATCTAAAAAGCAAACGGCG
ATTGCTCGTTCAAGCACAGAGTCGGAATATAGAGCCTTAGCTCATACTACTGCTGAAGTTATATGGCTTAAACAGCTTCTGACCGAAATTGGTTGTTCTTCCTCCTCTAA
ACCTGTTCTCTGGTGTGATAATCTAAGTGCCGGGGCACTTGCCACTAATCCGGTGTTCCATGCTCGCACAAAACACATTGAGATCGATGTTCACTTTGTGCGAGATCAAG
TTCTTAAAGGAAGTCTTGAGGTCCGTTATGTTCCAACAGCTGACCAGCTTGCAGATTGCTTAACTAAACCAATCTCTCACTCTCAGTTTCTATATCATCGCTCCAAACTC
GGACTAGCTGAACTACCCGCTCGTTTGAGGGGGAATGTTAAGGAGATTAATCAACCATGTAAGATTAGTGAAGGTGACTCAATTAATCAATCCAAATGA
Protein sequenceShow/hide protein sequence
MWCSSHRLRCLSQKTDGKAKEQEEKSRIEFGILPSVMKNDLNTCSCPRVFRNSEAFRAKPGETGVARGSRDRTEGEELSLGLGRARAYGSGPIGPTLWSVHPLGLVFRSY
FCPVVLIGSLYIRVVILDHTDAQRTDDDNRTELRPKEPKEGRPDQRVGPCYGMPRPFAEAEHSARLRGPSPVTSFWSLMSRIAWHRRQCGLHHAGVQRFLLILQVTSFPV
STNSLLVSREGQIILDHSDMQRADEDNAIPPKETIRKRISEQEQAKGLVQGWRDRALAQPPRLASVDPCGTSPSVFFWSLVSLDVLIKFSDTTTNLPNNTTLQGSIQRYK
ARLVAKGFHQSPGVDFFETFSPVVKASTIRVILSIAVTKGWSMRQLDFNNAFLNGTLDEEVYMSQPPGYIDPSCPHHVCKLNKAIYGLKQAPRAWTNTLKSALLSWGFQN
SRSDTSLYFYKRGCDVIFLLVYVDDVVMTGNNDLLMSHLITVLDGRFALKDLGRLSFFLGIQVNYLSSGILLTQEKYINDLLHKLELEDLKPAPSPCVVGKHLSLTDGQP
LADPFVYRSTIGALQYLTTTRPDIAYIVNHLSQFLKQPTDIHWNAVKRVLRYISGTKHLGLLIQSGSNLDISAYSDADWASNIDDRKSVAAYCVFLGENLVSWSSKKQTA
IARSSTESEYRALAHTTAEVIWLKQLLTEIGCSSSSKPVLWCDNLSAGALATNPVFHARTKHIEIDVHFVRDQVLKGSLEVRYVPTADQLADCLTKPISHSQFLYHRSKL
GLAELPARLRGNVKEINQPCKISEGDSINQSK