; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027808 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027808
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr8:5361911..5363113
RNA-Seq ExpressionLag0027808
SyntenyLag0027808
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7542996.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.4e-6639.55Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVD---TSLDHGIS
        +R+FG LC+ STLP H+TKFS R  P VF+GYP  +K +++  ++  + + SR+V+FHENIF F  V L+D   +  P+ +LP L   D   +S  H I 
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVD---TSLDHGIS

Query:  ETFSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFL
        +++    S++ + +   +S   P S  N+ V T S+            + H+                S+ H+    +   +P+  + +  P    V   
Subjt:  ETFSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFL

Query:  RRSTRATKQPTYLQDFHCHLASSSNLPPTPM---RHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPP
         R  R  K P+YL  +HC+L  + N  PTP+   R+PL + + Y  LAP FQ F L+I     P  + QA+  E+W++A  +ELTA+E + TWS+V LPP
Subjt:  RRSTRATKQPTYLQDFHCHLASSSNLPPTPM---RHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPP

Query:  KKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQPP
          + +GCKWV+ IK+ A+G++ERYKARLVAK YTQ+EG+D+ +TFS VAKL T+++LL V   +N +L Q+DV+NAFLHG+L EE+YM L  GY PP
Subjt:  KKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQPP

RVW23791.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.4e-6542.01Show/hide
Query:  RLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETFS
        R FGCL FASTL +H+TKF  R+   V +GYPP +K YRL+ +  ++ F S+D IFHE IF FH  T  D+L +  PN VL                   
Subjt:  RLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETFS

Query:  LDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDP-LQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRS
                          P+S    + +++S +  P +D  +  P    L P      ++ F + ++   D +          +H ++P  + V  LRRS
Subjt:  LDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDP-LQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRS

Query:  TRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSIG
         R +K P YL+DFH +L S  +LP     +PL  Y+ +  L+ S                    V F HWR AM+ EL AME N TWS+V LP  KHSIG
Subjt:  TRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSIG

Query:  CKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGY
        CKW+YK+K K++GSIER+KA +VAK YTQ+EGLDF ETFS VAKLVT++VLL +   Q   LVQLDVNNAFL+G LFEEVYMDL LGY
Subjt:  CKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGY

RVX06074.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]2.0e-6541.15Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF
        +R+FGCLC+ STL +++TKFS R   VVF+GYP   K Y+L  IE R    SR+VIFHE IF F K                    N  +SLD  IS   
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF

Query:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRS
          D     +   N+ S         ++VL     QPP            LQ                    V PS                       R 
Subjt:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRS

Query:  TRATKQPTYLQDFHCHLASS-SNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSI
        TR +KQP+YL+D+HC L +S +++      HP+Q+++ Y +L+PS++ F+LS+     P  + +A     WR AM+ EL A+E N TWSIV LP  KH +
Subjt:  TRATKQPTYLQDFHCHLASS-SNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSI

Query:  GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQPPKASIVQGS
        GCKWVYKIKHKANG+IERYKARLVAK YTQ+EG+D+++TFS VAKLVT+++LL +   +   L QLDVNNAFLHGDL EEVYM L  GY     S+   +
Subjt:  GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQPPKASIVQGS

Query:  V
        V
Subjt:  V

TYK16758.1 Copia protein [Cucumis melo var. makuwa]2.4e-7943.88Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF
        +++FG LC+AS+LP +++KF  R  P VF+GYP  MKAY+L+ IE ++ F SRDVIFHE  F FH +       + LP F LPK F+      H      
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF

Query:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRS
                      T+L  P ++  NT  T            + P+   ++ +D    + +  +    + D+  +     SQ   +  P  +  + +R+S
Subjt:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRS

Query:  TRATKQPTYLQDFHCHLASSSNLPPT--PMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHS
        TR TK P+YLQ +HC L ++ + P T    ++P+  Y+ Y  L+P+++   L +       +YH+AV  + WR+AM+ EL AME N TWSIVPLP  K+S
Subjt:  TRATKQPTYLQDFHCHLASSSNLPPT--PMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHS

Query:  IGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQP
        IGC+WVYKIKHK +GSIERYKARLVAK YTQ+EGLD+ ETFS VAK+VT++ LLT+ VS+   LVQLDVNNAFLHG+LFEEVYMDL LGY+P
Subjt:  IGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQP

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]2.2e-9348.39Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFN-VDTSL----DH-
        +++FGCLCF ST P +++KF  R    VFVGYPP MK Y+L+ IE +RFF SRDVIFHE+IF FH V++   + +  P  V+PK ++ VDTS     DH 
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFN-VDTSL----DH-

Query:  ------GISETFSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHT--LQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHE
               +S T +  + T V+ D      G P+ +AN +    S I   +   + P  +    + P   L ++   VD +      +PS ++        
Subjt:  ------GISETFSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHT--LQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHE

Query:  ASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNT
                  LRRS+R  ++P+YL+D+HC L  +++   + + +PLQ Y+ Y+ L+ S++ F LS+  ++ PQ+YHQAVPF HWR+AM  EL AMEAN+T
Subjt:  ASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNT

Query:  WSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLL
        WS+VPLP + HSIGCKW+YK+KHK++GSIERYKARLVAK YTQ+EGLD+IETFS VAKLVT++VLLT+ VS N +LVQLDVNNAFLHGDLFEEVYMDL L
Subjt:  WSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLL

Query:  GYQ
        GY+
Subjt:  GYQ

TrEMBL top hitse value%identityAlignment
A0A2N9EHN7 Integrase catalytic domain-containing protein1.6e-7645.84Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF
        +++FGCLCFASTL SH+TKF  R    VF+GYP  +K Y+L  +   + F SRDV+FHE IF F   T   + T FL             S    IS T 
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF

Query:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPS-QHIHEASPKQSYVSFLRR
                           P  I + +++ +  +    + P  P    +  P   LP    F D S H   ++ + + +PS  HI   SP QS  S LRR
Subjt:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPS-QHIHEASPKQSYVSFLRR

Query:  STRATKQPTYLQDFHCHLA---SSSNLPP-----TPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVP
        STR  K PTYLQD+HC LA    S++ PP     TP  +PL   + Y  L+P+ + F LS+ A   P  +HQA    HW++AM  EL A+EANNTW++ P
Subjt:  STRATKQPTYLQDFHCHLA---SSSNLPP-----TPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVP

Query:  LPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGY
        LPP KH IGCKWVYK+K K++GS+ERYKARLVAK YTQ+EGLD+ ETFS VAK  T+R LL V  ++N SL QLDVNNAFLHGDL EEVYM L LG+
Subjt:  LPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGY

A0A2N9F124 Integrase catalytic domain-containing protein2.8e-7342.28Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLP-NFVLPKLFNVDTSLDHGISET
        +++FGCL +ASTLPSH+TKF  R  P VF+GYP   K Y+LF++   + F SRDV+FHE++F F           F P +F  P   + D S     S  
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLP-NFVLPKLFNVDTSLDHGISET

Query:  FSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRR
        F +D       + +  S   P      T+ T  ++ P       PP  H        P ++  +    H     P   I+P+  I    P Q      R+
Subjt:  FSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRR

Query:  STRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSI
        STR+T  P+YLQD+HCHLAS++  P   + +P+   + YS L+P+ + +T++I A   P++YH+AV   HW  AME EL A+EAN+TW +  LP  KH I
Subjt:  STRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSI

Query:  GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQPPKAS
        GCKWVYKIK K++GSIERYKARLVAK Y Q EG+D+ ETFS VAKLVT+R  + +  ++   L QLDVNNAFLHG+L EEV+M L  G++  + S
Subjt:  GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQPPKAS

A0A2N9G1Y1 Integrase catalytic domain-containing protein6.0e-7645.57Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF
        +++FGCLCFASTL SH+TKF  R    VF+GYP  +K Y+L  +   + F SRDV+FHE IF F   T   + T FL             S    IS T 
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF

Query:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPS-QHIHEASPKQSYVSFLRR
                           P  I + +++ +  +    + P  P    +  P   LP    F D S H   ++ + + +PS  HI   SP QS  S LRR
Subjt:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPS-QHIHEASPKQSYVSFLRR

Query:  STRATKQPTYLQDFHCHLA---SSSNLPPTPMR---HPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLP
        STR  K  TYLQD+HC LA    S++ PPT      +PL   + Y  L+P+ + F LS+ A   P  +HQA    HW++AM  EL A+EANNTW++ PLP
Subjt:  STRATKQPTYLQDFHCHLA---SSSNLPPTPMR---HPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLP

Query:  PKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGY
        P KH IGCKWVYK+K K++GS+ERYKARLVAK YTQ+EGLD+ ETFS VAK  T+R LL V  ++N SL QLDVNNAFLHGDL EEVYM L LG+
Subjt:  PKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGY

A0A5D3CZP1 Copia protein1.2e-7943.88Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF
        +++FG LC+AS+LP +++KF  R  P VF+GYP  MKAY+L+ IE ++ F SRDVIFHE  F FH +       + LP F LPK F+      H      
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETF

Query:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRS
                      T+L  P ++  NT  T            + P+   ++ +D    + +  +    + D+  +     SQ   +  P  +  + +R+S
Subjt:  SLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRS

Query:  TRATKQPTYLQDFHCHLASSSNLPPT--PMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHS
        TR TK P+YLQ +HC L ++ + P T    ++P+  Y+ Y  L+P+++   L +       +YH+AV  + WR+AM+ EL AME N TWSIVPLP  K+S
Subjt:  TRATKQPTYLQDFHCHLASSSNLPPT--PMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHS

Query:  IGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQP
        IGC+WVYKIKHK +GSIERYKARLVAK YTQ+EGLD+ ETFS VAK+VT++ LLT+ VS+   LVQLDVNNAFLHG+LFEEVYMDL LGY+P
Subjt:  IGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQP

A0A6J1DNP7 uncharacterized protein LOC1110220651.1e-9348.39Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFN-VDTSL----DH-
        +++FGCLCF ST P +++KF  R    VFVGYPP MK Y+L+ IE +RFF SRDVIFHE+IF FH V++   + +  P  V+PK ++ VDTS     DH 
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFN-VDTSL----DH-

Query:  ------GISETFSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHT--LQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHE
               +S T +  + T V+ D      G P+ +AN +    S I   +   + P  +    + P   L ++   VD +      +PS ++        
Subjt:  ------GISETFSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHT--LQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHE

Query:  ASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNT
                  LRRS+R  ++P+YL+D+HC L  +++   + + +PLQ Y+ Y+ L+ S++ F LS+  ++ PQ+YHQAVPF HWR+AM  EL AMEAN+T
Subjt:  ASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNT

Query:  WSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLL
        WS+VPLP + HSIGCKW+YK+KHK++GSIERYKARLVAK YTQ+EGLD+IETFS VAKLVT++VLLT+ VS N +LVQLDVNNAFLHGDLFEEVYMDL L
Subjt:  WSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLL

Query:  GYQ
        GY+
Subjt:  GYQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.2e-2225.56Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFL------FHKVTLEDELTNFLPNFVLPKLFNVDTSLDH
        +R+FG   +   + + Q KF  ++   +FVGY P    ++L+     +F  +RDV+  E   +      F  V L+D   +   NF       + T    
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFL------FHKVTLEDELTNFLPNFVLPKLFNVDTSLDH

Query:  GISETFSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEA----SPK
          +E+   DN   +   +   +   P + +   + TE   +  + D +Q          D   S   F+++S                H++E+    +P 
Subjt:  GISETFSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEA----SPK

Query:  QSYVSFLRRSTR--ATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWS
        +S  S      +      PT           S  L   P     +     +++  +  T    +  +F    Y        W +A+ TEL A + NNTW+
Subjt:  QSYVSFLRRSTR--ATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWS

Query:  IVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLG
        I   P  K+ +  +WV+ +K+   G+  RYKARLVA+ +TQK  +D+ ETF+ VA++ + R +L++V+  NL + Q+DV  AFL+G L EE+YM L  G
Subjt:  IVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-2726.67Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFV-LPKLFNVDTSLDHGISET
        +++FGC  FA      +TK   ++ P +F+GY      YRL+   +++   SRDV+F E+         E      +PNFV +P   N  T         
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFV-LPKLFNVDTSLDHGISET

Query:  FSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRR
         S +++T  V ++ E    QP       V+ +       ++ ++ PT    Q   L  SE   V+   +                    P   YV     
Subjt:  FSLDNSTTVVGDRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRR

Query:  STRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSI
             ++P  L++               + HP +N +                                    AM+ E+ +++ N T+ +V LP  K  +
Subjt:  STRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSI

Query:  GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQ
         CKWV+K+K   +  + RYKARLV K + QK+G+DF E FS V K+ +IR +L++  S +L + QLDV  AFLHGDL EE+YM+   G++
Subjt:  GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQ

P92520 Uncharacterized mitochondrial protein AtMg008203.5e-1739.66Show/hide
Query:  SRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIET
        ++L P + + T++      P+    A+    W  AM+ EL A+  N TW +VP P  ++ +GCKWV+K K  ++G+++R KARLVAK + Q+EG+ F+ET
Subjt:  SRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIET

Query:  FSSVAKLVTIRVLLTV
        +S V +  TIR +L V
Subjt:  FSSVAKLVTIRVLLTV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-3430.82Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLF--HKVTL-------EDELTNFLPNFVLPKLFNVDTS
        +R+FGC C+    P +Q K   ++   VF+GY     AY   H++  R + SR V F EN F F  +  TL        +    + P+  LP    V  +
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLF--HKVTL-------EDELTNFLPNFVLPKLFNVDTS

Query:  LDHGISETFSLDNSTTVVGDRNE--TSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEAS-
                 +   S+     RN   +S     S +++   +     P    P QP T  T        S+N   +  T+          +PSQ     S 
Subjt:  LDHGISETFSLDNSTTVVGDRNE--TSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEAS-

Query:  PKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRH---PLQNYIPYSRLAP------------------SFQTFTLSILANFMPQYYHQAVPF
        P QS  S    +T A+             +S+S  PP+ + H   PL   +  +  AP                     +  +S+ A   P+   QA+  
Subjt:  PKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRH---PLQNYIPYSRLAP------------------SFQTFTLSILANFMPQYYHQAVPF

Query:  EHWRDAMETELTAMEANNTWSIVPLPPKKHSI-GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLD
        E WR+AM +E+ A   N+TW +VP PP   +I GC+W++  K+ ++GS+ RYKARLVAK Y Q+ GLD+ ETFS V K  +IR++L V V ++  + QLD
Subjt:  EHWRDAMETELTAMEANNTWSIVPLPPKKHSI-GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLD

Query:  VNNAFLHGDLFEEVYMDLLLGYQPP
        VNNAFL G L ++VYM      QPP
Subjt:  VNNAFLHGDLFEEVYMDLLLGYQPP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-3429.81Show/hide
Query:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKV-----TLEDELTNFLPNFVLPKLFNVDTSLDHG
        +++FGC C+    P ++ K   ++    F+GY     AY   HI   R +TSR V F E  F F        T +++ ++  PN+  P    + T     
Subjt:  MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKV-----TLEDELTNFLPNFVLPKLFNVDTSLDHG

Query:  ISETFSLDNSTTVVGDRNETSLGQPVS--------IANNTVLTESTIQPPDLDPLQPP---TMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHI
           T  +  +   +G   +TS   P S        ++++ + + S   P   +P  P       T QP     S +     S   ++  P+ + +P+   
Subjt:  ISETFSLDNSTTVVGDRNETSLGQPVS--------IANNTVLTESTIQPPDLDPLQPP---TMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHI

Query:  HEASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAP------------------SFQTFTLSILANFMPQYYHQAVP
          +   QS +S    S       T + + +   +SS++ PP P   P    I  +  AP                     ++  S+ AN  P+   QA+ 
Subjt:  HEASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAP------------------SFQTFTLSILANFMPQYYHQAVP

Query:  FEHWRDAMETELTAMEANNTWSIVPLPPKKHSI-GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQL
         + WR AM +E+ A   N+TW +VP PP   +I GC+W++  K  ++GS+ RYKARLVAK Y Q+ GLD+ ETFS V K  +IR++L V V ++  + QL
Subjt:  FEHWRDAMETELTAMEANNTWSIVPLPPKKHSI-GCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQL

Query:  DVNNAFLHGDLFEEVYMDLLLGYQPP
        DVNNAFL G L +EVYM      QPP
Subjt:  DVNNAFLHGDLFEEVYMDLLLGYQPP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.2e-4845.79Show/hide
Query:  SCAINPSQHIHEASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAM
        S  I PS +I    P+ S    +  S R T++P YLQD++CH  +S  +      H +  ++ Y +++P + +F + I     P  Y++A  F  W  AM
Subjt:  SCAINPSQHIHEASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASSSNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAM

Query:  ETELTAMEANNTWSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHG
        + E+ AME  +TW I  LPP K  IGCKWVYKIK+ ++G+IERYKARLVAK YTQ+EG+DFIETFS V KL +++++L +    N +L QLD++NAFL+G
Subjt:  ETELTAMEANNTWSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHG

Query:  DLFEEVYMDLLLGY
        DL EE+YM L  GY
Subjt:  DLFEEVYMDLLLGY

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.5e-1839.66Show/hide
Query:  SRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIET
        ++L P + + T++      P+    A+    W  AM+ EL A+  N TW +VP P  ++ +GCKWV+K K  ++G+++R KARLVAK + Q+EG+ F+ET
Subjt:  SRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQKEGLDFIET

Query:  FSSVAKLVTIRVLLTV
        +S V +  TIR +L V
Subjt:  FSSVAKLVTIRVLLTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTTGTTTGGTTGTTTGTGTTTTGCCTCCACTTTGCCTTCACATCAGACAAAATTTTCACATCGTACCACTCCTGTTGTTTTTGTTGGCTATCCACCAAGGATGAA
GGCTTATCGCTTATTTCATATTGAGCAACGCCGTTTTTTTACCTCCCGTGATGTTATATTTCATGAGAATATCTTTCTCTTTCATAAGGTTACTTTGGAAGATGAGTTGA
CTAATTTTCTTCCGAATTTTGTTTTACCCAAGCTGTTTAATGTGGATACTTCTTTGGATCATGGGATTTCTGAAACTTTTTCTCTTGATAATTCTACAACAGTAGTGGGC
GACAGAAATGAGACTTCTTTGGGTCAACCAGTTTCTATTGCAAATAACACAGTTTTGACAGAATCCACCATACAACCACCTGATTTGGATCCCCTACAGCCACCCACCAT
GCATACACTACAACCTGATGACTTGTTGCCCTCTGAAAATGTGTTTGTTGATCAGTCGACACATCACCATGATGTTGAGCCTTCATGTGCCATTAATCCCTCACAACACA
TTCATGAAGCTTCTCCTAAGCAATCATATGTCTCTTTTCTTCGACGTTCAACACGTGCAACCAAGCAACCTACTTATTTGCAAGATTTTCATTGTCATCTTGCATCTTCT
TCAAATCTCCCTCCCACTCCAATGCGACATCCTCTTCAGAATTATATACCTTATTCAAGATTGGCCCCTTCTTTTCAGACCTTCACATTGAGTATTTTAGCCAACTTTAT
GCCTCAGTACTATCATCAGGCTGTTCCCTTTGAACATTGGCGTGACGCTATGGAGACCGAATTAACTGCAATGGAGGCAAACAATACTTGGTCTATTGTTCCTCTTCCAC
CCAAAAAGCATTCTATTGGCTGTAAATGGGTGTATAAAATTAAGCACAAGGCTAATGGATCAATTGAACGCTACAAAGCTCGCCTTGTTGCTAAAAGATATACCCAAAAA
GAAGGGTTGGACTTTATCGAAACCTTTTCTTCTGTTGCTAAACTTGTAACGATACGTGTTCTTCTTACTGTTGTTGTATCTCAAAATTTGTCTCTTGTTCAATTGGACGT
TAATAATGCCTTTCTCCATGGTGACTTATTTGAGGAGGTATACATGGATTTACTTTTGGGATATCAACCTCCTAAGGCTTCTATTGTTCAGGGGAGCGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTTTGTTTGGTTGTTTGTGTTTTGCCTCCACTTTGCCTTCACATCAGACAAAATTTTCACATCGTACCACTCCTGTTGTTTTTGTTGGCTATCCACCAAGGATGAA
GGCTTATCGCTTATTTCATATTGAGCAACGCCGTTTTTTTACCTCCCGTGATGTTATATTTCATGAGAATATCTTTCTCTTTCATAAGGTTACTTTGGAAGATGAGTTGA
CTAATTTTCTTCCGAATTTTGTTTTACCCAAGCTGTTTAATGTGGATACTTCTTTGGATCATGGGATTTCTGAAACTTTTTCTCTTGATAATTCTACAACAGTAGTGGGC
GACAGAAATGAGACTTCTTTGGGTCAACCAGTTTCTATTGCAAATAACACAGTTTTGACAGAATCCACCATACAACCACCTGATTTGGATCCCCTACAGCCACCCACCAT
GCATACACTACAACCTGATGACTTGTTGCCCTCTGAAAATGTGTTTGTTGATCAGTCGACACATCACCATGATGTTGAGCCTTCATGTGCCATTAATCCCTCACAACACA
TTCATGAAGCTTCTCCTAAGCAATCATATGTCTCTTTTCTTCGACGTTCAACACGTGCAACCAAGCAACCTACTTATTTGCAAGATTTTCATTGTCATCTTGCATCTTCT
TCAAATCTCCCTCCCACTCCAATGCGACATCCTCTTCAGAATTATATACCTTATTCAAGATTGGCCCCTTCTTTTCAGACCTTCACATTGAGTATTTTAGCCAACTTTAT
GCCTCAGTACTATCATCAGGCTGTTCCCTTTGAACATTGGCGTGACGCTATGGAGACCGAATTAACTGCAATGGAGGCAAACAATACTTGGTCTATTGTTCCTCTTCCAC
CCAAAAAGCATTCTATTGGCTGTAAATGGGTGTATAAAATTAAGCACAAGGCTAATGGATCAATTGAACGCTACAAAGCTCGCCTTGTTGCTAAAAGATATACCCAAAAA
GAAGGGTTGGACTTTATCGAAACCTTTTCTTCTGTTGCTAAACTTGTAACGATACGTGTTCTTCTTACTGTTGTTGTATCTCAAAATTTGTCTCTTGTTCAATTGGACGT
TAATAATGCCTTTCTCCATGGTGACTTATTTGAGGAGGTATACATGGATTTACTTTTGGGATATCAACCTCCTAAGGCTTCTATTGTTCAGGGGAGCGTTTAG
Protein sequenceShow/hide protein sequence
MRLFGCLCFASTLPSHQTKFSHRTTPVVFVGYPPRMKAYRLFHIEQRRFFTSRDVIFHENIFLFHKVTLEDELTNFLPNFVLPKLFNVDTSLDHGISETFSLDNSTTVVG
DRNETSLGQPVSIANNTVLTESTIQPPDLDPLQPPTMHTLQPDDLLPSENVFVDQSTHHHDVEPSCAINPSQHIHEASPKQSYVSFLRRSTRATKQPTYLQDFHCHLASS
SNLPPTPMRHPLQNYIPYSRLAPSFQTFTLSILANFMPQYYHQAVPFEHWRDAMETELTAMEANNTWSIVPLPPKKHSIGCKWVYKIKHKANGSIERYKARLVAKRYTQK
EGLDFIETFSSVAKLVTIRVLLTVVVSQNLSLVQLDVNNAFLHGDLFEEVYMDLLLGYQPPKASIVQGSV