; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018788 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018788
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE2
Genome locationchr5:34361140..34366937
RNA-Seq ExpressionLag0018788
SyntenyLag0018788
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]6.0e-10557.83Show/hide
Query:  MEMFNSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLF
        M + N+HLQAA T PIT A+A++H +GI  L S + +S  D WIIDSGASRHICH ++LF NW   + + V+LP  + + VD +GDI+I+  L L+DVLF
Subjt:  MEMFNSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLF

Query:  IPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSA-ASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSV
        + +FAYNL+SVSCLL T   S+ F  + C+IQD     +IG+A C++GLY+L+  A  +  +      ++   TWH RLGH+S K L+SL  TLC  +  
Subjt:  IPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSA-ASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSV

Query:  SHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDN
         H+  C VCPLAKQK+LSF SNN+VAS+ FDL+HADIWGPF  P+Y GY+YFLTLVDD  RFTWVY+LRQKSDVL I+P+FF++IETQFSKVIK FRSDN
Subjt:  SHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDN

Query:  APEFRFTKFFASTGTLHQFSCVERPQQNAVVE
        APE + T+FFA  GT+HQFSCVE+PQQN+VVE
Subjt:  APEFRFTKFFASTGTLHQFSCVERPQQNAVVE

KZV37633.1 hypothetical protein F511_38248 [Dorcoceras hygrometricum]2.3e-7245.43Show/hide
Query:  NSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEF
        N H++  + EP      +S  TGIC   S +       W++D+GA+ HIC S ++FH+ R +    ++LP   T+ V  +G + ++ DL+L DVL++PEF
Subjt:  NSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEF

Query:  AYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPAS-TWHLRLGHISHKRLASLKDTLCFKDSVSHNK
         +NL+S+S L      SV F  + C IQD +  ++IG       LY+L  S + V +      SVP S  WHLR+GH S  +L+SLKD L F  +     
Subjt:  AYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPAS-TWHLRLGHISHKRLASLKDTLCFKDSVSHNK

Query:  PCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF
        PC VC L+KQK+L F SNN +  + F+LLH D+WGPF   + +GYR+FLT+VDD +RFTWVYLLR KSDV +I P F +M++TQF   IK  RSDNAPE 
Subjt:  PCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF

Query:  RFTKFFASTGTLHQFSCVERPQQNAVVE
         F  FF   G +H +SCVERPQQN+VVE
Subjt:  RFTKFFASTGTLHQFSCVERPQQNAVVE

RVW82526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.1e-7145.71Show/hide
Query:  AVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVSVSCLLQTGLF
        ++S+ TGI  L+ SS       WI+DSGA+ H+C + ++FH+        V LPT   + +  +G I +SP LVL  VL+IP F +NL+S+S L QT  F
Subjt:  AVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVSVSCLLQTGLF

Query:  SVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSA-ASVASFTAATGSVPA---STWHLRLGHISHKRLASLKDTLCFKDSVSHNKPCDVCPLAKQKKL
        S  F    C IQD    ++IG    +  LY+L  S   S++S      +  A     WH RL H S+ +L+ LK  L  + + + N  C +CPLAKQK+L
Subjt:  SVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSA-ASVASFTAATGSVPA---STWHLRLGHISHKRLASLKDTLCFKDSVSHNKPCDVCPLAKQKKL

Query:  SFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLH
         F  +N+++S+ FDL+H DIWGPFH PT++G+RYFLT+VDD +R TWV+LLR KSDV TI P+FF M++T+F   IK  RSDNAPE   +  F     LH
Subjt:  SFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLH

Query:  QFSCVERPQQNAVVE
         FSCVE PQQN+VVE
Subjt:  QFSCVERPQQNAVVE

XP_012856897.1 PREDICTED: uncharacterized protein LOC105976150 [Erythranthe guttata]2.2e-7047.3Show/hide
Query:  VSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVSVSCLLQTGLFS
        +S  +GICL AS         WIIDSGASRHIC+ + LF +  K++   V+LP    V V+++GD+ +S DL+L++V ++P F +NL+SVS LL     +
Subjt:  VSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVSVSCLLQTGLFS

Query:  VLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSVSHNKP-----CDVCPLAKQKKL
        V+F  +  LIQDK  L+ IG+                      +   V A+ WH RLGHI   +L    D L  K S++ +KP     C +CP+AKQK+L
Subjt:  VLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSVSHNKP-----CDVCPLAKQKKL

Query:  SFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLH
         F  ++ V+S++FDL+H DIWGP+   ++NGY+YF+TLVDD SRFTWV+LL+ KSDVLT IP FF M++TQF+  IK FRSDNA E +FT+ F+  G LH
Subjt:  SFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLH

Query:  QFSCVERPQQNAVVE
        QFSCV  PQQNAVVE
Subjt:  QFSCVERPQQNAVVE

XP_012857659.1 PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata]7.6e-7647.83Show/hide
Query:  KTEPITVASAVSHATGICLLASSSVKS-LGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVS
        K   I   S +S  TGICL  +    S +   WI+DSGASRHICH+++LF N + +    VVLP +  V V+ +GD++++  LVL +V ++PEF +NLVS
Subjt:  KTEPITVASAVSHATGICLLASSSVKS-LGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVS

Query:  VSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFK-DSVSHNKPCDVCP
        VS LL    + V+F +    IQD RL+  IG+ +   GLY+L   +AS     A    + A+ WH RLGHI   +LA L        D +S +  C VCP
Subjt:  VSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFK-DSVSHNKPCDVCP

Query:  LAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFF
        LAKQK+L F++++ V++ +FDL+H DIWGPF  P+Y+G+ YF+TLVDD SRFTWV+LL+ KS+V+T++PRF KM+  QF K IK FRSDNA E +F   F
Subjt:  LAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFF

Query:  ASTGTLHQFSCVERPQQNAVVE
           G +HQFSCV  PQQNA+VE
Subjt:  ASTGTLHQFSCVERPQQNAVVE

TrEMBL top hitse value%identityAlignment
A0A2Z7BSS5 Integrase catalytic domain-containing protein1.1e-7245.43Show/hide
Query:  NSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEF
        N H++  + EP      +S  TGIC   S +       W++D+GA+ HIC S ++FH+ R +    ++LP   T+ V  +G + ++ DL+L DVL++PEF
Subjt:  NSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEF

Query:  AYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPAS-TWHLRLGHISHKRLASLKDTLCFKDSVSHNK
         +NL+S+S L      SV F  + C IQD +  ++IG       LY+L  S + V +      SVP S  WHLR+GH S  +L+SLKD L F  +     
Subjt:  AYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPAS-TWHLRLGHISHKRLASLKDTLCFKDSVSHNK

Query:  PCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF
        PC VC L+KQK+L F SNN +  + F+LLH D+WGPF   + +GYR+FLT+VDD +RFTWVYLLR KSDV +I P F +M++TQF   IK  RSDNAPE 
Subjt:  PCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF

Query:  RFTKFFASTGTLHQFSCVERPQQNAVVE
         F  FF   G +H +SCVERPQQN+VVE
Subjt:  RFTKFFASTGTLHQFSCVERPQQNAVVE

A0A2Z7BVC8 Integrase catalytic domain-containing protein (Fragment)2.4e-6743.38Show/hide
Query:  HLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAY
        H    + EP    S +S  TGIC   S         W++D+GA+ HIC S +LFH+ R ++   + LP  +T+ V  VG + ++ DL+L DVL++P F +
Subjt:  HLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAY

Query:  NLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSVSHNKPCD
        NL+SVS L     +SV F    C IQD +  R+IG       LY+L  S + + + + +   +    WHLR+GH S  RL+ LKD + F        PC 
Subjt:  NLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSVSHNKPCD

Query:  VCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFT
        VC L+KQK+L F S+N + +  F+LLH DIWGPF     +GYR+FLT+VDD +RFTWVY+L  KS+V +I P F +MI T+F   IK  RSDNAPE  F 
Subjt:  VCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFT

Query:  KFFASTGTLHQFSCVERPQQNAVVE
          F   G +H  SCVE PQQN+VVE
Subjt:  KFFASTGTLHQFSCVERPQQNAVVE

A0A438HDI8 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-7245.71Show/hide
Query:  AVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVSVSCLLQTGLF
        ++S+ TGI  L+ SS       WI+DSGA+ H+C + ++FH+        V LPT   + +  +G I +SP LVL  VL+IP F +NL+S+S L QT  F
Subjt:  AVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVSVSCLLQTGLF

Query:  SVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSA-ASVASFTAATGSVPA---STWHLRLGHISHKRLASLKDTLCFKDSVSHNKPCDVCPLAKQKKL
        S  F    C IQD    ++IG    +  LY+L  S   S++S      +  A     WH RL H S+ +L+ LK  L  + + + N  C +CPLAKQK+L
Subjt:  SVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSA-ASVASFTAATGSVPA---STWHLRLGHISHKRLASLKDTLCFKDSVSHNKPCDVCPLAKQKKL

Query:  SFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLH
         F  +N+++S+ FDL+H DIWGPFH PT++G+RYFLT+VDD +R TWV+LLR KSDV TI P+FF M++T+F   IK  RSDNAPE   +  F     LH
Subjt:  SFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLH

Query:  QFSCVERPQQNAVVE
         FSCVE PQQN+VVE
Subjt:  QFSCVERPQQNAVVE

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 82.9e-10557.83Show/hide
Query:  MEMFNSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLF
        M + N+HLQAA T PIT A+A++H +GI  L S + +S  D WIIDSGASRHICH ++LF NW   + + V+LP  + + VD +GDI+I+  L L+DVLF
Subjt:  MEMFNSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLF

Query:  IPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSA-ASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSV
        + +FAYNL+SVSCLL T   S+ F  + C+IQD     +IG+A C++GLY+L+  A  +  +      ++   TWH RLGH+S K L+SL  TLC  +  
Subjt:  IPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSA-ASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSV

Query:  SHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDN
         H+  C VCPLAKQK+LSF SNN+VAS+ FDL+HADIWGPF  P+Y GY+YFLTLVDD  RFTWVY+LRQKSDVL I+P+FF++IETQFSKVIK FRSDN
Subjt:  SHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDN

Query:  APEFRFTKFFASTGTLHQFSCVERPQQNAVVE
        APE + T+FFA  GT+HQFSCVE+PQQN+VVE
Subjt:  APEFRFTKFFASTGTLHQFSCVERPQQNAVVE

A0A6D2HNE3 Uncharacterized protein1.2e-6646.64Show/hide
Query:  DCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVI
        + WIIDSGAS H+C   ALF     +    V LP    V +     + IS  L+L +VL +P+F +NL+SVSCL++T   S  F  + CLIQ+     +I
Subjt:  DCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVI

Query:  GRADCRHGLYILSD------SAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTL-CFKDSVSHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLLH
        GRA   H LYIL +      SA+S  S    +     S WH RLGH S   L  LK  L  FKD  S    C VCPLAKQ++L++ S+N++AS  FDL+H
Subjt:  GRADCRHGLYILSD------SAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTL-CFKDSVSHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLLH

Query:  ADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLHQFSCVERPQQNAVVE
         DIWGPF   +  GYRYFLT+VDD +R TW+Y+LR KSDV T+ P F  +I TQ++  +K  RSDNAPE  FT      G +HQ SC   PQQN+VVE
Subjt:  ADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLHQFSCVERPQQNAVVE

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.0e-1927.68Show/hide
Query:  VASAVSHATGICLLASSSVKSLGDC-WIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCV--DFV-----GDIRISPD--LVLRDVLFIPEFAYN
        V +A SH     +   ++   + +C +++DSGAS H+ +  +L+      D + VV P    V    +F+     G +R+  D  + L DVLF  E A N
Subjt:  VASAVSHATGICLLASSSVKSLGDC-WIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCV--DFV-----GDIRISPD--LVLRDVLFIPEFAYN

Query:  LVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKD-SVSHN----
        L+SV  L + G+ S+ F  S   I  K  L V+  +   + + +++  A S+     A        WH R GHIS  +L  +K    F D S+ +N    
Subjt:  LVSVSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKD-SVSHN----

Query:  -KPCDVCPLAKQKKLSF---TSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSD
         + C+ C   KQ +L F       H+   +F ++H+D+ GP    T +   YF+  VD  + +   YL++ KSDV ++   F    E  F+  +     D
Subjt:  -KPCDVCPLAKQKKLSF---TSNNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSD

Query:  NAPEF---RFTKFFASTGTLHQFSCVERPQQNAVVE
        N  E+      +F    G  +  +    PQ N V E
Subjt:  NAPEF---RFTKFFASTGTLHQFSCVERPQQNAVVE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-3231.31Show/hide
Query:  WIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPD----LVLRDVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLR
        W++D+ AS H    R LF  +   D   V +       +  +GDI I  +    LVL+DV  +P+   NL+S   L + G +   FA+    +    L  
Subjt:  WIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPD----LVLRDVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKRLLR

Query:  VIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASL-KDTLCFKDSVSHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIW
        VI +   R  LY  +++        AA   +    WH R+GH+S K L  L K +L      +  KPCD C   KQ ++SF +++    N+ DL+++D+ 
Subjt:  VIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASL-KDTLCFKDSVSHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLLHADIW

Query:  GPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF---RFTKFFASTGTLHQFSCVERPQQNAVVE
        GP    +  G +YF+T +DDASR  WVY+L+ K  V  +  +F  ++E +  + +K  RSDN  E+    F ++ +S G  H+ +    PQ N V E
Subjt:  GPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF---RFTKFFASTGTLHQFSCVERPQQNAVVE

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.7e-0921.84Show/hide
Query:  LGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLR---DVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKR
        L D  +IDSGAS+ +  S    H+      I +V      + ++ +G++  +     +     L  P  AY+L+S+S L    + +  F  +     D  
Subjt:  LGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLR---DVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKR

Query:  LLRVIGRADCRHGLYI-----------LSDSAASVASFTAATGSVPASTWHLRLGHISHKRL-ASLKDTLCFKDSVSHNKPCDV------------CPLA
        +L  I     +HG +            +S    +  + + +    P    H  LGH + + +  SLK     K++V++ K  D+            C + 
Subjt:  LLRVIGRADCRHGLYI-----------LSDSAASVASFTAATGSVPASTWHLRLGHISHKRL-ASLKDTLCFKDSVSHNKPCDV------------CPLA

Query:  KQKKLSFTSNNHV----ASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLL--RQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF--
        K  K      + +    +   F  LH DI+GP H    +   YF++  D+ +RF WVY L  R++  +L +       I+ QF+  +   + D   E+  
Subjt:  KQKKLSFTSNNHV----ASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLL--RQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF--

Query:  -RFTKFFASTGTLHQFSCVERPQQNAVVELNQLSQYHSTPTNIHSQSL
            KFF + G    ++     + + V E    +  +   T +H   L
Subjt:  -RFTKFFASTGTLHQFSCVERPQQNAVVELNQLSQYHSTPTNIHSQSL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.3e-2125.93Show/hide
Query:  WIIDSGASRHICHSRALFHNWRKIDPIC----VVLPTAYTVCVDFVGDIRISP---DLVLRDVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKR
        W++DSGA+ HI      F+N     P      V++    T+ +   G   +S     L L ++L++P    NL+SV  L      SV F  +   ++D  
Subjt:  WIIDSGASRHICHSRALFHNWRKIDPIC----VVLPTAYTVCVDFVGDIRISP---DLVLRDVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKR

Query:  LLRVIGRADCRHGLYILS-DSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSVSHNK--PCDVCPLAKQKKLSFTSNNHVASNVFDLL
            + +   +  LY     S+  V+ F + +     S+WH RLGH +   L S+            +K   C  C + K  K+ F+ +   ++   + +
Subjt:  LLRVIGRADCRHGLYILS-DSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSVSHNK--PCDVCPLAKQKKLSFTSNNHVASNVFDLL

Query:  HADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF-RFTKFFASTGTLHQFSCVERPQQNAVVE
        ++D+W      +++ YRY++  VD  +R+TW+Y L+QKS V      F  ++E +F   I  F SDN  EF    ++F+  G  H  S    P+ N    
Subjt:  HADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEF-RFTKFFASTGTLHQFSCVERPQQNAVVE

Query:  LNQLSQYHSTPTNIHSQSLKRVPR
        L++    H   T +   S   +P+
Subjt:  LNQLSQYHSTPTNIHSQSLKRVPR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-2027.33Show/hide
Query:  WIIDSGASRHICHSRALFHNWRKIDPIC----VVLPTAYTVCVDFVGDIRI---SPDLVLRDVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKR
        W++DSGA+ HI      F+N     P      V++    T+ +   G   +   S  L L  VL++P    NL+SV  L  T   SV F  +   ++D  
Subjt:  WIIDSGASRHICHSRALFHNWRKIDPIC----VVLPTAYTVCVDFVGDIRI---SPDLVLRDVLFIPEFAYNLVSVSCLLQTGLFSVLFADSHCLIQDKR

Query:  LLRVIGRADCRHGLYILS-DSAASVASFTAATGSVPASTWHLRLGHISHKRLASL--KDTLCFKDSVSHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLL
            + +   +  LY     S+ +V+ F +       S+WH RLGH S   L S+    +L   +       C  C + K  K+ F+++   +S   + +
Subjt:  LLRVIGRADCRHGLYILS-DSAASVASFTAATGSVPASTWHLRLGHISHKRLASL--KDTLCFKDSVSHNKPCDVCPLAKQKKLSFTSNNHVASNVFDLL

Query:  HADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTK-FFASTGTLHQFSCVERPQQNAVVE
        ++D+W      + + YRY++  VD  +R+TW+Y L+QKS V      F  ++E +F   I    SDN  EF   + + +  G  H  S    P+ N + E
Subjt:  HADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTK-FFASTGTLHQFSCVERPQQNAVVE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.9e-0631.52Show/hide
Query:  LNQLSQYHSTPTNIHSQSLKRVPRFLKETVDMRLNIQPVSKFSITAYSNADWANDSRGLKTSHGICTNLGETLGSSWSLKKQNVVSKFSESA
        +N+LSQ+   P   H Q++ ++  ++K TV   L     ++  +  +S+A + +     ++++G C  LG +L  SW  KKQ VVSK S  A
Subjt:  LNQLSQYHSTPTNIHSQSLKRVPRFLKETVDMRLNIQPVSKFSITAYSNADWANDSRGLKTSHGICTNLGETLGSSWSLKKQNVVSKFSESA

ATMG00810.1 DNA/RNA polymerases superfamily protein4.0e-0634.62Show/hide
Query:  QFSCVERPQQNAVVELNQLSQYHSTPTNIHSQSLKRVPRFLKETVDMRLNIQPVSKFSITAYSNADWANDSRGLKTSHGICTNLGETLGSSWSLKKQNVV
        Q+  + RP  +  V  N + Q    PT      LKRV R++K T+   L I   SK ++ A+ ++DWA  +   +++ G CT LG  +  SWS K+Q  V
Subjt:  QFSCVERPQQNAVVELNQLSQYHSTPTNIHSQSLKRVPRFLKETVDMRLNIQPVSKFSITAYSNADWANDSRGLKTSHGICTNLGETLGSSWSLKKQNVV

Query:  SKFS
        S+ S
Subjt:  SKFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGTTCAATTCTCACCTACAAGCCGCAAAAACAGAACCTATTACCGTGGCTTCAGCTGTGTCACATGCTACAGGTATTTGCCTTTTAGCCTCTTCTTCGGTAAA
ATCTCTTGGTGATTGTTGGATTATCGATTCTGGCGCGTCCCGACATATTTGTCATTCTCGTGCCTTGTTTCATAATTGGCGTAAAATTGACCCTATTTGTGTTGTTCTTC
CAACTGCCTACACAGTATGTGTGGATTTTGTGGGTGACATTCGTATATCCCCTGACTTAGTCCTTCGAGATGTTCTCTTTATTCCTGAGTTTGCTTACAACTTGGTTTCG
GTGAGCTGCTTGCTTCAAACAGGTCTTTTTTCTGTTCTGTTTGCTGATTCTCATTGCCTGATACAGGACAAGCGGCTATTGAGAGTGATTGGCAGGGCTGATTGTCGTCA
TGGCCTTTACATTCTTTCTGATTCTGCAGCTTCTGTGGCTTCCTTTACTGCTGCCACAGGTTCTGTTCCTGCTTCTACATGGCATCTCCGCTTAGGACATATATCACATA
AACGACTAGCTTCTTTGAAGGACACTTTATGTTTTAAGGACTCTGTTTCTCATAATAAACCATGTGACGTTTGTCCTCTTGCTAAACAAAAGAAGCTGTCATTTACATCA
AACAATCATGTTGCTTCAAATGTTTTTGATTTGTTGCACGCCGATATTTGGGGCCCTTTTCATACACCCACATATAATGGATATCGATACTTTCTGACTTTGGTGGATGA
TGCTTCTAGATTTACCTGGGTCTATTTACTGCGTCAAAAGTCAGATGTCCTCACTATTATCCCTAGATTTTTCAAAATGATTGAAACACAATTCTCAAAGGTCATAAAGT
GTTTTCGATCGGATAATGCCCCAGAATTCAGGTTTACTAAATTTTTTGCATCCACTGGGACTTTACACCAGTTTTCTTGCGTTGAGCGTCCTCAACAAAATGCAGTGGTT
GAGCTGAATCAGCTCAGCCAATATCATTCAACTCCCACAAATATTCACAGTCAATCCCTAAAACGAGTTCCTAGGTTCCTAAAAGAAACAGTGGATATGAGATTGAACAT
ACAACCAGTCAGCAAGTTCTCAATCACAGCCTATTCCAATGCAGATTGGGCAAATGACTCAAGAGGATTGAAAACCAGTCATGGAATTTGTACAAACCTCGGCGAAACGT
TGGGGTCGTCGTGGTCATTGAAGAAACAAAATGTAGTCTCCAAATTCAGTGAATCCGCCATTGCAAGTAGCCTGTGTACCATGCAAGAACGAAACATTCAAAATCAATTA
ACGGTTTGCGAGAGACAAATTGTTGAGGAAGCAGCTGCTGGAAATTCGTTACATGGCGATTGGCGATTCCGACGACCCCAATGGCTCCGGCGGTTCCTTCTCTCTCGTTC
GACTCCGACAACTCCAACGGCTCCGGCGTGTCCAGAAACTCTGCGGCGTCTTCATCCCCGTTCTAGCAGCTCCGACTCAACGACGTCTTCATCCCGTGGATTTTTCAGGC
CGTTTCTAGCAGCTCCGACGACTCCACCAATCAAGATTTTTCGAAGAGGGGAGAGGGTTTTTTGGAGAGTTCTCGAAGATGGAGAGGGTTTTACTGGTTGGTATATTCGC
CGGCGTCGGAGAAGACGAGTTTTCCCGGTAGATTCCCGGTCGAGATCGAAAACCGGGAAATACGGTTCCGTAAGCTTTGAACCGATTTCCTACGAGAAAACCGGTTCAAA
CCTTACAGGATTCCCGAAGAGATTGACTGAAACCCAATTAGATATGTTTAGGCAAACCGTATTTGGCCCTATATTAGACAGCAACATATTATTTAATGGTCAGTTAATCC
ACCATCTACTACTTAGGGAGGTTGAGGATCCCAGGAAGGATGTAATTAGTTTCGATATATTTGGAAATAAGGTGTCGTTTGACAAGGAAGAATTCGATCTAATCACTGGA
TTTAGACACAATAGGAGGATAGTTGATAGACATGAGTCGGGGGTTAGATTGAGGCGTCTGTACTTTAATGAAGGTAGCGCTCGCATATTTTATCGAGCTAGCAATGTTTG
GGCGGGAGAGGAAACAAAAATTCAATTGGTCTTTATTGGGTATCGTGGACGATTGGGAGATATTCTGCAATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGTTCAATTCTCACCTACAAGCCGCAAAAACAGAACCTATTACCGTGGCTTCAGCTGTGTCACATGCTACAGGTATTTGCCTTTTAGCCTCTTCTTCGGTAAA
ATCTCTTGGTGATTGTTGGATTATCGATTCTGGCGCGTCCCGACATATTTGTCATTCTCGTGCCTTGTTTCATAATTGGCGTAAAATTGACCCTATTTGTGTTGTTCTTC
CAACTGCCTACACAGTATGTGTGGATTTTGTGGGTGACATTCGTATATCCCCTGACTTAGTCCTTCGAGATGTTCTCTTTATTCCTGAGTTTGCTTACAACTTGGTTTCG
GTGAGCTGCTTGCTTCAAACAGGTCTTTTTTCTGTTCTGTTTGCTGATTCTCATTGCCTGATACAGGACAAGCGGCTATTGAGAGTGATTGGCAGGGCTGATTGTCGTCA
TGGCCTTTACATTCTTTCTGATTCTGCAGCTTCTGTGGCTTCCTTTACTGCTGCCACAGGTTCTGTTCCTGCTTCTACATGGCATCTCCGCTTAGGACATATATCACATA
AACGACTAGCTTCTTTGAAGGACACTTTATGTTTTAAGGACTCTGTTTCTCATAATAAACCATGTGACGTTTGTCCTCTTGCTAAACAAAAGAAGCTGTCATTTACATCA
AACAATCATGTTGCTTCAAATGTTTTTGATTTGTTGCACGCCGATATTTGGGGCCCTTTTCATACACCCACATATAATGGATATCGATACTTTCTGACTTTGGTGGATGA
TGCTTCTAGATTTACCTGGGTCTATTTACTGCGTCAAAAGTCAGATGTCCTCACTATTATCCCTAGATTTTTCAAAATGATTGAAACACAATTCTCAAAGGTCATAAAGT
GTTTTCGATCGGATAATGCCCCAGAATTCAGGTTTACTAAATTTTTTGCATCCACTGGGACTTTACACCAGTTTTCTTGCGTTGAGCGTCCTCAACAAAATGCAGTGGTT
GAGCTGAATCAGCTCAGCCAATATCATTCAACTCCCACAAATATTCACAGTCAATCCCTAAAACGAGTTCCTAGGTTCCTAAAAGAAACAGTGGATATGAGATTGAACAT
ACAACCAGTCAGCAAGTTCTCAATCACAGCCTATTCCAATGCAGATTGGGCAAATGACTCAAGAGGATTGAAAACCAGTCATGGAATTTGTACAAACCTCGGCGAAACGT
TGGGGTCGTCGTGGTCATTGAAGAAACAAAATGTAGTCTCCAAATTCAGTGAATCCGCCATTGCAAGTAGCCTGTGTACCATGCAAGAACGAAACATTCAAAATCAATTA
ACGGTTTGCGAGAGACAAATTGTTGAGGAAGCAGCTGCTGGAAATTCGTTACATGGCGATTGGCGATTCCGACGACCCCAATGGCTCCGGCGGTTCCTTCTCTCTCGTTC
GACTCCGACAACTCCAACGGCTCCGGCGTGTCCAGAAACTCTGCGGCGTCTTCATCCCCGTTCTAGCAGCTCCGACTCAACGACGTCTTCATCCCGTGGATTTTTCAGGC
CGTTTCTAGCAGCTCCGACGACTCCACCAATCAAGATTTTTCGAAGAGGGGAGAGGGTTTTTTGGAGAGTTCTCGAAGATGGAGAGGGTTTTACTGGTTGGTATATTCGC
CGGCGTCGGAGAAGACGAGTTTTCCCGGTAGATTCCCGGTCGAGATCGAAAACCGGGAAATACGGTTCCGTAAGCTTTGAACCGATTTCCTACGAGAAAACCGGTTCAAA
CCTTACAGGATTCCCGAAGAGATTGACTGAAACCCAATTAGATATGTTTAGGCAAACCGTATTTGGCCCTATATTAGACAGCAACATATTATTTAATGGTCAGTTAATCC
ACCATCTACTACTTAGGGAGGTTGAGGATCCCAGGAAGGATGTAATTAGTTTCGATATATTTGGAAATAAGGTGTCGTTTGACAAGGAAGAATTCGATCTAATCACTGGA
TTTAGACACAATAGGAGGATAGTTGATAGACATGAGTCGGGGGTTAGATTGAGGCGTCTGTACTTTAATGAAGGTAGCGCTCGCATATTTTATCGAGCTAGCAATGTTTG
GGCGGGAGAGGAAACAAAAATTCAATTGGTCTTTATTGGGTATCGTGGACGATTGGGAGATATTCTGCAATTATGA
Protein sequenceShow/hide protein sequence
MEMFNSHLQAAKTEPITVASAVSHATGICLLASSSVKSLGDCWIIDSGASRHICHSRALFHNWRKIDPICVVLPTAYTVCVDFVGDIRISPDLVLRDVLFIPEFAYNLVS
VSCLLQTGLFSVLFADSHCLIQDKRLLRVIGRADCRHGLYILSDSAASVASFTAATGSVPASTWHLRLGHISHKRLASLKDTLCFKDSVSHNKPCDVCPLAKQKKLSFTS
NNHVASNVFDLLHADIWGPFHTPTYNGYRYFLTLVDDASRFTWVYLLRQKSDVLTIIPRFFKMIETQFSKVIKCFRSDNAPEFRFTKFFASTGTLHQFSCVERPQQNAVV
ELNQLSQYHSTPTNIHSQSLKRVPRFLKETVDMRLNIQPVSKFSITAYSNADWANDSRGLKTSHGICTNLGETLGSSWSLKKQNVVSKFSESAIASSLCTMQERNIQNQL
TVCERQIVEEAAAGNSLHGDWRFRRPQWLRRFLLSRSTPTTPTAPACPETLRRLHPRSSSSDSTTSSSRGFFRPFLAAPTTPPIKIFRRGERVFWRVLEDGEGFTGWYIR
RRRRRRVFPVDSRSRSKTGKYGSVSFEPISYEKTGSNLTGFPKRLTETQLDMFRQTVFGPILDSNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFDKEEFDLITG
FRHNRRIVDRHESGVRLRRLYFNEGSARIFYRASNVWAGEETKIQLVFIGYRGRLGDILQL