; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031685 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031685
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr11:11886548..11894094
RNA-Seq ExpressionLag0031685
SyntenyLag0031685
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7542996.1 Ribonuclease H-like superfamily [Arabidopsis thaliana x Arabidopsis arenosa]1.4e-6231.94Show/hide
Query:  IFGTSGLKVVQNNQQQNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPG------------YRQKGDQRSLQQQK------------------------
        +F TSG   +       N+K  K + +CTHCG+ GH +++CYK+HGYPPG            Y Q+  Q S QQ                          
Subjt:  IFGTSGLKVVQNNQQQNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPG------------YRQKGDQRSLQQQK------------------------

Query:  ------------PETASSVTSTTLIAPST------NTVDAIA----QCHSLFVMLQSQLTTAKSDSDVAT----SYLACTYLNSSPHGPWIIDLGASTHI
                    P+  +++ +  + AP T      N +D  +    Q   +   LQS     +S S++AT      +AC   +         D GA++H+
Subjt:  ------------PETASSVTSTTLIAPST------NTVDAIA----QCHSLFVMLQSQLTTAKSDSDVAT----SYLACTYLNSSPHGPWIIDLGASTHI

Query:  CFNKASFTTLFP-VATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLYMLDDSN---TTLNSTSCASVTQKLSSSLWHSRLGHLSF
        C +   F    P +  +V LP+  R+ + + G++ L   + L+ + +    +G+ S+   LY+L  ++   + ++    AS+T     +LWH RLGH S 
Subjt:  CFNKASFTTLFP-VATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLYMLDDSN---TTLNSTSCASVTQKLSSSLWHSRLGHLSF

Query:  LGV-------------LHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGC
          +             LH  S    P+QNSVVERKHQH+LNVAR+L FQS VP+ +W EC  T V+LIN+TPS +L  ++PY      +P+YS+++VFG 
Subjt:  LGV-------------LHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGC

Query:  LCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVL-------PRSFDVDNFSSKTDPPT
        LC+ STL  +R+KF+PRA P+V +GYPS  +  +++      ++ISR+VVFHE++FPF  V   + Q D+F H +L       P S  + N      P T
Subjt:  LCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVL-------PRSFDVDNFSSKTDPPT

Query:  -------IPTSIHTNP----TSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCS------------SRPTKLPTYLKDYHCSLLTDS-PFPTYSTKYPL
                P+S HT P     SP   S  SS   SSS    ++++ +S+P                R  K P+YL  YHC L  ++ P P  ST+YPL
Subjt:  -------IPTSIHTNP----TSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCS------------SRPTKLPTYLKDYHCSLLTDS-PFPTYSTKYPL

RVX02013.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.8e-6836.46Show/hide
Query:  NFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVAT-----
        N K +K++P C+H G+ GHT+D+CYKL+GYPPGY+ K      + Q  +T+S  T  +  A S     + AQC  L  +L SQL     D+ +AT     
Subjt:  NFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVAT-----

Query:  ------------SYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLY
                    S  + ++ NS     W++D GA+ H+C +  SF +  P + +SV LP+   +S++  GSV L   I       TL  +          
Subjt:  ------------SYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLY

Query:  MLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKW
        +    N  + S    +  +   S  + ++       G+L   SC   P+QNSVVERKHQHILNVARAL FQS +PL +WS C LT VYLIN+TPS +L  
Subjt:  MLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKW

Query:  QTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVLPR
        +TP+ +     P YS +K FGCLC+ASTL+ + +KF+PRAI  V +GYP  Y++ +++     + FISRDV+FHES+FPF          D FS +VLP 
Subjt:  QTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVLPR

Query:  SFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQ
                       +P SI +N ++      PS    S S+D P            +R  + P+YL+DYHCS    + F + ST +PL Q
Subjt:  SFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQ

RVX21347.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.3e-7338.34Show/hide
Query:  NFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVAT-----
        N K +K+RP C+HCG+ GHT+D+CYKL+GYPPGY+ K      + Q  +T+S  T  +  A S     + AQC  L  +L SQL     D+  AT     
Subjt:  NFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVAT-----

Query:  ------------SYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLY
                    S  + ++ NS     W++D+G + H+C +  SF +  P + +SV LP+   +S++  GSV L   I L  + S  K IG G     LY
Subjt:  ------------SYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLY

Query:  MLDDSNTTLNSTSCASV--TQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVL
        +LD S+    S   ++     +L  S +    G LSF    H ++    P+QNSVVERKHQHILNVA AL FQS +PL +WS C LT VYLIN+TPS +L
Subjt:  MLDDSNTTLNSTSCASV--TQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVL

Query:  KWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVL
          +TP+ +     P YS +K F CLC+ASTL+ +R+KF+PRAI  V +GYP  Y++ +++     + FISRDV+FHES+FPF          D FS +VL
Subjt:  KWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVL

Query:  PRSFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQ
        P                +P SI +N ++      PS    S S+D P            +R  + P+YL+DYHCS    + F + ST +PL Q
Subjt:  PRSFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQ

XP_012837652.1 PREDICTED: uncharacterized protein LOC105958190 [Erythranthe guttata]5.0e-7136.9Show/hide
Query:  QNNQQQ------NNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQ-QKPETASSVTSTTLIAPSTN-----TVDAI------AQCHS
        QN QQ+      N F  +K+RP CTHC   GHTI+ CYKLHGYP GY+ K  Q S  +      AS +     ++ S++     +VD++       QC  
Subjt:  QNNQQQ------NNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQ-QKPETASSVTSTTLIAPSTN-----TVDAI------AQCHS

Query:  LFVMLQSQLTTAKSDSDV----------ATSYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHR
        L  +  S L+   +   +           T   A   L    H  WI+D GAS HI  +K+ F +L  +   SV LPDN+   V++ G  + L  I LH 
Subjt:  LFVMLQSQLTTAKSDSDV----------ATSYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHR

Query:  ------------ERSTLKTIGRGSLCDGLYMLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLS------------------------------------
                      + LK IG+G   DGLY+LD ++    + S +S+   +S+S WH+RLGHLS                                    
Subjt:  ------------ERSTLKTIGRGSLCDGLYMLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLS------------------------------------

Query:  --FLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSAN
           LGV+HQ+SCV  P+QNSVVERKHQH+LNVARAL+FQS+VP+HFWSEC LT  YLIN+TPS +L   TP+         Y++++VFGCL F STLS +
Subjt:  --FLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSAN

Query:  RSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPF--HTVTFSNDQPDLFSHIVLPRSFDVDNFSSKTDPPTIPTSIHTNPTS
        RSKF PRA   + +GYP+  +  +++  +   +F+SR+V+FHE++FPF  +    +N   + F  +VLP    V +FS     P I +    N +S
Subjt:  RSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPF--HTVTFSNDQPDLFSHIVLPRSFDVDNFSSKTDPPTIPTSIHTNPTS

XP_022154919.1 uncharacterized protein LOC111022065 [Momordica charantia]7.4e-6734.31Show/hide
Query:  AAEIFGTSGLKVVQNNQQQNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIA-----QCHS
        A+ +  TS     + N    +   RK++ +CTHCG+ GHT+D+CYKLH YPPGYR    + +        ++   S ++ A  +   +++A     QC  
Subjt:  AAEIFGTSGLKVVQNNQQQNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIA-----QCHS

Query:  LFVMLQSQLTTAK--SDSDVATSYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIG
        L  +LQS LTT K  SD+D  TS++A T                                                                    KTI 
Subjt:  LFVMLQSQLTTAK--SDSDVATSYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIG

Query:  RGSLCDGLYMLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLIN
        R           D    L+               +HS+       GVLHQFSCV  PEQNSVVERKHQH+LNVAR+L+FQSRVP  FW EC LT  YLIN
Subjt:  RGSLCDGLYMLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLIN

Query:  QTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPD
        +TP+ VL W TPY    G   DYS +KVFGCLCF ST   NRSKF PRA+ +V +GYP   +  ++        F+SRDV+FHES+FPFHTV+ ++   D
Subjt:  QTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPD

Query:  LFSHIVLPRSFDV---------DNFSSKTDPPTIPTSIHTNPT------SP-------DMNSIPSSETDSSSI---------------DIPNTSSHVSTP
         F  +V+P+S+D+         D+ +  T     PTS   +PT      SP       D N  P    + S+I               D+ N  S V  P
Subjt:  LFSHIVLPRSFDV---------DNFSSKTDPPTIPTSIHTNPT------SP-------DMNSIPSSETDSSSI---------------DIPNTSSHVSTP

Query:  ----------KCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQY
                  + SSR  + P+YL+DYHC L+  +     S  YPL +Y
Subjt:  ----------KCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQY

TrEMBL top hitse value%identityAlignment
A0A2N9GKP6 Ribonuclease H2.3e-6632.36Show/hide
Query:  QNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTL---IAPSTNTVDAIAQCHSLFVMLQSQ--LTTAKSDSDV
        + NF+ R ++P C HCG  GHT+D+CYK+HGYPPGY+ KG    +  Q     S   +  +   ++P        AQC  L  +L ++  L+      + 
Subjt:  QNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTL---IAPSTNTVDAIAQCHSLFVMLQSQ--LTTAKSDSDV

Query:  ATSYLACTYLNSSPHGP-----------WIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVL---------LGLICLHRERSTLKTI
        + +Y A     SS   P           WIID GA+ H+  + +  TT+  +  TSV LP+   +SV + G+V +         L   CL +  +  + I
Subjt:  ATSYLACTYLNSSPHGP-----------WIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVL---------LGLICLHRERSTLKTI

Query:  GRGSLCDGLYMLDDS------------------------NTTLNSTSCASVTQKLSSSLWHSRLGHLSF-------------------------------
        G G L  GLYML  S                        NT+L ++   +  Q  +  LWH RLGH SF                               
Subjt:  GRGSLCDGLYMLDDS------------------------NTTLNSTSCASVTQKLSSSLWHSRLGHLSF-------------------------------

Query:  ------------------------------------LGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLK
                                             GV+HQ SC   P+QNSVVERKHQH+LNVARA+ FQS +P  FW EC LT  Y+IN+ PS +L 
Subjt:  ------------------------------------LGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLK

Query:  WQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPF-------HTVTFSNDQPDL
         +TP+ +     P YS +KVFGCL +ASTL ++R+KF  RA+  V +GYP   +  ++        F+SRDVVFHE +FPF       H  +  +D  D 
Subjt:  WQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPF-------HTVTFSNDQPDL

Query:  FSHIVLPRSFDV-----------DNFSSKTDPP-TIPTSIHTNPTSPDMNSIPSS--ETDSSSIDIPNTSSHVSTP----KCSSRPTKLPTYLKDYHCSL
        F      +  DV           D+ +S +DPP ++    HT+  S  + S+P     +DSS  D+P     V  P    + SSRP    +YL+DYHCSL
Subjt:  FSHIVLPRSFDV-----------DNFSSKTDPP-TIPTSIHTNPTSPDMNSIPSS--ETDSSSIDIPNTSSHVSTP----KCSSRPTKLPTYLKDYHCSL

Query:  LTDSPFPTYSTKYPL
        ++  P P  ST YP+
Subjt:  LTDSPFPTYSTKYPL

A0A2N9HL88 Integrase catalytic domain-containing protein2.2e-6434.19Show/hide
Query:  RKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVATSYLACTYLN
        +KERP+C+HCG+ GHT+++CYK+HGYPPGY+ KG            A+ VT+ T                                             N
Subjt:  RKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVATSYLACTYLN

Query:  SSPHGPWIIDLGASTHICFNKASFTTL-FPVATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLYMLDDSNT--------------
        +  +G W+ID GA+ H+ ++   FT +   + T+V LP+              L L+         K IGRG   +GLY+L+D ++              
Subjt:  SSPHGPWIIDLGASTHICFNKASFTTL-FPVATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLYMLDDSNT--------------

Query:  -TLNSTSCASVTQKLSSSLWHSRLGHLSFL---------GVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHV
          L  +  A V  + ++ +   R  +             GV+H  SCVA P+QNSVVERKHQHILNVARAL FQS +PL +WS+C L   YLIN+TPS V
Subjt:  -TLNSTSCASVTQKLSSSLWHSRLGHLSFL---------GVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHV

Query:  LKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIV
        LK +TP+ +   + P YS ++VFGCLC+ASTL+ N SKF PRA   + +GYP   +  +++      +FISRDVVFHE++FPF  ++    Q D F+ +V
Subjt:  LKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIV

Query:  LPRSFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPN---------------TSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPT
        LP S    +         I +    + TS   + I + +T S ++ I N               T+    + + S+R +  P YL DYHC+L   SP P 
Subjt:  LPRSFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPN---------------TSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPT

Query:  YST
        +ST
Subjt:  YST

A0A438IZ95 Retrovirus-related Pol polyprotein from transposon RE14.2e-6836.46Show/hide
Query:  NFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVAT-----
        N K +K++P C+H G+ GHT+D+CYKL+GYPPGY+ K      + Q  +T+S  T  +  A S     + AQC  L  +L SQL     D+ +AT     
Subjt:  NFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVAT-----

Query:  ------------SYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLY
                    S  + ++ NS     W++D GA+ H+C +  SF +  P + +SV LP+   +S++  GSV L   I       TL  +          
Subjt:  ------------SYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLY

Query:  MLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKW
        +    N  + S    +  +   S  + ++       G+L   SC   P+QNSVVERKHQHILNVARAL FQS +PL +WS C LT VYLIN+TPS +L  
Subjt:  MLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKW

Query:  QTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVLPR
        +TP+ +     P YS +K FGCLC+ASTL+ + +KF+PRAI  V +GYP  Y++ +++     + FISRDV+FHES+FPF          D FS +VLP 
Subjt:  QTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVLPR

Query:  SFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQ
                       +P SI +N ++      PS    S S+D P            +R  + P+YL+DYHCS    + F + ST +PL Q
Subjt:  SFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQ

A0A438KJI9 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-7338.34Show/hide
Query:  NFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVAT-----
        N K +K+RP C+HCG+ GHT+D+CYKL+GYPPGY+ K      + Q  +T+S  T  +  A S     + AQC  L  +L SQL     D+  AT     
Subjt:  NFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIAQCHSLFVMLQSQLTTAKSDSDVAT-----

Query:  ------------SYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLY
                    S  + ++ NS     W++D+G + H+C +  SF +  P + +SV LP+   +S++  GSV L   I L  + S  K IG G     LY
Subjt:  ------------SYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVA-TSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIGRGSLCDGLY

Query:  MLDDSNTTLNSTSCASV--TQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVL
        +LD S+    S   ++     +L  S +    G LSF    H ++    P+QNSVVERKHQHILNVA AL FQS +PL +WS C LT VYLIN+TPS +L
Subjt:  MLDDSNTTLNSTSCASV--TQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVL

Query:  KWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVL
          +TP+ +     P YS +K F CLC+ASTL+ +R+KF+PRAI  V +GYP  Y++ +++     + FISRDV+FHES+FPF          D FS +VL
Subjt:  KWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVL

Query:  PRSFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQ
        P                +P SI +N ++      PS    S S+D P            +R  + P+YL+DYHCS    + F + ST +PL Q
Subjt:  PRSFDVDNFSSKTDPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQ

A0A6J1DNP7 uncharacterized protein LOC1110220653.6e-6734.31Show/hide
Query:  AAEIFGTSGLKVVQNNQQQNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIA-----QCHS
        A+ +  TS     + N    +   RK++ +CTHCG+ GHT+D+CYKLH YPPGYR    + +        ++   S ++ A  +   +++A     QC  
Subjt:  AAEIFGTSGLKVVQNNQQQNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPSTNTVDAIA-----QCHS

Query:  LFVMLQSQLTTAK--SDSDVATSYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIG
        L  +LQS LTT K  SD+D  TS++A T                                                                    KTI 
Subjt:  LFVMLQSQLTTAK--SDSDVATSYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIG

Query:  RGSLCDGLYMLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLIN
        R           D    L+               +HS+       GVLHQFSCV  PEQNSVVERKHQH+LNVAR+L+FQSRVP  FW EC LT  YLIN
Subjt:  RGSLCDGLYMLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLIN

Query:  QTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPD
        +TP+ VL W TPY    G   DYS +KVFGCLCF ST   NRSKF PRA+ +V +GYP   +  ++        F+SRDV+FHES+FPFHTV+ ++   D
Subjt:  QTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPD

Query:  LFSHIVLPRSFDV---------DNFSSKTDPPTIPTSIHTNPT------SP-------DMNSIPSSETDSSSI---------------DIPNTSSHVSTP
         F  +V+P+S+D+         D+ +  T     PTS   +PT      SP       D N  P    + S+I               D+ N  S V  P
Subjt:  LFSHIVLPRSFDV---------DNFSSKTDPPTIPTSIHTNPT------SP-------DMNSIPSSETDSSSI---------------DIPNTSSHVSTP

Query:  ----------KCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQY
                  + SSR  + P+YL+DYHC L+  +     S  YPL +Y
Subjt:  ----------KCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQY

SwissProt top hitse value%identityAlignment
P92512 Uncharacterized mitochondrial protein AtMg007101.1e-0430Show/hide
Query:  ILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRA
        I+   R++  +  +P  F ++   T V++IN+ PS  + +  P  VW  S+P YS ++ FGC+ +   +  +  K  PRA
Subjt:  ILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.6e-1728.93Show/hide
Query:  SLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCL
        +LW     + S  G+ H  S    PE N + ERKH+HI+     L   + +P  +W       VYLIN+ P+ +L+ ++P+    G+ P+Y  ++VFGC 
Subjt:  SLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCL

Query:  CFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPF--HTVTFSNDQPD-------LFSHIVLPRSFDVDNFSSKTDPP
        C+      N+ K   ++   V +GY     +   +  +   ++ISR V F E+ FPF  +  T S  Q            H  LP    V    S +DP 
Subjt:  CFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPF--HTVTFSNDQPD-------LFSHIVLPRSFDVDNFSSKTDPP

Query:  TIPTSIHTNPTSPDMNS-IPSSETDSS-SIDIPNTSSHVSTPKCSSRPTKLPTYLK-DYHCSLLTDSPFPTYSTKYPLHQ
           T   ++P++P  NS + SS  DSS S   P++    +  +   +PT  PT  +   H S  T    PT  +   L Q
Subjt:  TIPTSIHTNPTSPDMNS-IPSSETDSS-SIDIPNTSSHVSTPKCSSRPTKLPTYLK-DYHCSLLTDSPFPTYSTKYPLHQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.1e-2032.64Show/hide
Query:  HLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSA
        +LS  G+ H  S    PE N + ERKH+HI+ +   L   + VP  +W       VYLIN+ P+ +L+ Q+P+    G  P+Y  +KVFGC C+      
Subjt:  HLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSA

Query:  NRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTF---------SNDQPDLFSHIVLPRSFDVDNFSSKTDPPTIPTSIHT
        NR K   ++     MGY     +   +      ++ SR V F E  FPF T  F         S+  P+  SH  LP +  V        PP +   + T
Subjt:  NRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTF---------SNDQPDLFSHIVLPRSFDVDNFSSKTDPPTIPTSIHT

Query:  NPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPT
        +P  P   S P   T  SS ++P  SS +S+P  SS PT
Subjt:  NPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPT

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.8e-0630Show/hide
Query:  ILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRA
        I+   R++  +  +P  F ++   T V++IN+ PS  + +  P  VW  S+P YS ++ FGC+ +   +  +  K  PRA
Subjt:  ILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQTPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCATGCCGAAGTCGATGGGTTGGGACAACGGATTTGCATGGTAGGGGTTGGCCGGAGTCCTTTCGAGCTTGGTGGGGGTGGTTTTCGAAGCTCCATCTCCCTCTT
TTATTCCTCTTCTCATCAATTGGGGGCGGACTCCTACGCCAAGGAAAGTGGTTCTTTCCTCACACTTGGTTCTCCATCTCCTCCAAGTAAGTTTGGTGGAGTTGGGTTGA
TTGATTGGAAGTGTTTGGATGCTTATTGTTGGTCTGATTTTCCGCAAGCGCACGGGTCGTCACAAAAGTCGTCGCAGCGTCGCGACGCTACACAGACAGCGTCGCGACGC
TATCGCAACTTTGGAGCGTGCTTTGCTGATTACCTAGCGTCGTTTTGGAAAATTTTGGTTCAGGCCATTTTACGTCGTTATGCTGCCGAAATTTTTGGTACATCTGGTTT
AAAAGTGGTTCAGAACAATCAGCAACAAAATAATTTCAAATCTCGCAAAGAACGCCCAATTTGCACTCACTGTGGAGTGCAAGGTCATACAATTGATCGTTGCTACAAAC
TTCATGGCTATCCTCCTGGATATCGGCAAAAGGGAGATCAACGTTCCCTACAGCAACAGAAGCCCGAGACTGCCTCTTCGGTGACTTCTACGACTCTCATTGCTCCTTCT
ACGAACACTGTTGATGCCATTGCTCAATGTCATAGCCTTTTTGTTATGCTTCAGTCCCAGTTAACTACCGCCAAGAGTGATTCAGATGTTGCTACGTCTTATTTAGCATG
CACCTATCTGAATTCCTCTCCTCATGGCCCTTGGATAATCGATTTGGGCGCATCAACTCATATTTGTTTTAACAAAGCATCATTTACTACTCTATTTCCTGTTGCAACTT
CTGTGGTTCTACCAGATAATACACGTATCTCTGTGAATTATGCTGGTTCTGTGGTTCTACTTGGTTTGATATGTCTTCATCGAGAAAGGTCCACTTTGAAGACGATTGGC
AGGGGTAGTTTATGTGATGGTCTTTATATGCTCGATGACAGTAACACTACCCTTAATTCAACTAGTTGTGCGTCTGTTACACAGAAGCTATCATCTAGTTTGTGGCATTC
CAGGCTTGGTCATCTCTCTTTCCTTGGAGTTCTTCATCAGTTTTCCTGTGTGGCGAGACCTGAGCAAAATTCGGTTGTTGAACGAAAGCATCAGCATATATTAAATGTGG
CTAGAGCACTATTTTTTCAGTCTCGTGTTCCTTTACATTTCTGGAGTGAATGCAAATTGACTGTTGTATATTTAATTAATCAAACTCCATCACATGTTTTGAAATGGCAA
ACTCCTTATGTTGTCTGGAATGGATCTTTGCCTGATTATTCGTTGATGAAAGTCTTTGGATGTCTCTGCTTTGCATCCACTCTATCTGCTAATCGGTCTAAGTTTGCTCC
TCGAGCTATACCTGCTGTTCTTATGGGATATCCCTCCTGGTATGAAAGCTTACAAATTGTTTGGTATAGAAAATTGTCGATTTTCATTTCTCGAGATGTGGTCTTTCATG
AGTCTGTGTTTCCATTTCATACAGTAACTTTCTCCAATGATCAACCTGACTTGTTTTCCCACATAGTTCTACCTCGATCATTTGATGTAGACAATTTTTCCTCTAAGACT
GATCCCCCAACCATCCCTACGTCTATACATACCAATCCTACCTCCCCAGACATGAATTCTATACCTTCTTCCGAAACTGATTCCTCATCCATTGATATACCAAACACATC
CTCTCATGTCTCAACACCTAAGTGCTCCTCGAGGCCAACTAAATTACCCACTTATCTCAAAGATTATCATTGTTCCCTCCTTACCGATTCCCCTTTTCCTACTTACTCCA
CCAAATATCCTTTACACCAATATACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGCATGCCGAAGTCGATGGGTTGGGACAACGGATTTGCATGGTAGGGGTTGGCCGGAGTCCTTTCGAGCTTGGTGGGGGTGGTTTTCGAAGCTCCATCTCCCTCTT
TTATTCCTCTTCTCATCAATTGGGGGCGGACTCCTACGCCAAGGAAAGTGGTTCTTTCCTCACACTTGGTTCTCCATCTCCTCCAAGTAAGTTTGGTGGAGTTGGGTTGA
TTGATTGGAAGTGTTTGGATGCTTATTGTTGGTCTGATTTTCCGCAAGCGCACGGGTCGTCACAAAAGTCGTCGCAGCGTCGCGACGCTACACAGACAGCGTCGCGACGC
TATCGCAACTTTGGAGCGTGCTTTGCTGATTACCTAGCGTCGTTTTGGAAAATTTTGGTTCAGGCCATTTTACGTCGTTATGCTGCCGAAATTTTTGGTACATCTGGTTT
AAAAGTGGTTCAGAACAATCAGCAACAAAATAATTTCAAATCTCGCAAAGAACGCCCAATTTGCACTCACTGTGGAGTGCAAGGTCATACAATTGATCGTTGCTACAAAC
TTCATGGCTATCCTCCTGGATATCGGCAAAAGGGAGATCAACGTTCCCTACAGCAACAGAAGCCCGAGACTGCCTCTTCGGTGACTTCTACGACTCTCATTGCTCCTTCT
ACGAACACTGTTGATGCCATTGCTCAATGTCATAGCCTTTTTGTTATGCTTCAGTCCCAGTTAACTACCGCCAAGAGTGATTCAGATGTTGCTACGTCTTATTTAGCATG
CACCTATCTGAATTCCTCTCCTCATGGCCCTTGGATAATCGATTTGGGCGCATCAACTCATATTTGTTTTAACAAAGCATCATTTACTACTCTATTTCCTGTTGCAACTT
CTGTGGTTCTACCAGATAATACACGTATCTCTGTGAATTATGCTGGTTCTGTGGTTCTACTTGGTTTGATATGTCTTCATCGAGAAAGGTCCACTTTGAAGACGATTGGC
AGGGGTAGTTTATGTGATGGTCTTTATATGCTCGATGACAGTAACACTACCCTTAATTCAACTAGTTGTGCGTCTGTTACACAGAAGCTATCATCTAGTTTGTGGCATTC
CAGGCTTGGTCATCTCTCTTTCCTTGGAGTTCTTCATCAGTTTTCCTGTGTGGCGAGACCTGAGCAAAATTCGGTTGTTGAACGAAAGCATCAGCATATATTAAATGTGG
CTAGAGCACTATTTTTTCAGTCTCGTGTTCCTTTACATTTCTGGAGTGAATGCAAATTGACTGTTGTATATTTAATTAATCAAACTCCATCACATGTTTTGAAATGGCAA
ACTCCTTATGTTGTCTGGAATGGATCTTTGCCTGATTATTCGTTGATGAAAGTCTTTGGATGTCTCTGCTTTGCATCCACTCTATCTGCTAATCGGTCTAAGTTTGCTCC
TCGAGCTATACCTGCTGTTCTTATGGGATATCCCTCCTGGTATGAAAGCTTACAAATTGTTTGGTATAGAAAATTGTCGATTTTCATTTCTCGAGATGTGGTCTTTCATG
AGTCTGTGTTTCCATTTCATACAGTAACTTTCTCCAATGATCAACCTGACTTGTTTTCCCACATAGTTCTACCTCGATCATTTGATGTAGACAATTTTTCCTCTAAGACT
GATCCCCCAACCATCCCTACGTCTATACATACCAATCCTACCTCCCCAGACATGAATTCTATACCTTCTTCCGAAACTGATTCCTCATCCATTGATATACCAAACACATC
CTCTCATGTCTCAACACCTAAGTGCTCCTCGAGGCCAACTAAATTACCCACTTATCTCAAAGATTATCATTGTTCCCTCCTTACCGATTCCCCTTTTCCTACTTACTCCA
CCAAATATCCTTTACACCAATATACTTAG
Protein sequenceShow/hide protein sequence
MLHAEVDGLGQRICMVGVGRSPFELGGGGFRSSISLFYSSSHQLGADSYAKESGSFLTLGSPSPPSKFGGVGLIDWKCLDAYCWSDFPQAHGSSQKSSQRRDATQTASRR
YRNFGACFADYLASFWKILVQAILRRYAAEIFGTSGLKVVQNNQQQNNFKSRKERPICTHCGVQGHTIDRCYKLHGYPPGYRQKGDQRSLQQQKPETASSVTSTTLIAPS
TNTVDAIAQCHSLFVMLQSQLTTAKSDSDVATSYLACTYLNSSPHGPWIIDLGASTHICFNKASFTTLFPVATSVVLPDNTRISVNYAGSVVLLGLICLHRERSTLKTIG
RGSLCDGLYMLDDSNTTLNSTSCASVTQKLSSSLWHSRLGHLSFLGVLHQFSCVARPEQNSVVERKHQHILNVARALFFQSRVPLHFWSECKLTVVYLINQTPSHVLKWQ
TPYVVWNGSLPDYSLMKVFGCLCFASTLSANRSKFAPRAIPAVLMGYPSWYESLQIVWYRKLSIFISRDVVFHESVFPFHTVTFSNDQPDLFSHIVLPRSFDVDNFSSKT
DPPTIPTSIHTNPTSPDMNSIPSSETDSSSIDIPNTSSHVSTPKCSSRPTKLPTYLKDYHCSLLTDSPFPTYSTKYPLHQYT