; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008307 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008307
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:17184248..17186069
RNA-Seq ExpressionLag0008307
SyntenyLag0008307
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN61322.1 hypothetical protein VITISV_012106 [Vitis vinifera]4.6e-5735.13Show/hide
Query:  FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTV
        +  L   L +KL  +NY+LW++++ N I A   E FIDGT   P+  D++   +NP F  W++ +R ++SW+YSSL    + +IIG +++   W+ L ++
Subjt:  FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTV

Query:  YESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSI
        + SSS  RIM LR +L+  +K  +++  Y+ +IK  AD  +AIGEP+S +D +  +L GLG++YN  VT+I  R+D+ SL  + S+LLA+  RLE+QSSI
Subjt:  YESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSI

Query:  DSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKW--HTPSSQNRPQCQICRKLGHTTLVCYNRLN
        + ++      AN  ++SN R   R   F+  R             Q +S  +N+ +  GR +     +      S   +PQCQ+C K GHT  +CY+R +
Subjt:  DSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKW--HTPSSQNRPQCQICRKLGHTTLVCYNRLN

Query:  PMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGS
          +Q   ++++     +      Q  +P +V S S      ++  DE+WY+D GA+HH+T +L NL  +SPY G+++V IGNG H S  +
Subjt:  PMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGS

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]2.0e-6538.94Show/hide
Query:  PPAPSP----MLPFFPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYN
        PPAP P     LP       NP I    P+     PS+  PL +KL D NY++WK +LLN +IA  LE F+DG+   PP+F+D  Q Q NP+F  WQ+YN
Subjt:  PPAPSP----MLPFFPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYN

Query:  RILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYN
        R++MSW+Y+S+N+  LG+I+G +SA  IW+ L  +Y ++S   +  LR+ L+ I+K+GLT   Y+ + + + +  ++IGEP++Y DHL Y L GLG +YN
Subjt:  RILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYN

Query:  LFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSP
         FVTSIQ++  R S              +E+ +S  SL               TR+    +P +N           FP    +S+P   N     P  SP
Subjt:  LFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSP

Query:  CPKWHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQ
         P     S + RP+CQIC K GHT   CY+  N  YQ P     P   FN +  P     P L  S +  ++ + S PD +WYMD GA+HH TPDLN L 
Subjt:  CPKWHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQ

Query:  QSSPYFGSEQVVIGNG
         +SPY G +QV +GNG
Subjt:  QSSPYFGSEQVVIGNG

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.9e-6139.08Show/hide
Query:  YPPAPSPMLPFFPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFID-GTPAPPKFMDVAQLQVNPQFTLWQKYNRIL
        +PP P+         N NP   PQ      P PSL+  L+IKL ++N LL K++LLN IIA  LE FID    +PPK++D A  QVNP+F  W + N+++
Subjt:  YPPAPSPMLPFFPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFID-GTPAPPKFMDVAQLQVNPQFTLWQKYNRIL

Query:  MSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFV
        MSW+YSSL    +G+I+  S+A DIW  L   YES S   +M L SQL++I+K  + +S+YL+++K V D+F+ IGEPLSYRD L  ILEGL  EY+ FV
Subjt:  MSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFV

Query:  TSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPK
        TSI NR+DR SL +V SLL  Y  RL ++S   +LN  QA                     NPR                  P  +NS+           
Subjt:  TSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPK

Query:  WHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSS
                 PQCQIC K GH  L  Y+R N  Y  P    A     N   Q   P    L  S +P+ +S S   D +WYMD GATHH TP+  ++  + 
Subjt:  WHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSS

Query:  PYFGSEQVVIGN
         Y   +  ++GN
Subjt:  PYFGSEQVVIGN

RVW95765.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.6e-5735.13Show/hide
Query:  FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTV
        +  L   L +KL  +NY+LW++++ N I A   E FIDGT   P+  D++   +NP F  W++ +R ++SW+YSSL    + +IIG +++   W+ L ++
Subjt:  FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTV

Query:  YESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSI
        + SSS  RIM LR +L+  +K  +++  Y+ +IK  AD  +AIGEP+S +D +  +L GLG++YN  VT+I  R+D+ SL  + S+LLA+  RLE+QSSI
Subjt:  YESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSI

Query:  DSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKW--HTPSSQNRPQCQICRKLGHTTLVCYNRLN
        + ++      AN  ++SN R   R   F+  R             Q +S  +N+ +  GR +     +      S   +PQCQ+C K GHT  +CY+R +
Subjt:  DSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKW--HTPSSQNRPQCQICRKLGHTTLVCYNRLN

Query:  PMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGS
          +Q   ++++     +      Q  +P +V S S      ++  DE+WY+D GA+HH+T +L NL  +SPY G+++V IGNG H S  +
Subjt:  PMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.0e-10149.51Show/hide
Query:  FPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDD
        FP    N    P +P + +PFP+L  PLN+KL+D+N+LLWKN+LLN +IA  L  ++DGT   PP+F+D  QLQ NP +  W++YNR+LM W+YSSL+++
Subjt:  FPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDD

Query:  KLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSS
        K+GE++   +  DIW  L  VY+S +T RIMGL+++L+ +RKDG +VSQYLA+IK++ADKF+A+GEPLSYRDHLA++L+GLG+EYN FVTSI NR D  S
Subjt:  KLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSS

Query:  LADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRP-PAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRP
        L DVRSLLLAY ARL+KQ+++D LN+ QANL NL+   N++R         P P F+ P      FP    + + S S+LG+PQS    KW    S ++ 
Subjt:  LADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRP-PAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRP

Query:  QCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVI
        QCQIC KLGH+  VCY+R N  Y     + +PQA ++     P            PSS     HPDE+W+MD GATHHMTPD + L   +PY G EQV +
Subjt:  QCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVI

Query:  GNGPHRSFGSNS
        GNG     GS+S
Subjt:  GNGPHRSFGSNS

TrEMBL top hitse value%identityAlignment
A0A438CVN7 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-5735.13Show/hide
Query:  FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTV
        +  L   L +KL  +NY+LW++++ N I A   E FIDGT   P+  D++   +NP F  W++ +R ++SW+YSSL    + +IIG +++   W+ L ++
Subjt:  FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTV

Query:  YESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSI
        + SSS  RIM LR +L+  +K  +++  Y+ +IK  AD  +AIGEP+S +D +  +L GLG++YN  VT+I  R+D+ SL  + S+LLA+  RLE+QSSI
Subjt:  YESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSI

Query:  DSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKW--HTPSSQNRPQCQICRKLGHTTLVCYNRLN
        + ++      AN  ++SN R   R   F+  R             Q +S  +N+ +  GR +     +      S   +PQCQ+C K GHT  +CY+R +
Subjt:  DSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKW--HTPSSQNRPQCQICRKLGHTTLVCYNRLN

Query:  PMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGS
          +Q   ++++     +      Q  +P +V S S      ++  DE+WY+D GA+HH+T +L NL  +SPY G+++V IGNG H S  +
Subjt:  PMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGS

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE14.3e-6139.08Show/hide
Query:  YPPAPSPMLPFFPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFID-GTPAPPKFMDVAQLQVNPQFTLWQKYNRIL
        +PP P+         N NP   PQ      P PSL+  L+IKL ++N LL K++LLN IIA  LE FID    +PPK++D A  QVNP+F  W + N+++
Subjt:  YPPAPSPMLPFFPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFID-GTPAPPKFMDVAQLQVNPQFTLWQKYNRIL

Query:  MSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFV
        MSW+YSSL    +G+I+  S+A DIW  L   YES S   +M L SQL++I+K  + +S+YL+++K V D+F+ IGEPLSYRD L  ILEGL  EY+ FV
Subjt:  MSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFV

Query:  TSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPK
        TSI NR+DR SL +V SLL  Y  RL ++S   +LN  QA                     NPR                  P  +NS+           
Subjt:  TSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPK

Query:  WHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSS
                 PQCQIC K GH  L  Y+R N  Y  P    A     N   Q   P    L  S +P+ +S S   D +WYMD GATHH TP+  ++  + 
Subjt:  WHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSS

Query:  PYFGSEQVVIGN
         Y   +  ++GN
Subjt:  PYFGSEQVVIGN

A0A6J1DQX7 uncharacterized protein LOC1110223155.0e-10249.51Show/hide
Query:  FPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDD
        FP    N    P +P + +PFP+L  PLN+KL+D+N+LLWKN+LLN +IA  L  ++DGT   PP+F+D  QLQ NP +  W++YNR+LM W+YSSL+++
Subjt:  FPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDD

Query:  KLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSS
        K+GE++   +  DIW  L  VY+S +T RIMGL+++L+ +RKDG +VSQYLA+IK++ADKF+A+GEPLSYRDHLA++L+GLG+EYN FVTSI NR D  S
Subjt:  KLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSS

Query:  LADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRP-PAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRP
        L DVRSLLLAY ARL+KQ+++D LN+ QANL NL+   N++R         P P F+ P      FP    + + S S+LG+PQS    KW    S ++ 
Subjt:  LADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRP-PAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRP

Query:  QCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVI
        QCQIC KLGH+  VCY+R N  Y     + +PQA ++     P            PSS     HPDE+W+MD GATHHMTPD + L   +PY G EQV +
Subjt:  QCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVI

Query:  GNGPHRSFGSNS
        GNG     GS+S
Subjt:  GNGPHRSFGSNS

A0A7J0GPN0 UBX domain-containing protein9.9e-6638.94Show/hide
Query:  PPAPSP----MLPFFPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYN
        PPAP P     LP       NP I    P+     PS+  PL +KL D NY++WK +LLN +IA  LE F+DG+   PP+F+D  Q Q NP+F  WQ+YN
Subjt:  PPAPSP----MLPFFPTYNLNPPIFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYN

Query:  RILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYN
        R++MSW+Y+S+N+  LG+I+G +SA  IW+ L  +Y ++S   +  LR+ L+ I+K+GLT   Y+ + + + +  ++IGEP++Y DHL Y L GLG +YN
Subjt:  RILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYN

Query:  LFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSP
         FVTSIQ++  R S              +E+ +S  SL               TR+    +P +N           FP    +S+P   N     P  SP
Subjt:  LFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSP

Query:  CPKWHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQ
         P     S + RP+CQIC K GHT   CY+  N  YQ P     P   FN +  P     P L  S +  ++ + S PD +WYMD GA+HH TPDLN L 
Subjt:  CPKWHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQ

Query:  QSSPYFGSEQVVIGNG
         +SPY G +QV +GNG
Subjt:  QSSPYFGSEQVVIGNG

A5BFR8 Integrase catalytic domain-containing protein2.2e-5735.13Show/hide
Query:  FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTV
        +  L   L +KL  +NY+LW++++ N I A   E FIDGT   P+  D++   +NP F  W++ +R ++SW+YSSL    + +IIG +++   W+ L ++
Subjt:  FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTV

Query:  YESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSI
        + SSS  RIM LR +L+  +K  +++  Y+ +IK  AD  +AIGEP+S +D +  +L GLG++YN  VT+I  R+D+ SL  + S+LLA+  RLE+QSSI
Subjt:  YESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSI

Query:  DSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKW--HTPSSQNRPQCQICRKLGHTTLVCYNRLN
        + ++      AN  ++SN R   R   F+  R             Q +S  +N+ +  GR +     +      S   +PQCQ+C K GHT  +CY+R +
Subjt:  DSLNLVQANLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKW--HTPSSQNRPQCQICRKLGHTTLVCYNRLN

Query:  PMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGS
          +Q   ++++     +      Q  +P +V S S      ++  DE+WY+D GA+HH+T +L NL  +SPY G+++V IGNG H S  +
Subjt:  PMYQAPSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.6e-3125.68Show/hide
Query:  KLSDSNYLLWKNELLNHIIAFDLESFIDG-TPAPPKFMDV-AQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTR
        KL+ +NYL+W  ++      ++L  F+DG T  PP  +   A  +VNP +T W++ ++++ S +  +++      +   ++A  IW+ LR +Y + S   
Subjt:  KLSDSNYLLWKNELLNHIIAFDLESFIDG-TPAPPKFMDV-AQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTR

Query:  IMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQA
        +  LR+QL++  K   T+  Y+  +    D+ + +G+P+ + + +  +LE L  EY   +  I  ++   +L ++   LL + +++   SS   + +   
Subjt:  IMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQA

Query:  NLA--NLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRP---QCQICRKLGHTTLVCYNRLNPMYQA
         ++  N TT +N    NR + + N                     +N+NS   +P       +H  ++Q++P   +CQIC   GH+   C        Q 
Subjt:  NLA--NLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRP---QCQICRKLGHTTLVCYNRLNPMYQA

Query:  PSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISS-HPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNG---PHRSFGSNSMLKRAKPER
          SSV  Q       QPP P  P        +++++ S +    W +D GATHH+T D NNL    PY G + V++ +G   P    GS S+  +++P  
Subjt:  PSSSVAPQAFFNQFTQPPQPALPCLVVSDSPSSVSISS-HPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNG---PHRSFGSNSMLKRAKPER

Query:  KMTIL
           IL
Subjt:  KMTIL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-2224.93Show/hide
Query:  KLSDSNYLLWKNELLNHIIAFDLESFIDG-TPAPPKFMDV-AQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTR
        KL+ +NYL+W  ++      ++L  F+DG TP PP  +   A  +VNP +T W++ ++++ S +  +++      +   ++A  IW+ LR +Y + S   
Subjt:  KLSDSNYLLWKNELLNHIIAFDLESFIDG-TPAPPKFMDV-AQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRTVYESSSTTR

Query:  IMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQA
          G  +QLR I +                D+ + +G+P+ + + +  +LE L  +Y   +  I  ++   SL ++   L+   ++L   +S + + +   
Subjt:  IMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQA

Query:  NLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSV
         + +  TN+N  + NR    +                    N +N+ S   +P SS     +        +CQIC   GH+   C     P      S+ 
Subjt:  NLANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSV

Query:  APQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNG
          Q   + FT P QP     V  +SP + +        W +D GATHH+T D NNL    PY G + V+I +G
Subjt:  APQAFFNQFTQPPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNG

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.4e-0925.64Show/hide
Query:  NLNPPIFPQHPVTPSP---FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDK
        +++P   P  P    P    PS      +   + NY+ WK    + +       FIDGT P P  F        +P +  W++ N ++M W+ +S+ D  
Subjt:  NLNPPIFPQHPVTPSP---FPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGT-PAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDK

Query:  LGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDV
        L  ++   +A  +W+ LR V+      +I  LR +L  +R+ G +V +Y  ++  V
Subjt:  LGEIIGCSSAFDIWDHLRTVYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDV

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.4e-1626.18Show/hide
Query:  PLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKL-GEIIGCSSAFDIWDHLRTVYESSS
        P+ + + +SNY  W+   L H ++FD+   IDGT  P    DV           WQK + I+   +Y +L   +  G  +  S++ DIW  ++  + ++ 
Subjt:  PLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKL-GEIIGCSSAFDIWDHLRTVYESSS

Query:  TTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEK
          R + L S+LR      + V+ Y  ++K +AD    +  P++ R+ + Y+L GL  +++  +  I++R    S  D  ++L     RL++
Subjt:  TTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTCCTTCAAGTTCTTCAACCTTAGTCTCAAACGGAATTCCGATCATATCCTCTTCCGCCACTCAACCTGTTATTTCTCCTATTACTTCCCCTTCTTCTTCTCA
ACGCTTAAACCTTAATCCCCAAACCCCCTTATCGAACACCCAAACCAGACCTCTCAACCCTAATGTCCCTCCTTTTTCGACTGGCGTCAATTCCAGTTTTCTTCCTCATC
CCAACATTATAAATCAACCACGTGGTCCTGCTGCTCCTACAGGATTTGCTTACCCTCCAGCTCCCTCGCCTATGTTACCATTTTTTCCGACGTATAACTTGAACCCTCCT
ATATTTCCTCAGCATCCTGTAACACCCTCTCCCTTTCCATCACTGACCCCTCCCCTCAACATTAAGCTCTCTGATTCGAACTACCTACTGTGGAAGAATGAACTGCTAAA
TCACATTATAGCCTTTGATTTGGAATCGTTCATTGATGGCACACCTGCTCCCCCAAAATTTATGGATGTTGCTCAACTCCAGGTAAATCCCCAATTTACTCTTTGGCAAA
AATATAATAGAATTCTTATGAGTTGGATGTACTCTTCCTTGAATGATGACAAACTCGGTGAAATTATTGGGTGTTCTTCTGCTTTCGATATATGGGATCATCTCCGTACT
GTTTATGAATCATCTTCCACCACAAGAATTATGGGTCTTCGGTCTCAACTTCGGAAAATCAGAAAGGATGGCTTAACAGTCTCTCAGTATCTTGCTCAGATTAAGGATGT
TGCCGACAAGTTTTCAGCCATTGGCGAACCCTTATCCTATCGAGATCACCTTGCGTATATTCTAGAAGGGCTTGGAACAGAGTATAACCTTTTTGTGACCTCCATTCAGA
ATAGAAACGATCGTTCATCTCTTGCCGACGTACGTAGTCTGTTGTTAGCTTATGGCGCAAGACTTGAAAAGCAATCCTCAATTGATAGCCTTAATCTGGTTCAAGCCAAC
CTTGCCAATTTAACCACAAATTCAAATACCAGAAGACAAAATCGTGTCTCCCCCTTTTCCAATCCTCGTCCCCCATTTACCAGACCTCCTGCACCTTTTCCTTTTCCGCA
ACCCTTTTCGAATCCTTCAAATAGCAATAGTGTTCTAGGTCGGCCACAGTCGTCTCCTTGTCCAAAATGGCACACCCCCTCTTCTCAAAATAGACCTCAATGCCAGATTT
GCAGGAAGCTTGGTCATACGACGCTAGTTTGTTACAATCGCTTAAATCCAATGTATCAAGCTCCCTCTTCCTCTGTTGCTCCTCAAGCATTTTTTAATCAGTTTACTCAG
CCTCCTCAGCCAGCCTTACCTTGCCTTGTCGTCTCTGACTCCCCCTCATCTGTTTCGATTTCTTCTCACCCTGATGAAGCTTGGTATATGGACTACGGTGCCACTCACCA
TATGACACCGGATCTCAATAATCTTCAACAATCCAGTCCCTATTTCGGCAGTGAGCAGGTTGTGATTGGTAATGGTCCTCACCGGAGCTTTGGAAGCAATTCGATGCTTA
AACGAGCGAAACCGGAGCGTAAAATGACCATTCTGCCCCTGGAGCTTCAACAACGTCGAGACGCTTGCCTCACAGCGTCGCGACGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTCCTTCAAGTTCTTCAACCTTAGTCTCAAACGGAATTCCGATCATATCCTCTTCCGCCACTCAACCTGTTATTTCTCCTATTACTTCCCCTTCTTCTTCTCA
ACGCTTAAACCTTAATCCCCAAACCCCCTTATCGAACACCCAAACCAGACCTCTCAACCCTAATGTCCCTCCTTTTTCGACTGGCGTCAATTCCAGTTTTCTTCCTCATC
CCAACATTATAAATCAACCACGTGGTCCTGCTGCTCCTACAGGATTTGCTTACCCTCCAGCTCCCTCGCCTATGTTACCATTTTTTCCGACGTATAACTTGAACCCTCCT
ATATTTCCTCAGCATCCTGTAACACCCTCTCCCTTTCCATCACTGACCCCTCCCCTCAACATTAAGCTCTCTGATTCGAACTACCTACTGTGGAAGAATGAACTGCTAAA
TCACATTATAGCCTTTGATTTGGAATCGTTCATTGATGGCACACCTGCTCCCCCAAAATTTATGGATGTTGCTCAACTCCAGGTAAATCCCCAATTTACTCTTTGGCAAA
AATATAATAGAATTCTTATGAGTTGGATGTACTCTTCCTTGAATGATGACAAACTCGGTGAAATTATTGGGTGTTCTTCTGCTTTCGATATATGGGATCATCTCCGTACT
GTTTATGAATCATCTTCCACCACAAGAATTATGGGTCTTCGGTCTCAACTTCGGAAAATCAGAAAGGATGGCTTAACAGTCTCTCAGTATCTTGCTCAGATTAAGGATGT
TGCCGACAAGTTTTCAGCCATTGGCGAACCCTTATCCTATCGAGATCACCTTGCGTATATTCTAGAAGGGCTTGGAACAGAGTATAACCTTTTTGTGACCTCCATTCAGA
ATAGAAACGATCGTTCATCTCTTGCCGACGTACGTAGTCTGTTGTTAGCTTATGGCGCAAGACTTGAAAAGCAATCCTCAATTGATAGCCTTAATCTGGTTCAAGCCAAC
CTTGCCAATTTAACCACAAATTCAAATACCAGAAGACAAAATCGTGTCTCCCCCTTTTCCAATCCTCGTCCCCCATTTACCAGACCTCCTGCACCTTTTCCTTTTCCGCA
ACCCTTTTCGAATCCTTCAAATAGCAATAGTGTTCTAGGTCGGCCACAGTCGTCTCCTTGTCCAAAATGGCACACCCCCTCTTCTCAAAATAGACCTCAATGCCAGATTT
GCAGGAAGCTTGGTCATACGACGCTAGTTTGTTACAATCGCTTAAATCCAATGTATCAAGCTCCCTCTTCCTCTGTTGCTCCTCAAGCATTTTTTAATCAGTTTACTCAG
CCTCCTCAGCCAGCCTTACCTTGCCTTGTCGTCTCTGACTCCCCCTCATCTGTTTCGATTTCTTCTCACCCTGATGAAGCTTGGTATATGGACTACGGTGCCACTCACCA
TATGACACCGGATCTCAATAATCTTCAACAATCCAGTCCCTATTTCGGCAGTGAGCAGGTTGTGATTGGTAATGGTCCTCACCGGAGCTTTGGAAGCAATTCGATGCTTA
AACGAGCGAAACCGGAGCGTAAAATGACCATTCTGCCCCTGGAGCTTCAACAACGTCGAGACGCTTGCCTCACAGCGTCGCGACGCTAG
Protein sequenceShow/hide protein sequence
MASPSSSSTLVSNGIPIISSSATQPVISPITSPSSSQRLNLNPQTPLSNTQTRPLNPNVPPFSTGVNSSFLPHPNIINQPRGPAAPTGFAYPPAPSPMLPFFPTYNLNPP
IFPQHPVTPSPFPSLTPPLNIKLSDSNYLLWKNELLNHIIAFDLESFIDGTPAPPKFMDVAQLQVNPQFTLWQKYNRILMSWMYSSLNDDKLGEIIGCSSAFDIWDHLRT
VYESSSTTRIMGLRSQLRKIRKDGLTVSQYLAQIKDVADKFSAIGEPLSYRDHLAYILEGLGTEYNLFVTSIQNRNDRSSLADVRSLLLAYGARLEKQSSIDSLNLVQAN
LANLTTNSNTRRQNRVSPFSNPRPPFTRPPAPFPFPQPFSNPSNSNSVLGRPQSSPCPKWHTPSSQNRPQCQICRKLGHTTLVCYNRLNPMYQAPSSSVAPQAFFNQFTQ
PPQPALPCLVVSDSPSSVSISSHPDEAWYMDYGATHHMTPDLNNLQQSSPYFGSEQVVIGNGPHRSFGSNSMLKRAKPERKMTILPLELQQRRDACLTASRR