; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035147 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035147
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr3:15724916..15726207
RNA-Seq ExpressionLag0035147
SyntenyLag0035147
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8516701.1 hypothetical protein F0562_016793 [Nyssa sinensis]2.3e-4433.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK+  D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

KAA8519786.1 hypothetical protein F0562_014124 [Nyssa sinensis]1.0e-4433.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK+  D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

KAA8521875.1 hypothetical protein F0562_012811 [Nyssa sinensis]2.3e-4433.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK+  D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]1.0e-4433.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK+  D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

KAA8535282.1 hypothetical protein F0562_030285 [Nyssa sinensis]1.7e-4433.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK   D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

TrEMBL top hitse value%identityAlignment
A0A2N9I6U7 Uncharacterized protein8.4e-4534.42Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSD--GCKSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYT
        +L A+++   VDG++ CP +F+ +S+  G  ++  VW+  EKR++S + S+I  LK  LH++ K   +S++ +L ++K+  D+L  V V++D+E++L   
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSD--GCKSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYT

Query:  LNGLSSKFNSFKTSFRTRSGSVTLDELHAL-------LKSKLKFIEQHNKLS-TASIN-PTAMFA----RGVNQNQSSRGRGRNQNNQAGQFNSGRG--N
        L GL  ++++F T+ RTR+ + + +++H L       LKS +   + H+ ++  A+ N   A+F+    RG  +N  +RGRGRN N      N GRG  N
Subjt:  LNGLSSKFNSFKTSFRTRSGSVTLDELHAL-------LKSKLKFIEQHNKLS-TASIN-PTAMFA----RGVNQNQSSRGRGRNQNNQAGQFNSGRG--N

Query:  SDGSQGGYSSANFGRNPGRSLPNLQSSWSWSSRLLNCLNLSYQGRHPPSKLAAMAIA-NDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVA
        + G   G SS +F RN         +S   +    + ++ +YQG+ PPSKLAAMA   N  +S  + W++D+G   H TP+ S +  + ++ G ++ TV 
Subjt:  SDGSQGGYSSANFGRNPGRSLPNLQSSWSWSSRLLNCLNLSYQGRHPPSKLAAMAIA-NDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVA

Query:  NDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFGANWFTIQDKDTGQIL
        N   LP    G S L        +  +LCVP +S+NLLSV++ C DNN  F F A+ F I+D  TG++L
Subjt:  NDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFGANWFTIQDKDTGQIL

A0A5J4ZPW7 Retrotran_gag_3 domain-containing protein4.9e-4533.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK+  D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

A0A5J4ZT09 Flavin-containing monooxygenase1.1e-4433.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK+  D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

A0A5J5A1U7 Integrase catalytic domain-containing protein4.9e-4533.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK+  D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

A0A5J5B049 Retrotran_gag_3 domain-containing protein8.4e-4533.73Show/hide
Query:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK
        +L AHSL   +DG+  CPN+F++D  G                                        +S+E W A E+RFS+ T S+I +LKSALH+++K
Subjt:  MLHAHSLFDIVDGSKSCPNEFLKDSDGC--------------------------------------KSSKEVWTAREKRFSSLTHSHIHELKSALHSVAK

Query:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS
           +SID Y+ +IK   D L +VSV ++DED+L+Y LNGL  ++N+FKTS RT+S ++TL+E++A+LK + + IE  +K + +   P AM A     N S
Subjt:  SPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQS

Query:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA
        S  RG + +N +G+       S RG    S G + S NFG        + P +S     +S     ++        L+C   ++ SYQG+ P  +L AM+
Subjt:  SRGRGRNQNNQAGQFN-----SGRGNSDGSQGGYSSANFG--------RNPGRSLPNLQSSWSWSSRL--------LNC---LNLSYQGRHPPSKLAAMA

Query:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG
           +  S  +   W  D+G   H+T D + L     + G + +T+AN Q L  + +G S++        ++++LCVP ++ NLLSV Q C DN+  F F 
Subjt:  IANDPSSTTAT--WLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNNFVFTFG

Query:  ANWFTIQDKDTGQIL
        +  F IQDK T Q+L
Subjt:  ANWFTIQDKDTGQIL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-1424.48Show/hide
Query:  SSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELH-AL
        ++ ++W    K +++ ++ H+ +L++ L    K  T++ID+Y+  +    D+L  +   +D ++ +   L  L  ++         +    TL E+H  L
Subjt:  SSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELH-AL

Query:  LKSKLKFIEQHNKLSTASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFNSGRGNSDGSQGGYSSANFGRNPGRSLPNLQS---------SWSWSSRLLN
        L  + K +     +S+A++ P  + A  V+   ++     N  N+  ++++   N++      SS NF  N  +S P L           S    S+L +
Subjt:  LKSKLKFIEQHNKLSTASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFNSGRGNSDGSQGGYSSANFGRNPGRSLPNLQS---------SWSWSSRLLN

Query:  CLNLSYQGRHPPSKL------AAMAIANDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVP
         L+ S   + PPS        A +A+ +  SS    WL DSG   H+T D + L+L+  + G + + VA+   +P +  G ++L T    L++ ++L VP
Subjt:  CLNLSYQGRHPPSKL------AAMAIANDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVP

Query:  DLSANLLSVSQCCVDNNFVFTFGANWFTIQDKDTG
        ++  NL+SV + C  N     F    F ++D +TG
Subjt:  DLSANLLSVSQCCVDNNFVFTFGANWFTIQDKDTG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.6e-0822.05Show/hide
Query:  SSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALL
        ++ ++W    K +++ ++ H+ +L+                ++ R     D+L  +   +D ++ +   L  L   +         +    +L E+H  L
Subjt:  SSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFKTSFRTRSGSVTLDELHALL

Query:  KSKLKFIEQHNKLSTASINPTAMFAR--GVNQNQSSRGRGRNQNNQAGQFNSGRGNSDGSQGGYSSANFGRNPGRSLPNLQSSWSWSSRLLNCLNL----
         ++   +   N      I    +  R    N+NQ++RG  RN NN   + NS + +S GS+         R P   L   Q           C  L    
Subjt:  KSKLKFIEQHNKLSTASINPTAMFAR--GVNQNQSSRGRGRNQNNQAGQFNSGRGNSDGSQGGYSSANFGRNPGRSLPNLQSSWSWSSRLLNCLNL----

Query:  ----SYQGRHP--PSKLAAMAIANDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSA
              Q   P  P +  A    N P +    WL DSG   H+T D + L+ +  + G + + +A+   +P    G ++L T    L ++ +L VP++  
Subjt:  ----SYQGRHP--PSKLAAMAIANDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSA

Query:  NLLSVSQCCVDNNFVFTFGANWFTIQDKDTG
        NL+SV + C  N     F    F ++D +TG
Subjt:  NLLSVSQCCVDNNFVFTFGANWFTIQDKDTG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCACGCGCATTCCCTCTTTGATATAGTTGATGGATCAAAGTCGTGTCCCAATGAGTTCTTGAAAGATAGTGATGGATGCAAGTCTTCCAAAGAGGTATGGACTGC
CCGAGAAAAGCGTTTTTCCTCTCTTACACACTCTCATATTCATGAATTGAAATCGGCACTACACTCTGTAGCTAAGAGCCCGACTGAGTCAATTGATGAGTATTTGATTC
GAATCAAAGAGATTGTTGATAAACTTGTCACTGTCTCTGTGAAAGTTGATGATGAAGACTTACTTCTGTATACTCTAAATGGTCTGTCATCTAAATTTAACTCTTTCAAG
ACCTCATTTCGTACAAGAAGTGGTTCTGTAACGCTTGACGAATTACACGCCTTACTGAAGTCTAAATTGAAATTTATTGAGCAACATAACAAATTATCAACTGCCTCCAT
CAATCCTACAGCAATGTTCGCTCGAGGTGTCAATCAGAATCAGTCCTCTCGCGGTCGTGGTCGAAATCAAAATAATCAAGCAGGTCAGTTTAATTCTGGCCGTGGCAATT
CAGATGGGAGCCAAGGAGGTTATTCTAGTGCAAATTTTGGGAGAAATCCTGGTCGTAGTCTGCCAAATTTGCAATCATCCTGGTCATGGAGCTCTCGATTGCTAAATTGT
CTCAATCTATCTTATCAAGGTCGCCATCCCCCTTCCAAGCTCGCTGCAATGGCTATTGCTAATGATCCCTCATCAACCACAGCTACTTGGCTTGCCGACAGCGGATGTAA
CACTCATGTTACCCCTGACACTTCTTGTTTGGCCTTGAATTCTAACTTCAATGGCAAAGAGGTTCTTACTGTTGCAAACGACCAAGGACTCCCAGCTGCTCAAGCTGGTA
TCAGTACTCTTCTTACACCTCAAAGTGATCTTCATATGTCCAGTTTATTATGTGTACCGGATCTTTCAGCCAACTTACTATCTGTCTCTCAGTGTTGTGTGGATAATAAT
TTTGTTTTCACTTTTGGTGCTAATTGGTTTACAATTCAGGACAAGGACACGGGCCAAATTTTAGCTACAAGTGGAAAAATCCTGGCCGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTCACGCGCATTCCCTCTTTGATATAGTTGATGGATCAAAGTCGTGTCCCAATGAGTTCTTGAAAGATAGTGATGGATGCAAGTCTTCCAAAGAGGTATGGACTGC
CCGAGAAAAGCGTTTTTCCTCTCTTACACACTCTCATATTCATGAATTGAAATCGGCACTACACTCTGTAGCTAAGAGCCCGACTGAGTCAATTGATGAGTATTTGATTC
GAATCAAAGAGATTGTTGATAAACTTGTCACTGTCTCTGTGAAAGTTGATGATGAAGACTTACTTCTGTATACTCTAAATGGTCTGTCATCTAAATTTAACTCTTTCAAG
ACCTCATTTCGTACAAGAAGTGGTTCTGTAACGCTTGACGAATTACACGCCTTACTGAAGTCTAAATTGAAATTTATTGAGCAACATAACAAATTATCAACTGCCTCCAT
CAATCCTACAGCAATGTTCGCTCGAGGTGTCAATCAGAATCAGTCCTCTCGCGGTCGTGGTCGAAATCAAAATAATCAAGCAGGTCAGTTTAATTCTGGCCGTGGCAATT
CAGATGGGAGCCAAGGAGGTTATTCTAGTGCAAATTTTGGGAGAAATCCTGGTCGTAGTCTGCCAAATTTGCAATCATCCTGGTCATGGAGCTCTCGATTGCTAAATTGT
CTCAATCTATCTTATCAAGGTCGCCATCCCCCTTCCAAGCTCGCTGCAATGGCTATTGCTAATGATCCCTCATCAACCACAGCTACTTGGCTTGCCGACAGCGGATGTAA
CACTCATGTTACCCCTGACACTTCTTGTTTGGCCTTGAATTCTAACTTCAATGGCAAAGAGGTTCTTACTGTTGCAAACGACCAAGGACTCCCAGCTGCTCAAGCTGGTA
TCAGTACTCTTCTTACACCTCAAAGTGATCTTCATATGTCCAGTTTATTATGTGTACCGGATCTTTCAGCCAACTTACTATCTGTCTCTCAGTGTTGTGTGGATAATAAT
TTTGTTTTCACTTTTGGTGCTAATTGGTTTACAATTCAGGACAAGGACACGGGCCAAATTTTAGCTACAAGTGGAAAAATCCTGGCCGTGTAG
Protein sequenceShow/hide protein sequence
MLHAHSLFDIVDGSKSCPNEFLKDSDGCKSSKEVWTAREKRFSSLTHSHIHELKSALHSVAKSPTESIDEYLIRIKEIVDKLVTVSVKVDDEDLLLYTLNGLSSKFNSFK
TSFRTRSGSVTLDELHALLKSKLKFIEQHNKLSTASINPTAMFARGVNQNQSSRGRGRNQNNQAGQFNSGRGNSDGSQGGYSSANFGRNPGRSLPNLQSSWSWSSRLLNC
LNLSYQGRHPPSKLAAMAIANDPSSTTATWLADSGCNTHVTPDTSCLALNSNFNGKEVLTVANDQGLPAAQAGISTLLTPQSDLHMSSLLCVPDLSANLLSVSQCCVDNN
FVFTFGANWFTIQDKDTGQILATSGKILAV