; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024059 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024059
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:197351..230810
RNA-Seq ExpressionLag0024059
SyntenyLag0024059
Gene Ontology termsGO:0006071 - glycerol metabolic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004371 - glycerone kinase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR004006 - DhaK domain
IPR004007 - DhaL domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036117 - DhaL domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]0.0e+0065.07Show/hide
Query:  HMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGLY
        HMTGN  LFS        P+VT+ADG+TS   G G + LT S SLSS+L +P  SFNLIS S+LT DLNC V FF GYCLFQD  TK+ IG+  ES GLY
Subjt:  HMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGLY

Query:  IFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFVD
        +F+ ++S  VAC  V SPFE HCRLGHPS+ VLK L P+F SLSSL C+SCQFAKFHR+S  PR  KRA APFEL+H D+WGPCP+ S+ GFRYFVTFVD
Subjt:  IFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFVD

Query:  DFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPKY
        D SR+TWLYLMKNRSELLSHF  FH EI+ QFN S+K LR+DNA EYFSH+L SYL E+GI+HQSSC DTPSQN VAERKNRHLLETARA+ FQ++V K 
Subjt:  DFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPKY

Query:  FWADAVSTACFLINRMPSSVLKGE----------------------------------------------------------------------------
        FW DAVSTACFLINRMPSSVL GE                                                                            
Subjt:  FWADAVSTACFLINRMPSSVLKGE----------------------------------------------------------------------------

Query:  ----------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLPIALRKGKRQCTYPISNFVSY
                  C  EDD+  +Y + S +   STD  PS     RP I+QVYSRR  P PS +CP     SS DP  SDDLPIALRKGKR+CTYP+S+F+SY
Subjt:  ----------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLPIALRKGKRQCTYPISNFVSY

Query:  SHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYSDT
          LS S+ +F+ SL+S S+P +VHEALSHPGW+ AM+EEM ALDDN TWD VS P GKK IGCKWVFAVK+NPDG+VARLKAR VAKG+AQ YG DYSDT
Subjt:  SHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYSDT

Query:  FSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------IINFGMRKSKSDHSVFY
        FSP+AKL S+ LF+ +A  + W LHQLDIKN FLHGDL+EEVYMEQPPGFVAQGE+ KV                        ++ FGM+KS SDHSVFY
Subjt:  FSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------IINFGMRKSKSDHSVFY

Query:  KRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLTKEGEL
        +RSE G++LLVVYVDDIVITG+D  GI +LKTF+  QF+TKDLG LK+FLGIEV+RSKKGI LSQRKYV+DLL+E GKLGAKP  TPMMPN QL KEGEL
Subjt:  KRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLTKEGEL

Query:  LKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVGGNL
         KDPERYRRLVGKLNYLTVTRPDI YSVS+VSQFMSSPTVDHWAA+EQILCYLKAAPGRG+LYKD+GH  VECFS+A WAGSR+DRRSTSGYCVFVGGNL
Subjt:  LKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVGGNL

Query:  VSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        VSWKSKKQNVVSRSSAESEYRAMAQSVCE+VWI QLL E+GF IT P KLWCDNQ ALHIASNPVFHERTKHIEVDCHF
Subjt:  VSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]0.0e+0065.52Show/hide
Query:  FQDLTTKRTIGKWRESRGLYIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDV
        F+D  TK+ IG+  ES GLY+F+ ++S  VAC  V SPFE HCRLGHPS+ VLK L P+F SLSSL C+SCQFAKFHR+S  PR  KRA APFEL+H D+
Subjt:  FQDLTTKRTIGKWRESRGLYIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDV

Query:  WGPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERK
        WGPCP+ S+ GFRYFVTFVDD SR+TWLYLMKNRSELLSHF  FH EI+ QFN S+K LR+DNA EYFSH+L SYL E+GI+HQSSC DTPSQN VAERK
Subjt:  WGPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERK

Query:  NRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGE--------------------------------------------------------
        NRHLLETARA+ FQ++V K FW DAVSTACFLINRMPSSVL GE                                                        
Subjt:  NRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGE--------------------------------------------------------

Query:  ------------------------------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLP
                                      C  EDD+  +Y + S +   STD  PS     RP I+QVYSRR  P PS +CP     SS DP  SDDLP
Subjt:  ------------------------------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLP

Query:  IALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARL
        IALRKGKR+CTYP+S+F+SY  LS S+ +F+ SL+S S+P +VHEALSHPGW+ AM+EEM ALDDN TWD VS P GKK IGCKWVFAVK+NPDG+VARL
Subjt:  IALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARL

Query:  KARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV---------------------
        KAR VAKG+AQ YG DYSDTFSP+AKL S+ LF+ +A  + W LHQLDIKN FLHGDL+EEVYMEQPPGFVAQGE+ KV                     
Subjt:  KARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV---------------------

Query:  ---IINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLG
           ++ FGM+KS SDHSVFY+RSE G++LLVVYVDDIVITG+D  GI +LKTF+  QF+TKDLG LK+FLGIEV+RSKKGI LSQRKYV+DLL+E GKLG
Subjt:  ---IINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLG

Query:  AKPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWA
        AKP  TPMMPN QL KEGEL KDPERYRRLVGKLNYLTVTRPDI YSVS+VSQFMSSPTVDHWAA+EQILCYLKAAPGRG+LYKD+GH  VECFS+A WA
Subjt:  AKPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWA

Query:  GSRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        GSR+DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCE+VWI QLL E+GF IT P KLWCDNQ ALHIASNPVFHERTKHIEVDCHF
Subjt:  GSRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]0.0e+0065.52Show/hide
Query:  FQDLTTKRTIGKWRESRGLYIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDV
        F+D  TK+ IG+  ES GLY+F+ ++S  VAC  V SPFE HCRLGHPS+ VLK L P+F SLSSL C+SCQFAKFHR+S  PR  KRA APFEL+H D+
Subjt:  FQDLTTKRTIGKWRESRGLYIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDV

Query:  WGPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERK
        WGPCP+ S+ GFRYFVTFVDD SR+TWLYLMKNRSELLSHF  FH EI+ QFN S+K LR+DNA EYFSH+L SYL E+GI+HQSSC DTPSQN VAERK
Subjt:  WGPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERK

Query:  NRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGE--------------------------------------------------------
        NRHLLETARA+ FQ++V K FW DAVSTACFLINRMPSSVL GE                                                        
Subjt:  NRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGE--------------------------------------------------------

Query:  ------------------------------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLP
                                      C  EDD+  +Y + S +   STD  PS     RP I+QVYSRR  P PS +CP     SS DP  SDDLP
Subjt:  ------------------------------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLP

Query:  IALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARL
        IALRKGKR+CTYP+S+F+SY  LS S+ +F+ SL+S S+P +VHEALSHPGW+ AM+EEM ALDDN TWD VS P GKK IGCKWVFAVK+NPDG+VARL
Subjt:  IALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARL

Query:  KARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV---------------------
        KAR VAKG+AQ YG DYSDTFSP+AKL S+ LF+ +A  + W LHQLDIKN FLHGDL+EEVYMEQPPGFVAQGE+ KV                     
Subjt:  KARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV---------------------

Query:  ---IINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLG
           ++ FGM+KS SDHSVFY+RSE G++LLVVYVDDIVITG+D  GI +LKTF+  QF+TKDLG LK+FLGIEV+RSKKGI LSQRKYV+DLL+E GKLG
Subjt:  ---IINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLG

Query:  AKPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWA
        AKP  TPMMPN QL KEGEL KDPERYRRLVGKLNYLTVTRPDI YSVS+VSQFMSSPTVDHWAA+EQILCYLKAAPGRG+LYKD+GH  VECFS+A WA
Subjt:  AKPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWA

Query:  GSRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        GSR+DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCE+VWI QLL E+GF IT P KLWCDNQ ALHIASNPVFHERTKHIEVDCHF
Subjt:  GSRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]0.0e+0065.52Show/hide
Query:  FQDLTTKRTIGKWRESRGLYIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDV
        F+D  TK+ IG+  ES GLY+F+ ++S  VAC  V SPFE HCRLGHPS+ VLK L P+F SLSSL C+SCQFAKFHR+S  PR  KRA APFEL+H D+
Subjt:  FQDLTTKRTIGKWRESRGLYIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDV

Query:  WGPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERK
        WGPCP+ S+ GFRYFVTFVDD SR+TWLYLMKNRSELLSHF  FH EI+ QFN S+K LR+DNA EYFSH+L SYL E+GI+HQSSC DTPSQN VAERK
Subjt:  WGPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERK

Query:  NRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGE--------------------------------------------------------
        NRHLLETARA+ FQ++V K FW DAVSTACFLINRMPSSVL GE                                                        
Subjt:  NRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGE--------------------------------------------------------

Query:  ------------------------------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLP
                                      C  EDD+  +Y + S +   STD  PS     RP I+QVYSRR  P PS +CP     SS DP  SDDLP
Subjt:  ------------------------------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLP

Query:  IALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARL
        IALRKGKR+CTYP+S+F+SY  LS S+ +F+ SL+S S+P +VHEALSHPGW+ AM+EEM ALDDN TWD VS P GKK IGCKWVFAVK+NPDG+VARL
Subjt:  IALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARL

Query:  KARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV---------------------
        KAR VAKG+AQ YG DYSDTFSP+AKL S+ LF+ +A  + W LHQLDIKN FLHGDL+EEVYMEQPPGFVAQGE+ KV                     
Subjt:  KARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV---------------------

Query:  ---IINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLG
           ++ FGM+KS SDHSVFY+RSE G++LLVVYVDDIVITG+D  GI +LKTF+  QF+TKDLG LK+FLGIEV+RSKKGI LSQRKYV+DLL+E GKLG
Subjt:  ---IINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLG

Query:  AKPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWA
        AKP  TPMMPN QL KEGEL KDPERYRRLVGKLNYLTVTRPDI YSVS+VSQFMSSPTVDHWAA+EQILCYLKAAPGRG+LYKD+GH  VECFS+A WA
Subjt:  AKPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWA

Query:  GSRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        GSR+DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCE+VWI QLL E+GF IT P KLWCDNQ ALHIASNPVFHERTKHIEVDCHF
Subjt:  GSRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]0.0e+0065.59Show/hide
Query:  QDLTTKRTIGKWRESRGLYIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVW
        QD  TK+ IG+  ES GLY+F+ ++S  VAC  V SPFE HCRLGHPS+ VLK L P+F SLSSL C+SCQFAKFHR+S  PR  KRA APFEL+H D+W
Subjt:  QDLTTKRTIGKWRESRGLYIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVW

Query:  GPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKN
        GPCP+ S+ GFRYFVTFVDD SR+TWLYLMKNRSELLSHF  FH EI+ QFN S+K LR+DNA EYFSH+L SYL E+GI+HQSSC DTPSQN VAERKN
Subjt:  GPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKN

Query:  RHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------
        RHLLETARA+ FQ++V K FW DAVSTACFLINRMPSSVL GE                                                         
Subjt:  RHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------

Query:  -----------------------------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLPI
                                     C  EDD+  +Y + S +   STD  PS     RP I+QVYSRR  P PS +CP     SS DP  SDDLPI
Subjt:  -----------------------------CNEEDDDFLVYTILS-SYEPSTDSPPSIPVSTRPPITQVYSRR--PTPSVTCPEPEASSSLDPGTSDDLPI

Query:  ALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLK
        ALRKGKR+CTYP+S+F+SY  LS S+ +F+ SL+S S+P +VHEALSHPGW+ AM+EEM ALDDN TWD VS P GKK IGCKWVFAVK+NPDG+VARLK
Subjt:  ALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLK

Query:  ARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV----------------------
        AR VAKG+AQ YG DYSDTFSP+AKL S+ LF+ +A  + W LHQLDIKN FLHGDL+EEVYMEQPPGFVAQGE+ KV                      
Subjt:  ARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV----------------------

Query:  --IINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGA
          ++ FGM+KS SDHSVFY+RSE G++LLVVYVDDIVITG+D  GI +LKTF+  QF+TKDLG LK+FLGIEV+RSKKGI LSQRKYV+DLL+E GKLGA
Subjt:  --IINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGA

Query:  KPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAG
        KP  TPMMPN QL KEGEL KDPERYRRLVGKLNYLTVTRPDI YSVS+VSQFMSSPTVDHWAA+EQILCYLKAAPGRG+LYKD+GH  VECFS+A WAG
Subjt:  KPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAG

Query:  SRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        SR+DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCE+VWI QLL E+GF IT P KLWCDNQ ALHIASNPVFHERTKHIEVDCHF
Subjt:  SRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

TrEMBL top hitse value%identityAlignment
A0A438DZQ8 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0059.3Show/hide
Query:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL
        DHMTGN + FST + + S+P VT+ADG+T    G G V+ T+SI+LSS+L++P  +FNLISVSKLT++LNC VSFFP +C+FQDL TKRT GK   S GL
Subjt:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL

Query:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV
        YI +  +   VAC   +SP E HCRLGHPS+ VLK L PQF +L SL CESC FAK HR SL PR  KR  + FEL+H DVWGPCP+ S+ GFRYFVTFV
Subjt:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV

Query:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK
        DDFSR+TW+Y MKNRSE+ SHF  F AEI+TQ++ S+K+LRSDN KEY S++  +Y+  +GILHQ+SCVDTPSQN VAERKNRHLLETARA+MFQ+ VPK
Subjt:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK

Query:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------
         FWADAVSTACFLINRMP+ VLKG+                                                                           
Subjt:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------

Query:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK
                    +EED+++LVY +++S          DS  S+          P   +PPI Q+YSRRP  + TCP P  SSS DP +  DLPI+L KGK
Subjt:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK

Query:  RQC--TYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV
        R C   Y I+NFVSY HLSSSS   +AS+ S+SVPKTV EAL+HPGW+ AM+EE+ AL+DN TW  V LP GKK +GCKWVFAVKV+PDGSVARLKAR V
Subjt:  RQC--TYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV

Query:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------II
        A+G+AQTYGVDYSDTFSP+AKL SV LFI +A    W +HQLDIKN FLHGDLEEEVY+EQPPGFVAQGE GKV                        I 
Subjt:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------II

Query:  NFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCS
         FGM KS+ DHSVFYK+S  G+ILLVVYV+DIVIT +D +GI  LKTF+HS+FHTKDLG LK+FLGIEV RSKK + LSQRKYV+DLL E GK+ AKPC+
Subjt:  NFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCS

Query:  TPMMPNLQL-TKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRK
        TPM+PN+QL   +G+   +PER RR+VGKLNYLTVTRPDI Y+VS+VSQF  +PT+ HWAALEQILCYLK APG G+LY   GH  +ECFS+  WAGS+ 
Subjt:  TPMMPNLQL-TKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRK

Query:  DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAMAQ+ CE++WI QLL E+G   T P KLWCDNQ ALHIA+NPV+HERTKHIEVDCHF
Subjt:  DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0060.1Show/hide
Query:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL
        DHMTGN + FST + + S+P VT+ADG+T    G G V+ T+SI+LSS+L++P  +FNLISVSKLT++LNC VSFFP +C+FQDL TKRT GK   S GL
Subjt:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL

Query:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV
        YI +  +   VAC   +SP E HCRLGHPS+ VLK L PQF +L SL CESC FAK HR SL PR  KRA + FEL+H DVWGPCP+ S+ GFRYFVTFV
Subjt:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV

Query:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK
        DDFSR+TW+Y MKNRSE+ SHF  F AEI+TQ++ S+K+LRSDN KEY S++  +Y+  +GILHQ+SCVDTPSQN VAERKNRHLLETARA+MFQ+ VPK
Subjt:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK

Query:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------
         FWADAVSTACFLINRMP+ VLK +                                                                           
Subjt:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------

Query:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK
                    +EED+++LVY +++S          DS  S+          P   +PPI QVYSRRP  + TCP P  SSS DP +  DLPI+LRKGK
Subjt:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK

Query:  RQC--TYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV
        R C   Y I+NFVSY HLSSSS   +AS+ S+SVPKTV EAL+HPGW+ AM+EE+ AL+DN TW  V LP GKK +GCKWVFAVKVNPDGSVARLKAR V
Subjt:  RQC--TYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV

Query:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------II
        A+G+AQTYGVDYSDTFSP+AKL SV LFI +A    W +HQLDIKN FLHGDLEEEVY+EQPPGFVAQGE GKV                        I 
Subjt:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------II

Query:  NFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCS
         FGM KS+ DHSVFYK+S  G+ILLVVYVDDIVITG+D +GI  LKTF+HS+FHTKDLG LK+FLGIEV RSKKG+ LSQRKYV+DLL E GK+ AKPC+
Subjt:  NFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCS

Query:  TPMMPNLQL-TKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRK
        TPM+PN+QL   +G+   +PERYRR+VGKLNYLTVTRPDI Y+VS+VSQF S+PT+ HWAALEQILCYLK APG G+LY   GH  +ECFS+A WAGS+ 
Subjt:  TPMMPNLQL-TKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRK

Query:  DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAM+Q+ CE++WI QLL E+G   T P KLWCDNQ ALHIA+NPV+HERTKHIEVDCHF
Subjt:  DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

A0A438GAQ9 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0060.45Show/hide
Query:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL
        DHMTGN + FST + + S+P VT+ADG+T    G G V+ T+SI+LSS+L++P  +FNLISVSKLT++LNC VSFFP +C+FQDL TKRT GK   S GL
Subjt:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL

Query:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV
        YI +  +   VAC   +SP E HCRLGHPS+ VLK L PQF +L SL CESC FAK HR SL PR  KRA + FEL+H DVWGPCP+ S+ GFRYFVTFV
Subjt:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV

Query:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK
        DDFSR+TW+Y MKNRSE+ SHF  F AEI+TQ++ S+K+LRSDN KEY S++  +Y+  +GILHQ+SCVDTP QN VAERKNRHLLET RA+MFQ+ VPK
Subjt:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK

Query:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------
         FWADAVSTACFLINRMP+ VLKG+                                                                           
Subjt:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------

Query:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK
                    +EED+++LVY +++S          DS  S+          P   +PPI QVYS RP  + TCP P  SSS DP +  DLPI+LRKGK
Subjt:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK

Query:  RQC--TYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV
        R C   Y I+NFVSY HLSSSS   +AS+ S+SVPKTV EAL+HPGW+ AM+EE+ AL+DN TW  V LP GKK +GCKWVFAVKVNPDGSVARLKAR V
Subjt:  RQC--TYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV

Query:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKVIINFGMRKSKSDHSVFYKRSENGVIL
        A+G+AQTYGVDYSDTFSP+AKL SV LFI +A    W +HQLDIKN FLHGDLEEEVY+EQPP        GK I  FGM KS+ DHSVFYK+S  G+IL
Subjt:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKVIINFGMRKSKSDHSVFYKRSENGVIL

Query:  LVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQL-TKEGELLKDPERYR
        LVVYVDDIVITG+D +GI  LKTF+HS+FHTKDLG LK+FLGIEV RSKKG+ LSQRKYV+DLL E GK+ AKPC+TPM+PN+QL   +G+   +PERYR
Subjt:  LVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQL-TKEGELLKDPERYR

Query:  RLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVGGNLVSWKSKKQ
        R+VGKLNYL VTRPDI Y+VS+VSQF S+PT+ HWAALEQILCYLK APG G+LY   GH  +ECFS+A WAGS+ DRRST+GYCVF GGNLV+WKSKKQ
Subjt:  RLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVGGNLVSWKSKKQ

Query:  NVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        +VVSRSSAESE RAMAQ+ CE++WI QLL E+G   T P KLWCDNQ ALHIA+NPV+HERTKHIEVDCHF
Subjt:  NVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

A0A438IRR9 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0059.8Show/hide
Query:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL
        DHMTGN + FST + + S+P VT+ADG+T    G G V+ T+SI+LSS+L++P  +FNLISVSKLT++LN  VSFFP +C+FQDL TKRT GK   S GL
Subjt:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL

Query:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV
        YI +  +   VAC   +SP E HC+LGHPS+ VLK L PQF +L SL CESC FAK HR SL PR  KRA + FEL+H DVWGPCP+ S+ GFRYFVTFV
Subjt:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV

Query:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK
        DDFSR+TW+Y MKNRSE+ SHF  F AEI+TQ++ S+K+LRSDN KEY S++  +Y+ ++GILHQ+SCVDTPSQN VAERKNRHLLETARA+MFQ+ VPK
Subjt:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK

Query:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------
         FWADAVSTACFLINRMP+ VLKG+                                                                           
Subjt:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------

Query:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK
                    +EED+++LVY +++S          DS  S+          P   +PPI QVYSRRP  + TCP P  SSS DP +  DLPI+LRKGK
Subjt:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK

Query:  R--QCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV
        R  +  Y I+NFVSY HLSSSS   +AS+ S+SVPKTV EAL+HPGW+ AM+EE+ AL+DN TW  V LP GKK +GCKWVFAVKVN DGSVARLKAR V
Subjt:  R--QCTYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV

Query:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------II
        A+G+AQTYGVDYSDTFSP+AKL SV LFI +A    W +HQLDIKN FLHGDLEEEVY+EQPPGFVAQGE GKV                        I 
Subjt:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------II

Query:  NFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCS
         FGM KS+ DHSVFYK+S  G+ILLVVYVDDIVITG+D +GI  LKTF+HS+FHTKDLG LK+FLGIEV RSKKG+ LSQRKYV+DLL E GK+ AKPC+
Subjt:  NFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCS

Query:  TPMMPNLQL-TKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRK
        TPM+PN+QL   +G+   +PERYRR+VGKLNYLTVTRPDI Y+VS+VSQF S+PT+ HWAALEQILCYLK APG G+LY   GH  +ECFS+A WAGS+ 
Subjt:  TPMMPNLQL-TKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRK

Query:  DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAMAQ+ CE++WI QLL E+G   T P KLWCDNQ ALHIA+NP++HERTKHIEVDCHF
Subjt:  DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

B0FBS2 Uncharacterized protein0.0e+0060.2Show/hide
Query:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL
        DHMTGN + FST + + S+P VT+ADG+T    G G V+ T+SI+LSS+L++P  +FNLISVSKLT++LNC VSFFP +C+FQDL TKRT GK   S GL
Subjt:  DHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL

Query:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV
        YI +  +   VAC   +SP E HCRLGHPS+ VLK L PQF +L SL CESC FAK HR SL PR  KRA + FEL+H DVWGPCP+ S+ GFRYFVTFV
Subjt:  YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFV

Query:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK
        DDFSR+TW+Y MKNRSE+ SHF  F AEI+TQ++ S+K+LRSDN KEY S++  +Y+  +GILHQ+SCVDTPSQN VAERKNRHLLETARA+MFQ+ VPK
Subjt:  DDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPK

Query:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------
         FWADAVSTACFLINRMP+ VLKG+                                                                           
Subjt:  YFWADAVSTACFLINRMPSSVLKGE---------------------------------------------------------------------------

Query:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK
                    +EED+++LVY +++S          DS  S+          P   +PPI QVYSRRP  + TCP P  SSS DP +  DLPI+LRKGK
Subjt:  -----------CNEEDDDFLVYTILSS-----YEPSTDSPPSI----------PVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGK

Query:  RQC--TYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV
        R C   Y I+NFVSY HLSSSS   +AS+ S+SVPKTV EAL+HPGW+ AM+EE+ AL+DN TW  V LP GKK +GCKWVFAVKVNPDGSVARLKAR V
Subjt:  RQC--TYPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFV

Query:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------II
        A+G+AQTYGVDYSDTFSP+AKL SV LFI +A    W +HQLDIKN FLHGDLEEEVY+EQPPGFVAQGE GKV                        I 
Subjt:  AKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------II

Query:  NFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCS
         FGM KS+ DHSVFYK+S  G+ILLVVYVDDIVITG+D +GI  LKTF+HS+FHTKDLG LK+FLGIEV RSKKG+ LSQRKYV+DLL E GK+ AKPC+
Subjt:  NFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCS

Query:  TPMMPNLQL-TKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRK
        TPM+PN+QL   +G+   +PERYRR+VGKLNYLTVTRPDI Y+VS+VSQF S+PT+ HWAALEQILCYLK APG G+LY   GH  +ECFS+A WAGS+ 
Subjt:  TPMMPNLQL-TKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRK

Query:  DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        DRRST+GYCVF GGNLV+WKSKKQ+VVSRSSAESEYRAM+Q+ CE++WI QLL E+G   T P KLWCDNQ ALHIA+NPV+HERTKHIEVDCHF
Subjt:  DRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

SwissProt top hitse value%identityAlignment
O04059 Putative 3,4-dihydroxy-2-butanone kinase1.1e-13371.93Show/hide
Query:  ETNYVPITRGNRVVLMVNGLGATPMMELMIASGKAVPKLQLEHGLAVDRVYTGSFMTSLDMAGFSITIMKSDETVLQWLDAATKAPCWPIGADGSHPPAK
        ETNYVPITRG+RVVL++NGLGATP+MELMI +GKAVP+LQLEHGLAVDRVYTGSFMTSLDMAGFSI++MK+D+ +L  LDA TKAP WP+GA+G+ PPAK
Subjt:  ETNYVPITRGNRVVLMVNGLGATPMMELMIASGKAVPKLQLEHGLAVDRVYTGSFMTSLDMAGFSITIMKSDETVLQWLDAATKAPCWPIGADGSHPPAK

Query:  IPVPIPPPALATKNGETLGGPLQLNQQGIILEAAIEAAAKAVINLKDPLNDWDSKVGDGDCGSTMFRGATAILEDI-KCYPLNDAAETMNEIGSSIGRVM
        IPVP+ PP+ + K  +TL  P +L+ QG ILE AIEAAA  V+NL+D LN+WD+KVGDGDCGSTMFRGA AILED+ K YPLND AET+NEIG+SIGRVM
Subjt:  IPVPIPPPALATKNGETLGGPLQLNQQGIILEAAIEAAAKAVINLKDPLNDWDSKVGDGDCGSTMFRGATAILEDI-KCYPLNDAAETMNEIGSSIGRVM

Query:  GGTSGIIYTIFFKAAYTKLKSSSVDDITPQKWAEALEASIAAISKYGGATAGYRTLLDALIPASEVLRKRLDAGENPTTAFIHSSEAALAGAEATKNMQA
        GGTSGI+Y+IF KAAY KLK ++   +T   WA+ALEA+IAA+SKYGGA+AGYRTLLDALIPA   L++RL+AG++P  AFI S+EAA AGAE+TK+MQA
Subjt:  GGTSGIIYTIFFKAAYTKLKSSSVDDITPQKWAEALEASIAAISKYGGATAGYRTLLDALIPASEVLRKRLDAGENPTTAFIHSSEAALAGAEATKNMQA

Query:  LAGRSSYVFGEILAAVPDPGAMAAAAWYRAAAIAVRDKCQAA
         AGRS+YV G+ILA+VPDPGAMAAAAWYRAAA+AV++K   A
Subjt:  LAGRSSYVFGEILAAVPDPGAMAAAAWYRAAAIAVRDKCQAA

P04146 Copia protein4.0e-8027.17Show/hide
Query:  LSSLACESCQFAKFHRVSLYPRATK-RANAPFELIHFDVWGPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRS
        LS   CE C   K  R+       K     P  ++H DV GP    +     YFV FVD F+     YL+K +S++ S F++F A+    FN  +  L  
Subjt:  LSSLACESCQFAKFHRVSLYPRATK-RANAPFELIHFDVWGPCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRS

Query:  DNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGECNE--------------
        DN +EY S+ +  +  + GI +  +   TP  N V+ER  R + E AR ++    + K FW +AV TA +LINR+PS  L                    
Subjt:  DNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGECNE--------------

Query:  -----------------EDDDFLVYTILSSYEPS--------------------------------------TDSPPS----IPVSTRPPITQVY-----
                         + DD    +I   YEP+                                       DS  S     P  +R  I   +     
Subjt:  -----------------EDDDFLVYTILSSYEPS--------------------------------------TDSPPS----IPVSTRPPITQVY-----

Query:  ----------------------SRR------PTPSVTCP-----------------------------------------EPEASSSL------DPGTSD
                              SR+      P  S  C                                          E E +  L      +P  +D
Subjt:  ----------------------SRR------PTPSVTCP-----------------------------------------EPEASSSL------DPGTSD

Query:  DLPIALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSV--SVPKTVHEAL---SHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVN
         + I  R+ +R  T P    +SY+   +S    + +  ++   VP +  E         W  A+  E+NA   N TW     P  K  +  +WVF+VK N
Subjt:  DLPIALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSV--SVPKTVHEAL---SHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVN

Query:  PDGSVARLKARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGEN----GKVIINFGMRK-
          G+  R KAR VA+G  Q Y +DY +TF+P+A+++S    + L   ++  +HQ+D+K  FL+G L+EE+YM  P G     +N     K I  +G+++ 
Subjt:  PDGSVARLKARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGEN----GKVIINFGMRK-

Query:  -------------------SKSDHSVFY--KRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVI
                           S  D  ++   K + N  I +++YVDD+VI   D + +   K ++  +F   DL  +KHF+GI +   +  I LSQ  YV 
Subjt:  -------------------SKSDHSVFY--KRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVI

Query:  DLLTEKGKLGAKPCSTPMMP--NLQLTKEGELLKDPERYRRLVGKLNYLTV-TRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKD--
         +L++         STP+    N +L    E    P   R L+G L Y+ + TRPD+T +V+I+S++ S    + W  L+++L YLK      L++K   
Subjt:  DLLTEKGKLGAKPCSTPMMP--NLQLTKEGELLKDPERYRRLVGKLNYLTV-TRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKD--

Query:  YGHMNVECFSNAGWAGSRKDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNP
             +  + ++ WAGS  DR+ST+GY       NL+ W +K+QN V+ SS E+EY A+ ++V E +W++ LL  +   +  P K++ DNQ  + IA+NP
Subjt:  YGHMNVECFSNAGWAGSRKDRRSTSGYCV-FVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNP

Query:  VFHERTKHIEVDCHFETNYV
          H+R KHI++  HF    V
Subjt:  VFHERTKHIEVDCHFETNYV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-11931.26Show/hide
Query:  TIADGTTSPG--SGIGNVRLTNSISLSSILS----IPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGLYIFEPEI-------ST
        T+  G TS    +GIG++ +  ++  + +L     +P    NLIS   L RD       +  Y   Q         KWR ++G  +    +       + 
Subjt:  TIADGTTSPG--SGIGNVRLTNSISLSSILS----IPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGLYIFEPEI-------ST

Query:  VVACSKVSSPFED-------HCRLGHPS-----ISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFV
           C    +  +D       H R+GH S     I   KSL       +   C+ C F K HRVS    + ++ N   +L++ DV GP  IES  G +YFV
Subjt:  VVACSKVSSPFED-------HCRLGHPS-----ISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFV

Query:  TFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQIN
        TF+DD SR  W+Y++K + ++   F+ FHA +  +    LK LRSDN  EY S     Y   HGI H+ +   TP  N VAER NR ++E  R+++    
Subjt:  TFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQIN

Query:  VPKYFWADAVSTACFLINRMPSSVLKGECNE------------------------------EDDDFLVYTILSSY-------------------------
        +PK FW +AV TAC+LINR PS  L  E  E                              + DD  +  I   Y                         
Subjt:  VPKYFWADAVSTACFLINRMPSSVLKGECNE------------------------------EDDDFLVYTILSSY-------------------------

Query:  -----EPSTDSPPSIPVSTRPPITQV--YSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSV----
               + D    +     P    +   S  PT + +  +  +     PG   +    L +G  +  +P      +  L  S    + S +  S     
Subjt:  -----EPSTDSPPSIPVSTRPPITQV--YSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVSV----

Query:  ------PKTVHEALSHP---GWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYSDTFSPIAKLASV
              P+++ E LSHP       AM EEM +L  N T+  V LP GK+P+ CKWVF +K + D  + R KAR V KG  Q  G+D+ + FSP+ K+ S+
Subjt:  ------PKTVHEALSHP---GWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYSDTFSPIAKLASV

Query:  WLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKVI-IN---FGMR--------------------KSKSDHSVFYKR-SENGVIL
           + LA      + QLD+K  FLHGDLEEE+YMEQP GF   G+   V  +N   +G++                    K+ SD  V++KR SEN  I+
Subjt:  WLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKVI-IN---FGMR--------------------KSKSDHSVFYKR-SENGVIL

Query:  LVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSK--KGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLTK---------EG
        L++YVDD++I G D   I  LK  +   F  KDLG  +  LG++++R +  + + LSQ KY+  +L       AKP STP+  +L+L+K         +G
Subjt:  LVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSK--KGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLTK---------EG

Query:  ELLKDPERYRRLVGKLNYLTV-TRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVG
         + K P  Y   VG L Y  V TRPDI ++V +VS+F+ +P  +HW A++ IL YL+   G  L +     + ++ +++A  AG   +R+S++GY     
Subjt:  ELLKDPERYRRLVGKLNYLTV-TRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVG

Query:  GNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF
        G  +SW+SK Q  V+ S+ E+EY A  ++  E++W+++ L ELG        ++CD+Q+A+ ++ N ++H RTKHI+V  H+
Subjt:  GNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.8e-13631.37Show/hide
Query:  HMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNS---ISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESR
        H+T +    S   P T   +V +ADG+T P S  G+  L+     ++L +IL +P    NLISV +L       V FFP     +DL T   + + +   
Subjt:  HMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNS---ISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESR

Query:  GLYIF----EPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFH------SLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIE
         LY +       +S   + S  ++    H RLGHP+ S+L S+   +       S   L+C  C   K ++V  + ++T  +  P E I+ DVW   PI 
Subjt:  GLYIF----EPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFH------SLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIE

Query:  SKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLET
        S   +RY+V FVD F+R TWLY +K +S++   F  F   +  +F   +    SDN  E+   AL  Y  +HGI H +S   TP  N ++ERK+RH++ET
Subjt:  SKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLET

Query:  ARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGEC-------------------------------NEEDDD-----FLVYTILSS-----------
           ++   ++PK +W  A + A +LINR+P+ +L+ E                                ++ DD      FL Y++  S           
Subjt:  ARAVMFQINVPKYFWADAVSTACFLINRMPSSVLKGEC-------------------------------NEEDDD-----FLVYTILSS-----------

Query:  ------------------------------------YEPSTDSPPSIPVSTRPPITQV-YSRRPTPSVTCP---EPEASSSLDPGTSDDLPIA------L
                                            + P T  P   PV   P  +   ++  P  S + P      +SS+LD   S   P +       
Subjt:  ------------------------------------YEPSTDSPPSIPVSTRPPITQV-YSRRPTPSVTCP---EPEASSSLDPGTSDDLPIA------L

Query:  RKGKRQCTYP-------------------------ISNFVSYSHLSSSSC--------------------------------------------------
        + G +  T P                         ++  +S    SSSS                                                   
Subjt:  RKGKRQCTYP-------------------------ISNFVSYSHLSSSSC--------------------------------------------------

Query:  ---------SFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPI-GCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYS
                 S   SL + S P+T  +AL    WR AM  E+NA   N TWD V  P     I GC+W+F  K N DGS+ R KAR VAKG+ Q  G+DY+
Subjt:  ---------SFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPI-GCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYS

Query:  DTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------IINFGMRKSKSDHSV
        +TFSP+ K  S+ + + +A    WP+ QLD+ N FL G L ++VYM QPPGF+ +     V                        ++  G   S SD S+
Subjt:  DTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------IINFGMRKSKSDHSV

Query:  FYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLT-KE
        F  +    ++ ++VYVDDI+ITG+D + +      +  +F  KD   L +FLGIE  R   G+ LSQR+Y++DLL     + AKP +TPM P+ +L+   
Subjt:  FYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLT-KE

Query:  GELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVG
        G  L DP  YR +VG L YL  TRPDI+Y+V+ +SQFM  PT +H  AL++IL YL   P  G+  K    +++  +S+A WAG + D  ST+GY V++G
Subjt:  GELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVG

Query:  GNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHFETNYV
         + +SW SKKQ  V RSS E+EYR++A +  E+ WI  LL ELG  +T P  ++CDN  A ++ +NPVFH R KHI +D HF  N V
Subjt:  GNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHFETNYV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-13832.39Show/hide
Query:  HMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRL---TNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTI--GKWRE
        H+T +    S   P T   +V IADG+T P +  G+  L   + S+ L+ +L +P    NLISV +L       V FFP     +DL T   +  GK ++
Subjt:  HMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRL---TNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTI--GKWRE

Query:  SRGLYIFEPEISTVVACSKVSSPFED------HCRLGHPSISVLKSLRPQFHSL-------SSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWG
             ++E  I++  A S  +SP         H RLGHPS+++L S+    HSL         L+C  C   K H+V  +  +T  ++ P E I+ DVW 
Subjt:  SRGLYIFEPEISTVVACSKVSSPFED------HCRLGHPSISVLKSLRPQFHSL-------SSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWG

Query:  PCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNR
          PI S   +RY+V FVD F+R TWLY +K +S++   F  F + +  +F   +  L SDN  E+    L  YL +HGI H +S   TP  N ++ERK+R
Subjt:  PCPIESKRGFRYFVTFVDDFSRVTWLYLMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNR

Query:  HLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVL------------------------------------KGECNEEDDDFLVYT----------
        H++E    ++   +VPK +W  A S A +LINR+P+ +L                                    K E   +   F+ Y+          
Subjt:  HLLETARAVMFQINVPKYFWADAVSTACFLINRMPSSVL------------------------------------KGECNEEDDDFLVYT----------

Query:  ---------------------------ILSSYEPSTDSPPSIP-----------------------VSTRPP-------ITQVYSR--------------
                                   + +S E  +DS P+ P                        S RPP        TQV S               
Subjt:  ---------------------------ILSSYEPSTDSPPSIP-----------------------VSTRPP-------ITQVYSR--------------

Query:  ---------RPT-------------PSVTCPEPEASSSLDPGTSDDLPIALRKGKRQCTYPISNFVSYSHLSSSSC------------------------
                 +PT             P +  P P + S   P  +  LP +        T   S     S  SSS+                         
Subjt:  ---------RPT-------------PSVTCPEPEASSSLDPGTSDDLPIALRKGKRQCTYPISNFVSYSHLSSSSC------------------------

Query:  ------------------SFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPI-GCKWVFAVKVNPDGSVARLKARFVAKGH
                          S+  SL + S P+T  +A+    WR AM  E+NA   N TWD V  P     I GC+W+F  K N DGS+ R KAR VAKG+
Subjt:  ------------------SFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPI-GCKWVFAVKVNPDGSVARLKARFVAKGH

Query:  AQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------IINFGM
         Q  G+DY++TFSP+ K  S+ + + +A    WP+ QLD+ N FL G L +EVYM QPPGFV +     V                        ++  G 
Subjt:  AQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVAQGENGKV------------------------IINFGM

Query:  RKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMM
          S SD S+F  +    +I ++VYVDDI+ITG+DT  ++     +  +F  K+   L +FLGIE  R  +G+ LSQR+Y +DLL     L AKP +TPM 
Subjt:  RKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMM

Query:  PNLQLT-KEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRS
         + +LT   G  L DP  YR +VG L YL  TRPD++Y+V+ +SQ+M  PT DHW AL+++L YL   P  G+  K    +++  +S+A WAG   D  S
Subjt:  PNLQLT-KEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRS

Query:  TSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHFETNYV
        T+GY V++G + +SW SKKQ  V RSS E+EYR++A +  EL WI  LL ELG  ++ P  ++CDN  A ++ +NPVFH R KHI +D HF  N V
Subjt:  TSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHFETNYV

Arabidopsis top hitse value%identityAlignment
AT1G48430.1 Dihydroxyacetone kinase1.2e-13271.76Show/hide
Query:  ETNYVPITRGNRVVLMVNGLGATPMMELMIASGKAVPKLQLEHGLAVDRVYTGSFMTSLDMAGFSITIMKSDETVLQWLDAATKAPCWPIGADGSHPPAK
        ETNYVPITRGN VVLM+NGLG TP+MELMIA+GKAVPKLQLE+GLAVDRVYTGSFMTSLDMAGFSI+IMK+D+++L+ LDA T AP WP+G DGS PPAK
Subjt:  ETNYVPITRGNRVVLMVNGLGATPMMELMIASGKAVPKLQLEHGLAVDRVYTGSFMTSLDMAGFSITIMKSDETVLQWLDAATKAPCWPIGADGSHPPAK

Query:  IPVPIPPPALATKNGETLGGPLQLNQQGIILEAAIEAAAKAVINLKDPLNDWDSKVGDGDCGSTMFRGATAILEDI-KCYPLNDAAETMNEIGSSIGRVM
        IPVP+  P  +TKN E+   P +L+QQG ILEAAIEAAA  VINLKD LN+WD KVGDGDCGSTM RGATAILED+ K YPLNDAAET++EIGSSI RVM
Subjt:  IPVPIPPPALATKNGETLGGPLQLNQQGIILEAAIEAAAKAVINLKDPLNDWDSKVGDGDCGSTMFRGATAILEDI-KCYPLNDAAETMNEIGSSIGRVM

Query:  GGTSGIIYTIFFKAAYTKLKSSSVDDITPQKWAEALEASIAAISKYGGATAGYRTLLDALIPASEVLRKRLDAGENPTTAFIHSSEAALAGAEATKNMQA
        GGTSGIIY +  KAAY +LK++S  + T + W+EAL++SI+A+SKYGGATAGYRT+LDALIPAS+VL ++L  GE+P  AF+ S+EAA AGAE+T +M+A
Subjt:  GGTSGIIYTIFFKAAYTKLKSSSVDDITPQKWAEALEASIAAISKYGGATAGYRTLLDALIPASEVLRKRLDAGENPTTAFIHSSEAALAGAEATKNMQA

Query:  LAGRSSYVFGEILAAVPDPGAMAAAAWYRAAAIAVRDKCQ
         AGRSSYV  EI A++PDPGAMAAAAWY AAA AV+++ Q
Subjt:  LAGRSSYVFGEILAAVPDPGAMAAAAWYRAAAIAVRDKCQ

AT3G17770.1 Dihydroxyacetone kinase4.7e-13270.26Show/hide
Query:  ETNYVPITRGNRVVLMVNGLGATPMMELMIASGKAVPKLQLEHGLAVDRVYTGSFMTSLDMAGFSITIMKSDETVLQWLDAATKAPCWPIGADGSHPPAK
        ETNYVPITRGNRVVLMVNGLG TP+MELMIA+GKAVPKLQLE GLAVDRVYTG FMTSLDMAGFSI+IMK+D ++L  LDA TKAP WP+G DG+ PPAK
Subjt:  ETNYVPITRGNRVVLMVNGLGATPMMELMIASGKAVPKLQLEHGLAVDRVYTGSFMTSLDMAGFSITIMKSDETVLQWLDAATKAPCWPIGADGSHPPAK

Query:  IPVPIPPPALATKNGETLGGPLQLNQQGIILEAAIEAAAKAVINLKDPLNDWDSKVGDGDCGSTMFRGATAILEDIK-CYPLNDAAETMNEIGSSIGRVM
        IPVP+ PP+ + K+ E+   PL+L+++G +LEAAI+AAA  +I+LKD LN+WD KVGDGDCGSTM+RGATAILED+K  YPLNDAAET+NEIG SI R M
Subjt:  IPVPIPPPALATKNGETLGGPLQLNQQGIILEAAIEAAAKAVINLKDPLNDWDSKVGDGDCGSTMFRGATAILEDIK-CYPLNDAAETMNEIGSSIGRVM

Query:  GGTSGIIYTIFFKAAYTKLKSSSVDDITPQKWAEALEASIAAISKYGGATAGYRTLLDALIPASEVLRKRLDAGENPTTAFIHSSEAALAGAEATKNMQA
        GGTSGIIY +  KAAY +LK+++  ++TP+ W+EAL++SIA++SKYGGATAGYRT+LDALIPAS+VL ++L AGE+P +AFI S EAA AGAE+T  MQA
Subjt:  GGTSGIIYTIFFKAAYTKLKSSSVDDITPQKWAEALEASIAAISKYGGATAGYRTLLDALIPASEVLRKRLDAGENPTTAFIHSSEAALAGAEATKNMQA

Query:  LAGRSSYVFGEILAAVPDPGAMAAAAWYRAAAIAVRDKCQAAS
         AGRSSYV  E LA VPDPGAMAAA WY AAA AV+++ + +S
Subjt:  LAGRSSYVFGEILAAVPDPGAMAAAAWYRAAAIAVRDKCQAAS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.3e-12044.31Show/hide
Query:  YPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQ
        + IS F+SY  +S    SFL  +     P T +EA     W  AM +E+ A++   TW+  +LP  KKPIGCKWV+ +K N DG++ R KAR VAKG+ Q
Subjt:  YPISNFVSYSHLSSSSCSFLASLQSVSVPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQ

Query:  TYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVA-QGEN---------------------------GKVIINF
          G+D+ +TFSP+ KL SV L + ++ I+++ LHQLDI N FL+GDL+EE+YM+ PPG+ A QG++                              +I F
Subjt:  TYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLDIKNVFLHGDLEEEVYMEQPPGFVA-QGEN---------------------------GKVIINF

Query:  GMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTP
        G  +S SDH+ F K +    + ++VYVDDI+I  ++ + +  LK+ + S F  +DLG LK+FLG+E+ RS  GI + QRKY +DLL E G LG KP S P
Subjt:  GMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTP

Query:  MMPNLQLT-KEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDR
        M P++  +   G    D + YRRL+G+L YL +TR DI+++V+ +SQF  +P + H  A+ +IL Y+K   G+GL Y     M ++ FS+A +   +  R
Subjt:  MMPNLQLT-KEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDR

Query:  RSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCH
        RST+GYC+F+G +L+SWKSKKQ VVS+SSAE+EYRA++ +  E++W+ Q   EL   ++ PT L+CDN  A+HIA+N VFHERTKHIE DCH
Subjt:  RSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCH

ATMG00810.1 DNA/RNA polymerases superfamily protein6.0e-4742.6Show/hide
Query:  LVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLTKEGELLKDPERYRR
        L++YVDDI++TG   + +  L   + S F  KDLG + +FLGI++     G+ LSQ KY   +L   G L  KP STP+   L  +       DP  +R 
Subjt:  LVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGILLSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLTKEGELLKDPERYRR

Query:  LVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVGGNLVSWKSKKQN
        +VG L YLT+TRPDI+Y+V+IV Q M  PT+  +  L+++L Y+K     GL       +NV+ F ++ WAG    RRST+G+C F+G N++SW +K+Q 
Subjt:  LVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVECFSNAGWAGSRKDRRSTSGYCVFVGGNLVSWKSKKQN

Query:  VVSRSSAESEYRAMAQSVCELVW
         VSRSS E+EYRA+A +  EL W
Subjt:  VVSRSSAESEYRAMAQSVCELVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.4e-1948.35Show/hide
Query:  PKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYSDTFSPIAKLASV
        PK+V  AL  PGW  AM EE++AL  N TW  V  PV +  +GCKWVF  K++ DG++ RLKAR VAKG  Q  G+ + +T+SP+ + A++
Subjt:  PKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYSDTFSPIAKLASV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTTTCCTCCTCCTTCGACATCCCGATCACATGACTGGTAATCCCCGTTTATTCTCTACCATTTATCCATCTACATCTTCCCCTAATGTCACTATAGCTGATGG
AACCACATCTCCAGGCTCAGGAATCGGCAATGTTCGACTTACCAACTCAATTTCGCTGTCTTCCATTTTGAGTATACCACAGTTTTCCTTTAACTTGATTTCTGTTAGTA
AACTTACTCGTGACCTTAATTGTTGTGTCTCATTTTTCCCTGGTTATTGCTTATTTCAGGATCTTACGACGAAGAGGACTATTGGTAAATGGCGTGAATCTAGAGGTCTC
TACATCTTTGAACCAGAAATATCAACCGTCGTTGCATGTTCTAAAGTGTCATCTCCATTTGAAGATCATTGTCGTTTGGGTCATCCTTCTATTTCAGTATTGAAGAGTCT
TCGTCCTCAGTTTCATTCTCTGTCTTCTTTAGCTTGTGAGTCATGTCAGTTTGCTAAGTTCCATCGTGTGAGTCTGTATCCCCGTGCAACTAAACGAGCTAATGCTCCAT
TTGAATTAATTCATTTTGATGTTTGGGGTCCGTGTCCTATTGAGTCAAAGAGAGGGTTTAGATATTTTGTTACCTTTGTCGATGATTTTTCTCGCGTAACTTGGTTATAT
TTAATGAAAAATCGTTCTGAGCTGCTTTCTCATTTTCGTAACTTCCATGCTGAAATCCGAACTCAATTTAACGGTTCTCTTAAAGTTCTACGGAGTGATAATGCTAAAGA
ATACTTCTCTCATGCTCTCAATTCTTATTTAGATGAACATGGCATCCTCCATCAATCCTCCTGTGTTGATACTCCATCTCAAAATTGGGTTGCAGAAAGAAAGAACCGTC
ATCTCCTTGAAACAGCAAGAGCCGTAATGTTCCAGATAAATGTTCCAAAGTATTTTTGGGCCGATGCTGTATCCACAGCTTGTTTCTTAATAAATCGCATGCCCTCCTCA
GTTCTTAAGGGGGAGTGTAATGAGGAGGATGATGATTTTCTTGTCTATACCATTCTCTCCTCTTATGAGCCTTCTACTGATTCACCTCCATCTATACCTGTTTCTACTCG
TCCACCTATTACTCAGGTTTATTCTCGACGACCAACCCCTTCAGTTACATGCCCTGAACCAGAGGCTTCTTCGTCATTGGATCCAGGAACGAGCGATGACCTTCCCATTG
CTCTTCGTAAAGGTAAACGTCAATGCACTTATCCTATTTCCAATTTTGTTTCATATAGTCATTTGTCATCTTCTTCCTGTTCTTTTCTTGCATCTTTGCAATCGGTATCT
GTTCCTAAGACTGTTCATGAAGCTTTGTCTCATCCTGGTTGGCGTGCTGCAATGGTTGAGGAGATGAATGCCTTAGATGACAATTGTACTTGGGATTTTGTTTCACTTCC
TGTAGGAAAGAAGCCTATTGGTTGTAAGTGGGTATTTGCGGTCAAGGTCAATCCAGATGGATCTGTGGCTCGCTTGAAAGCTCGCTTTGTTGCTAAAGGTCATGCACAAA
CCTATGGAGTTGACTATTCTGATACCTTTTCTCCCATTGCTAAATTGGCTTCTGTTTGGTTGTTCATTTTGTTGGCATGCATTCATCATTGGCCCTTGCATCAACTTGAT
ATTAAAAATGTCTTTCTTCATGGTGATCTTGAAGAAGAGGTGTATATGGAGCAACCACCAGGGTTTGTTGCTCAGGGGGAGAATGGAAAGGTGATTATAAATTTTGGGAT
GAGGAAAAGTAAATCAGATCATTCTGTTTTTTATAAGCGATCTGAAAATGGTGTTATCTTGTTGGTTGTGTATGTTGATGATATTGTTATTACTGGCGATGACACATCAG
GTATCCAAGCACTCAAAACCTTTGTCCATAGTCAATTCCATACAAAAGATTTGGGAACGTTGAAACACTTTCTAGGAATTGAGGTAATAAGGAGCAAGAAGGGGATCTTG
TTATCACAGAGAAAATATGTAATTGACTTGTTGACTGAAAAAGGGAAGTTAGGGGCTAAACCATGTAGTACCCCAATGATGCCGAACTTACAGCTTACGAAAGAGGGAGA
GTTACTTAAAGATCCTGAAAGGTATAGAAGGTTAGTAGGAAAACTCAATTATCTTACAGTGACTCGGCCTGACATAACTTATTCAGTGAGTATTGTGAGCCAGTTTATGT
CTTCTCCTACAGTTGATCATTGGGCTGCATTGGAACAAATTTTGTGTTATTTGAAGGCAGCTCCTGGACGTGGTTTATTATATAAGGATTATGGGCACATGAACGTTGAA
TGCTTCTCAAATGCTGGCTGGGCAGGATCTAGAAAAGATAGAAGATCAACCTCAGGATATTGTGTATTTGTGGGTGGAAATTTGGTTTCTTGGAAAAGTAAAAAACAAAA
TGTGGTGTCACGTTCGAGTGCCGAATCAGAATATAGAGCAATGGCACAGTCTGTATGTGAATTAGTCTGGATACGTCAACTTCTTGTTGAATTGGGATTTGATATCACAA
CACCGACAAAATTGTGGTGTGATAATCAAACAGCTCTTCATATTGCATCTAATCCAGTATTTCATGAGCGGACTAAACACATTGAGGTTGACTGTCACTTTGAAACTAAC
TATGTTCCAATAACACGAGGTAATCGAGTGGTGCTCATGGTCAACGGGTTAGGAGCTACCCCGATGATGGAATTGATGATTGCATCTGGGAAAGCGGTTCCTAAGTTGCA
GCTGGAGCATGGGTTGGCTGTTGATAGAGTGTACACTGGATCATTTATGACTTCTCTTGACATGGCAGGTTTTTCAATTACCATCATGAAATCAGATGAAACAGTTTTGC
AATGGTTGGATGCTGCAACCAAGGCTCCTTGTTGGCCCATTGGTGCTGATGGCAGTCACCCACCTGCCAAAATACCTGTTCCAATACCACCACCAGCTCTTGCTACAAAG
AATGGGGAGACATTGGGTGGACCCCTTCAACTAAATCAACAAGGCATCATTCTAGAGGCTGCAATTGAGGCAGCTGCCAAAGCAGTGATCAATCTCAAGGACCCATTAAA
TGACTGGGATAGCAAAGTGGGCGATGGTGATTGTGGGTCAACGATGTTTAGGGGTGCAACAGCTATTCTTGAGGACATAAAATGCTATCCACTGAACGATGCTGCTGAGA
CAATGAATGAAATTGGATCATCCATTGGAAGAGTTATGGGAGGAACTAGTGGGATCATATATACAATATTTTTCAAGGCAGCATATACAAAATTGAAATCAAGCAGCGTA
GATGACATCACCCCGCAAAAATGGGCTGAAGCATTAGAAGCTTCCATAGCTGCAATTAGCAAGTATGGTGGGGCTACGGCTGGTTATCGAACATTACTTGATGCCCTAAT
TCCAGCATCTGAAGTTCTTAGGAAGAGGTTAGATGCTGGTGAAAATCCCACCACCGCGTTTATTCATTCATCTGAAGCAGCATTGGCTGGCGCTGAAGCAACAAAAAACA
TGCAGGCTCTGGCTGGCCGTTCGTCATATGTATTTGGGGAAATCCTCGCTGCAGTTCCAGATCCAGGTGCAATGGCTGCAGCAGCATGGTATAGAGCTGCGGCTATAGCT
GTCAGGGACAAGTGCCAGGCTGCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTTTCCTCCTCCTTCGACATCCCGATCACATGACTGGTAATCCCCGTTTATTCTCTACCATTTATCCATCTACATCTTCCCCTAATGTCACTATAGCTGATGG
AACCACATCTCCAGGCTCAGGAATCGGCAATGTTCGACTTACCAACTCAATTTCGCTGTCTTCCATTTTGAGTATACCACAGTTTTCCTTTAACTTGATTTCTGTTAGTA
AACTTACTCGTGACCTTAATTGTTGTGTCTCATTTTTCCCTGGTTATTGCTTATTTCAGGATCTTACGACGAAGAGGACTATTGGTAAATGGCGTGAATCTAGAGGTCTC
TACATCTTTGAACCAGAAATATCAACCGTCGTTGCATGTTCTAAAGTGTCATCTCCATTTGAAGATCATTGTCGTTTGGGTCATCCTTCTATTTCAGTATTGAAGAGTCT
TCGTCCTCAGTTTCATTCTCTGTCTTCTTTAGCTTGTGAGTCATGTCAGTTTGCTAAGTTCCATCGTGTGAGTCTGTATCCCCGTGCAACTAAACGAGCTAATGCTCCAT
TTGAATTAATTCATTTTGATGTTTGGGGTCCGTGTCCTATTGAGTCAAAGAGAGGGTTTAGATATTTTGTTACCTTTGTCGATGATTTTTCTCGCGTAACTTGGTTATAT
TTAATGAAAAATCGTTCTGAGCTGCTTTCTCATTTTCGTAACTTCCATGCTGAAATCCGAACTCAATTTAACGGTTCTCTTAAAGTTCTACGGAGTGATAATGCTAAAGA
ATACTTCTCTCATGCTCTCAATTCTTATTTAGATGAACATGGCATCCTCCATCAATCCTCCTGTGTTGATACTCCATCTCAAAATTGGGTTGCAGAAAGAAAGAACCGTC
ATCTCCTTGAAACAGCAAGAGCCGTAATGTTCCAGATAAATGTTCCAAAGTATTTTTGGGCCGATGCTGTATCCACAGCTTGTTTCTTAATAAATCGCATGCCCTCCTCA
GTTCTTAAGGGGGAGTGTAATGAGGAGGATGATGATTTTCTTGTCTATACCATTCTCTCCTCTTATGAGCCTTCTACTGATTCACCTCCATCTATACCTGTTTCTACTCG
TCCACCTATTACTCAGGTTTATTCTCGACGACCAACCCCTTCAGTTACATGCCCTGAACCAGAGGCTTCTTCGTCATTGGATCCAGGAACGAGCGATGACCTTCCCATTG
CTCTTCGTAAAGGTAAACGTCAATGCACTTATCCTATTTCCAATTTTGTTTCATATAGTCATTTGTCATCTTCTTCCTGTTCTTTTCTTGCATCTTTGCAATCGGTATCT
GTTCCTAAGACTGTTCATGAAGCTTTGTCTCATCCTGGTTGGCGTGCTGCAATGGTTGAGGAGATGAATGCCTTAGATGACAATTGTACTTGGGATTTTGTTTCACTTCC
TGTAGGAAAGAAGCCTATTGGTTGTAAGTGGGTATTTGCGGTCAAGGTCAATCCAGATGGATCTGTGGCTCGCTTGAAAGCTCGCTTTGTTGCTAAAGGTCATGCACAAA
CCTATGGAGTTGACTATTCTGATACCTTTTCTCCCATTGCTAAATTGGCTTCTGTTTGGTTGTTCATTTTGTTGGCATGCATTCATCATTGGCCCTTGCATCAACTTGAT
ATTAAAAATGTCTTTCTTCATGGTGATCTTGAAGAAGAGGTGTATATGGAGCAACCACCAGGGTTTGTTGCTCAGGGGGAGAATGGAAAGGTGATTATAAATTTTGGGAT
GAGGAAAAGTAAATCAGATCATTCTGTTTTTTATAAGCGATCTGAAAATGGTGTTATCTTGTTGGTTGTGTATGTTGATGATATTGTTATTACTGGCGATGACACATCAG
GTATCCAAGCACTCAAAACCTTTGTCCATAGTCAATTCCATACAAAAGATTTGGGAACGTTGAAACACTTTCTAGGAATTGAGGTAATAAGGAGCAAGAAGGGGATCTTG
TTATCACAGAGAAAATATGTAATTGACTTGTTGACTGAAAAAGGGAAGTTAGGGGCTAAACCATGTAGTACCCCAATGATGCCGAACTTACAGCTTACGAAAGAGGGAGA
GTTACTTAAAGATCCTGAAAGGTATAGAAGGTTAGTAGGAAAACTCAATTATCTTACAGTGACTCGGCCTGACATAACTTATTCAGTGAGTATTGTGAGCCAGTTTATGT
CTTCTCCTACAGTTGATCATTGGGCTGCATTGGAACAAATTTTGTGTTATTTGAAGGCAGCTCCTGGACGTGGTTTATTATATAAGGATTATGGGCACATGAACGTTGAA
TGCTTCTCAAATGCTGGCTGGGCAGGATCTAGAAAAGATAGAAGATCAACCTCAGGATATTGTGTATTTGTGGGTGGAAATTTGGTTTCTTGGAAAAGTAAAAAACAAAA
TGTGGTGTCACGTTCGAGTGCCGAATCAGAATATAGAGCAATGGCACAGTCTGTATGTGAATTAGTCTGGATACGTCAACTTCTTGTTGAATTGGGATTTGATATCACAA
CACCGACAAAATTGTGGTGTGATAATCAAACAGCTCTTCATATTGCATCTAATCCAGTATTTCATGAGCGGACTAAACACATTGAGGTTGACTGTCACTTTGAAACTAAC
TATGTTCCAATAACACGAGGTAATCGAGTGGTGCTCATGGTCAACGGGTTAGGAGCTACCCCGATGATGGAATTGATGATTGCATCTGGGAAAGCGGTTCCTAAGTTGCA
GCTGGAGCATGGGTTGGCTGTTGATAGAGTGTACACTGGATCATTTATGACTTCTCTTGACATGGCAGGTTTTTCAATTACCATCATGAAATCAGATGAAACAGTTTTGC
AATGGTTGGATGCTGCAACCAAGGCTCCTTGTTGGCCCATTGGTGCTGATGGCAGTCACCCACCTGCCAAAATACCTGTTCCAATACCACCACCAGCTCTTGCTACAAAG
AATGGGGAGACATTGGGTGGACCCCTTCAACTAAATCAACAAGGCATCATTCTAGAGGCTGCAATTGAGGCAGCTGCCAAAGCAGTGATCAATCTCAAGGACCCATTAAA
TGACTGGGATAGCAAAGTGGGCGATGGTGATTGTGGGTCAACGATGTTTAGGGGTGCAACAGCTATTCTTGAGGACATAAAATGCTATCCACTGAACGATGCTGCTGAGA
CAATGAATGAAATTGGATCATCCATTGGAAGAGTTATGGGAGGAACTAGTGGGATCATATATACAATATTTTTCAAGGCAGCATATACAAAATTGAAATCAAGCAGCGTA
GATGACATCACCCCGCAAAAATGGGCTGAAGCATTAGAAGCTTCCATAGCTGCAATTAGCAAGTATGGTGGGGCTACGGCTGGTTATCGAACATTACTTGATGCCCTAAT
TCCAGCATCTGAAGTTCTTAGGAAGAGGTTAGATGCTGGTGAAAATCCCACCACCGCGTTTATTCATTCATCTGAAGCAGCATTGGCTGGCGCTGAAGCAACAAAAAACA
TGCAGGCTCTGGCTGGCCGTTCGTCATATGTATTTGGGGAAATCCTCGCTGCAGTTCCAGATCCAGGTGCAATGGCTGCAGCAGCATGGTATAGAGCTGCGGCTATAGCT
GTCAGGGACAAGTGCCAGGCTGCTTCATGA
Protein sequenceShow/hide protein sequence
MSSFLLLRHPDHMTGNPRLFSTIYPSTSSPNVTIADGTTSPGSGIGNVRLTNSISLSSILSIPQFSFNLISVSKLTRDLNCCVSFFPGYCLFQDLTTKRTIGKWRESRGL
YIFEPEISTVVACSKVSSPFEDHCRLGHPSISVLKSLRPQFHSLSSLACESCQFAKFHRVSLYPRATKRANAPFELIHFDVWGPCPIESKRGFRYFVTFVDDFSRVTWLY
LMKNRSELLSHFRNFHAEIRTQFNGSLKVLRSDNAKEYFSHALNSYLDEHGILHQSSCVDTPSQNWVAERKNRHLLETARAVMFQINVPKYFWADAVSTACFLINRMPSS
VLKGECNEEDDDFLVYTILSSYEPSTDSPPSIPVSTRPPITQVYSRRPTPSVTCPEPEASSSLDPGTSDDLPIALRKGKRQCTYPISNFVSYSHLSSSSCSFLASLQSVS
VPKTVHEALSHPGWRAAMVEEMNALDDNCTWDFVSLPVGKKPIGCKWVFAVKVNPDGSVARLKARFVAKGHAQTYGVDYSDTFSPIAKLASVWLFILLACIHHWPLHQLD
IKNVFLHGDLEEEVYMEQPPGFVAQGENGKVIINFGMRKSKSDHSVFYKRSENGVILLVVYVDDIVITGDDTSGIQALKTFVHSQFHTKDLGTLKHFLGIEVIRSKKGIL
LSQRKYVIDLLTEKGKLGAKPCSTPMMPNLQLTKEGELLKDPERYRRLVGKLNYLTVTRPDITYSVSIVSQFMSSPTVDHWAALEQILCYLKAAPGRGLLYKDYGHMNVE
CFSNAGWAGSRKDRRSTSGYCVFVGGNLVSWKSKKQNVVSRSSAESEYRAMAQSVCELVWIRQLLVELGFDITTPTKLWCDNQTALHIASNPVFHERTKHIEVDCHFETN
YVPITRGNRVVLMVNGLGATPMMELMIASGKAVPKLQLEHGLAVDRVYTGSFMTSLDMAGFSITIMKSDETVLQWLDAATKAPCWPIGADGSHPPAKIPVPIPPPALATK
NGETLGGPLQLNQQGIILEAAIEAAAKAVINLKDPLNDWDSKVGDGDCGSTMFRGATAILEDIKCYPLNDAAETMNEIGSSIGRVMGGTSGIIYTIFFKAAYTKLKSSSV
DDITPQKWAEALEASIAAISKYGGATAGYRTLLDALIPASEVLRKRLDAGENPTTAFIHSSEAALAGAEATKNMQALAGRSSYVFGEILAAVPDPGAMAAAAWYRAAAIA
VRDKCQAAS