; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021132 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021132
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPolyprotein
Genome locationchr10:6350798..6354183
RNA-Seq ExpressionPay0021132
SyntenyPay0021132
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025964.1 uncharacterized protein E6C27_scaffold34G002790 [Cucumis melo var. makuwa]0.0e+0081.05Show/hide
Query:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE
        MSAEEG                 KEQHSPTRRSKSKG AVREHVDT LTNLEQGMEDVQLA                                  +ELKE
Subjt:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE

Query:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR
        LAS VTTLKAFVEGELH+LHTKSIS E R DALCVECRSKH  S APSTSTHPTTSGTSNIKVPKPDVYNGVRNAT+VDNFLFGLERYFVALGV +DEAR
Subjt:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR

Query:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I
        INHAPTFLRDAAQLWWR                                +SRGKLRRLRHTGSILDYVKEFTTLMLEIGDLP+KE              I
Subjt:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I

Query:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA
        ELDRRNVQTLDD IA  ETLVDYSAQS GKKPGPEKHGGKPDKTK+FGRKDGGKVKTFQW+NGKNDGAHRGE SNP KPCFICKGPHWTRDCPNRKALNA
Subjt:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA

Query:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV
        LVAKFQEVKQVEDA  PQIGSMQQIGVM KETT EHKGLLYGS+RIEGKEA AMFDTG SHNFMDVQEAKRLGLKFKEETG VKVVNAKEQ IHG+AKGV
Subjt:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV

Query:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP
        LVKIGDWQKRLDFS+LPMDDF+IVLGLGFFDKVVTLLDSN+GTLSIIDGLMTTIP+RRGKPVKMLSALQFKRGV+KN+CYVATMK +E EEAK DEPPVP
Subjt:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP

Query:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN
        DNIQ VLDEYKDIMP+ELPKKLPPRR VDH+IELEPGAKPP MAPYRMAP EL+ELRRQLKELLD GYIQPSKAPY                  DYRALN
Subjt:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        KITIKN                    SKIDL SGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
        QTLEEHVQHLRQVFQV R+NELYIKLEKCSFAKQEV+FLGHWIKEGKLMMDNAKVRAILEWKAPTK+PEL+SFLGFVNYY RFIKGYSGIAAPLTNLLKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV
        NQTWDWTEECQ+AFDRLKH VSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQD HPIAFESRKLND ER YTVQEKEMTA+VH LRTWRHYLLGSKFTV
Subjt:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV

Query:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG
        MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGR N+VADALS KA+LN I TSMPTS+FLERIKEGMQHDELAKNLLKLAK+GKTRRFWEN+G
Subjt:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG

Query:  TLLTIGNRLFVPKWGSLRKDILKECHDSL
        TLLTIGNRLFVP+WG+LRKD+L+ECHDSL
Subjt:  TLLTIGNRLFVPKWGSLRKDILKECHDSL

KAA0040659.1 uncharacterized protein E6C27_scaffold370G00130 [Cucumis melo var. makuwa]0.0e+0085.75Show/hide
Query:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE
        MSAEEG                 KEQHSPTRRSKSKG AVREHVDT LTNLEQGMEDVQLAVGRLS+ FEELVQENAE TSVAKEMIE+MGRTFQKEL+E
Subjt:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE

Query:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR
        LAS VTTLKAFVEGELHDLHTKSIS E R DALCVECRSKH  S APSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGV +DEAR
Subjt:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR

Query:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I
        INHAPTFLRDAAQLWWR                                +SRGKLRRLRHTGSILDYVKEFTTLMLEIGDLP+KE              I
Subjt:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I

Query:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA
        ELDRRNVQTLDD IAVAETLVDYSAQS GKKPGPEKHGGK DKTK+FGRKDGGKVKTFQW+NGKNDGAHRGESSNP KPCFICKGPHWTRDCPNRKALNA
Subjt:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA

Query:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV
        LVAKFQE+KQVEDA  PQIGSMQQIGVM KETT EHKGLLYGS+RIEGKEAT MFDTGASHNF+DVQEAKRLGLKFKEETG VKVVNAKEQ IHG+AKGV
Subjt:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV

Query:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP
        LVKIGDWQKRLDFS+LPMDDF+IVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIP+RRGKPV+MLSALQFKRGV+KN+CYV TMK +E EEAK+DEPPVP
Subjt:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP

Query:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN
        DNIQ VLDEYKDIMP+ELPKKLPPRR VDH+IELEPGAKPP MAPYRMAPPEL+ELRRQLKELLDAGYIQPSKAPY                  DYRALN
Subjt:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        KITIKNRY IPLIAD FDQLG ARWFSKIDL SGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQ FLDRFVVVYLDDIVVYS
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
        QTLEEHVQHLRQVFQV RDNELYIKLEKCSFAKQEVEFL HWIKEGKLMMDNAKVRAILEWKAPTK+PEL+SFLGFVNYYRRFIKGYSGI APLTNLLKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV
        NQTWDWTEECQ+AFDRLKH VSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQD HPIAFES KLNDTER YTVQEKEMTA+VHCLRTWRHYLLGSKFT+
Subjt:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV

Query:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRF
        MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGR N+VADALS KAELNTIT SMPTS+FLERIKEGMQHDELA+NLLKLAKEGKTRRF
Subjt:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRF

KAA0050296.1 reverse transcriptase [Cucumis melo var. makuwa]0.0e+0087.21Show/hide
Query:  KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKELASMVTTLKAFVEGELHDLHTKS
        KEQHSPTRRSKSKGSAVREHVDT LTNLEQGMEDVQLAVGRLSDNFEELVQENAE TSVAKEMIENMGRTFQKELKEL+S VTTLKAFVEGELHDLHTKS
Subjt:  KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKELASMVTTLKAFVEGELHDLHTKS

Query:  ISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEARINHAPTFLRDAAQLWWR------
        ISLEMR DALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNAT+VDNFLFGLERYFVALGV +DEARINHA TFLRDAAQLWWR      
Subjt:  ISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEARINHAPTFLRDAAQLWWR------

Query:  --------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------IELDRRNVQTLDDGIAVAETLVDY
                                  +SRGKLRRLRHTGSILDYVKEFTTLML+IGDLP+KE              IELDRRNVQTLDD IAVAETLVDY
Subjt:  --------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------IELDRRNVQTLDDGIAVAETLVDY

Query:  SAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNALVAKFQEVKQVEDASEPQIGSMQ
        SAQS GKKPGPEKHGGKPDKTKSFG KDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNALVAKFQEVKQVEDA  PQIGSMQ
Subjt:  SAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNALVAKFQEVKQVEDASEPQIGSMQ

Query:  QIGVMKETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGVLVKIGDWQKRLDFSILPMDDFHIV
        QIGVMKETT EHK LLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGVLVKIGDWQKRLDFSILPMDDFHIV
Subjt:  QIGVMKETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGVLVKIGDWQKRLDFSILPMDDFHIV

Query:  LGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVPDNIQNVLDEYKDIMPAELPKKLPP
        LGLGFFDKVVTLLDSNRGTLSIIDGLMTTI VRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVPDNIQNVLDEYKDIM AELPKKLPP
Subjt:  LGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVPDNIQNVLDEYKDIMPAELPKKLPP

Query:  RRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPYDYRALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQ
         R VDHKIELEPGAKPP MAPYRMAP E KELRRQLKELLDAGYIQP        ALNKITIKNRY IPLIA+ FDQLGNARWFSKIDLHSGYYQVRIKQ
Subjt:  RRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPYDYRALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQ

Query:  GDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHW
        GDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHW
Subjt:  GDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHW

Query:  IKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKKNQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDA
        IKEGKLMMDNA                               KGYSGI APLTNLLKKNQTWDWTEECQRAFDRLKH + EEPVMVLADHTKPFEVHTDA
Subjt:  IKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKKNQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDA

Query:  SDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGREN
        SDFAIGGVLMQDDHPIAFESRKLNDTER YTVQEKEMTA+VHCLRTWRHYLLGSKFTV+TDNVATSYFQTQKKLTPKQA+WQDFLAEFDFKLEYKPGR N
Subjt:  SDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGREN

Query:  IVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNGTLLTIGNRLFVPKWGSLRKDI
        IVADALSLKAEL+TITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRF ENNGTLLTI +  +   W  ++ DI
Subjt:  IVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNGTLLTIGNRLFVPKWGSLRKDI

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]0.0e+0085.3Show/hide
Query:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE
        MSAEEG                 KEQHSPTRRSKSKG AVREHVDT LTNLEQGMEDVQLAVGRLS+NFEELVQENAE TSVAKEMIE+MGRTFQ+ELKE
Subjt:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE

Query:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR
        L S VTTLKAFVEGELH+LHTKSIS E R DALCVECRSKH  S APSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGV +DEAR
Subjt:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR

Query:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I
        INHAPTFLRDAAQLWWR                                +SRGKLRRLRHTGSIL+YVKEFTTLMLEIGDLP+KE              I
Subjt:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I

Query:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA
        ELDRRNVQTLDD IA AETLVDYSAQS GKKPGPEK+GGK DKTK+FGR+DGGKVKTFQW+NGKNDGAHRGESSNP KPCFICKGPHWTRDCPN+KALNA
Subjt:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA

Query:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV
        LVAKFQE+KQVEDA  PQIGSMQQIGVM KETT EHKGLLYGS+RIEGKEATAMFDTGASHNFMDVQEAKRLGLK+KEETGTVKVVNAKEQ IHG+AKGV
Subjt:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV

Query:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP
        LVKIGDWQKRLDFS+LPMDDF+IVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIP+RRGKP+KMLSALQFKRGV+KN+CYVATMK +E EEAK+DEPPVP
Subjt:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP

Query:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN
        DNIQ VLDEYKDIMP+ELPKKLPPRR VDH+IELEPGAKPP MAPYRMAP EL+ELRRQLKELLDAGYIQPSKAPY                  DYRALN
Subjt:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        KITIKNRY IPLIAD FDQLG ARWFSKIDL SGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
        QTLEEHVQHL+QVFQV RDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNA+VRAILEWKAPTKVPEL+SFLGFVNYYRRFIKGYS IAAPLTNLLKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV
        NQTW WTEECQ+AFDRLKH VSEE VMVLADHTKPFEVHTDASDFAIGGVL+QD HPIAFESRKLNDTER YTVQEKEMTA+VHCLRTWRHYLLGSKFTV
Subjt:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV

Query:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG
        MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGR N+VADALS KAELNTIT SMPTS+FLERIKEGMQHDELAKNLLKLAKEGKTRRFWEN+G
Subjt:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG

Query:  TLLTIGNRLFVPKWGSLRKDILKECHDSL
        TLLT GNRLFV +WG+LRKD+L+ECHDSL
Subjt:  TLLTIGNRLFVPKWGSLRKDILKECHDSL

XP_008446938.1 PREDICTED: uncharacterized protein LOC103489499 [Cucumis melo]0.0e+0085.03Show/hide
Query:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE
        MSAEEG                 KEQHSPTRRSKSKG AVREHVDT LTNLEQGMEDVQLAVGRLS+NFEELV ENAE TSVAKEMIE+MGRTFQKELKE
Subjt:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE

Query:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR
        LAS VTTLKAFVEGELH+LHTKSIS E R DALCVECRSKH  S APS STHPTTSGTSNIKVPKPDVYNGVRN+TVVDNFLF LERYFVALGV +DEAR
Subjt:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR

Query:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I
        INH PTFLRDAAQLWWR                                +SRGKLRRLRHT SILDYVKEFTTLMLEIGDLP+KE              I
Subjt:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I

Query:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA
        ELDRRNVQTLDD IA AETLVDYS Q  GKKPGPEKHGGKPDKTK+FGRKDGGKVKTFQW+NGKNDGAHRGESSNP KP FICKGPHWTRDCPNRKALNA
Subjt:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA

Query:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV
        LV KFQE+KQVEDA  PQIGSMQQI +M KETT EHKGLLYGS+RIEGKEATAMFDTGAS+NFMDVQEAKRLGLKFKEETG VKVV AKEQ IHG+AKGV
Subjt:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV

Query:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP
        LVKIGDWQKRLDF +LPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIP+RRGKPVKMLSALQFKRGV+KN+CYVATMK +E E+ K DEPPVP
Subjt:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP

Query:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN
        DNIQ VLDEYKDIMP+ELPKKLPPRR VDH+IELEPGAKPP MAPYRMAPPEL+ELRRQLKELLDAGYIQPSKAPY                  DYRALN
Subjt:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        KITIKNRY IPLIAD FDQLG ARWFSKIDL SGYYQVRIKQGD+AKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFV+VYLDDIVVYS
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
        QTLEEHV HLRQVFQV RDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPEL+SFLGFVNYYRRFIKGYSGIAAPLTNLLKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV
        NQTWDWTEECQRAFD+LKH VSEEPVMVL DHTKPF+VHTDASDFAIGGVLMQDDHPIAFESRKLNDTER YTVQEKEMTA+VHCLRTWRHYLLGSKFTV
Subjt:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV

Query:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG
        MTDNVATSYFQTQKKLTPKQARWQDFLAEF+FKLEYKPGR N+VADALS KAELN ITTSMPTS+FLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG
Subjt:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG

Query:  TLLTIGNRLFVPKWGSLRKDILKECHDSL
        TLLTIGNRLFVP+WG+LRKD+L+ECHDSL
Subjt:  TLLTIGNRLFVPKWGSLRKDILKECHDSL

TrEMBL top hitse value%identityAlignment
A0A1S3BG92 uncharacterized protein LOC1034894990.0e+0085.03Show/hide
Query:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE
        MSAEEG                 KEQHSPTRRSKSKG AVREHVDT LTNLEQGMEDVQLAVGRLS+NFEELV ENAE TSVAKEMIE+MGRTFQKELKE
Subjt:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE

Query:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR
        LAS VTTLKAFVEGELH+LHTKSIS E R DALCVECRSKH  S APS STHPTTSGTSNIKVPKPDVYNGVRN+TVVDNFLF LERYFVALGV +DEAR
Subjt:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR

Query:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I
        INH PTFLRDAAQLWWR                                +SRGKLRRLRHT SILDYVKEFTTLMLEIGDLP+KE              I
Subjt:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I

Query:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA
        ELDRRNVQTLDD IA AETLVDYS Q  GKKPGPEKHGGKPDKTK+FGRKDGGKVKTFQW+NGKNDGAHRGESSNP KP FICKGPHWTRDCPNRKALNA
Subjt:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA

Query:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV
        LV KFQE+KQVEDA  PQIGSMQQI +M KETT EHKGLLYGS+RIEGKEATAMFDTGAS+NFMDVQEAKRLGLKFKEETG VKVV AKEQ IHG+AKGV
Subjt:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV

Query:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP
        LVKIGDWQKRLDF +LPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIP+RRGKPVKMLSALQFKRGV+KN+CYVATMK +E E+ K DEPPVP
Subjt:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP

Query:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN
        DNIQ VLDEYKDIMP+ELPKKLPPRR VDH+IELEPGAKPP MAPYRMAPPEL+ELRRQLKELLDAGYIQPSKAPY                  DYRALN
Subjt:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        KITIKNRY IPLIAD FDQLG ARWFSKIDL SGYYQVRIKQGD+AKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFV+VYLDDIVVYS
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
        QTLEEHV HLRQVFQV RDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPEL+SFLGFVNYYRRFIKGYSGIAAPLTNLLKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV
        NQTWDWTEECQRAFD+LKH VSEEPVMVL DHTKPF+VHTDASDFAIGGVLMQDDHPIAFESRKLNDTER YTVQEKEMTA+VHCLRTWRHYLLGSKFTV
Subjt:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV

Query:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG
        MTDNVATSYFQTQKKLTPKQARWQDFLAEF+FKLEYKPGR N+VADALS KAELN ITTSMPTS+FLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG
Subjt:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG

Query:  TLLTIGNRLFVPKWGSLRKDILKECHDSL
        TLLTIGNRLFVP+WG+LRKD+L+ECHDSL
Subjt:  TLLTIGNRLFVPKWGSLRKDILKECHDSL

A0A5A7SJU9 Uncharacterized protein0.0e+0081.05Show/hide
Query:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE
        MSAEEG                 KEQHSPTRRSKSKG AVREHVDT LTNLEQGMEDVQLA                                  +ELKE
Subjt:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE

Query:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR
        LAS VTTLKAFVEGELH+LHTKSIS E R DALCVECRSKH  S APSTSTHPTTSGTSNIKVPKPDVYNGVRNAT+VDNFLFGLERYFVALGV +DEAR
Subjt:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR

Query:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I
        INHAPTFLRDAAQLWWR                                +SRGKLRRLRHTGSILDYVKEFTTLMLEIGDLP+KE              I
Subjt:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I

Query:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA
        ELDRRNVQTLDD IA  ETLVDYSAQS GKKPGPEKHGGKPDKTK+FGRKDGGKVKTFQW+NGKNDGAHRGE SNP KPCFICKGPHWTRDCPNRKALNA
Subjt:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA

Query:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV
        LVAKFQEVKQVEDA  PQIGSMQQIGVM KETT EHKGLLYGS+RIEGKEA AMFDTG SHNFMDVQEAKRLGLKFKEETG VKVVNAKEQ IHG+AKGV
Subjt:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV

Query:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP
        LVKIGDWQKRLDFS+LPMDDF+IVLGLGFFDKVVTLLDSN+GTLSIIDGLMTTIP+RRGKPVKMLSALQFKRGV+KN+CYVATMK +E EEAK DEPPVP
Subjt:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP

Query:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN
        DNIQ VLDEYKDIMP+ELPKKLPPRR VDH+IELEPGAKPP MAPYRMAP EL+ELRRQLKELLD GYIQPSKAPY                  DYRALN
Subjt:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        KITIKN                    SKIDL SGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
        QTLEEHVQHLRQVFQV R+NELYIKLEKCSFAKQEV+FLGHWIKEGKLMMDNAKVRAILEWKAPTK+PEL+SFLGFVNYY RFIKGYSGIAAPLTNLLKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV
        NQTWDWTEECQ+AFDRLKH VSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQD HPIAFESRKLND ER YTVQEKEMTA+VH LRTWRHYLLGSKFTV
Subjt:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV

Query:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG
        MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGR N+VADALS KA+LN I TSMPTS+FLERIKEGMQHDELAKNLLKLAK+GKTRRFWEN+G
Subjt:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG

Query:  TLLTIGNRLFVPKWGSLRKDILKECHDSL
        TLLTIGNRLFVP+WG+LRKD+L+ECHDSL
Subjt:  TLLTIGNRLFVPKWGSLRKDILKECHDSL

A0A5A7THC0 Reverse transcriptase domain-containing protein0.0e+0085.75Show/hide
Query:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE
        MSAEEG                 KEQHSPTRRSKSKG AVREHVDT LTNLEQGMEDVQLAVGRLS+ FEELVQENAE TSVAKEMIE+MGRTFQKEL+E
Subjt:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE

Query:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR
        LAS VTTLKAFVEGELHDLHTKSIS E R DALCVECRSKH  S APSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGV +DEAR
Subjt:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR

Query:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I
        INHAPTFLRDAAQLWWR                                +SRGKLRRLRHTGSILDYVKEFTTLMLEIGDLP+KE              I
Subjt:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I

Query:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA
        ELDRRNVQTLDD IAVAETLVDYSAQS GKKPGPEKHGGK DKTK+FGRKDGGKVKTFQW+NGKNDGAHRGESSNP KPCFICKGPHWTRDCPNRKALNA
Subjt:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA

Query:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV
        LVAKFQE+KQVEDA  PQIGSMQQIGVM KETT EHKGLLYGS+RIEGKEAT MFDTGASHNF+DVQEAKRLGLKFKEETG VKVVNAKEQ IHG+AKGV
Subjt:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV

Query:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP
        LVKIGDWQKRLDFS+LPMDDF+IVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIP+RRGKPV+MLSALQFKRGV+KN+CYV TMK +E EEAK+DEPPVP
Subjt:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP

Query:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN
        DNIQ VLDEYKDIMP+ELPKKLPPRR VDH+IELEPGAKPP MAPYRMAPPEL+ELRRQLKELLDAGYIQPSKAPY                  DYRALN
Subjt:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        KITIKNRY IPLIAD FDQLG ARWFSKIDL SGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQ FLDRFVVVYLDDIVVYS
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
        QTLEEHVQHLRQVFQV RDNELYIKLEKCSFAKQEVEFL HWIKEGKLMMDNAKVRAILEWKAPTK+PEL+SFLGFVNYYRRFIKGYSGI APLTNLLKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV
        NQTWDWTEECQ+AFDRLKH VSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQD HPIAFES KLNDTER YTVQEKEMTA+VHCLRTWRHYLLGSKFT+
Subjt:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV

Query:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRF
        MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGR N+VADALS KAELNTIT SMPTS+FLERIKEGMQHDELA+NLLKLAKEGKTRRF
Subjt:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRF

A0A5A7VEX8 Polyprotein0.0e+0085.3Show/hide
Query:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE
        MSAEEG                 KEQHSPTRRSKSKG AVREHVDT LTNLEQGMEDVQLAVGRLS+NFEELVQENAE TSVAKEMIE+MGRTFQ+ELKE
Subjt:  MSAEEG-----------------KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKE

Query:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR
        L S VTTLKAFVEGELH+LHTKSIS E R DALCVECRSKH  S APSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGV +DEAR
Subjt:  LASMVTTLKAFVEGELHDLHTKSISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEAR

Query:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I
        INHAPTFLRDAAQLWWR                                +SRGKLRRLRHTGSIL+YVKEFTTLMLEIGDLP+KE              I
Subjt:  INHAPTFLRDAAQLWWR--------------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------I

Query:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA
        ELDRRNVQTLDD IA AETLVDYSAQS GKKPGPEK+GGK DKTK+FGR+DGGKVKTFQW+NGKNDGAHRGESSNP KPCFICKGPHWTRDCPN+KALNA
Subjt:  ELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNA

Query:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV
        LVAKFQE+KQVEDA  PQIGSMQQIGVM KETT EHKGLLYGS+RIEGKEATAMFDTGASHNFMDVQEAKRLGLK+KEETGTVKVVNAKEQ IHG+AKGV
Subjt:  LVAKFQEVKQVEDASEPQIGSMQQIGVM-KETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGV

Query:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP
        LVKIGDWQKRLDFS+LPMDDF+IVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIP+RRGKP+KMLSALQFKRGV+KN+CYVATMK +E EEAK+DEPPVP
Subjt:  LVKIGDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVP

Query:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN
        DNIQ VLDEYKDIMP+ELPKKLPPRR VDH+IELEPGAKPP MAPYRMAP EL+ELRRQLKELLDAGYIQPSKAPY                  DYRALN
Subjt:  DNIQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        KITIKNRY IPLIAD FDQLG ARWFSKIDL SGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
        QTLEEHVQHL+QVFQV RDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNA+VRAILEWKAPTKVPEL+SFLGFVNYYRRFIKGYS IAAPLTNLLKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV
        NQTW WTEECQ+AFDRLKH VSEE VMVLADHTKPFEVHTDASDFAIGGVL+QD HPIAFESRKLNDTER YTVQEKEMTA+VHCLRTWRHYLLGSKFTV
Subjt:  NQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTV

Query:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG
        MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGR N+VADALS KAELNTIT SMPTS+FLERIKEGMQHDELAKNLLKLAKEGKTRRFWEN+G
Subjt:  MTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNG

Query:  TLLTIGNRLFVPKWGSLRKDILKECHDSL
        TLLT GNRLFV +WG+LRKD+L+ECHDSL
Subjt:  TLLTIGNRLFVPKWGSLRKDILKECHDSL

A0A5D3BWR0 Reverse transcriptase0.0e+0087.21Show/hide
Query:  KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKELASMVTTLKAFVEGELHDLHTKS
        KEQHSPTRRSKSKGSAVREHVDT LTNLEQGMEDVQLAVGRLSDNFEELVQENAE TSVAKEMIENMGRTFQKELKEL+S VTTLKAFVEGELHDLHTKS
Subjt:  KEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKELASMVTTLKAFVEGELHDLHTKS

Query:  ISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEARINHAPTFLRDAAQLWWR------
        ISLEMR DALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNAT+VDNFLFGLERYFVALGV +DEARINHA TFLRDAAQLWWR      
Subjt:  ISLEMRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEARINHAPTFLRDAAQLWWR------

Query:  --------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------IELDRRNVQTLDDGIAVAETLVDY
                                  +SRGKLRRLRHTGSILDYVKEFTTLML+IGDLP+KE              IELDRRNVQTLDD IAVAETLVDY
Subjt:  --------------------------QSRGKLRRLRHTGSILDYVKEFTTLMLEIGDLPKKE--------------IELDRRNVQTLDDGIAVAETLVDY

Query:  SAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNALVAKFQEVKQVEDASEPQIGSMQ
        SAQS GKKPGPEKHGGKPDKTKSFG KDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNALVAKFQEVKQVEDA  PQIGSMQ
Subjt:  SAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCPNRKALNALVAKFQEVKQVEDASEPQIGSMQ

Query:  QIGVMKETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGVLVKIGDWQKRLDFSILPMDDFHIV
        QIGVMKETT EHK LLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGVLVKIGDWQKRLDFSILPMDDFHIV
Subjt:  QIGVMKETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGVLVKIGDWQKRLDFSILPMDDFHIV

Query:  LGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVPDNIQNVLDEYKDIMPAELPKKLPP
        LGLGFFDKVVTLLDSNRGTLSIIDGLMTTI VRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVPDNIQNVLDEYKDIM AELPKKLPP
Subjt:  LGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVPDNIQNVLDEYKDIMPAELPKKLPP

Query:  RRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPYDYRALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQ
         R VDHKIELEPGAKPP MAPYRMAP E KELRRQLKELLDAGYIQP        ALNKITIKNRY IPLIA+ FDQLGNARWFSKIDLHSGYYQVRIKQ
Subjt:  RRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPYDYRALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQ

Query:  GDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHW
        GDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHW
Subjt:  GDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHW

Query:  IKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKKNQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDA
        IKEGKLMMDNA                               KGYSGI APLTNLLKKNQTWDWTEECQRAFDRLKH + EEPVMVLADHTKPFEVHTDA
Subjt:  IKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKKNQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDA

Query:  SDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGREN
        SDFAIGGVLMQDDHPIAFESRKLNDTER YTVQEKEMTA+VHCLRTWRHYLLGSKFTV+TDNVATSYFQTQKKLTPKQA+WQDFLAEFDFKLEYKPGR N
Subjt:  SDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGREN

Query:  IVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNGTLLTIGNRLFVPKWGSLRKDI
        IVADALSLKAEL+TITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRF ENNGTLLTI +  +   W  ++ DI
Subjt:  IVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKEGKTRRFWENNGTLLTIGNRLFVPKWGSLRKDI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.5e-8739.33Show/hide
Query:  VLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY-----------------------DYRALN
        +L +Y DI   E   KL       H I  +     P  + Y       +E+  Q++++L+ G I+ S +PY                       DYR LN
Subjt:  VLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY-----------------------DYRALN

Query:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS
        +IT+ +R+ IP + +   +LG   +F+ IDL  G++Q+ +     +KTA  T++G YE+L MPFGL NAPATF   MN + +P L++  +VYLDDI+V+S
Subjt:  KITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYS

Query:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK
         +L+EH+Q L  VF+      L ++L+KC F KQE  FLGH +    +  +  K+ AI ++  PTK  E+++FLG   YYR+FI  ++ IA P+T  LKK
Subjt:  QTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKK

Query:  NQTWDWTE-ECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFT
        N   D T  E   AF +LK+++SE+P++ + D TK F + TDASD A+G VL QD HP+++ SR LN+ E +Y+  EKE+ A+V   +T+RHYLLG  F 
Subjt:  NQTWDWTE-ECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFT

Query:  VMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALS
        + +D+   S+    K    K  RW+  L+EFDF ++Y  G+EN VADALS
Subjt:  VMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALS

P0CT41 Transposon Tf2-12 polyprotein1.2e-7531.35Show/hide
Query:  EPPVPDNIQNVLDEYKDIMPAELPKKLP-PRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKA------------------PY
        EP +PD    +  E+KDI      +KLP P + ++ ++EL        +  Y + P +++ +  ++ + L +G I+ SKA                    
Subjt:  EPPVPDNIQNVLDEYKDIMPAELPKKLP-PRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKA------------------PY

Query:  DYRALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLD
        DY+ LNK    N Y +PLI     ++  +  F+K+DL S Y+ +R+++GDE K A     G +E+LVMP+G++ APA F   +N +     +  VV Y+D
Subjt:  DYRALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLD

Query:  DIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPL
        DI+++S++  EHV+H++ V Q  ++  L I   KC F + +V+F+G+ I E         +  +L+WK P    EL+ FLG VNY R+FI   S +  PL
Subjt:  DIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPL

Query:  TNLLKKNQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQ---DD--HPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTW
         NLLKK+  W WT    +A + +K  +   PV+   D +K   + TDASD A+G VL Q   DD  +P+ + S K++  + +Y+V +KEM A++  L+ W
Subjt:  TNLLKKNQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQ---DD--HPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTW

Query:  RHYLLGS--KFTVMTD--NVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAE-------------LNTITTSMPTSDFLERIKE
        RHYL  +   F ++TD  N+        +    + ARWQ FL +F+F++ Y+PG  N +ADALS   +             +N +     T DF  ++  
Subjt:  RHYLLGS--KFTVMTD--NVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAE-------------LNTITTSMPTSDFLERIKE

Query:  GMQHDELAKNLLKLAKEGKTRRFWENNGTLLTIGNRLFVPKWGSLRKDILKECHD
           +D    NLL    +         +G L+   +++ +P    L + I+K+ H+
Subjt:  GMQHDELAKNLLKLAKEGKTRRFWENNGTLLTIGNRLFVPKWGSLRKDILKECHD

P20825 Retrovirus-related Pol polyprotein from transposon 2974.0e-8437.83Show/hide
Query:  IQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY-----------------------DYR
        ++ +L++++++   E  +KL     + H +     + P     Y +A     E+  Q++E+L+ G I+ S +PY                       DYR
Subjt:  IQNVLDEYKDIMPAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPY-----------------------DYR

Query:  ALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIV
         LN+ITI +RY IP + +   +LG  ++F+ IDL  G++Q+ + +   +KTA  T+ G YE+L MPFGL NAPATF   MN + +P L++  +VYLDDI+
Subjt:  ALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIV

Query:  VYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNL
        ++S +L EH+  ++ VF    D  L ++L+KC F K+E  FLGH +    +  +  KV+AI+ +  PTK  E+++FLG   YYR+FI  Y+ IA P+T+ 
Subjt:  VYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNL

Query:  LKKNQTWDWTE-ECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGS
        LKK    D  + E   AF++LK ++  +P++ L D  K F + TDAS+ A+G VL Q+ HPI+F SR LND E +Y+  EKE+ A+V   +T+RHYLLG 
Subjt:  LKKNQTWDWTE-ECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGS

Query:  KFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALS-LKAELN
        +F + +D+    +    K+   K  RW+  L+E+ FK++Y  G+EN VADALS +K E N
Subjt:  KFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALS-LKAELN

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.4e-7836.18Show/hide
Query:  EYKDIMPAELPKKLPPRRV----VDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAP------------------YDYRALNKITI
        +Y++I+  +LP +  P  +    V H IE++PGA+ P + PY +     +E+ + +++LLD  +I PSK+P                   DYR LNK TI
Subjt:  EYKDIMPAELPKKLPPRRV----VDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAP------------------YDYRALNKITI

Query:  KNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLE
         + + +P I +   ++GNA+ F+ +DLHSGY+Q+ ++  D  KTA VT  G YE+ VMPFGL NAP+TF   M   F+    RFV VYLDDI+++S++ E
Subjt:  KNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLE

Query:  EHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLL-KKNQT
        EH +HL  V +  ++  L +K +KC FA +E EFLG+ I   K+     K  AI ++  P  V + Q FLG +NYYRRFI   S IA P+   +  K+Q 
Subjt:  EHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLL-KKNQT

Query:  WDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHP------IAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSK
          WTE+  +A ++LK  +   PV+V  ++   + + TDAS   IG VL + D+       + + S+ L   +++Y   E E+  ++  L  +R+ L G  
Subjt:  WDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHP------IAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSK

Query:  FTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKE
        FT+ TD+++    Q + +   +  RW D LA +DF LEY  G +N+VADA+S      T  TS P     E  K   + D L   +L   KE
Subjt:  FTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKE

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.9e-7836.38Show/hide
Query:  EYKDIMPAELPKKLPPRRV----VDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAP------------------YDYRALNKITI
        +Y++I+  +LP +  P  +    V H IE++PGA+ P + PY +     +E+ + +++LLD  +I PSK+P                   DYR LNK TI
Subjt:  EYKDIMPAELPKKLPPRRV----VDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAP------------------YDYRALNKITI

Query:  KNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLE
         + + +P I +   ++GNA+ F+ +DLHSGY+Q+ ++  D  KTA VT  G YE+ VMPFGL NAP+TF   M   F+    RFV VYLDDI+++S++ E
Subjt:  KNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQGDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLE

Query:  EHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLL-KKNQT
        EH +HL  V +  ++  L +K +KC FA +E EFLG+ I   K+     K  AI ++  P  V + Q FLG +NYYRRFI   S IA P+   +  K+Q 
Subjt:  EHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLL-KKNQT

Query:  WDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHP------IAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSK
          WTE+  +A D+LK  +   PV+V  ++   + + TDAS   IG VL + D+       + + S+ L   +++Y   E E+  ++  L  +R+ L G  
Subjt:  WDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHP------IAFESRKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSK

Query:  FTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKE
        FT+ TD+++    Q + +   +  RW D LA +DF LEY  G +N+VADA+S      T  TS P     E  K   + D L   +L   KE
Subjt:  FTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKEGMQHDELAKNLLKLAKE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.3e-2140.15Show/hide
Query:  VQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLG--HWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKKNQTW
        + HL  V Q++  ++ Y   +KC+F + ++ +LG  H I    +  D AK+ A++ W  P    EL+ FLG   YYRRF+K Y  I  PLT LLKKN + 
Subjt:  VQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLG--HWIKEGKLMMDNAKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKKNQTW

Query:  DWTEECQRAFDRLKHVVSEEPVMVLADHTKPF
         WTE    AF  LK  V+  PV+ L D   PF
Subjt:  DWTEECQRAFDRLKHVVSEEPVMVLADHTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCTGAAGAAGGAAAAGAGCAACATTCTCCTACTCGACGCTCGAAATCAAAAGGTTCTGCAGTAAGGGAGCATGTCGACACTCCCCTTACCAACCTTGAGCAAGG
TATGGAGGATGTTCAGCTCGCGGTTGGTAGGCTGAGTGATAATTTTGAGGAATTGGTGCAAGAAAATGCCGAAACCACCTCGGTTGCCAAGGAGATGATTGAGAACATGG
GACGGACATTCCAGAAGGAGTTGAAAGAGCTTGCATCCATGGTGACAACCTTAAAGGCATTCGTGGAAGGTGAGTTACACGATCTCCACACTAAATCTATCTCTTTAGAA
ATGAGATTTGATGCCCTTTGTGTAGAATGTCGTTCCAAACATGCTACTAGCTATGCTCCCTCAACGAGCACACACCCCACTACGTCAGGCACTTCTAACATCAAAGTGCC
CAAACCCGACGTTTATAATGGCGTAAGAAATGCCACCGTCGTGGATAACTTCTTATTTGGCCTCGAAAGATACTTTGTCGCGTTAGGGGTGCACAATGACGAGGCAAGGA
TAAACCATGCCCCAACCTTCCTAAGGGATGCTGCACAATTGTGGTGGCGTCAATCTCGAGGTAAACTCCGTCGTCTCCGACACACCGGTAGTATTCTTGATTATGTAAAA
GAATTCACTACCTTAATGCTCGAGATAGGCGATCTACCCAAAAAAGAAATCGAGCTGGACCGTCGTAATGTCCAGACCCTTGACGATGGTATTGCTGTGGCAGAAACGCT
TGTTGATTACTCTGCCCAATCAATGGGGAAGAAGCCTGGTCCAGAGAAACATGGAGGAAAACCGGATAAGACCAAGAGCTTTGGTCGCAAGGACGGGGGAAAGGTGAAGA
CTTTCCAGTGGAGGAATGGCAAGAATGATGGTGCACATCGAGGTGAGTCCTCTAACCCTTCTAAACCCTGCTTTATCTGTAAAGGACCACATTGGACTAGAGATTGTCCA
AATCGAAAGGCATTAAATGCCCTCGTTGCAAAATTCCAAGAGGTAAAGCAAGTAGAAGATGCATCAGAACCCCAAATCGGGTCAATGCAGCAAATTGGGGTGATGAAGGA
GACTACTGCAGAACATAAGGGTCTCCTATACGGCAGCGTTCGAATAGAAGGAAAAGAGGCCACAGCCATGTTCGATACAGGAGCATCCCATAATTTTATGGATGTTCAAG
AGGCTAAGAGGTTAGGCCTCAAGTTTAAAGAAGAGACCGGGACCGTCAAAGTAGTAAACGCCAAGGAACAGGTCATTCATGGTATAGCCAAGGGAGTACTAGTGAAGATT
GGTGATTGGCAGAAGAGGTTAGACTTCTCCATCCTGCCCATGGACGATTTTCATATTGTACTTGGCTTGGGTTTCTTCGACAAGGTCGTAACCCTTCTCGACTCCAACCG
AGGCACACTATCCATTATAGATGGATTAATGACAACAATCCCAGTAAGGAGAGGGAAGCCAGTGAAGATGTTGTCAGCGTTACAATTCAAAAGAGGAGTCTCAAAGAACG
AATGCTATGTAGCAACCATGAAGGCCGTAGAAACCGAGGAAGCAAAATCTGACGAGCCTCCGGTACCGGACAACATACAGAATGTCCTTGACGAGTACAAGGACATAATG
CCTGCAGAGCTGCCAAAGAAACTTCCCCCAAGAAGGGTAGTAGACCACAAAATTGAACTAGAACCAGGAGCTAAACCACCTACCATGGCTCCTTATCGGATGGCCCCTCC
AGAACTTAAGGAGTTACGAAGACAGTTGAAAGAACTCCTCGATGCCGGATATATACAACCGTCAAAGGCCCCCTATGACTACAGGGCTCTCAACAAGATCACTATCAAGA
ACAGGTACTCCATCCCTCTTATTGCTGACCCTTTCGACCAGCTAGGAAATGCCCGATGGTTTAGCAAGATCGACCTACACTCCGGGTATTACCAGGTCAGAATAAAGCAA
GGGGACGAAGCAAAAACTGCATGTGTGACCAGGTATGGTGCTTATGAATTCTTAGTTATGCCTTTTGGCCTAACGAACGCACCAGCCACGTTCTGCACACTCATGAATAA
ACTCTTCCAGCCGTTCTTAGACCGATTTGTTGTAGTATATCTGGATGACATTGTTGTCTACAGTCAAACATTAGAAGAGCATGTCCAACACTTGAGACAAGTCTTCCAAG
TATTCCGCGACAATGAACTATACATAAAGCTGGAAAAATGTTCCTTTGCCAAGCAAGAAGTCGAGTTCCTTGGACATTGGATAAAAGAAGGCAAATTAATGATGGACAAT
GCCAAGGTACGTGCGATACTAGAGTGGAAGGCCCCAACTAAAGTACCCGAACTACAGTCTTTCTTGGGGTTCGTAAACTATTACCGGAGATTTATCAAGGGGTACTCTGG
CATCGCTGCCCCTCTCACCAACCTATTGAAGAAAAATCAAACGTGGGATTGGACGGAGGAGTGCCAGCGAGCATTTGACAGGTTAAAACATGTCGTCTCAGAGGAACCCG
TAATGGTACTGGCAGATCACACTAAACCCTTTGAAGTACACACTGATGCCTCAGACTTTGCAATAGGGGGAGTCCTGATGCAAGACGACCATCCTATAGCGTTCGAGAGT
CGAAAGTTGAACGACACCGAACGGCATTACACCGTGCAAGAAAAGGAAATGACAGCCATGGTCCACTGCCTAAGAACTTGGAGGCATTACCTTCTAGGTAGTAAATTTAC
GGTTATGACCGACAACGTTGCCACGAGCTACTTCCAGACTCAGAAGAAACTAACACCAAAGCAAGCTCGATGGCAAGACTTTCTAGCTGAATTTGACTTCAAGCTAGAAT
ATAAACCTGGGAGGGAAAATATTGTTGCCGATGCCCTCAGCCTCAAAGCCGAATTAAACACAATCACGACAAGCATGCCCACAAGCGACTTCCTTGAACGAATCAAGGAA
GGTATGCAACATGACGAGCTGGCCAAGAATCTGCTGAAATTGGCCAAGGAGGGAAAAACCAGGAGGTTTTGGGAGAACAACGGTACCCTCCTTACAATTGGTAACCGCCT
ATTTGTGCCAAAATGGGGATCATTGCGGAAGGACATCTTAAAAGAATGCCACGATTCCTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCTGAAGAAGGAAAAGAGCAACATTCTCCTACTCGACGCTCGAAATCAAAAGGTTCTGCAGTAAGGGAGCATGTCGACACTCCCCTTACCAACCTTGAGCAAGG
TATGGAGGATGTTCAGCTCGCGGTTGGTAGGCTGAGTGATAATTTTGAGGAATTGGTGCAAGAAAATGCCGAAACCACCTCGGTTGCCAAGGAGATGATTGAGAACATGG
GACGGACATTCCAGAAGGAGTTGAAAGAGCTTGCATCCATGGTGACAACCTTAAAGGCATTCGTGGAAGGTGAGTTACACGATCTCCACACTAAATCTATCTCTTTAGAA
ATGAGATTTGATGCCCTTTGTGTAGAATGTCGTTCCAAACATGCTACTAGCTATGCTCCCTCAACGAGCACACACCCCACTACGTCAGGCACTTCTAACATCAAAGTGCC
CAAACCCGACGTTTATAATGGCGTAAGAAATGCCACCGTCGTGGATAACTTCTTATTTGGCCTCGAAAGATACTTTGTCGCGTTAGGGGTGCACAATGACGAGGCAAGGA
TAAACCATGCCCCAACCTTCCTAAGGGATGCTGCACAATTGTGGTGGCGTCAATCTCGAGGTAAACTCCGTCGTCTCCGACACACCGGTAGTATTCTTGATTATGTAAAA
GAATTCACTACCTTAATGCTCGAGATAGGCGATCTACCCAAAAAAGAAATCGAGCTGGACCGTCGTAATGTCCAGACCCTTGACGATGGTATTGCTGTGGCAGAAACGCT
TGTTGATTACTCTGCCCAATCAATGGGGAAGAAGCCTGGTCCAGAGAAACATGGAGGAAAACCGGATAAGACCAAGAGCTTTGGTCGCAAGGACGGGGGAAAGGTGAAGA
CTTTCCAGTGGAGGAATGGCAAGAATGATGGTGCACATCGAGGTGAGTCCTCTAACCCTTCTAAACCCTGCTTTATCTGTAAAGGACCACATTGGACTAGAGATTGTCCA
AATCGAAAGGCATTAAATGCCCTCGTTGCAAAATTCCAAGAGGTAAAGCAAGTAGAAGATGCATCAGAACCCCAAATCGGGTCAATGCAGCAAATTGGGGTGATGAAGGA
GACTACTGCAGAACATAAGGGTCTCCTATACGGCAGCGTTCGAATAGAAGGAAAAGAGGCCACAGCCATGTTCGATACAGGAGCATCCCATAATTTTATGGATGTTCAAG
AGGCTAAGAGGTTAGGCCTCAAGTTTAAAGAAGAGACCGGGACCGTCAAAGTAGTAAACGCCAAGGAACAGGTCATTCATGGTATAGCCAAGGGAGTACTAGTGAAGATT
GGTGATTGGCAGAAGAGGTTAGACTTCTCCATCCTGCCCATGGACGATTTTCATATTGTACTTGGCTTGGGTTTCTTCGACAAGGTCGTAACCCTTCTCGACTCCAACCG
AGGCACACTATCCATTATAGATGGATTAATGACAACAATCCCAGTAAGGAGAGGGAAGCCAGTGAAGATGTTGTCAGCGTTACAATTCAAAAGAGGAGTCTCAAAGAACG
AATGCTATGTAGCAACCATGAAGGCCGTAGAAACCGAGGAAGCAAAATCTGACGAGCCTCCGGTACCGGACAACATACAGAATGTCCTTGACGAGTACAAGGACATAATG
CCTGCAGAGCTGCCAAAGAAACTTCCCCCAAGAAGGGTAGTAGACCACAAAATTGAACTAGAACCAGGAGCTAAACCACCTACCATGGCTCCTTATCGGATGGCCCCTCC
AGAACTTAAGGAGTTACGAAGACAGTTGAAAGAACTCCTCGATGCCGGATATATACAACCGTCAAAGGCCCCCTATGACTACAGGGCTCTCAACAAGATCACTATCAAGA
ACAGGTACTCCATCCCTCTTATTGCTGACCCTTTCGACCAGCTAGGAAATGCCCGATGGTTTAGCAAGATCGACCTACACTCCGGGTATTACCAGGTCAGAATAAAGCAA
GGGGACGAAGCAAAAACTGCATGTGTGACCAGGTATGGTGCTTATGAATTCTTAGTTATGCCTTTTGGCCTAACGAACGCACCAGCCACGTTCTGCACACTCATGAATAA
ACTCTTCCAGCCGTTCTTAGACCGATTTGTTGTAGTATATCTGGATGACATTGTTGTCTACAGTCAAACATTAGAAGAGCATGTCCAACACTTGAGACAAGTCTTCCAAG
TATTCCGCGACAATGAACTATACATAAAGCTGGAAAAATGTTCCTTTGCCAAGCAAGAAGTCGAGTTCCTTGGACATTGGATAAAAGAAGGCAAATTAATGATGGACAAT
GCCAAGGTACGTGCGATACTAGAGTGGAAGGCCCCAACTAAAGTACCCGAACTACAGTCTTTCTTGGGGTTCGTAAACTATTACCGGAGATTTATCAAGGGGTACTCTGG
CATCGCTGCCCCTCTCACCAACCTATTGAAGAAAAATCAAACGTGGGATTGGACGGAGGAGTGCCAGCGAGCATTTGACAGGTTAAAACATGTCGTCTCAGAGGAACCCG
TAATGGTACTGGCAGATCACACTAAACCCTTTGAAGTACACACTGATGCCTCAGACTTTGCAATAGGGGGAGTCCTGATGCAAGACGACCATCCTATAGCGTTCGAGAGT
CGAAAGTTGAACGACACCGAACGGCATTACACCGTGCAAGAAAAGGAAATGACAGCCATGGTCCACTGCCTAAGAACTTGGAGGCATTACCTTCTAGGTAGTAAATTTAC
GGTTATGACCGACAACGTTGCCACGAGCTACTTCCAGACTCAGAAGAAACTAACACCAAAGCAAGCTCGATGGCAAGACTTTCTAGCTGAATTTGACTTCAAGCTAGAAT
ATAAACCTGGGAGGGAAAATATTGTTGCCGATGCCCTCAGCCTCAAAGCCGAATTAAACACAATCACGACAAGCATGCCCACAAGCGACTTCCTTGAACGAATCAAGGAA
GGTATGCAACATGACGAGCTGGCCAAGAATCTGCTGAAATTGGCCAAGGAGGGAAAAACCAGGAGGTTTTGGGAGAACAACGGTACCCTCCTTACAATTGGTAACCGCCT
ATTTGTGCCAAAATGGGGATCATTGCGGAAGGACATCTTAAAAGAATGCCACGATTCCTTATAG
Protein sequenceShow/hide protein sequence
MSAEEGKEQHSPTRRSKSKGSAVREHVDTPLTNLEQGMEDVQLAVGRLSDNFEELVQENAETTSVAKEMIENMGRTFQKELKELASMVTTLKAFVEGELHDLHTKSISLE
MRFDALCVECRSKHATSYAPSTSTHPTTSGTSNIKVPKPDVYNGVRNATVVDNFLFGLERYFVALGVHNDEARINHAPTFLRDAAQLWWRQSRGKLRRLRHTGSILDYVK
EFTTLMLEIGDLPKKEIELDRRNVQTLDDGIAVAETLVDYSAQSMGKKPGPEKHGGKPDKTKSFGRKDGGKVKTFQWRNGKNDGAHRGESSNPSKPCFICKGPHWTRDCP
NRKALNALVAKFQEVKQVEDASEPQIGSMQQIGVMKETTAEHKGLLYGSVRIEGKEATAMFDTGASHNFMDVQEAKRLGLKFKEETGTVKVVNAKEQVIHGIAKGVLVKI
GDWQKRLDFSILPMDDFHIVLGLGFFDKVVTLLDSNRGTLSIIDGLMTTIPVRRGKPVKMLSALQFKRGVSKNECYVATMKAVETEEAKSDEPPVPDNIQNVLDEYKDIM
PAELPKKLPPRRVVDHKIELEPGAKPPTMAPYRMAPPELKELRRQLKELLDAGYIQPSKAPYDYRALNKITIKNRYSIPLIADPFDQLGNARWFSKIDLHSGYYQVRIKQ
GDEAKTACVTRYGAYEFLVMPFGLTNAPATFCTLMNKLFQPFLDRFVVVYLDDIVVYSQTLEEHVQHLRQVFQVFRDNELYIKLEKCSFAKQEVEFLGHWIKEGKLMMDN
AKVRAILEWKAPTKVPELQSFLGFVNYYRRFIKGYSGIAAPLTNLLKKNQTWDWTEECQRAFDRLKHVVSEEPVMVLADHTKPFEVHTDASDFAIGGVLMQDDHPIAFES
RKLNDTERHYTVQEKEMTAMVHCLRTWRHYLLGSKFTVMTDNVATSYFQTQKKLTPKQARWQDFLAEFDFKLEYKPGRENIVADALSLKAELNTITTSMPTSDFLERIKE
GMQHDELAKNLLKLAKEGKTRRFWENNGTLLTIGNRLFVPKWGSLRKDILKECHDSL