; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G017470 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G017470
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGolgin subfamily A member 6-like protein 22
Genome locationchr04:24753833..24764700
RNA-Seq ExpressionLsi04G017470
SyntenyLsi04G017470
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136491.1 uncharacterized protein LOC101222062 isoform X2 [Cucumis sativus]2.9e-15171Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGKSDRS+ SIERRNWGKIFNGLTQMLRTQQNQLETLV ERKLLEDRVKMQHERW ADIRLYEDH+SQM+D+L LQDMERS Q SKSDLL GMKQTE Y+
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
        CRLKI                                                                  +HSEAELEDFKSFFDD I+H+NS  QE+ 
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT
        LRSA+EPAEANGG E  +S  GN DE RRS+ALE EVRR R E+EKLASEKS EVSALV E KFVWNQYNV+E DYSSKLK+K SELERAHLKVE+LLAT
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT

Query:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK
        LEQLQSSNNEKD VIA LRNQVGKMETDS KLKDEISRLSH+LEVQRKS+NA+ATPVL PCKAG R S LGGKNG++SRSNVIVNKD  SAQPSHSGNQ 
Subjt:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK

Query:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL
        KRGA DISDPGTPRLFTSSFKVPKLKNEINL
Subjt:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL

XP_008466482.1 PREDICTED: uncharacterized protein LOC103503874 [Cucumis melo]5.9e-14971Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGK+DRS+ASIER+NWGKIFNGLTQMLRTQQNQLETLV ERKLLEDRVKMQHERW ADIRLYEDHISQ++D+LLL DMERSLQ SKSDLL+GMKQTE Y+
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
        CRLKI                                                                  + SEAE EDFKS FDD I+H+NSN QE+ 
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT
        LRSA+EPAEANGGRE VLS  GN DE RRSKALE EVRRLR E+EKLASEKS EVSALV EKKFVW+QYNV+E DYSSKLK+KQSELE A LKVE+LLAT
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT

Query:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK
        LEQLQ+SN+EKD VIATL+NQVGKMETDSCKLKDEISRLS++LEVQRKS+N +ATPVL PCKA TR S LG KN +KSRSNV VNKD SSAQPSHSGNQK
Subjt:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK

Query:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL
        KRGADDISDPGTPRLFTSSFKVPKLKNEINL
Subjt:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL

XP_038900070.1 keratin, type II cytoskeletal I-like isoform X1 [Benincasa hispida]8.0e-16275.23Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGK+D SQASIERRNWGKIFNGLTQMLR QQNQLETLV ERKLLEDRVKMQHERW AD RLYEDHISQMKD+LLLQDMERSLQ SKSDLLTGMKQTE YL
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
         RLKI                                                                  +HSEAELEDFKSFFDDLISH+NSN QE+S
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRS-KALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLA
        L SA+EP EANGGRE VLS  GN +E RRS K LEGEVRRLRCE+EKLASEKSLEVSALVAEKKFVWNQYNV+E+D+SSKLKSKQSELERAHLK+EKLLA
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRS-KALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLA

Query:  TLEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQ
        TLEQLQ+SNNEKDGVIATLRNQVG MET+SCKLKDEISRLSH+LEVQRKS+NA+ATPVLNPCKAG+RPS LGGKNGTK+RSNV VNK TSSAQPSHSGNQ
Subjt:  TLEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQ

Query:  KKRGADDISDPGTPRLFTSSFKVPKLKNEINL
        KKRGADDISDPGTPRLFTSSFKVPKLKNEINL
Subjt:  KKRGADDISDPGTPRLFTSSFKVPKLKNEINL

XP_038900071.1 keratin, type II cytoskeletal I-like isoform X2 [Benincasa hispida]7.5e-15274.22Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGK+D SQASIERRNWGKIFNGLTQMLR QQNQLETLV ERKLLEDRVKMQHERW AD RLYEDHISQMKD+LLLQDMERSLQ SKSDLLTGMKQTE YL
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
         RLKI                                                                  +HSEAELEDFKSFFDDLISH+NSN QE+S
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRS-KALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLA
        L SA+EP EANGGRE VLS  GN +E RRS K LEGEVRRLRCE+EKLASEKSLEVSALVAEKKFVWNQYNV+E+D+SSKLKSKQSELERAHLK+EKLLA
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRS-KALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLA

Query:  TLEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQ
        TLEQLQ+SNNEKDGVIATLRNQVG MET+SCKLKDEISRLSH+LEVQRKS+NA+ATPVLNPCKAG+RPS LGGKNGTK+RSNV VNK TSSAQPSHSGNQ
Subjt:  TLEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQ

Query:  KKRGADDISDPGTPR
        KKRGADDISDPGTPR
Subjt:  KKRGADDISDPGTPR

XP_038900072.1 keratin, type II cytoskeletal I-like isoform X3 [Benincasa hispida]2.9e-15174.15Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGK+D SQASIERRNWGKIFNGLTQMLR QQNQLETLV ERKLLEDRVKMQHERW AD RLYEDHISQMKD+LLLQDMERSLQ SKSDLLTGMKQTE YL
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
         RLKI                                                                  +HSEAELEDFKSFFDDLISH+NSN QE+S
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRS-KALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLA
        L SA+EP EANGGRE VLS  GN +E RRS K LEGEVRRLRCE+EKLASEKSLEVSALVAEKKFVWNQYNV+E+D+SSKLKSKQSELERAHLK+EKLLA
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRS-KALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLA

Query:  TLEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQ
        TLEQLQ+SNNEKDGVIATLRNQVG MET+SCKLKDEISRLSH+LEVQRKS+NA+ATPVLNPCKAG+RPS LGGKNGTK+RSNV VNK TSSAQPSHSGNQ
Subjt:  TLEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQ

Query:  KKRGADDISDPGTP
        KKRGADDISDPGTP
Subjt:  KKRGADDISDPGTP

TrEMBL top hitse value%identityAlignment
A0A0A0LH58 Uncharacterized protein1.4e-15171Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGKSDRS+ SIERRNWGKIFNGLTQMLRTQQNQLETLV ERKLLEDRVKMQHERW ADIRLYEDH+SQM+D+L LQDMERS Q SKSDLL GMKQTE Y+
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
        CRLKI                                                                  +HSEAELEDFKSFFDD I+H+NS  QE+ 
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT
        LRSA+EPAEANGG E  +S  GN DE RRS+ALE EVRR R E+EKLASEKS EVSALV E KFVWNQYNV+E DYSSKLK+K SELERAHLKVE+LLAT
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT

Query:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK
        LEQLQSSNNEKD VIA LRNQVGKMETDS KLKDEISRLSH+LEVQRKS+NA+ATPVL PCKAG R S LGGKNG++SRSNVIVNKD  SAQPSHSGNQ 
Subjt:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK

Query:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL
        KRGA DISDPGTPRLFTSSFKVPKLKNEINL
Subjt:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL

A0A1S3CRD4 uncharacterized protein LOC1035038742.9e-14971Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGK+DRS+ASIER+NWGKIFNGLTQMLRTQQNQLETLV ERKLLEDRVKMQHERW ADIRLYEDHISQ++D+LLL DMERSLQ SKSDLL+GMKQTE Y+
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
        CRLKI                                                                  + SEAE EDFKS FDD I+H+NSN QE+ 
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT
        LRSA+EPAEANGGRE VLS  GN DE RRSKALE EVRRLR E+EKLASEKS EVSALV EKKFVW+QYNV+E DYSSKLK+KQSELE A LKVE+LLAT
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT

Query:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK
        LEQLQ+SN+EKD VIATL+NQVGKMETDSCKLKDEISRLS++LEVQRKS+N +ATPVL PCKA TR S LG KN +KSRSNV VNKD SSAQPSHSGNQK
Subjt:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK

Query:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL
        KRGADDISDPGTPRLFTSSFKVPKLKNEINL
Subjt:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL

A0A5A7VA23 Putative Cytomatrix protein-related6.4e-14970.77Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGK+DRS+ASIER+NWGKIFNGLTQMLRTQQNQLETLV ERKLLEDRVKMQHERW ADIRLYEDHISQ++D+LLL DMERSLQ SKSDLL+GMKQTE Y+
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
        CRLKI                                                                  + SEAE EDFKS FDD I+H+NSN QE+ 
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT
        LRSA+EPAEANGGRE V+S  GN DE RRSKALE EVRRLR E+EKLASEKS EVSALV EKKFVW+QYNV+E DYSSKLK+KQSELE A LKVE+LLAT
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT

Query:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK
        LEQLQ+SN+EKD VIATL+NQVGKMETDSCKLKDEISRLS++LEVQRKS+N +ATPVL PCKA TR S LG KN +KSRSNV VNKD SSAQPSHSGNQK
Subjt:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK

Query:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL
        KRGADDISDPGTPRLFTSSFKVPKLKNEINL
Subjt:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL

A0A5D3E6L6 Putative Cytomatrix protein-related2.9e-14971Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGK+DRS+ASIER+NWGKIFNGLTQMLRTQQNQLETLV ERKLLEDRVKMQHERW ADIRLYEDHISQ++D+LLL DMERSLQ SKSDLL+GMKQTE Y+
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
        CRLKI                                                                  + SEAE EDFKS FDD I+H+NSN QE+ 
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT
        LRSA+EPAEANGGRE VLS  GN DE RRSKALE EVRRLR E+EKLASEKS EVSALV EKKFVW+QYNV+E DYSSKLK+KQSELE A LKVE+LLAT
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT

Query:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK
        LEQLQ+SN+EKD VIATL+NQVGKMETDSCKLKDEISRLS++LEVQRKS+N +ATPVL PCKA TR S LG KN +KSRSNV VNKD SSAQPSHSGNQK
Subjt:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK

Query:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL
        KRGADDISDPGTPRLFTSSFKVPKLKNEINL
Subjt:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL

A0A6J1BRF6 protein MLP1-like isoform X25.4e-14066.82Show/hide
Query:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL
        MGKS+ S+ASI+RRNWGKIFN +TQMLRTQQNQLETLVKERKLLEDRV+ QHERW ADIRLYEDHISQMKD+LLL++MERSL+ SKSDLL GMKQTE YL
Subjt:  MGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYL

Query:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS
        CRLKI                                                                  + SE ELEDFKSFFDDL+SH+ S+PQE  
Subjt:  CRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEAS

Query:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT
        LR+A+E AEANG RES  +A G  D  RR KALEGEVRRLR E+EKLASEK  EVSALVAEKKFVWNQYNVME+ Y+SKLKSK SELE A+ K+EKL+AT
Subjt:  LRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLAT

Query:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK
        LEQLQ+S+NEKDG+IATLR+QVGKMETDS KLK+E+S+LSH LEVQRKSMNA+ATPVLN C AGTRPSSLGGKN  K+RSNV +NKD SSAQ S SGN+K
Subjt:  LEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQK

Query:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL
        +RG D IS+PGTPRLFTS+FKVPKLKNEINL
Subjt:  KRGADDISDPGTPRLFTSSFKVPKLKNEINL

SwissProt top hitse value%identityAlignment
P25386 Intracellular protein transport protein USO14.2e-0423.3Show/hide
Query:  QMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTG----MKQTEFYLCRLKIGFCLFSPFNLLKLRI
        Q L + +  LE+L KE + L  ++K   E+     R Y + ISQ+ D++     E      K+D L G    MK T      LK      S  + L L+I
Subjt:  QMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTG----MKQTEFYLCRLKIGFCLFSPFNLLKLRI

Query:  VKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAEL---EDFKSFFDDLISHRNSNPQE---------ASLRSAAEPA
         +           ++N T+  +L +  ++V   T    + +D C +   EV   E +L   ED  S + +L        +E           L      +
Subjt:  VKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAEL---EDFKSFFDDLISHRNSNPQE---------ASLRSAAEPA

Query:  EANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERA----HLKVEKLLATLEQL
        +A    ES LS         R  A E        + EKL +E  ++  A   E+K +    + +  +YS K+ + + EL R      LK +++  T  +L
Subjt:  EANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERA----HLKVEKLLATLEQL

Query:  QSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNL
        +  +   D ++   +N +  ++ +    KD+I+R    L
Subjt:  QSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNL

Arabidopsis top hitse value%identityAlignment
AT1G19980.1 cytomatrix protein-related1.1e-4732.79Show/hide
Query:  SIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYLCRLKIGFCL
        S ER NW  IF  L ++L+T+Q+QLE+L+K++K+LE  +K  +E W +D+R YED +S M  ++    M + L+T KS+LL G+K+ +  LC LK+    
Subjt:  SIERRNWGKIFNGLTQMLRTQQNQLETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYLCRLKIGFCL

Query:  FSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEASLRSAAEPAE
                                                                      +HS  EL+DFK++FD L  + N                
Subjt:  FSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTLSQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEASLRSAAEPAE

Query:  ANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLATLEQLQSSNN
               V S  GN  E    K+LE ++R+L+ E+EKLASEK  EVS L+ E  F WNQ+  +E +++ KLK K  E+ +A+ K+  L++  EQLQSSN 
Subjt:  ANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSLEVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLATLEQLQSSNN

Query:  EKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSH------SGNQKKRG
        EKD  I+ L+ ++ +MET+S K  +EIS+L+ +LE  +KS     TPVL  C    + S     NG    S++   KD S+A  ++      S  ++ + 
Subjt:  EKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKAGTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSH------SGNQKKRG

Query:  ADDISDPGTPRLFTSSFKVPKLKNEIN
           +S    P+LFTS+F++PKLK+  N
Subjt:  ADDISDPGTPRLFTSSFKVPKLKNEIN

AT1G19990.1 unknown protein1.6e-2744.13Show/hide
Query:  SEDQKPIKKAKVELDESDDGMSLGALLQEKRKKLLNVGSKLFSKPKKEELQGVDGLGKSPKMDSGSASKGTKVKKEERFNSFGDDFDEKPVKKSSAAKRD
        SED   +K  K++ +  +D  SL +  ++K     N GSK   K KKEE    D   K P   S S S+   VKK+E  +    D ++KPV K +++   
Subjt:  SEDQKPIKKAKVELDESDDGMSLGALLQEKRKKLLNVGSKLFSKPKKEELQGVDGLGKSPKMDSGSASKGTKVKKEERFNSFGDDFDEKPVKKSSAAKRD

Query:  MELKKKKKVKEEEKSRSSKEELDSLKKKRKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQLPHSEMAQ--MMESGLLSKEEAKKVFEKKQKKAPLQKLS
        +  + KK  KEEE             KK++E+KVYDLPGQKR+ P+ERDPLRIFYE+L+KQ+P S+MAQ  +MESGLL  E+AKKV EKK +K    KLS
Subjt:  MELKKKKKVKEEEKSRSSKEELDSLKKKRKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQLPHSEMAQ--MMESGLLSKEEAKKVFEKKQKKAPLQKLS

Query:  SPVKTVSAV-----KSVTKTAIVNKTVQSSP--VSSNKTTKVDSKVITKLSKKRKSKNESSEDESDDDIIISRSIKKKPRA
        SPVK+ ++      KSVT   +  K VQ SP    SNK    DSK  TK     K K  S +D+SDDD + SR + KK RA
Subjt:  SPVKTVSAV-----KSVTKTAIVNKTVQSSP--VSSNKTTKVDSKVITKLSKKRKSKNESSEDESDDDIIISRSIKKKPRA

AT5G11600.1 unknown protein6.4e-1640.51Show/hide
Query:  KSRSSKEELDSLK-KKRKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQLPHSEMAQ--MMESGLLSKEEAKKVFEKKQKKAPLQKLSSPVKTVSAVKSV
        K++++   +  +K K ++EKKVY L GQK DPPEER+PLRIFYE+L KQ+P SEMA+  +ME G+LS E+AK+ FEKKQ+K    ++ +P K+     S 
Subjt:  KSRSSKEELDSLK-KKRKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQLPHSEMAQ--MMESGLLSKEEAKKVFEKKQKKAPLQKLSSPVKTVSAVKSV

Query:  TKTAIVNKTVQSSPVSSNKTTKVDSKVITKLSKKRKSKNESSEDESDDDIIISRSIKK
               K   S   S++K   +D+        ++K K    +D+ DDD I+S   +K
Subjt:  TKTAIVNKTVQSSPVSSNKTTKVDSKVITKLSKKRKSKNESSEDESDDDIIISRSIKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGACCCAATGAGCACAACTCAAGAACCTTGCAACTCCCAAAGCAAACACATAATGTGCGGTAAACGGCTCAACAATCTAGGAGAGAGAACAAACAGAGAAAACAA
GTTATTGAAAGCTTTTGAAAAGTATCAGGGAGGGAAACAACAGGATAAACTTGAAATACCAAAGATACCAGTATCACTTCAGCATGGCAACCTGCAAAATCCCTTGTTGG
ATGGATCACAAAAGATAGCAGAGCACAGGAACAACAGAGTGTCTGGTTTACCTTTCAAGAACACTCAAACAAGCAGCAAGATCTCCCACATCATAAAGTTTCCAAAAAGG
TCGAACAATATTAAGAAGAAGGTCAATGCTAAACCAACTGCAGAACAGCTGGTCCAGGAATTAAGACCAAGAAATGGCCACACCTGCACAAGTCCTCTCCTTCATGAGTT
TATAAATGAGGACGGAGATTCCGATGGCGTGAACAGCCTCGGCAGCGACGAAAAGATTGTCGTGGTCATGAACCACAAAGCGCAAGAAAACAAGAGCAGCCATTCCAGAT
ACGACGGCCAGAAAGGCCTTGATCTTGGGGGGTTGCCGGCGAACCCATGTGGCGACGGCCTGAATTGGCCTCCTGGTTGCCTTCATTTCGGGGTTCTCTCTCTTTTCTTC
AGTCTCGTGTGTGTGTGTGTGTGTGTCTCTCGGCGCGCAGAGGGTAGAAGAGTCAGATCCGGGAAGAAGACCCGTGGATCCATCTCATGCCCCATCAAGATCCCAATGCT
AAATCTTCCCATTCATTTCGTGTCGACACGTGAAAGCAATTTCTCCCGTTCATTCAACGGTCACTTTATTCACCGTCGAACACTCTCTCTTGTTCGCTTGTTTACTGTTC
TGTGGATCGGAATGGGGAAGAGCGATAGATCACAAGCTTCGATTGAACGGCGAAACTGGGGCAAAATCTTCAATGGTCTAACACAAATGCTACGTACGCAACAAAATCAG
CTCGAAACACTCGTCAAGGAGCGAAAACTTCTCGAAGATCGCGTTAAAATGCAGCACGAACGATGGTTCGCTGATATTCGTCTTTACGAGGACCATATCTCTCAGATGAA
GGATGATTTGTTATTGCAAGATATGGAACGCTCACTTCAAACCTCGAAATCAGATCTGCTAACTGGAATGAAGCAGACGGAGTTCTATCTCTGCCGACTGAAAATAGGTT
TTTGTTTATTTTCCCCTTTTAATCTTTTGAAGTTACGAATAGTCAAAGCATTTGGTAGGATAAAATTCATGATACCTCATAGGCGCAATCTCACTTCCCCTGGTACCTTA
AGTCAAATTTCGAGGAATGTAAATCGAAATACGACACTAAGAAGTAAGTTCAAGGACGCTTGTTTCTACACTAGCAACGAAGTCAAACATTCAGAAGCAGAGTTGGAAGA
TTTCAAATCTTTCTTTGACGATCTTATCTCTCATAGAAACTCCAATCCACAAGAAGCATCTTTGAGAAGTGCAGCAGAACCAGCTGAGGCAAATGGTGGAAGAGAAAGTG
TCTTGTCCGCATGTGGAAATATAGATGAAGCCAGACGTTCTAAGGCATTGGAGGGTGAAGTAAGGAGGTTGAGGTGCGAGCATGAAAAACTTGCCTCAGAAAAGAGTTTG
GAGGTGTCTGCACTGGTGGCTGAGAAGAAATTTGTATGGAATCAGTATAATGTTATGGAAGATGATTACTCAAGTAAATTGAAGAGTAAGCAGTCAGAGCTTGAACGTGC
ACACCTAAAGGTAGAGAAACTTCTAGCCACATTGGAACAACTACAAAGCTCAAACAATGAGAAGGATGGTGTTATTGCAACGTTAAGAAACCAAGTGGGGAAGATGGAAA
CTGACTCATGTAAATTAAAAGACGAAATTTCCAGACTTTCACACAATTTAGAAGTGCAAAGGAAGTCTATGAATGCAAGTGCCACACCTGTGCTGAACCCATGCAAGGCA
GGAACTAGGCCATCTAGTTTGGGAGGCAAAAATGGCACGAAGAGCAGAAGTAATGTCATTGTCAACAAAGACACATCTTCTGCACAACCTTCTCATTCGGGAAACCAAAA
GAAGAGAGGCGCTGATGATATTTCAGATCCAGGGACTCCAAGGTTGTTTACCTCTAGTTTCAAGGTCCCTAAACTGAAGAACGAAATCAATTTGACGGTTCCACAGCAAT
ATCAGCAGCAGAAACAGCAACACCGCGGCACCCCAAGAAGAACGACGACGCCGAGAAATGCGAGCAGCGGCATAGGTGGGAGAGACATGGAAGCCATTGTTCTCTCAGCC
CATCCATACTTCCACTCGGGACCTCTTTCCTCCACCACCCAACTCATGCAATGGACACATGGAGGCCATGCAATGGACTTACAAATCCCAGAGGATGGTCATATAATGTC
TTCTGAGGATCAGAAGCCGATAAAGAAAGCCAAAGTGGAGCTAGACGAATCGGATGACGGAATGAGCCTTGGTGCCCTTTTACAAGAAAAGAGGAAGAAACTCTTAAATG
TGGGTTCTAAACTTTTCTCAAAGCCGAAGAAGGAAGAACTTCAGGGAGTAGATGGGTTGGGAAAATCTCCCAAAATGGATTCTGGGTCTGCCTCCAAGGGCACCAAGGTT
AAGAAAGAAGAGCGTTTCAACTCCTTCGGTGACGATTTTGACGAAAAGCCTGTCAAAAAGAGCTCTGCTGCGAAACGTGATATGGAACTGAAGAAGAAGAAGAAAGTGAA
GGAGGAGGAGAAGAGCAGGAGCTCCAAGGAGGAGTTGGATAGCCTGAAAAAAAAGAGAAAGGAAAAGAAGGTATATGATTTGCCTGGTCAGAAGCGAGATCCTCCAGAAG
AGAGAGACCCCTTGAGAATTTTCTATGAAACGCTCCACAAGCAACTTCCCCACAGTGAGATGGCACAGATGATGGAGTCTGGTTTGTTATCCAAAGAAGAAGCTAAGAAA
GTTTTTGAAAAGAAGCAGAAGAAGGCTCCATTGCAAAAGTTGAGCTCCCCAGTGAAGACTGTGAGTGCTGTAAAGAGCGTCACAAAGACCGCCATTGTTAACAAAACTGT
CCAATCTTCTCCAGTTTCTTCAAATAAAACGACAAAAGTCGACTCCAAAGTCATTACGAAACTGTCGAAGAAGCGAAAGTCCAAAAATGAAAGTTCTGAAGATGAATCAG
ACGACGACATTATAATCAGTAGAAGCATAAAAAAGAAGCCAAGAGCAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGACCCAATGAGCACAACTCAAGAACCTTGCAACTCCCAAAGCAAACACATAATGTGCGGTAAACGGCTCAACAATCTAGGAGAGAGAACAAACAGAGAAAACAA
GTTATTGAAAGCTTTTGAAAAGTATCAGGGAGGGAAACAACAGGATAAACTTGAAATACCAAAGATACCAGTATCACTTCAGCATGGCAACCTGCAAAATCCCTTGTTGG
ATGGATCACAAAAGATAGCAGAGCACAGGAACAACAGAGTGTCTGGTTTACCTTTCAAGAACACTCAAACAAGCAGCAAGATCTCCCACATCATAAAGTTTCCAAAAAGG
TCGAACAATATTAAGAAGAAGGTCAATGCTAAACCAACTGCAGAACAGCTGGTCCAGGAATTAAGACCAAGAAATGGCCACACCTGCACAAGTCCTCTCCTTCATGAGTT
TATAAATGAGGACGGAGATTCCGATGGCGTGAACAGCCTCGGCAGCGACGAAAAGATTGTCGTGGTCATGAACCACAAAGCGCAAGAAAACAAGAGCAGCCATTCCAGAT
ACGACGGCCAGAAAGGCCTTGATCTTGGGGGGTTGCCGGCGAACCCATGTGGCGACGGCCTGAATTGGCCTCCTGGTTGCCTTCATTTCGGGGTTCTCTCTCTTTTCTTC
AGTCTCGTGTGTGTGTGTGTGTGTGTCTCTCGGCGCGCAGAGGGTAGAAGAGTCAGATCCGGGAAGAAGACCCGTGGATCCATCTCATGCCCCATCAAGATCCCAATGCT
AAATCTTCCCATTCATTTCGTGTCGACACGTGAAAGCAATTTCTCCCGTTCATTCAACGGTCACTTTATTCACCGTCGAACACTCTCTCTTGTTCGCTTGTTTACTGTTC
TGTGGATCGGAATGGGGAAGAGCGATAGATCACAAGCTTCGATTGAACGGCGAAACTGGGGCAAAATCTTCAATGGTCTAACACAAATGCTACGTACGCAACAAAATCAG
CTCGAAACACTCGTCAAGGAGCGAAAACTTCTCGAAGATCGCGTTAAAATGCAGCACGAACGATGGTTCGCTGATATTCGTCTTTACGAGGACCATATCTCTCAGATGAA
GGATGATTTGTTATTGCAAGATATGGAACGCTCACTTCAAACCTCGAAATCAGATCTGCTAACTGGAATGAAGCAGACGGAGTTCTATCTCTGCCGACTGAAAATAGGTT
TTTGTTTATTTTCCCCTTTTAATCTTTTGAAGTTACGAATAGTCAAAGCATTTGGTAGGATAAAATTCATGATACCTCATAGGCGCAATCTCACTTCCCCTGGTACCTTA
AGTCAAATTTCGAGGAATGTAAATCGAAATACGACACTAAGAAGTAAGTTCAAGGACGCTTGTTTCTACACTAGCAACGAAGTCAAACATTCAGAAGCAGAGTTGGAAGA
TTTCAAATCTTTCTTTGACGATCTTATCTCTCATAGAAACTCCAATCCACAAGAAGCATCTTTGAGAAGTGCAGCAGAACCAGCTGAGGCAAATGGTGGAAGAGAAAGTG
TCTTGTCCGCATGTGGAAATATAGATGAAGCCAGACGTTCTAAGGCATTGGAGGGTGAAGTAAGGAGGTTGAGGTGCGAGCATGAAAAACTTGCCTCAGAAAAGAGTTTG
GAGGTGTCTGCACTGGTGGCTGAGAAGAAATTTGTATGGAATCAGTATAATGTTATGGAAGATGATTACTCAAGTAAATTGAAGAGTAAGCAGTCAGAGCTTGAACGTGC
ACACCTAAAGGTAGAGAAACTTCTAGCCACATTGGAACAACTACAAAGCTCAAACAATGAGAAGGATGGTGTTATTGCAACGTTAAGAAACCAAGTGGGGAAGATGGAAA
CTGACTCATGTAAATTAAAAGACGAAATTTCCAGACTTTCACACAATTTAGAAGTGCAAAGGAAGTCTATGAATGCAAGTGCCACACCTGTGCTGAACCCATGCAAGGCA
GGAACTAGGCCATCTAGTTTGGGAGGCAAAAATGGCACGAAGAGCAGAAGTAATGTCATTGTCAACAAAGACACATCTTCTGCACAACCTTCTCATTCGGGAAACCAAAA
GAAGAGAGGCGCTGATGATATTTCAGATCCAGGGACTCCAAGGTTGTTTACCTCTAGTTTCAAGGTCCCTAAACTGAAGAACGAAATCAATTTGACGGTTCCACAGCAAT
ATCAGCAGCAGAAACAGCAACACCGCGGCACCCCAAGAAGAACGACGACGCCGAGAAATGCGAGCAGCGGCATAGGTGGGAGAGACATGGAAGCCATTGTTCTCTCAGCC
CATCCATACTTCCACTCGGGACCTCTTTCCTCCACCACCCAACTCATGCAATGGACACATGGAGGCCATGCAATGGACTTACAAATCCCAGAGGATGGTCATATAATGTC
TTCTGAGGATCAGAAGCCGATAAAGAAAGCCAAAGTGGAGCTAGACGAATCGGATGACGGAATGAGCCTTGGTGCCCTTTTACAAGAAAAGAGGAAGAAACTCTTAAATG
TGGGTTCTAAACTTTTCTCAAAGCCGAAGAAGGAAGAACTTCAGGGAGTAGATGGGTTGGGAAAATCTCCCAAAATGGATTCTGGGTCTGCCTCCAAGGGCACCAAGGTT
AAGAAAGAAGAGCGTTTCAACTCCTTCGGTGACGATTTTGACGAAAAGCCTGTCAAAAAGAGCTCTGCTGCGAAACGTGATATGGAACTGAAGAAGAAGAAGAAAGTGAA
GGAGGAGGAGAAGAGCAGGAGCTCCAAGGAGGAGTTGGATAGCCTGAAAAAAAAGAGAAAGGAAAAGAAGGTATATGATTTGCCTGGTCAGAAGCGAGATCCTCCAGAAG
AGAGAGACCCCTTGAGAATTTTCTATGAAACGCTCCACAAGCAACTTCCCCACAGTGAGATGGCACAGATGATGGAGTCTGGTTTGTTATCCAAAGAAGAAGCTAAGAAA
GTTTTTGAAAAGAAGCAGAAGAAGGCTCCATTGCAAAAGTTGAGCTCCCCAGTGAAGACTGTGAGTGCTGTAAAGAGCGTCACAAAGACCGCCATTGTTAACAAAACTGT
CCAATCTTCTCCAGTTTCTTCAAATAAAACGACAAAAGTCGACTCCAAAGTCATTACGAAACTGTCGAAGAAGCGAAAGTCCAAAAATGAAAGTTCTGAAGATGAATCAG
ACGACGACATTATAATCAGTAGAAGCATAAAAAAGAAGCCAAGAGCAGCCTAATGCATGTGTAAATGGACAATCCCTTCTATTCAGAAACTGATTTTTAATTTCTGATTA
GTTGTTCCAAAACATTGCATCAACTTTTTTAATGATGGTGACACTCATTTGGTCAATTTTCTCCCACTATATTCATATATGATATATTCGTGCCTATTTACAACTACAGT
TCTTGTGCCTGATACTAGGCTTCATGATTTGCTTGTGGGAGCAATTAGAAATAGAAGACTTTTGATTACAA
Protein sequenceShow/hide protein sequence
MKDPMSTTQEPCNSQSKHIMCGKRLNNLGERTNRENKLLKAFEKYQGGKQQDKLEIPKIPVSLQHGNLQNPLLDGSQKIAEHRNNRVSGLPFKNTQTSSKISHIIKFPKR
SNNIKKKVNAKPTAEQLVQELRPRNGHTCTSPLLHEFINEDGDSDGVNSLGSDEKIVVVMNHKAQENKSSHSRYDGQKGLDLGGLPANPCGDGLNWPPGCLHFGVLSLFF
SLVCVCVCVSRRAEGRRVRSGKKTRGSISCPIKIPMLNLPIHFVSTRESNFSRSFNGHFIHRRTLSLVRLFTVLWIGMGKSDRSQASIERRNWGKIFNGLTQMLRTQQNQ
LETLVKERKLLEDRVKMQHERWFADIRLYEDHISQMKDDLLLQDMERSLQTSKSDLLTGMKQTEFYLCRLKIGFCLFSPFNLLKLRIVKAFGRIKFMIPHRRNLTSPGTL
SQISRNVNRNTTLRSKFKDACFYTSNEVKHSEAELEDFKSFFDDLISHRNSNPQEASLRSAAEPAEANGGRESVLSACGNIDEARRSKALEGEVRRLRCEHEKLASEKSL
EVSALVAEKKFVWNQYNVMEDDYSSKLKSKQSELERAHLKVEKLLATLEQLQSSNNEKDGVIATLRNQVGKMETDSCKLKDEISRLSHNLEVQRKSMNASATPVLNPCKA
GTRPSSLGGKNGTKSRSNVIVNKDTSSAQPSHSGNQKKRGADDISDPGTPRLFTSSFKVPKLKNEINLTVPQQYQQQKQQHRGTPRRTTTPRNASSGIGGRDMEAIVLSA
HPYFHSGPLSSTTQLMQWTHGGHAMDLQIPEDGHIMSSEDQKPIKKAKVELDESDDGMSLGALLQEKRKKLLNVGSKLFSKPKKEELQGVDGLGKSPKMDSGSASKGTKV
KKEERFNSFGDDFDEKPVKKSSAAKRDMELKKKKKVKEEEKSRSSKEELDSLKKKRKEKKVYDLPGQKRDPPEERDPLRIFYETLHKQLPHSEMAQMMESGLLSKEEAKK
VFEKKQKKAPLQKLSSPVKTVSAVKSVTKTAIVNKTVQSSPVSSNKTTKVDSKVITKLSKKRKSKNESSEDESDDDIIISRSIKKKPRAA