; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC11G202140 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC11G202140
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUnknown protein
Genome locationCiama_Chr11:1734017..1736684
RNA-Seq ExpressionCaUC11G202140
SyntenyCaUC11G202140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038899317.1 uncharacterized protein LOC120086655 isoform X1 [Benincasa hispida]2.2e-24375.81Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE  LTEEVL+LH+LWR+GPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        T A  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LKVQPEENHVAKEH             DDI+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDVSENN
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+NDDACK NDVKLQAENTDNS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSD+DVSENN
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDVSENN

Query:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV
        LID + VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S V
Subjt:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------G
        ICKVLGWDIEKLPAVVLKG+ LGRSLTK+D +K  DESVGN++DNT E D                      VEDD                       G
Subjt:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------G

Query:  VKETDSMKVDSNGEVT
        VKETDSMKVDSNGE T
Subjt:  VKETDSMKVDSNGEVT

XP_038899319.1 uncharacterized protein LOC120086655 isoform X2 [Benincasa hispida]6.7e-24576.06Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE  LTEEVL+LH+LWR+GPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        T A  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LKVQPEENHVAKEH             DDI+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDVSENN
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+NDDACK NDVKLQAENTDNS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSD+DVSENN
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDVSENN

Query:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV
        LID + VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S V
Subjt:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVP--------------------VEDDY----------------------GVK
        ICKVLGWDIEKLPAVVLKG+ LGRSLTK+D +KDESVGN++DNT E D                      VEDD                       GVK
Subjt:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVP--------------------VEDDY----------------------GVK

Query:  ETDSMKVDSNGEVT
        ETDSMKVDSNGE T
Subjt:  ETDSMKVDSNGEVT

XP_038899320.1 uncharacterized protein LOC120086655 isoform X3 [Benincasa hispida]7.7e-24175.49Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE  LTEEVL+LH+LWR+GPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        T A  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LK  PEENHVAKEH             DDI+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDVSENN
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+NDDACK NDVKLQAENTDNS+ GM ESN EMDNLP     V ESILKACKEF AAFFTSMSD+DVSENN
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLP-----VSESILKACKEFFAAFFTSMSDDDVSENN

Query:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV
        LID + VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S V
Subjt:  LIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLV

Query:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------G
        ICKVLGWDIEKLPAVVLKG+ LGRSLTK+D +K  DESVGN++DNT E D                      VEDD                       G
Subjt:  ICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------G

Query:  VKETDSMKVDSNGEVT
        VKETDSMKVDSNGE T
Subjt:  VKETDSMKVDSNGEVT

XP_038899321.1 uncharacterized protein LOC120086655 isoform X4 [Benincasa hispida]3.0e-24576.43Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE  LTEEVL+LH+LWR+GPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        T A  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LKVQPEENHVAKEH             DDI+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRD
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+NDDACK NDVKLQAENTDNS+ GM ESN EMDNLPV ESILKACKEF AAFFTSMSD+DVSENNLID +
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRD

Query:  RVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVL
         VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S VICKVL
Subjt:  RVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVL

Query:  GWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------GVKETD
        GWDIEKLPAVVLKG+ LGRSLTK+D +K  DESVGN++DNT E D                      VEDD                       GVKETD
Subjt:  GWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------GVKETD

Query:  SMKVDSNGEVT
        SMKVDSNGE T
Subjt:  SMKVDSNGEVT

XP_038899322.1 uncharacterized protein LOC120086655 isoform X5 [Benincasa hispida]3.2e-22371.52Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA
        M+PYSE  LTEEVL+LH+LWR+GPPRNPKP HNHSSTVVAAAANRNPSNKRP DPKNR NKKKKPR EP Q SGPEWPCPEPVQ QPSTSSGWP I+PVA
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVA

Query:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK
        T A  PVSSEERAN AALQLQYKGS+ACRGFFARNADSGSDEE EEEE     +GEMMESEEYKFFLK+FVENDELRGYYEKN ESGLFCCLVCGGM ++
Subjt:  TLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRK

Query:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE
        K GK+FKNCVGLVQHSISISRTKKK+AHRAFGQVVCRVFGWDI+RLPTIVLKGEPL RSLA+SG+LKVQPEENHVAKEH             DDI+KKNE
Subjt:  KSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEH-------------DDIDKKNE

Query:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRD
        VV +D  +QKLEEE+ AEDPTSN+KD+ SG+                                   V ESILKACKEF AAFFTSMSD+DVSENNLID +
Subjt:  VVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRD

Query:  RVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVL
         VEE +EFKFFLKLF ENESLRRYYEN YDDGEFFCLAC GAGKKML SFKTCGRLLQHTTSLGK+K+ KKPVQKPHIAKMLKMKM+AHRA S VICKVL
Subjt:  RVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVL

Query:  GWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------GVKETD
        GWDIEKLPAVVLKG+ LGRSLTK+D +K  DESVGN++DNT E D                      VEDD                       GVKETD
Subjt:  GWDIEKLPAVVLKGKALGRSLTKSDVSK--DESVGNAIDNTTEADVP--------------------VEDDY----------------------GVKETD

Query:  SMKVDSNGEVT
        SMKVDSNGE T
Subjt:  SMKVDSNGEVT

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X12.3e-19873.2Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ
        M+PYS+  LT+EVLYLHSLW +GPPRNPKPTH+HSST   A A+ NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP IQ
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ

Query:  PVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        PVAT A Q VSSEER N AALQLQYKGS+ACR FFARNADSGSDEEEEEEEE   +DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  PVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE
        G+KK GK+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPTIVLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF
        QKLEE K AEDPTSN+KD+ SGENDDA KD DVKLQ EN DNSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+DDDVSE      D  EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYEN Y DGEF CLACE AG+K +  FKTC RLLQH+T LGK+ + +K  QKP   K+LKM MLAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD
        A+VL G+ALG SLTKSDVSK +   +    ++ AD  VEDD
Subjt:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD

A0A1S3CJZ1 uncharacterized protein LOC103501816 isoform X38.1e-19672.83Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ
        M+PYS+  LT+EVLYLHSLW +GPPRNPKPTH+HSST   A A+ NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP IQ
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ

Query:  PVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        PVAT A Q VSSEER N AALQLQYKGS+ACR FFARNADSGSDEEEEEEEE   +DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  PVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE
        G+KK GK+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPTIVLKGEPL RSLANSGDLK  PEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF
        QKLEE K AEDPTSN+KD+ SGENDDA KD DVKLQ EN DNSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+DDDVSE      D  EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYEN Y DGEF CLACE AG+K +  FKTC RLLQH+T LGK+ + +K  QKP   K+LKM MLAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD
        A+VL G+ALG SLTKSDVSK +   +    ++ AD  VEDD
Subjt:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD

A0A1S3CJZ2 uncharacterized protein LOC103501816 isoform X21.3e-19873.57Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ
        M+PYS+  LT+EVLYLHSLW +GPPRNPKPTH+HSST   A A+ NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP IQ
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ

Query:  PVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        PVAT A Q VSSEER N AALQLQYKGS+ACR FFARNADSGSDEEEEEEEE   +DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  PVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE
        G+KK GK+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPTIVLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF
        QKLEE K AEDPTSN+KD+ SGENDDA KD DVKLQ EN DNSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+DDDVSE      D  EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYEN Y DGEF CLACE AG+K +  FKTC RLLQH+T LGK+ + +K  QKP   K+LKM MLAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD
        A+VL G+ALG SLTKSDVSKD+S  +    ++ AD  VEDD
Subjt:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDD

A0A5D3DXE1 Uncharacterized protein2.5e-19775Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ
        M+PYS+  LT+EVLYLHSLW +GPPRNPKPTH+HSST   A A+ NPSNKRP DP  RKN   KKKKPR +PPQ SGPEWPCPEPVQ QPSTSSGWP IQ
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKN---KKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQ

Query:  PVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM
        PVAT A Q VSSEER N AALQLQYKGS+ACR FFARNADSGSDEEEEEEEE   +DGEMMES+EY FFLKMFVEN+ELR YYEKN ESGLFCCLVC GM
Subjt:  PVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGM

Query:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE
        G+KK GK+FKNC+ LVQHSISIS TKKK+AHRAFG VV RVFGWDI+RLPTIVLKGEPL RSLANSGDLKVQPEE HV       D KNEV  VSV+E+E
Subjt:  GRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEV--VSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF
        QKLEE K AEDPTSN+KD+ SGENDDA KD DVKLQ EN DNSISGMGESN EMDNL V  +IL+ACKEF AAFF SM+DDDVSE      D  EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF ENE+LRRYYEN Y DGEF CLACE AG+K +  FKTC RLLQH+T LGK+ + +K  QKP   K+LKM MLAHRAY+ V+CKVLG DI+ LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSK
        A+VL G+ALG SLTKSDVSK
Subjt:  AVVLKGKALGRSLTKSDVSK

A0A6J1CJP3 uncharacterized protein LOC111012232 isoform X29.0e-18766.61Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPK--NRKNKKKKPRPEP--PQHSGPEWPCPEPVQYQPSTSSGWPSI
        M+PY E  LTEEVL+LHSLWR+GPP+N K   NHS+  VA  ANR PSNKRP  P+    K KKKKPRP P  PQ SGPEWPCPEPVQ QPSTSSGWP+I
Subjt:  MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPK--NRKNKKKKPRPEP--PQHSGPEWPCPEPVQYQPSTSSGWPSI

Query:  QPVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGG
        QP AT A QPVSSEERA  +ALQLQYK  +ACRGFFARNADSGS+ EEEEEEE E NDG + + EEYKFFLKMFVEN EL  YYEKN E G FCCLVCGG
Subjt:  QPVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGG

Query:  MGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHD-DIDKKNEVVSVDENE
        MG+KKSGKRFK+CVGLVQHSISISRTKKK+AHRAFG V+CRV GWD++RLP IVLKGEPL RSLA+SG+ +VQPE+NHVAKE    +  +N+     +NE
Subjt:  MGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHD-DIDKKNEVVSVDENE

Query:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF
        +KLEE+KAAEDP SNAK+ SSGEN + CK+NDV +Q ENTDNSI GMG    EM NLPV + I KACKEFFA F  S SD+      L D D +EE +EF
Subjt:  QKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEF

Query:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP
        KFFLKLF EN+ LR YYE+ Y+DGEF CLACEGAGKK    FKTCGRLLQH+TSL K+++G+        AKMLKMK LAHRAYS  +CKVLGWD+E+LP
Subjt:  KFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLP

Query:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDDYGVKETDSMKVDS
        +VVLKG+ LGRSLTK  VSKDE +GN   N + +  P+E+  G  E   ++ D+
Subjt:  AVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDDYGVKETDSMKVDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein2.3e-5432.13Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPP-RNPKPTHNHS-------------------------STVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSG
        MN Y +  L +EV+YLHSLW QGPP R P P+ N +                           V     +RNP+N     P+N  N  K+PRP+    SG
Subjt:  MNPYSENILTEEVLYLHSLWRQGPP-RNPKPTHNHS-------------------------STVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSG

Query:  PEWPCPEPVQYQPSTSSGWPSIQPVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNAD------SGSDEEE--EEEEEAEGNDGEMMESEEYKFF
         EWP  + V   PST SGWP  +P      +P+S+EE+   AA  LQ      CR FF R +       +G DE E  E +E+      E   S+E++F 
Subjt:  PEWPCPEPVQYQPSTSSGWPSIQPVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNAD------SGSDEEE--EEEEEAEGNDGEMMESEEYKFF

Query:  LKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDL
         ++F EN +L+ YYEKN+ +G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+N                      
Subjt:  LKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDL

Query:  KVQPEENHVAKEHDDIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFF
                           N VVS  ++ Q + E   A +P S++K                K Q  +         E + +   L + ++  +A K+ F
Subjt:  KVQPEENHVAKEHDDIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFF

Query:  AAFFTSMSD--DDVSENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGA-GKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKP
            T  +D  ++  + NL         +E +   K+F EN  L+ YYE  Y+ G F CL C  A  KKML  FK C  ++QH T               
Subjt:  AAFFTSMSD--DDVSENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGA-GKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKP

Query:  HIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKAL--------GRSLTKSDVSK---DESVGNAIDNTTEADVPVE
           K+ KMK+ AH+ ++  +C++LGWD E LP  V+KG A             T S V +   ++  GN  DN  EA+  VE
Subjt:  HIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKAL--------GRSLTKSDVSK---DESVGNAIDNTTEADVPVE

AT1G78810.2 unknown protein6.2e-5532.59Show/hide
Query:  MNPYSENILTEEVLYLHSLWRQGPP-RNPKPTHNHS-------------------------STVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSG
        MN Y +  L +EV+YLHSLW QGPP R P P+ N +                           V     +RNP+N     P+N  N  K+PRP+    SG
Subjt:  MNPYSENILTEEVLYLHSLWRQGPP-RNPKPTHNHS-------------------------STVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSG

Query:  PEWPCPEPVQYQPSTSSGWPSIQPVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNAD------SGSDEEE--EEEEEAEGNDGEMMESEEYKFF
         EWP  + V   PST SGWP  +P      +P+S+EE+   AA  LQ      CR FF R +       +G DE E  E +E+      E   S+E++F 
Subjt:  PEWPCPEPVQYQPSTSSGWPSIQPVATLAPQPVSSEERANFAALQLQYKGSEACRGFFARNAD------SGSDEEE--EEEEEAEGNDGEMMESEEYKFF

Query:  LKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDL
         ++F EN +L+ YYEKN+ +G F CLVCGG+G +KS ++FK+C+ L+QHS++I +T  K  HRA  QVVC V GWD+N                      
Subjt:  LKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISISRTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDL

Query:  KVQPEENHVAKEHDDIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFF
                           N VVS  ++ Q + E   A +P S++K                K Q  +         E + +   L + ++  +A K+ F
Subjt:  KVQPEENHVAKEHDDIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKLQAENTDNSISGMGESNTEMDNLPVSESILKACKEFF

Query:  AAFFTSMSD--DDVSENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGA-GKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKP
            T  +D  ++  + NL         +E +   K+F EN  L+ YYE  Y+ G F CL C  A  KKML  FK C  ++QH T               
Subjt:  AAFFTSMSD--DDVSENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGA-GKKMLNSFKTCGRLLQHTTSLGKSKMGKKPVQKP

Query:  HIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKA
           K+ KMK+ AH+ ++  +C++LGWD E LP  V+KG A
Subjt:  HIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTACTCCGAGAATATACTCACAGAAGAGGTTCTCTATCTTCACTCTCTCTGGCGTCAAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCAC
CGTCGTCGCCGCTGCCGCGAATCGGAACCCCTCCAACAAGAGACCTAGAGATCCAAAGAATCGAAAGAACAAGAAGAAAAAACCACGCCCCGAGCCACCGCAACACTCCG
GCCCCGAGTGGCCCTGTCCGGAGCCGGTTCAATATCAGCCCTCCACGTCATCTGGGTGGCCGTCAATTCAGCCTGTTGCCACTCTGGCGCCTCAGCCTGTGTCTTCTGAA
GAGCGAGCAAATTTTGCGGCGTTGCAATTGCAGTACAAGGGTTCCGAGGCTTGCCGGGGATTTTTCGCTAGAAATGCCGATTCGGGGAGCGACGAAGAGGAGGAGGAGGA
GGAGGAAGCTGAGGGTAATGATGGGGAAATGATGGAAAGTGAAGAATATAAATTCTTTTTGAAGATGTTTGTGGAGAATGACGAACTTAGGGGTTATTACGAGAAGAATT
CTGAAAGTGGGTTGTTTTGTTGCTTGGTTTGTGGTGGAATGGGGAGAAAGAAATCTGGGAAAAGGTTTAAGAACTGCGTTGGGCTTGTTCAACATTCGATTTCGATATCG
AGGACGAAGAAGAAGCAGGCTCATAGGGCTTTTGGGCAGGTCGTATGCAGGGTTTTTGGTTGGGATATTAATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCCCTTGG
TCGATCATTAGCCAATTCTGGAGACTTGAAGGTTCAGCCAGAGGAAAATCATGTGGCTAAAGAGCATGATGACATTGATAAGAAGAATGAAGTGGTTTCAGTGGATGAGA
ATGAACAGAAATTGGAGGAAGAAAAGGCAGCTGAAGATCCTACTTCTAATGCTAAAGATGTGAGTTCTGGAGAGAATGATGATGCCTGCAAAGATAACGATGTCAAACTG
CAAGCAGAAAATACAGATAATTCAATATCAGGCATGGGAGAAAGCAATACAGAAATGGATAATTTGCCTGTGTCAGAGTCGATTTTGAAAGCATGTAAAGAATTTTTTGC
GGCCTTCTTTACATCTATGAGCGACGATGATGTTAGTGAAAACAACTTAATCGACAGAGATAGAGTTGAGGAATGCAAAGAGTTCAAATTCTTTTTAAAGTTGTTCATCG
AGAACGAAAGCTTGAGAAGATATTACGAGAACAAATATGATGATGGAGAATTTTTCTGTTTAGCTTGTGAAGGTGCAGGAAAGAAAATGTTAAATAGTTTTAAGACATGT
GGCCGCCTTCTCCAGCATACAACTTCTCTAGGGAAGAGCAAAATGGGGAAAAAACCGGTTCAGAAGCCTCACATTGCTAAAATGTTGAAAATGAAAATGCTGGCTCATAG
GGCATATAGTTTAGTTATATGTAAGGTTCTTGGTTGGGACATTGAAAAGCTTCCTGCAGTCGTGTTAAAAGGCAAAGCTCTTGGTCGTTCCTTAACGAAGTCAGACGTGT
CAAAGGACGAATCTGTTGGCAATGCAATTGATAATACGACAGAAGCGGATGTTCCTGTAGAAGACGACTATGGTGTCAAAGAAACTGATTCTATGAAGGTTGATAGCAAT
GGTGAAGTTACTTTGAAGGATGATGCCGTGGATGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATAATTAAACCAGTACCGAACTGGTTGAAATTTAAATTAGAAGTATAAATCCTACAACTAAATTACGAAGAAAATACGTAATTTTCGGGGAAACAAAACTCGTGAGAGTT
GAGTTGGGACTGGAAGTTGGGAACGGAGACGATGATACCAAACCTCCATTACATTCTCTCTTGATTCCGCCATTTTTCCACCAATGAATCCCTACTCCGAGAATATACTC
ACAGAAGAGGTTCTCTATCTTCACTCTCTCTGGCGTCAAGGCCCGCCGAGGAACCCTAAACCCACTCACAACCATTCATCCACCGTCGTCGCCGCTGCCGCGAATCGGAA
CCCCTCCAACAAGAGACCTAGAGATCCAAAGAATCGAAAGAACAAGAAGAAAAAACCACGCCCCGAGCCACCGCAACACTCCGGCCCCGAGTGGCCCTGTCCGGAGCCGG
TTCAATATCAGCCCTCCACGTCATCTGGGTGGCCGTCAATTCAGCCTGTTGCCACTCTGGCGCCTCAGCCTGTGTCTTCTGAAGAGCGAGCAAATTTTGCGGCGTTGCAA
TTGCAGTACAAGGGTTCCGAGGCTTGCCGGGGATTTTTCGCTAGAAATGCCGATTCGGGGAGCGACGAAGAGGAGGAGGAGGAGGAGGAAGCTGAGGGTAATGATGGGGA
AATGATGGAAAGTGAAGAATATAAATTCTTTTTGAAGATGTTTGTGGAGAATGACGAACTTAGGGGTTATTACGAGAAGAATTCTGAAAGTGGGTTGTTTTGTTGCTTGG
TTTGTGGTGGAATGGGGAGAAAGAAATCTGGGAAAAGGTTTAAGAACTGCGTTGGGCTTGTTCAACATTCGATTTCGATATCGAGGACGAAGAAGAAGCAGGCTCATAGG
GCTTTTGGGCAGGTCGTATGCAGGGTTTTTGGTTGGGATATTAATCGACTTCCGACGATTGTGTTGAAGGGCGAGCCCCTTGGTCGATCATTAGCCAATTCTGGAGACTT
GAAGGTTCAGCCAGAGGAAAATCATGTGGCTAAAGAGCATGATGACATTGATAAGAAGAATGAAGTGGTTTCAGTGGATGAGAATGAACAGAAATTGGAGGAAGAAAAGG
CAGCTGAAGATCCTACTTCTAATGCTAAAGATGTGAGTTCTGGAGAGAATGATGATGCCTGCAAAGATAACGATGTCAAACTGCAAGCAGAAAATACAGATAATTCAATA
TCAGGCATGGGAGAAAGCAATACAGAAATGGATAATTTGCCTGTGTCAGAGTCGATTTTGAAAGCATGTAAAGAATTTTTTGCGGCCTTCTTTACATCTATGAGCGACGA
TGATGTTAGTGAAAACAACTTAATCGACAGAGATAGAGTTGAGGAATGCAAAGAGTTCAAATTCTTTTTAAAGTTGTTCATCGAGAACGAAAGCTTGAGAAGATATTACG
AGAACAAATATGATGATGGAGAATTTTTCTGTTTAGCTTGTGAAGGTGCAGGAAAGAAAATGTTAAATAGTTTTAAGACATGTGGCCGCCTTCTCCAGCATACAACTTCT
CTAGGGAAGAGCAAAATGGGGAAAAAACCGGTTCAGAAGCCTCACATTGCTAAAATGTTGAAAATGAAAATGCTGGCTCATAGGGCATATAGTTTAGTTATATGTAAGGT
TCTTGGTTGGGACATTGAAAAGCTTCCTGCAGTCGTGTTAAAAGGCAAAGCTCTTGGTCGTTCCTTAACGAAGTCAGACGTGTCAAAGGACGAATCTGTTGGCAATGCAA
TTGATAATACGACAGAAGCGGATGTTCCTGTAGAAGACGACTATGGTGTCAAAGAAACTGATTCTATGAAGGTTGATAGCAATGGTGAAGTTACTTTGAAGGATGATGCC
GTGGATGTGTAA
Protein sequenceShow/hide protein sequence
MNPYSENILTEEVLYLHSLWRQGPPRNPKPTHNHSSTVVAAAANRNPSNKRPRDPKNRKNKKKKPRPEPPQHSGPEWPCPEPVQYQPSTSSGWPSIQPVATLAPQPVSSE
ERANFAALQLQYKGSEACRGFFARNADSGSDEEEEEEEEAEGNDGEMMESEEYKFFLKMFVENDELRGYYEKNSESGLFCCLVCGGMGRKKSGKRFKNCVGLVQHSISIS
RTKKKQAHRAFGQVVCRVFGWDINRLPTIVLKGEPLGRSLANSGDLKVQPEENHVAKEHDDIDKKNEVVSVDENEQKLEEEKAAEDPTSNAKDVSSGENDDACKDNDVKL
QAENTDNSISGMGESNTEMDNLPVSESILKACKEFFAAFFTSMSDDDVSENNLIDRDRVEECKEFKFFLKLFIENESLRRYYENKYDDGEFFCLACEGAGKKMLNSFKTC
GRLLQHTTSLGKSKMGKKPVQKPHIAKMLKMKMLAHRAYSLVICKVLGWDIEKLPAVVLKGKALGRSLTKSDVSKDESVGNAIDNTTEADVPVEDDYGVKETDSMKVDSN
GEVTLKDDAVDV