; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G008180 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G008180
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr09:4201164..4205598
RNA-Seq ExpressionCmoCh09G008180
SyntenyCmoCh09G008180
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591921.1 hypothetical protein SDJN03_14267, partial [Cucurbita argyrosperma subsp. sororia]1.2e-29197.87Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLW RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEER NRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLA SGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVL VPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLK---DNP
        FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLK EGHGCSLTKLDVLK   D+P
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLK---DNP

Query:  VGNAGDNTNEVDDPVR
        VGNAGDN NEVDDPV+
Subjt:  VGNAGDNTNEVDDPVR

KAG7024795.1 hypothetical protein SDJN02_13614, partial [Cucurbita argyrosperma subsp. argyrosperma]5.2e-28498.99Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLW RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEER NRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLA SGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVL VPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLK
        FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAI+LKGEGHGCSLTKLDVLK
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLK

XP_022937203.1 uncharacterized protein LOC111443568 isoform X1 [Cucurbita moschata]1.0e-27994.04Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        EDLISGE                               VPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
        FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN

Query:  AGDNTNEVDDPVRDDSTEID
        AGDNTNEVDDPVRDDSTEID
Subjt:  AGDNTNEVDDPVRDDSTEID

XP_022937204.1 uncharacterized protein LOC111443568 isoform X2 [Cucurbita moschata]4.7e-27793.65Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFK  PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        EDLISGE                               VPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
        FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN

Query:  AGDNTNEVDDPVRDDSTEID
        AGDNTNEVDDPVRDDSTEID
Subjt:  AGDNTNEVDDPVRDDSTEID

XP_023535254.1 uncharacterized protein LOC111796743 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-27392.31Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLFCCLVC GMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLA SGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        EDLISGE                               VPESI EACEEFFAA LTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
        FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLK+KMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSL KLDVLKD+PVGN
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN

Query:  AGDNTNEVDDPVRDDSTEID
        AGDNTNEVDDPVRDDSTEID
Subjt:  AGDNTNEVDDPVRDDSTEID

TrEMBL top hitse value%identityAlignment
A0A1S3CJZ0 uncharacterized protein LOC103501816 isoform X11.5e-16461.85Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVA--AATNKRPRD---TKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPM-PCA
        M+PYS+ERLT+EVLYLHSLW RGPPR PKPT  + STAVA    +NKRP D    KN+ +KKKKPR +P QD+GPEWPCPEPVQNQPSTSSGWPP+ P A
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVA--AATNKRPRD---TKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPM-PCA

Query:  TPAARLVSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSG
        TPAA+LVSSEER N  ALQLQYKG +ACR+F  RNADSGSDEE EEEE +DGE+MES+EY FFL +F+EN+ELR YYEKNCE GLFCCLVC GMGKKK G
Subjt:  TPAARLVSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSG

Query:  KRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKW
        K+FKNC+ LV HS SIS TKKK AHRAFG  V RVFGWDIDRLPTIVL GEPLSRSLA SGD K QPEE  V  +      NE V++  +E    +EQK 
Subjt:  KRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKW

Query:  EEEKTAEDLISGEK---TKNDDSSVVVTECRKHVVSSDELI--------QLDVLQVPESITEACEEFFAAFLTSMADDDVSE---NNAIEEREEFKFFLK
        EE KTAED  S  K   +  +D +   T+ +  V ++D  I        ++D L V  +I  AC+EF AAF  SM DDDVSE    +  EEREEFKFFLK
Subjt:  EEEKTAEDLISGEK---TKNDDSSVVVTECRKHVVSSDELI--------QLDVLQVPESITEACEEFFAAFLTSMADDDVSE---NNAIEEREEFKFFLK

Query:  LFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG
        LF ENE+LRRYY+N Y DGEF+CL CE AG+K ++ FKTC RLL+H+T  GKN   K+  KP   K+LK+ MLAHRAY+ V+C+VLG DI+ LPAIVL G
Subjt:  LFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKG

Query:  EGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID
        E  G SLTK DV K     +    ++  DD V DDSTE++
Subjt:  EGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID

A0A6J1FAI7 uncharacterized protein LOC111443568 isoform X22.3e-27793.65Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFK  PEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        EDLISGE                               VPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
        FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN

Query:  AGDNTNEVDDPVRDDSTEID
        AGDNTNEVDDPVRDDSTEID
Subjt:  AGDNTNEVDDPVRDDSTEID

A0A6J1FFD4 uncharacterized protein LOC111443568 isoform X14.9e-28094.04Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        EDLISGE                               VPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
        FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN

Query:  AGDNTNEVDDPVRDDSTEID
        AGDNTNEVDDPVRDDSTEID
Subjt:  AGDNTNEVDDPVRDDSTEID

A0A6J1IMA4 uncharacterized protein LOC111476868 isoform X12.7e-26289.04Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRD KNR+QKKKK R EPLQDTGPEWP PEPVQNQP TSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEERANRVALQLQY GIEACRRFL RNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLFCCLVCGGMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQA+CRVFGWDIDRLPTIVLNGEPLSRSLA SGDFKDQPEE+QVAEEHDSWV  ENVAI ND+IDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        E+ ISGE                               VPESI EACEEFFAAFLTSMADDDVSENNAIEE EEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
        FSCLVC+GAGKKTLRSFKTCVRLLRHTTY GKNKTG KRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKD+PVGN
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN

Query:  AGDNTNEVDDPVRDDSTEID
        AGDNTNEVDDPV+DDSTEID
Subjt:  AGDNTNEVDDPVRDDSTEID

A0A6J1INL5 uncharacterized protein LOC111476868 isoform X21.3e-25988.65Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL
        MNPYSEERLTEEVLYLHSLW+RGPPRGPKPTRYYLSTAVAAATNKRPRD KNR+QKKKK R EPLQDTGPEWP PEPVQNQP TSSGWPPMPCATPAARL
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARL

Query:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC
        VSSEERANRVALQLQY GIEACRRFL RNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKN EDGLFCCLVCGGMGKKKSGKRFKNC
Subjt:  VSSEERANRVALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNC

Query:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA
        IGLVHHSNSISRTKKKVAHRAFGQA+CRVFGWDIDRLPTIVLNGEPLSRSLA SGDFK  PEE+QVAEEHDSWV  ENVAI ND+IDMKNEQKWEEEKTA
Subjt:  IGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTA

Query:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE
        E+ ISGE                               VPESI EACEEFFAAFLTSMADDDVSENNAIEE EEFKFFLKLFIENESLRRYYKNKYDDGE
Subjt:  EDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGE

Query:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN
        FSCLVC+GAGKKTLRSFKTCVRLLRHTTY GKNKTG KRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKD+PVGN
Subjt:  FSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGN

Query:  AGDNTNEVDDPVRDDSTEID
        AGDNTNEVDDPV+DDSTEID
Subjt:  AGDNTNEVDDPVRDDSTEID

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G78810.1 unknown protein2.4e-5331.15Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPP-RGPKPTRYY---------------------LSTAVAAAT----NKRPRDTKNRKQKKKKPRLEPLQDTGPEWPC
        MN Y +E L +EV+YLHSLW +GPP R P P+  +                     L +   A T    ++ P + +N     K+PR     D+G EWP 
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPP-RGPKPTRYY---------------------LSTAVAAAT----NKRPRDTKNRKQKKKKPRLEPLQDTGPEWPC

Query:  PEPVQNQPSTSSGWPP-MPCATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNAD------SGSDEEVEEEEGNDGEIME------SEEYKFFLNLF
         + V   PST SGWP   PC     R +S+EE+    A  LQ      CR F  R +       +G DE  E +EG++ + +E      S+E++F   +F
Subjt:  PEPVQNQPSTSSGWPP-MPCATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNAD------SGSDEEVEEEEGNDGEIME------SEEYKFFLNLF

Query:  MENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQP
         EN +L+ YYEKN  +G F CLVCGG+G +KS ++FK+C+ L+ HS +I +T  K+ HRA  Q VC V GWD+                           
Subjt:  MENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQP

Query:  EENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADD
                                    N      +K ++ ++ G      DS   + + ++ V+S +E  +  VLQ+ ++ +EA ++ F    T  A D
Subjt:  EENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADD

Query:  DVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGA-GKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSL
           EN      EE +   K+F EN  L+ YY+  Y+ G F CLVC  A  KK L+ FK C  +++H T                 K+ K+K+ AH+ ++ 
Subjt:  DVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGA-GKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSL

Query:  VICQVLGWDIEKLPAIVLKG
         +C++LGWD E LP  V+KG
Subjt:  VICQVLGWDIEKLPAIVLKG

AT1G78810.2 unknown protein2.4e-5331.15Show/hide
Query:  MNPYSEERLTEEVLYLHSLWQRGPP-RGPKPTRYY---------------------LSTAVAAAT----NKRPRDTKNRKQKKKKPRLEPLQDTGPEWPC
        MN Y +E L +EV+YLHSLW +GPP R P P+  +                     L +   A T    ++ P + +N     K+PR     D+G EWP 
Subjt:  MNPYSEERLTEEVLYLHSLWQRGPP-RGPKPTRYY---------------------LSTAVAAAT----NKRPRDTKNRKQKKKKPRLEPLQDTGPEWPC

Query:  PEPVQNQPSTSSGWPP-MPCATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNAD------SGSDEEVEEEEGNDGEIME------SEEYKFFLNLF
         + V   PST SGWP   PC     R +S+EE+    A  LQ      CR F  R +       +G DE  E +EG++ + +E      S+E++F   +F
Subjt:  PEPVQNQPSTSSGWPP-MPCATPAARLVSSEERANRVALQLQYKGIEACRRFLIRNAD------SGSDEEVEEEEGNDGEIME------SEEYKFFLNLF

Query:  MENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQP
         EN +L+ YYEKN  +G F CLVCGG+G +KS ++FK+C+ L+ HS +I +T  K+ HRA  Q VC V GWD+                           
Subjt:  MENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHRAFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQP

Query:  EENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADD
                                    N      +K ++ ++ G      DS   + + ++ V+S +E  +  VLQ+ ++ +EA ++ F    T  A D
Subjt:  EENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNDDSSVVVTECRKHVVSSDELIQLDVLQVPESITEACEEFFAAFLTSMADD

Query:  DVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGA-GKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSL
           EN      EE +   K+F EN  L+ YY+  Y+ G F CLVC  A  KK L+ FK C  +++H T                 K+ K+K+ AH+ ++ 
Subjt:  DVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGA-GKKTLRSFKTCVRLLRHTTYPGKNKTGKKRVKPHIAKMLKIKMLAHRAYSL

Query:  VICQVLGWDIEKLPAIVLKG
         +C++LGWD E LP  V+KG
Subjt:  VICQVLGWDIEKLPAIVLKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCTTACTCCGAGGAAAGACTCACCGAAGAGGTTCTCTATCTCCACTCTCTGTGGCAGCGAGGTCCGCCGAGGGGCCCTAAGCCCACTCGCTATTATTTATCCAC
CGCCGTCGCCGCTGCTACGAATAAGAGACCCAGAGACACAAAGAATCGAAAGCAAAAGAAGAAGAAGCCACGCCTCGAGCCATTACAAGACACCGGCCCCGAATGGCCCT
GCCCGGAGCCAGTGCAAAATCAGCCCTCGACGTCATCTGGGTGGCCGCCAATGCCCTGTGCTACTCCGGCGGCTCGGCTGGTGTCGTCTGAAGAGCGAGCAAATCGTGTG
GCGTTGCAATTGCAGTACAAGGGTATCGAGGCTTGCCGGAGATTTCTCATTAGAAATGCCGATTCAGGGAGTGATGAAGAGGTGGAGGAGGAAGAGGGGAATGATGGGGA
GATTATGGAAAGTGAAGAGTACAAATTCTTTTTGAATCTGTTCATGGAGAATGATGAACTTAGGGGCTATTACGAGAAGAATTGTGAAGATGGGTTGTTTTGTTGCTTGG
TTTGTGGTGGAATGGGGAAGAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCATCATTCGAATTCGATATCAAGGACGAAGAAGAAGGTGGCTCATAGG
GCTTTTGGACAGGCCGTATGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCAACCATTGTGTTGAATGGCGAGCCTCTCAGTCGATCATTAGCCACCTCTGGAGATTT
TAAGGATCAGCCAGAGGAAAATCAGGTGGCTGAAGAACATGATTCTTGGGTTCATAATGAAAATGTAGCCATTTTGAATGATGAAATTGATATGAAGAATGAACAGAAAT
GGGAGGAAGAAAAGACAGCTGAAGATTTGATTTCTGGCGAGAAAACGAAGAACGATGATTCCTCGGTGGTCGTAACCGAATGCCGAAAACATGTAGTTTCTTCTGATGAG
CTGATACAGTTGGATGTGTTGCAGGTACCCGAGTCGATTACGGAAGCATGTGAAGAATTTTTTGCTGCCTTCTTGACATCTATGGCTGACGACGATGTTAGCGAAAACAA
CGCAATCGAGGAACGCGAAGAGTTCAAATTCTTTTTAAAGCTGTTCATTGAGAATGAAAGCTTGAGAAGATATTACAAGAACAAGTATGATGATGGAGAATTTTCGTGTT
TAGTTTGTGAAGGAGCGGGAAAGAAAACGTTGAGGAGTTTTAAGACGTGCGTTCGCCTTCTCCGACATACAACTTATCCTGGGAAGAACAAAACAGGGAAAAAACGGGTT
AAGCCTCACATTGCTAAGATGTTGAAAATAAAGATGCTGGCTCATAGAGCATATAGTTTAGTTATATGCCAGGTTCTTGGTTGGGACATAGAAAAGCTTCCTGCAATCGT
GTTAAAAGGCGAAGGCCATGGTTGTTCGTTAACGAAGCTAGACGTGTTGAAAGACAACCCGGTTGGCAATGCAGGTGATAATACGAACGAAGTAGATGATCCTGTGAGAG
ATGACTCCACTGAGATCGACTAA
mRNA sequenceShow/hide mRNA sequence
GAATGAACACAGTTTAAATCTCAGAAGCATTTTATCAAACAGGTAGTGAACCCTGTAAAAAGACAACATCTCTTTCACTGGATGAGCAGTCAACGTTTCCCCATTTCAAA
GAATCGAAAACAGACACTACTGAAGTTTTGTATTCGAACATGCCTTCAAGAGCGCGAACCCATTAATCTCCTCGGCGGAAGAAACAGGGTCGGAACCAAAAGAAGCGCAA
TCAGAATGAAACCAACGGGTCATGTCGAATCCCCGCCCATCTCTGCTCACAAAACCCAACCTCAGTGCCTTTGCAGGAATGGTTTTGGAGCAGTTCTTGCATGTGGAACG
ATTCGACTTGGCGTACTCGGCAACAATTTTTACAGCAGATGTAGACATTGGTGGACCTTGAAAGAAAATACGGCGACTGTGAGAAGAGAAACGAGAGAAATGGCTCAGTG
GGGACTTGGGAATTTGAAGGAGGGAAGAAAGTATGTGGTTGTTACAGTAGGTCATATGACTGGTACGGACTGTGGAGCCGATCAGCGTACGGACGAGGGTAGAATGCACA
AGGTAGAAGACGACGAGCGACGAAATTCCCCTTCTAAATTCTAATTGAGATATATATATAGAGAGAGATATATATATATAGAGAGAGAGAGAGAGAGAGAGAGCACATAA
ATCATAAGCGCCTATTTGTGATTTATGATCGGAGAGGATGAGACGAGCGATGATCCCCAACCTCTATTACTCTTGATTCCATCATCTTTCCACCAATGAATCCTTACTCC
GAGGAAAGACTCACCGAAGAGGTTCTCTATCTCCACTCTCTGTGGCAGCGAGGTCCGCCGAGGGGCCCTAAGCCCACTCGCTATTATTTATCCACCGCCGTCGCCGCTGC
TACGAATAAGAGACCCAGAGACACAAAGAATCGAAAGCAAAAGAAGAAGAAGCCACGCCTCGAGCCATTACAAGACACCGGCCCCGAATGGCCCTGCCCGGAGCCAGTGC
AAAATCAGCCCTCGACGTCATCTGGGTGGCCGCCAATGCCCTGTGCTACTCCGGCGGCTCGGCTGGTGTCGTCTGAAGAGCGAGCAAATCGTGTGGCGTTGCAATTGCAG
TACAAGGGTATCGAGGCTTGCCGGAGATTTCTCATTAGAAATGCCGATTCAGGGAGTGATGAAGAGGTGGAGGAGGAAGAGGGGAATGATGGGGAGATTATGGAAAGTGA
AGAGTACAAATTCTTTTTGAATCTGTTCATGGAGAATGATGAACTTAGGGGCTATTACGAGAAGAATTGTGAAGATGGGTTGTTTTGTTGCTTGGTTTGTGGTGGAATGG
GGAAGAAGAAATCTGGGAAAAGGTTTAAGAACTGCATTGGGCTTGTTCATCATTCGAATTCGATATCAAGGACGAAGAAGAAGGTGGCTCATAGGGCTTTTGGACAGGCC
GTATGCAGGGTTTTTGGTTGGGATATTGATCGACTTCCAACCATTGTGTTGAATGGCGAGCCTCTCAGTCGATCATTAGCCACCTCTGGAGATTTTAAGGATCAGCCAGA
GGAAAATCAGGTGGCTGAAGAACATGATTCTTGGGTTCATAATGAAAATGTAGCCATTTTGAATGATGAAATTGATATGAAGAATGAACAGAAATGGGAGGAAGAAAAGA
CAGCTGAAGATTTGATTTCTGGCGAGAAAACGAAGAACGATGATTCCTCGGTGGTCGTAACCGAATGCCGAAAACATGTAGTTTCTTCTGATGAGCTGATACAGTTGGAT
GTGTTGCAGGTACCCGAGTCGATTACGGAAGCATGTGAAGAATTTTTTGCTGCCTTCTTGACATCTATGGCTGACGACGATGTTAGCGAAAACAACGCAATCGAGGAACG
CGAAGAGTTCAAATTCTTTTTAAAGCTGTTCATTGAGAATGAAAGCTTGAGAAGATATTACAAGAACAAGTATGATGATGGAGAATTTTCGTGTTTAGTTTGTGAAGGAG
CGGGAAAGAAAACGTTGAGGAGTTTTAAGACGTGCGTTCGCCTTCTCCGACATACAACTTATCCTGGGAAGAACAAAACAGGGAAAAAACGGGTTAAGCCTCACATTGCT
AAGATGTTGAAAATAAAGATGCTGGCTCATAGAGCATATAGTTTAGTTATATGCCAGGTTCTTGGTTGGGACATAGAAAAGCTTCCTGCAATCGTGTTAAAAGGCGAAGG
CCATGGTTGTTCGTTAACGAAGCTAGACGTGTTGAAAGACAACCCGGTTGGCAATGCAGGTGATAATACGAACGAAGTAGATGATCCTGTGAGAGATGACTCCACTGAGA
TCGACTAAGTTCACAACCAATCCGTCGGTGCAGTCAGGATGATACTGGAGAAGATGACTCGAAAAAGGTTTTGGCTCTGCCCCGAGCTCGTTATAACCAAGTCGGTCTCT
AATGGCGACTGAAGATGAGAGGAGCAGGGCAAGGATGATTCACAACGTAACAGGGAGGAGAGTTTCAGTTTCTTTCATTTCTAATTTTCTTTTTAAGTTCATTTCATTTA
TATCGAGGCGAATCAAATTACATGAATTTCGAACAAATTTAGATTGAAATTCATCGACTCCCATTAATTTCTTGTCAAAATTCATGAATCCCGACCACATTTCACTTGGA
ATTCATGATTCCTAACCATTTTCGACGCATATTTATAAATTTTGATTGAGATTATGGTTGTGAACAGGTCAAAATTTTTTCATTTAATAATAATTTTTAAAATAACTTAA
TTATTACACAAAAATATTATTATTATTATTTATCATTTTTTTCAAATACAACCTTGAC
Protein sequenceShow/hide protein sequence
MNPYSEERLTEEVLYLHSLWQRGPPRGPKPTRYYLSTAVAAATNKRPRDTKNRKQKKKKPRLEPLQDTGPEWPCPEPVQNQPSTSSGWPPMPCATPAARLVSSEERANRV
ALQLQYKGIEACRRFLIRNADSGSDEEVEEEEGNDGEIMESEEYKFFLNLFMENDELRGYYEKNCEDGLFCCLVCGGMGKKKSGKRFKNCIGLVHHSNSISRTKKKVAHR
AFGQAVCRVFGWDIDRLPTIVLNGEPLSRSLATSGDFKDQPEENQVAEEHDSWVHNENVAILNDEIDMKNEQKWEEEKTAEDLISGEKTKNDDSSVVVTECRKHVVSSDE
LIQLDVLQVPESITEACEEFFAAFLTSMADDDVSENNAIEEREEFKFFLKLFIENESLRRYYKNKYDDGEFSCLVCEGAGKKTLRSFKTCVRLLRHTTYPGKNKTGKKRV
KPHIAKMLKIKMLAHRAYSLVICQVLGWDIEKLPAIVLKGEGHGCSLTKLDVLKDNPVGNAGDNTNEVDDPVRDDSTEID