; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g23370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g23370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:16895583..16901559
RNA-Seq ExpressionMoc04g23370
SyntenyMoc04g23370
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.8e-25289.84Show/hide
Query:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR
        +EE DQLRGQLDAQVEALK KCEQKE PLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IAL GSAR
Subjt:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR

Query:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL
        LWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG+EAP TFAEVL
Subjt:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL

Query:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRG
        QKAKKVIDGQELLRTKTGRPE KIGRGRSGKDIE A PKSKDKGSFSSGRA YRR ENGPTRSRPYERFT TTIPISEILTNIEESGMEKLLKRPEKLRG
Subjt:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRG

Query:  APERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGSQSGHKRKD--------
        APERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDR AVINTIFGGPSG QSG KRK+        
Subjt:  APERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGSQSGHKRKD--------

Query:  -----------------ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQ
                         ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG IDLPVTL Q
Subjt:  -----------------ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQ

Query:  DQTQVTQMAEFV
        DQTQVTQMAEFV
Subjt:  DQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.1e-24677.58Show/hide
Query:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR
        +EE DQLRG+L+AQVEALK KCEQKE PLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL GSAR
Subjt:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR

Query:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL
        LW                                                     FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG EAP TFAEVL
Subjt:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL

Query:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRG
        QKAKKVIDGQELLRTKTGRPE  I RGRSGKD EKA  KSKDKGSFSSGRA +RR  NGPTRSRPYERFT TTIPISEILTNIEESGMEKLLKRPEKLRG
Subjt:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRG

Query:  APERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGSQSGHKRKD--------
        APERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DR AVINTIFGGPSG QSGHKRK+        
Subjt:  APERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGSQSGHKRKD--------

Query:  -----------------ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQ
                         ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTL  
Subjt:  -----------------ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQ

Query:  DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIEPLASGDGTLEFKADLPRREFAA
        DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVC +E L S DGTLEFKA+LPRREFAA
Subjt:  DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIEPLASGDGTLEFKADLPRREFAA

Query:  PTEELELVPLL
        PTEELELVPLL
Subjt:  PTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.1e-20484.98Show/hide
Query:  MCYFLTGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTL
        MCYFLTGLADEALTVKL +EAP TFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E   PKSKDKGSFS+GRA YRR ENGPTRSRPYERFT 
Subjt:  MCYFLTGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTL

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRL
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDR 
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRL

Query:  AVINTIFGGPSGSQSGHKRK-------------------------DADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSG QSGHKRK                          ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGSQSGHKRK-------------------------DADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVTL QDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQT SRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKG

Query:  SSVCTIEPLASGDGTLEFKADLPRREFAAPTEELELVPLLSPEKQL
        +SVC +E L S DGTLEF+ADLP REFAAP EELELVPLLS EKQ+
Subjt:  SSVCTIEPLASGDGTLEFKADLPRREFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]5.5e-24178.09Show/hide
Query:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR
        +EE DQL+ + DAQVEALK +CE+KES  +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIAL GSAR
Subjt:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR

Query:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL
        LWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL +EAP TFAEVL
Subjt:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL

Query:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKG-SFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLR
        QK KKVIDGQELLRTKTGRPE  I +GR+GKD  KA  KS+DKG S SS R  YRR  +   +SRPYE +T TTIPI EILTNIEE+GMEKLLKRPEKLR
Subjt:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKG-SFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLR

Query:  GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVIN------------TIFGGPSGSQSG
        G PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DR AVIN                      S 
Subjt:  GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVIN------------TIFGGPSGSQSG

Query:  HKRKDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFV
             ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++RQD TQVTQMAEFV
Subjt:  HKRKDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFV

Query:  VIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIE
        VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVC +E
Subjt:  VIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]1.4e-20471.69Show/hide
Query:  MDFQAASDAIKCRAFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIAL GSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSS-GRAAYRRPENGPTRSRPYERFTLTTIP
        T LADE LTVKLG+EAPTTF EVLQKAKKVIDGQELLRTKTGRPE +I + +  ++  KA  KS+DKGS SS  R  YRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSS-GRAAYRRPENGPTRSRPYERFTLTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DR AVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVIN

Query:  TIFGGPSGSQSGHKRK-------------------------DADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+G QSG+KRK                         DADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGSQSGHKRK-------------------------DADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVC
                      GCIDLPVT+ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVC

Query:  TIEPLASGDGTLEFKADLP---RREFAAPTEELELVPLLSPEKQ
         +E   +     E +ADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  TIEPLASGDGTLEFKADLP---RREFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088138.8e-25389.84Show/hide
Query:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR
        +EE DQLRGQLDAQVEALK KCEQKE PLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IAL GSAR
Subjt:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR

Query:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL
        LWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG+EAP TFAEVL
Subjt:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL

Query:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRG
        QKAKKVIDGQELLRTKTGRPE KIGRGRSGKDIE A PKSKDKGSFSSGRA YRR ENGPTRSRPYERFT TTIPISEILTNIEESGMEKLLKRPEKLRG
Subjt:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRG

Query:  APERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGSQSGHKRKD--------
        APERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDR AVINTIFGGPSG QSG KRK+        
Subjt:  APERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGSQSGHKRKD--------

Query:  -----------------ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQ
                         ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEG IDLPVTL Q
Subjt:  -----------------ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQ

Query:  DQTQVTQMAEFV
        DQTQVTQMAEFV
Subjt:  DQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188235.5e-24777.58Show/hide
Query:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR
        +EE DQLRG+L+AQVEALK KCEQKE PLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL GSAR
Subjt:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR

Query:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL
        LW                                                     FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG EAP TFAEVL
Subjt:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL

Query:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRG
        QKAKKVIDGQELLRTKTGRPE  I RGRSGKD EKA  KSKDKGSFSSGRA +RR  NGPTRSRPYERFT TTIPISEILTNIEESGMEKLLKRPEKLRG
Subjt:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRG

Query:  APERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGSQSGHKRKD--------
        APERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DR AVINTIFGGPSG QSGHKRK+        
Subjt:  APERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVINTIFGGPSGSQSGHKRKD--------

Query:  -----------------ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQ
                         ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTL  
Subjt:  -----------------ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQ

Query:  DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIEPLASGDGTLEFKADLPRREFAA
        DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVC +E L S DGTLEFKA+LPRREFAA
Subjt:  DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIEPLASGDGTLEFKADLPRREFAA

Query:  PTEELELVPLL
        PTEELELVPLL
Subjt:  PTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198995.2e-20584.98Show/hide
Query:  MCYFLTGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTL
        MCYFLTGLADEALTVKL +EAP TFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E   PKSKDKGSFS+GRA YRR ENGPTRSRPYERFT 
Subjt:  MCYFLTGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTRSRPYERFTL

Query:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRL
        TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDR 
Subjt:  TTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRL

Query:  AVINTIFGGPSGSQSGHKRK-------------------------DADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSG QSGHKRK                          ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGSQSGHKRK-------------------------DADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKG
        KKSPTPLVGFSGESV+PEGCIDLPVTL QDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQT SRECYAS LKG
Subjt:  KKSPTPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKG

Query:  SSVCTIEPLASGDGTLEFKADLPRREFAAPTEELELVPLLSPEKQL
        +SVC +E L S DGTLEF+ADLP REFAAP EELELVPLLS EKQ+
Subjt:  SSVCTIEPLASGDGTLEFKADLPRREFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204792.7e-24178.09Show/hide
Query:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR
        +EE DQL+ + DAQVEALK +CE+KES  +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIAL GSAR
Subjt:  QEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSAR

Query:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL
        LWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL +EAP TFAEVL
Subjt:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVL

Query:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKG-SFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLR
        QK KKVIDGQELLRTKTGRPE  I +GR+GKD  KA  KS+DKG S SS R  YRR  +   +SRPYE +T TTIPI EILTNIEE+GMEKLLKRPEKLR
Subjt:  QKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKG-SFSSGRAAYRRPENGPTRSRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLR

Query:  GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVIN------------TIFGGPSGSQSG
        G PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DR AVIN                      S 
Subjt:  GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVIN------------TIFGGPSGSQSG

Query:  HKRKDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFV
             ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++RQD TQVTQMAEFV
Subjt:  HKRKDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFV

Query:  VIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIE
        VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVC +E
Subjt:  VIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIE

A0A6J1DZB9 uncharacterized protein LOC1110249046.8e-20571.69Show/hide
Query:  MDFQAASDAIKCRAFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIAL GSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSS-GRAAYRRPENGPTRSRPYERFTLTTIP
        T LADE LTVKLG+EAPTTF EVLQKAKKVIDGQELLRTKTGRPE +I + +  ++  KA  KS+DKGS SS  R  YRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSS-GRAAYRRPENGPTRSRPYERFTLTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DR AVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRLAVIN

Query:  TIFGGPSGSQSGHKRK-------------------------DADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+G QSG+KRK                         DADLE VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  TIFGGPSGSQSGHKRK-------------------------DADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVC
                      GCIDLPVT+ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVC

Query:  TIEPLASGDGTLEFKADLP---RREFAAPTEELELVPLLSPEKQ
         +E   +     E +ADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  TIEPLASGDGTLEFKADLP---RREFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAAGAGAAGGAAGCAAGGCTGGATGAATTTGGTGGAGAAGTTGCTGGAATTTATAGCTGTGCAGAATTGGTCGGCACGAATCACCGCGCCTGTCCTACCA
CCTGCGCACCCTCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGAC
GCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATGTATAACGAAATGATATTAGCCGCAGGCGCAGGGTCCCGATCTGAGAAC
CGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGA
GGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAGAAGGACAGGAGGAGATCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCC
TTAAAGCCAAAATGTGAGCAGAAAGAAAGTCCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGATGTTTTGGAAGCACCGATCCCTCCGAAGTTC
AAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGATGCAATCAAA
TGCCGCGCTTTTCAGATCGCGCTTATTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTC
GCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACAAGATTCCAG
GAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGATGAGGCC
CCGACCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAAGGAAAGATCGGCCGGGGC
AGAAGTGGAAAAGATATAGAAAAGGCAGGTCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGCGTATCGGAGGCCGGAGAACGGACCTACCAGG
AGCCGACCTTACGAACGCTTCACCCTGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAGCGTCCTGAGAAG
CTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAG
GATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCAAGGACGCCGCCCCGGCGC
ACTGACCGACTTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGAGTCAGTCCGGACATAAAAGAAAGGATGCAGACTTGGAGGAGGTCCACCTGCCCCAC
AATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTC
GCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACG
CTTAGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTT
CGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAGTCCGAGGAGAACAGACCAATTCGAGGGAGTGCTATGCCTCC
GCACTCAAAGGCTCATCGGTCTGCACCATCGAACCTCTCGCCAGTGGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACT
GAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCC
TCGATCTCAGAGCTAGATCTGATGGAGATCGGTGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAG
CGCAGAAAGTTGGCAAGGCGAGCAGCTCAGTTCGCGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAG
GGCCTGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTG
AAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAACGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCAAGAGAAGGAAGCAAGGCTGGATGAATTTGGTGGAGAAGTTGCTGGAATTTATAGCTGTGCAGAATTGGTCGGCACGAATCACCGCGCCTGTCCTACCA
CCTGCGCACCCTCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCCCCGACAAGTGAGAACTTGGAC
GCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATGTATAACGAAATGATATTAGCCGCAGGCGCAGGGTCCCGATCTGAGAAC
CGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGA
GGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAGAAGGACAGGAGGAGATCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCC
TTAAAGCCAAAATGTGAGCAGAAAGAAAGTCCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGATGTTTTGGAAGCACCGATCCCTCCGAAGTTC
AAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGATGCAATCAAA
TGCCGCGCTTTTCAGATCGCGCTTATTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTC
GCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACAAGATTCCAG
GAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGATGAGGCC
CCGACCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAAGGAAAGATCGGCCGGGGC
AGAAGTGGAAAAGATATAGAAAAGGCAGGTCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGCGTATCGGAGGCCGGAGAACGGACCTACCAGG
AGCCGACCTTACGAACGCTTCACCCTGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAGCGTCCTGAGAAG
CTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAG
GATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCAAGGACGCCGCCCCGGCGC
ACTGACCGACTTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGAGTCAGTCCGGACATAAAAGAAAGGATGCAGACTTGGAGGAGGTCCACCTGCCCCAC
AATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTC
GCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACG
CTTAGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTT
CGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACAGTCCGAGGAGAACAGACCAATTCGAGGGAGTGCTATGCCTCC
GCACTCAAAGGCTCATCGGTCTGCACCATCGAACCTCTCGCCAGTGGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACT
GAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCC
TCGATCTCAGAGCTAGATCTGATGGAGATCGGTGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAG
CGCAGAAAGTTGGCAAGGCGAGCAGCTCAGTTCGCGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAG
GGCCTGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTG
AAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAACGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MIKRRKQGWMNLVEKLLEFIAVQNWSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPAPAPTSENLDALQREMEAMRTKMRSMEEMYNEMILAAGAGSRSEN
RVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLREGQEEIDQLRGQLDAQVEALKPKCEQKESPLNDGDLGESPFTSDVLEAPIPPKF
KAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQ
EEQLKVAHCSDDSAMCYFLTGLADEALTVKLGDEAPTTFAEVLQKAKKVIDGQELLRTKTGRPEGKIGRGRSGKDIEKAGPKSKDKGSFSSGRAAYRRPENGPTR
SRPYERFTLTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRR
TDRLAVINTIFGGPSGSQSGHKRKDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVT
LRQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTNSRECYASALKGSSVCTIEPLASGDGTLEFKADLPRREFAAPT
EELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISELDLMEIGAPESSWMDPIADFIRGNSPQDPKERRKLARRAAQFAVRGGALYRRGFSLPLLRCLTPEE
GLRVQTHVGALDPTWEGPFEVKGIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP