; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g41380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g41380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:31713233..31715683
RNA-Seq ExpressionMoc08g41380
SyntenyMoc08g41380
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.1e-27592.23Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQ+EALK KCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALT SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETL+EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRR E+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGG
        ESGMEKLL RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPR SS EKK+ERKRSRTPPRRTDRPAVINTIFGGP+GG
Subjt:  ESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITF+GADLEEVHLPHNDALVIAPLIDH+VV RVLVDGG SANI+SLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVITEGCIDLPVTLGQDRTRVTQMAEFV
        SVI EG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVITEGCIDLPVTLGQDRTRVTQMAEFV

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]1.1e-22494.31Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TL+EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKD+ERADPKSKDKGSFSSGRAEYRR ESGPT+SRPYERFTPTTIPISEILTNIEESGMEKLL RPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPR SS EKK+ERKRSRTPPRRTDRPAVINTIFGGP+GGQSGHKRKELARAARREVCIIREQGPTCPITF+GAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVH

Query:  LPHNDALVIAPLIDHMVVRRVL
        LPHNDA VIAPLIDH+VVRRVL
Subjt:  LPHNDALVIAPLIDHMVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.2e-27179.71Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQ+EALK KCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALT SARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSGRAE+RR  +GPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLL RPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPR SS EKK+ERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGG

Query:  PNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVG
        P+GGQSGHKRKELARAARREVCIIREQ PTCPITF+ ADLEEVHLPHNDALVIAPLIDH+VVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVG
Subjt:  PNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVG

Query:  FSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGSSVCALETL
        FS ESVI EGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFG+P+IHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGSSVCALETL

Query:  AGRDGALEFEADLPRKEFAAPTKELELIPLL
          RDG LEF+A+LPR+EFAAPT+ELEL+PLL
Subjt:  AGRDGALEFEADLPRKEFAAPTKELELIPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.6e-22388.79Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRR E+GPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLL RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPR SS EKK+ERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRP

Query:  AVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQL
        AVINTIFGGP+GGQSGHKRK+LARAARREVCIIREQ PTCPITF+ ADL EVHLPHNDALVIAPLIDH+VVRRVLVDGGASANI+SLPTYLALGWTRSQL
Subjt:  AVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKG
        K+SPTPLVGFSGESV+ EGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFG+P+IHSFRAIPSTLHQVLKY TPNGVGTVRGEQ ASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKG

Query:  SSVCALETLAGRDGALEFEADLPRKEFAAPTKELELIPLLSPAKQV
        +SVCALETL  RDG LEFEADLP +EFAAP +ELEL+PLLS  KQV
Subjt:  SSVCALETLAGRDGALEFEADLPRKEFAAPTKELELIPLLSPAKQV

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.9e-27366.11Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGTRGSAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGTRGSAPAPTSENFDALQREMEAMR

Query:  TQMRSIEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGCPEDNESEGYTRQRGDLREHLNRKRGSSLRRGQSPSRSHRSSNQQAESSHNP--AGIIT
                                                                                              AESS+NP   G+IT
Subjt:  TQMRSIEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGCPEDNESEGYTRQRGDLREHLNRKRGSSLRRGQSPSRSHRSSNQQAESSHNP--AGIIT

Query:  REEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSAR
        REEFDQL+ + DAQ+EALK +CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALT SAR
Subjt:  REEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSAR

Query:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVL
        LWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETL+EYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVL
Subjt:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVL

Query:  QKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKG-SFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLR
        QK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS R +YRR  S   +SRPYE +TPTTIPI EILTNIEE+GMEKLL RPEKLR
Subjt:  QKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKG-SFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLR

Query:  GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARR
        G PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++SVEKK+ERKR RTPPRR DRPAVIN             K+KELAR ARR
Subjt:  GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARR

Query:  EVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVGFSGESVITEGCIDLPVTLG
        EVCIIREQ PT  I FN ADLE VHLPHNDALVIAPLID ++VRR+LVDGGASANI+SL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDLPV++ 
Subjt:  EVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVGFSGESVITEGCIDLPVTLG

Query:  QDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRD
        QD T+VTQMAEFVVIDGRSAYNAIFG+P+IHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  QDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.0e-27592.23Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQ+EALK KCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALT SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETL+EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRR E+GPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGG
        ESGMEKLL RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPR SS EKK+ERKRSRTPPRRTDRPAVINTIFGGP+GG
Subjt:  ESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGG

Query:  QSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQ PTCPITF+GADLEEVHLPHNDALVIAPLIDH+VV RVLVDGG SANI+SLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVITEGCIDLPVTLGQDRTRVTQMAEFV
        SVI EG IDLPVTLGQD+T+VTQMAEFV
Subjt:  SVITEGCIDLPVTLGQDRTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188235.9e-27279.71Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQ+EALK KCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALT SARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD E+AD KSKDKGSFSSGRAE+RR  +GPTRSRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLL RPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPR SS EKK+ERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGG

Query:  PNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVG
        P+GGQSGHKRKELARAARREVCIIREQ PTCPITF+ ADLEEVHLPHNDALVIAPLIDH+VVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVG
Subjt:  PNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVG

Query:  FSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGSSVCALETL
        FS ESVI EGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYNAIFG+P+IHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGSSVCALETL

Query:  AGRDGALEFEADLPRKEFAAPTKELELIPLL
          RDG LEF+A+LPR+EFAAPT+ELEL+PLL
Subjt:  AGRDGALEFEADLPRKEFAAPTKELELIPLL

A0A6J1D9W7 uncharacterized protein LOC1110187085.4e-22594.31Show/hide
Query:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ
        KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASDAIKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQ
Subjt:  KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TL+EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        GRGRSGKD+ERADPKSKDKGSFSSGRAEYRR ESGPT+SRPYERFTPTTIPISEILTNIEESGMEKLL RPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVH
        WELKRQIEDLIQDGYFKKFVGKPR SS EKK+ERKRSRTPPRRTDRPAVINTIFGGP+GGQSGHKRKELARAARREVCIIREQGPTCPITF+GAD EEVH
Subjt:  WELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVH

Query:  LPHNDALVIAPLIDHMVVRRVL
        LPHNDA VIAPLIDH+VVRRVL
Subjt:  LPHNDALVIAPLIDHMVVRRVL

A0A6J1DD03 uncharacterized protein LOC1110198991.7e-22388.79Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRR E+GPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAEYRRVESGPTRSRPYERFTP

Query:  TTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRP
        TTIPISEILTNIEESGMEKLL RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPR SS EKK+ERKRSRTPPRRTDRP
Subjt:  TTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRP

Query:  AVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQL
        AVINTIFGGP+GGQSGHKRK+LARAARREVCIIREQ PTCPITF+ ADL EVHLPHNDALVIAPLIDH+VVRRVLVDGGASANI+SLPTYLALGWTRSQL
Subjt:  AVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKG
        K+SPTPLVGFSGESV+ EGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFG+P+IHSFRAIPSTLHQVLKY TPNGVGTVRGEQ ASRECYAS LKG
Subjt:  KRSPTPLVGFSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKG

Query:  SSVCALETLAGRDGALEFEADLPRKEFAAPTKELELIPLLSPAKQV
        +SVCALETL  RDG LEFEADLP +EFAAP +ELEL+PLLS  KQV
Subjt:  SSVCALETLAGRDGALEFEADLPRKEFAAPTKELELIPLLSPAKQV

A0A6J1DHB3 uncharacterized protein LOC1110204791.4e-27366.11Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGTRGSAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGTRGSAPAPTSENFDALQREMEAMR

Query:  TQMRSIEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGCPEDNESEGYTRQRGDLREHLNRKRGSSLRRGQSPSRSHRSSNQQAESSHNP--AGIIT
                                                                                              AESS+NP   G+IT
Subjt:  TQMRSIEEMYNEMMLAAGAGSRSENRVTRVDVREQRGSHLGCPEDNESEGYTRQRGDLREHLNRKRGSSLRRGQSPSRSHRSSNQQAESSHNP--AGIIT

Query:  REEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSAR
        REEFDQL+ + DAQ+EALK +CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALT SAR
Subjt:  REEFDQLRGELDAQMEALKVKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSAR

Query:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVL
        LWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETL+EYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVL
Subjt:  LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVL

Query:  QKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKG-SFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLR
        QK KKVIDGQELLRTKTGRPE+ I +GR+GKD  +AD KS+DKG S SS R +YRR  S   +SRPYE +TPTTIPI EILTNIEE+GMEKLL RPEKLR
Subjt:  QKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKDKG-SFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLR

Query:  GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARR
        G PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++SVEKK+ERKR RTPPRR DRPAVIN             K+KELAR ARR
Subjt:  GAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSVEKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARR

Query:  EVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVGFSGESVITEGCIDLPVTLG
        EVCIIREQ PT  I FN ADLE VHLPHNDALVIAPLID ++VRR+LVDGGASANI+SL TYLALGWTRSQLK+SPTPLVGFSGES+  EGCIDLPV++ 
Subjt:  EVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYLALGWTRSQLKRSPTPLVGFSGESVITEGCIDLPVTLG

Query:  QDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRD
        QD T+VTQMAEFVVIDGRSAYNAIFG+P+IHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  QDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGSSVCALETLAGRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCGACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCACCCGGGGTTCAGCCCCGGCTCCAACGAGCGAAAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATAGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTAACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCTGTCCCGAAGACAACGA
GAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAGAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCA
ACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGAGAGCTCGACGCTCAGATGGAGGCCTTAAAGGTCAAATGT
GAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCC
TTATGATGGGACGAAGGACCCCAAAGACTATGTTGAGGTTTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTA
CTAGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAA
AAGACAGCAACACATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCAGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGA
CTCGGCCATGTGCTATTTCCTCACCGGCCTAGCCGACGAAGCCCTCACAGTGAAACTCGGGGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAG
TCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAGGGCAGATCCCAAGTCCAAGGAC
AAGGGATCCTTTTCCAGCGGCCGAGCAGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGAT
CCTAACGAACATCGAGGAATCTGGAATGGAAAAACTACTCAATCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCACCGGG
AGCACGGACATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGGCTAGCTCGGTA
GAGAAAAAGAAAGAGCGAAAGCGCTCAAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAGTCCGGACATAA
AAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCAACGGTGCAGACTTGGAGGAGGTCCACCTGC
CCCACAATGATGCACTTGTGATTGCTCCCCTGATTGATCATATGGTGGTCAGGAGAGTGTTGGTAGACGGGGGCGCATCCGCTAACATCATGTCCTTACCGACCTACCTC
GCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCACAGAGGGTTGCATCGACTTGCCGGTCACGCTGGG
GCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAAACCCGTCATCCACTCATTTCGGGCCATTC
CCTCAACACTGCATCAAGTTTTGAAGTATCTCACCCCCAATGGCGTGGGCACGGTTAGAGGAGAACAGGCCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCA
TCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCAAGGAAGGAGTTTGCCGCACCCACTAAGGAGCTCGAGCTTATTCC
TCTGCTTAGTCCCGCGAAGCAGGTAAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGACCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCGACGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCACCCGGGGTTCAGCCCCGGCTCCAACGAGCGAAAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATAGAGGAAATGTAT
AACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTAACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCTGTCCCGAAGACAACGA
GAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAGAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCA
ACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGAGAGCTCGACGCTCAGATGGAGGCCTTAAAGGTCAAATGT
GAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCC
TTATGATGGGACGAAGGACCCCAAAGACTATGTTGAGGTTTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTA
CTAGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAA
AAGACAGCAACACATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCAGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGA
CTCGGCCATGTGCTATTTCCTCACCGGCCTAGCCGACGAAGCCCTCACAGTGAAACTCGGGGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAG
TCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAGGGCAGATCCCAAGTCCAAGGAC
AAGGGATCCTTTTCCAGCGGCCGAGCAGAGTATCGAAGGGTGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGAT
CCTAACGAACATCGAGGAATCTGGAATGGAAAAACTACTCAATCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCACCGGG
AGCACGGACATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGGCTAGCTCGGTA
GAGAAAAAGAAAGAGCGAAAGCGCTCAAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAGTCCGGACATAA
AAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCAACGGTGCAGACTTGGAGGAGGTCCACCTGC
CCCACAATGATGCACTTGTGATTGCTCCCCTGATTGATCATATGGTGGTCAGGAGAGTGTTGGTAGACGGGGGCGCATCCGCTAACATCATGTCCTTACCGACCTACCTC
GCCTTGGGATGGACGAGGTCGCAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCACAGAGGGTTGCATCGACTTGCCGGTCACGCTGGG
GCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAAACCCGTCATCCACTCATTTCGGGCCATTC
CCTCAACACTGCATCAAGTTTTGAAGTATCTCACCCCCAATGGCGTGGGCACGGTTAGAGGAGAACAGGCCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCA
TCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCAAGGAAGGAGTTTGCCGCACCCACTAAGGAGCTCGAGCTTATTCC
TCTGCTTAGTCCCGCGAAGCAGGTAAGCTAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGTRGSAPAPTSENFDALQREMEAMRTQMRSIEEMY
NEMMLAAGAGSRSENRVTRVDVREQRGSHLGCPEDNESEGYTRQRGDLREHLNRKRGSSLRRGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQMEALKVKC
EQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTSSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK
KTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADPKSKD
KGSFSSGRAEYRRVESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRASSV
EKKKERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHMVVRRVLVDGGASANIMSLPTYL
ALGWTRSQLKRSPTPLVGFSGESVITEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGKPVIHSFRAIPSTLHQVLKYLTPNGVGTVRGEQAASRECYASALKGS
SVCALETLAGRDGALEFEADLPRKEFAAPTKELELIPLLSPAKQVS