; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g14080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g14080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:10692744..10697090
RNA-Seq ExpressionMoc08g14080
SyntenyMoc08g14080
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.6e-25086.93Show/hide
Query:  QAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPAT AGVITREEFDQLR QLDAQVEALKAKCEQKEG LNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAH
        AIKCRAF+IALT SARLWYRRLP  SISTYSQLRREFL  FSSRHYDKK A HLATIRQKEGETLREYV RFQEEQLKVAHCSDDSAMCYFLTGL+DEA 
Subjt:  AIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAH

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADP+SKDKGSFSSGRAEYRRAENGPT SRPYERFTPT IPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIE+LIQDGYFKKF+GKPRT SAEKKEERK SRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGG

Query:  HSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
         SG KRKELARAARREVCIIRE                                     RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  HSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIQEGCIDLPVTLGHDQTQVTQMAEFV
        SVI EG IDLPVTLG DQTQVTQMAEFV
Subjt:  SVIQEGCIDLPVTLGHDQTQVTQMAEFV

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]8.5e-20191.41Show/hide
Query:  VITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTD
        +ITREEFDQLR QLDAQ EALKAKCEQKEG LNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA SDAIKCRAFQIALT 
Subjt:  VITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTD

Query:  SARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFA
        SARLWYRRLP RSISTYSQLRREFL QFSSRHYDKK A HLATIRQKEGETLREYV RFQEEQLKVAHCSDDSAMCYFLTGL+DEA TVKLGEEAP+TF 
Subjt:  SARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPERKI RGRSGKDIEK DP+SKDKGSFSSGR EYRRAENGPT SRPYERFTPT IPI EILT IEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKF+GKPRT SAEKKEERK SRTPPRRTDRPAVINTIFGGPSGG SGHKRKEL
Subjt:  LRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKEL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.2e-24976.23Show/hide
Query:  SSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPAT  GVITREEFDQLR +L+AQVEALKAKCEQKEG LNDG+LGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLS
        AASDAIKCRAFQIALT SARLW                                                     FQE+QLKVA  SDDSAMCYFLTGL+
Subjt:  AASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLS

Query:  DEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEIL
        DEA TVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD EKAD +SKDKGSFSSGRAE+RRA NGPT SRPYERFTPT IPISEIL
Subjt:  DEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKF+GKPRT SAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGG

Query:  PSGGHSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGG SGHKRKELARAARREVCIIRE                                     RVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGHSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVI EGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN IFGRPIIH FRAIPSTLHQ+LKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLPRRESAAPTEELELVPLL
          RDGTLEF+A+LPRRE AAPTEELELVPLL
Subjt:  AGRDGTLEFEADLPRRESAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.5e-25364.4Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITATVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT  VLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITATVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMDEMGSHLGPVEEEHPEDIESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAK
                                                                        AESS+NP T  GVITREEFDQL+ + DAQVEALKA+
Subjt:  TQMRSMDEMGSHLGPVEEEHPEDIESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAK

Query:  CEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREF
        CE+KE S +DG+LGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALT SARLWYRRLP R ISTYSQLR+EF
Subjt:  CEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREF

Query:  LTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE
        ++QFSSRHYD+K   HLATIRQKEGETLREYV RF EEQLKVAHCSDDSAMCYFLTGL+DE  TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE
Subjt:  LTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE

Query:  RKIGRGRSGKDIEKADPRSKDKG-SFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHN
        + I +GR+GKD  KAD +S+DKG S SS R +YRR+ +    SRPYE +TPT IPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHN
Subjt:  RKIGRGRSGKDIEKADPRSKDKG-SFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHN

Query:  TSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKELARAARREVCIIRE--------------
        TS+YWELKRQIEDLIQDGYFKKF+GKPR+ S EKKEERK  RTPPRR DRPAVIN             K+KELAR ARREVCIIRE              
Subjt:  TSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKELARAARREVCIIRE--------------

Query:  -----------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY
                               R+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++  D TQVTQMAEFVVIDGRSAY
Subjt:  -----------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY

Query:  NVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        N IFGRPIIH FRA+PSTLHQ+LKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  NVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]5.3e-21193.09Show/hide
Query:  GVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLR +LDAQVEALKAKCEQK+ SLNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  DSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATF
         SARLWYRRLP RSISTYSQLRREFL QFSSRHYDKK A HLATIRQKEGETLREYV RFQEEQLKVAHCSDDSAMCYFLTGL+DEA TVKLGEEAPATF
Subjt:  DSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+ADP+SKDKGSFSSGRAEYRRAENGPT SRPYERFTPT IPI EILTNIEESGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD+WELKRQIEDLIQDGYFKKF+GKPRT SAEKKEERK SRTPPRRTDRPAVINTIFGGPSGG  GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKELARA

Query:  ARREV
        ARRE+
Subjt:  ARREV

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.3e-25086.93Show/hide
Query:  QAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPAT AGVITREEFDQLR QLDAQVEALKAKCEQKEG LNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAH
        AIKCRAF+IALT SARLWYRRLP  SISTYSQLRREFL  FSSRHYDKK A HLATIRQKEGETLREYV RFQEEQLKVAHCSDDSAMCYFLTGL+DEA 
Subjt:  AIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAH

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADP+SKDKGSFSSGRAEYRRAENGPT SRPYERFTPT IPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIE+LIQDGYFKKF+GKPRT SAEKKEERK SRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGG

Query:  HSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
         SG KRKELARAARREVCIIRE                                     RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  HSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIQEGCIDLPVTLGHDQTQVTQMAEFV
        SVI EG IDLPVTLG DQTQVTQMAEFV
Subjt:  SVIQEGCIDLPVTLGHDQTQVTQMAEFV

A0A6J1CKB3 uncharacterized protein LOC1110120814.1e-20191.41Show/hide
Query:  VITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTD
        +ITREEFDQLR QLDAQ EALKAKCEQKEG LNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQA SDAIKCRAFQIALT 
Subjt:  VITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTD

Query:  SARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFA
        SARLWYRRLP RSISTYSQLRREFL QFSSRHYDKK A HLATIRQKEGETLREYV RFQEEQLKVAHCSDDSAMCYFLTGL+DEA TVKLGEEAP+TF 
Subjt:  SARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPERKI RGRSGKDIEK DP+SKDKGSFSSGR EYRRAENGPT SRPYERFTPT IPI EILT IEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKF+GKPRT SAEKKEERK SRTPPRRTDRPAVINTIFGGPSGG SGHKRKEL
Subjt:  LRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKEL

A0A6J1D9E1 uncharacterized protein LOC1110188231.1e-24976.23Show/hide
Query:  SSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESSHNPAT  GVITREEFDQLR +L+AQVEALKAKCEQKEG LNDG+LGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLS
        AASDAIKCRAFQIALT SARLW                                                     FQE+QLKVA  SDDSAMCYFLTGL+
Subjt:  AASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLS

Query:  DEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEIL
        DEA TVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD EKAD +SKDKGSFSSGRAE+RRA NGPT SRPYERFTPT IPISEIL
Subjt:  DEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKF+GKPRT SAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGG

Query:  PSGGHSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGG SGHKRKELARAARREVCIIRE                                     RVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGHSGHKRKELARAARREVCIIRE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETL
        FS ESVI EGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN IFGRPIIH FRAIPSTLHQ+LKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETL

Query:  AGRDGTLEFEADLPRRESAAPTEELELVPLL
          RDGTLEF+A+LPRRE AAPTEELELVPLL
Subjt:  AGRDGTLEFEADLPRRESAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204794.6e-25364.4Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITATVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT  VLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITATVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMDEMGSHLGPVEEEHPEDIESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAK
                                                                        AESS+NP T  GVITREEFDQL+ + DAQVEALKA+
Subjt:  TQMRSMDEMGSHLGPVEEEHPEDIESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAK

Query:  CEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREF
        CE+KE S +DG+LGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALT SARLWYRRLP R ISTYSQLR+EF
Subjt:  CEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREF

Query:  LTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE
        ++QFSSRHYD+K   HLATIRQKEGETLREYV RF EEQLKVAHCSDDSAMCYFLTGL+DE  TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE
Subjt:  LTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE

Query:  RKIGRGRSGKDIEKADPRSKDKG-SFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHN
        + I +GR+GKD  KAD +S+DKG S SS R +YRR+ +    SRPYE +TPT IPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHN
Subjt:  RKIGRGRSGKDIEKADPRSKDKG-SFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHN

Query:  TSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKELARAARREVCIIRE--------------
        TS+YWELKRQIEDLIQDGYFKKF+GKPR+ S EKKEERK  RTPPRR DRPAVIN             K+KELAR ARREVCIIRE              
Subjt:  TSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKELARAARREVCIIRE--------------

Query:  -----------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY
                               R+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV++  D TQVTQMAEFVVIDGRSAY
Subjt:  -----------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY

Query:  NVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD
        N IFGRPIIH FRA+PSTLHQ+LKY T NGVGTVRGE   SRECYAS  K SSVCALE    RD
Subjt:  NVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRD

A0A6J1DS95 uncharacterized protein LOC1110234212.6e-21193.09Show/hide
Query:  GVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLR +LDAQVEALKAKCEQK+ SLNDG+LGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  DSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATF
         SARLWYRRLP RSISTYSQLRREFL QFSSRHYDKK A HLATIRQKEGETLREYV RFQEEQLKVAHCSDDSAMCYFLTGL+DEA TVKLGEEAPATF
Subjt:  DSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLREYVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+ADP+SKDKGSFSSGRAEYRRAENGPT SRPYERFTPT IPI EILTNIEESGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTKIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD+WELKRQIEDLIQDGYFKKF+GKPRT SAEKKEERK SRTPPRRTDRPAVINTIFGGPSGG  GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRPAVINTIFGGPSGGHSGHKRKELARA

Query:  ARREV
        ARRE+
Subjt:  ARREV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGACTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGATGAAATGGGT
TCCCACCTCGGCCCAGTCGAGGAAGAACATCCCGAAGACATCGAGAGCGAGGGACACACTCGCCGGGGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATC
TCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTTCTGCAGGAGTGATAACAAGGGAGGAGTTCG
ACCAGCTGAGGGACCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCAACTTGGGAGAATCGCCTTTCACCTCG
GACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGA
TTTCCAGGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCACTTACTGACAGCGCGCGATTGTGGTATCGGAGACTGCCAACCAGGTCAATCTCAACTTACT
CTCAGCTGAGAAGGGAGTTCCTCACCCAGTTCTCTTCTCGGCATTATGACAAAAAGATAGCAAACCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAA
TATGTCGCCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTATCCGACGAGGCCCACACGGTGAAACT
TGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCG
GCCGGGGTAGAAGTGGAAAAGATATAGAAAAGGCAGATCCAAGGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACC
AGCAGCCGACCTTACGAACGCTTCACTCCGACCAAGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCT
TCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGTTTCCATCGGGAGCACGGCCATAACACGTCGGACTACTGGGAGTTGAAGCGCCAAATTGAGGATCTAA
TTCAAGATGGCTACTTCAAGAAATTTTTGGGAAAGCCGAGGACTGGCTCGGCAGAGAAAAAGGAAGAGCGGAAGTGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCT
GCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGAGGTCATTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGAGGGT
GCTGGTAGACGGAGGCGCATCTGCTAACATCCTATCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCT
CTGGAGAATCGGTCATCCAAGAGGGTTGCATCGACTTGCCGGTCACACTTGGGCATGACCAAACTCAGGTCACCCAAATGGCAGAGTTCGTGGTAATTGACGGTAGATCG
GCCTATAACGTCATCTTTGGGAGACCCATCATCCACCTATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTTCACCCCCAATGGCGTGGGCACAGTCCG
AGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCAGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCG
ACCTGCCGAGGAGGGAGTCTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGGGCAACTCACCGCAAGACCCCAAGGAGCGCAGAAA
GTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTACCTCTATTGAGATGCCTAACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGACTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGATGAAATGGGT
TCCCACCTCGGCCCAGTCGAGGAAGAACATCCCGAAGACATCGAGAGCGAGGGACACACTCGCCGGGGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATC
TCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTTCTGCAGGAGTGATAACAAGGGAGGAGTTCG
ACCAGCTGAGGGACCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCAACTTGGGAGAATCGCCTTTCACCTCG
GACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCAAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGA
TTTCCAGGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCACTTACTGACAGCGCGCGATTGTGGTATCGGAGACTGCCAACCAGGTCAATCTCAACTTACT
CTCAGCTGAGAAGGGAGTTCCTCACCCAGTTCTCTTCTCGGCATTATGACAAAAAGATAGCAAACCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAA
TATGTCGCCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTATCCGACGAGGCCCACACGGTGAAACT
TGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCG
GCCGGGGTAGAAGTGGAAAAGATATAGAAAAGGCAGATCCAAGGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACC
AGCAGCCGACCTTACGAACGCTTCACTCCGACCAAGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCT
TCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGTTTCCATCGGGAGCACGGCCATAACACGTCGGACTACTGGGAGTTGAAGCGCCAAATTGAGGATCTAA
TTCAAGATGGCTACTTCAAGAAATTTTTGGGAAAGCCGAGGACTGGCTCGGCAGAGAAAAAGGAAGAGCGGAAGTGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCT
GCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGAGGTCATTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGAGGGT
GCTGGTAGACGGAGGCGCATCTGCTAACATCCTATCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCT
CTGGAGAATCGGTCATCCAAGAGGGTTGCATCGACTTGCCGGTCACACTTGGGCATGACCAAACTCAGGTCACCCAAATGGCAGAGTTCGTGGTAATTGACGGTAGATCG
GCCTATAACGTCATCTTTGGGAGACCCATCATCCACCTATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTTCACCCCCAATGGCGTGGGCACAGTCCG
AGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCAGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCG
ACCTGCCGAGGAGGGAGTCTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGGGCAACTCACCGCAAGACCCCAAGGAGCGCAGAAA
GTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTACCTCTATTGAGATGCCTAACCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITATVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMDEMG
SHLGPVEEEHPEDIESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATSAGVITREEFDQLRDQLDAQVEALKAKCEQKEGSLNDGNLGESPFTS
DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTDSARLWYRRLPTRSISTYSQLRREFLTQFSSRHYDKKIANHLATIRQKEGETLRE
YVARFQEEQLKVAHCSDDSAMCYFLTGLSDEAHTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPRSKDKGSFSSGRAEYRRAENGPT
SSRPYERFTPTKIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFLGKPRTGSAEKKEERKCSRTPPRRTDRP
AVINTIFGGPSGGHSGHKRKELARAARREVCIIRERVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIQEGCIDLPVTLGHDQTQVTQMAEFVVIDGRS
AYNVIFGRPIIHLFRAIPSTLHQILKYFTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRRESAAPTEELELVPLLSPEKQGQLTARPQGAQK
VGKASSSVRGPRWSIVPTRLFPTSIEMPNP