; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g22280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g22280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr5:15975217..15980761
RNA-Seq ExpressionMoc05g22280
SyntenyMoc05g22280
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.8e-24485.61Show/hide
Query:  QAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKA TVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEAL
        AIKCR F+IAL GSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEAL
Subjt:  AIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE AD K                       SRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPIT-----------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGE
        QSG KRKELARAARREVCIIREQRPTCPIT                             RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLV FSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPIT-----------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGE

Query:  SVIPEGCVDLPVTLGQDQTPVTQMAEFV
        SVIPEG +DLPVTLGQDQT VTQMAEFV
Subjt:  SVIPEGCVDLPVTLGQDQTPVTQMAEFV

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]1.3e-19188.64Show/hide
Query:  VITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIG
        +ITREEFDQLRGQLDAQ EALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKA TVKPYDGSKDPKDYVEVFEGLMDFQA SDAIKCR FQIAL G
Subjt:  VITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFA
        SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEALTVKLGEEAP+TF 
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPERKI RGRSGKDIEK D K                       SRPYERFTPTTIPI EILT IEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSDCWELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQSGHKRKEL
Subjt:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKEL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]8.5e-23673.64Show/hide
Query:  STNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQ
        S+NQQAESSHNPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAP        TVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  STNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLT
        AASDAIKCR FQIAL GSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGL 
Subjt:  AASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLT

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD EKADLK                       SRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELK QIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP R  DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPIT-----------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVR
        PSGGQSGHKRKELARAARREVCIIREQRPTCPIT                             RVLVD G SANI+SL TYLALGWTRSQLKKS TPLV 
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPIT-----------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVR

Query:  FSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGRSANNAIFGRPIIHSFRALPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGC+DLPVTLG DQT VTQMAEFVVIDGRSA NAIFGRPIIHSFRA+PSTLHQV KYSTPNGVG V GEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGRSANNAIFGRPIIHSFRALPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETL

Query:  ASRDETLEFEADLPRR--SVPVEILD
         SRD TLEF+A+LPRR  + P E L+
Subjt:  ASRDETLEFEADLPRR--SVPVEILD

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]5.0e-25265.51Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGQSPSRSHRSTNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLG
                                                          AESS+NP TP GVITREEFDQL+ + DAQVEALKA+CE+KE S +DGDLG
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGQSPSRSHRSTNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLG

Query:  ESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTA
        E  F+SD+LEA IPPKFK  T+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC  FQIAL GSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT 
Subjt:  ESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTA

Query:  THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEK
        THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGL DE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  K
Subjt:  THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEK

Query:  ADLK------------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDL
        AD K                        SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELK QIEDL
Subjt:  ADLK------------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDL

Query:  IQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPIT--------------------
        IQDGYFKKFVGKPR++S EKKEERKR RTPPR  DRPAVIN             K+KELAR ARREVCIIREQRPT  I                     
Subjt:  IQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPIT--------------------

Query:  ---------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGRSANNAIFGRPIIHSFRA
                 R+LVDGGASANILSL TYLALGWTRSQLKKS TPLV FSGES+  EGC+DLPV++ QD T VTQMAEFVVIDGRSA NAIFGRPIIHSFRA
Subjt:  ---------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGRSANNAIFGRPIIHSFRA

Query:  LPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETLASRDE
        +PSTLHQV KYST NGVGTV GE   SRECYAS  K SSVCALE    RDE
Subjt:  LPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETLASRDE

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]2.2e-19988.59Show/hide
Query:  GVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALI
        G+ITREEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIPPKFKA TVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCR FQIAL 
Subjt:  GVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALI

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+AD K                       SRPYERFTPTTIPI EILTNIEESGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARREVCIIREQR
        ARRE+   +E R
Subjt:  ARREVCIIREQR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.8e-24485.61Show/hide
Query:  QAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKA TVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEAL
        AIKCR F+IAL GSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEAL
Subjt:  AIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE AD K                       SRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREQRPTCPIT-----------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGE
        QSG KRKELARAARREVCIIREQRPTCPIT                             RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLV FSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCPIT-----------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGE

Query:  SVIPEGCVDLPVTLGQDQTPVTQMAEFV
        SVIPEG +DLPVTLGQDQT VTQMAEFV
Subjt:  SVIPEGCVDLPVTLGQDQTPVTQMAEFV

A0A6J1CKB3 uncharacterized protein LOC1110120816.2e-19288.64Show/hide
Query:  VITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIG
        +ITREEFDQLRGQLDAQ EALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKA TVKPYDGSKDPKDYVEVFEGLMDFQA SDAIKCR FQIAL G
Subjt:  VITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFA
        SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEALTVKLGEEAP+TF 
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPERKI RGRSGKDIEK D K                       SRPYERFTPTTIPI EILT IEESGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSDCWELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQSGHKRKEL
Subjt:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKEL

A0A6J1D9E1 uncharacterized protein LOC1110188234.1e-23673.64Show/hide
Query:  STNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQ
        S+NQQAESSHNPATP GVITREEFDQLRG+L+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAP        TVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  STNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLT
        AASDAIKCR FQIAL GSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGL 
Subjt:  AASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLT

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD EKADLK                       SRPYERFTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGG
        TNIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELK QIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP R  DRPAVINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREQRPTCPIT-----------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVR
        PSGGQSGHKRKELARAARREVCIIREQRPTCPIT                             RVLVD G SANI+SL TYLALGWTRSQLKKS TPLV 
Subjt:  PSGGQSGHKRKELARAARREVCIIREQRPTCPIT-----------------------------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVR

Query:  FSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGRSANNAIFGRPIIHSFRALPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETL
        FS ESVIPEGC+DLPVTLG DQT VTQMAEFVVIDGRSA NAIFGRPIIHSFRA+PSTLHQV KYSTPNGVG V GEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGRSANNAIFGRPIIHSFRALPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETL

Query:  ASRDETLEFEADLPRR--SVPVEILD
         SRD TLEF+A+LPRR  + P E L+
Subjt:  ASRDETLEFEADLPRR--SVPVEILD

A0A6J1DHB3 uncharacterized protein LOC1110204792.4e-25265.51Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGQSPSRSHRSTNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLG
                                                          AESS+NP TP GVITREEFDQL+ + DAQVEALKA+CE+KE S +DGDLG
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGQSPSRSHRSTNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLG

Query:  ESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTA
        E  F+SD+LEA IPPKFK  T+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC  FQIAL GSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT 
Subjt:  ESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTA

Query:  THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEK
        THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGL DE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  K
Subjt:  THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEK

Query:  ADLK------------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDL
        AD K                        SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELK QIEDL
Subjt:  ADLK------------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDL

Query:  IQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPIT--------------------
        IQDGYFKKFVGKPR++S EKKEERKR RTPPR  DRPAVIN             K+KELAR ARREVCIIREQRPT  I                     
Subjt:  IQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPIT--------------------

Query:  ---------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGRSANNAIFGRPIIHSFRA
                 R+LVDGGASANILSL TYLALGWTRSQLKKS TPLV FSGES+  EGC+DLPV++ QD T VTQMAEFVVIDGRSA NAIFGRPIIHSFRA
Subjt:  ---------RVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGRSANNAIFGRPIIHSFRA

Query:  LPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETLASRDE
        +PSTLHQV KYST NGVGTV GE   SRECYAS  K SSVCALE    RDE
Subjt:  LPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETLASRDE

A0A6J1DS95 uncharacterized protein LOC1110234211.1e-19988.59Show/hide
Query:  GVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALI
        G+ITREEFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDVLEAPIPPKFKA TVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCR FQIAL 
Subjt:  GVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALI

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+AD K                       SRPYERFTPTTIPI EILTNIEESGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLK-----------------------SRPYERFTPTTIPISEILTNIEESGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPR TDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  ARREVCIIREQR
        ARRE+   +E R
Subjt:  ARREVCIIREQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTAGTGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAGGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTAACGCGCTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCC
ATGGAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGACAGTCACCA
TCCCGCTCGCACAGGAGCACCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAG
CTCGACGCTCAGGTAGAGGCCTTGAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTAAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCAGACGTTTTGGAA
GCACCGATCCCTCCGAAGTTCAAAGCTAATACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAA
GCGGCATCAGACGCAATCAAATGTCGCGTCTTTCAGATCGCACTTATTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCT
CAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCAACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGA
GAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAACCGACGAAGCCCTCACG
GTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAAGAGCTCCTCCGAACCAAAACCGGCCGACCA
GAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCTCAAGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAG
ATCCTAACGAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTC
CATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGTGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAAAAATTTGTGGGAAAGCCCAGG
ACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGTGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGG
GGTCAGTCCGGACACAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAACAGAGGCCAACCTGCCCAATCACGAGGGTGCTAGTA
GACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACACCGCTGGTTCGGTTCTCT
GGAGAATCGGTCATCCCGGAGGGTTGCGTCGACTTACCGGTCACGCTTGGGCAGGACCAAACTCCGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGA
TCGGCCAATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTTAAGTATTCCACCCCCAATGGCGTGGGC
ACGGTCGGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGAGACGCTC
GAGTTCGAGGCCGACCTGCCGAGGAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCATCA
TGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGGCAGAATGGTC
AGACATTACAACGCCCGCATTCGACCTCGGACCTTCCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGCGGGC
CCGTTTGAGGTCAAGGGCATAGTTCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTAT
TATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTAGTGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAGGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTAACGCGCTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCC
ATGGAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGACAGTCACCA
TCCCGCTCGCACAGGAGCACCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAG
CTCGACGCTCAGGTAGAGGCCTTGAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTAAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCAGACGTTTTGGAA
GCACCGATCCCTCCGAAGTTCAAAGCTAATACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAA
GCGGCATCAGACGCAATCAAATGTCGCGTCTTTCAGATCGCACTTATTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCT
CAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCAACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGA
GAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAACCGACGAAGCCCTCACG
GTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAAGAGCTCCTCCGAACCAAAACCGGCCGACCA
GAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCTCAAGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAG
ATCCTAACGAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTC
CATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGTGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAAAAATTTGTGGGAAAGCCCAGG
ACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGTGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGG
GGTCAGTCCGGACACAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAACAGAGGCCAACCTGCCCAATCACGAGGGTGCTAGTA
GACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACACCGCTGGTTCGGTTCTCT
GGAGAATCGGTCATCCCGGAGGGTTGCGTCGACTTACCGGTCACGCTTGGGCAGGACCAAACTCCGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGA
TCGGCCAATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTTAAGTATTCCACCCCCAATGGCGTGGGC
ACGGTCGGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGGGATGAGACGCTC
GAGTTCGAGGCCGACCTGCCGAGGAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCATCA
TGGATGGACCCGATTGCGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGGCAGAATGGTC
AGACATTACAACGCCCGCATTCGACCTCGGACCTTCCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGCGGGC
CCGTTTGAGGTCAAGGGCATAGTTCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTAT
TATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAVVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGARGPAPAPPSENFNALQREMEAMRTQMRS
MEEMYNEMILAAGAGSRSENRMTRIDIREQRGQSPSRSHRSTNQQAESSHNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLE
APIPPKFKANTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRVFQIALIGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLR
EYVTRFQEEQLKVAHCSDDSAMCYFLTGLTDEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADLKSRPYERFTPTTIPISE
ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRCTDRPAVINTIFGGPSG
GQSGHKRKELARAARREVCIIREQRPTCPITRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVRFSGESVIPEGCVDLPVTLGQDQTPVTQMAEFVVIDGR
SANNAIFGRPIIHSFRALPSTLHQVFKYSTPNGVGTVGGEQTASRECYASALKGSSVCALETLASRDETLEFEADLPRRSVPVEILDNPSISKPDLMEIGAPESS
WMDPIADFIRGNSPQDPKERRKLARRAARFVGRMVRHYNARIRPRTFQVGHLVLRRVQTHVGALDPTWAGPFEVKGIVRPGTYILADLKGDVLAHPWNAEHLKRY
YP