; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g19050 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g19050
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:13666825..13672103
RNA-Seq ExpressionMoc07g19050
SyntenyMoc07g19050
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]3.7e-24485.23Show/hide
Query:  QAESSHN---PAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDG+KDPKDYV+VFE LMDFQAASD
Subjt:  QAESSHN---PAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHY+KKTATHLA IRQKEGETLREYVTRFQEEQLKVA CSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIK
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI+
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIK

Query:  DSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLL+RPEK RGA ERR+KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELAR ARREVCII+E                                     RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITQMVEFV
        SVIPEG IDLPVTLGQ+QT++TQM EFV
Subjt:  SVIPEGCIDLPVTLGQNQTRITQMVEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.3e-24671.39Show/hide
Query:  SSNQQAESSHNPA---GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYV+VFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA+ SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NI++SGMEKLL+RPEK RGA ERRNKDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELAR ARREVCII+E                                     RVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTATLKGPSVCALETL-
        S ESVIPEGCIDLPVTLG +QT++TQM EFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQIASRECY + LKG SVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTATLKGPSVCALETL-

Query:  -RDGTLEFEADLPRKKFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDKPSILEPDLMEIGAPEP
         RDGTLEF+A+LPR++FAAPTEELELVPLL  +      +E +L     +  +D       D+   G PEP
Subjt:  -RDGTLEFEADLPRKKFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDKPSILEPDLMEIGAPEP

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.7e-23372.17Show/hide
Query:  QAESSHNP--AGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDA
        +AESS+NP   G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYV+VFE LMDFQAA+DA
Subjt:  QAESSHNP--AGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDA

Query:  IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALT
        IKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHY++KT THLA IRQKEGETLREYVTRF EEQLKVA CSDDSAMCYFLTGLADE LT
Subjt:  IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALT

Query:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIK
        VKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNI+
Subjt:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIK

Query:  DSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ++GMEKLL+RPEK RG  E+RN DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN         
Subjt:  DSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
            K+KELAR ARREVCII+E                                     R+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTATLKGPSVCALE--TLRD
        S+  EGCIDLPV++ Q+ T++TQM EFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECY +  K  SVCALE  T+RD
Subjt:  SVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTATLKGPSVCALE--TLRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.3e-21295.06Show/hide
Query:  GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDGTKDPKDYV+VFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHY+KKTATHLA IRQKEGETLREYVTRFQEEQLKVA CSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIKDSGMEKLLRRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNI++SGMEKLL+RPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIKDSGMEKLLRRPE

Query:  KFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELART
        K RGA ERR+KDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQ GHKRKELAR 
Subjt:  KFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELART

Query:  ARREV
        ARRE+
Subjt:  ARREV

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]4.2e-20385.78Show/hide
Query:  NRKRGSSLRKGQSSSRSHRSSNQQAESSHNPAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTK
        + KRGSSLRKGQS SRSHRSSNQQAESSHNPAG+ITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGE PFTSDVLEA IPPKFKAPTVKPYDGTK
Subjt:  NRKRGSSLRKGQSSSRSHRSSNQQAESSHNPAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTK

Query:  DPKDYVDVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVA
        DPKDYV+VFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA                                 RQKE ETLREYVTRFQEEQLKVA
Subjt:  DPKDYVDVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVA

Query:  RCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSR
         CSDDSAMCYF TGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPT  R
Subjt:  RCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSR

Query:  PYERFTPTTIPISEILTNIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTP
        PYERFTPTTIPIS ILTNI++SGMEKLL+R EK RGA ERR KDKYCRFHREHGHNTS+CWELKRQIEDLIQDGYFKKFVG P TSSAEKKEERKRSRTP
Subjt:  PYERFTPTTIPISEILTNIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTP

Query:  PRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIKER
        PRRTDRPAVINTIFGGPSGGQS HKRK+LAR ARREVCII+E+
Subjt:  PRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIKER

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.8e-24485.23Show/hide
Query:  QAESSHN---PAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASD
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDG+KDPKDYV+VFE LMDFQAASD
Subjt:  QAESSHN---PAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHY+KKTATHLA IRQKEGETLREYVTRFQEEQLKVA CSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIK
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI+
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIK

Query:  DSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLL+RPEK RGA ERR+KDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KRKELAR ARREVCII+E                                     RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITQMVEFV
        SVIPEG IDLPVTLGQ+QT++TQM EFV
Subjt:  SVIPEGCIDLPVTLGQNQTRITQMVEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.1e-24671.39Show/hide
Query:  SSNQQAESSHNPA---GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQ
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYV+VFEGLMDFQ
Subjt:  SSNQQAESSHNPA---GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA+ SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NI++SGMEKLL+RPEK RGA ERRNKDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF
        SGGQSGHKRKELAR ARREVCII+E                                     RVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGF

Query:  SGESVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTATLKGPSVCALETL-
        S ESVIPEGCIDLPVTLG +QT++TQM EFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQIASRECY + LKG SVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTATLKGPSVCALETL-

Query:  -RDGTLEFEADLPRKKFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDKPSILEPDLMEIGAPEP
         RDGTLEF+A+LPR++FAAPTEELELVPLL  +      +E +L     +  +D       D+   G PEP
Subjt:  -RDGTLEFEADLPRKKFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDKPSILEPDLMEIGAPEP

A0A6J1DHB3 uncharacterized protein LOC1110204798.4e-23472.17Show/hide
Query:  QAESSHNP--AGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDA
        +AESS+NP   G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYV+VFE LMDFQAA+DA
Subjt:  QAESSHNP--AGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDA

Query:  IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALT
        IKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHY++KT THLA IRQKEGETLREYVTRF EEQLKVA CSDDSAMCYFLTGLADE LT
Subjt:  IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALT

Query:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIK
        VKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNI+
Subjt:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIK

Query:  DSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ++GMEKLL+RPEK RG  E+RN DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN         
Subjt:  DSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
            K+KELAR ARREVCII+E                                     R+LVDGGASANILSL TYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRKELARTARREVCIIKE-------------------------------------RVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTATLKGPSVCALE--TLRD
        S+  EGCIDLPV++ Q+ T++TQM EFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECY +  K  SVCALE  T+RD
Subjt:  SVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTATLKGPSVCALE--TLRD

A0A6J1DS95 uncharacterized protein LOC1110234216.2e-21395.06Show/hide
Query:  GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEA IPPKFKAPTVKPYDGTKDPKDYV+VFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHY+KKTATHLA IRQKEGETLREYVTRFQEEQLKVA CSDDSAMCYFLTGLADEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIKDSGMEKLLRRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNI++SGMEKLL+RPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIKDSGMEKLLRRPE

Query:  KFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELART
        K RGA ERR+KDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQ GHKRKELAR 
Subjt:  KFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELART

Query:  ARREV
        ARRE+
Subjt:  ARREV

A0A6J1DXR9 uncharacterized protein LOC1110251092.0e-20385.78Show/hide
Query:  NRKRGSSLRKGQSSSRSHRSSNQQAESSHNPAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTK
        + KRGSSLRKGQS SRSHRSSNQQAESSHNPAG+ITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGE PFTSDVLEA IPPKFKAPTVKPYDGTK
Subjt:  NRKRGSSLRKGQSSSRSHRSSNQQAESSHNPAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTK

Query:  DPKDYVDVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVA
        DPKDYV+VFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA                                 RQKE ETLREYVTRFQEEQLKVA
Subjt:  DPKDYVDVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVA

Query:  RCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSR
         CSDDSAMCYF TGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPT  R
Subjt:  RCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSR

Query:  PYERFTPTTIPISEILTNIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTP
        PYERFTPTTIPIS ILTNI++SGMEKLL+R EK RGA ERR KDKYCRFHREHGHNTS+CWELKRQIEDLIQDGYFKKFVG P TSSAEKKEERKRSRTP
Subjt:  PYERFTPTTIPISEILTNIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTP

Query:  PRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIKER
        PRRTDRPAVINTIFGGPSGGQS HKRK+LAR ARREVCII+E+
Subjt:  PRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIKER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCTACGCGCATGGACGTACGCGAGCAAAGGGGT
TCCCACCTCGGCCCAGCGGAGGAGGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCACGAGCATCTGAATAGAAAGAGAGGC
TCATCTCTCCGAAAAGGGCAGTCGTCGTCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATGATCACAAGGGAGGAGTTC
GACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTC
ACCTCGGACGTTTTGGAAGCATCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGATGTCTTTGAA
GGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGG
TCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACAACAAAAAGACAGCGACCCATCTCGCCGCCATCAGGCAGAAG
GAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACGCTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTA
GCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGA
ACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAAGGATCCTTCTCCAGCGGCCGA
GCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCAAAGAT
TCTGGAATGGAGAAACTACTCAGGCGTCCGGAGAAATTTCGGGGAGCCTCGGAGAGGCGCAACAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAAC
ACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAG
GAAGAGCGAAAGCGTTCAAGGACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGTGGTCAATCCGGACATAAAAGA
AAGGAGTTAGCCCGTACAGCCAGACGCGAGGTGTGCATCATCAAGGAGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTC
GCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACA
CTGGGGCAGAACCAAACCCGGATCACTCAAATGGTCGAGTTCGTGGTAGTTGACGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTT
CGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAAATCGCTTCGAGGGAGTGTTACACCGCC
ACACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGAAGTTTGCCGCGCCCACTGAGGAG
CTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAAACCCTCGATC
TTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGA
AAGTTGGCAAGACGGGCAGCTCGGTTCGTGGGCAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGG
GTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAATGCATAGTCCGACCTGGGACGTATGTATTGGCCGATCTGAAAGGAGAT
GTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCTACGCGCATGGACGTACGCGAGCAAAGGGGT
TCCCACCTCGGCCCAGCGGAGGAGGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCACGAGCATCTGAATAGAAAGAGAGGC
TCATCTCTCCGAAAAGGGCAGTCGTCGTCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATGATCACAAGGGAGGAGTTC
GACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTC
ACCTCGGACGTTTTGGAAGCATCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGATGTCTTTGAA
GGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGG
TCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACAACAAAAAGACAGCGACCCATCTCGCCGCCATCAGGCAGAAG
GAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACGCTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTA
GCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGA
ACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAAGGATCCTTCTCCAGCGGCCGA
GCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCAAAGAT
TCTGGAATGGAGAAACTACTCAGGCGTCCGGAGAAATTTCGGGGAGCCTCGGAGAGGCGCAACAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAAC
ACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAG
GAAGAGCGAAAGCGTTCAAGGACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGTGGTCAATCCGGACATAAAAGA
AAGGAGTTAGCCCGTACAGCCAGACGCGAGGTGTGCATCATCAAGGAGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTC
GCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACA
CTGGGGCAGAACCAAACCCGGATCACTCAAATGGTCGAGTTCGTGGTAGTTGACGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTT
CGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAAATCGCTTCGAGGGAGTGTTACACCGCC
ACACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGAAGTTTGCCGCGCCCACTGAGGAG
CTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAAACCCTCGATC
TTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGA
AAGTTGGCAAGACGGGCAGCTCGGTTCGTGGGCAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGG
GTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAATGCATAGTCCGACCTGGGACGTATGTATTGGCCGATCTGAAAGGAGAT
GTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MRSMEAMYNEMVLAAGAGSRSENRATRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLHEHLNRKRGSSLRKGQSSSRSHRSSNQQAESSHNPAGMITREEF
DQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEASIPPKFKAPTVKPYDGTKDPKDYVDVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAR
SISTYSQLRREFLAQFSSRHYNKKTATHLAAIRQKEGETLREYVTRFQEEQLKVARCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR
TKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIKDSGMEKLLRRPEKFRGASERRNKDKYCRFHREHGHN
TSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARTARREVCIIKERVLVDGGASANILSLPTYL
ALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMVEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQIASRECYTA
TLKGPSVCALETLRDGTLEFEADLPRKKFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDKPSILEPDLMEIGAPEPSWMDPIADFIRGNSPQDPKERR
KLARRAARFVGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKCIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP