; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g26310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g26310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:19658369..19663336
RNA-Seq ExpressionMoc09g26310
SyntenyMoc09g26310
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.7e-24285.42Show/hide
Query:  QAESSQN---PAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS+N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSQN---PAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGFSGE
        QSG KRKELARAA                               DA             RVLVDGG SANILSLPTYLALGWTRSQLK+S TPLVGFSGE
Subjt:  QSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITEMAEFV
        SVIPEG IDLPVTLGQ+QT++T+MAEFV
Subjt:  SVIPEGCIDLPVTLGQNQTRITEMAEFV

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]4.2e-20693.69Show/hide
Query:  MITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        MITREEFDQLRG+LDAQ EALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQA SDAIKCRAFQIALTG
Subjt:  MITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPATFA
        SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEALTVKLGEEAP+TF 
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPERKI RGRSGKD E+ DPKSKDKGSFSSGR EYRRAENGPTRSRPYERFTPTTIPI EILT IE+SGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEK

Query:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
Subjt:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.1e-24274.6Show/hide
Query:  SSYQQAESSQNPA---GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SS QQAESS NPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSYQQAESSQNPA---GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLV
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGL 
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLV

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER IDRGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP T SAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGF
        SGGQSGHKRKELARAA                               DA            RRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGF

Query:  SGESVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETL-
        S ESVIPEGCIDLPVTLG +QT++T+MAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYA+ALKG SVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLL
         RDGTLEF+A+LPR+EFAAPTEELELVPLL
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.9e-23272.5Show/hide
Query:  QAESSQNP--AGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDA
        +AESS NP   G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DA
Subjt:  QAESSQNP--AGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDA

Query:  IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALT
        IKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGL DE LT
Subjt:  IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALT

Query:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        VKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ ID+GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE
Subjt:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP + S EKKEERKR RTPPRR DRPAVIN         
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGFSGE
            K+KELAR A                               DA            RR+LVDGGASANILSL TYLALGWTRSQLK+S TPLVGFSGE
Subjt:  QSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALE--TLRD
        S+  EGCIDLPV++ Q+ T++T+MAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYA+  K  SVCALE  T+RD
Subjt:  SVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALE--TLRD

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]6.6e-0238.82Show/hide
Query:  EGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSEDFDALQ----REMEAMRTQMRSMEAMYNE
        EGQGH+ L  EPL RSARIT P LPPAHP+ SKA       +           T E+FD L+     ++EA++ +    E+ +++
Subjt:  EGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSEDFDALQ----REMEAMRTQMRSMEAMYNE

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.7e-21597.26Show/hide
Query:  GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  A
        A
Subjt:  A

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.3e-24285.42Show/hide
Query:  QAESSQN---PAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD
        +AESS+N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASD
Subjt:  QAESSQN---PAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEAL
        AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGFSGE
        QSG KRKELARAA                               DA             RVLVDGG SANILSLPTYLALGWTRSQLK+S TPLVGFSGE
Subjt:  QSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITEMAEFV
        SVIPEG IDLPVTLGQ+QT++T+MAEFV
Subjt:  SVIPEGCIDLPVTLGQNQTRITEMAEFV

A0A6J1CKB3 uncharacterized protein LOC1110120812.0e-20693.69Show/hide
Query:  MITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG
        MITREEFDQLRG+LDAQ EALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQA SDAIKCRAFQIALTG
Subjt:  MITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPATFA
        SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEALTVKLGEEAP+TF 
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPATFA

Query:  EVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEK
        EVLQK KKVIDG ELLRTKTGRPERKI RGRSGKD E+ DPKSKDKGSFSSGR EYRRAENGPTRSRPYERFTPTTIPI EILT IE+SGMEKLLKRPEK
Subjt:  EVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEK

Query:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        LR   ERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
Subjt:  LRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

A0A6J1D9E1 uncharacterized protein LOC1110188231.0e-24274.6Show/hide
Query:  SSYQQAESSQNPA---GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ
        SS QQAESS NPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGESPFTSDVLE        APTVK YDG+KDPKDYVEVFEGLMDFQ
Subjt:  SSYQQAESSQNPA---GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLV
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGL 
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLV

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER IDRGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT

Query:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGP
        NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKP T SAEKKEERK SRTP RR DRPAVINTIFGGP
Subjt:  NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGP

Query:  SGGQSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGF
        SGGQSGHKRKELARAA                               DA            RRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGF
Subjt:  SGGQSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGF

Query:  SGESVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETL-
        S ESVIPEGCIDLPVTLG +QT++T+MAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYA+ALKG SVCALETL 
Subjt:  SGESVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETL-

Query:  -RDGTLEFEADLPRKEFAAPTEELELVPLL
         RDGTLEF+A+LPR+EFAAPTEELELVPLL
Subjt:  -RDGTLEFEADLPRKEFAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204794.8e-23272.5Show/hide
Query:  QAESSQNP--AGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDA
        +AESS NP   G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DA
Subjt:  QAESSQNP--AGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDA

Query:  IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALT
        IKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGL DE LT
Subjt:  IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALT

Query:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        VKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ ID+GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNIE
Subjt:  VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP + S EKKEERKR RTPPRR DRPAVIN         
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGFSGE
            K+KELAR A                               DA            RR+LVDGGASANILSL TYLALGWTRSQLK+S TPLVGFSGE
Subjt:  QSGHKRKELARAA-------------------------------DA------------RRVLVDGGASANILSLPTYLALGWTRSQLKRSLTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALE--TLRD
        S+  EGCIDLPV++ Q+ T++T+MAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE   SRECYA+  K  SVCALE  T+RD
Subjt:  SVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALE--TLRD

A0A6J1DHB3 uncharacterized protein LOC1110204793.2e-0238.82Show/hide
Query:  EGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSEDFDALQ----REMEAMRTQMRSMEAMYNE
        EGQGH+ L  EPL RSARIT P LPPAHP+ SKA       +           T E+FD L+     ++EA++ +    E+ +++
Subjt:  EGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSEDFDALQ----REMEAMRTQMRSMEAMYNE

A0A6J1DHB3 uncharacterized protein LOC1110204798.2e-21697.26Show/hide
Query:  GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPATF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL DEALTVKLGEEAPATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPATF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPE
        AEVLQKAKKVIDGQELLRTKTGRPERKI RGRSGKD ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKLLKRPE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPE

Query:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARA
        KLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIEDLIQDGYFKKFVGKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQ GHKRKELARA
Subjt:  KLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARA

Query:  A
        A
Subjt:  A

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGTGTGAGGGCCGAGGTGAGCCCGGCTCAGGTCCGACCCACCGGGGAGCCCGGTCCGCCCAAGTGGTCAGGTCGGTCCGGAGGCCGGGTTCGAGCTACAACCAG
AAACACACTGTTCGAACATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGAG
AGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGT
CGAGGTGGGACTTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACGCAAATGCGCTC
CATGGAGGCGATGTATAACGAAATGGTGCTAGCTACAGGCGCGGGGTCCCGATCTGAAAATCGGGCGATGCGCATGGACGTACGCGAGCAAAGGGGCTCCCACCTCGGCC
CGACCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCAAAAAGGG
CAGTCGCCATCCCGCTCCCACAGGAGCTCCTACCAGCAGGCTGAATCCTCTCAGAACCCCGCAGGGATGATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGA
TGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCGATCC
CCCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCA
ATCAAATGCCGCGCCTTCCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCCCAGCTGAGAAGGGAGTTCCT
CGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGG
AGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACAGGTCTAGTCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACC
TTCGCTGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGACCGGGGCAGAAGTGGAAAAGA
TGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGTCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCCTACGAGCGCTTCA
CCCCAACAACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGT
AAGGACAAGTATTGCCGCTTTCATCGGGAGCACGGCCACAACACGTCAGACTGCTGGGAATTGAAGCGTCAAATTGAAGATCTGATTCAAGACGGCTACTTCAAGAAGTT
TGTGGGAAAGCCCGGGACCGGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATCTTTGGAG
GGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCGACGCGAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCCAACATCCTGTCCTTACCG
ACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCTGACGCCGCTGGTTGGGTTCTCTGGAGAGTCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGT
CACGCTGGGGCAGAACCAAACCCGGATCACTGAAATGGCCGAGTTCGTGGTAGTTGACGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTC
GGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAACAAGCCGCTTCGAGGGAGTGTTATGCCGCCGCACTC
AAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGT
TCCTCTGCTTAGTCCCGAGAAGCAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGAAGGGCAGCTCGGTTCGTGATCCGAGATGCGGCATTGTAC
CGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAGTGTGAGGGCCGAGGTGAGCCCGGCTCAGGTCCGACCCACCGGGGAGCCCGGTCCGCCCAAGTGGTCAGGTCGGTCCGGAGGCCGGGTTCGAGCTACAACCAG
AAACACACTGTTCGAACATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGAG
AGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGT
CGAGGTGGGACTTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACGCAAATGCGCTC
CATGGAGGCGATGTATAACGAAATGGTGCTAGCTACAGGCGCGGGGTCCCGATCTGAAAATCGGGCGATGCGCATGGACGTACGCGAGCAAAGGGGCTCCCACCTCGGCC
CGACCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCAAAAAGGG
CAGTCGCCATCCCGCTCCCACAGGAGCTCCTACCAGCAGGCTGAATCCTCTCAGAACCCCGCAGGGATGATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGA
TGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCGATCC
CCCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCA
ATCAAATGCCGCGCCTTCCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCCCAGCTGAGAAGGGAGTTCCT
CGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGG
AGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACAGGTCTAGTCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACC
TTCGCTGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGACCGGGGCAGAAGTGGAAAAGA
TGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGTCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCCTACGAGCGCTTCA
CCCCAACAACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGT
AAGGACAAGTATTGCCGCTTTCATCGGGAGCACGGCCACAACACGTCAGACTGCTGGGAATTGAAGCGTCAAATTGAAGATCTGATTCAAGACGGCTACTTCAAGAAGTT
TGTGGGAAAGCCCGGGACCGGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATCTTTGGAG
GGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCGACGCGAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCCAACATCCTGTCCTTACCG
ACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCTGACGCCGCTGGTTGGGTTCTCTGGAGAGTCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGT
CACGCTGGGGCAGAACCAAACCCGGATCACTGAAATGGCCGAGTTCGTGGTAGTTGACGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTC
GGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAACAAGCCGCTTCGAGGGAGTGTTATGCCGCCGCACTC
AAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGT
TCCTCTGCTTAGTCCCGAGAAGCAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGAAGGGCAGCTCGGTTCGTGATCCGAGATGCGGCATTGTAC
CGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCTCTGA
Protein sequenceShow/hide protein sequence
MPSVRAEVSPAQVRPTGEPGPPKWSGRSGGRVRATTRNTLFEHGSTSEFDQYDGSKDSGCQRCPPEGGRSSGGEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRG
RGGTSKKGARGPAPAPTSEDFDALQREMEAMRTQMRSMEAMYNEMVLATGAGSRSENRAMRMDVREQRGSHLGPTEEERPEDNGSEGYTRQRGDLREHLNRKRGSSLQKG
QSPSRSHRSSYQQAESSQNPAGMITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDA
IKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLVDEALTVKLGEEAPAT
FAEVLQKAKKVIDGQELLRTKTGRPERKIDRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRS
KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAADARRVLVDGGASANILSLP
TYLALGWTRSQLKRSLTPLVGFSGESVIPEGCIDLPVTLGQNQTRITEMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAAL
KGPSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQGQLTTRPQGAQKVGKKGSSVRDPRCGIVPTWLFPASVEMPNL