; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14550 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14550
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:9797951..9803435
RNA-Seq ExpressionMoc03g14550
SyntenyMoc03g14550
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.3e-21878.98Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVE-------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDL ESPFTSDVLEAPIP KFKAPTVKPYDG+KDPKDYVE             
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVE-------------

Query:  --------------------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
                                        LRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  --------------------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP+TIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA------------
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPA            
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA------------

Query:  -----------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEE
                                PTCPITFDGADLEEVHLP+NDALVIAPLIDHVVV RVLVD G SANILSLPTYLALGWTRSQLK+SPTPLVGFS E
Subjt:  -----------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEE

Query:  SVIPEGCIDLPVTLGQDQTRITQMAEFV
        SVIPEG IDLPVTLGQDQT++TQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRITQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]4.9e-23473.06Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVELRREFLAQF
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDL ESPFTSDVLE        APTVK YDG+KDPKDYVE+    +   
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVELRREFLAQF

Query:  SSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG
             D + A+     R  +          FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I 
Subjt:  SSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG

Query:  RGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWE
        RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTP+TIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WE
Subjt:  RGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWE

Query:  LKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA-----------------------------------GPTCPITFDGADLEEVHLP
        LKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPA                                    PTCPITFD ADLEEVHLP
Subjt:  LKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA-----------------------------------GPTCPITFDGADLEEVHLP

Query:  YNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGR
        +NDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG DQT++TQMAEFVV+DGRSAYNAIFGR
Subjt:  YNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGR

Query:  PIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGTL--RDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQTDLARSVPVE
        PIIHSFRAIPSTLHQVLKY T NGVG VRGEQIASRECYA+ALKGSSVCAL TL  RDGTLE +A+LPR+EFAAPT+ELELVPLL  +   ++     ++
Subjt:  PIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGTL--RDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQTDLARSVPVE

Query:  ILDNPSILEPDLMEIGAPEP
           + + ++ D+   G PEP
Subjt:  ILDNPSILEPDLMEIGAPEP

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]2.2e-19482.37Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  STIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP
        +TIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRP
Subjt:  STIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP

Query:  A-----------------------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQL
        A                                    PTCPITFD ADL EVHLP+NDALVIAPLIDHVVVRRVLVD GASANILSLPTYLALGWTRSQL
Subjt:  A-----------------------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKG
        K+SPTPLVGFS ESV+PEGCIDLPVTLGQDQTR+TQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY T NGVGTVRGEQ ASRECYA+ LKG
Subjt:  KRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKG

Query:  SSVCALGTL--RDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQTDL
        +SVCAL TL  RDGTLE EADLP +EFAAP +ELELVPLLS EKQ  L
Subjt:  SSVCALGTL--RDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQTDL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]7.0e-22558.43Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGATAVEGQGHDVLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L  EPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGATAVEGQGHDVLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENQATRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENQATRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVE------------------------
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDL E  F+SD+LEA IP KFK PT+KPYDG+KDPKDYVE                        
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVE------------------------

Query:  ---------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
                             LR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  ---------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TP+TIPI EILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA----------------------
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPA                      
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA----------------------

Query:  GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQ
         PT  I F+ ADLE VHLP+NDALVIAPLID V+VRR+LVD GASANILSL TYLALGWTRSQLK+SPTPLVGFS ES+  EGCIDLPV++ QD T++TQ
Subjt:  GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQ

Query:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCAL--GTLRD
        MAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TLNGVGTVRGE   SRECYA+  K SSVCAL   T+RD
Subjt:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCAL--GTLRD

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]7.1e-17779.31Show/hide
Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEI
        +ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP+TIPISEI
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEI

Query:  LTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA-------
        LTNIEDSGMEKLLKRPEKLRGAPERRSKDK                                       TSSAEKKEERKRSRTPPRRTDRPA       
Subjt:  LTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA-------

Query:  ----------------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLV
                                    GPTCPITFDGADLEEVHLP+NDALVIAPLIDHVVVRRVLVD GASANILSLPTYLALGWTRSQLKRSPTPLV
Subjt:  ----------------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLV

Query:  GFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGT
        GFS ESVIPEGCIDLPVTLGQDQTR+TQM EFVVVDGRS YNAIFGRPIIHSFR IPSTLHQVLKY T NGVGTVRGEQ  SRECYAAALKGSSVCAL T
Subjt:  GFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGT

Query:  LRDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQ
        LRDGTLE+EADLPRKEFAAPT+ELELVPLLSPEKQ
Subjt:  LRDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.2e-21978.98Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVE-------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDL ESPFTSDVLEAPIP KFKAPTVKPYDG+KDPKDYVE             
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVE-------------

Query:  --------------------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
                                        LRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  --------------------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIE
        TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP+TIPISEILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA------------
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPA            
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA------------

Query:  -----------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEE
                                PTCPITFDGADLEEVHLP+NDALVIAPLIDHVVV RVLVD G SANILSLPTYLALGWTRSQLK+SPTPLVGFS E
Subjt:  -----------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEE

Query:  SVIPEGCIDLPVTLGQDQTRITQMAEFV
        SVIPEG IDLPVTLGQDQT++TQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRITQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.4e-23473.06Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVELRREFLAQF
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDL ESPFTSDVLE        APTVK YDG+KDPKDYVE+    +   
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVELRREFLAQF

Query:  SSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG
             D + A+     R  +          FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I 
Subjt:  SSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIG

Query:  RGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWE
        RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTP+TIPISEILTNIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WE
Subjt:  RGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWE

Query:  LKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA-----------------------------------GPTCPITFDGADLEEVHLP
        LKRQIEDLIQD YFKKFVGKP TSSAEKKEERK SRTP RR DRPA                                    PTCPITFD ADLEEVHLP
Subjt:  LKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA-----------------------------------GPTCPITFDGADLEEVHLP

Query:  YNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGR
        +NDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG DQT++TQMAEFVV+DGRSAYNAIFGR
Subjt:  YNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGR

Query:  PIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGTL--RDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQTDLARSVPVE
        PIIHSFRAIPSTLHQVLKY T NGVG VRGEQIASRECYA+ALKGSSVCAL TL  RDGTLE +A+LPR+EFAAPT+ELELVPLL  +   ++     ++
Subjt:  PIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGTL--RDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQTDLARSVPVE

Query:  ILDNPSILEPDLMEIGAPEP
           + + ++ D+   G PEP
Subjt:  ILDNPSILEPDLMEIGAPEP

A0A6J1DD03 uncharacterized protein LOC1110198991.1e-19482.37Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP

Query:  STIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP
        +TIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRP
Subjt:  STIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRP

Query:  A-----------------------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQL
        A                                    PTCPITFD ADL EVHLP+NDALVIAPLIDHVVVRRVLVD GASANILSLPTYLALGWTRSQL
Subjt:  A-----------------------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKG
        K+SPTPLVGFS ESV+PEGCIDLPVTLGQDQTR+TQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKY T NGVGTVRGEQ ASRECYA+ LKG
Subjt:  KRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKG

Query:  SSVCALGTL--RDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQTDL
        +SVCAL TL  RDGTLE EADLP +EFAAP +ELELVPLLS EKQ  L
Subjt:  SSVCALGTL--RDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQTDL

A0A6J1DHB3 uncharacterized protein LOC1110204793.4e-22558.43Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGATAVEGQGHDVLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L  EPL RSARIT P LPPAHP+ SK                                    
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGATAVEGQGHDVLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEAMYNEMVLAAGAGSRSENQATRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--
                                                                                                   AESS+NP  
Subjt:  TQMRSMEAMYNEMVLAAGAGSRSENQATRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNP--

Query:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVE------------------------
         G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDL E  F+SD+LEA IP KFK PT+KPYDG+KDPKDYVE                        
Subjt:  AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVE------------------------

Query:  ---------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT
                             LR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPAT
Subjt:  ---------------------LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPAT

Query:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIEDSGMEKLLKR
        FAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ +   +SRPYE +TP+TIPI EILTNIE++GMEKLLKR
Subjt:  FAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEILTNIEDSGMEKLLKR

Query:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA----------------------
        PEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPA                      
Subjt:  PEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA----------------------

Query:  GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQ
         PT  I F+ ADLE VHLP+NDALVIAPLID V+VRR+LVD GASANILSL TYLALGWTRSQLK+SPTPLVGFS ES+  EGCIDLPV++ QD T++TQ
Subjt:  GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQ

Query:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCAL--GTLRD
        MAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY TLNGVGTVRGE   SRECYA+  K SSVCAL   T+RD
Subjt:  MAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCAL--GTLRD

A0A6J1DYW5 uncharacterized protein LOC1110243323.5e-17779.31Show/hide
Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEI
        +ADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTP+TIPISEI
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPSTIPISEI

Query:  LTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA-------
        LTNIEDSGMEKLLKRPEKLRGAPERRSKDK                                       TSSAEKKEERKRSRTPPRRTDRPA       
Subjt:  LTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPA-------

Query:  ----------------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLV
                                    GPTCPITFDGADLEEVHLP+NDALVIAPLIDHVVVRRVLVD GASANILSLPTYLALGWTRSQLKRSPTPLV
Subjt:  ----------------------------GPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLV

Query:  GFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGT
        GFS ESVIPEGCIDLPVTLGQDQTR+TQM EFVVVDGRS YNAIFGRPIIHSFR IPSTLHQVLKY T NGVGTVRGEQ  SRECYAAALKGSSVCAL T
Subjt:  GFSEESVIPEGCIDLPVTLGQDQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGT

Query:  LRDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQ
        LRDGTLE+EADLPRKEFAAPT+ELELVPLLSPEKQ
Subjt:  LRDGTLEVEADLPRKEFAAPTKELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAACAGCGGTAGAGGGGCAAGGT
CACGACGTCTTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTACCGCCCGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCAGGTGCAGGGTCCCGATCTGAAAATCAAGCGACGCGCATGGACGTACGCGAGCAAAGGGGATCCCACCTC
GGCCCAGCCGAGGAAGAACGCCCCGAAGACAATGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTA
AGGGGGGAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGAGAGAGTCGCCTTTCACCTCGGAC
GTCTTGGAAGCACCAATCCCTCTGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGATCCCAAGGACTATGTTGAGCTGAGAAGGGAGTTCCTC
GCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTAGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAG
GAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCC
CCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAAATCGGCCGGGGC
AGAAGTGGAAAGGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGC
CGACCGTACGAGCGCTTCACCCCATCCACGATTCCAATTTCCGAGATCTTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTT
CGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGAT
CTAATTCAAGACGGCTACTTCAAAAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCTCGGCGCACC
GACCGACCTGCGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCTACAATGATGCCCTTGTGATTGCTCCCTTGATTGAT
CATGTGGTGGTTAGGAGAGTGCTGGTAGATAGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGA
AGCCCGACGCCGCTGGTTGGGTTCTCTGAAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGATCACTCAAATG
GCCGAGTTCGTGGTAGTTGACGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTG
AAGTATCCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAAATCGCTTCGAGGGAATGTTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGGA
ACTCTCAGGGATGGGACGCTCGAGGTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTAAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAG
CAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGAC
CCGATCACGGACTTCATTAGGGGTAACTCACCACGAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGCAGGATGGC
CAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGC
CCATTTGAAATCAAGGACATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTAT
TATCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAACAGCGGTAGAGGGGCAAGGT
CACGACGTCTTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTACCGCCCGCACACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCC
ATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCAGGTGCAGGGTCCCGATCTGAAAATCAAGCGACGCGCATGGACGTACGCGAGCAAAGGGGATCCCACCTC
GGCCCAGCCGAGGAAGAACGCCCCGAAGACAATGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTC
CGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTA
AGGGGGGAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGAGAGAGTCGCCTTTCACCTCGGAC
GTCTTGGAAGCACCAATCCCTCTGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGATCCCAAGGACTATGTTGAGCTGAGAAGGGAGTTCCTC
GCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTAGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAG
GAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCC
CCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAAATCGGCCGGGGC
AGAAGTGGAAAGGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGC
CGACCGTACGAGCGCTTCACCCCATCCACGATTCCAATTTCCGAGATCTTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTT
CGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGAT
CTAATTCAAGACGGCTACTTCAAAAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCTCGGCGCACC
GACCGACCTGCGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCTACAATGATGCCCTTGTGATTGCTCCCTTGATTGAT
CATGTGGTGGTTAGGAGAGTGCTGGTAGATAGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGA
AGCCCGACGCCGCTGGTTGGGTTCTCTGAAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGATCACTCAAATG
GCCGAGTTCGTGGTAGTTGACGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTG
AAGTATCCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAAATCGCTTCGAGGGAATGTTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGGA
ACTCTCAGGGATGGGACGCTCGAGGTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTAAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAG
CAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGAC
CCGATCACGGACTTCATTAGGGGTAACTCACCACGAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGCAGGATGGC
CAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGC
CCATTTGAAATCAAGGACATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTAT
TATCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGATAVEGQGHDVLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRS
MEAMYNEMVLAAGAGSRSENQATRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQL
RGELDAQVEALKAKCEQKDDSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYDGTKDPKDYVELRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQ
EEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRS
RPYERFTPSTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRT
DRPAGPTCPITFDGADLEEVHLPYNDALVIAPLIDHVVVRRVLVDRGASANILSLPTYLALGWTRSQLKRSPTPLVGFSEESVIPEGCIDLPVTLGQDQTRITQM
AEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQIASRECYAAALKGSSVCALGTLRDGTLEVEADLPRKEFAAPTKELELVPLLSPEK
QTDLARSVPVEILDNPSILEPDLMEIGAPEPSWMDPITDFIRGNSPRDPKERRKLARRAARFVVRDGQDGQTLQCPRSTSGLSGRTLVLRRVQTHVGALDPAWEG
PFEIKDIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP