; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:1186246..1190789
RNA-Seq ExpressionMoc07g01450
SyntenyMoc07g01450
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.0e-22279.55Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTP------------------
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKE PLNDGDLGESPFTS+VLEAPIP KFKAPTVKPYDG + P                  
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTP------------------

Query:  -------RIMLRSLKASWISKRHQTQ-SNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
               RI L      W  +      S     R   LA       D + ATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  -------RIMLRSLKASWISKRHQTQ-SNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIE
        TVKLGEEAPATF EVLQKAKKVIDGQELLRTKTG+PERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAE GPTRSRPYERFTPTT+PISEILTNIE
Subjt:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIFGGQAG-
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKR RTPPRRTDRPAVINTIFGG +G 
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIFGGQAG-

Query:  -----------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRE
                               RPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFS E
Subjt:  -----------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.7e-24477.02Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTPRIMLRSLKASWISK
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKE PLNDGDLGESPFTS+VLE        APTVK YDG + P+  +   +      
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTPRIMLRSLKASWISK

Query:  RHQTQSNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKV
                             G+ D Q A+     R  +          FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATF EVLQKAKKV
Subjt:  RHQTQSNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKV

Query:  IDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIEESGMEKLLKRPEKLRGAPERRS
        IDGQELLRTKTG+PER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA  GPTRSRPYERFTPTT+PISEILTNIEESGMEKLLKRPEKLRGAPERR+
Subjt:  IDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIEESGMEKLLKRPEKLRGAPERRS

Query:  KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIFGGQAG-----------------------
        KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK  RTP RR DRPAVINTIFGG +G                       
Subjt:  KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIFGGQAG-----------------------

Query:  -RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVTLGQDQTQVT
         RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFSRESVIPEGCIDLPVTLG DQTQVT
Subjt:  -RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVTLGQDQTQVT

Query:  QMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFEADLPRKEFAAPTEELE
        QMAEFVVIDGRSAYNAIFGRP+IHS RAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCA+ETL S DGTLEF+A+LPR+EFAAPTEELE
Subjt:  QMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFEADLPRKEFAAPTEELE

Query:  LVPLL
        LVPLL
Subjt:  LVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]3.0e-20985.91Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATF EVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAE GPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTP

Query:  TTVPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRP
        TT+PISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKR RTPPRRTDRP
Subjt:  TTVPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRP

Query:  AVINTIFGGQAG------------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGG +G                        RPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGQAG------------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSRESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFS ESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRP+IHS RAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSRESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCAIETLASGDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQGQ
        +SVCA+ETL S DGTLEFEADLP +EFAAP EELELVPLLS EKQ Q
Subjt:  SSVCAIETLASGDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQGQ

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.8e-22358.64Show/hide
Query:  MVQPANSTNTANRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTFKATRGRGGTSKKGARGLAPAPTSENFDALQRGMEAM
        MVQPANSTNTA+RR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP                                        
Subjt:  MVQPANSTNTANRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTFKATRGRGGTSKKGARGLAPAPTSENFDALQRGMEAM

Query:  CTQMRSMEEMYNEMILAAGAGSRSENRMTHIDIREQRGSHLGPVEEEHPEDNESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA
                                                                                         PS+        AESS+NP 
Subjt:  CTQMRSMEEMYNEMILAAGAGSRSENRMTHIDIREQRGSHLGPVEEEHPEDNESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA

Query:  TPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTP---------------------------
        TP GVITR EFDQL+ K DAQVEALKA+CE+KE+  +DGDLGE  F+S++LEA IP KFK PT+KPYDG + P                           
Subjt:  TPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTP---------------------------

Query:  ------RIMLRSLKASWISKRHQTQSNVV-PFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT
              R+  R L A  IS   Q +   +  F SR          D +  THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LT
Subjt:  ------RIMLRSLKASWISKRHQTQSNVV-PFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT

Query:  VKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIE
        VKL EEAPATF EVLQK KKVIDGQELLRTKTG+PE+ I +GR+GKD  KAD KS+DKG S SS R +YRR+     +SRPYE +TPTT+PI EILTNIE
Subjt:  VKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVIN---------
        E+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRLRTPPRR DRPAVIN         
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVIN---------

Query:  --TIFGGQAGRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVT
           +   +  RPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFS ES+  EGCIDLPV+
Subjt:  --TIFGGQAGRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVT

Query:  LGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE
        + QD TQVTQMAEFVVIDGRSAYNAIFGRP+IHS RA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCA+E
Subjt:  LGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]1.2e-17879Show/hide
Query:  LADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISE
        +ADEALTVKLGEEAPATF EVLQKAKKVIDGQELLRTKTG+PERKIGRGRSGKD E+ADPKSKDKGSFSSGRAEYRRAE GPTRSRPYERFTPTT+PISE
Subjt:  LADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIF
        ILTNIE+SGMEKLLKRPEKLRGAPERRSKDK                                       TSSAEKKEERKR RTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIF

Query:  GGQAG------------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL
        GG +G                         PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPL
Subjt:  GGQAG------------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL

Query:  VGFSRESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE
        VGFS ESVIPEGCIDLPVTLGQDQT+VTQM EFVV+DGRS YNAIFGRP+IHS R IPSTLHQVLKYSTPNGVGTVRGEQT SRECYA+ALKGSSVCA+E
Subjt:  VGFSRESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE

Query:  TLASGDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ
        TL   DGTLE EADLPRKEFAAPTEELELVPLLSPEKQ
Subjt:  TLASGDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.1e-22379.55Show/hide
Query:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTP------------------
        +AESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKE PLNDGDLGESPFTS+VLEAPIP KFKAPTVKPYDG + P                  
Subjt:  QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTP------------------

Query:  -------RIMLRSLKASWISKRHQTQ-SNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
               RI L      W  +      S     R   LA       D + ATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  -------RIMLRSLKASWISKRHQTQ-SNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIE
        TVKLGEEAPATF EVLQKAKKVIDGQELLRTKTG+PERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAE GPTRSRPYERFTPTT+PISEILTNIE
Subjt:  TVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIFGGQAG-
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERKR RTPPRRTDRPAVINTIFGG +G 
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIFGGQAG-

Query:  -----------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRE
                               RPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFS E
Subjt:  -----------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188238.0e-24577.02Show/hide
Query:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTPRIMLRSLKASWISK
        SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALKAKCEQKE PLNDGDLGESPFTS+VLE        APTVK YDG + P+  +   +      
Subjt:  SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTPRIMLRSLKASWISK

Query:  RHQTQSNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKV
                             G+ D Q A+     R  +          FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATF EVLQKAKKV
Subjt:  RHQTQSNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKV

Query:  IDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIEESGMEKLLKRPEKLRGAPERRS
        IDGQELLRTKTG+PER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA  GPTRSRPYERFTPTT+PISEILTNIEESGMEKLLKRPEKLRGAPERR+
Subjt:  IDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIEESGMEKLLKRPEKLRGAPERRS

Query:  KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIFGGQAG-----------------------
        KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPRTSSAEKKEERK  RTP RR DRPAVINTIFGG +G                       
Subjt:  KDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIFGGQAG-----------------------

Query:  -RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVTLGQDQTQVT
         RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFSRESVIPEGCIDLPVTLG DQTQVT
Subjt:  -RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVTLGQDQTQVT

Query:  QMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFEADLPRKEFAAPTEELE
        QMAEFVVIDGRSAYNAIFGRP+IHS RAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCA+ETL S DGTLEF+A+LPR+EFAAPTEELE
Subjt:  QMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFEADLPRKEFAAPTEELE

Query:  LVPLL
        LVPLL
Subjt:  LVPLL

A0A6J1DD03 uncharacterized protein LOC1110198991.4e-20985.91Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATF EVLQKAKKVIDGQELLRT       KIG+GRSGKD+E  DPKSKDKGSFS+GRAEYRRAE GPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTP

Query:  TTVPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRP
        TT+PISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKR RTPPRRTDRP
Subjt:  TTVPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRP

Query:  AVINTIFGGQAG------------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGG +G                        RPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGQAG------------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KKSPTPLVGFSRESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG
        KKSPTPLVGFS ESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRP+IHS RAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG
Subjt:  KKSPTPLVGFSRESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKG

Query:  SSVCAIETLASGDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQGQ
        +SVCA+ETL S DGTLEFEADLP +EFAAP EELELVPLLS EKQ Q
Subjt:  SSVCAIETLASGDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQGQ

A0A6J1DHB3 uncharacterized protein LOC1110204791.3e-22358.64Show/hide
Query:  MVQPANSTNTANRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTFKATRGRGGTSKKGARGLAPAPTSENFDALQRGMEAM
        MVQPANSTNTA+RR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP                                        
Subjt:  MVQPANSTNTANRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTFKATRGRGGTSKKGARGLAPAPTSENFDALQRGMEAM

Query:  CTQMRSMEEMYNEMILAAGAGSRSENRMTHIDIREQRGSHLGPVEEEHPEDNESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA
                                                                                         PS+        AESS+NP 
Subjt:  CTQMRSMEEMYNEMILAAGAGSRSENRMTHIDIREQRGSHLGPVEEEHPEDNESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPA

Query:  TPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTP---------------------------
        TP GVITR EFDQL+ K DAQVEALKA+CE+KE+  +DGDLGE  F+S++LEA IP KFK PT+KPYDG + P                           
Subjt:  TPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTP---------------------------

Query:  ------RIMLRSLKASWISKRHQTQSNVV-PFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT
              R+  R L A  IS   Q +   +  F SR          D +  THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LT
Subjt:  ------RIMLRSLKASWISKRHQTQSNVV-PFRSRLLAARDCGIEDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALT

Query:  VKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIE
        VKL EEAPATF EVLQK KKVIDGQELLRTKTG+PE+ I +GR+GKD  KAD KS+DKG S SS R +YRR+     +SRPYE +TPTT+PI EILTNIE
Subjt:  VKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVIN---------
        E+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRLRTPPRR DRPAVIN         
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVIN---------

Query:  --TIFGGQAGRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVT
           +   +  RPT  I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFS ES+  EGCIDLPV+
Subjt:  --TIFGGQAGRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVT

Query:  LGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE
        + QD TQVTQMAEFVVIDGRSAYNAIFGRP+IHS RA+PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCA+E
Subjt:  LGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE

A0A6J1DYW5 uncharacterized protein LOC1110243325.9e-17979Show/hide
Query:  LADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISE
        +ADEALTVKLGEEAPATF EVLQKAKKVIDGQELLRTKTG+PERKIGRGRSGKD E+ADPKSKDKGSFSSGRAEYRRAE GPTRSRPYERFTPTT+PISE
Subjt:  LADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKGPTRSRPYERFTPTTVPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIF
        ILTNIE+SGMEKLLKRPEKLRGAPERRSKDK                                       TSSAEKKEERKR RTPPRRTDRPAVINTIF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTDRPAVINTIF

Query:  GGQAG------------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL
        GG +G                         PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK+SPTPL
Subjt:  GGQAG------------------------RPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPL

Query:  VGFSRESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE
        VGFS ESVIPEGCIDLPVTLGQDQT+VTQM EFVV+DGRS YNAIFGRP+IHS R IPSTLHQVLKYSTPNGVGTVRGEQT SRECYA+ALKGSSVCA+E
Subjt:  VGFSRESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIE

Query:  TLASGDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ
        TL   DGTLE EADLPRKEFAAPTEELELVPLLSPEKQ
Subjt:  TLASGDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAAATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATTTAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCTAGCTCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGGAATGGAGGCAATGTGCACACAAATGCGGTCCATGGAGGAAATG
TATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCACATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCGGGGGGGAGACCTCCGTGAACATCTCAACAGAAAGAGAGGCTCATCTCTCCGGAAAGGACAGTCACCATCTC
GCTCACACCGGAGCTCCAACCAGCAGGCCGAATCCTCTCACAATCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAACTGAGGGGCAAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGCTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGAACGTTTTGGAAGCACCGATCCCTCT
AAAGTTCAAAGCTCCTACTGTAAAGCCTTATGATGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAA
ATGTCGTGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGAAGACTGCCAGCCAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTG
CGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGT
GAAACTTGGAGAGGAGGCCCCGGCCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCAACCAGAACGAA
AGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCAGAGAAGGGA
CCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGGTTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAATTACTCAAGCGTCCTGA
GAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGG
ATCTAATTCAAGATGGTTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAAGAAGAGCGGAAGCGTTTGAGGACGCCGCCCCGGCGCACTGAC
CGACCTGCGGTCATCAATACCATTTTCGGAGGGCAAGCGGGGAGGCCGACCTGCCCAATCACCTTCGACAGTGCAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGC
ACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGATGGAGGCGCATCTGCTAACATCTTGTCCTTACCGACCTACCTCGCCCTGGGATGGA
CGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTAGAGAATCGGTCATCCCAGAAGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACT
CAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCGTCATCCACTCATTGCGGGCCATTCCCTCAACACTTCA
TCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTGCGCCA
TCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGCCCC
GAGAAGCAAGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCC
TGCCTCTATTGAGATGCCTAACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAAATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATTTAAGGCCACCCGTGGCCGAGGTGGAACCT
CTAAGAAGGGCGCCCGGGGTCTAGCTCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGGAATGGAGGCAATGTGCACACAAATGCGGTCCATGGAGGAAATG
TATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCACATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGA
ACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCGGGGGGGAGACCTCCGTGAACATCTCAACAGAAAGAGAGGCTCATCTCTCCGGAAAGGACAGTCACCATCTC
GCTCACACCGGAGCTCCAACCAGCAGGCCGAATCCTCTCACAATCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAACTGAGGGGCAAGCTCGACGCT
CAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGCTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGAACGTTTTGGAAGCACCGATCCCTCT
AAAGTTCAAAGCTCCTACTGTAAAGCCTTATGATGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAA
ATGTCGTGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGAAGACTGCCAGCCAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTG
CGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGT
GAAACTTGGAGAGGAGGCCCCGGCCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCAACCAGAACGAA
AGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCAGAGAAGGGA
CCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGGTTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAATTACTCAAGCGTCCTGA
GAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGG
ATCTAATTCAAGATGGTTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAAGAAGAGCGGAAGCGTTTGAGGACGCCGCCCCGGCGCACTGAC
CGACCTGCGGTCATCAATACCATTTTCGGAGGGCAAGCGGGGAGGCCGACCTGCCCAATCACCTTCGACAGTGCAGACTTAGAGGAGGTCCACCTGCCCCACAATGATGC
ACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGATGGAGGCGCATCTGCTAACATCTTGTCCTTACCGACCTACCTCGCCCTGGGATGGA
CGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTAGAGAATCGGTCATCCCAGAAGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACT
CAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCGTCATCCACTCATTGCGGGCCATTCCCTCAACACTTCA
TCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTGCGCCA
TCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGCCCC
GAGAAGCAAGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCC
TGCCTCTATTGAGATGCCTAACCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTANRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTFKATRGRGGTSKKGARGLAPAPTSENFDALQRGMEAMCTQMRSMEEM
YNEMILAAGAGSRSENRMTHIDIREQRGSHLGPVEEEHPEDNESEGHTRRGGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDA
QVEALKAKCEQKEAPLNDGDLGESPFTSNVLEAPIPLKFKAPTVKPYDGRRTPRIMLRSLKASWISKRHQTQSNVVPFRSRLLAARDCGIEDCQPATHLATIRQKEGETL
REYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKVIDGQELLRTKTGQPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAEKG
PTRSRPYERFTPTTVPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRLRTPPRRTD
RPAVINTIFGGQAGRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSRESVIPEGCIDLPVTLGQDQT
QVTQMAEFVVIDGRSAYNAIFGRPVIHSLRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCAIETLASGDGTLEFEADLPRKEFAAPTEELELVPLLSP
EKQGQFTTRPQGAQKVGKASSSVRGPRWSIVPTRLFPASIEMPNP