; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g35280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g35280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:25875461..25880628
RNA-Seq ExpressionMoc08g35280
SyntenyMoc08g35280
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]5.1e-20976.52Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGES-----------------------------------------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES                                               
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGES-----------------------------------------------

Query:  -------------------RLLAARDCGT----------------GDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
                           R L A    T                 D + ATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  -------------------RLLAARDCGT----------------GDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKD-ERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIE
        TVKLGEEAPATFAEVLQKAKKVID QELLRTKTGRPE KIGRGRSGKD E ADPKSKDKGSFSSGR EYRR ENGPTRSRPYERFTPTTIPIS+ILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKD-ERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERK SRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KR ELARAARREVCII+EQ PTCPITFDGADLE+VHLPHNDALVIA LIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]5.8e-24576.64Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESRLLA-------------ARDC--------GTGDCQPATHLATI
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGES   +             ++D         G  D Q A+     
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESRLLA-------------ARDC--------GTGDCQPATHLATI

Query:  RQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKD
        R  +          FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVID QELLRTKTGRPE  I RGRSGKDE+AD KSKD
Subjt:  RQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKD

Query:  KGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKK
        KGSFSSGR E+RR  NGPTRSRPYERFTPTTIPIS+ILTNIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKK
Subjt:  KGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKK

Query:  FVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVV
        FVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKR ELARAARREVCII+EQ PTCPITFD ADLE+VHLPHNDALVIA LIDHVVV
Subjt:  FVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVV

Query:  RRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQV
        RRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQV
Subjt:  RRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQV

Query:  LKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFETL--KDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE-
        LKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCA ETL  +DGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E 
Subjt:  LKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFETL--KDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE-

Query:  -PDLMEIG
         P+ + +G
Subjt:  -PDLMEIG

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]4.1e-22290.36Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKD-ERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVID QELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GR EYRR ENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKD-ERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTP

Query:  TTIPISKILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRP
        TTIPIS+ILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERK SRTPPRRTDRP
Subjt:  TTIPISKILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKR +LARAARREVCII+EQ PTCPITFD ADL +VHLPHNDALVIA LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYA+ LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKG

Query:  SSVCAFETL--KDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
        +SVCA ETL  +DGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCAFETL--KDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.2e-21161.24Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAALEGQGHDGLATEPLRRSARITRRSATGGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKRQSPSR
        MVQPANSTNT DRR LAA+  HQREVGA  +EGQGH+ L TEPL RSARIT       P    +P                                   
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAALEGQGHDGLATEPLRRSARITRRSATGGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKRQSPSR

Query:  SHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGE------------------------------------------
               +AESS+NP   G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE                                          
Subjt:  SHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGE------------------------------------------

Query:  ------------------------SRLLAARDCGT----------------GDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTG
                                 R L AR   T                 D +  THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTG
Subjt:  ------------------------SRLLAARDCGT----------------GDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTG

Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDE-RADPKSKDKG-SFSSGRPEYRRVENGPTRSRPYERFTPTTIPIS
        LADE LTVKL EEAPATFAEVLQK KKVID QELLRTKTGRPE  I +GR+GKD+ +AD KS+DKG S SS R +YRR  +   +SRPYE +TPTTIPI 
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDE-RADPKSKDKG-SFSSGRPEYRRVENGPTRSRPYERFTPTTIPIS

Query:  KILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTI
        +ILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERK  RTPPRR DRPAVIN  
Subjt:  KILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTI

Query:  FGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTP
                   K+ ELAR ARREVCII+EQ PT  I F+ ADLE VHLPHNDALVIA LID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTP
Subjt:  FGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTP

Query:  LVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAF
        LVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCA 
Subjt:  LVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAF

Query:  E--TLKD
        E  T++D
Subjt:  E--TLKD

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]1.6e-20286.67Show/hide
Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKI
        +ADEALTVKLGEEAPATFAEVLQKAKKVID QELLRTKTGRPE KIGRGRSGKDERADPKSKDKGSFSSGR EYRR ENGPTRSRPYERFTPTTIPIS+I
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKI

Query:  LTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFG
        LTNIEDSGMEKLLKRPEKLRGAPERRSKDK                                       TSSAEKKEERK SRTPPRRTDRPAVINTIFG
Subjt:  LTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFG

Query:  GPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLV
        GPSGGQSGHKR ELAR ARREVCII+EQGPTCPITFDGADLE+VHLPHNDALVIA LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLV
Subjt:  GPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLV

Query:  GFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFET
        GFSGESVIPEGCIDLPVTLGQDQTRVTQM EFVVVDGRS YNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQT SRECYAAALKGSSVCA ET
Subjt:  GFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFET

Query:  LKDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ
        L+DGTLE EADLPRKEFAAPTEELELVPLLSPEKQ
Subjt:  LKDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.5e-20976.52Show/hide
Query:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGES-----------------------------------------------
        +AESS N   PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGES                                               
Subjt:  QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGES-----------------------------------------------

Query:  -------------------RLLAARDCGT----------------GDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
                           R L A    T                 D + ATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  -------------------RLLAARDCGT----------------GDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKD-ERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIE
        TVKLGEEAPATFAEVLQKAKKVID QELLRTKTGRPE KIGRGRSGKD E ADPKSKDKGSFSSGR EYRR ENGPTRSRPYERFTPTTIPIS+ILTNIE
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKD-ERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIE

Query:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGG
        +SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFVGKPRTSSAEKKEERK SRTPPRRTDRPAVINTIFGGPSGG
Subjt:  DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE
        QSG KR ELARAARREVCII+EQ PTCPITFDGADLE+VHLPHNDALVIA LIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLK+SPTPLVGFSGE
Subjt:  QSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV
        SVIPEG IDLPVTLGQDQT+VTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.8e-24576.64Show/hide
Query:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESRLLA-------------ARDC--------GTGDCQPATHLATI
        SSNQQAESSHNPA   G+ITREEFDQLRG+L+AQVEALKAKCEQK+  LNDGDLGES   +             ++D         G  D Q A+     
Subjt:  SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESRLLA-------------ARDC--------GTGDCQPATHLATI

Query:  RQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKD
        R  +          FQE+QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVID QELLRTKTGRPE  I RGRSGKDE+AD KSKD
Subjt:  RQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKD

Query:  KGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKK
        KGSFSSGR E+RR  NGPTRSRPYERFTPTTIPIS+ILTNIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKK
Subjt:  KGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKK

Query:  FVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVV
        FVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKR ELARAARREVCII+EQ PTCPITFD ADLE+VHLPHNDALVIA LIDHVVV
Subjt:  FVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVV

Query:  RRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQV
        RRVLVD G SANI+SL TYLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRAIPSTLHQV
Subjt:  RRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQV

Query:  LKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFETL--KDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE-
        LKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCA ETL  +DGTLEF+A+LPR+EFAAPTEELELVPLL  +      +E +L     +  +D+   +E 
Subjt:  LKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFETL--KDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE-

Query:  -PDLMEIG
         P+ + +G
Subjt:  -PDLMEIG

A0A6J1DD03 uncharacterized protein LOC1110198992.0e-22290.36Show/hide
Query:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKD-ERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTP
        MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVID QELLRT       KIG+GRSGKD E  DPKSKDKGSFS+GR EYRR ENGPTRSRPYERFTP
Subjt:  MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKD-ERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTP

Query:  TTIPISKILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRP
        TTIPIS+ILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERK SRTPPRRTDRP
Subjt:  TTIPISKILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRP

Query:  AVINTIFGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
        AVINTIFGGPSGGQSGHKR +LARAARREVCII+EQ PTCPITFD ADL +VHLPHNDALVIA LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Subjt:  AVINTIFGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL

Query:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKG
        K+SPTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYA+ LKG
Subjt:  KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKG

Query:  SSVCAFETL--KDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL
        +SVCA ETL  +DGTLEFEADLP +EFAAP EELELVPLLS EKQ+
Subjt:  SSVCAFETL--KDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204791.6e-21161.24Show/hide
Query:  MVQPANSTNTTDRRTLAASDAHQREVGAAALEGQGHDGLATEPLRRSARITRRSATGGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKRQSPSR
        MVQPANSTNT DRR LAA+  HQREVGA  +EGQGH+ L TEPL RSARIT       P    +P                                   
Subjt:  MVQPANSTNTTDRRTLAASDAHQREVGAAALEGQGHDGLATEPLRRSARITRRSATGGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKRQSPSR

Query:  SHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGE------------------------------------------
               +AESS+NP   G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE                                          
Subjt:  SHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGE------------------------------------------

Query:  ------------------------SRLLAARDCGT----------------GDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTG
                                 R L AR   T                 D +  THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTG
Subjt:  ------------------------SRLLAARDCGT----------------GDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTG

Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDE-RADPKSKDKG-SFSSGRPEYRRVENGPTRSRPYERFTPTTIPIS
        LADE LTVKL EEAPATFAEVLQK KKVID QELLRTKTGRPE  I +GR+GKD+ +AD KS+DKG S SS R +YRR  +   +SRPYE +TPTTIPI 
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDE-RADPKSKDKG-SFSSGRPEYRRVENGPTRSRPYERFTPTTIPIS

Query:  KILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTI
        +ILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERK  RTPPRR DRPAVIN  
Subjt:  KILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTI

Query:  FGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTP
                   K+ ELAR ARREVCII+EQ PT  I F+ ADLE VHLPHNDALVIA LID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SPTP
Subjt:  FGGPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTP

Query:  LVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAF
        LVGFSGES+  EGCIDLPV++ QD T+VTQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE   SRECYA+  K SSVCA 
Subjt:  LVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAF

Query:  E--TLKD
        E  T++D
Subjt:  E--TLKD

A0A6J1DYW5 uncharacterized protein LOC1110243327.8e-20386.67Show/hide
Query:  LADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKI
        +ADEALTVKLGEEAPATFAEVLQKAKKVID QELLRTKTGRPE KIGRGRSGKDERADPKSKDKGSFSSGR EYRR ENGPTRSRPYERFTPTTIPIS+I
Subjt:  LADEALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKI

Query:  LTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFG
        LTNIEDSGMEKLLKRPEKLRGAPERRSKDK                                       TSSAEKKEERK SRTPPRRTDRPAVINTIFG
Subjt:  LTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFG

Query:  GPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLV
        GPSGGQSGHKR ELAR ARREVCII+EQGPTCPITFDGADLE+VHLPHNDALVIA LIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLV
Subjt:  GPSGGQSGHKRNELARAARREVCIIKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLV

Query:  GFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFET
        GFSGESVIPEGCIDLPVTLGQDQTRVTQM EFVVVDGRS YNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQT SRECYAAALKGSSVCA ET
Subjt:  GFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFET

Query:  LKDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ
        L+DGTLE EADLPRKEFAAPTEELELVPLLSPEKQ
Subjt:  LKDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGTGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGCTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCTCTCCGCAGGTCGGCACGGATCACCCGCAGGTCGGCCACGGGTGGCCCTGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACA
CTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAACGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAAGCTGAA
TCCTCTCACAATCCTGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGA
TTCACTGAACGACGGCGACTTGGGAGAATCGCGCTTACTGGCAGCGCGTGATTGTGGTACCGGAGACTGCCAGCCAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGG
GTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAGCTTGGAGAGGAAGCCCCGGCCACCTTCGCCGAGGTACTCCAGAAGGCGAAGAAAGTCATCGACAGACAGGAGCTCCTCCGAACCAAAACCGGCCG
ACCTGAATTTAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGTCGACCTGAGTATCGAAGGGTGG
AGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCAAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAG
CGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCA
AATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTCGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGAGTTCAAGGACGCCACCCCGGC
GCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAATGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATC
ATCAAGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGATGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCTCTTGATTGATCATGT
GGTGGTCAGGAGAGTGTTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGC
CGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTA
GTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGG
CGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCTTCGAAACTCTCAAGGATGGGACGCTCGAGT
TCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTC
GCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGAACTTCAT
TAGGGGTAACTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTC
TATTGAAATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGAAGAAAGAAGAGCAATGG
CCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCCGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGG
GACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGTGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGCTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACGGAACCTCTCCGCAGGTCGGCACGGATCACCCGCAGGTCGGCCACGGGTGGCCCTGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACA
CTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAACGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAAGCTGAA
TCCTCTCACAATCCTGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGA
TTCACTGAACGACGGCGACTTGGGAGAATCGCGCTTACTGGCAGCGCGTGATTGTGGTACCGGAGACTGCCAGCCAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGG
GTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAA
GCCCTCACGGTGAAGCTTGGAGAGGAAGCCCCGGCCACCTTCGCCGAGGTACTCCAGAAGGCGAAGAAAGTCATCGACAGACAGGAGCTCCTCCGAACCAAAACCGGCCG
ACCTGAATTTAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGTCGACCTGAGTATCGAAGGGTGG
AGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCAAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAG
CGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCA
AATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTCGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGAGTTCAAGGACGCCACCCCGGC
GCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAATGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATC
ATCAAGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGATGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCTCTTGATTGATCATGT
GGTGGTCAGGAGAGTGTTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGC
CGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTA
GTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGG
CGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCTCATCGGTCTGCGCCTTCGAAACTCTCAAGGATGGGACGCTCGAGT
TCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTC
GCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGAACTTCAT
TAGGGGTAACTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTC
TATTGAAATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGAAGAAAGAAGAGCAATGG
CCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGGTCCAAACGCATGTGGGTGCCCTTGACCCGGCCCGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGG
GACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAALEGQGHDGLATEPLRRSARITRRSATGGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKRQSPSRSHRSSNQQAE
SSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESRLLAARDCGTGDCQPATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE
ALTVKLGEEAPATFAEVLQKAKKVIDRQELLRTKTGRPEFKIGRGRSGKDERADPKSKDKGSFSSGRPEYRRVENGPTRSRPYERFTPTTIPISKILTNIEDSGMEKLLK
RPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKSSRTPPRRTDRPAVINTIFGGPSGGQSGHKRNELARAARREVCI
IKEQGPTCPITFDGADLEDVHLPHNDALVIALLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVV
VDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCAFETLKDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDL
ARSVPVEILDNPSILEPDLMEIGAPESSWMDPITNFIRGNSPQDPKERRKLARRAARFVVRDGALYRRGFSLPLLKCLTPEEGLVEHYEPTTNEDGLLLNLDLLKKEEQW
PSYAWRNIRAEWVQTHVGALDPAREGPFEIKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYP