; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g11330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g11330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:8757213..8762816
RNA-Seq ExpressionMoc07g11330
SyntenyMoc07g11330
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.7e-25089.92Show/hide
Query:  QAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASD
        +AESSRNP TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE FE LMDFQAASD
Subjt:  QAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWY+RLPA SISTYSQLRREFLA F SRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVE
        TVKLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKI RGRSGKDIE  DPKSKDKGSFSSGRAEYRRAENGPTRSRPYE FTPTTIPISEILTN+E
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT                           TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGE
        QSG KRKELARAARREVCIIRE RPTCPITF   +LEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGE

Query:  SVIPEG
        SVIPEG
Subjt:  SVIPEG

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]1.1e-19686.02Show/hide
Query:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQ
        K+  LNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE FEGLMDF AASDAIKCRAFQIALTGSARLWY+RLPARSISTYSQLRREFLAQ
Subjt:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQ

Query:  FFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKI
        F SR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDG ELLRTKTGRP+RKI
Subjt:  FFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKI

Query:  DRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---
         RGRSGKD+E+ DPKSKDKGSFSSGRAEYRRAE+GPT+SRPYE FTPTTIPISEILTN+EESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT   
Subjt:  DRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---

Query:  ------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVH
                                TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE  PTCPITF   + EEVH
Subjt:  ------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.7e-21669.57Show/hide
Query:  SSNQQAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQ
        SSNQQAESS NP TP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVE FEGLMDFQ
Subjt:  SSNQQAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDG ELLRTKTGRPER IDRGRSGKD EK D KSKDKGSFSSGRAE+RRA NGPTRSRPYE FTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEIL

Query:  TNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGG
        TN+EESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNT                           TSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG
        PSGGQSGHKRKELARAARREVCIIRE RPTCPITF S +LEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG

Query:  FSGESVIPE------------------------------------------------------------------GEQTASRECYASALKGSSVCALETL
        FS ESVIPE                                                                  GEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPE------------------------------------------------------------------GEQTASRECYASALKGSSVCALETL

Query:  PSRDGTLEFKADLPRREFAAPTEELELVPLL
         SRDGTLEFKA+LPRREFAAPTEELELVPLL
Subjt:  PSRDGTLEFKADLPRREFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.3e-21056.01Show/hide
Query:  MVQPANSTNMADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGAQGPAPAPPSENFDALQREMEAMR
        MVQPANSTN ADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNMADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGAQGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVT
                                                                                                   AESS NP+T
Subjt:  TQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVT

Query:  PAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIA
        P GVITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVE FE LMDFQAA+DAIKC AFQIA
Subjt:  PAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWY+RLPAR ISTYSQLR+EF++QF SRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLK
        TFAEVLQK KKVIDG ELLRTKTGRPE+ ID+GR+GKD  K D KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTN+EE+GMEKLLK
Subjt:  TFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+HGHNT                           ++S EKKEERKR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPE----
        AR ARREVCIIRE RPT  I F   +LE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKS TPLVGFSGES+  E    
Subjt:  ARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPE----

Query:  --------------------------------------------------------------GEQTASRECYASALKGSSVCALETLPSRD
                                                                      GE   SRECYAS  K SSVCALE    RD
Subjt:  --------------------------------------------------------------GEQTASRECYASALKGSSVCALETLPSRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.1e-19486.26Show/hide
Query:  SSRNPV-TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAI
        S+R PV +  G+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVE FEGLMDFQAASDAI
Subjt:  SSRNPV-TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAI

Query:  KCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTV
        KCRAFQIALTGSARLWY+RLP RSISTYSQLRREFLAQF SRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTV
Subjt:  KCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTV

Query:  KLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEES
        KLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKI RGRSGKD+E+ DPKSKDKGSFSSGRAEYRRAENGPTRSRPYE FTPTTIPI EILTN+EES
Subjt:  KLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEES

Query:  GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQS
        GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT                           TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQ 
Subjt:  GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQS

Query:  GHKRKELARAARREVCIIREPR
        GHKRKELARAARRE+   +E R
Subjt:  GHKRKELARAARREVCIIREPR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.3e-25089.92Show/hide
Query:  QAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASD
        +AESSRNP TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE FE LMDFQAASD
Subjt:  QAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWY+RLPA SISTYSQLRREFLA F SRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVE
        TVKLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKI RGRSGKDIE  DPKSKDKGSFSSGRAEYRRAENGPTRSRPYE FTPTTIPISEILTN+E
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVE

Query:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
        ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT                           TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Subjt:  ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG

Query:  QSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGE
        QSG KRKELARAARREVCIIRE RPTCPITF   +LEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKS TPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGE

Query:  SVIPEG
        SVIPEG
Subjt:  SVIPEG

A0A6J1D9E1 uncharacterized protein LOC1110188238.1e-21769.57Show/hide
Query:  SSNQQAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQ
        SSNQQAESS NP TP GVITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVE FEGLMDFQ
Subjt:  SSNQQAESSRNPVTPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEIL
        DEALTVKLG+EAPATFAEVLQKAKKVIDG ELLRTKTGRPER IDRGRSGKD EK D KSKDKGSFSSGRAE+RRA NGPTRSRPYE FTPTTIPISEIL
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEIL

Query:  TNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGG
        TN+EESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNT                           TSSAEKKEERK SRTP RR DRPAVINTIFGG
Subjt:  TNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGG

Query:  PSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG
        PSGGQSGHKRKELARAARREVCIIRE RPTCPITF S +LEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVG

Query:  FSGESVIPE------------------------------------------------------------------GEQTASRECYASALKGSSVCALETL
        FS ESVIPE                                                                  GEQ ASRECYASALKGSSVCALETL
Subjt:  FSGESVIPE------------------------------------------------------------------GEQTASRECYASALKGSSVCALETL

Query:  PSRDGTLEFKADLPRREFAAPTEELELVPLL
         SRDGTLEFKA+LPRREFAAPTEELELVPLL
Subjt:  PSRDGTLEFKADLPRREFAAPTEELELVPLL

A0A6J1D9W7 uncharacterized protein LOC1110187085.5e-19786.02Show/hide
Query:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQ
        K+  LNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVE FEGLMDF AASDAIKCRAFQIALTGSARLWY+RLPARSISTYSQLRREFLAQ
Subjt:  KEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQ

Query:  FFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKI
        F SR Y KKT THLATIRQKEG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDG ELLRTKTGRP+RKI
Subjt:  FFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKI

Query:  DRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---
         RGRSGKD+E+ DPKSKDKGSFSSGRAEYRRAE+GPT+SRPYE FTPTTIPISEILTN+EESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT   
Subjt:  DRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---

Query:  ------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVH
                                TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIRE  PTCPITF   + EEVH
Subjt:  ------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204791.1e-21056.01Show/hide
Query:  MVQPANSTNMADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGAQGPAPAPPSENFDALQREMEAMR
        MVQPANSTN ADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SK                                    
Subjt:  MVQPANSTNMADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGAQGPAPAPPSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVT
                                                                                                   AESS NP+T
Subjt:  TQMRSMEEMYNEMILAAGVGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVT

Query:  PAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIA
        P GVITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVE FE LMDFQAA+DAIKC AFQIA
Subjt:  PAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA
        LTGSARLWY+RLPAR ISTYSQLR+EF++QF SRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPA
Subjt:  LTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPA

Query:  TFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLK
        TFAEVLQK KKVIDG ELLRTKTGRPE+ ID+GR+GKD  K D KS+DKG S SS R +YRR+ +   +SRPYE +TPTTIPI EILTN+EE+GMEKLLK
Subjt:  TFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLK

Query:  RPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL
        RPEKLRG PE+R+ DKYCRFHR+HGHNT                           ++S EKKEERKR RTPPRR DRPAVIN             K+KEL
Subjt:  RPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL

Query:  ARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPE----
        AR ARREVCIIRE RPT  I F   +LE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKS TPLVGFSGES+  E    
Subjt:  ARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGFSGESVIPE----

Query:  --------------------------------------------------------------GEQTASRECYASALKGSSVCALETLPSRD
                                                                      GE   SRECYAS  K SSVCALE    RD
Subjt:  --------------------------------------------------------------GEQTASRECYASALKGSSVCALETLPSRD

A0A6J1DS95 uncharacterized protein LOC1110234215.1e-19586.26Show/hide
Query:  SSRNPV-TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAI
        S+R PV +  G+ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVE FEGLMDFQAASDAI
Subjt:  SSRNPV-TPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAI

Query:  KCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTV
        KCRAFQIALTGSARLWY+RLP RSISTYSQLRREFLAQF SRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTV
Subjt:  KCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQFFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTV

Query:  KLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEES
        KLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKI RGRSGKD+E+ DPKSKDKGSFSSGRAEYRRAENGPTRSRPYE FTPTTIPI EILTN+EES
Subjt:  KLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEES

Query:  GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQS
        GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT                           TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQ 
Subjt:  GMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT---------------------------TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQS

Query:  GHKRKELARAARREVCIIREPR
        GHKRKELARAARRE+   +E R
Subjt:  GHKRKELARAARREVCIIREPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATATGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACTCGTGGCCGAGGTGGGACCTCCA
AGAGGGGCGCCCAGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGTAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTTGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGTTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCAGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTTTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACCGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACATGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGACCGGGGCAGAAGTGGAAAAGATATAGAA
AAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGGAGGGCGGAGAACGGACCTACCAGAAGCCGACCTTACGAGTGCTTCACCCC
GACCACGATTCCAATTTCCGAGATCCTAACGAACGTCGAGGAGTCTGGAATGGAAAAACTGCTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGG
ACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCT
GCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCCGAG
GCCGACCTGCCCCATCACCTTCGGCAGTACAAACTTAGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGG
TGCTGGTAGATGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACACCGCTGGTTGGGTTC
TCTGGAGAATCGGTCATCCCAGAGGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCCCCAGTAGGGA
TGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGT
ACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCG
ATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGG
CTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTACGGCAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAG
AAAGAAGAGAAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGTCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTA
AGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGATCAAGTGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGA
CGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATATGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACTCGTGGCCGAGGTGGGACCTCCA
AGAGGGGCGCCCAGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGTAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTTGAGGAGGAACA
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGTTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCAGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTTTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTT
GAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACCGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACATGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGACCGGGGCAGAAGTGGAAAAGATATAGAA
AAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGGAGGGCGGAGAACGGACCTACCAGAAGCCGACCTTACGAGTGCTTCACCCC
GACCACGATTCCAATTTCCGAGATCCTAACGAACGTCGAGGAGTCTGGAATGGAAAAACTGCTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGG
ACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCT
GCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCCGAG
GCCGACCTGCCCCATCACCTTCGGCAGTACAAACTTAGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGG
TGCTGGTAGATGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCTGACACCGCTGGTTGGGTTC
TCTGGAGAATCGGTCATCCCAGAGGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCCCCAGTAGGGA
TGGGACGCTCGAGTTCAAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGT
ACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCG
ATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGG
CTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTACGGCAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAG
AAAGAAGAGAAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGTCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTA
AGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGATCAAGTGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGA
CGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNMADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKRGAQGPAPAPPSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGVGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPVTPAGVITREEFDQLRGQLDAQ
VEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEFFEGLMDFQAASDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRREFLAQ
FFSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIDRGRSGKDIE
KTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPTTIPISEILTNVEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTTSSAEKKEERKRSRTPPRRTDRP
AVINTIFGGPSGGQSGHKRKELARAARREVCIIREPRPTCPITFGSTNLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSLTPLVGF
SGESVIPEGEQTASRECYASALKGSSVCALETLPSRDGTLEFKADLPRREFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMEIGAPESSWMDP
IADFIRGNSPQDPKERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPEEGLVEHYEPTANEEELLLNLDLLEERREMAQLRLAEYQGRMARHYNVRVRPRTFQVGHLVL
RRVQTHVGALDPTWEGPFEIKCIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP