; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g06570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g06570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:4838305..4843352
RNA-Seq ExpressionMoc08g06570
SyntenyMoc08g06570
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.0e-26489.96Show/hide
Query:  QAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNP TPA VITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSA LWYRRLPA SISTYSQLRREFLA FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        T+KLGEEAPATFAE                         IGRGRSGKDIE ADPKSKDK SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSGA
        ESGMEKLLKRPEKLRGAPERR KDKYCRFH EHGHNTSD WELK QIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN IFGGPSG 
Subjt:  ESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSGA

Query:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTC ITFDGADL+EVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTR QLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRITQMAEFV
        SVIPEG IDLPVTLGQDQT++TQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRITQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.4e-26178.32Show/hide
Query:  MSSNQQAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF
        MSSNQQAESS NP TP  VITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDF
Subjt:  MSSNQQAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF

Query:  QAASDAIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL
        QAASDAIKCRAFQIALTGSA LW                                                     FQE+QLKVA  SDDSAMCYFLTGL
Subjt:  QAASDAIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL

Query:  ADEALTLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEI
        ADEALT+KLG+EAPATFAE                         I RGRSGKD EKAD KSKDK SFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEI
Subjt:  ADEALTLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFG
        LTNIEESGMEKLLKRPEKLRGAPERR KDKYCRFH EH HNTSD WELK QIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVIN IFG
Subjt:  LTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFG

Query:  GPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLV
        GPSG QSGHKRKELARAARREVCIIREQRPTC ITFD ADL+EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTR QLKKS TPLV
Subjt:  GPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLV

Query:  GFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET
        GFS ESVIPEGCIDLPVTLG DQT++TQMAEFVVIDGRSAYNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALET
Subjt:  GFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET

Query:  LAGRDETLEFEADLPRREFAAPTEELELVPLL
        L  RD TLEF+A+LPRREFAAPTEELELVPLL
Subjt:  LAGRDETLEFEADLPRREFAAPTEELELVPLL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]4.0e-21688.61Show/hide
Query:  MCYFLTGLADEALTLKLGEEAPATFAE------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALT+KL EEAPATFAE                  IG+GRSGKD+E  DPKSKDK SFS+GRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTLKLGEEAPATFAE------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIF
        ILTNIEESGMEKLLKRPEKLRGAPERR KDKYCRFH EHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN IF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIF

Query:  GGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPL
        GGPSG QSGHKRK+LARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTR QLKKSPTPL
Subjt:  GGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQTR+TQMAEFVV+DGRSAYNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALE

Query:  TLAGRDETLEFEADLPRREFAAPTEELELVPLLSPEKQL
        TL  RD TLEFEADLP REFAAP EELELVPLLS EKQ+
Subjt:  TLAGRDETLEFEADLPRREFAAPTEELELVPLLSPEKQL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]5.7e-24775.58Show/hide
Query:  QAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NP+TP  VITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSA LWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDK-ESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI
        T+KL EEAPATFAE                         I +GR+GKD  KAD KS+DK  S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNI
Subjt:  TLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDK-ESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSG
        EE+GMEKLLKRPEKLRG PE+R  DKYCRFH +HGHNTS+ WELK QIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN        
Subjt:  EESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSG

Query:  AQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSG
             K+KELAR ARREVCIIREQRPT SI F+ ADL+ VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTR QLKKSPTPLVGFSG
Subjt:  AQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSG

Query:  ESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGR
        ES+  EGCIDLPV++ QD T++TQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    R
Subjt:  ESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGR

Query:  DE
        DE
Subjt:  DE

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]4.1e-20571.51Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSA LWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTLKLGEEAPATFAEI----------------GRGRSGKDIE---------KADPKSKDKESFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LT+KLGEEAP TF E+                  GR  K I+         KAD KS+DK S SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTLKLGEEAPATFAEI----------------GRGRSGKDIE---------KADPKSKDKESFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R K+KYCRFH +HGHNT+ CWELK QIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  IIFGGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSP
         IFGGP+G QSG+KRKELAR ARREVCIIRE +PTCSITF  ADL+ VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  IIFGGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T++TQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDETLEFEADLP---RREFAAPTEELELVPLLSPEKQ
        ALE    R +  E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDETLEFEADLP---RREFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088135.0e-26589.96Show/hide
Query:  QAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNP TPA VITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSA LWYRRLPA SISTYSQLRREFLA FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
        T+KLGEEAPATFAE                         IGRGRSGKDIE ADPKSKDK SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Subjt:  TLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE

Query:  ESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSGA
        ESGMEKLLKRPEKLRGAPERR KDKYCRFH EHGHNTSD WELK QIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN IFGGPSG 
Subjt:  ESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSGA

Query:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSGE
        QSG KRKELARAARREVCIIREQRPTC ITFDGADL+EVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTR QLKKSPTPLVGFSGE
Subjt:  QSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTRITQMAEFV
        SVIPEG IDLPVTLGQDQT++TQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTRITQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188231.1e-26178.32Show/hide
Query:  MSSNQQAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF
        MSSNQQAESS NP TP  VITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDF
Subjt:  MSSNQQAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF

Query:  QAASDAIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL
        QAASDAIKCRAFQIALTGSA LW                                                     FQE+QLKVA  SDDSAMCYFLTGL
Subjt:  QAASDAIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGL

Query:  ADEALTLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEI
        ADEALT+KLG+EAPATFAE                         I RGRSGKD EKAD KSKDK SFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEI
Subjt:  ADEALTLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFG
        LTNIEESGMEKLLKRPEKLRGAPERR KDKYCRFH EH HNTSD WELK QIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVIN IFG
Subjt:  LTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFG

Query:  GPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLV
        GPSG QSGHKRKELARAARREVCIIREQRPTC ITFD ADL+EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTR QLKKS TPLV
Subjt:  GPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLV

Query:  GFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET
        GFS ESVIPEGCIDLPVTLG DQT++TQMAEFVVIDGRSAYNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKGSSVCALET
Subjt:  GFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALET

Query:  LAGRDETLEFEADLPRREFAAPTEELELVPLL
        L  RD TLEF+A+LPRREFAAPTEELELVPLL
Subjt:  LAGRDETLEFEADLPRREFAAPTEELELVPLL

A0A6J1DD03 uncharacterized protein LOC1110198991.9e-21688.61Show/hide
Query:  MCYFLTGLADEALTLKLGEEAPATFAE------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE
        MCYFLTGLADEALT+KL EEAPATFAE                  IG+GRSGKD+E  DPKSKDK SFS+GRAEYRRAENGPTRSRPYERFTPTTIPISE
Subjt:  MCYFLTGLADEALTLKLGEEAPATFAE------------------IGRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISE

Query:  ILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIF
        ILTNIEESGMEKLLKRPEKLRGAPERR KDKYCRFH EHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN IF
Subjt:  ILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIF

Query:  GGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPL
        GGPSG QSGHKRK+LARAARREVCIIREQRPTC ITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTR QLKKSPTPL
Subjt:  GGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPL

Query:  VGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALE
        VGFSGESV+PEGCIDLPVTLGQDQTR+TQMAEFVV+DGRSAYNAIFGRPIIHSFR IPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKG+SVCALE
Subjt:  VGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALE

Query:  TLAGRDETLEFEADLPRREFAAPTEELELVPLLSPEKQL
        TL  RD TLEFEADLP REFAAP EELELVPLLS EKQ+
Subjt:  TLAGRDETLEFEADLPRREFAAPTEELELVPLLSPEKQL

A0A6J1DHB3 uncharacterized protein LOC1110204792.7e-24775.58Show/hide
Query:  QAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NP+TP  VITREEFDQL+ + DAQVEALKA+CE+KE   +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSA LWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDK-ESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI
        T+KL EEAPATFAE                         I +GR+GKD  KAD KS+DK  S SS R +YRR+ +   +SRPYE +TPTTIPI EILTNI
Subjt:  TLKLGEEAPATFAE-------------------------IGRGRSGKDIEKADPKSKDK-ESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSG
        EE+GMEKLLKRPEKLRG PE+R  DKYCRFH +HGHNTS+ WELK QIEDLIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN        
Subjt:  EESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSG

Query:  AQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSG
             K+KELAR ARREVCIIREQRPT SI F+ ADL+ VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTR QLKKSPTPLVGFSG
Subjt:  AQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSG

Query:  ESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGR
        ES+  EGCIDLPV++ QD T++TQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKYST NGVGTVRGE   SRECYAS  K SSVCALE    R
Subjt:  ESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVCALETLAGR

Query:  DE
        DE
Subjt:  DE

A0A6J1DZB9 uncharacterized protein LOC1110249042.0e-20571.51Show/hide
Query:  MDFQAASDAIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL
        MDFQAA+DAIKCRAFQIALTGSA LWYRRLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFL
Subjt:  MDFQAASDAIKCRAFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL

Query:  TGLADEALTLKLGEEAPATFAEI----------------GRGRSGKDIE---------KADPKSKDKESFSS-GRAEYRRAENGPTRSRPYERFTPTTIP
        T LADE LT+KLGEEAP TF E+                  GR  K I+         KAD KS+DK S SS  R EYRR E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTLKLGEEAPATFAEI----------------GRGRSGKDIE---------KADPKSKDKESFSS-GRAEYRRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R K+KYCRFH +HGHNT+ CWELK QIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  IIFGGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSP
         IFGGP+G QSG+KRKELAR ARREVCIIRE +PTCSITF  ADL+ VHLPHNDALVIA LIDH +VRRVL+DG                          
Subjt:  IIFGGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRLQLKKSP

Query:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC
                      GCIDLPVT+GQD T++TQMAEFVVIDGRSAYNAIFGRPIIHSFR +PSTLHQVLKYSTPN VG VRGEQ  SRECYASALKGS+VC
Subjt:  TPLVGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGSSVC

Query:  ALETLAGRDETLEFEADLP---RREFAAPTEELELVPLLSPEKQ
        ALE    R +  E EADLP   +R+F  PTEELELVPLLSPE+Q
Subjt:  ALETLAGRDETLEFEADLP---RREFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAAGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGA
GGCCTTAAAGGCCAAATGTGAGCAAAAGGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCA
AAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGC
GCCTTTCAGATCGCGCTTACTGGCAGCGCGGGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTC
TTCTCGACATTATGACCAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGG
TCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGCTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATC
GGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGAATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTAC
CAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGC
TTCGGGGAGCCCCGGAGAGGCGCATCAAGGACAAGTATTGCCGCTTCCATTTGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGTGCCAAATTGAGGATCTA
ATTCAAGATGGCTACTTCAAGAAATTTGTGGGCAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACC
TGCGGTCATCAATATCATTTTCGGAGGGCCAAGCGGGGCTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGAGAGCAAA
GGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGAAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTAATTGATCATGTGGTGGTCAGGAGG
GTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTTGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTT
CTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGATCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGAT
CGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGTCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTC
CGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGAGACGCTCGAGTTCGAGGC
CGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGTACGAGATCGACCTGGCCAGGT
CGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCTAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGAGGC
AACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAG
ATGCCTAACCCCTGAAGAGGGCCTGAGGGTCCAAACCCATATGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGATCTGGGACGTACG
CGATCACATTCCATGATTCCGAGTTCGACCAGAAATTAAATGGGGGCCACGGACTCCCACGCGAAGAATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCCAACCAGCAGGCTGAATCCTCTCGCAACCCAGTAACTCCTGCAAGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGA
GGCCTTAAAGGCCAAATGTGAGCAAAAGGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCA
AAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGC
GCCTTTCAGATCGCGCTTACTGGCAGCGCGGGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTC
TTCTCGACATTATGACCAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGG
TCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGCTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGATC
GGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGAATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTAC
CAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGC
TTCGGGGAGCCCCGGAGAGGCGCATCAAGGACAAGTATTGCCGCTTCCATTTGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGTGCCAAATTGAGGATCTA
ATTCAAGATGGCTACTTCAAGAAATTTGTGGGCAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACC
TGCGGTCATCAATATCATTTTCGGAGGGCCAAGCGGGGCTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGAGAGCAAA
GGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGAAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTAATTGATCATGTGGTGGTCAGGAGG
GTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTTGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTT
CTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCGGATCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGAT
CGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGTCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTC
CGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGAGACGCTCGAGTTCGAGGC
CGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATCGGCGTACGAGATCGACCTGGCCAGGT
CGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCTAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGAGGC
AACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAG
ATGCCTAACCCCTGAAGAGGGCCTGAGGGTCCAAACCCATATGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGATCTGGGACGTACG
CGATCACATTCCATGATTCCGAGTTCGACCAGAAATTAAATGGGGGCCACGGACTCCCACGCGAAGAATTATGA
Protein sequenceShow/hide protein sequence
MSSNQQAESSRNPVTPARVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
AFQIALTGSAGLWYRRLPARSISTYSQLRREFLAQFSSRHYDQKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTLKLGEEAPATFAEI
GRGRSGKDIEKADPKSKDKESFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRIKDKYCRFHLEHGHNTSDCWELKCQIEDL
IQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINIIFGGPSGAQSGHKRKELARAARREVCIIREQRPTCSITFDGADLKEVHLPHNDALVIAPLIDHVVVRR
VLVDGGASANILSLPTYLALGWTRLQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRITQMAEFVVIDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTV
RGEQTASRECYASALKGSSVCALETLAGRDETLEFEADLPRREFAAPTEELELVPLLSPEKQLASAYEIDLARSVPVEILDNPSISELDLMEIGAPESSWMDPIADFIRG
NSPQDPKERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPEEGLRVQTHMGALDPTWEGPFEVKGIVRSGTYAITFHDSEFDQKLNGGHGLPREEL