; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g31240 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g31240
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:23653471..23659518
RNA-Seq ExpressionMoc09g31240
SyntenyMoc09g31240
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.8e-23984.28Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCE KEGPLND DLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLP  SISTYSQLRREFLA FSSRH+DKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIE
        TVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPT SRPYERFTPTTIPI EILTNIE
Subjt:  TVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIE

Query:  ESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG-
        ESGMEKLLK PEKLRGAPERRSKDKYCRFHR+H HNTSD W                       TSS EKKEERKRSRTPP+RTDRPAVINTIF GP+G 
Subjt:  ESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG-

Query:  ----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
                                          DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  ----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]5.8e-23773.85Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCE KEGPLND DLGES FTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEIL
        DEALTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA NGPT SRPYERFTPTTIPI EIL
Subjt:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEIL

Query:  TNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRG
        TNIEESGMEKLLK PEKLRGAPERR+KDKYCRFHR+HDHNTSD W                       TSS EKKEERK SRTP +R DRPAVINTIF G
Subjt:  TNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRG

Query:  PNG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        P+G                                   DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PNG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG +RGEQ ASREC+ASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETL

Query:  ASRDGTLEFKADLLRREFAAPTEELELVPLL
         SRDGTLEFKA+L RREFAAPTEELELVPLL
Subjt:  ASRDGTLEFKADLLRREFAAPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]3.3e-23274.15Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NP TP GVITREEFDQL+ + DAQVEALKA+CE KE   +D DLGE SF+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSARLWYRRLP R ISTYSQLR+EF++QFSSRH+D+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNI
        TVKL EEAP TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  KAD KS+DKG S SS R +YRR+ +    SRPYE +TPTTIPIFEILTNI
Subjt:  TVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNI

Query:  EESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVIN--------
        EE+GMEKLLK PEKLRG PE+R+ DKYCRFHR H HNTS+ W                       ++SVEKKEERKR RTPP+R DRPAVIN        
Subjt:  EESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVIN--------

Query:  -----TIFRGP---------NGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPV
              I R           + DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV
Subjt:  -----TIFRGP---------NGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPV

Query:  TLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETLASRD
        ++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGT+RGE   SREC+AS  K SSVCALE    RD
Subjt:  TLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETLASRD

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]3.4e-18460.19Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEH
        E+    + P   E   ++E   ++ +  DLR+HL  K+  +  + +   S SR   +SN +A+S   P  P  VI REEFD ++ + D QVEALKA+CE 
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEH

Query:  KEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQ
        KE P +DDDLGES FTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLP RSISTYSQLR+EF+ Q
Subjt:  KEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQ

Query:  FSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS RH+D+KTATHLATIRQKE                                   DE LTVKLGEEAP TFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSD
         + R  +   K D KSKDKGS SSG R EYRR+E+GP+ SRPYER        +E+   IE+   +   K   K  G P                     
Subjt:  GRGRSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSD

Query:  CWTSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVL
          ++SVEKKEERKRSRTPP+R DRPAVINTIF GP+G                                   DLE VHLPHNDALVIAPLIDHV+VRRVL
Subjt:  CWTSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVL

Query:  VDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYS
        VDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEGCIDLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYS
Subjt:  VDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYS

Query:  TPNGVGTIRGEQTASRECFASALKGSSVCALETLASRD
        TPNGVGT+RGEQ  SREC+ASALK SSVCALE   S+D
Subjt:  TPNGVGTIRGEQTASRECFASALKGSSVCALETLASRD

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]6.1e-18688.11Show/hide
Query:  GVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRG+LDAQVEALKAKCE K+  LND DLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRH+DKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAP TF
Subjt:  GSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+ADPKSKDKGSFSSGRAEYRRAENGPT SRPYERFTPTTIPIFEILTNIEESGMEKLLK PE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPE

Query:  KLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG
        KLRGAPERRSKDKYCRFHR+H HNTSD W                       TSS EKKEERKRSRTPP+RTDRPAVINTIF GP+G
Subjt:  KLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.4e-23984.28Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCE KEGPLND DLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+IALTGSARLWYRRLP  SISTYSQLRREFLA FSSRH+DKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIE
        TVKLGEEAP TFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPT SRPYERFTPTTIPI EILTNIE
Subjt:  TVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIE

Query:  ESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG-
        ESGMEKLLK PEKLRGAPERRSKDKYCRFHR+H HNTSD W                       TSS EKKEERKRSRTPP+RTDRPAVINTIF GP+G 
Subjt:  ESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG-

Query:  ----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
                                          DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGE
Subjt:  ----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGE

Query:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV
        SVIPEG IDLPVTLGQDQTQVTQMAEFV
Subjt:  SVIPEGCIDLPVTLGQDQTQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188232.8e-23773.85Show/hide
Query:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ
        SSNQQAESS NPATP GVITREEFDQLRG+L+AQVEALKAKCE KEGPLND DLGES FTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQ
Subjt:  SSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA
        AASDAIKCRAFQIALTGSARLW                                                     FQE+QLKVA  SDDSAMCYFLTGLA
Subjt:  AASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEIL
        DEALTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKTGRPER I RGRSGKD EKAD KSKDKGSFSSGRAE+RRA NGPT SRPYERFTPTTIPI EIL
Subjt:  DEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEIL

Query:  TNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRG
        TNIEESGMEKLLK PEKLRGAPERR+KDKYCRFHR+HDHNTSD W                       TSS EKKEERK SRTP +R DRPAVINTIF G
Subjt:  TNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRG

Query:  PNG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        P+G                                   DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PNG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETL
        FS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG +RGEQ ASREC+ASALKGSSVCALETL
Subjt:  FSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETL

Query:  ASRDGTLEFKADLLRREFAAPTEELELVPLL
         SRDGTLEFKA+L RREFAAPTEELELVPLL
Subjt:  ASRDGTLEFKADLLRREFAAPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204791.6e-23274.15Show/hide
Query:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        +AESS NP TP GVITREEFDQL+ + DAQVEALKA+CE KE   +D DLGE SF+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+D
Subjt:  QAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSARLWYRRLP R ISTYSQLR+EF++QFSSRH+D+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNI
        TVKL EEAP TFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD  KAD KS+DKG S SS R +YRR+ +    SRPYE +TPTTIPIFEILTNI
Subjt:  TVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNI

Query:  EESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVIN--------
        EE+GMEKLLK PEKLRG PE+R+ DKYCRFHR H HNTS+ W                       ++SVEKKEERKR RTPP+R DRPAVIN        
Subjt:  EESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVIN--------

Query:  -----TIFRGP---------NGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPV
              I R           + DLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES+  EGCIDLPV
Subjt:  -----TIFRGP---------NGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPV

Query:  TLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETLASRD
        ++ QD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGT+RGE   SREC+AS  K SSVCALE    RD
Subjt:  TLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETLASRD

A0A6J1DPC9 uncharacterized protein LOC1110222801.6e-18460.19Show/hide
Query:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEH
        E+    + P   E   ++E   ++ +  DLR+HL  K+  +  + +   S SR   +SN +A+S   P  P  VI REEFD ++ + D QVEALKA+CE 
Subjt:  EQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ---SPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEH

Query:  KEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQ
        KE P +DDDLGES FTSD++EAPIPPKFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKC AFQIALTGSARLW RRLP RSISTYSQLR+EF+ Q
Subjt:  KEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQ

Query:  FSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FS RH+D+KTATHLATIRQKE                                   DE LTVKLGEEAP TFAEVLQ AKKVIDGQELLRTKT RPE++I
Subjt:  FSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GRGRSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSD
         + R  +   K D KSKDKGS SSG R EYRR+E+GP+ SRPYER        +E+   IE+   +   K   K  G P                     
Subjt:  GRGRSGKDIEKADPKSKDKGSFSSG-RAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSD

Query:  CWTSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVL
          ++SVEKKEERKRSRTPP+R DRPAVINTIF GP+G                                   DLE VHLPHNDALVIAPLIDHV+VRRVL
Subjt:  CWTSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG-----------------------------------DLEEVHLPHNDALVIAPLIDHVVVRRVL

Query:  VDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYS
        VDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESV PEGCIDLPVT+GQD TQVTQMAEFVVIDGR AYNAIF RPIIHSF+A+PS LHQVLKYS
Subjt:  VDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYS

Query:  TPNGVGTIRGEQTASRECFASALKGSSVCALETLASRD
        TPNGVGT+RGEQ  SREC+ASALK SSVCALE   S+D
Subjt:  TPNGVGTIRGEQTASRECFASALKGSSVCALETLASRD

A0A6J1DS95 uncharacterized protein LOC1110234213.0e-18688.11Show/hide
Query:  GVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
        G+ITREEFDQLRG+LDAQVEALKAKCE K+  LND DLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT
Subjt:  GVITREEFDQLRGQLDAQVEALKAKCEHKEGPLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALT

Query:  GSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTF
        GSARLWYRRLP RSISTYSQLRREFLAQFSSRH+DKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAP TF
Subjt:  GSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTF

Query:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPE
        AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+ADPKSKDKGSFSSGRAEYRRAENGPT SRPYERFTPTTIPIFEILTNIEESGMEKLLK PE
Subjt:  AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPE

Query:  KLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG
        KLRGAPERRSKDKYCRFHR+H HNTSD W                       TSS EKKEERKRSRTPP+RTDRPAVINTIF GP+G
Subjt:  KLRGAPERRSKDKYCRFHRKHDHNTSDCW-----------------------TSSVEKKEERKRSRTPPQRTDRPAVINTIFRGPNG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGATCTATCGGGAAGCTCGGTGAGGGCCGATGTGAGCTACTGTCCGCCAAGTATTCAGATCGGTCCGGAGG
CCGAGTTCGAGCTGCAATCAGAAATACACTGTTGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGCGCCGGGGGTCCAGCCCCGGCCCCGACAAGTG
AGAACTTGAACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGCAAATGCGGTCCATGGAGGAAATGTATAACGAAATGATATTAGCTGCAGGCGCAGGGTCCCGATCT
GAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAG
AGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCA
ACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCATAAAGAAGGT
CCACTAAACGATGACGACTTGGGAGAATCGTCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAA
GGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGAT
TGTGGTATCGGAGACTGCCAGTCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTTTGACAAAAAGACAGCGACCCAT
CTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTA
TTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGG
AGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCC
AGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGTAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACGAACATCGA
GGAGTCTGGAATGGAAAAACTACTCAAACATCCTGAGAAGCTTCGGGGAGCTCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGAAGCACGACCATAACA
CGTCGGACTGCTGGACCAGCTCGGTAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCAGCGCACTGACCGACCTGCGGTCATCAATACTATTTTCAGAGGG
CCAAACGGGGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATC
TGCTAACATCCTGTCTTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAG
AGGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGG
AGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGATCCGAGGAGAACAGACCGCTTCGAG
GGAGTGCTTTGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCTGAGGAGGGAGTTTG
CCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGCCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCTGAG
CCAGATCTGATGGAGATCGACGCTCCAGAGTTCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGTAGAAGGTTGGCAAG
GCAAGCAGCTCGGTTCGTGGTCCGAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGATCTATCGGGAAGCTCGGTGAGGGCCGATGTGAGCTACTGTCCGCCAAGTATTCAGATCGGTCCGGAGG
CCGAGTTCGAGCTGCAATCAGAAATACACTGTTGACATCCAAGGCCACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGCGCCGGGGGTCCAGCCCCGGCCCCGACAAGTG
AGAACTTGAACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGCAAATGCGGTCCATGGAGGAAATGTATAACGAAATGATATTAGCTGCAGGCGCAGGGTCCCGATCT
GAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAAGGGGTTCCCACCTCGGCCCGGTCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAG
AGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCGCA
ACCCAGCAACTCCTGCAGGAGTGATTACAAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCATAAAGAAGGT
CCACTAAACGATGACGACTTGGGAGAATCGTCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAA
GGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGAT
TGTGGTATCGGAGACTGCCAGTCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTTTGACAAAAAGACAGCGACCCAT
CTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTA
TTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGG
AGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCC
AGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGTAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTTCGAGATCCTAACGAACATCGA
GGAGTCTGGAATGGAAAAACTACTCAAACATCCTGAGAAGCTTCGGGGAGCTCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGAAGCACGACCATAACA
CGTCGGACTGCTGGACCAGCTCGGTAGAGAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCAGCGCACTGACCGACCTGCGGTCATCAATACTATTTTCAGAGGG
CCAAACGGGGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATC
TGCTAACATCCTGTCTTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAG
AGGGTTGCATCGACTTGCCGGTCACACTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGG
AGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGATCCGAGGAGAACAGACCGCTTCGAG
GGAGTGCTTTGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCAGTAGGGATGGGACGCTCGAGTTCAAGGCCGACCTGCTGAGGAGGGAGTTTG
CCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGCCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCTGAG
CCAGATCTGATGGAGATCGACGCTCCAGAGTTCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGTAGAAGGTTGGCAAG
GCAAGCAGCTCGGTTCGTGGTCCGAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MLSMRAEVNLAEVRSIGKLGEGRCELLSAKYSDRSGGRVRAAIRNTLLTSKATRGRGGTSKKGAGGPAPAPTSENLNALQREMEAMRTQMRSMEEMYNEMILAAGAGSRS
ENRVTRVGIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEHKEG
PLNDDDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHFDKKTATH
LATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFS
SGRAEYRRAENGPTSSRPYERFTPTTIPIFEILTNIEESGMEKLLKHPEKLRGAPERRSKDKYCRFHRKHDHNTSDCWTSSVEKKEERKRSRTPPQRTDRPAVINTIFRG
PNGDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFG
RPIIHSFRAIPSTLHQVLKYSTPNGVGTIRGEQTASRECFASALKGSSVCALETLASRDGTLEFKADLLRREFAAPTEELELVPLLSPEKQADLARSVPVEILDNPSISE
PDLMEIDAPEFSWMDPIVDFIRGNSPQDPKERRRLARQAARFVVRGDVLAHPWNAEHLKRYYP