; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh02G014870 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh02G014870
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionUnknown protein
Genome locationCma_Chr02:8496923..8499613
RNA-Seq ExpressionCmaCh02G014870
SyntenyCmaCh02G014870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606125.1 hypothetical protein SDJN03_03442, partial [Cucurbita argyrosperma subsp. sororia]1.2e-19386.14Show/hide
Query:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
        MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLL+NDFKRTDSSVTK+LPPPTASTRIALPAVSTLQKASNAVVRAFK
Subjt:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK

Query:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEP-PSSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSC
        QFPL RSISRKLIL+SFWKKSN+  SN KRWKSFREFLDEKEP PSSSDHDH   ADS+ IA+A RNSICSCS NSISWTESEFTSEMIPSSSS NFDSC
Subjt:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEP-PSSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSC

Query:  SEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETT------AAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGL
        SEID AKDDKDSPGNLI KTDG+ALGKD IEETT      AAYPEKAIKQLGNEEEKEQ SPVSVLDFGLKRLEKGAELEPVDLKKRFAD       LGL
Subjt:  SEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETT------AAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGL

Query:  IISTKEGQREQKAFELLKLVK---SSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKD
        IISTKEGQREQKA EL++LVK    SQCFTLKTENLVLDLIHEKLEE+E    S    RGCGFEEEKVVKLIEGW++GEGGEMRVMGWE+ EGRSLYIKD
Subjt:  IISTKEGQREQKAFELLKLVK---SSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKD

Query:  MEKAGKWRSLGGEKEELAAEVETEVWISLLHEL
        MEKAGKWRSLGGEKEELAAEVETE+WI LL +L
Subjt:  MEKAGKWRSLGGEKEELAAEVETEVWISLLHEL

KAG7036067.1 hypothetical protein SDJN02_02867, partial [Cucurbita argyrosperma subsp. argyrosperma]4.6e-19081.09Show/hide
Query:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
        MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLL+NDFKRTDSSVTK+LPPPTASTRIALPAVSTLQKASNAVVRAFK
Subjt:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK

Query:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEP-PSSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSC
        QFPL RSISRKLIL+SFWKKSN+  SN KRWKSFREFLDEKEP PSSSDHDH   ADS+ IA+A RNSICSCS NSISWTESEFTSEMIPSSSS NFDSC
Subjt:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEP-PSSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSC

Query:  SEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETT------AAYPEKAIK---------------------------QLGNEEEKEQSSPVSVLDFGLKRL
        SEID AKDDKDSPGNLI KTDG+ALGKD IEETT      AAYPEKAIK                           QLGNEEEKEQ SPVSVLDFGLKRL
Subjt:  SEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETT------AAYPEKAIK---------------------------QLGNEEEKEQSSPVSVLDFGLKRL

Query:  EKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVK---SSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIE
        EKGAELEPVDLKKRFAD+G+       IISTKEGQREQKA EL++LVK    SQCFTLKTENLVLDLIHEKLEE+E    S    RGCGFEEEKVVKLIE
Subjt:  EKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVK---SSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIE

Query:  GWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL
        GW++GEGGEMRVMGWE+ EGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWI LLH+L
Subjt:  GWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL

XP_022958448.1 uncharacterized protein LOC111459668 [Cucurbita moschata]2.8e-19586.9Show/hide
Query:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
        MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSS+GFRSFPRSQCCTTTVRFLL+NDFKRTDSSVTK+LPPPTASTRIALPAVSTLQKASNAVVRAFK
Subjt:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK

Query:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCS
        QFPL RSISRKLILVSFWKKSNV  SN KRWKSFREFLDEKEPPSSSDHDH   ADS+ IA+A RNSICSCS NSISWTESEFTSEMIPSSSSGNFDSCS
Subjt:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCS

Query:  EIDVAKDDKDSPG---NLIPKTDGIALGKDSIEETT------AAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPREL
        EID AKDDKDSPG   NLI KTDG+ALGKD IEETT      AAYPEKAIKQLGNEEEKEQ SPVSVLDFGLKRLEKGAELEPVDLKKRFAD       L
Subjt:  EIDVAKDDKDSPG---NLIPKTDGIALGKDSIEETT------AAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPREL

Query:  GLIISTKEGQREQKAFELLKLVKS---SQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYI
        GLIISTKEGQREQKA EL++LVKS   SQCFTLKTENLVLDLIHEKLEE+E    S    RGCGFEEEKVVKLIEGW++GEGGEMRVMGWE+ EGRSLYI
Subjt:  GLIISTKEGQREQKAFELLKLVKS---SQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYI

Query:  KDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL
        KDMEKAGKWRSLGGEKEELAAEVETEVWI LL EL
Subjt:  KDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL

XP_022996339.1 uncharacterized protein LOC111491600 [Cucurbita maxima]1.5e-233100Show/hide
Query:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
        MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
Subjt:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK

Query:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEID
        QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEID
Subjt:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEID

Query:  VAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQRE
        VAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQRE
Subjt:  VAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQRE

Query:  QKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGE
        QKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGE
Subjt:  QKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGE

Query:  KEELAAEVETEVWISLLHELY
        KEELAAEVETEVWISLLHELY
Subjt:  KEELAAEVETEVWISLLHELY

XP_023532695.1 uncharacterized protein LOC111794789 [Cucurbita pepo subsp. pepo]5.3e-19486.87Show/hide
Query:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
        MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLL+NDFKRTDSSVTK+LPPPTASTRIALPAVSTLQKASNAVVRAFK
Subjt:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK

Query:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPP-SSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSC
        QFPL RSISRKLIL+SFWKKSNV  SN KRWKSFREFLDEKEPP SSSDHDH   ADS+ IA+A RNSICSCS NSISWTESEFTSEMIPSSSSGNFDSC
Subjt:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPP-SSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSC

Query:  SEIDVAKDDKDSPGNLIPKTDGIALGKDSIEE------TTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGL
        SEID AKD+KDSPGNLI KTD IALGKD IEE      TTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPV+LKKRFAD       LGL
Subjt:  SEIDVAKDDKDSPGNLIPKTDGIALGKDSIEE------TTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGL

Query:  IISTKEGQREQKAFELLKLVKS---SQCFTLKTENLVLDLIHEKL-EEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIK
        IISTKEGQREQKA EL++LVKS   SQCF  KTENLVLDLIHEKL EEDESSG      RGCGFEEEKVVKLI+GW++GEGGEMRVMGWE+ EGRSLYIK
Subjt:  IISTKEGQREQKAFELLKLVKS---SQCFTLKTENLVLDLIHEKL-EEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIK

Query:  DMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL
        DMEKAGKWRSLGGEKEELAAEVETEVWI LL EL
Subjt:  DMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL

TrEMBL top hitse value%identityAlignment
A0A0A0KP06 Uncharacterized protein1.7e-11856.22Show/hide
Query:  MASLVDSSSWSLVA----AKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVV
        MAS  DSS+W+L++     KP S  LKD+LL DDFSSCSSNGFRSFPR QCC+TTVRFLL  D K  DSSVTK   P T S +IAL  +STLQ+AS+AV+
Subjt:  MASLVDSSSWSLVA----AKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVV

Query:  RAFKQFPL--------PRSISRKLILVSFWKKSNVTHSN-TKRWKSFREFLDEKEPPSSS--DHDHADS---TAIAIAGRNSICSCSNNSISWTESEFTS
        RAFKQFPL        PRSISRKLI  +F KKS++   N  KRWKSF+EFLDEKEPPSSS  + +H+DS   TAIA+AGRNSI SCS NSISWTESEFTS
Subjt:  RAFKQFPL--------PRSISRKLILVSFWKKSNVTHSN-TKRWKSFREFLDEKEPPSSS--DHDHADS---TAIAIAGRNSICSCSNNSISWTESEFTS

Query:  EMIPSSSSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAA------------YPEKAIKQLGNEEEKEQSSPVSVLDFGL----------
        E+IPSS SGN +SCSE D  KDDKDSPGNLI K DG+  GKDS+EETT A            Y E  +KQ  NEEEKEQ SPVSVLDF            
Subjt:  EMIPSSSSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAA------------YPEKAIKQLGNEEEKEQSSPVSVLDFGL----------

Query:  --------------------KRLEKGAELEPVDLKKRFADIG-NGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSG
                            KRLEKG ELEPVDLKKRF +I   G ++   +I+ KE Q E+KA E LKL+KS+   T  TENL+LD  H+KL+E E++ 
Subjt:  --------------------KRLEKGAELEPVDLKKRFADIG-NGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSG

Query:  RSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMG-WEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL
         +        F++ +++K  + W++G  GE+ VMG WE+ E R+ YIKDME   KWRS GG+KEEL AE E EVWISLL++L
Subjt:  RSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMG-WEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL

A0A6J1E8H2 uncharacterized protein LOC1114303531.9e-12558.65Show/hide
Query:  MASLVDSSSWSLVA----AKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVV
        MAS +DSS+WS+++     KPTSF LKD+LL DDFSSCSSNGFRSFPR QCC TTVRFLL  D K  DS++TK   P TAS +IAL  +STLQ+AS+AVV
Subjt:  MASLVDSSSWSLVA----AKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVV

Query:  RAFKQFPLP--------RSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSS
        RAFK+FPLP        RS SRK+IL +FWKK +    NT+R KSF+EFLDEKEPP S   D A  TA+ + GRNSI SCS NSISWTESEFTSEMIPSS
Subjt:  RAFKQFPLP--------RSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSS

Query:  SSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAA-------------YPEKAIKQLGNEEEKEQSSPVSVLDFGL---------------
        SSGN +SCSE D  K DKDSPGNLI K DG+  GKDS+EETT A             + E  +K   NEEEKEQ SPVSVLDF                 
Subjt:  SSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAA-------------YPEKAIKQLGNEEEKEQSSPVSVLDFGL---------------

Query:  --------------KRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRR
                      KR E G E EP+DLKKRFADI  G +  G  IS KE QREQKAFELLKLVKS+   T  TENL+LD  HEKLEE+++  R+     
Subjt:  --------------KRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRR

Query:  GCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL
        G  F++ +V+K  E W+NG+ GE    GWE  EGR LYIKDMEKAGKWRSL GEKEELAAE E EVW+SL  EL
Subjt:  GCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL

A0A6J1H339 uncharacterized protein LOC1114596681.4e-19586.9Show/hide
Query:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
        MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSS+GFRSFPRSQCCTTTVRFLL+NDFKRTDSSVTK+LPPPTASTRIALPAVSTLQKASNAVVRAFK
Subjt:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK

Query:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCS
        QFPL RSISRKLILVSFWKKSNV  SN KRWKSFREFLDEKEPPSSSDHDH   ADS+ IA+A RNSICSCS NSISWTESEFTSEMIPSSSSGNFDSCS
Subjt:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDH---ADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCS

Query:  EIDVAKDDKDSPG---NLIPKTDGIALGKDSIEETT------AAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPREL
        EID AKDDKDSPG   NLI KTDG+ALGKD IEETT      AAYPEKAIKQLGNEEEKEQ SPVSVLDFGLKRLEKGAELEPVDLKKRFAD       L
Subjt:  EIDVAKDDKDSPG---NLIPKTDGIALGKDSIEETT------AAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPREL

Query:  GLIISTKEGQREQKAFELLKLVKS---SQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYI
        GLIISTKEGQREQKA EL++LVKS   SQCFTLKTENLVLDLIHEKLEE+E    S    RGCGFEEEKVVKLIEGW++GEGGEMRVMGWE+ EGRSLYI
Subjt:  GLIISTKEGQREQKAFELLKLVKS---SQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYI

Query:  KDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL
        KDMEKAGKWRSLGGEKEELAAEVETEVWI LL EL
Subjt:  KDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL

A0A6J1HZ34 uncharacterized protein LOC1114689621.5e-12558.99Show/hide
Query:  MASLVDSSSWSLVA----AKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVV
        MAS +DSS+WS+++     KPTSF LKD+LL DDFSSCSSNGFRSFPR QCC TTVRFLL  D K  DSS+TK   P TAS +IAL  +STLQ+AS+AVV
Subjt:  MASLVDSSSWSLVA----AKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVV

Query:  RAFKQFPLP--------RSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSS
        RAFK+FPLP        RS SRK+IL +FWKK +    NT+R KSF+EFLDEKEPP S   D A  TA+ + GRNSI SCS NSISWTESEFTSE IPSS
Subjt:  RAFKQFPLP--------RSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSS

Query:  SSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAA-------------YPEKAIKQLGNEEEKEQSSPVSVLDFGL---------------
        SSGN +SCSE D  K DKDSPGNLI K DG+  GKDS+EETT A             + E  +KQ  NEEEKEQ SPVSVLDF                 
Subjt:  SSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAA-------------YPEKAIKQLGNEEEKEQSSPVSVLDFGL---------------

Query:  -------------KRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRG
                     KR E G E EP+DLKKRFADI  G +   L IS KE QREQKAFELLKLVKS+   T  TENL+LD  HEKLEE+++  R+     G
Subjt:  -------------KRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRG

Query:  CGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL
           ++ +V+K  E W+NG+ GE  V GWE  EGR LYIKDME AGKWRS+GGEKEELAAE E EVWISL  EL
Subjt:  CGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHEL

A0A6J1K8F7 uncharacterized protein LOC1114916007.3e-234100Show/hide
Query:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
        MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK
Subjt:  MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFK

Query:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEID
        QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEID
Subjt:  QFPLPRSISRKLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEID

Query:  VAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQRE
        VAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQRE
Subjt:  VAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQRE

Query:  QKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGE
        QKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGE
Subjt:  QKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGE

Query:  KEELAAEVETEVWISLLHELY
        KEELAAEVETEVWISLLHELY
Subjt:  KEELAAEVETEVWISLLHELY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00770.1 unknown protein4.0e-0621.73Show/hide
Query:  LKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFKQFPLPRSISRKLILVS-FWKK
        LKD LL+D  +SCSSNGF+S PR             N F          + P    +      ++ ++   +  +++     LPRS+SR+L   +    +
Subjt:  LKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFKQFPLPRSISRKLILVS-FWKK

Query:  SNVTHSNTK---RWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEIDVAKDDKDSPGNLIPKTD
        +++T    K   RW S ++  ++        +   ++T    +   S  SCS    SW++ +FTSE +PSS   N + C E    K++            
Subjt:  SNVTHSNTK---RWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEIDVAKDDKDSPGNLIPKTD

Query:  GIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDF-----------------------------GLKRLEKGAELEPVDLKKRFADIGNGPREL
           +G+DS      A  E   ++   + EKE +SPVSV +                               ++R E  A + P +L +  +       E 
Subjt:  GIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDF-----------------------------GLKRLEKGAELEPVDLKKRFADIGNGPREL

Query:  GLIISTK------------------EGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEM
        G    TK                    + E+KA +L   VK      +  E+L++D   ++L +  +S           FE + V +  +GW+ G+  E 
Subjt:  GLIISTK------------------EGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEM

Query:  RVMGWEVVEGRSLYIKDMEKAGKW--RSLGGEKEELAAEVETEVWISLLHE
         +      + R    +++E+   W  + +  E E +  ++E E++  L+ E
Subjt:  RVMGWEVVEGRSLYIKDMEKAGKW--RSLGGEKEELAAEVETEVWISLLHE

AT4G11780.1 unknown protein3.3e-2129.15Show/hide
Query:  MASLVDSSSWSLVAAKP--TSFKLKDFLLQDDFSSCSSNGFRSFPRSQ--CCTTTVRFLLHNDFKRT----------DSSVTKTLPPPTASTRIALPAVS
        MAS++ SS   L  +K       L+D+LL DD SSCSSNGF+SFPR Q    ++TVR LL  + KR+             +T+     T  T I+     
Subjt:  MASLVDSSSWSLVAAKP--TSFKLKDFLLQDDFSSCSSNGFRSFPRSQ--CCTTTVRFLLHNDFKRT----------DSSVTKTLPPPTASTRIALPAVS

Query:  TLQKASNAVVRAF--------KQFPLPRSISRKLILVSFWKKSNVTHS----------NTKRWKS--FREFLDEK----EPPSSSDHDHADSTAIA----
         + KAS A ++          KQ    RS S++L+ +SFW+K  V  S            + W+S  + E LD++       S++D     ST+ A    
Subjt:  TLQKASNAVVRAF--------KQFPLPRSISRKLILVSFWKKSNVTHS----------NTKRWKS--FREFLDEK----EPPSSSDHDHADSTAIA----

Query:  ----IAGRNSICSCS--NNSISWTESEFTSEMIPSSSSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSP
            I+G +S        NS S      +S    SSS  + +  SEID  +D K+S G+ +   DG      S+    +    K        EEKEQ SP
Subjt:  ----IAGRNSICSCS--NNSISWTESEFTSEMIPSSSSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSP

Query:  VSVLDFGLK-----------------------RLEKGAELEPVDLKKRFADIGNGPRELGL-IISTKEGQREQKAFELLKLVK--SSQCFTLKTENLVLD
        VS+L+   K                       RL     LEP+DL KR         E     + T+E + E +A  L  LVK    +   L    +  +
Subjt:  VSVLDFGLK-----------------------RLEKGAELEPVDLKKRFADIGNGPRELGL-IISTKEGQREQKAFELLKLVK--SSQCFTLKTENLVLD

Query:  LIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGG-EKEELAAEVETEVWISLLHE
        L+ + L+ED    +          EE  +VK  E W+ G   EM  M WEV   R +Y+K+M    KW  + G E+E +  E+    + S + E
Subjt:  LIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGG-EKEELAAEVETEVWISLLHE

AT4G23020.1 unknown protein1.3e-1728.32Show/hide
Query:  LKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFKQFPLPRSI---------SRKL
        L+DFLL DD SSCSSNGF+SFPR          + H++ + T                  L     + KAS A++ A K  P P S+          + L
Subjt:  LKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFKQFPLPRSI---------SRKL

Query:  ILVSFWKKSNVTHSNT----------------KRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSE--MIPSSSSGNFD
           SFWKK +    N                 +R +SF EFL E +    SD  +  S     +G  ++   S +++    S F+SE   +  SSSG   
Subjt:  ILVSFWKKSNVTHSNT----------------KRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSE--MIPSSSSGNFD

Query:  SCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGL-----------------------KRLEKGAELEP
            + V     D  G+ +  +DG +L  D+ EE                EEKEQ SP+S+LD                          +RLE    LEP
Subjt:  SCSEIDVAKDDKDSPGNLIPKTDGIALGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGL-----------------------KRLEKGAELEP

Query:  VDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGW-MNGEGGEM
        VDL+KR             II  +E Q E +A  L  LVK S+    + + L   ++   L +      +   R     +E+K+V+++E W M  +  E 
Subjt:  VDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGW-MNGEGGEM

Query:  RV-MGWEVVEGRSLYIKDMEKAGKWRSLGG-EKEELAAEVETEVWISLLHEL
         + M W+V E R +Y+K+M    KW  + G EKE +  E+      SL+ EL
Subjt:  RV-MGWEVVEGRSLYIKDMEKAGKWRSLGG-EKEELAAEVETEVWISLLHEL

AT4G23020.2 unknown protein1.9e-1627.85Show/hide
Query:  LKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFKQFPLPRSI---------SRKL
        L+DFLL DD SSCSSNGF+SFPR          + H++ + T                  L     + KAS A++ A K  P P S+          + L
Subjt:  LKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFKQFPLPRSI---------SRKL

Query:  ILVSFWKKSNVTHSNT----------------KRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSE--MIPSSSSGNFD
           SFWKK +    N                 +R +SF EFL E +    SD  +  S     +G  ++   S +++    S F+SE   +  SSSG   
Subjt:  ILVSFWKKSNVTHSNT----------------KRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSE--MIPSSSSGNFD

Query:  SCSEIDVAKDDKDSPGNLIPKTDGIALGKDS-IEETTAAYPEKAIKQLGNE---EEKEQSSPVSVLDFGL-----------------------KRLEKGA
             D          +L   T+    G +S       + P   +  L  E   EEKEQ SP+S+LD                          +RLE   
Subjt:  SCSEIDVAKDDKDSPGNLIPKTDGIALGKDS-IEETTAAYPEKAIKQLGNE---EEKEQSSPVSVLDFGL-----------------------KRLEKGA

Query:  ELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGW-MNGE
         LEPVDL+KR             II  +E Q E +A  L  LVK S+    + + L   ++   L +      +   R     +E+K+V+++E W M  +
Subjt:  ELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHEKLEEDESSGRSGHKRRGCGFEEEKVVKLIEGW-MNGE

Query:  GGEMRV-MGWEVVEGRSLYIKDMEKAGKWRSLGG-EKEELAAEVETEVWISLLHEL
          E  + M W+V E R +Y+K+M    KW  + G EKE +  E+      SL+ EL
Subjt:  GGEMRV-MGWEVVEGRSLYIKDMEKAGKWRSLGG-EKEELAAEVETEVWISLLHEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGTTGGTGGATTCTTCTAGTTGGAGCCTCGTCGCCGCGAAACCTACGTCTTTCAAGCTCAAGGATTTTCTTCTTCAAGACGATTTCAGCTCTTGCTCCTCTAA
TGGCTTCCGATCGTTTCCACGAAGCCAATGCTGCACCACAACCGTCCGATTTCTCCTCCACAACGATTTCAAACGGACAGATTCTTCCGTAACTAAAACATTACCACCTC
CAACCGCCTCCACGAGAATCGCTCTCCCCGCGGTCTCCACTTTGCAAAAGGCGTCCAACGCTGTTGTCAGAGCATTTAAGCAGTTTCCGTTGCCCAGGAGTATTTCGCGG
AAACTGATTCTCGTATCCTTTTGGAAGAAATCGAATGTCACTCATTCCAACACCAAACGCTGGAAATCGTTTCGGGAATTTCTCGATGAGAAAGAACCGCCGTCGTCGTC
TGACCACGATCACGCCGATTCCACGGCCATTGCTATCGCTGGAAGAAACTCGATCTGTAGCTGTAGTAATAATAGCATCAGTTGGACGGAGAGCGAATTTACATCGGAGA
TGATTCCGTCGTCTTCGAGCGGTAATTTCGACAGTTGCAGCGAAATCGACGTCGCCAAGGACGATAAGGATTCGCCTGGTAATCTCATACCCAAAACAGATGGCATAGCG
TTGGGAAAAGATTCCATTGAAGAAACAACCGCCGCCTACCCGGAGAAAGCCATTAAGCAATTGGGAAATGAAGAAGAAAAGGAGCAGTCGAGTCCAGTTTCAGTCCTGGA
TTTTGGTTTGAAGCGATTGGAGAAGGGAGCCGAATTGGAACCTGTAGACTTGAAGAAGCGATTCGCTGATATAGGCAATGGCCCTCGGGAGTTGGGTTTAATAATATCCA
CAAAAGAAGGGCAGAGGGAACAGAAGGCATTCGAGCTTCTAAAGCTCGTAAAATCATCGCAGTGCTTCACATTGAAAACAGAGAATCTGGTGCTGGATTTGATCCATGAA
AAGCTGGAAGAAGATGAATCAAGTGGAAGAAGTGGGCATAAAAGAAGAGGGTGTGGTTTTGAAGAAGAAAAGGTTGTGAAATTAATAGAAGGTTGGATGAATGGAGAGGG
CGGAGAAATGAGGGTGATGGGATGGGAGGTGGTGGAAGGACGGAGTTTGTACATTAAGGATATGGAGAAGGCGGGAAAGTGGAGGAGTTTGGGCGGAGAAAAGGAAGAAT
TGGCGGCGGAGGTTGAAACCGAGGTTTGGATCTCTTTGCTTCATGAGCTATATTAG
mRNA sequenceShow/hide mRNA sequence
GCACGTCCCTCTGATTTCTGCTCCTCTGTTCTCTCTCTCTCTCTGTTTCAAAATCATTTTCGCAATTTGAATTCCTCTGTTTCATTTGATTTTGAAATGGCGTCGTTGGT
GGATTCTTCTAGTTGGAGCCTCGTCGCCGCGAAACCTACGTCTTTCAAGCTCAAGGATTTTCTTCTTCAAGACGATTTCAGCTCTTGCTCCTCTAATGGCTTCCGATCGT
TTCCACGAAGCCAATGCTGCACCACAACCGTCCGATTTCTCCTCCACAACGATTTCAAACGGACAGATTCTTCCGTAACTAAAACATTACCACCTCCAACCGCCTCCACG
AGAATCGCTCTCCCCGCGGTCTCCACTTTGCAAAAGGCGTCCAACGCTGTTGTCAGAGCATTTAAGCAGTTTCCGTTGCCCAGGAGTATTTCGCGGAAACTGATTCTCGT
ATCCTTTTGGAAGAAATCGAATGTCACTCATTCCAACACCAAACGCTGGAAATCGTTTCGGGAATTTCTCGATGAGAAAGAACCGCCGTCGTCGTCTGACCACGATCACG
CCGATTCCACGGCCATTGCTATCGCTGGAAGAAACTCGATCTGTAGCTGTAGTAATAATAGCATCAGTTGGACGGAGAGCGAATTTACATCGGAGATGATTCCGTCGTCT
TCGAGCGGTAATTTCGACAGTTGCAGCGAAATCGACGTCGCCAAGGACGATAAGGATTCGCCTGGTAATCTCATACCCAAAACAGATGGCATAGCGTTGGGAAAAGATTC
CATTGAAGAAACAACCGCCGCCTACCCGGAGAAAGCCATTAAGCAATTGGGAAATGAAGAAGAAAAGGAGCAGTCGAGTCCAGTTTCAGTCCTGGATTTTGGTTTGAAGC
GATTGGAGAAGGGAGCCGAATTGGAACCTGTAGACTTGAAGAAGCGATTCGCTGATATAGGCAATGGCCCTCGGGAGTTGGGTTTAATAATATCCACAAAAGAAGGGCAG
AGGGAACAGAAGGCATTCGAGCTTCTAAAGCTCGTAAAATCATCGCAGTGCTTCACATTGAAAACAGAGAATCTGGTGCTGGATTTGATCCATGAAAAGCTGGAAGAAGA
TGAATCAAGTGGAAGAAGTGGGCATAAAAGAAGAGGGTGTGGTTTTGAAGAAGAAAAGGTTGTGAAATTAATAGAAGGTTGGATGAATGGAGAGGGCGGAGAAATGAGGG
TGATGGGATGGGAGGTGGTGGAAGGACGGAGTTTGTACATTAAGGATATGGAGAAGGCGGGAAAGTGGAGGAGTTTGGGCGGAGAAAAGGAAGAATTGGCGGCGGAGGTT
GAAACCGAGGTTTGGATCTCTTTGCTTCATGAGCTATATTAG
Protein sequenceShow/hide protein sequence
MASLVDSSSWSLVAAKPTSFKLKDFLLQDDFSSCSSNGFRSFPRSQCCTTTVRFLLHNDFKRTDSSVTKTLPPPTASTRIALPAVSTLQKASNAVVRAFKQFPLPRSISR
KLILVSFWKKSNVTHSNTKRWKSFREFLDEKEPPSSSDHDHADSTAIAIAGRNSICSCSNNSISWTESEFTSEMIPSSSSGNFDSCSEIDVAKDDKDSPGNLIPKTDGIA
LGKDSIEETTAAYPEKAIKQLGNEEEKEQSSPVSVLDFGLKRLEKGAELEPVDLKKRFADIGNGPRELGLIISTKEGQREQKAFELLKLVKSSQCFTLKTENLVLDLIHE
KLEEDESSGRSGHKRRGCGFEEEKVVKLIEGWMNGEGGEMRVMGWEVVEGRSLYIKDMEKAGKWRSLGGEKEELAAEVETEVWISLLHELY