; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014049 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014049
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein Ycf2-like
Genome locationscaffold3:42839466..42843154
RNA-Seq ExpressionSpg014049
SyntenySpg014049
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]1.5e-8833.85Show/hide
Query:  TRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHEWTKKTNDEIEKKP--TGARSNK
        TR SDRL+AAG+T  RK                  ++ T +     S E+   ++    +G  GK+ +P T K +V   TKK    ++KK       S K
Subjt:  TRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHEWTKKTNDEIEKKP--TGARSNK

Query:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM
        +  R K      K   V  +IA+ E S  +      DT   S            + REK KK    +KE   ++   +KGK      + +    D  YLM
Subjt:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM

Query:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP
           +R++ LKINL  ++ +++ I   LGDR  + FR   FGH L+ S    SSQLLLHLIQ  CKPK TS+L F IGG++L+FGLREFALITGL C  +P
Subjt:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP

Query:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA
         I+ + +    R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE   ++ +H++MV+D E+F  YPWGR AF LL  +M++ 
Subjt:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA

Query:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR
          S+G  GI MGG ++ ILAWAYEVIP LS PP  +  RI N VPRIIN   + QP+W++L  K+F S +LEV P+  T  E+ M +F PF++ E    +
Subjt:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR

Query:  LAGDEQQVGDDVRIPPNFSV--GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTK----------DG
         A DE +   +     + S+  G PS TS+++V+ K  ++I     R+ + +  L++ L+ +   +   N++F+       GI  TTK          + 
Subjt:  LAGDEQQVGDDVRIPPNFSV--GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTK----------DG

Query:  DVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINK
         +++ EE+ E+DD +D  L+   +    +R+DD+     G   + QG ++     G + + ++ET             K G  E +  KA ++ DE IN+
Subjt:  DVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINK

Query:  AILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--
         I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                            +N T    G KRS+  E  
Subjt:  AILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--

Query:  EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
        EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

TYK09852.1 protein Ycf2-like [Cucumis melo var. makuwa]8.9e-7035.03Show/hide
Query:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK
        CKPK TS+L F IG ++L+FGLREFALITGL C  +P I+ + ++   R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH
        + ++ +H++MV+D E+F  YPWGR AF LL  +M++A  S+G  GI M GG ++ ILAWAYEVIP LS PP  +A RI N VPRIINW  + QP+W++L 
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH

Query:  VKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSV--GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREI
         K+F S +LEV P+  T  E+ M +F PF + E    + A DE +   +     + S+  G PS TS+++V+ K  ++I     R+ + +  L++ L+ +
Subjt:  VKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSV--GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREI

Query:  HKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKT
           +   N++F+  G           +  +++ EE+ E+DD +D  L+  N +   + DD     GK+            +  E++ +     DGG  K 
Subjt:  HKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKT

Query:  GGGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS------------------
           E             KA ++ DE IN+ I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                  
Subjt:  GGGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS------------------

Query:  --------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
                  +N T    G KRS+  E  EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  --------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

XP_038883715.1 uncharacterized protein LOC120074618 isoform X1 [Benincasa hispida]1.8e-6233.61Show/hide
Query:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKE-PPKKQKGGKKGKK----LKTMVEGDTVRVDDDY--LM
        +K SP T E   +  +  ++  T       + E++ +  D  S  +VG ++ K  +  +K++   K++K  K+GKK           D V V  +Y  L+
Subjt:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKE-PPKKQKGGKKGKK----LKTMVEGDTVRVDDDY--LM

Query:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP
          S  S   +INL  + +++  I   L +R    F+ +CFG  LD    K SSQL  HL++ QC     +EL+F + G+I KFG++EF+LITGLNCG LP
Subjt:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP

Query:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA
        +ID  ++Q   +F   YF  ++ ++R  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+V+D+E F  YPWGR ++ +   ++ KA
Subjt:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA

Query:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR
          S  +  IG+GG  +A+L WAYE IP L       A R+    PR+ NW  +  PEW++L  K+FQS S +V PL  T TEM+M Y  PF   + +  +
Subjt:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR

Query:  LAGD-EQQVGDDVRIPPNFSV----GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDP
        +    +Q+   D R   N       G PS+ +     +     +  K+  +  +LG+LV      H +D+  N   K+  +V    DP
Subjt:  LAGD-EQQVGDDVRIPPNFSV----GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDP

XP_038883716.1 uncharacterized protein LOC120074618 isoform X2 [Benincasa hispida]3.6e-6333.68Show/hide
Query:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKK----LKTMVEGDTVRVDDDY--LMS
        +K SP T E   +  +  ++  T       + E++ +  D  S  +VG ++ K  +  +K    K++K  K+GKK           D V V  +Y  L+ 
Subjt:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKK----LKTMVEGDTVRVDDDY--LMS

Query:  PSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQ
         S  S   +INL  + +++  I   L +R    F+ +CFG  LD    K SSQL  HL++ QC     +EL+F + G+I KFG++EF+LITGLNCG LP+
Subjt:  PSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQ

Query:  IDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKAS
        ID  ++Q   +F   YF  ++ ++R  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+V+D+E F  YPWGR ++ +   ++ KA 
Subjt:  IDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKAS

Query:  VSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRL
         S  +  IG+GG  +A+L WAYE IP L       A R+    PR+ NW  +  PEW++L  K+FQS S +V PL  T TEM+M Y  PF   + +  ++
Subjt:  VSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRL

Query:  AGD-EQQVGDDVRIPPNFSV----GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDP
            +Q+   D R   N       G PS+ +     +     +  K+  +  +LG+LV      H +D+  N   K+  +V    DP
Subjt:  AGD-EQQVGDDVRIPPNFSV----GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDP

XP_038883719.1 uncharacterized protein LOC120074618 isoform X5 [Benincasa hispida]3.6e-6333.95Show/hide
Query:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKK---LKTMVEGDTVRVDDDY--LMSP
        +K SP T E   +  +  ++  T       + E++ +  D  S  +VG ++ K  +  +K    K++K  K+GKK     T  E D    D +Y  L+  
Subjt:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKK---LKTMVEGDTVRVDDDY--LMSP

Query:  SKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQI
        S  S   +INL  + +++  I   L +R    F+ +CFG  LD    K SSQL  HL++ QC     +EL+F + G+I KFG++EF+LITGLNCG LP+I
Subjt:  SKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQI

Query:  DTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASV
        D  ++Q   +F   YF  ++ ++R  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+V+D+E F  YPWGR ++ +   ++ KA  
Subjt:  DTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASV

Query:  SRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLA
        S  +  IG+GG  +A+L WAYE IP L       A R+    PR+ NW  +  PEW++L  K+FQS S +V PL  T TEM+M Y  PF   + +  ++ 
Subjt:  SRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLA

Query:  GD-EQQVGDDVRIPPNFSV----GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDP
           +Q+   D R   N       G PS+ +     +     +  K+  +  +LG+LV      H +D+  N   K+  +V    DP
Subjt:  GD-EQQVGDDVRIPPNFSV----GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDP

TrEMBL top hitse value%identityAlignment
A0A1S3B065 uncharacterized protein LOC103484737 isoform X48.7e-6336.92Show/hide
Query:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK
        ++ P   ++ +KK + K  +              E +E+T++ DT+      V           E R+KKK     K+     K++K GKKG K      
Subjt:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK

Query:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK
             D V V  +Y  L+  S  S   +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK

Query:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT
        FG+++FALITGLNCG LP ID  ++Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D+E F +
Subjt:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT

Query:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE
        YPWGR ++ +   ++ KA  S  +  IG+GG  +A+  WAYE IP L+     +A RI    PR+ NW  +  PEW++L  K+FQS + +V PL  T+TE
Subjt:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE

Query:  MQMSYFQPF
        M+MSY  PF
Subjt:  MQMSYFQPF

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X58.7e-6336.92Show/hide
Query:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK
        ++ P   ++ +KK + K  +              E +E+T++ DT+      V           E R+KKK     K+     K++K GKKG K      
Subjt:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK

Query:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK
             D V V  +Y  L+  S  S   +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK

Query:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT
        FG+++FALITGLNCG LP ID  ++Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D+E F +
Subjt:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT

Query:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE
        YPWGR ++ +   ++ KA  S  +  IG+GG  +A+  WAYE IP L+     +A RI    PR+ NW  +  PEW++L  K+FQS + +V PL  T+TE
Subjt:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE

Query:  MQMSYFQPF
        M+MSY  PF
Subjt:  MQMSYFQPF

A0A1S3B181 uncharacterized protein LOC103484737 isoform X78.7e-6336.92Show/hide
Query:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK
        ++ P   ++ +KK + K  +              E +E+T++ DT+      V           E R+KKK     K+     K++K GKKG K      
Subjt:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK

Query:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK
             D V V  +Y  L+  S  S   +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK

Query:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT
        FG+++FALITGLNCG LP ID  ++Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D+E F +
Subjt:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT

Query:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE
        YPWGR ++ +   ++ KA  S  +  IG+GG  +A+  WAYE IP L+     +A RI    PR+ NW  +  PEW++L  K+FQS + +V PL  T+TE
Subjt:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE

Query:  MQMSYFQPF
        M+MSY  PF
Subjt:  MQMSYFQPF

A0A5A7U047 Protein Ycf2-like7.1e-8933.85Show/hide
Query:  TRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHEWTKKTNDEIEKKP--TGARSNK
        TR SDRL+AAG+T  RK                  ++ T +     S E+   ++    +G  GK+ +P T K +V   TKK    ++KK       S K
Subjt:  TRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHEWTKKTNDEIEKKP--TGARSNK

Query:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM
        +  R K      K   V  +IA+ E S  +      DT   S            + REK KK    +KE   ++   +KGK      + +    D  YLM
Subjt:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM

Query:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP
           +R++ LKINL  ++ +++ I   LGDR  + FR   FGH L+ S    SSQLLLHLIQ  CKPK TS+L F IGG++L+FGLREFALITGL C  +P
Subjt:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP

Query:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA
         I+ + +    R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE   ++ +H++MV+D E+F  YPWGR AF LL  +M++ 
Subjt:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA

Query:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR
          S+G  GI MGG ++ ILAWAYEVIP LS PP  +  RI N VPRIIN   + QP+W++L  K+F S +LEV P+  T  E+ M +F PF++ E    +
Subjt:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR

Query:  LAGDEQQVGDDVRIPPNFSV--GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTK----------DG
         A DE +   +     + S+  G PS TS+++V+ K  ++I     R+ + +  L++ L+ +   +   N++F+       GI  TTK          + 
Subjt:  LAGDEQQVGDDVRIPPNFSV--GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTK----------DG

Query:  DVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINK
         +++ EE+ E+DD +D  L+   +    +R+DD+     G   + QG ++     G + + ++ET             K G  E +  KA ++ DE IN+
Subjt:  DVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINK

Query:  AILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--
         I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                            +N T    G KRS+  E  
Subjt:  AILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--

Query:  EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
        EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

A0A5D3CEX9 Protein Ycf2-like4.3e-7035.03Show/hide
Query:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK
        CKPK TS+L F IG ++L+FGLREFALITGL C  +P I+ + ++   R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH
        + ++ +H++MV+D E+F  YPWGR AF LL  +M++A  S+G  GI M GG ++ ILAWAYEVIP LS PP  +A RI N VPRIINW  + QP+W++L 
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH

Query:  VKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSV--GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREI
         K+F S +LEV P+  T  E+ M +F PF + E    + A DE +   +     + S+  G PS TS+++V+ K  ++I     R+ + +  L++ L+ +
Subjt:  VKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSV--GAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREI

Query:  HKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKT
           +   N++F+  G           +  +++ EE+ E+DD +D  L+  N +   + DD     GK+            +  E++ +     DGG  K 
Subjt:  HKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKT

Query:  GGGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS------------------
           E             KA ++ DE IN+ I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                  
Subjt:  GGGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS------------------

Query:  --------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
                  +N T    G KRS+  E  EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  --------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)1.3e-1328.11Show/hide
Query:  KINLCCRTEIMDTI-NTILGDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDR
        ++N+  R E + TI N + G    +  +++ FG L +F   + S S  L+H L+  Q   K+  EL+F  GG  ++F +REF ++TGL CG LP  D  +
Subjt:  KINLCCRTEIMDTI-NTILGDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDR

Query:  LQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF
            S++   +       R  T+  V   ++    +   K+   L  +   ++   ++  +  + V M+ D + F  YPWGR AF
Subjt:  LQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF

AT1G36970.1 Domain of unknown function (DUF1985)1.4e-0731.39Show/hide
Query:  VDDDYLMSP----SKRSKALKINLCCRTEIMDTINTIL-GDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLR
        +D+D  + P    + R    ++N+  R +I+  I  +L G +  +   ++CFG L      + S S  L+H L+  Q   K+  EL    GG+ L+F L 
Subjt:  VDDDYLMSP----SKRSKALKINLCCRTEIMDTINTIL-GDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLR

Query:  EFALITGLNCGPLP-QIDTDRLQDSSRFKDEYFANDE
        EF  +TGL CG  P + D D      + K E F +DE
Subjt:  EFALITGLNCGPLP-QIDTDRLQDSSRFKDEYFANDE

AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases4.3e-0630Show/hide
Query:  VKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSR-GSVGIGMGGLVYAILAWAYEVIPALSAPP
        ++ A L  ++ FLLP      I ++H  M ED + F +YPWGR +F ++ + + +  V +  +  + + GL+YA+     E +PA+   P
Subjt:  VKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSR-GSVGIGMGGLVYAILAWAYEVIPALSAPP

AT4G08430.1 Ulp1 protease family protein7.4e-0626.74Show/hide
Query:  ELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVH-----
        E++  I  + ++F L EF  ITGLNC    + DT      + +KD  F N+ GV   ++  +F  ++   E       +   +   L  +   VH     
Subjt:  ELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVH-----

Query:  --IEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINW
          +       V D   F  YPWGR AF  L   +        S  I   G V A+L W YE +P +      + K     VP +++W
Subjt:  --IEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINW

AT5G45570.1 Ulp1 protease family protein5.1e-0725.11Show/hide
Query:  CRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKK--TSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSR
        C    +  I   LG    D  + T  G  + F+      ++Q +   + +Q +     E++  I  + ++F L EF  ITGLNC    + DT      + 
Subjt:  CRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKK--TSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSR

Query:  FKDEYFANDEGVRRKT------LNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGS
        +KD  F N+ GV          L  VF   K       + + +L  L   +        +       V D   F  YPWGR AF  L+  +        S
Subjt:  FKDEYFANDEGVRRKT------LNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGS

Query:  VGIGMGGLVYAILAWAYEVIPAL
          I   G V  +L W YE +P +
Subjt:  VGIGMGGLVYAILAWAYEVIPAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAAGGATAAGAACCACCCAACTAGAGCGAGTGACCGTTTGAAGGCTGCAGGAGTAACCCCAGGAAGAAAACCCCGTAAACAAACATCCCCAATCACATTGGGGAG
CGAACAGGATTCTGGAGACGCCATGAGTACATCAGTTTCAGTCGCTAAGGGATCTGGCGAGAAGACGAAAGGGGTAAAAAGGGACAGAGGCGACGGAGGTTCGGGCAAAA
AAGTAACTCCAACAAAGAAAACAAAAGTTCACGAATGGACCAAGAAGACCAACGATGAGATTGAGAAGAAACCCACTGGGGCACGAAGCAATAAGAAGAAAACGCGAGCG
AAACAGACAAATGACACAGATAAGGCGAGCCCTGTGACACCAGAGATTGCCCTTGAAACAAGCGAGGACACAGCTAAAAATGACACAGAAGACACCGAATCTAATAGTGT
GACGAATGACAACTCCTCGAGTGATGACGTAGGGGAAGAACGAGAGAAAAAGAAGACAACAATTGCTAAAAAGGAACCTCCTAAAAAACAGAAGGGTGGAAAAAAAGGAA
AAAAGCTGAAGACCATGGTTGAAGGTGACACTGTCCGAGTGGACGACGATTACCTTATGTCGCCATCAAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACA
GAAATAATGGACACCATCAACACCATCTTAGGAGATAGGTGTAGAGACGCTTTCAGAAACACGTGCTTTGGCCACCTGCTTGACTTCTCGTTCAAAAAGACGTCTTCCCA
GTTACTATTGCACCTGATCCAGCATCAATGCAAACCCAAACGGACGTCGGAACTTTACTTCAAGATTGGGGGGAAAATCTTAAAGTTTGGTCTACGGGAGTTCGCACTAA
TTACGGGACTAAATTGTGGCCCATTGCCACAAATTGACACAGACAGGCTACAAGATTCATCCAGGTTCAAGGATGAGTATTTTGCCAACGACGAAGGTGTCAGAAGAAAG
ACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGGCAGACCTCGTAAAGATGGCGCAGTTGTATTGTTTGGAGAGCTTTTTGTTACCTAGGCAAGAAAAGGT
GCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGCGCGCCGCCTTCACACTATTGACAAGCTACATGCATAAAGCAT
CCGTTAGTAGGGGCAGTGTTGGTATTGGAATGGGCGGATTAGTGTATGCCATCCTTGCATGGGCATACGAAGTGATACCTGCATTGAGCGCGCCACCGACCAACTACGCA
AAACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTAGAAGCTCAACCCGAATGGAGAGAACTGCACGTCAAGATATTCCAATCCTCATCGTTGGAGGTTGT
ACCATTGGACCCAACTGACACGGAAATGCAGATGTCGTACTTCCAACCTTTCTTGCAAGATGAGTTGGCTTCTCGGCGATTGGCCGGCGACGAACAACAAGTAGGCGACG
ATGTTCGAATCCCGCCGAACTTCTCAGTAGGGGCACCCTCAATGACCAGCCAGATGGATGTGATGGAAAAACGCCATCAAGAAATAATTGGAAAGCTTGACAGAGTTTAC
TCTATGCTAGGAGCCTTGGTGGACACTTTGAGGGAGATACACAAGCTTGACGACCCCCCAAACTCAAAATTCAAGATGCAAGGAGATGTTGGGACTGGTATTGACCCTAC
AACAAAAGACGGTGATGTGGAGGAAAAAGAAGAAAATGATGAAAAAGATGATCAAGATGACCACGAATTAGAGAAAAATCCTTCTCATCGAAGGGAAGACGACGATGGAG
GACCAACAGGTGGGAAACAGCAACAGGGCCCGACCACCCCCGGACCGACAACCCTTGTACAGACTGAAACTCGTGTAGATGGCGAAGGCACGGGAGATGGCGGGACAAAG
AAAACAGGAGGTGGTGAAGGCACCAAGGCCTGTGATGATGCCGACGAGACAATAAACAAGGCTATACTGTCAATAGATGAGGCCAAGGTGATTGAGAAGTTTAATAGGGA
CCGCAAGGGTAAAGCGGTTATGGTGGAAGGACCTCATACCATACCAAGAACCACAGTTCCACAACTTGGCCCCCGATGGTCTGCAAAGGTTAACGGTACCATAACCCGGG
GGGGAGCCAAGCGCTCAGATTGTGTGGAAGAAGTGGGGAGCCTGCAAGCCACAGGAATTTATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGAGGGAATCCCTA
CCGCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAAGGATAAGAACCACCCAACTAGAGCGAGTGACCGTTTGAAGGCTGCAGGAGTAACCCCAGGAAGAAAACCCCGTAAACAAACATCCCCAATCACATTGGGGAG
CGAACAGGATTCTGGAGACGCCATGAGTACATCAGTTTCAGTCGCTAAGGGATCTGGCGAGAAGACGAAAGGGGTAAAAAGGGACAGAGGCGACGGAGGTTCGGGCAAAA
AAGTAACTCCAACAAAGAAAACAAAAGTTCACGAATGGACCAAGAAGACCAACGATGAGATTGAGAAGAAACCCACTGGGGCACGAAGCAATAAGAAGAAAACGCGAGCG
AAACAGACAAATGACACAGATAAGGCGAGCCCTGTGACACCAGAGATTGCCCTTGAAACAAGCGAGGACACAGCTAAAAATGACACAGAAGACACCGAATCTAATAGTGT
GACGAATGACAACTCCTCGAGTGATGACGTAGGGGAAGAACGAGAGAAAAAGAAGACAACAATTGCTAAAAAGGAACCTCCTAAAAAACAGAAGGGTGGAAAAAAAGGAA
AAAAGCTGAAGACCATGGTTGAAGGTGACACTGTCCGAGTGGACGACGATTACCTTATGTCGCCATCAAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACA
GAAATAATGGACACCATCAACACCATCTTAGGAGATAGGTGTAGAGACGCTTTCAGAAACACGTGCTTTGGCCACCTGCTTGACTTCTCGTTCAAAAAGACGTCTTCCCA
GTTACTATTGCACCTGATCCAGCATCAATGCAAACCCAAACGGACGTCGGAACTTTACTTCAAGATTGGGGGGAAAATCTTAAAGTTTGGTCTACGGGAGTTCGCACTAA
TTACGGGACTAAATTGTGGCCCATTGCCACAAATTGACACAGACAGGCTACAAGATTCATCCAGGTTCAAGGATGAGTATTTTGCCAACGACGAAGGTGTCAGAAGAAAG
ACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGGCAGACCTCGTAAAGATGGCGCAGTTGTATTGTTTGGAGAGCTTTTTGTTACCTAGGCAAGAAAAGGT
GCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGCGCGCCGCCTTCACACTATTGACAAGCTACATGCATAAAGCAT
CCGTTAGTAGGGGCAGTGTTGGTATTGGAATGGGCGGATTAGTGTATGCCATCCTTGCATGGGCATACGAAGTGATACCTGCATTGAGCGCGCCACCGACCAACTACGCA
AAACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTAGAAGCTCAACCCGAATGGAGAGAACTGCACGTCAAGATATTCCAATCCTCATCGTTGGAGGTTGT
ACCATTGGACCCAACTGACACGGAAATGCAGATGTCGTACTTCCAACCTTTCTTGCAAGATGAGTTGGCTTCTCGGCGATTGGCCGGCGACGAACAACAAGTAGGCGACG
ATGTTCGAATCCCGCCGAACTTCTCAGTAGGGGCACCCTCAATGACCAGCCAGATGGATGTGATGGAAAAACGCCATCAAGAAATAATTGGAAAGCTTGACAGAGTTTAC
TCTATGCTAGGAGCCTTGGTGGACACTTTGAGGGAGATACACAAGCTTGACGACCCCCCAAACTCAAAATTCAAGATGCAAGGAGATGTTGGGACTGGTATTGACCCTAC
AACAAAAGACGGTGATGTGGAGGAAAAAGAAGAAAATGATGAAAAAGATGATCAAGATGACCACGAATTAGAGAAAAATCCTTCTCATCGAAGGGAAGACGACGATGGAG
GACCAACAGGTGGGAAACAGCAACAGGGCCCGACCACCCCCGGACCGACAACCCTTGTACAGACTGAAACTCGTGTAGATGGCGAAGGCACGGGAGATGGCGGGACAAAG
AAAACAGGAGGTGGTGAAGGCACCAAGGCCTGTGATGATGCCGACGAGACAATAAACAAGGCTATACTGTCAATAGATGAGGCCAAGGTGATTGAGAAGTTTAATAGGGA
CCGCAAGGGTAAAGCGGTTATGGTGGAAGGACCTCATACCATACCAAGAACCACAGTTCCACAACTTGGCCCCCGATGGTCTGCAAAGGTTAACGGTACCATAACCCGGG
GGGGAGCCAAGCGCTCAGATTGTGTGGAAGAAGTGGGGAGCCTGCAAGCCACAGGAATTTATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGAGGGAATCCCTA
CCGCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
Protein sequenceShow/hide protein sequence
MPKDKNHPTRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTPTKKTKVHEWTKKTNDEIEKKPTGARSNKKKTRA
KQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLMSPSKRSKALKINLCCRT
EIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRK
TLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYA
KRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSVGAPSMTSQMDVMEKRHQEIIGKLDRVY
SMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELEKNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTK
KTGGGEGTKACDDADETINKAILSIDEAKVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLGPRWSAKVNGTITRGGAKRSDCVEEVGSLQATGIYVDAMRGTWTKESRESL
PPEFFQPSFDLHLSQG