; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011708 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011708
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein Ycf2-like
Genome locationscaffold1:4141217..4144887
RNA-Seq ExpressionSpg011708
SyntenySpg011708
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]1.2e-8433.37Show/hide
Query:  TRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHERTKKTNDEIEKKP--TGARSNK
        TR SDRL+AAG+T  RK                  ++ T +     S E+   ++    +G  GK+ +P T K +V   TKK    ++KK       S K
Subjt:  TRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHERTKKTNDEIEKKP--TGARSNK

Query:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM
        +  R K      K   V  +IA+ E S  +      DT   S            + REK KK    +KE   ++   +KGK      + +    D  YLM
Subjt:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM

Query:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP
           +R++ LKINL  ++ +++ I   LGDR  + FR   FGH L+ S    SSQLLLHLIQ  CKPK TS+L F IGG++L+FGLREFALITGL C  +P
Subjt:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP

Query:  QINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA
         IN + +    R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE   ++ +H++MV+D E+F  YPWGR AF LL  +M++ 
Subjt:  QINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA

Query:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSS--------------MSYFQPFLQDELASRR
          S+G  GI MGG ++ ILAWAYEVIP LS PP  +  RI N VPRIIN   + QP+W++L  K+F S +              M +F PF++ E    +
Subjt:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSS--------------MSYFQPFLQDELASRR

Query:  LAGDDVRIPPNF--------SVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTK----------DG
         A D++R   NF        + G PS TS+++V+ K  ++I     R+ + +  L++ L+ +   +   N++F+       GI  TTK          + 
Subjt:  LAGDDVRIPPNF--------SVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTK----------DG

Query:  DVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINK
         +++ EE+ E+DD +D  L+   +    +R+DD+     G   + QG ++     G + + ++ET             K G  E +  KA ++ DE IN+
Subjt:  DVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINK

Query:  AILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--
         I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                            +N T    G KRS+  E  
Subjt:  AILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--

Query:  EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
        EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

KAA0051382.1 protein Ycf2-like [Cucumis melo var. makuwa]4.2e-5631.41Show/hide
Query:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK
        CKPK TS+L F IGG++L+FGLREFALITGL C  +P IN D ++   R K  YF N + V R+ LN++FN      + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHV
        + ++ +H++MV+D E+F  YPWGR AF LL  +M++   S+G  GI + G ++ ILAWAYEV P LS P   +A RI N VPRIINW  + QP+W++L  
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHV

Query:  KIFQSSS--------------MSYFQPFLQDELASRRLAGDDVRIPPNFSVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNS
        K+F S +              M +F PF++ E    + A D++R   N S    S++    +  +    I+ K+      +  ++  +R           
Subjt:  KIFQSSS--------------MSYFQPFLQDELASRRLAGDDVRIPPNFSVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNS

Query:  KFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKK--------
          +  G           +  +++ EE+ E+DD +D  L+   +    +R+DD+      K  +G         +  E+R +     DGG  K        
Subjt:  KFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKK--------

Query:  TGGGEGT---KACDDADETINKAILSIDEAKVIEKFNRDRKGK--------------AVMVEGPHTIPRTTVPQLGP-----------------------
        T G + +   KA ++  E IN+ I  IDE+ + +K  +  +G+                 +     + R   P  G                        
Subjt:  TGGGEGT---KACDDADETINKAILSIDEAKVIEKFNRDRKGK--------------AVMVEGPHTIPRTTVPQLGP-----------------------

Query:  ---RWSAKV-----NGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLS
           R  A+V     N T    G KRS+  E  EV  + +TGI++DA+R       ++        PSFDLHLS
Subjt:  ---RWSAKV-----NGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLS

TYK09852.1 protein Ycf2-like [Cucumis melo var. makuwa]4.5e-6634.33Show/hide
Query:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK
        CKPK TS+L F IG ++L+FGLREFALITGL C  +P IN + ++   R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH
        + ++ +H++MV+D E+F  YPWGR AF LL  +M++A  S+G  GI M GG ++ ILAWAYEVIP LS PP  +A RI N VPRIINW  + QP+W++L 
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH

Query:  VKIFQSSS--------------MSYFQPFLQDELASRRLAGDDVRIPPNF--------SVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREI
         K+F S +              M +F PF + E    + A D++R   NF        + G PS TS+++V+ K  ++I     R+ + +  L++ L+ +
Subjt:  VKIFQSSS--------------MSYFQPFLQDELASRRLAGDDVRIPPNF--------SVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREI

Query:  HKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKT
           +   N++F+  G           +  +++ EE+ E+DD +D  L+  N +   + DD     GK+            +  E++ +     DGG  K 
Subjt:  HKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKT

Query:  GGGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS------------------
           E             KA ++ DE IN+ I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                  
Subjt:  GGGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS------------------

Query:  --------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
                  +N T    G KRS+  E  EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  --------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

XP_024031030.1 uncharacterized protein LOC21394043 [Morus notabilis]1.2e-5533.26Show/hide
Query:  VRVDDDYLMSPSKR-SKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFAL
        V ++   L+ P K+ +   KINL  + +++D +N  L  R ++ FR  CFGHLLDF  KK  SQL+ HLI  QC   + +EL+F I G I+KFG++EFAL
Subjt:  VRVDDDYLMSPSKR-SKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFAL

Query:  ITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF
        ITGLNC   P I   +L +S+  K ++F   + V+R  LN VF A + G + D+VK+A+LYCLES L+P++ + +I+  H+ MV++ ELF  YPWGR ++
Subjt:  ITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF

Query:  TLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQS----------SSMSYFQPFLQD
         +  +Y+ ++  S+ +   G+GG  YA++ WAYE IP L     N AKRI N +PRIINWE + QP +RE+  ++F S          S     QPF+  
Subjt:  TLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQS----------SSMSYFQPFLQD

Query:  ELASRRLAG------------DDVRIPP------------NFSVG---APSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLRE-----IHKLDDPPN
            ++               +DV + P            N   G     S+ S+++ MEK   E+   ++ +Y+ML  +  T+ +     + K  +   
Subjt:  ELASRRLAG------------DDVRIPP------------NFSVG---APSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLRE-----IHKLDDPPN

Query:  SKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELEKNPSHRREDDDGGPTGGKQQ
         + +   D     D   K G+ +E++++ EK D+ + ++ ++ + + E       GG+++
Subjt:  SKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELEKNPSHRREDDDGGPTGGKQQ

XP_031743195.1 uncharacterized protein LOC101221625 isoform X8 [Cucumis sativus]9.3e-5636.12Show/hide
Query:  DKASPVTPEIALETSEDTAKN---DTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDD---YLMSPSKRSK
        +K SP + E + +  +  + +     +  E++ +  D  S   V   + KK    +K E  K++K GK+GK  K+ + G +   DD+    L+  S  + 
Subjt:  DKASPVTPEIALETSEDTAKN---DTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDD---YLMSPSKRSK

Query:  ALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRL
          +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K  +EL+F + G+I KFG+++FALITGLNCG LP I+  ++
Subjt:  ALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRL

Query:  QDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSV
        Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D++ F +YPWGR ++ +   ++ K+  S  + 
Subjt:  QDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSV

Query:  GIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSMSYFQPFL
         IG+GG  YA+L WAYE IP L+      A RI    PR+ NW     PEW++L  K+FQS +    QP +
Subjt:  GIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSMSYFQPFL

TrEMBL top hitse value%identityAlignment
A0A5A7U047 Protein Ycf2-like6.1e-8533.37Show/hide
Query:  TRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHERTKKTNDEIEKKP--TGARSNK
        TR SDRL+AAG+T  RK                  ++ T +     S E+   ++    +G  GK+ +P T K +V   TKK    ++KK       S K
Subjt:  TRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHERTKKTNDEIEKKP--TGARSNK

Query:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM
        +  R K      K   V  +IA+ E S  +      DT   S            + REK KK    +KE   ++   +KGK      + +    D  YLM
Subjt:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM

Query:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP
           +R++ LKINL  ++ +++ I   LGDR  + FR   FGH L+ S    SSQLLLHLIQ  CKPK TS+L F IGG++L+FGLREFALITGL C  +P
Subjt:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP

Query:  QINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA
         IN + +    R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE   ++ +H++MV+D E+F  YPWGR AF LL  +M++ 
Subjt:  QINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA

Query:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSS--------------MSYFQPFLQDELASRR
          S+G  GI MGG ++ ILAWAYEVIP LS PP  +  RI N VPRIIN   + QP+W++L  K+F S +              M +F PF++ E    +
Subjt:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSS--------------MSYFQPFLQDELASRR

Query:  LAGDDVRIPPNF--------SVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTK----------DG
         A D++R   NF        + G PS TS+++V+ K  ++I     R+ + +  L++ L+ +   +   N++F+       GI  TTK          + 
Subjt:  LAGDDVRIPPNF--------SVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTGIDPTTK----------DG

Query:  DVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINK
         +++ EE+ E+DD +D  L+   +    +R+DD+     G   + QG ++     G + + ++ET             K G  E +  KA ++ DE IN+
Subjt:  DVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINK

Query:  AILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--
         I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                            +N T    G KRS+  E  
Subjt:  AILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--

Query:  EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
        EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

A0A5A7U6E1 Protein Ycf2-like2.0e-5631.41Show/hide
Query:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK
        CKPK TS+L F IGG++L+FGLREFALITGL C  +P IN D ++   R K  YF N + V R+ LN++FN      + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHV
        + ++ +H++MV+D E+F  YPWGR AF LL  +M++   S+G  GI + G ++ ILAWAYEV P LS P   +A RI N VPRIINW  + QP+W++L  
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHV

Query:  KIFQSSS--------------MSYFQPFLQDELASRRLAGDDVRIPPNFSVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNS
        K+F S +              M +F PF++ E    + A D++R   N S    S++    +  +    I+ K+      +  ++  +R           
Subjt:  KIFQSSS--------------MSYFQPFLQDELASRRLAGDDVRIPPNFSVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNS

Query:  KFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKK--------
          +  G           +  +++ EE+ E+DD +D  L+   +    +R+DD+      K  +G         +  E+R +     DGG  K        
Subjt:  KFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKK--------

Query:  TGGGEGT---KACDDADETINKAILSIDEAKVIEKFNRDRKGK--------------AVMVEGPHTIPRTTVPQLGP-----------------------
        T G + +   KA ++  E IN+ I  IDE+ + +K  +  +G+                 +     + R   P  G                        
Subjt:  TGGGEGT---KACDDADETINKAILSIDEAKVIEKFNRDRKGK--------------AVMVEGPHTIPRTTVPQLGP-----------------------

Query:  ---RWSAKV-----NGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLS
           R  A+V     N T    G KRS+  E  EV  + +TGI++DA+R       ++        PSFDLHLS
Subjt:  ---RWSAKV-----NGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLS

A0A5D3CEX9 Protein Ycf2-like2.2e-6634.33Show/hide
Query:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK
        CKPK TS+L F IG ++L+FGLREFALITGL C  +P IN + ++   R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH
        + ++ +H++MV+D E+F  YPWGR AF LL  +M++A  S+G  GI M GG ++ ILAWAYEVIP LS PP  +A RI N VPRIINW  + QP+W++L 
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH

Query:  VKIFQSSS--------------MSYFQPFLQDELASRRLAGDDVRIPPNF--------SVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREI
         K+F S +              M +F PF + E    + A D++R   NF        + G PS TS+++V+ K  ++I     R+ + +  L++ L+ +
Subjt:  VKIFQSSS--------------MSYFQPFLQDELASRRLAGDDVRIPPNF--------SVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREI

Query:  HKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKT
           +   N++F+  G           +  +++ EE+ E+DD +D  L+  N +   + DD     GK+            +  E++ +     DGG  K 
Subjt:  HKLDDPPNSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKT

Query:  GGGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS------------------
           E             KA ++ DE IN+ I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                  
Subjt:  GGGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS------------------

Query:  --------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
                  +N T    G KRS+  E  EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  --------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

A0A5D3CNI7 TF-B3 domain-containing protein1.7e-5535.31Show/hide
Query:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK
        ++ P   ++ +KK + K  +              E +E+T++ DT+      V           E R+KKK     K+     K++K GKKG K      
Subjt:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK

Query:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK
             D V V  +Y  L+  S  S   +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK

Query:  FGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT
        FG+++FALITGLNCG LP I+  ++Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D+E F +
Subjt:  FGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT

Query:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSS----------MS
        YPWGR ++ +   ++ KA  S  +  IG+GG  +A+  WAYE IP L+     +A RI    PR+ NW  +  PEW++L  K+FQS +          MS
Subjt:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSS----------MS

Query:  YFQPF
        Y  PF
Subjt:  YFQPF

W9SF50 DUF1985 domain-containing protein5.9e-5633.26Show/hide
Query:  VRVDDDYLMSPSKR-SKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFAL
        V ++   L+ P K+ +   KINL  + +++D +N  L  R ++ FR  CFGHLLDF  KK  SQL+ HLI  QC   + +EL+F I G I+KFG++EFAL
Subjt:  VRVDDDYLMSPSKR-SKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFAL

Query:  ITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF
        ITGLNC   P I   +L +S+  K ++F   + V+R  LN VF A + G + D+VK+A+LYCLES L+P++ + +I+  H+ MV++ ELF  YPWGR ++
Subjt:  ITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF

Query:  TLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQS----------SSMSYFQPFLQD
         +  +Y+ ++  S+ +   G+GG  YA++ WAYE IP L     N AKRI N +PRIINWE + QP +RE+  ++F S          S     QPF+  
Subjt:  TLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQS----------SSMSYFQPFLQD

Query:  ELASRRLAG------------DDVRIPP------------NFSVG---APSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLRE-----IHKLDDPPN
            ++               +DV + P            N   G     S+ S+++ MEK   E+   ++ +Y+ML  +  T+ +     + K  +   
Subjt:  ELASRRLAG------------DDVRIPP------------NFSVG---APSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLRE-----IHKLDDPPN

Query:  SKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELEKNPSHRREDDDGGPTGGKQQ
         + +   D     D   K G+ +E++++ EK D+ + ++ ++ + + E       GG+++
Subjt:  SKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELEKNPSHRREDDDGGPTGGKQQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)4.7e-1327.57Show/hide
Query:  KINLCCRTEIMDTI-NTILGDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDR
        ++N+  R E + TI N + G    +  +++ FG L +F   + S S  L+H L+  Q   K+  EL+F  GG  ++F +REF ++TGL CG LP  +  +
Subjt:  KINLCCRTEIMDTI-NTILGDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDR

Query:  LQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF
            S++   +       R  T+  V   ++    +   K+   L  +   ++   ++  +  + V M+ D + F  YPWGR AF
Subjt:  LQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF

AT1G36970.1 Domain of unknown function (DUF1985)5.0e-0730.66Show/hide
Query:  VDDDYLMSP----SKRSKALKINLCCRTEIMDTINTIL-GDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLR
        +D+D  + P    + R    ++N+  R +I+  I  +L G +  +   ++CFG L      + S S  L+H L+  Q   K+  EL    GG+ L+F L 
Subjt:  VDDDYLMSP----SKRSKALKINLCCRTEIMDTINTIL-GDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLR

Query:  EFALITGLNCGPLP-QINTDRLQDSSRFKDEYFANDE
        EF  +TGL CG  P + + D      + K E F +DE
Subjt:  EFALITGLNCGPLP-QINTDRLQDSSRFKDEYFANDE

AT2G06420.1 Domain of unknown function (DUF1985)1.6e-0525.83Show/hide
Query:  KRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTD-RLQDSSRFKDEYFAN----DEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLL---
        KR  E +F + G  +++G+ E ALI+G NC     I+   +++++  FK ++F N     E VR K + +V    +     + ++M  LY L + ++   
Subjt:  KRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTD-RLQDSSRFKDEYFAN----DEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLL---

Query:  -PRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFT-LLTSYMHKASVSRGSV
            +   ++E  +  V D      + WGR +F  +L +  H  +   GSV
Subjt:  -PRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFT-LLTSYMHKASVSRGSV

AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases4.2e-0630Show/hide
Query:  VKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSR-GSVGIGMGGLVYAILAWAYEVIPALSAPP
        ++ A L  ++ FLLP      I ++H  M ED + F +YPWGR +F ++ + + +  V +  +  + + GL+YA+     E +PA+   P
Subjt:  VKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSR-GSVGIGMGGLVYAILAWAYEVIPALSAPP

AT5G45570.1 Ulp1 protease family protein1.9e-0624.66Show/hide
Query:  CRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKK--TSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSR
        C    +  I   LG    D  + T  G  + F+      ++Q +   + +Q +     E++  I  + ++F L EF  ITGLNC    + +T      + 
Subjt:  CRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKK--TSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSR

Query:  FKDEYFANDEGVRRKT------LNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGS
        +KD  F N+ GV          L  VF   K       + + +L  L   +        +       V D   F  YPWGR AF  L+  +        S
Subjt:  FKDEYFANDEGVRRKT------LNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGS

Query:  VGIGMGGLVYAILAWAYEVIPAL
          I   G V  +L W YE +P +
Subjt:  VGIGMGGLVYAILAWAYEVIPAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAAGGATAAGAACCACCCAACTAGAGCGAGTGACCGTTTGAAGGCTGCAGGAGTAACCCCAGGAAGAAAACCCCGTAAACAAACATCCCCAATCACATTGGGGAG
CGAACAGGATTCTGGAGACGCCATGAGTACATCAGTTTCAGTCGCTAAGGGATCTGGCGAGAAGACGAAAGGGGTAAAAAGGGACAGAGGCGACGGAGGTTCGGGCAAAA
AAGTAACTCCAACAAAGAAAACAAAAGTTCACGAACGGACCAAGAAGACCAACGATGAGATTGAGAAGAAACCCACTGGGGCACGAAGCAATAAGAAGAAAACGCGAGCG
AAACAGACAAATGACACAGATAAGGCGAGCCCTGTGACACCAGAGATTGCCCTTGAAACAAGCGAGGACACAGCTAAAAATGACACCGAAGACACCGAATCTAATAGTGT
GACGAATGACAACTCCTCGAGTGATGACGTAGGGGAAGAACGAGAGAAAAAGAAGACAACAATTGCTAAAAAGGAACCTCCTAAAAAACAGAAGGGTGGAAAAAAAGGAA
AAAAGCTGAAGACCATGGTTGAAGGTGACACTGTCCGAGTGGACGACGATTACCTTATGTCGCCATCAAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACA
GAAATAATGGACACCATCAACACCATCTTAGGAGATAGGTGTAGAGACGCTTTCAGAAACACGTGCTTTGGCCACCTGCTTGACTTCTCGTTCAAAAAGACGTCTTCCCA
GTTACTATTGCACCTGATCCAGCATCAATGCAAACCCAAACGGACATCGGAACTTTACTTCAAGATTGGGGGGAAAATCTTAAAGTTTGGTCTACGGGAGTTCGCACTAA
TTACGGGACTAAATTGTGGCCCATTGCCACAAATTAACACAGACAGGCTACAAGATTCATCCAGGTTCAAGGATGAGTATTTTGCCAACGACGAAGGTGTCAGAAGAAAG
ACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGGCAGACCTCGTAAAGATGGCGCAGTTGTATTGTTTGGAGAGCTTTTTGTTACCTAGGCAAGAAAAGGT
GCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGCGCGCCGCCTTCACACTATTGACAAGCTACATGCATAAAGCAT
CCGTTAGTAGGGGCAGTGTTGGTATTGGAATGGGCGGATTAGTGTATGCCATCCTTGCATGGGCATACGAAGTGATACCTGCATTGAGCGCGCCACCGACCAACTACGCA
AAACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTAGAAGCTCAACCCGAATGGAGAGAACTGCACGTCAAGATATTCCAATCCTCATCGATGTCGTACTT
CCAACCTTTCTTGCAAGATGAGTTGGCTTCTCGGCGATTGGCCGGCGACGATGTTCGAATCCCGCCGAACTTCTCAGTAGGGGCACCCTCAATGACCAGCCAGATGGATG
TGATGGAAAAACGCCATCAAGAAATAATTGGAAAGCTTGACAGAGTTTACTCTATGCTAGGAGCCTTGGTGGACACTTTGAGGGAGATACACAAGCTTGACGACCCCCCA
AACTCAAAATTCAAGATGCAAGGAGATGTTGGGACTGGTATTGACCCTACAACAAAAGACGGTGATGTGGAGGAAAAAGAAGAAAATGATGAAAAAGATGATCAAGATGA
CCACGAATTAGAGAAAAATCCTTCTCATCGAAGGGAAGACGACGATGGAGGACCAACAGGTGGGAAACAGCAACAGGGTCCGACCACCCCCGGACCGACAACCCTTGTAC
AGACTGAAACTCGTGTAGATGGCGAAGGCACGGGAGATGGCGGGACAAAGAAAACAGGAGGTGGTGAAGGCACCAAGGCCTGTGATGATGCCGACGAGACAATAAACAAG
GCTATACTGTCAATAGATGAGGCCAAGGTGATTGAGAAGTTTAATAGGGACCGCAAGGGTAAAGCGGTTATGGTGGAAGGACCTCATACCATACCAAGAACCACAGTTCC
ACAACTTGGCCCCCGATGGTCTGCAAAGGTTAACGGTACCATAACCCGGGGGGGAGCCAAGCGCTCAGATTGTGTGGAAGAAGTGGGGAGCCTGCAAGCCACAGGAATTT
ATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGAGGGAATCCCTACCGCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAAGGATAAGAACCACCCAACTAGAGCGAGTGACCGTTTGAAGGCTGCAGGAGTAACCCCAGGAAGAAAACCCCGTAAACAAACATCCCCAATCACATTGGGGAG
CGAACAGGATTCTGGAGACGCCATGAGTACATCAGTTTCAGTCGCTAAGGGATCTGGCGAGAAGACGAAAGGGGTAAAAAGGGACAGAGGCGACGGAGGTTCGGGCAAAA
AAGTAACTCCAACAAAGAAAACAAAAGTTCACGAACGGACCAAGAAGACCAACGATGAGATTGAGAAGAAACCCACTGGGGCACGAAGCAATAAGAAGAAAACGCGAGCG
AAACAGACAAATGACACAGATAAGGCGAGCCCTGTGACACCAGAGATTGCCCTTGAAACAAGCGAGGACACAGCTAAAAATGACACCGAAGACACCGAATCTAATAGTGT
GACGAATGACAACTCCTCGAGTGATGACGTAGGGGAAGAACGAGAGAAAAAGAAGACAACAATTGCTAAAAAGGAACCTCCTAAAAAACAGAAGGGTGGAAAAAAAGGAA
AAAAGCTGAAGACCATGGTTGAAGGTGACACTGTCCGAGTGGACGACGATTACCTTATGTCGCCATCAAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACA
GAAATAATGGACACCATCAACACCATCTTAGGAGATAGGTGTAGAGACGCTTTCAGAAACACGTGCTTTGGCCACCTGCTTGACTTCTCGTTCAAAAAGACGTCTTCCCA
GTTACTATTGCACCTGATCCAGCATCAATGCAAACCCAAACGGACATCGGAACTTTACTTCAAGATTGGGGGGAAAATCTTAAAGTTTGGTCTACGGGAGTTCGCACTAA
TTACGGGACTAAATTGTGGCCCATTGCCACAAATTAACACAGACAGGCTACAAGATTCATCCAGGTTCAAGGATGAGTATTTTGCCAACGACGAAGGTGTCAGAAGAAAG
ACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGGCAGACCTCGTAAAGATGGCGCAGTTGTATTGTTTGGAGAGCTTTTTGTTACCTAGGCAAGAAAAGGT
GCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGCGCGCCGCCTTCACACTATTGACAAGCTACATGCATAAAGCAT
CCGTTAGTAGGGGCAGTGTTGGTATTGGAATGGGCGGATTAGTGTATGCCATCCTTGCATGGGCATACGAAGTGATACCTGCATTGAGCGCGCCACCGACCAACTACGCA
AAACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTAGAAGCTCAACCCGAATGGAGAGAACTGCACGTCAAGATATTCCAATCCTCATCGATGTCGTACTT
CCAACCTTTCTTGCAAGATGAGTTGGCTTCTCGGCGATTGGCCGGCGACGATGTTCGAATCCCGCCGAACTTCTCAGTAGGGGCACCCTCAATGACCAGCCAGATGGATG
TGATGGAAAAACGCCATCAAGAAATAATTGGAAAGCTTGACAGAGTTTACTCTATGCTAGGAGCCTTGGTGGACACTTTGAGGGAGATACACAAGCTTGACGACCCCCCA
AACTCAAAATTCAAGATGCAAGGAGATGTTGGGACTGGTATTGACCCTACAACAAAAGACGGTGATGTGGAGGAAAAAGAAGAAAATGATGAAAAAGATGATCAAGATGA
CCACGAATTAGAGAAAAATCCTTCTCATCGAAGGGAAGACGACGATGGAGGACCAACAGGTGGGAAACAGCAACAGGGTCCGACCACCCCCGGACCGACAACCCTTGTAC
AGACTGAAACTCGTGTAGATGGCGAAGGCACGGGAGATGGCGGGACAAAGAAAACAGGAGGTGGTGAAGGCACCAAGGCCTGTGATGATGCCGACGAGACAATAAACAAG
GCTATACTGTCAATAGATGAGGCCAAGGTGATTGAGAAGTTTAATAGGGACCGCAAGGGTAAAGCGGTTATGGTGGAAGGACCTCATACCATACCAAGAACCACAGTTCC
ACAACTTGGCCCCCGATGGTCTGCAAAGGTTAACGGTACCATAACCCGGGGGGGAGCCAAGCGCTCAGATTGTGTGGAAGAAGTGGGGAGCCTGCAAGCCACAGGAATTT
ATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGAGGGAATCCCTACCGCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
Protein sequenceShow/hide protein sequence
MPKDKNHPTRASDRLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTPTKKTKVHERTKKTNDEIEKKPTGARSNKKKTRA
KQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLMSPSKRSKALKINLCCRT
EIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQINTDRLQDSSRFKDEYFANDEGVRRK
TLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYA
KRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSMSYFQPFLQDELASRRLAGDDVRIPPNFSVGAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPP
NSKFKMQGDVGTGIDPTTKDGDVEEKEENDEKDDQDDHELEKNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKTGGGEGTKACDDADETINK
AILSIDEAKVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLGPRWSAKVNGTITRGGAKRSDCVEEVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQG