; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039384 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039384
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein Ycf2-like
Genome locationscaffold10:42054388..42058076
RNA-Seq ExpressionSpg039384
SyntenySpg039384
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]6.1e-8733.41Show/hide
Query:  TRASDHLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHERTKKTNDEIEKKP--TGARSNK
        TR SD L+AAG+T  RK                  ++ T +     S E+   ++    +G  GK+ +P T K +V   TKK    ++KK       S K
Subjt:  TRASDHLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHERTKKTNDEIEKKP--TGARSNK

Query:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM
        +  R K      K   V  +IA+ E S  +      DT   S            + REK KK    +KE   ++   +KGK      + +    D  YLM
Subjt:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM

Query:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP
           +R++ LKINL  ++ +++ I   LGDR  + FR   FGH L+ S    SSQLLLHLIQ  CKPK TS+L F IGG++L+FGLREFALITGL C  +P
Subjt:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP

Query:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA
         I+ + +    R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE   ++ +H++MV+D E+F  YPWGR AF LL  +M++ 
Subjt:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA

Query:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR
          S+G  GI MGG ++ ILAWAYEVIP LS PP  +  RI N VPRIIN   + QP+W++L  K+F S +LEV P+  T  E+ M +F PF++ E    +
Subjt:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR

Query:  LAGDEQQVGDDVRIPPNFSV-RAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDPTTK----------DGD
         A DE +   +     + S+ R    TS+++V+ K  ++I     R+ + +  L++ L+ +   +   N++F+        I  TTK          +  
Subjt:  LAGDEQQVGDDVRIPPNFSV-RAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDPTTK----------DGD

Query:  VEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINKA
        +++ EE+ E+DD +D  L+   +    +R+DD+     G   + QG ++     G + + ++ET             K G  E +  KA ++ DE IN+ 
Subjt:  VEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINKA

Query:  ILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--E
        I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                            +N T    G KRS+  E  E
Subjt:  ILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--E

Query:  VGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
        V  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  VGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

TYK09852.1 protein Ycf2-like [Cucumis melo var. makuwa]2.0e-6934.74Show/hide
Query:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK
        CKPK TS+L F IG ++L+FGLREFALITGL C  +P I+ + ++   R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH
        + ++ +H++MV+D E+F  YPWGR AF LL  +M++A  S+G  GI M GG ++ ILAWAYEVIP LS PP  +A RI N VPRIINW  + QP+W++L 
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH

Query:  VKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSV-RAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIH
         K+F S +LEV P+  T  E+ M +F PF + E    + A DE +   +     + S+ R    TS+++V+ K  ++I     R+ + +  L++ L+ + 
Subjt:  VKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSV-RAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIH

Query:  KLDDPPNSKFKMQGDVGTSIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKTG
          +   N++F+  G           +  +++ EE+ E+DD +D  L+  N +   + DD     GK+            +  E++ +     DGG  K  
Subjt:  KLDDPPNSKFKMQGDVGTSIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKTG

Query:  GGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS-------------------
          E             KA ++ DE IN+ I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                   
Subjt:  GGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS-------------------

Query:  -------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
                 +N T    G KRS+  E  EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  -------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

XP_031743195.1 uncharacterized protein LOC101221625 isoform X8 [Cucumis sativus]8.1e-6333.82Show/hide
Query:  DKASPVTPEIALETSEDTAKN---DTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDD---YLMSPSKRSK
        +K SP + E + +  +  + +     +  E++ +  D  S   V   + KK    +K E  K++K GK+GK  K+ + G +   DD+    L+  S  + 
Subjt:  DKASPVTPEIALETSEDTAKN---DTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDD---YLMSPSKRSK

Query:  ALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRL
          +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K  +EL+F + G+I KFG+++FALITGLNCG LP ID  ++
Subjt:  ALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRL

Query:  QDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSV
        Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D++ F +YPWGR ++ +   ++ K+  S  + 
Subjt:  QDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSV

Query:  GIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELAS-RRLAGDEQ
         IG+GG  YA+L WAYE IP L+      A RI    PR+ NW     PEW++L  K+FQS + +V PL  T TEM+M Y  PF   + ++ + ++  +Q
Subjt:  GIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELAS-RRLAGDEQ

Query:  QVGDDVRIPPNFSVRAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSID
        +   D R   N         SQ    +     +  K+  +  +LG+LV      H +D+  +   KM G    + D
Subjt:  QVGDDVRIPPNFSVRAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSID

XP_038883716.1 uncharacterized protein LOC120074618 isoform X2 [Benincasa hispida]1.8e-6233.54Show/hide
Query:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKK----LKTMVEGDTVRVDDDY--LMS
        +K SP T E   +  +  ++  T       + E++ +  D  S  +VG ++ K  +  +K    K++K  K+GKK           D V V  +Y  L+ 
Subjt:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKK----LKTMVEGDTVRVDDDY--LMS

Query:  PSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQ
         S  S   +INL  + +++  I   L +R    F+ +CFG  LD    K SSQL  HL++ QC     +EL+F + G+I KFG++EF+LITGLNCG LP+
Subjt:  PSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQ

Query:  IDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKAS
        ID  ++Q   +F   YF  ++ ++R  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+V+D+E F  YPWGR ++ +   ++ KA 
Subjt:  IDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKAS

Query:  VSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRL
         S  +  IG+GG  +A+L WAYE IP L       A R+    PR+ NW  +  PEW++L  K+FQS S +V PL  T TEM+M Y  PF   + +  ++
Subjt:  VSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRL

Query:  AGD-EQQVGDDVRIPPN---FSVRAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDP
            +Q+   D R   N    + + P       V       +  K+  +  +LG+LV      H +D+  N   K+  +V  + DP
Subjt:  AGD-EQQVGDDVRIPPN---FSVRAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDP

XP_038883719.1 uncharacterized protein LOC120074618 isoform X5 [Benincasa hispida]2.4e-6233.81Show/hide
Query:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKK---LKTMVEGDTVRVDDDY--LMSP
        +K SP T E   +  +  ++  T       + E++ +  D  S  +VG ++ K  +  +K    K++K  K+GKK     T  E D    D +Y  L+  
Subjt:  DKASPVTPEIALETSEDTAKNDT------EDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKK---LKTMVEGDTVRVDDDY--LMSP

Query:  SKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQI
        S  S   +INL  + +++  I   L +R    F+ +CFG  LD    K SSQL  HL++ QC     +EL+F + G+I KFG++EF+LITGLNCG LP+I
Subjt:  SKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQI

Query:  DTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASV
        D  ++Q   +F   YF  ++ ++R  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+V+D+E F  YPWGR ++ +   ++ KA  
Subjt:  DTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASV

Query:  SRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLA
        S  +  IG+GG  +A+L WAYE IP L       A R+    PR+ NW  +  PEW++L  K+FQS S +V PL  T TEM+M Y  PF   + +  ++ 
Subjt:  SRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLA

Query:  GD-EQQVGDDVRIPPN---FSVRAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDP
           +Q+   D R   N    + + P       V       +  K+  +  +LG+LV      H +D+  N   K+  +V  + DP
Subjt:  GD-EQQVGDDVRIPPN---FSVRAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDP

TrEMBL top hitse value%identityAlignment
A0A1S3B065 uncharacterized protein LOC103484737 isoform X41.1e-6236.92Show/hide
Query:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK
        ++ P   ++ +KK + K  +              E +E+T++ DT+      V           E R+KKK     K+     K++K GKKG K      
Subjt:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK

Query:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK
             D V V  +Y  L+  S  S   +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK

Query:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT
        FG+++FALITGLNCG LP ID  ++Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D+E F +
Subjt:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT

Query:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE
        YPWGR ++ +   ++ KA  S  +  IG+GG  +A+  WAYE IP L+     +A RI    PR+ NW  +  PEW++L  K+FQS + +V PL  T+TE
Subjt:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE

Query:  MQMSYFQPF
        M+MSY  PF
Subjt:  MQMSYFQPF

A0A1S3B0L9 uncharacterized protein LOC103484737 isoform X51.1e-6236.92Show/hide
Query:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK
        ++ P   ++ +KK + K  +              E +E+T++ DT+      V           E R+KKK     K+     K++K GKKG K      
Subjt:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK

Query:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK
             D V V  +Y  L+  S  S   +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK

Query:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT
        FG+++FALITGLNCG LP ID  ++Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D+E F +
Subjt:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT

Query:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE
        YPWGR ++ +   ++ KA  S  +  IG+GG  +A+  WAYE IP L+     +A RI    PR+ NW  +  PEW++L  K+FQS + +V PL  T+TE
Subjt:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE

Query:  MQMSYFQPF
        M+MSY  PF
Subjt:  MQMSYFQPF

A0A1S4DTS6 uncharacterized protein LOC103484737 isoform X31.1e-6236.92Show/hide
Query:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK
        ++ P   ++ +KK + K  +              E +E+T++ DT+      V           E R+KKK     K+     K++K GKKG K      
Subjt:  EKKPTGARSNKKKTRAKQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEP---PKKQKGGKKGKK----LK

Query:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK
             D V V  +Y  L+  S  S   +INL  + +++  I   L +R    F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+F + G+I K
Subjt:  TMVEGDTVRVDDDY--LMSPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILK

Query:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT
        FG+++FALITGLNCG LP ID  ++Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+++D+E F +
Subjt:  FGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTT

Query:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE
        YPWGR ++ +   ++ KA  S  +  IG+GG  +A+  WAYE IP L+     +A RI    PR+ NW  +  PEW++L  K+FQS + +V PL  T+TE
Subjt:  YPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTE

Query:  MQMSYFQPF
        M+MSY  PF
Subjt:  MQMSYFQPF

A0A5A7U047 Protein Ycf2-like3.0e-8733.41Show/hide
Query:  TRASDHLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHERTKKTNDEIEKKP--TGARSNK
        TR SD L+AAG+T  RK                  ++ T +     S E+   ++    +G  GK+ +P T K +V   TKK    ++KK       S K
Subjt:  TRASDHLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTP-TKKTKVHERTKKTNDEIEKKP--TGARSNK

Query:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM
        +  R K      K   V  +IA+ E S  +      DT   S            + REK KK    +KE   ++   +KGK      + +    D  YLM
Subjt:  KKTRAKQTNDTDKASPVTPEIAL-ETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREK-KKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLM

Query:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP
           +R++ LKINL  ++ +++ I   LGDR  + FR   FGH L+ S    SSQLLLHLIQ  CKPK TS+L F IGG++L+FGLREFALITGL C  +P
Subjt:  SPSKRSKALKINLCCRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLP

Query:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA
         I+ + +    R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE   ++ +H++MV+D E+F  YPWGR AF LL  +M++ 
Subjt:  QIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKA

Query:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR
          S+G  GI MGG ++ ILAWAYEVIP LS PP  +  RI N VPRIIN   + QP+W++L  K+F S +LEV P+  T  E+ M +F PF++ E    +
Subjt:  SVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRR

Query:  LAGDEQQVGDDVRIPPNFSV-RAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDPTTK----------DGD
         A DE +   +     + S+ R    TS+++V+ K  ++I     R+ + +  L++ L+ +   +   N++F+        I  TTK          +  
Subjt:  LAGDEQQVGDDVRIPPNFSV-RAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDPTTK----------DGD

Query:  VEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINKA
        +++ EE+ E+DD +D  L+   +    +R+DD+     G   + QG ++     G + + ++ET             K G  E +  KA ++ DE IN+ 
Subjt:  VEEKEENDEKDDQDDHELE---KNPSHRREDDDGGPTGG--KQQQGPTT----PGPTTLVQTETRVDGEGTGDGGTKKTGGGEGT--KACDDADETINKA

Query:  ILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--E
        I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                            +N T    G KRS+  E  E
Subjt:  ILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS--------------------------AKVNGTITRGGAKRSDCVE--E

Query:  VGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
        V  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  VGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

A0A5D3CEX9 Protein Ycf2-like9.6e-7034.74Show/hide
Query:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK
        CKPK TS+L F IG ++L+FGLREFALITGL C  +P I+ + ++   R K  YF N + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH
        + ++ +H++MV+D E+F  YPWGR AF LL  +M++A  S+G  GI M GG ++ ILAWAYEVIP LS PP  +A RI N VPRIINW  + QP+W++L 
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGM-GGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINWEVEAQPEWRELH

Query:  VKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSV-RAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIH
         K+F S +LEV P+  T  E+ M +F PF + E    + A DE +   +     + S+ R    TS+++V+ K  ++I     R+ + +  L++ L+ + 
Subjt:  VKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSV-RAPSMTSQMDVMEKRHQEIIGKLDRVYSMLGALVDTLREIH

Query:  KLDDPPNSKFKMQGDVGTSIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKTG
          +   N++F+  G           +  +++ EE+ E+DD +D  L+  N +   + DD     GK+            +  E++ +     DGG  K  
Subjt:  KLDDPPNSKFKMQGDVGTSIDPTTKDGDVEEKEENDEKDDQDDHELE-KNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTKKTG

Query:  GGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS-------------------
          E             KA ++ DE IN+ I SIDE+    ++ +K  R R G+  +   P  + R   P  G     P+ +                   
Subjt:  GGEGT-----------KACDDADETINKAILSIDEA----KVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLG-----PRWS-------------------

Query:  -------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ
                 +N T    G KRS+  E  EV  + +TGI++DA+RG   +  ++        PSFDLHLSQ
Subjt:  -------AKVNGTITRGGAKRSDCVE--EVGSLQATGIYVDAMRGTWTKESRESLPPEFFQPSFDLHLSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)1.3e-1328.11Show/hide
Query:  KINLCCRTEIMDTI-NTILGDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDR
        ++N+  R E + TI N + G    +  +++ FG L +F   + S S  L+H L+  Q   K+  EL+F  GG  ++F +REF ++TGL CG LP  D  +
Subjt:  KINLCCRTEIMDTI-NTILGDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDR

Query:  LQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF
            S++   +       R  T+  V   ++    +   K+   L  +   ++   ++  +  + V M+ D + F  YPWGR AF
Subjt:  LQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAF

AT1G36970.1 Domain of unknown function (DUF1985)1.4e-0731.39Show/hide
Query:  VDDDYLMSP----SKRSKALKINLCCRTEIMDTINTIL-GDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLR
        +D+D  + P    + R    ++N+  R +I+  I  +L G +  +   ++CFG L      + S S  L+H L+  Q   K+  EL    GG+ L+F L 
Subjt:  VDDDYLMSP----SKRSKALKINLCCRTEIMDTINTIL-GDRCRDAFRNTCFGHLLDFSFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLR

Query:  EFALITGLNCGPLP-QIDTDRLQDSSRFKDEYFANDE
        EF  +TGL CG  P + D D      + K E F +DE
Subjt:  EFALITGLNCGPLP-QIDTDRLQDSSRFKDEYFANDE

AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases4.3e-0630Show/hide
Query:  VKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSR-GSVGIGMGGLVYAILAWAYEVIPALSAPP
        ++ A L  ++ FLLP      I ++H  M ED + F +YPWGR +F ++ + + +  V +  +  + + GL+YA+     E +PA+   P
Subjt:  VKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSR-GSVGIGMGGLVYAILAWAYEVIPALSAPP

AT4G08430.1 Ulp1 protease family protein7.4e-0626.74Show/hide
Query:  ELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVH-----
        E++  I  + ++F L EF  ITGLNC    + DT      + +KD  F N+ GV   ++  +F  ++   E       +   +   L  +   VH     
Subjt:  ELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRKTLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVH-----

Query:  --IEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINW
          +       V D   F  YPWGR AF  L   +        S  I   G V A+L W YE +P +      + K     VP +++W
Subjt:  --IEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYAKRIRNTVPRIINW

AT5G45570.1 Ulp1 protease family protein5.1e-0725.11Show/hide
Query:  CRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKK--TSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSR
        C    +  I   LG    D  + T  G  + F+      ++Q +   + +Q +     E++  I  + ++F L EF  ITGLNC    + DT      + 
Subjt:  CRTEIMDTINTILGDRCRDAFRNTCFGHLLDFSFKK--TSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSR

Query:  FKDEYFANDEGVRRKT------LNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGS
        +KD  F N+ GV          L  VF   K       + + +L  L   +        +       V D   F  YPWGR AF  L+  +        S
Subjt:  FKDEYFANDEGVRRKT------LNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGS

Query:  VGIGMGGLVYAILAWAYEVIPAL
          I   G V  +L W YE +P +
Subjt:  VGIGMGGLVYAILAWAYEVIPAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAAGGATAAGAACCACCCAACTAGAGCGAGTGACCATTTGAAGGCTGCAGGAGTAACCCCAGGAAGAAAACCCCGTAAACAAACATCCCCAATCACATTGGGGAG
CGAACAGGATTCTGGAGACGCCATGAGTACATCAGTTTCAGTCGCTAAGGGATCTGGCGAGAAGACGAAAGGGGTAAAAAGGGACAGAGGCGACGGAGGTTCGGGCAAAA
AAGTAACTCCAACAAAGAAAACAAAAGTTCACGAACGGACCAAGAAGACCAACGATGAGATTGAGAAGAAACCCACTGGGGCACGAAGCAATAAGAAGAAAACGCGAGCG
AAACAGACAAATGACACAGATAAGGCGAGCCCTGTGACACCAGAGATTGCCCTTGAAACAAGCGAGGACACAGCTAAAAATGACACCGAAGACACCGAATCTAATAGTGT
GACGAATGACAACTCCTCGAGTGATGACGTAGGGGAAGAACGAGAGAAAAAGAAGACAACAATTGCTAAAAAGGAACCTCCTAAAAAACAGAAGGGTGGAAAAAAAGGAA
AAAAGCTGAAGACCATGGTTGAAGGTGACACTGTCCGAGTGGACGACGATTACCTTATGTCGCCATCAAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACA
GAAATAATGGACACCATCAACACCATCTTAGGAGATAGGTGTAGAGACGCTTTCAGAAACACGTGCTTTGGCCACCTGCTTGACTTCTCGTTCAAAAAGACGTCTTCCCA
GTTACTATTGCACCTGATCCAGCATCAATGCAAACCCAAACGGACGTCGGAACTTTACTTCAAGATTGGGGGGAAAATCTTAAAGTTTGGTCTACGGGAGTTCGCACTAA
TTACGGGACTAAATTGTGGCCCATTGCCACAAATTGACACAGACAGACTACAAGATTCATCCAGGTTCAAGGATGAGTATTTTGCCAACGACGAAGGTGTCAGAAGAAAG
ACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGGCAGACCTCGTAAAGATGGCGCAGTTGTATTGTTTGGAGAGCTTTTTGTTACCTAGGCAAGAAAAGGT
GCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGCGCGCCGCCTTCACACTATTGACAAGCTACATGCATAAAGCAT
CCGTTAGTAGGGGCAGTGTTGGTATTGGAATGGGCGGATTAGTGTATGCCATCCTTGCATGGGCATACGAAGTGATACCTGCATTGAGCGCGCCACCGACCAACTACGCA
AAACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTAGAAGCTCAACCCGAATGGAGAGAACTGCACGTCAAGATATTCCAATCCTCATCGTTGGAGGTTGT
ACCATTGGACCCAACTGACACGGAAATGCAGATGTCGTACTTCCAACCTTTCTTGCAAGATGAGTTGGCTTCTCGGCGATTGGCCGGCGACGAACAACAAGTAGGCGACG
ATGTTCGAATCCCGCCGAACTTCTCAGTAAGGGCACCCTCAATGACCAGCCAGATGGATGTGATGGAAAAACGCCATCAAGAAATAATTGGAAAGCTTGACAGAGTTTAC
TCTATGCTAGGAGCCTTGGTGGACACTTTGAGGGAGATACACAAGCTTGACGACCCCCCAAACTCAAAATTCAAGATGCAAGGAGATGTTGGGACTAGTATTGACCCTAC
AACAAAAGACGGTGATGTGGAGGAAAAAGAAGAAAATGATGAAAAAGATGATCAAGATGACCACGAATTAGAGAAAAATCCTTCTCATCGAAGGGAAGACGACGATGGAG
GACCAACAGGTGGGAAACAGCAACAGGGCCCGACCACCCCCGGACCGACAACCCTTGTACAGACTGAAACTCGTGTAGATGGCGAAGGCACGGGAGATGGCGGGACAAAG
AAAACAGGAGGTGGTGAAGGCACCAAGGCCTGTGATGATGCCGACGAGACAATAAACAAGGCTATACTGTCAATAGATGAGGCCAAGGTGATTGAGAAGTTTAATAGGGA
CCGCAAGGGTAAAGCGGTTATGGTGGAAGGACCTCATACCATACCAAGAACCACAGTTCCACAACTTGGCCCCCGATGGTCTGCAAAGGTTAACGGTACCATAACCCGGG
GGGGAGCCAAGCGCTCAGATTGTGTGGAAGAAGTGGGGAGCCTGCAAGCCACAGGAATTTATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGAGGGAATCCCTA
CCGCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCAAGGATAAGAACCACCCAACTAGAGCGAGTGACCATTTGAAGGCTGCAGGAGTAACCCCAGGAAGAAAACCCCGTAAACAAACATCCCCAATCACATTGGGGAG
CGAACAGGATTCTGGAGACGCCATGAGTACATCAGTTTCAGTCGCTAAGGGATCTGGCGAGAAGACGAAAGGGGTAAAAAGGGACAGAGGCGACGGAGGTTCGGGCAAAA
AAGTAACTCCAACAAAGAAAACAAAAGTTCACGAACGGACCAAGAAGACCAACGATGAGATTGAGAAGAAACCCACTGGGGCACGAAGCAATAAGAAGAAAACGCGAGCG
AAACAGACAAATGACACAGATAAGGCGAGCCCTGTGACACCAGAGATTGCCCTTGAAACAAGCGAGGACACAGCTAAAAATGACACCGAAGACACCGAATCTAATAGTGT
GACGAATGACAACTCCTCGAGTGATGACGTAGGGGAAGAACGAGAGAAAAAGAAGACAACAATTGCTAAAAAGGAACCTCCTAAAAAACAGAAGGGTGGAAAAAAAGGAA
AAAAGCTGAAGACCATGGTTGAAGGTGACACTGTCCGAGTGGACGACGATTACCTTATGTCGCCATCAAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACA
GAAATAATGGACACCATCAACACCATCTTAGGAGATAGGTGTAGAGACGCTTTCAGAAACACGTGCTTTGGCCACCTGCTTGACTTCTCGTTCAAAAAGACGTCTTCCCA
GTTACTATTGCACCTGATCCAGCATCAATGCAAACCCAAACGGACGTCGGAACTTTACTTCAAGATTGGGGGGAAAATCTTAAAGTTTGGTCTACGGGAGTTCGCACTAA
TTACGGGACTAAATTGTGGCCCATTGCCACAAATTGACACAGACAGACTACAAGATTCATCCAGGTTCAAGGATGAGTATTTTGCCAACGACGAAGGTGTCAGAAGAAAG
ACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGGCAGACCTCGTAAAGATGGCGCAGTTGTATTGTTTGGAGAGCTTTTTGTTACCTAGGCAAGAAAAGGT
GCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGCGCGCCGCCTTCACACTATTGACAAGCTACATGCATAAAGCAT
CCGTTAGTAGGGGCAGTGTTGGTATTGGAATGGGCGGATTAGTGTATGCCATCCTTGCATGGGCATACGAAGTGATACCTGCATTGAGCGCGCCACCGACCAACTACGCA
AAACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTAGAAGCTCAACCCGAATGGAGAGAACTGCACGTCAAGATATTCCAATCCTCATCGTTGGAGGTTGT
ACCATTGGACCCAACTGACACGGAAATGCAGATGTCGTACTTCCAACCTTTCTTGCAAGATGAGTTGGCTTCTCGGCGATTGGCCGGCGACGAACAACAAGTAGGCGACG
ATGTTCGAATCCCGCCGAACTTCTCAGTAAGGGCACCCTCAATGACCAGCCAGATGGATGTGATGGAAAAACGCCATCAAGAAATAATTGGAAAGCTTGACAGAGTTTAC
TCTATGCTAGGAGCCTTGGTGGACACTTTGAGGGAGATACACAAGCTTGACGACCCCCCAAACTCAAAATTCAAGATGCAAGGAGATGTTGGGACTAGTATTGACCCTAC
AACAAAAGACGGTGATGTGGAGGAAAAAGAAGAAAATGATGAAAAAGATGATCAAGATGACCACGAATTAGAGAAAAATCCTTCTCATCGAAGGGAAGACGACGATGGAG
GACCAACAGGTGGGAAACAGCAACAGGGCCCGACCACCCCCGGACCGACAACCCTTGTACAGACTGAAACTCGTGTAGATGGCGAAGGCACGGGAGATGGCGGGACAAAG
AAAACAGGAGGTGGTGAAGGCACCAAGGCCTGTGATGATGCCGACGAGACAATAAACAAGGCTATACTGTCAATAGATGAGGCCAAGGTGATTGAGAAGTTTAATAGGGA
CCGCAAGGGTAAAGCGGTTATGGTGGAAGGACCTCATACCATACCAAGAACCACAGTTCCACAACTTGGCCCCCGATGGTCTGCAAAGGTTAACGGTACCATAACCCGGG
GGGGAGCCAAGCGCTCAGATTGTGTGGAAGAAGTGGGGAGCCTGCAAGCCACAGGAATTTATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGAGGGAATCCCTA
CCGCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
Protein sequenceShow/hide protein sequence
MPKDKNHPTRASDHLKAAGVTPGRKPRKQTSPITLGSEQDSGDAMSTSVSVAKGSGEKTKGVKRDRGDGGSGKKVTPTKKTKVHERTKKTNDEIEKKPTGARSNKKKTRA
KQTNDTDKASPVTPEIALETSEDTAKNDTEDTESNSVTNDNSSSDDVGEEREKKKTTIAKKEPPKKQKGGKKGKKLKTMVEGDTVRVDDDYLMSPSKRSKALKINLCCRT
EIMDTINTILGDRCRDAFRNTCFGHLLDFSFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQIDTDRLQDSSRFKDEYFANDEGVRRK
TLNIVFNAMKHGVEADLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRAAFTLLTSYMHKASVSRGSVGIGMGGLVYAILAWAYEVIPALSAPPTNYA
KRIRNTVPRIINWEVEAQPEWRELHVKIFQSSSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDEQQVGDDVRIPPNFSVRAPSMTSQMDVMEKRHQEIIGKLDRVY
SMLGALVDTLREIHKLDDPPNSKFKMQGDVGTSIDPTTKDGDVEEKEENDEKDDQDDHELEKNPSHRREDDDGGPTGGKQQQGPTTPGPTTLVQTETRVDGEGTGDGGTK
KTGGGEGTKACDDADETINKAILSIDEAKVIEKFNRDRKGKAVMVEGPHTIPRTTVPQLGPRWSAKVNGTITRGGAKRSDCVEEVGSLQATGIYVDAMRGTWTKESRESL
PPEFFQPSFDLHLSQG