; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003016 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003016
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein Ycf2-like
Genome locationchr4:47409194..47413416
RNA-Seq ExpressionLag0003016
SyntenyLag0003016
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047596.1 protein Ycf2-like [Cucumis melo var. makuwa]1.7e-9133.81Show/hide
Query:  VSTRASDRLKAAGVTAGRKP-PEQTSPITLGSEQDSEEAMSTTVSVAKGSSEKTKGVKRDRDDGGPSKKVTPSKKTKVRDWTKKTNDEIEKPTETRSNKK
        + TR SDRL+AAG+T  RK  P     ++  SE+  E+ M      A+GS             GG  +    SKK +VR  TKK    ++K    +S   
Subjt:  VSTRASDRLKAAGVTAGRKP-PEQTSPITLGSEQDSEEAMSTTVSVAKGSSEKTKGVKRDRDDGGPSKKVTPSKKTKVRDWTKKTNDEIEKPTETRSNKK

Query:  TKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKKEAPKKKKGGKKGKKLKTMVEEGDTVRVDDDYFMSPS
         KR  + K   +        V E S  +T     DT   S         +  E+GKK  ++ K+E  ++ +  +KGK      +  D+V     Y M   
Subjt:  TKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKKEAPKKKKGGKKGKKLKTMVEEGDTVRVDDDYFMSPS

Query:  KRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLD
        +R++ LKINL  ++ +++ I   LGDR    FR   FGH L+ +    SSQLLLHLIQ  CKPK T +L F IGG++L+FGLREFALITGL C  +P ++
Subjt:  KRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLD

Query:  KDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVS
         + +    R K  YF + + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE   ++ +H++MV+D E+F  YPWGRVAF LL  +M +   S
Subjt:  KDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVS

Query:  RGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAG
        +G  GI MGGF++ ILAWAYEVIP LS PP  +  RI N VPRIIN   + QP+W++L  K+F SP+L+V P+  T  ++ MP+F PF++ E    + A 
Subjt:  RGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAG

Query:  DNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHELANPPNSKFKMPGDVGTGIDPTTK----------DDDVEG
        D  +   +     + S+    P  S+++V+ K  ++I     ++ + +  L++ L+ + EL    N++F+       GI  TTK          ++ ++ 
Subjt:  DNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHELANPPNSKFKMPGDVGTGIDPTTK----------DDDVEG

Query:  KEETDEKDEQDDHGLE---KNPSHRREDDDGGPTGG--KQQQGSTT----PGPTTLVQTETRVDGEGTGDGRTKKTGGGEGT--KACDDVDETINKAILS
         EE  E+D+ +D  L+   +    +R+DD+     G   + QG ++     G + + ++ET             K G  E +  KA ++ DE IN+ I S
Subjt:  KEETDEKDEQDDHGLE---KNPSHRREDDDGGPTGG--KQQQGSTT----PGPTTLVQTETRVDGEGTGDGRTKKTGGGEGT--KACDDVDETINKAILS

Query:  IDEA----KVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGP-RWSAKAKGIKIKEPTTPLIQN--------RTPLREVNGTITRGGAKRSDCVE-GVGSL
        IDE+    ++ +K  R R G+  +   P  + R   P  G  +   +   I  K P+   + N        R  L+ +N T    G KRS+  E    + 
Subjt:  IDEA----KVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGP-RWSAKAKGIKIKEPTTPLIQN--------RTPLREVNGTITRGGAKRSDCVE-GVGSL

Query:  QATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQ
         +TGI++DA+RG   +   +        PSFDLHLSQ
Subjt:  QATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQ

KGN48800.2 hypothetical protein Csa_003918 [Cucumis sativus]3.0e-6438.76Show/hide
Query:  VRDWTKKTNDEIEKPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKK--EAPKKKKGGKK
        ++D    T +E + P     ++K K+ K       AS    E   ETSE        DT+ DS     +       Q KK A  +KK  E  K+KK GK+
Subjt:  VRDWTKKTNDEIEKPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKK--EAPKKKKGGKK

Query:  GKK---LKTMVEEGDTVRVDDDY--FMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELY
        GKK     T  E  D V V  +Y   +  S  +   +INL  + +++  I N L +R  + F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+
Subjt:  GKK---LKTMVEEGDTVRVDDDY--FMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELY

Query:  FKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLM
        F + G+I KFG+++FALITGLNCG LP +D  K+Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+
Subjt:  FKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLM

Query:  VEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKV
        ++D++ F +YPWGR+++ +   +++K+  S  +  IG+GGF YA+L WAYE IP L+      A RI    PR+ NW     PEW++L  K+FQS +  V
Subjt:  VEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKV

Query:  VPLDPTDTKMQMPYFQPF
         PL  T T+M+MPY  PF
Subjt:  VPLDPTDTKMQMPYFQPF

TYK09852.1 protein Ycf2-like [Cucumis melo var. makuwa]3.0e-7235.21Show/hide
Query:  CKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEK
        CKPK T +L F IG ++L+FGLREFALITGL C  +P ++ + ++   R K  YF + + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGM-GGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELH
        + ++ +H++MV+D E+F  YPWGRVAF LL  +M +A  S+G  GI M GGF++ ILAWAYEVIP LS PP  +A RI N VPRIINW  + QP+W++L 
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGM-GGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELH

Query:  AKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIH
         K+F SP+L+V P+  T  ++ MP+F PF + E    + A D  +   +     + S+    P  S+++V+ K  ++I     ++ + +  L++ L+ + 
Subjt:  AKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIH

Query:  ELANPPNSKFKMPGDVGTGIDPTTKDDDVEGKEETDEKDEQDDHGLEKNPSHR----REDDDGGPTGGK---QQQGSTT----PGPTTLVQTETRVDGEG
        EL    N++F+  G           ++ ++  EE  E+D+ +D  L  + S+R    + DDD    G +   + QG ++     G + + ++ET      
Subjt:  ELANPPNSKFKMPGDVGTGIDPTTKDDDVEGKEETDEKDEQDDHGLEKNPSHR----REDDDGGPTGGK---QQQGSTT----PGPTTLVQTETRVDGEG

Query:  TGDGRTKKTGGGEGTKACDDVDETINKAILSIDEA----KVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGP-RWSAKAKGIKIKEPTTPLIQN------
         GD  + +       KA ++ DE IN+ I SIDE+    ++ +K  R R G+  +   P  + R   P  G  +   +   I  K P+   + N      
Subjt:  TGDGRTKKTGGGEGTKACDDVDETINKAILSIDEA----KVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGP-RWSAKAKGIKIKEPTTPLIQN------

Query:  --RTPLREVNGTITRGGAKRSDCVE-GVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQ
          R  L+ +N T    G KRS+  E    +  +TGI++DA+RG   +   +        PSFDLHLSQ
Subjt:  --RTPLREVNGTITRGGAKRSDCVE-GVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQ

XP_031743199.1 uncharacterized protein LOC101221625 isoform X11 [Cucumis sativus]3.0e-6438.76Show/hide
Query:  VRDWTKKTNDEIEKPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKK--EAPKKKKGGKK
        ++D    T +E + P     ++K K+ K       AS    E   ETSE        DT+ DS     +       Q KK A  +KK  E  K+KK GK+
Subjt:  VRDWTKKTNDEIEKPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKK--EAPKKKKGGKK

Query:  GKK---LKTMVEEGDTVRVDDDY--FMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELY
        GKK     T  E  D V V  +Y   +  S  +   +INL  + +++  I N L +R  + F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+
Subjt:  GKK---LKTMVEEGDTVRVDDDY--FMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELY

Query:  FKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLM
        F + G+I KFG+++FALITGLNCG LP +D  K+Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+
Subjt:  FKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLM

Query:  VEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKV
        ++D++ F +YPWGR+++ +   +++K+  S  +  IG+GGF YA+L WAYE IP L+      A RI    PR+ NW     PEW++L  K+FQS +  V
Subjt:  VEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKV

Query:  VPLDPTDTKMQMPYFQPF
         PL  T T+M+MPY  PF
Subjt:  VPLDPTDTKMQMPYFQPF

XP_038883716.1 uncharacterized protein LOC120074618 isoform X2 [Benincasa hispida]1.0e-6434.26Show/hide
Query:  EIEKPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKKEAPKKKKGGKKGKK---LKTMVE
        E   PT   + +K K+ K  +               TS +  +H+ E +E D  T+ +S  + G +Q K  +  +K    K+KK  K+GKK     T  E
Subjt:  EIEKPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKKEAPKKKKGGKKGKK---LKTMVE

Query:  EGDTVRVDDDY--FMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGL
          D V V  +Y   +  S  S   +INL  + +++  I N L +R  + F+ +CFG  LD    K SSQL  HL++ QC      EL+F + G+I KFG+
Subjt:  EGDTVRVDDDY--FMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGL

Query:  REFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPW
        +EF+LITGLNCG LP++D  K+Q   +F   YF  ++ ++R  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+V+D+E F  YPW
Subjt:  REFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPW

Query:  GRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKVVPLDPTDTKMQM
        GR+++ +   +++KA  S  +  IG+GG  +A+L WAYE IP L       A R+    PR+ NW  +  PEW++L  K+FQS S  V PL  T T+M+M
Subjt:  GRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKVVPLDPTDTKMQM

Query:  PYFQPFLQDELASRRLAGD-NQQVEGDVRIPPN---FSIGAPPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHELANPPNSKFKMPGDVGTGI
        PY  PF   + +  ++    +Q+   D R   N    +   PP +    V       +  K+  +  +LG+LV      H++ N  N   K+  +V    
Subjt:  PYFQPFLQDELASRRLAGD-NQQVEGDVRIPPN---FSIGAPPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHELANPPNSKFKMPGDVGTGI

Query:  DP
        DP
Subjt:  DP

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein1.4e-6438.76Show/hide
Query:  VRDWTKKTNDEIEKPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKK--EAPKKKKGGKK
        ++D    T +E + P     ++K K+ K       AS    E   ETSE        DT+ DS     +       Q KK A  +KK  E  K+KK GK+
Subjt:  VRDWTKKTNDEIEKPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKK--EAPKKKKGGKK

Query:  GKK---LKTMVEEGDTVRVDDDY--FMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELY
        GKK     T  E  D V V  +Y   +  S  +   +INL  + +++  I N L +R  + F+ +CFG+ LD    K SSQL  HLI+ QC  K   EL+
Subjt:  GKK---LKTMVEEGDTVRVDDDY--FMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELY

Query:  FKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLM
        F + G+I KFG+++FALITGLNCG LP +D  K+Q   +F   YF  ++ +RR  L+ VF  M  G   D+VKMA+LY LE F+L +Q +  I  E+ L+
Subjt:  FKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLM

Query:  VEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKV
        ++D++ F +YPWGR+++ +   +++K+  S  +  IG+GGF YA+L WAYE IP L+      A RI    PR+ NW     PEW++L  K+FQS +  V
Subjt:  VEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKV

Query:  VPLDPTDTKMQMPYFQPF
         PL  T T+M+MPY  PF
Subjt:  VPLDPTDTKMQMPYFQPF

A0A5A7U047 Protein Ycf2-like8.1e-9233.81Show/hide
Query:  VSTRASDRLKAAGVTAGRKP-PEQTSPITLGSEQDSEEAMSTTVSVAKGSSEKTKGVKRDRDDGGPSKKVTPSKKTKVRDWTKKTNDEIEKPTETRSNKK
        + TR SDRL+AAG+T  RK  P     ++  SE+  E+ M      A+GS             GG  +    SKK +VR  TKK    ++K    +S   
Subjt:  VSTRASDRLKAAGVTAGRKP-PEQTSPITLGSEQDSEEAMSTTVSVAKGSSEKTKGVKRDRDDGGPSKKVTPSKKTKVRDWTKKTNDEIEKPTETRSNKK

Query:  TKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKKEAPKKKKGGKKGKKLKTMVEEGDTVRVDDDYFMSPS
         KR  + K   +        V E S  +T     DT   S         +  E+GKK  ++ K+E  ++ +  +KGK      +  D+V     Y M   
Subjt:  TKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKKEAPKKKKGGKKGKKLKTMVEEGDTVRVDDDYFMSPS

Query:  KRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLD
        +R++ LKINL  ++ +++ I   LGDR    FR   FGH L+ +    SSQLLLHLIQ  CKPK T +L F IGG++L+FGLREFALITGL C  +P ++
Subjt:  KRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLD

Query:  KDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVS
         + +    R K  YF + + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE   ++ +H++MV+D E+F  YPWGRVAF LL  +M +   S
Subjt:  KDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVS

Query:  RGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAG
        +G  GI MGGF++ ILAWAYEVIP LS PP  +  RI N VPRIIN   + QP+W++L  K+F SP+L+V P+  T  ++ MP+F PF++ E    + A 
Subjt:  RGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAG

Query:  DNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHELANPPNSKFKMPGDVGTGIDPTTK----------DDDVEG
        D  +   +     + S+    P  S+++V+ K  ++I     ++ + +  L++ L+ + EL    N++F+       GI  TTK          ++ ++ 
Subjt:  DNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHELANPPNSKFKMPGDVGTGIDPTTK----------DDDVEG

Query:  KEETDEKDEQDDHGLE---KNPSHRREDDDGGPTGG--KQQQGSTT----PGPTTLVQTETRVDGEGTGDGRTKKTGGGEGT--KACDDVDETINKAILS
         EE  E+D+ +D  L+   +    +R+DD+     G   + QG ++     G + + ++ET             K G  E +  KA ++ DE IN+ I S
Subjt:  KEETDEKDEQDDHGLE---KNPSHRREDDDGGPTGG--KQQQGSTT----PGPTTLVQTETRVDGEGTGDGRTKKTGGGEGT--KACDDVDETINKAILS

Query:  IDEA----KVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGP-RWSAKAKGIKIKEPTTPLIQN--------RTPLREVNGTITRGGAKRSDCVE-GVGSL
        IDE+    ++ +K  R R G+  +   P  + R   P  G  +   +   I  K P+   + N        R  L+ +N T    G KRS+  E    + 
Subjt:  IDEA----KVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGP-RWSAKAKGIKIKEPTTPLIQN--------RTPLREVNGTITRGGAKRSDCVE-GVGSL

Query:  QATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQ
         +TGI++DA+RG   +   +        PSFDLHLSQ
Subjt:  QATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQ

A0A5A7U6E1 Protein Ycf2-like3.2e-6431.48Show/hide
Query:  CKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEK
        CKPK T +L F IGG++L+FGLREFALITGL C  +P ++ D ++   R K  YF + + V R+ LN++FN      + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHA
        + ++ +H++MV+D E+F  YPWGRVAF LL  +M +   S+G  GI + GF++ ILAWAYEV P LS P   +A RI N VPRIINW  + QP+W++L  
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHA

Query:  KIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHE
        K+F SP+L+V P+  T  ++ MP+F PF++ E    + A D  +   +     + S+    P  S+++++ K     I K  +      +     +  HE
Subjt:  KIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHE

Query:  LANPPNSKFKMPGDVGTGIDPTTKDDDVEG----------KEETDEKDEQDDHGLEKNPSHRREDDDGGPTGGKQQQGSTTPGPTTLVQTETRVDGEGTG
               +  +       ++   ++DDVE            E+ D+ +++D  GL       R +   G  GG+ +   +   PT              G
Subjt:  LANPPNSKFKMPGDVGTGIDPTTKDDDVEG----------KEETDEKDEQDDHGLEKNPSHRREDDDGGPTGGKQQQGSTTPGPTTLVQTETRVDGEGTG

Query:  DGRTKKTGGGEGTKACDDVDETINKAILSIDEAKVIEKFNRDRKGK--------------AVMMGGPHTIPRTTIPQLG-----PRWSAKAKGIKIKEPT
        D  + +       KA ++  E IN+ I  IDE+ + +K  +  +G+                 +     + R   P  G     P+ +     +   +  
Subjt:  DGRTKKTGGGEGTKACDDVDETINKAILSIDEAKVIEKFNRDRKGK--------------AVMMGGPHTIPRTTIPQLG-----PRWSAKAKGIKIKEPT

Query:  TPLIQNRTP----LREVNGTITRGGAKRSDCVE-GVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLS
        + + +NR P    L+ +N T    G KRS+  E    +  +TGI++DA+R        +        PSFDLHLS
Subjt:  TPLIQNRTP----LREVNGTITRGGAKRSDCVE-GVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLS

A0A5D3CEX9 Protein Ycf2-like1.4e-7235.21Show/hide
Query:  CKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEK
        CKPK T +L F IG ++L+FGLREFALITGL C  +P ++ + ++   R K  YF + + V R+ LN++FN    G + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGM-GGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELH
        + ++ +H++MV+D E+F  YPWGRVAF LL  +M +A  S+G  GI M GGF++ ILAWAYEVIP LS PP  +A RI N VPRIINW  + QP+W++L 
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGM-GGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELH

Query:  AKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIH
         K+F SP+L+V P+  T  ++ MP+F PF + E    + A D  +   +     + S+    P  S+++V+ K  ++I     ++ + +  L++ L+ + 
Subjt:  AKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIH

Query:  ELANPPNSKFKMPGDVGTGIDPTTKDDDVEGKEETDEKDEQDDHGLEKNPSHR----REDDDGGPTGGK---QQQGSTT----PGPTTLVQTETRVDGEG
        EL    N++F+  G           ++ ++  EE  E+D+ +D  L  + S+R    + DDD    G +   + QG ++     G + + ++ET      
Subjt:  ELANPPNSKFKMPGDVGTGIDPTTKDDDVEGKEETDEKDEQDDHGLEKNPSHR----REDDDGGPTGGK---QQQGSTT----PGPTTLVQTETRVDGEG

Query:  TGDGRTKKTGGGEGTKACDDVDETINKAILSIDEA----KVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGP-RWSAKAKGIKIKEPTTPLIQN------
         GD  + +       KA ++ DE IN+ I SIDE+    ++ +K  R R G+  +   P  + R   P  G  +   +   I  K P+   + N      
Subjt:  TGDGRTKKTGGGEGTKACDDVDETINKAILSIDEA----KVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGP-RWSAKAKGIKIKEPTTPLIQN------

Query:  --RTPLREVNGTITRGGAKRSDCVE-GVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQ
          R  L+ +N T    G KRS+  E    +  +TGI++DA+RG   +   +        PSFDLHLSQ
Subjt:  --RTPLREVNGTITRGGAKRSDCVE-GVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQ

A0A5D3DKA6 Protein Ycf2-like5.5e-6432.46Show/hide
Query:  CKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEK
        CKPK T +L F IGG++L+FGLREFALITGL C  +P ++ D ++   R K  YF + + V R+ LN++FN      + D +KMA+LY LESFL+P+QE 
Subjt:  CKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEK

Query:  VHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHA
        + ++ +H++MV+D E+F  YPWGRVAF LL  +M +   S+G  GI + GF++ ILAWAYEV P LS P   +A RI N VPRIINW  + QP+W++L  
Subjt:  VHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHA

Query:  KIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGK-LDKVYSVLGALVDTLREIH
        K+F SP+L+V P+  T  ++ MP+F PF++ E    + A D  +   +     + S+    P  S+++++ K       + LDKV        D   +  
Subjt:  KIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIG-APPMISQMDVMEKHHQEIIGK-LDKVYSVLGALVDTLREIH

Query:  ELANPPNSKFKMPGDVGTGIDPTTKDDDVEGKEETDEKDEQDDHGLEKNPSHRREDDDGGPTGGKQQQGSTTPGPTTLVQTETRVDGEGTGDGRTKKTGG
        E  N  +S         T ++    D+D +GK   DE   +   G            DGG +  K ++  T P                 GD  + +   
Subjt:  ELANPPNSKFKMPGDVGTGIDPTTKDDDVEGKEETDEKDEQDDHGLEKNPSHRREDDDGGPTGGKQQQGSTTPGPTTLVQTETRVDGEGTGDGRTKKTGG

Query:  GEGTKACDDVDETINKAILSIDEAKVIEKFNRDRK------------------GKAVMMGGPHTIPRTTIPQLG-----PRWSAKAKGIKIKEPTTPLIQ
            KA ++  E IN+ I  IDE+ + +K  +  +                  G+  +   P  + R   P  G     P+ +     +   +  + + +
Subjt:  GEGTKACDDVDETINKAILSIDEAKVIEKFNRDRK------------------GKAVMMGGPHTIPRTTIPQLG-----PRWSAKAKGIKIKEPTTPLIQ

Query:  NRTP----LREVNGTITRGGAKRSDCVE-GVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLS
        NR P    L+ +N T    G KRS+  E    +  +TGI++DA+R        +        PSFDLHLS
Subjt:  NRTP----LREVNGTITRGGAKRSDCVE-GVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)7.9e-1529.73Show/hide
Query:  KINLCCRTEIMDTINNIL-GDRCREAFRNTCFGHLLDFTFKKTS-SQLLLH-LIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDK
        ++N+  R E + TI N+L G    E  +++ FG L +F   + S S  L+H L+  Q   K+  EL+F  GG  ++F +REF ++TGL CG LP  D+ K
Subjt:  KINLCCRTEIMDTINNIL-GDRCREAFRNTCFGHLLDFTFKKTS-SQLLLH-LIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDK

Query:  LQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAF
            S++   +       R  T+  V   ++    +   K+   L  +   ++   ++  +  + V M+ D + F  YPWGR AF
Subjt:  LQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAF

AT1G36970.1 Domain of unknown function (DUF1985)1.4e-0832.85Show/hide
Query:  VDDDYFMSP----SKRSKALKINLCCRTEIMDTINNIL-GDRCREAFRNTCFGHLLDFTFKKTS-SQLLLH-LIQHQCKPKRTPELYFKIGGKILKFGLR
        +D+D  + P    + R    ++N+  R +I+  I ++L G +  E   ++CFG L      + S S  L+H L+  Q   K+  EL    GG+ L+F L 
Subjt:  VDDDYFMSP----SKRSKALKINLCCRTEIMDTINNIL-GDRCREAFRNTCFGHLLDFTFKKTS-SQLLLH-LIQHQCKPKRTPELYFKIGGKILKFGLR

Query:  EFALITGLNCGPLP-QLDKDKLQDSSRFKDEYFADDE
        EF  +TGL CG  P + D D      + K E F DDE
Subjt:  EFALITGLNCGPLP-QLDKDKLQDSSRFKDEYFADDE

AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases8.0e-0730.39Show/hide
Query:  MKHGVETD---LVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSR-GSVGIGMGGFVYAILAWAYEVIPALSA
        +K  V TD    ++ A L  ++ FLLP      I ++H  M ED + F +YPWGR++F ++ T +++  V +  +  + + G +YA+     E +PA+  
Subjt:  MKHGVETD---LVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSR-GSVGIGMGGFVYAILAWAYEVIPALSA

Query:  PP
         P
Subjt:  PP

AT5G45570.1 Ulp1 protease family protein6.1e-0723.5Show/hide
Query:  CRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKK--TSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSR
        C    +  I + LG    +  + T  G  + FT      ++Q +   + +Q +     E++  I  + ++F L EF  ITGLNC    + D  +      
Subjt:  CRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKK--TSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSR

Query:  FKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMG
        + +   +   G     L  VF   K       + + +L  L   +        +       V D   F  YPWGRVAF  L+  ++       S  I   
Subjt:  FKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMG

Query:  GFVYAILAWAYEVIPAL
        G V  +L W YE +P +
Subjt:  GFVYAILAWAYEVIPAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATCCAGGCTTGCCCCTCTGGTTTAACAAATCGTGCATGCGCCTTCGTGAATACCATTTTCCAAAGAAAAACCCCCCTTCGAACAACCTGCTTCCTCTCCGTTGT
TCGTGCTCCCTTTCCGACGATCGTTCGACGCTCCCTCTCCGACGTCGTTCGCAGCCCGCTCTCAGACGCCATCCGAAGCTCCCCCATCCGAAGCTCCCCCTCCGTTCAAC
CCCCCACCTTCGACCGCTCAGCCTCCCTCGCACAAACCGTTCATGCATTTGATCCCGCCGCCGTTCTTGCGCCTGGTCCCACCGCCGAACGTTTGCATTCCACCCATCTG
TGCTCGCCGTACAGCCCCCCTCAGAGCGTAGAACATTTGCGAGGTATGCCCAAGGATAAGCTCGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGGC
AGGAAGAAAACCCCCGGAACAAACGTCCCCAATCACATTGGGGAGCGAACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCGCTAAGGGATCCAGCGAAAAGA
CGAAAGGGGTAAAAAGGGACAGAGACGATGGAGGTCCGAGCAAAAAAGTAACTCCATCAAAGAAAACAAAAGTTCGCGACTGGACCAAGAAGACCAACGATGAGATTGAG
AAACCCACTGAGACACGAAGCAATAAGAAGACAAAGCGGTCGAAACAGACAAAAAACACAGATAAGGCGAGCCATGTGACACCAGAAGTTGTCCCTGAAACAAGCGAGGA
CACCACTAAACATGACACTGAAGACACCGAATCTGATAGTGTGACGAATGACAACTCCACGAGTGATGAAGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAA
AGGAAGCTCCTAAAAAAAAGAAGGGTGGAAAAAAGGGAAAAAAGCTGAAGACCATGGTTGAAGAAGGTGACACCGTCCGAGTGGACGATGATTACTTTATGTCACCATCG
AAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACAGAAATAATGGACACCATCAACAACATCTTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTT
CGGCCACCTGCTTGACTTTACGTTCAAAAAGACGTCTTCCCAGTTACTATTGCACCTGATCCAGCATCAGTGCAAACCCAAACGGACGCCAGAACTTTACTTCAAGATTG
GAGGGAAAATCTTAAAGTTTGGCCTACGGGAGTTCGCATTAATTACGGGACTAAATTGTGGCCCATTGCCACAACTTGACAAAGACAAGCTACAAGATTCTTCCAGGTTC
AAGGATGAGTATTTTGCTGATGACGAGGGTGTTAGAAGAAAGACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGACAGACCTCGTAAAGATGGCGCAGTT
GTATTGTTTGGAGAGTTTTTTGTTACCTAGGCAAGAAAAGGTGCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGC
GCGTCGCCTTCACACTATTGACAACCTACATGCAGAAGGCATCCGTTAGTAGGGGCAGCGTTGGTATTGGAATGGGCGGGTTCGTATATGCCATCCTTGCATGGGCATAC
GAAGTGATACCCGCATTGAGCGCCCCACCGACCAACTACGCAAGACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTCGAAGCTCAACCCGAATGGAGAGA
ACTACATGCCAAGATATTCCAATCCCCATCGCTGAAGGTGGTACCATTGGACCCAACCGACACGAAAATGCAGATGCCGTACTTCCAACCTTTCTTGCAAGATGAATTGG
CTTCTCGACGATTGGCAGGAGACAATCAACAAGTAGAAGGCGATGTTCGAATCCCACCGAACTTCTCAATAGGGGCACCCCCAATGATCAGCCAGATGGATGTGATGGAA
AAACACCATCAAGAAATAATTGGTAAGCTCGACAAAGTTTACTCTGTGCTAGGAGCCTTGGTGGATACTTTGAGGGAGATACACGAGCTTGCCAACCCCCCAAACTCAAA
ATTCAAGATGCCCGGAGATGTTGGGACTGGTATTGACCCTACAACAAAAGACGATGATGTGGAGGGAAAAGAAGAAACTGATGAAAAAGATGAGCAAGATGACCATGGAT
TAGAGAAAAATCCTTCTCATCGAAGGGAAGACGACGATGGAGGACCAACAGGTGGGAAACAGCAACAGGGGTCGACCACCCCCGGACCAACAACCCTTGTACAGACTGAA
ACTCGTGTAGATGGCGAAGGCACGGGAGATGGCAGGACAAAGAAAACAGGAGGTGGTGAAGGCACAAAGGCCTGTGATGATGTCGACGAGACAATAAACAAGGCTATACT
GTCAATAGATGAGGCCAAGGTGATTGAAAAGTTTAATAGGGACCGCAAGGGTAAAGCGGTTATGATGGGAGGACCTCATACCATACCAAGAACCACAATCCCGCAACTTG
GCCCCCGATGGTCTGCAAAGGCCAAGGGAATAAAAATTAAGGAACCTACCACTCCGCTCATTCAAAATAGGACACCCCTCCGTGAGGTCAACGGGACCATAACTCGGGGG
GGAGCCAAGCGCTCAGATTGTGTGGAAGGAGTGGGGAGCCTGCAAGCCACAGGAATTTATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGATGGAAACCCTACC
GCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATATCCAGGCTTGCCCCTCTGGTTTAACAAATCGTGCATGCGCCTTCGTGAATACCATTTTCCAAAGAAAAACCCCCCTTCGAACAACCTGCTTCCTCTCCGTTGT
TCGTGCTCCCTTTCCGACGATCGTTCGACGCTCCCTCTCCGACGTCGTTCGCAGCCCGCTCTCAGACGCCATCCGAAGCTCCCCCATCCGAAGCTCCCCCTCCGTTCAAC
CCCCCACCTTCGACCGCTCAGCCTCCCTCGCACAAACCGTTCATGCATTTGATCCCGCCGCCGTTCTTGCGCCTGGTCCCACCGCCGAACGTTTGCATTCCACCCATCTG
TGCTCGCCGTACAGCCCCCCTCAGAGCGTAGAACATTTGCGAGGTATGCCCAAGGATAAGCTCGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGGC
AGGAAGAAAACCCCCGGAACAAACGTCCCCAATCACATTGGGGAGCGAACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCGCTAAGGGATCCAGCGAAAAGA
CGAAAGGGGTAAAAAGGGACAGAGACGATGGAGGTCCGAGCAAAAAAGTAACTCCATCAAAGAAAACAAAAGTTCGCGACTGGACCAAGAAGACCAACGATGAGATTGAG
AAACCCACTGAGACACGAAGCAATAAGAAGACAAAGCGGTCGAAACAGACAAAAAACACAGATAAGGCGAGCCATGTGACACCAGAAGTTGTCCCTGAAACAAGCGAGGA
CACCACTAAACATGACACTGAAGACACCGAATCTGATAGTGTGACGAATGACAACTCCACGAGTGATGAAGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAA
AGGAAGCTCCTAAAAAAAAGAAGGGTGGAAAAAAGGGAAAAAAGCTGAAGACCATGGTTGAAGAAGGTGACACCGTCCGAGTGGACGATGATTACTTTATGTCACCATCG
AAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACAGAAATAATGGACACCATCAACAACATCTTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTT
CGGCCACCTGCTTGACTTTACGTTCAAAAAGACGTCTTCCCAGTTACTATTGCACCTGATCCAGCATCAGTGCAAACCCAAACGGACGCCAGAACTTTACTTCAAGATTG
GAGGGAAAATCTTAAAGTTTGGCCTACGGGAGTTCGCATTAATTACGGGACTAAATTGTGGCCCATTGCCACAACTTGACAAAGACAAGCTACAAGATTCTTCCAGGTTC
AAGGATGAGTATTTTGCTGATGACGAGGGTGTTAGAAGAAAGACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGACAGACCTCGTAAAGATGGCGCAGTT
GTATTGTTTGGAGAGTTTTTTGTTACCTAGGCAAGAAAAGGTGCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGC
GCGTCGCCTTCACACTATTGACAACCTACATGCAGAAGGCATCCGTTAGTAGGGGCAGCGTTGGTATTGGAATGGGCGGGTTCGTATATGCCATCCTTGCATGGGCATAC
GAAGTGATACCCGCATTGAGCGCCCCACCGACCAACTACGCAAGACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTCGAAGCTCAACCCGAATGGAGAGA
ACTACATGCCAAGATATTCCAATCCCCATCGCTGAAGGTGGTACCATTGGACCCAACCGACACGAAAATGCAGATGCCGTACTTCCAACCTTTCTTGCAAGATGAATTGG
CTTCTCGACGATTGGCAGGAGACAATCAACAAGTAGAAGGCGATGTTCGAATCCCACCGAACTTCTCAATAGGGGCACCCCCAATGATCAGCCAGATGGATGTGATGGAA
AAACACCATCAAGAAATAATTGGTAAGCTCGACAAAGTTTACTCTGTGCTAGGAGCCTTGGTGGATACTTTGAGGGAGATACACGAGCTTGCCAACCCCCCAAACTCAAA
ATTCAAGATGCCCGGAGATGTTGGGACTGGTATTGACCCTACAACAAAAGACGATGATGTGGAGGGAAAAGAAGAAACTGATGAAAAAGATGAGCAAGATGACCATGGAT
TAGAGAAAAATCCTTCTCATCGAAGGGAAGACGACGATGGAGGACCAACAGGTGGGAAACAGCAACAGGGGTCGACCACCCCCGGACCAACAACCCTTGTACAGACTGAA
ACTCGTGTAGATGGCGAAGGCACGGGAGATGGCAGGACAAAGAAAACAGGAGGTGGTGAAGGCACAAAGGCCTGTGATGATGTCGACGAGACAATAAACAAGGCTATACT
GTCAATAGATGAGGCCAAGGTGATTGAAAAGTTTAATAGGGACCGCAAGGGTAAAGCGGTTATGATGGGAGGACCTCATACCATACCAAGAACCACAATCCCGCAACTTG
GCCCCCGATGGTCTGCAAAGGCCAAGGGAATAAAAATTAAGGAACCTACCACTCCGCTCATTCAAAATAGGACACCCCTCCGTGAGGTCAACGGGACCATAACTCGGGGG
GGAGCCAAGCGCTCAGATTGTGTGGAAGGAGTGGGGAGCCTGCAAGCCACAGGAATTTATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGATGGAAACCCTACC
GCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
Protein sequenceShow/hide protein sequence
MNIQACPSGLTNRACAFVNTIFQRKTPLRTTCFLSVVRAPFPTIVRRSLSDVVRSPLSDAIRSSPIRSSPSVQPPTFDRSASLAQTVHAFDPAAVLAPGPTAERLHSTHL
CSPYSPPQSVEHLRGMPKDKLVSTRASDRLKAAGVTAGRKPPEQTSPITLGSEQDSEEAMSTTVSVAKGSSEKTKGVKRDRDDGGPSKKVTPSKKTKVRDWTKKTNDEIE
KPTETRSNKKTKRSKQTKNTDKASHVTPEVVPETSEDTTKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKKEAPKKKKGGKKGKKLKTMVEEGDTVRVDDDYFMSPS
KRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTPELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDKLQDSSRF
KDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTTYMQKASVSRGSVGIGMGGFVYAILAWAY
EVIPALSAPPTNYARRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLKVVPLDPTDTKMQMPYFQPFLQDELASRRLAGDNQQVEGDVRIPPNFSIGAPPMISQMDVME
KHHQEIIGKLDKVYSVLGALVDTLREIHELANPPNSKFKMPGDVGTGIDPTTKDDDVEGKEETDEKDEQDDHGLEKNPSHRREDDDGGPTGGKQQQGSTTPGPTTLVQTE
TRVDGEGTGDGRTKKTGGGEGTKACDDVDETINKAILSIDEAKVIEKFNRDRKGKAVMMGGPHTIPRTTIPQLGPRWSAKAKGIKIKEPTTPLIQNRTPLREVNGTITRG
GAKRSDCVEGVGSLQATGIYVDAMRGTWTKESMETLPPEFFQPSFDLHLSQG