; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016939 (gene) of Chayote v1 genome

Gene IDSed0016939
OrganismSechium edule (Chayote v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG09:3149381..3154305
RNA-Seq ExpressionSed0016939
SyntenySed0016939
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025238.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.7e-8447.59Show/hide
Query:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV
        SIY+IP F+ +  PKA+ P +VSLGPY+HGK HL  ME+ KL    +        +E +    +TI DE  E  ++      + +   +   + L+LM+V
Subjt:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV

Query:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVE------AREYLHILDMYRAVLLYPR-SE
        DGCF++ FL NCP  L N+  +IK+DML+LENQLPM LL+KL SIA+++ +            K+ +S ++        ++ LHIL+MY+  LL+P    
Subjt:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVE------AREYLHILDMYRAVLLYPR-SE

Query:  SDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFER--STLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVA
        +D    ID SDPE Q I  AT+LREAGI FK S T S TDV F+     L LP +++D +TES LLNVMAFEKLH+E   KVTSF+  M NLID ++DVA
Subjt:  SDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFER--STLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVA

Query:  LLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY
        +LA + ++ NA+G+D+ AA LFN L  G       HMA V+  VN+HCN+ WN  CA+LKH YFQ+PWT IS+ AAIFGF IL+LQAIYQ  DYY
Subjt:  LLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY

XP_022148888.1 UPF0481 protein At3g47200-like [Momordica charantia]2.6e-8447.45Show/hide
Query:  SIYEIPNFIKEVQPKA-FVPTLVSLGPYHHGKPHLGSMEMAK------------------LDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKST
        SIY+IP+F+++VQPKA F P LVS GPYHHG+ HL  ME+ K                  ++ V ++LE+++G Y+ +  E              W K  
Subjt:  SIYEIPNFIKEVQPKA-FVPTLVSLGPYHHGKPHLGSMEMAK------------------LDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKST

Query:  KGIERLLKLMVVDGCFVIE-FLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRI-TLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAV
         G  + L+LMV+DGCF++E  L +   WL NMR +I RDML+LENQLPMKLLE+L S+AN S +  ++++   F    + + + ++  EYLHIL+MY   
Subjt:  KGIERLLKLMVVDGCFVIE-FLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRI-TLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAV

Query:  LLYP------RSESDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIA
        LL P      R +  S+    + D E Q I  AT+L EAGI F+ S++KS TDV F+  R  LKLP +V+D  TES  LNVMAFEKLH E   KVT FI 
Subjt:  LLYP------RSESDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIA

Query:  FMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT---TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILML
         M+NLIDVDQDVALLAS  II NALG+D+ AA LF  LA+G         H+  V  MV +HC K  + WCASLKH YFQ+PW  +S+ AA  GF IL+L
Subjt:  FMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT---TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILML

Query:  QAIYQVCDYYK
        QA+YQ+CDYY+
Subjt:  QAIYQVCDYYK

XP_023513986.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita pepo subsp. pepo]9.8e-8447.59Show/hide
Query:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV
        SIY+IP F+ +  PKA+ P +VSLGPY+HGK HL  ME+ KL    +        +E +    +TI DE  E  ++      + ++ T+   + L+LM+V
Subjt:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV

Query:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVE------AREYLHILDMYRAVLLYPR-SE
        DGCF++ FL NCP  L N+  +IK+DML+LENQLPM LLEKL SIA ++      V L     ++ +S ++        ++ LHIL+MY+  LL+P    
Subjt:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVE------AREYLHILDMYRAVLLYPR-SE

Query:  SDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFER--STLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVA
        +D    +D SDPE Q I  AT+LREAGI FK S+T S TDV F+     L LP +++D  TES LLNVMAFEKLH++   +VTSF+  M NLID ++DVA
Subjt:  SDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFER--STLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVA

Query:  LLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY
        +LA + ++ NA+G+D+ AA LFN L  G       HMA V+  VN+HCN+ WN  CA+LKH YFQ+PWT IS+ AAIFGF IL+LQAIYQ  DYY
Subjt:  LLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY

XP_023513987.1 UPF0481 protein At3g47200-like isoform X2 [Cucurbita pepo subsp. pepo]7.5e-8448.46Show/hide
Query:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV
        SIY+IP F+ +  PKA+ P +VSLGPY+HGK HL  ME+ KL    +        +E +    +TI DE  E  ++      + ++ T+   + L+LM+V
Subjt:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV

Query:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKE-PISEYVEAREYLHILDMYRAVLLYPR-SESDSRN
        DGCF++ FL NCP  L N+  +IK+DML+LENQLPM LLEKL SIA ++    Q +    S+    P +E +  ++ LHIL+MY+  LL+P    +D   
Subjt:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKE-PISEYVEAREYLHILDMYRAVLLYPR-SESDSRN

Query:  FIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFER--STLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASK
         +D SDPE Q I  AT+LREAGI FK S+T S TDV F+     L LP +++D  TES LLNVMAFEKLH++   +VTSF+  M NLID ++DVA+LA +
Subjt:  FIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFER--STLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASK

Query:  GIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY
         ++ NA+G+D+ AA LFN L  G       HMA V+  VN+HCN+ WN  CA+LKH YFQ+PWT IS+ AAIFGF IL+LQAIYQ  DYY
Subjt:  GIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]5.7e-9250.52Show/hide
Query:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPL-AHLMIWDKSTKGIERLLKLMVVDGCFVI
        SIY+IP F++++Q KAF P LVSLGPYHHGK HL SME  K       + +   +  +I +  +  +E    A+  + +K  +   + L++M+VDGCF++
Subjt:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPL-AHLMIWDKSTKGIERLLKLMVVDGCFVI

Query:  EFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAVLLYPRSESDSRN-----FID
        +F   CP  L  MR +IKRDML+LENQLPM+LL++L      +    Q++            E V  RE LHILDMYRA LLYP  +   R+        
Subjt:  EFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAVLLYPRSESDSRN-----FID

Query:  QSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSF--ERSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGII
        QSDPE Q I  ATQL +AGI FK S TK+ TDVSF  ++  L+LP +V+D  TE+ LLNVMAFEKL++E   KVTSF+  M+NLIDVD+DVALLAS  I+
Subjt:  QSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSF--ERSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGII

Query:  HNALGDDRAAANLFNLLAKGIT-TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYYK
         NALG+D +AA+LF+LL KG       H+  V+  VNKHC+ SWN WCASLKH YFQNPW  IS+FAA+FGFAIL++QAIYQ+ DY++
Subjt:  HNALGDDRAAANLFNLLAKGIT-TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYYK

TrEMBL top hitse value%identityAlignment
A0A6J1D5C0 UPF0481 protein At3g47200-like1.2e-8447.45Show/hide
Query:  SIYEIPNFIKEVQPKA-FVPTLVSLGPYHHGKPHLGSMEMAK------------------LDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKST
        SIY+IP+F+++VQPKA F P LVS GPYHHG+ HL  ME+ K                  ++ V ++LE+++G Y+ +  E              W K  
Subjt:  SIYEIPNFIKEVQPKA-FVPTLVSLGPYHHGKPHLGSMEMAK------------------LDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKST

Query:  KGIERLLKLMVVDGCFVIE-FLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRI-TLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAV
         G  + L+LMV+DGCF++E  L +   WL NMR +I RDML+LENQLPMKLLE+L S+AN S +  ++++   F    + + + ++  EYLHIL+MY   
Subjt:  KGIERLLKLMVVDGCFVIE-FLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRI-TLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAV

Query:  LLYP------RSESDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIA
        LL P      R +  S+    + D E Q I  AT+L EAGI F+ S++KS TDV F+  R  LKLP +V+D  TES  LNVMAFEKLH E   KVT FI 
Subjt:  LLYP------RSESDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIA

Query:  FMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT---TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILML
         M+NLIDVDQDVALLAS  II NALG+D+ AA LF  LA+G         H+  V  MV +HC K  + WCASLKH YFQ+PW  +S+ AA  GF IL+L
Subjt:  FMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT---TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILML

Query:  QAIYQVCDYYK
        QA+YQ+CDYY+
Subjt:  QAIYQVCDYYK

A0A6J1H6V9 UPF0481 protein At3g47200-like1.2e-7945.39Show/hide
Query:  KHSIYEIPNFIKEVQP----KAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLL
        KHSIY++P F+++       KAF P +VS GPYHHGK HL  ME  K     TL      +E +    +TI D+  E  +R      + ++ T+   + L
Subjt:  KHSIYEIPNFIKEVQP----KAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLL

Query:  KLMVVDGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYV----------EAREYLHILDMYRA
        KLM+VDGCF++    +CP  L NM+ +I+ + L+LENQLP+KLL KL SIA         + +     + P+ E +             E+LHILD+Y+A
Subjt:  KLMVVDGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYV----------EAREYLHILDMYRA

Query:  VLLYPRSESDSRNFIDQ---SDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHI-----ETHHKVTS
         LL P    D   + ++   S  EFQ I  AT+L EAGI FKMS+TKS  DV F+     L LP +++D +TES  LNVMAFEKLH+     E    +TS
Subjt:  VLLYPRSESDSRNFIDQ---SDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHI-----ETHHKVTS

Query:  FIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILM
        F+  M NLID ++DVALL+SKG + NALG+DR AA LF+ L KG+    + HM  V+ M+N +C + WN  CA+LKH YFQNPWT IS+ AAIFGF IL+
Subjt:  FIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILM

Query:  LQAIYQVCDYYK
        LQAIYQ+ DYYK
Subjt:  LQAIYQVCDYYK

A0A6J1HB25 UPF0481 protein At3g47200-like3.1e-8347.59Show/hide
Query:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV
        SIY+IP F+ +  PKA+ P +VSLGPY+HGK HL  ME+ KL    +        +E +    +TI DE  E  ++      + +   +   + L+LM+V
Subjt:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV

Query:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVE------AREYLHILDMYRAVLLYPR-SE
        DGCF++ FL NCP  L N+  +IK+DML+LENQLPM LLEKL SIA+++      V L     K+ +S ++        ++ LHIL+MY+  LL+P    
Subjt:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVE------AREYLHILDMYRAVLLYPR-SE

Query:  SDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFER--STLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVA
        +D    +  SDPE Q I  AT+LREAGI FK S+T S TDV F+     L LP +++D +TES LLNVMAFEKLH++   KVTSF+  M NLID ++DVA
Subjt:  SDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFER--STLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVA

Query:  LLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY
        +LA + ++ NA+G+D+ AA LFN L  G       HMA V+  VN+HCN+ WN  CA+LKH YFQ+PWT IS+ AAIFGF IL+LQAIYQ  DYY
Subjt:  LLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY

A0A6J1KVQ6 UPF0481 protein At3g47200-like isoform X21.4e-8348.46Show/hide
Query:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV
        SIY+IP F+ +  PKA+ P +VSLGPY+HGK HL  ME+ KL    +        +E +    +TI DE  E  +R      + ++ T+   + L+LM+V
Subjt:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV

Query:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKS-RITLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAVLLYPR-SESDSRN
        DGCF++ FL +CP  L N+  +IK+DML+LENQLPM LLEKL SIA ++ ++      L       P +E +  ++ LHIL+MY+  LLYP     D   
Subjt:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKS-RITLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAVLLYPR-SESDSRN

Query:  FIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASK
         +D SDPE Q I  AT+L EAGI FK S+T+S  DV F+  R  L LP +++D  TES +LNVMAFEKLH++   KVTSF+  M NLID ++DVA+LA +
Subjt:  FIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASK

Query:  GIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY
         I+ NA+G+D+ AA LF+ L  G       HMA V+ MVN HCN+ WN  CA+LKH YFQ+PWT IS+ AAIFGF IL+LQAIYQ  DYY
Subjt:  GIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY

A0A6J1KYV8 UPF0481 protein At3g47200-like isoform X16.2e-8447.85Show/hide
Query:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV
        SIY+IP F+ +  PKA+ P +VSLGPY+HGK HL  ME+ KL    +        +E +    +TI DE  E  +R      + ++ T+   + L+LM+V
Subjt:  SIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTL-------LEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVV

Query:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVE------AREYLHILDMYRAVLLYPR-SE
        DGCF++ FL +CP  L N+  +IK+DML+LENQLPM LLEKL SIA ++      V L     K+ + +++        ++ LHIL+MY+  LLYP    
Subjt:  DGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVE------AREYLHILDMYRAVLLYPR-SE

Query:  SDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVA
         D    +D SDPE Q I  AT+L EAGI FK S+T+S  DV F+  R  L LP +++D  TES +LNVMAFEKLH++   KVTSF+  M NLID ++DVA
Subjt:  SDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFE--RSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVA

Query:  LLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY
        +LA + I+ NA+G+D+ AA LF+ L  G       HMA V+ MVN HCN+ WN  CA+LKH YFQ+PWT IS+ AAIFGF IL+LQAIYQ  DYY
Subjt:  LLASKGIIHNALGDDRAAANLFNLLAKGITT-YSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYY

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026452.0e-1025.65Show/hide
Query:  SESDSRNFIDQSDP---EFQTIRHATQLREAGIIFKMSETKSFTDVSFERST--LKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDV
        +  +S + +D   P   E  TI   + L +AG+ FK +   + + V+F+ ++    LP + LD +TE+ L N++A+E  +       T +   ++ +ID 
Subjt:  SESDSRNFIDQSDP---EFQTIRHATQLREAGIIFKMSETKSFTDVSFERST--LKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDV

Query:  DQDVALLASKGIIHNALGDDRAAANLFNLLAKGI-TTYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQ
        ++DV LL  +G++ + L  D+ AA ++N ++K +  T    + K    VN++    W      L   Y    W  ++  AA+    ++ LQ
Subjt:  DQDVALLASKGIIHNALGDDRAAANLFNLLAKGI-TTYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQ

Q9SD53 UPF0481 protein At3g472005.6e-3428.2Show/hide
Query:  IYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLK-------LMVVD
        I+ +P     + PKA+ P +VS+GPYH+G+ HL  ++  K   +   L+E +       D     + + +  L   DK  K     LK       +MV+D
Subjt:  IYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLK-------LMVVD

Query:  GCFVIEF---------LNNCP----PWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPI----SEYVEAREY--LHILDM
        GCF++           L+  P    PWL    S I+ D+L+LENQ+P  +L+ L  + +K  ++     + F   K PI    S + + R Y   H+LD+
Subjt:  GCFVIEF---------LNNCP----PWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPI----SEYVEAREY--LHILDM

Query:  YRAVLLYPRSESD---------------SRNFIDQSDPEFQTIRHATQLREAGIIFKMSETK--SFTDVSFERSTLKLPCVVLDYSTESALLNVMAFEKL
         R   L   SESD               S N           I  A +LR  GI F++  +K  S  +V  +++ L++P +  D    S  LN +AFE+ 
Subjt:  YRAVLLYPRSESD---------------SRNFIDQSDPEFQTIRHATQLREAGIIFKMSETK--SFTDVSFERSTLKLPCVVLDYSTESALLNVMAFEKL

Query:  HIETHHKVTSFIAFMDNLIDVDQDVALLAS-KGIIHNALGDDRAAANLFNLLAKGIT--TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTIS
        + ++ +++T++I FM  L++ ++DV  L + K II N  G +   +  F  ++K +     + ++  V   VN++  K +N   A  +HT+F++PWT +S
Subjt:  HIETHHKVTSFIAFMDNLIDVDQDVALLAS-KGIIHNALGDDRAAANLFNLLAKGIT--TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTIS

Query:  IFAAIFGFAILMLQAIYQVCDY
          A +F   + MLQ+   +  Y
Subjt:  IFAAIFGFAILMLQAIYQVCDY

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)2.3e-5132.4Show/hide
Query:  KHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVVDGCFV
        K  IY +P +++E   K++ P  VSLGPYHHGK  L SM+  K   V  +L+          D   E  E+  A        +      ++++V+DGCFV
Subjt:  KHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVVDGCFV

Query:  IE-------------FLNNCPPW-LQNMRSEIKRDMLMLENQLPM----KLLEKLCSIANKSRITLQTVHLHFS---HHKEPISEYVEAR----------
        +E             +  N P + ++     I+RDM+MLENQLP+    +LLE      N++ +  Q     F       EP+++  +++          
Subjt:  IE-------------FLNNCPPW-LQNMRSEIKRDMLMLENQLPM----KLLEKLCSIANKSRITLQTVHLHFS---HHKEPISEYVEAR----------

Query:  -------EYLHILDMYRAVLL--YPRSESD------SRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNV
                 LH LD++R  LL   P+ E        SRN         Q I   T+L+EAGI F+  +T  F D+ F+   L++P +++   T+S  LN+
Subjt:  -------EYLHILDMYRAVLL--YPRSESD------SRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNV

Query:  MAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT--TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNP
        +AFE+ HI++ + +TS+I FMDNLID  +DV+ L   GII + LG D   A+LFN L + +   T   +++++++ VN++ +  WN W A+LKH YF NP
Subjt:  MAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT--TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNP

Query:  WTTISIFAAIFGFAILMLQAIYQVCDYYK
        W  +S  AA+    +   Q+ Y V  YYK
Subjt:  WTTISIFAAIFGFAILMLQAIYQVCDYYK

AT3G50140.1 Plant protein of unknown function (DUF247)2.3e-4629.14Show/hide
Query:  KHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVVDGCFV
        K  IY +P  +K+    ++ P  VSLGPYHHG  HL  M+  K   V  +++  +       D   E  ER  A             +  +++V+DGCFV
Subjt:  KHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVVDGCFV

Query:  IEFL-------------NNCPPW-LQNMRSEIKRDMLMLENQLPMKLLEKLCSI------------------------ANKSRITLQTVHLHFSHHKEPI
        ++                N P + ++     I+RDMLMLENQLP+ +L +L  +                           S   ++    + +    PI
Subjt:  IEFL-------------NNCPPW-LQNMRSEIKRDMLMLENQLPMKLLEKLCSI------------------------ANKSRITLQTVHLHFSHHKEPI

Query:  SEYVEAREYLHILDMYRAVLLYPRSESD--------SRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNV
        ++  + +E LH LD++R  LL P  + D        SR  +     + Q +   T+LREAGI FK  ++  F D+ F+   L++P +++   T+S   N+
Subjt:  SEYVEAREYLHILDMYRAVLLYPRSESD--------SRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNV

Query:  MAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT--TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNP
        +A+E+ HI++ + +TS+I FMDNLID  +D+  L    II + LG+D   A++FN L + +     + ++++++  V+++ N+ WN   A+LKH YF NP
Subjt:  MAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT--TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNP

Query:  WTTISIFAAIFGFAILMLQAIYQVCDYYK
        W   S FAA+    + + Q+ +    Y+K
Subjt:  WTTISIFAAIFGFAILMLQAIYQVCDYYK

AT3G50150.1 Plant protein of unknown function (DUF247)1.1e-4830.16Show/hide
Query:  LPVKDVSEYLTHFHSHRNQIFKHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIW
        + +KD  E    + +  N   K  IY +P +++E   K+++P  VS+GPYHHGK HL  ME  K   V  ++   + +     D   E +E         
Subjt:  LPVKDVSEYLTHFHSHRNQIFKHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIW

Query:  DKSTKGIERLLKLMVVDGCFVIE-------------FLNNCPPWL-QNMRSEIKRDMLMLENQLPMKLLEKLCSI----ANKSRITLQTVHLHFSHHKEP
            K      +++V+DGCFV+E             +  N P +  + +   I+RDM+MLENQLP+ +L++L  +     N++ I  + V + F     P
Subjt:  DKSTKGIERLLKLMVVDGCFVIE-------------FLNNCPPWL-QNMRSEIKRDMLMLENQLPMKLLEKLCSI----ANKSRITLQTVHLHFSHHKEP

Query:  ISEYVEAREY----------------LHILDMY-RAVLLYPRSESDSRNFIDQS--DPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVV
         SE +   E                 LH LD++ R+++    + +    + D S  + + Q I   T+LR AG+ F   ET    D+ F+   LK+P ++
Subjt:  ISEYVEAREY----------------LHILDMY-RAVLLYPRSESDSRNFIDQS--DPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVV

Query:  LDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGI--TTYSWHMAKVNMMVNKHCNKSWNNW
        +   T+S   N++AFE+ H ++ + +TS+I FMDNLI+  QDV+ L   GII + LG D   A+LFN L K +       ++++++  VN++ ++ WN+ 
Subjt:  LDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGI--TTYSWHMAKVNMMVNKHCNKSWNNW

Query:  CASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYYK
         A+L+  YF NPW   S  AA+    +   Q+ + V  YYK
Subjt:  CASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYYK

AT3G50160.1 Plant protein of unknown function (DUF247)2.6e-5032.58Show/hide
Query:  IYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVVDGCFVIE-
        IY +P +++E   K+++P +VS+GPYHHG  HL  ME  K   V  ++   +       D   E  E+  A               ++++V+DG F+IE 
Subjt:  IYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVVDGCFVIE-

Query:  ------------FLNNCPPW-LQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLH--FSHHKEPISEYVEAREYLHILDMYRAVLLYPRS
                    +  N P + ++ +   I+RDM+MLENQLP  +L+ L  +     +    V L   F     P  E +     LH LD+ R  LL    
Subjt:  ------------FLNNCPPW-LQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLH--FSHHKEPISEYVEAREYLHILDMYRAVLLYPRS

Query:  ESDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVAL
         SD    +    P+ Q I   T+LR AG+ F   ET  F D+ F+   LK+P +++   T+S  LN++AFE+ HI++  K+TS+I FMDNLI+  +DV+ 
Subjt:  ESDSRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVAL

Query:  LASKGIIHNALGDDRAAANLFNLLAKGI--TTYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYYK
        L   GII N LG D   ++LFN L K +       +++ +   VN +  + WN   A+L+H YF NPW   S  AA+        Q+ + V  Y+K
Subjt:  LASKGIIHNALGDDRAAANLFNLLAKGI--TTYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYYK

AT3G50170.1 Plant protein of unknown function (DUF247)4.9e-4930.61Show/hide
Query:  KHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVVDGCFV
        K  IY +P++++E   K++ P  VSLGPYHHGK  L  ME  K   +  +L+ L+       +   E  E+  A        +       +++V+DGCFV
Subjt:  KHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWDKSTKGIERLLKLMVVDGCFV

Query:  IE-------------FLNNCPPW-LQNMRSEIKRDMLMLENQLPMKLLEKLCSI----ANKSRITLQTVHLHFSHHKEPISEYVEAREY-----------
        +E             +  N P + ++ +   I+RDM+MLENQLP+ +L++L  +     N++ I    V + F     P  E +   +            
Subjt:  IE-------------FLNNCPPW-LQNMRSEIKRDMLMLENQLPMKLLEKLCSI----ANKSRITLQTVHLHFSHHKEPISEYVEAREY-----------

Query:  --------LHILDMYRAVLLYPRSESDSRNFIDQ--------SDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNVM
                LH LD++R  LL      ++R+ + +           + Q +   T+LREAG+ F+  +T  F D+ F+   L++P +++   T+S   N++
Subjt:  --------LHILDMYRAVLLYPRSESDSRNFIDQ--------SDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNVM

Query:  AFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT--TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPW
        AFE+ HIE+ + +TS+I FMDNLI+  +DV+ L   GII + LG D   A+LFN L + +       H+++++  VN++ N+ WN   A+L H YF NPW
Subjt:  AFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGDDRAAANLFNLLAKGIT--TYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPW

Query:  TTISIFAAIFGFAILMLQAIYQVCDYYK
           S  AA+    + + Q+ Y V  YYK
Subjt:  TTISIFAAIFGFAILMLQAIYQVCDYYK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACCCAGGAATTGGTGGAGCGATTGCCGGTCAAAGATGTAAGTGAATATCTCACTCATTTCCATTCACATAGAAACCAAATTTTCAAGCATTCAATTTACGAAAT
ACCAAACTTCATCAAAGAAGTTCAACCCAAAGCTTTCGTGCCGACTCTGGTGTCGCTCGGGCCATACCACCACGGAAAGCCACACCTCGGTTCGATGGAGATGGCAAAAT
TGGACTATGTGATGACATTGTTGGAAGAACTGCGGGGATCGTACAACACGATTTGGGATGAGAGCAATGAAGGAATAGAAAGACCCTTGGCGCACTTGATGATTTGGGAT
AAGAGCACAAAAGGAATAGAGAGATTGTTGAAGCTCATGGTTGTGGATGGTTGTTTTGTTATTGAATTCTTGAACAATTGTCCTCCATGGCTTCAAAATATGAGATCTGA
AATCAAGCGCGACATGTTAATGCTTGAGAATCAGCTGCCCATGAAGCTTCTTGAGAAGCTATGTTCCATAGCTAACAAGAGCCGAATAACCCTTCAGACAGTCCATTTGC
ATTTTAGCCATCACAAGGAGCCCATCTCAGAATATGTGGAAGCAAGAGAATACTTGCACATTTTAGACATGTACAGGGCAGTATTATTGTATCCAAGGAGTGAGAGTGAT
AGTCGAAACTTCATCGACCAATCTGACCCAGAGTTTCAAACCATACGACACGCAACACAGCTTCGCGAAGCTGGGATCATTTTCAAGATGAGCGAAACCAAGAGCTTCAC
CGACGTATCGTTCGAACGCAGCACGTTGAAGCTCCCGTGTGTGGTATTGGACTATAGCACGGAGTCGGCTTTATTAAACGTGATGGCATTTGAGAAACTCCACATCGAAA
CTCACCACAAAGTCACATCGTTCATAGCCTTCATGGACAATCTCATAGACGTGGACCAAGATGTGGCATTGTTAGCCTCCAAAGGAATCATCCACAATGCGCTCGGCGAC
GACCGAGCCGCGGCTAACTTGTTCAATCTACTGGCTAAAGGAATTACTACATATAGCTGGCACATGGCTAAGGTAAACATGATGGTGAATAAACATTGCAACAAATCGTG
GAATAATTGGTGTGCAAGTCTCAAACATACCTATTTTCAAAACCCATGGACAACCATCTCTATCTTTGCTGCTATTTTTGGTTTTGCCATCCTAATGCTCCAAGCCATCT
ACCAAGTCTGTGATTATTACAAGGTTTCGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAACCCAGGAATTGGTGGAGCGATTGCCGGTCAAAGATGTAAGTGAATATCTCACTCATTTCCATTCACATAGAAACCAAATTTTCAAGCATTCAATTTACGAAAT
ACCAAACTTCATCAAAGAAGTTCAACCCAAAGCTTTCGTGCCGACTCTGGTGTCGCTCGGGCCATACCACCACGGAAAGCCACACCTCGGTTCGATGGAGATGGCAAAAT
TGGACTATGTGATGACATTGTTGGAAGAACTGCGGGGATCGTACAACACGATTTGGGATGAGAGCAATGAAGGAATAGAAAGACCCTTGGCGCACTTGATGATTTGGGAT
AAGAGCACAAAAGGAATAGAGAGATTGTTGAAGCTCATGGTTGTGGATGGTTGTTTTGTTATTGAATTCTTGAACAATTGTCCTCCATGGCTTCAAAATATGAGATCTGA
AATCAAGCGCGACATGTTAATGCTTGAGAATCAGCTGCCCATGAAGCTTCTTGAGAAGCTATGTTCCATAGCTAACAAGAGCCGAATAACCCTTCAGACAGTCCATTTGC
ATTTTAGCCATCACAAGGAGCCCATCTCAGAATATGTGGAAGCAAGAGAATACTTGCACATTTTAGACATGTACAGGGCAGTATTATTGTATCCAAGGAGTGAGAGTGAT
AGTCGAAACTTCATCGACCAATCTGACCCAGAGTTTCAAACCATACGACACGCAACACAGCTTCGCGAAGCTGGGATCATTTTCAAGATGAGCGAAACCAAGAGCTTCAC
CGACGTATCGTTCGAACGCAGCACGTTGAAGCTCCCGTGTGTGGTATTGGACTATAGCACGGAGTCGGCTTTATTAAACGTGATGGCATTTGAGAAACTCCACATCGAAA
CTCACCACAAAGTCACATCGTTCATAGCCTTCATGGACAATCTCATAGACGTGGACCAAGATGTGGCATTGTTAGCCTCCAAAGGAATCATCCACAATGCGCTCGGCGAC
GACCGAGCCGCGGCTAACTTGTTCAATCTACTGGCTAAAGGAATTACTACATATAGCTGGCACATGGCTAAGGTAAACATGATGGTGAATAAACATTGCAACAAATCGTG
GAATAATTGGTGTGCAAGTCTCAAACATACCTATTTTCAAAACCCATGGACAACCATCTCTATCTTTGCTGCTATTTTTGGTTTTGCCATCCTAATGCTCCAAGCCATCT
ACCAAGTCTGTGATTATTACAAGGTTTCGTTTTGA
Protein sequenceShow/hide protein sequence
MSTQELVERLPVKDVSEYLTHFHSHRNQIFKHSIYEIPNFIKEVQPKAFVPTLVSLGPYHHGKPHLGSMEMAKLDYVMTLLEELRGSYNTIWDESNEGIERPLAHLMIWD
KSTKGIERLLKLMVVDGCFVIEFLNNCPPWLQNMRSEIKRDMLMLENQLPMKLLEKLCSIANKSRITLQTVHLHFSHHKEPISEYVEAREYLHILDMYRAVLLYPRSESD
SRNFIDQSDPEFQTIRHATQLREAGIIFKMSETKSFTDVSFERSTLKLPCVVLDYSTESALLNVMAFEKLHIETHHKVTSFIAFMDNLIDVDQDVALLASKGIIHNALGD
DRAAANLFNLLAKGITTYSWHMAKVNMMVNKHCNKSWNNWCASLKHTYFQNPWTTISIFAAIFGFAILMLQAIYQVCDYYKVSF