; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013590 (gene) of Chayote v1 genome

Gene IDSed0013590
OrganismSechium edule (Chayote v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG09:3200253..3203857
RNA-Seq ExpressionSed0013590
SyntenySed0013590
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025238.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-9751.37Show/hide
Query:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK
        SIY+IP F+ +T PKA+EP +VSLGPY+HGK HL  MEL KL+LFH F   C L+VE IV  V  +L+EL  SYD + ++  +                 
Subjt:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK

Query:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYRTLLL
           +FL+LMI+DGCF++ FL NCP  L N++ +I +DML+LENQLPM LL KL S+A+R  +   V  +++++  +  + KD      LHIL+MY+  LL
Subjt:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYRTLLL

Query:  YP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIID
        +P     D    ++ S+PE Q+IP AT+LR AGIKFKRS T S TDV F+  G  L LP ++VDD TES LLNVMAFEKLH+E    VTSFV  M+N+ID
Subjt:  YP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIID

Query:  EDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDY
        ++RDVA+LA + ++ N +G+D+ AA LFN L +G       HM  V+K VNEHCN+ WN+ CA+L H YFQSPWTIIS+ A IFGF ILILQAIYQ  DY
Subjt:  EDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDY

Query:  Y
        Y
Subjt:  Y

XP_022960454.1 UPF0481 protein At3g47200-like [Cucurbita moschata]3.6e-9751.11Show/hide
Query:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK
        SIY+IP F+ +T PKA+EP +VSLGPY+HGK HL  MEL KL+LFH F   C L+VE IV  V  +L+EL  SYD + ++                K   
Subjt:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK

Query:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRK----RITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYR
        G  +FL+LMI+DGCF++ FL NCP  L N++ +I +DML+LENQLPM LL+KL S+A+R     +  +  + +++++  +  + KD      LHIL+MY+
Subjt:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRK----RITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYR

Query:  TLLLYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMN
          LL+P     D    +  S+PE Q+IP AT+LR AGIKFKRSKT S TDV F+  G  L LP ++VDD TES LLNVMAFEKLH++    VTSFV  M+
Subjt:  TLLLYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMN

Query:  NIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQ
        N+ID++RDVA+LA + ++ N +G+D+ AA LFN L +G       HM  V+K VNEHCN+ WN+ CA+L H YFQSPWTIIS+ A IFGF ILILQAIYQ
Subjt:  NIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQ

Query:  IGDYY
          DYY
Subjt:  IGDYY

XP_023513986.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita pepo subsp. pepo]1.6e-9751.11Show/hide
Query:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK
        SIY+IP F+ +T PKA+EP +VSLGPY+HGK HL  MEL KL+LF+ F   C L+VE IV+ V  +L+EL  SYD + +E T+                 
Subjt:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK

Query:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRK----RITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYR
           +FL+LMI+DGCF++ FL NCP  L N++ +I +DML+LENQLPM LL+KL S+A R     +  +  + +++++  +  + KD      LHIL+MY+
Subjt:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRK----RITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYR

Query:  TLLLYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMN
          LL+P     D    ++ S+PE Q+IP AT+LR AGIKFKRSKTDS TDV F+  G  L LP ++VDD TES LLNVMAFEKLH++    VTSFV  M+
Subjt:  TLLLYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMN

Query:  NIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQ
        N+ID++RDVA+LA + ++ N +G+D+ AA LFN L +G       HM  V+K VNEHCN+ WN+ CA+L H YFQSPWTIIS+ A IFGF ILILQAIYQ
Subjt:  NIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQ

Query:  IGDYY
          DYY
Subjt:  IGDYY

XP_023513987.1 UPF0481 protein At3g47200-like isoform X2 [Cucurbita pepo subsp. pepo]2.8e-9751.24Show/hide
Query:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK
        SIY+IP F+ +T PKA+EP +VSLGPY+HGK HL  MEL KL+LF+ F   C L+VE IV+ V  +L+EL  SYD + +E T+                 
Subjt:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK

Query:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRK---RITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYRT
           +FL+LMI+DGCF++ FL NCP  L N++ +I +DML+LENQLPM LL+KL S+A R       +  + +++++  +  + KD      LHIL+MY+ 
Subjt:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRK---RITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYRT

Query:  LLLYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNN
         LL+P     D    ++ S+PE Q+IP AT+LR AGIKFKRSKTDS TDV F+  G  L LP ++VDD TES LLNVMAFEKLH++    VTSFV  M+N
Subjt:  LLLYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNN

Query:  IIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQI
        +ID++RDVA+LA + ++ N +G+D+ AA LFN L +G       HM  V+K VNEHCN+ WN+ CA+L H YFQSPWTIIS+ A IFGF ILILQAIYQ 
Subjt:  IIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQI

Query:  GDYY
         DYY
Subjt:  GDYY

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]2.8e-9750.99Show/hide
Query:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDE-STDGIERSSSHAMILDKSA
        SIY+IP F++K   KAFEP LVSLGPYHHGK HL SME  K + F +F+   GL +E IVES+   LE L G+YD + ++   DG               
Subjt:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDE-STDGIERSSSHAMILDKSA

Query:  KGIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYRTLL
            +FL++MI+DGCF+++F + CP  L  M ++I RDML+LENQLPM+LL +L    N             +  +   I+++    R LHILDMYR  L
Subjt:  KGIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYRTLL

Query:  LYP------KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSF--EGSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFM
        LYP      +SG  +    QS+PE QIIP ATQL  AGIKFK S T + TDVSF  +   L LP +VVDD TE+ LLNVMAFEKL++E    VTSFV  M
Subjt:  LYP------KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSF--EGSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFM

Query:  NNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT-TYTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIY
        NN+ID DRDVALLAS  I+ N LG+D +AA+LF+LL  G       H+ +V+  VN+HC+ SWN+WCASL H YFQ+PW IIS+FA +FGFAILI+QAIY
Subjt:  NNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT-TYTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIY

Query:  QIGDYY
        QI DY+
Subjt:  QIGDYY

TrEMBL top hitse value%identityAlignment
A0A6J1D5C0 UPF0481 protein At3g47200-like1.4e-8648.43Show/hide
Query:  SIYEIPNFIKKTLPKA-FEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSA
        SIY+IP+F+++  PKA FEP LVS GPYHHG+ HL  MEL K + F  F    GL VE IVE V  +LE+++G YD +  E                K  
Subjt:  SIYEIPNFIKKTLPKA-FEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSA

Query:  KGIERFLKLMILDGCFVIE-FLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANR---KRITITVVQTHFIHNKKQSISKDVEAQRHLHILDMY
         G  +FL+LM+LDGCF++E  L +   WL NM  +I RDML+LENQLPMKLL++L SMAN    K +   V     I +K +    D     +LHIL+MY
Subjt:  KGIERFLKLMILDGCFVIE-FLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANR---KRITITVVQTHFIHNKKQSISKDVEAQRHLHILDMY

Query:  RTLLLYPK-------SGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTS
           LL PK        G  +    + + E+QII  AT+L  AGI+F+RS++ S TDV F+     L+LP +VVDD TES  LNVMAFEKLH E    VT 
Subjt:  RTLLLYPK-------SGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTS

Query:  FVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT---TYTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAI
        F+  MNN+ID D+DVALLAS  II+N LG+D+ AA LF  L+ G         H+  V +MV EHC K  +KWCASL H YFQ PW I+S+ A   GF I
Subjt:  FVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT---TYTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAI

Query:  LILQAIYQIGDYY
        LILQA+YQI DYY
Subjt:  LILQAIYQIGDYY

A0A6J1H6V9 UPF0481 protein At3g47200-like3.6e-8246.76Show/hide
Query:  KHSIYEIPNFIKKT----LPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMI
        KHSIY++P F+++T      KAF+P +VS GPYHHGK HL  ME  K + F+  L T GL VE IV  V  +L++L  SYD + +E T            
Subjt:  KHSIYEIPNFIKKT----LPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMI

Query:  LDKSAKGIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQ---THFIHNKKQSISKDVEAQRHLHI
             +   +FLKLMI+DGCF++    +CP  L NM  +I  + L+LENQLP+KLL KL S+A    I+    +      I N      KD+     LHI
Subjt:  LDKSAKGIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQ---THFIHNKKQSISKDVEAQRHLHI

Query:  LDMYRTLLLYPKSGDGQNFVNQ---SNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHI-----EIH
        LD+Y+  LL P   D   +  +   S  E Q+IP AT+L  AGIKFK S+T S  DV F+     L LP ++VDD TES  LNVMAFEKLH+     E  
Subjt:  LDMYRTLLLYPKSGDGQNFVNQ---SNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHI-----EIH

Query:  HLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFG
         L+TSFV  M+N+ID++RDVALL+SKG + N LG+DR AA LF+ L  G+      HM  V+KM+N++C + WN+ CA+L H YFQ+PWT+IS+ A IFG
Subjt:  HLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFG

Query:  FAILILQAIYQIGDYYE
        F ILILQAIYQ+ DYY+
Subjt:  FAILILQAIYQIGDYYE

A0A6J1HB25 UPF0481 protein At3g47200-like1.8e-9751.11Show/hide
Query:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK
        SIY+IP F+ +T PKA+EP +VSLGPY+HGK HL  MEL KL+LFH F   C L+VE IV  V  +L+EL  SYD + ++                K   
Subjt:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK

Query:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRK----RITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYR
        G  +FL+LMI+DGCF++ FL NCP  L N++ +I +DML+LENQLPM LL+KL S+A+R     +  +  + +++++  +  + KD      LHIL+MY+
Subjt:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRK----RITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYR

Query:  TLLLYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMN
          LL+P     D    +  S+PE Q+IP AT+LR AGIKFKRSKT S TDV F+  G  L LP ++VDD TES LLNVMAFEKLH++    VTSFV  M+
Subjt:  TLLLYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMN

Query:  NIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQ
        N+ID++RDVA+LA + ++ N +G+D+ AA LFN L +G       HM  V+K VNEHCN+ WN+ CA+L H YFQSPWTIIS+ A IFGF ILILQAIYQ
Subjt:  NIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQ

Query:  IGDYY
          DYY
Subjt:  IGDYY

A0A6J1KVQ6 UPF0481 protein At3g47200-like isoform X25.3e-9451Show/hide
Query:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK
        SIY+IP F+ +T PKA+EP +VSLGPY+HGK HL  MEL KL+LFH F   C L+VE IV  V  +L+EL  SYD + +E T                 +
Subjt:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK

Query:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQ-THFIHNKKQSISKDVEAQRHLHILDMYRTLL
           +FL+LMI+DGCF++ FL +CP  L N++ +I +DML+LENQLPM LL+KL S+A R    + + Q    +  K  SI ++   +  LHIL+MY+  L
Subjt:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQ-THFIHNKKQSISKDVEAQRHLHILDMYRTLL

Query:  LYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNII
        LYP     D    ++ S+PE Q+IP AT+L  AGIKFKRSKT+S  DV F+     L LP ++VDD TES +LNVMAFEKLH++    VTSFV  M+N+I
Subjt:  LYP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNII

Query:  DEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGD
        D++RDVA+LA + I+ N +G+D+ AA LF+ L +G       HM  V+KMVN HCN+ WN+ CA+L H YFQSPWTIIS+ A IFGF ILILQAIYQ  D
Subjt:  DEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGD

Query:  YY
        YY
Subjt:  YY

A0A6J1KYV8 UPF0481 protein At3g47200-like isoform X18.2e-9550.87Show/hide
Query:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK
        SIY+IP F+ +T PKA+EP +VSLGPY+HGK HL  MEL KL+LFH F   C L+VE IV  V  +L+EL  SYD + +E T                 +
Subjt:  SIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKSAK

Query:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYRTLLL
           +FL+LMI+DGCF++ FL +CP  L N++ +I +DML+LENQLPM LL+KL S+A R  + +       +  K  SI ++   +  LHIL+MY+  LL
Subjt:  GIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIHNKKQSISKDVEAQRHLHILDMYRTLLL

Query:  YP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIID
        YP     D    ++ S+PE Q+IP AT+L  AGIKFKRSKT+S  DV F+     L LP ++VDD TES +LNVMAFEKLH++    VTSFV  M+N+ID
Subjt:  YP--KSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFE--GSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIID

Query:  EDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDY
        ++RDVA+LA + I+ N +G+D+ AA LF+ L +G       HM  V+KMVN HCN+ WN+ CA+L H YFQSPWTIIS+ A IFGF ILILQAIYQ  DY
Subjt:  EDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGITT-YTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDY

Query:  Y
        Y
Subjt:  Y

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026452.2e-1227.06Show/hide
Query:  IPHATQLRGAGIKFKRSKTDSFTDVSFEGST--LRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDR
        IP  + L  AG++FK +   + + V+F+ ++    LP + +D  TE+ L N++A+E  +     + T +   +N IID + DV LL  +G++V+ L  D+
Subjt:  IPHATQLRGAGIKFKRSKTDSFTDVSFEGST--LRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDR

Query:  AAANLFNLLSNGI-TTYTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQ
         AA ++N +S  +  T    + +  + VN +    W      L   Y    W I++  A +    ++ LQ
Subjt:  AAANLFNLLSNGI-TTYTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQ

Q9SD53 UPF0481 protein At3g472004.5e-3427.98Show/hide
Query:  IYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTC---GLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS
        I+ +P       PKA++P +VS+GPYH+G+ HL  ++  K  L   FL       +E   +V++V+ L +++R SY           E  + H ++    
Subjt:  IYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTC---GLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS

Query:  AKGIERFLKLMILDGCFV----------IEFLEN---CPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRIT--ITVVQTHFIHN---KKQSIS
                 +M+LDGCF+          IE  E+     PWL +    I  D+L+LENQ+P  +L  L  + ++  ++  +  +  HF  N   K+ S  
Subjt:  AKGIERFLKLMILDGCFV----------IEFLEN---CPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRIT--ITVVQTHFIHN---KKQSIS

Query:  KDVEAQRHLHILDMYRTLLLYPKS----------------GDGQNFVNQSNPELQIIPHATQLRGAGIKF--KRSKTDSFTDVSFEGSTLRLPCVVVDDG
        +     +  H+LD+ R   L   S                G   N  +  +  + +I  A +LR  GIKF  +RSK DS  +V  + + L++P +  D  
Subjt:  KDVEAQRHLHILDMYRTLLLYPKS----------------GDGQNFVNQSNPELQIIPHATQLRGAGIKF--KRSKTDSFTDVSFEGSTLRLPCVVVDDG

Query:  TESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLAS-KGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMGEVNKMVNEHCNKSWNKWCAS
          S  LN +AFE+ + +  + +T+++ FM  +++ + DV  L + K II N  G +   +  F  +S  +     T ++  V K VNE+  K +N   A 
Subjt:  TESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLAS-KGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMGEVNKMVNEHCNKSWNKWCAS

Query:  LTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDY
          HT+F+SPWT +S  A +F   + +LQ+   I  Y
Subjt:  LTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDY

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)2.1e-5029.98Show/hide
Query:  KHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS
        K  IY +P ++++   K++ P  VSLGPYHHGK  L SM+  K    ++ L      ++  ++++ +L E+ R  Y+     S++               
Subjt:  KHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS

Query:  AKGIERFLKLMILDGCFVIE-------------FLENCPPW-LQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIH-----------
              F+++++LDGCFV+E             +  N P + ++   + I RDM+MLENQLP+ +L++L  +    R    +V    I            
Subjt:  AKGIERFLKLMILDGCFVIE-------------FLENCPPW-LQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIH-----------

Query:  -------NKKQSISKD-----VEAQRHLHILDMYRTLLL--YPKSGD-------GQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTL
                 + S+++D           LH LD++R  LL   PK           +N         Q+I   T+L+ AGIKF+R KTD F D+ F+   L
Subjt:  -------NKKQSISKD-----VEAQRHLHILDMYRTLLL--YPKSGD-------GQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTL

Query:  RLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMGEVNKMVNEHCN
         +P +++ DGT+S  LN++AFE+ HI+  + +TS++ FM+N+ID   DV+ L   GII + LG D   A+LFN L   +   T   ++  ++  VN + +
Subjt:  RLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMGEVNKMVNEHCN

Query:  KSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYYE
          WN W A+L H YF +PW I+S  A +    +   Q+ Y +  YY+
Subjt:  KSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYYE

AT3G50130.1 Plant protein of unknown function (DUF247)5.7e-4827.99Show/hide
Query:  KHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS
        K  IY +P ++++   K++ P  VSLGP+HHG  HL  M+  K    +  +     ++E  ++++ +L +  R  Y+   D S++               
Subjt:  KHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS

Query:  AKGIERFLKLMILDGCFVIEFLENCPPWLQNMTYE--------------ITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIH-----------
             +F ++++LDGCFV+E           + Y+              I RDM+MLENQLP+ +L++L  +   KR    +V    +            
Subjt:  AKGIERFLKLMILDGCFVIEFLENCPPWLQNMTYE--------------ITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIH-----------

Query:  --NKKQSISKDV-------EAQRHLHILDMYRTLLLYPKSGDGQNFVNQ---------SNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTLRLP
              S+ +D        + +  LH LD++R  LL P S                     + Q+I   T+LR AGIKF+  KTD F D+ F+   L +P
Subjt:  --NKKQSISKDV-------EAQRHLHILDMYRTLLLYPKSGDGQNFVNQ---------SNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTLRLP

Query:  CVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMGEVNKMVNEHCNKSW
         +++ DGT+S   N++AFE+ HI+  + +TS++ FM+N+ID   DV  L   GII + LG+D   A+LFN L   +       ++ +++  V+ + ++ W
Subjt:  CVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMGEVNKMVNEHCNKSW

Query:  NKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYY
        N   A L H YF +PW   S FA +    + + Q+ +    Y+
Subjt:  NKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYY

AT3G50150.1 Plant protein of unknown function (DUF247)1.3e-4927.85Show/hide
Query:  VKDVSEYLTNFQPRKNQCFKHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICD
        +KD  E   ++    N   K  IY +P ++++   K++ P  VS+GPYHHGK HL  ME  K    +  +      +E  ++++ +L EE R  Y    D
Subjt:  VKDVSEYLTNFQPRKNQCFKHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICD

Query:  ESTDGIERSSSHAMILDKSAKGIERFLKLMILDGCFVIEFLENCPPWLQNMTY--------------EITRDMLMLENQLPMKLLDKLCSMAN-------
                            K    F ++++LDGCFV+E  +      Q + Y               I RDM+MLENQLP+ +LD+L  +         
Subjt:  ESTDGIERSSSHAMILDKSAKGIERFLKLMILDGCFVIEFLENCPPWLQNMTY--------------EITRDMLMLENQLPMKLLDKLCSMAN-------

Query:  -----RKRITITVVQTHFIHNK------KQSISKDVEAQRHLHILDMYRTLLLYPKSGDGQNF----VNQSNPELQIIPHATQLRGAGIKFKRSKTDSFT
               R   T++ T  +  K       Q  S ++     LH LD++   L+       Q      ++    + Q+I   T+LRGAG+ F R +T    
Subjt:  -----RKRITITVVQTHFIHNK------KQSISKDVEAQRHLHILDMYRTLLLYPKSGDGQNF----VNQSNPELQIIPHATQLRGAGIKFKRSKTDSFT

Query:  DVSFEGSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGI--TTYTVHMGEV
        D+ F+   L++P +++ DGT+S   N++AFE+ H +  + +TS++ FM+N+I+  +DV+ L   GII + LG D   A+LFN L   +       ++ ++
Subjt:  DVSFEGSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGI--TTYTVHMGEV

Query:  NKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYYE
        ++ VN + ++ WN   A+L   YF +PW   S  A +    +   Q+ + +  YY+
Subjt:  NKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYYE

AT3G50170.1 Plant protein of unknown function (DUF247)1.2e-5030Show/hide
Query:  KHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS
        K  IY +P+++++   K++ P  VSLGPYHHGK  L  ME  K    ++ L      +E    ++ +L E+ R      C E    + R+          
Subjt:  KHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS

Query:  AKGIERFLKLMILDGCFVIE-------------FLENCPPW-LQNMTYEITRDMLMLENQLPMKLLDKLCSM----ANRKRITITVVQTHF---------
              F ++++LDGCFV+E             +  N P + ++ + + I RDM+MLENQLP+ +LD+L  +     N+  I   V    F         
Subjt:  AKGIERFLKLMILDGCFVIE-------------FLENCPPW-LQNMTYEITRDMLMLENQLPMKLLDKLCSM----ANRKRITITVVQTHF---------

Query:  IHNKKQS-----ISKDVEA---QRHLHILDMYRTLLLYPKSGDG---------QNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTLRL
        +    QS     + K ++    +  LH LD++R  LL                +N       + Q++   T+LR AG+KF++ KTD F D+ F+   L +
Subjt:  IHNKKQS-----ISKDVEA---QRHLHILDMYRTLLLYPKSGDG---------QNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTLRL

Query:  PCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMGEVNKMVNEHCNKS
        P +++ DGT+S   N++AFE+ HIE  + +TS++ FM+N+I+   DV+ L   GII + LG D   A+LFN L   +       H+  ++  VN + N+ 
Subjt:  PCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMGEVNKMVNEHCNKS

Query:  WNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYYELRLKI
        WN   A+LTH YF +PW   S  A +    + + Q+ Y +  YY+   K+
Subjt:  WNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYYELRLKI

AT3G50180.1 Plant protein of unknown function (DUF247)1.8e-4928.74Show/hide
Query:  KHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS
        K  IY++P+++     K++ P  VSLGPYHHG+    SME  K    +  L      +E  ++++++L E+ R  Y+                +++L  +
Subjt:  KHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMICDESTDGIERSSSHAMILDKS

Query:  AKGIERFLKLMILDGCFVIEFLENCPPWLQNMTYE--------------ITRDMLMLENQLPMKLLDKLCSM---ANRKRITITVVQTHFI------HNK
              F ++++LDGCF++E L+        + Y+              I RDM+MLENQLP+ +L++L  +      +   + +V   FI         
Subjt:  AKGIERFLKLMILDGCFVIEFLENCPPWLQNMTYE--------------ITRDMLMLENQLPMKLLDKLCSM---ANRKRITITVVQTHFI------HNK

Query:  KQSISKDVEAQRHLHILDMYRTLLLYPKSGDGQNFVNQSNPELQ-IIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTLRLPCVVVDDGTESALLNVMAFE
         ++      +   LH LD++   LL+P+S    N+   ++  LQ +IP  T+LR AG KFK +KTD F D+ F    L +P +++ DGT+S  LN++AFE
Subjt:  KQSISKDVEAQRHLHILDMYRTLLLYPKSGDGQNFVNQSNPELQ-IIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTLRLPCVVVDDGTESALLNVMAFE

Query:  KLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMG----EVNKMVNEHCNKSWNKWCASLTHTYFQSP
        + HIE  + +TS++ FM+N+ID   D++ L   GII + LG +   A++FN L   +   T  +++     EV++   ++ ++  N    +L   Y  +P
Subjt:  KLHIEIHHLVTSFVAFMNNIIDEDRDVALLASKGIIVNGLGDDRAAANLFNLLSNGIT--TYTVHMG----EVNKMVNEHCNKSWNKWCASLTHTYFQSP

Query:  WTIISVFATIFGFAILILQAIYQIGDYY
        W  +S FA +    +   Q+ +    Y+
Subjt:  WTIISVFATIFGFAILILQAIYQIGDYY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATCCAGGAATTTGCAGATCAATTGCTGGTCAAAGATGTGAGTGAATATCTCACTAATTTCCAGCCAAGAAAAAACCAATGTTTCAAACATTCAATTTATGAAAT
ACCAAACTTCATCAAAAAAACCCTACCCAAAGCTTTCGAGCCAACGCTCGTGTCGCTCGGGCCATACCACCACGGAAAGCCACACCTCGGTTCGATGGAGCTAGCGAAAT
TGGAATTGTTTCACCAATTTCTTGGCACTTGTGGGCTCGAAGTCGAGTTCATTGTTGAAAGTGTGATGAAGTTGTTGGAAGAACTGCGAGGATCGTACGACATGATTTGC
GATGAGAGCACGGACGGAATAGAGAGATCGTCGTCACACGCCATGATTTTGGATAAGAGCGCGAAAGGAATAGAGCGATTTTTGAAGCTCATGATTTTGGATGGATGTTT
TGTTATTGAGTTCTTGGAAAATTGTCCTCCATGGCTTCAAAATATGACATATGAAATCACGCGGGACATGTTAATGCTTGAGAATCAGCTGCCCATGAAGCTTCTTGACA
AACTATGTTCCATGGCTAATAGGAAACGAATAACCATTACGGTAGTCCAAACACATTTTATCCATAACAAAAAGCAATCAATCTCAAAAGATGTGGAAGCACAACGACAT
TTGCACATTTTAGACATGTACCGGACACTATTATTGTATCCTAAGAGCGGTGATGGTCAAAATTTTGTCAACCAATCTAACCCGGAGCTTCAAATCATACCACACGCAAC
ACAGCTTCGCGGAGCCGGGATCAAATTCAAAAGGAGCAAAACCGACAGCTTTACCGACGTGTCGTTCGAAGGCAGCACATTGAGGCTCCCATGCGTGGTAGTGGACGATG
GCACGGAGTCAGCTTTGTTAAACGTGATGGCATTTGAGAAACTCCACATTGAAATTCACCACTTAGTCACATCATTCGTAGCCTTCATGAACAATATCATAGACGAGGAC
CGAGATGTTGCGTTGCTAGCCTCGAAAGGAATCATCGTCAATGGGCTTGGCGACGATCGAGCAGCGGCTAACTTGTTCAATCTACTGTCTAACGGAATTACTACATATAC
AGTCCACATGGGTGAGGTGAACAAGATGGTGAATGAGCATTGCAACAAGTCATGGAATAAGTGGTGTGCAAGTCTCACACATACCTATTTTCAAAGTCCATGGACAATCA
TCTCTGTCTTTGCTACTATTTTTGGTTTTGCTATCCTAATCCTCCAAGCCATCTACCAAATCGGTGATTATTATGAGCTTCGCCTCAAAATTAATTAG
mRNA sequenceShow/hide mRNA sequence
CAAATAGAGAGACTAAACAAAAATGTCAATCCAGGAATTTGCAGATCAATTGCTGGTCAAAGATGTGAGTGAATATCTCACTAATTTCCAGCCAAGAAAAAACCAATGTT
TCAAACATTCAATTTATGAAATACCAAACTTCATCAAAAAAACCCTACCCAAAGCTTTCGAGCCAACGCTCGTGTCGCTCGGGCCATACCACCACGGAAAGCCACACCTC
GGTTCGATGGAGCTAGCGAAATTGGAATTGTTTCACCAATTTCTTGGCACTTGTGGGCTCGAAGTCGAGTTCATTGTTGAAAGTGTGATGAAGTTGTTGGAAGAACTGCG
AGGATCGTACGACATGATTTGCGATGAGAGCACGGACGGAATAGAGAGATCGTCGTCACACGCCATGATTTTGGATAAGAGCGCGAAAGGAATAGAGCGATTTTTGAAGC
TCATGATTTTGGATGGATGTTTTGTTATTGAGTTCTTGGAAAATTGTCCTCCATGGCTTCAAAATATGACATATGAAATCACGCGGGACATGTTAATGCTTGAGAATCAG
CTGCCCATGAAGCTTCTTGACAAACTATGTTCCATGGCTAATAGGAAACGAATAACCATTACGGTAGTCCAAACACATTTTATCCATAACAAAAAGCAATCAATCTCAAA
AGATGTGGAAGCACAACGACATTTGCACATTTTAGACATGTACCGGACACTATTATTGTATCCTAAGAGCGGTGATGGTCAAAATTTTGTCAACCAATCTAACCCGGAGC
TTCAAATCATACCACACGCAACACAGCTTCGCGGAGCCGGGATCAAATTCAAAAGGAGCAAAACCGACAGCTTTACCGACGTGTCGTTCGAAGGCAGCACATTGAGGCTC
CCATGCGTGGTAGTGGACGATGGCACGGAGTCAGCTTTGTTAAACGTGATGGCATTTGAGAAACTCCACATTGAAATTCACCACTTAGTCACATCATTCGTAGCCTTCAT
GAACAATATCATAGACGAGGACCGAGATGTTGCGTTGCTAGCCTCGAAAGGAATCATCGTCAATGGGCTTGGCGACGATCGAGCAGCGGCTAACTTGTTCAATCTACTGT
CTAACGGAATTACTACATATACAGTCCACATGGGTGAGGTGAACAAGATGGTGAATGAGCATTGCAACAAGTCATGGAATAAGTGGTGTGCAAGTCTCACACATACCTAT
TTTCAAAGTCCATGGACAATCATCTCTGTCTTTGCTACTATTTTTGGTTTTGCTATCCTAATCCTCCAAGCCATCTACCAAATCGGTGATTATTATGAGCTTCGCCTCAA
AATTAATTAGTAGTGAAAATCGTAATTCGTGTATATTAATGTTGATGTGAGATCTTCAATCATGGTGCTCTAGAATAATGACTTCTTAAGTTGG
Protein sequenceShow/hide protein sequence
MSIQEFADQLLVKDVSEYLTNFQPRKNQCFKHSIYEIPNFIKKTLPKAFEPTLVSLGPYHHGKPHLGSMELAKLELFHQFLGTCGLEVEFIVESVMKLLEELRGSYDMIC
DESTDGIERSSSHAMILDKSAKGIERFLKLMILDGCFVIEFLENCPPWLQNMTYEITRDMLMLENQLPMKLLDKLCSMANRKRITITVVQTHFIHNKKQSISKDVEAQRH
LHILDMYRTLLLYPKSGDGQNFVNQSNPELQIIPHATQLRGAGIKFKRSKTDSFTDVSFEGSTLRLPCVVVDDGTESALLNVMAFEKLHIEIHHLVTSFVAFMNNIIDED
RDVALLASKGIIVNGLGDDRAAANLFNLLSNGITTYTVHMGEVNKMVNEHCNKSWNKWCASLTHTYFQSPWTIISVFATIFGFAILILQAIYQIGDYYELRLKIN