; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025652 (gene) of Chayote v1 genome

Gene IDSed0025652
OrganismSechium edule (Chayote v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationLG09:3187519..3190309
RNA-Seq ExpressionSed0025652
SyntenySed0025652
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592831.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. sororia]4.6e-9249.26Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK
        SIY+IP F+ +    A+EP +VSLGPY+HGK HL  ME+ KL LF  F   C L+VE +V  V T+L+EL  SY  + ++ KE                 
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK

Query:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY
           + L+LM+VDGCF++ FL NCP  L N+  +IK+D+L+LENQLPM LL KL SIA+++      V L  +D K+  S ++      + ++ LHIL MY
Subjt:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY

Query:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM
        +  LLHP  D TD    ID SDP+ Q I  AT+LRE+GI FK S T S TDV F+  G  L LP ++++ +T+S LLNVMAFEKLH+E   KVTSF+  M
Subjt:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM

Query:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY
         NLID  +DVA+LA + ++ NA+G+D+ AA LFN L  G       HMA V+K VN+HC + WNE CA+LKH YFQ+PWTIIS+ AAIFGF +L+ Q+IY
Subjt:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY

Query:  QFCDYY
        QF DYY
Subjt:  QFCDYY

KAG7025238.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-9249.01Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK
        SIY+IP F+ +  P A+EP +VSLGPY+HGK HL  ME+ KL LF  F   C L+VE +V  V T+L+EL  SY  + ++ KE                 
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK

Query:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY
           + L+LM+VDGCF++ FL NCP  L N+  +IK+D+L+LENQLPM LL KL SIA+++            D K+  S ++      + ++ LHIL MY
Subjt:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY

Query:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM
        +  LLHP  D TD    ID SDP+ Q I  AT+LRE+GI FK S T S TDV F+  G  L LP ++++ +T+S LLNVMAFEKLH+E   KVTSF+  M
Subjt:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM

Query:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY
         NLID  +DVA+LA + ++ NA+G+D+ AA LFN L  G       HMA V+K VN+HC + WNE CA+LKH YFQ+PWTIIS+ AAIFGF +L+ Q+IY
Subjt:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY

Query:  QFCDYY
        QF DYY
Subjt:  QFCDYY

XP_022960454.1 UPF0481 protein At3g47200-like [Cucurbita moschata]1.7e-9148.77Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK
        SIY+IP F+ +  P A+EP +VSLGPY+HGK HL  ME+ KL LF  F   C L+VE +V  V T+L+EL  SY  + ++ KE                 
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK

Query:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY
           + L+LM+VDGCF++ FL NCP  L N+  +IK+D+L+LENQLPM LL+KL SIA+++      V L  +D K+  S ++      + ++ LHIL MY
Subjt:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY

Query:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM
        +  LLHP  D TD    +  SDP+ Q I  AT+LRE+GI FK S+T S TDV F+  G  L LP ++++ +T+S LLNVMAFEKLH++   KVTSF+  M
Subjt:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM

Query:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY
         NLID  +DVA+LA + ++ NA+G+D+ AA LFN L  G       HMA V+K VN+HC + WNE CA+LKH YFQ+PWTIIS+ AAIFGF +L+ Q+IY
Subjt:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY

Query:  QFCDYY
        QF DYY
Subjt:  QFCDYY

XP_023513986.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita pepo subsp. pepo]1.0e-9148.66Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK
        SIY+IP F+ +  P A+EP +VSLGPY+HGK HL  ME+ KL LF  F   C L+VE +V+ V T+L+EL  SY  +    +E  E P            
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK

Query:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY
           + L+LM+VDGCF++ FL NCP  L N+  +IK+D+L+LENQLPM LL+KL SIA ++      V L  +D ++  S ++      + ++ LHIL MY
Subjt:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY

Query:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM
        +  LLHP  D TD    +D SDP+ Q I  AT+LRE+GI FK S+T S TDV F+  G  L LP ++++  T+S LLNVMAFEKLH++   +VTSF+  M
Subjt:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM

Query:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY
         NLID  +DVA+LA + ++ NA+G+D+ AA LFN L  G       HMA V+K VN+HC + WNE CA+LKH YFQ+PWTIIS+ AAIFGF +L+ Q+IY
Subjt:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY

Query:  QFCDYYKVP
        QF DYY  P
Subjt:  QFCDYYKVP

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]1.3e-9448.64Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK
        SIY+IP F++++Q  AFEP LVSLGPYHHGK HL SME  K   FR+F++  GL +E +VE +   LE L G+Y                   + +K  +
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK

Query:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYVLAEEYLHILGMYRAVLLH
           + L++M+VDGCF++ F   CP  L  MR +IKRD+L+LENQLPM+LL +L      +    Q++          N + V+  E LHIL MYRA LL+
Subjt:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYVLAEEYLHILGMYRAVLLH

Query:  P---RSDTDSQNFI--DQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSF--EGSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDN
        P   R D   +  +   QSDP+ Q I  ATQL ++GI FK S TK+ TDVSF  +   L+LP ++++  T++ LLNVMAFEKL++E   KVTSF+  M+N
Subjt:  P---RSDTDSQNFI--DQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSF--EGSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDN

Query:  LIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT-TYSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQF
        LID+++DVALLAS  I+ NALG+D +AADLF+LL KG       H+ DV+  VNKHC  SWN+WCASLKH YFQNPW IIS+FAA+FGFA+L+ Q+IYQ 
Subjt:  LIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT-TYSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQF

Query:  CDYYK
         DY++
Subjt:  CDYYK

TrEMBL top hitse value%identityAlignment
A0A6J1D5C0 UPF0481 protein At3g47200-like1.2e-8545.89Show/hide
Query:  ERLAVKDVSEYLTQFLSHRKQILKH--SIYEIPNFIKEVQPNA-FEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRG
        E   V  V E L Q L     +     SIY+IP+F+++VQP A FEP LVS GPYHHG+ HL  ME+ K   F+ F    GL VE +VERV ++LE+++G
Subjt:  ERLAVKDVSEYLTQFLSHRKQILKH--SIYEIPNFIKEVQPNA-FEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRG

Query:  SYSMIWDESKEGIERPLAHLMISDKCAKGIERLLKLMVVDGCFVIQ-FLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRI-TIQTVHLH
         Y  +  E K           + D  AK     L+LMV+DGCF+++  L +   WL NMR +I RD+L+LENQLPMKLL++L S+AN S +  ++++   
Subjt:  SYSMIWDESKEGIERPLAHLMISDKCAKGIERLLKLMVVDGCFVIQ-FLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRI-TIQTVHLH

Query:  FRDHKEPNSKYVLAEEYLHILGMYRAVLLHP------RSDTDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYS
        F    +   +  L  EYLHIL MY   LL P      R    S+    + D + Q I  AT+L E+GI F+ S++KS TDV F+     LKLP ++++  
Subjt:  FRDHKEPNSKYVLAEEYLHILGMYRAVLLHP------RSDTDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYS

Query:  TKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT---TYSRHMADVNKMVNKHCKKSWNEWCAS
        T+S  LNVMAFEKLH E   KVT FI  M+NLID++QDVALLAS  II NALG+D+ AA+LF  LA+G         H+  V +MV +HCKK  ++WCAS
Subjt:  TKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT---TYSRHMADVNKMVNKHCKKSWNEWCAS

Query:  LKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYK
        LKH YFQ+PW I+S+ AA  GF +L+ Q++YQ CDYY+
Subjt:  LKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYK

A0A6J1H6V9 UPF0481 protein At3g47200-like6.5e-8444.73Show/hide
Query:  KHSIYEIPNFIKEVQP----NAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMI
        KHSIY++P F+++        AF+P +VS GPYHHGK HL  ME  K   F   L T GL VE +V  V T+L++L  SY  + +E              
Subjt:  KHSIYEIPNFIKEVQP----NAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMI

Query:  SDKCAKGIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV----------LA
             +   + LKLM+VDGCF+++   +CP  L NM+ +I+ + L+LENQLP+KLL KL SIA         + +   D + P  + +          L 
Subjt:  SDKCAKGIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV----------LA

Query:  EEYLHILGMYRAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHI---
        +E+LHIL +Y+A LL P  D T      + S  +FQ I  AT+L E+GI FKMS+TKS  DV F+     L LP ++++ +T+S  LNVMAFEKLH+   
Subjt:  EEYLHILGMYRAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHI---

Query:  --ETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVF
          E    +TSF+  M NLID  +DVALL+SKG + NALG+DR AA+LF+ L KG+    +RHM  V+KM+N +CK+ WNE CA+LKH YFQNPWT+IS+ 
Subjt:  --ETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVF

Query:  AAIFGFALLMQQSIYQFCDYYKVPLQK
        AAIFGF +L+ Q+IYQ  DYYK   +K
Subjt:  AAIFGFALLMQQSIYQFCDYYKVPLQK

A0A6J1HB25 UPF0481 protein At3g47200-like8.4e-9248.77Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK
        SIY+IP F+ +  P A+EP +VSLGPY+HGK HL  ME+ KL LF  F   C L+VE +V  V T+L+EL  SY  + ++ KE                 
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK

Query:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY
           + L+LM+VDGCF++ FL NCP  L N+  +IK+D+L+LENQLPM LL+KL SIA+++      V L  +D K+  S ++      + ++ LHIL MY
Subjt:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY

Query:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM
        +  LLHP  D TD    +  SDP+ Q I  AT+LRE+GI FK S+T S TDV F+  G  L LP ++++ +T+S LLNVMAFEKLH++   KVTSF+  M
Subjt:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM

Query:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY
         NLID  +DVA+LA + ++ NA+G+D+ AA LFN L  G       HMA V+K VN+HC + WNE CA+LKH YFQ+PWTIIS+ AAIFGF +L+ Q+IY
Subjt:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY

Query:  QFCDYY
        QF DYY
Subjt:  QFCDYY

A0A6J1KVQ6 UPF0481 protein At3g47200-like isoform X25.7e-8847.19Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK
        SIY+IP F+ +  P A+EP +VSLGPY+HGK HL  ME+ KL LF  F   C L+VE +V  V T+L+EL  SY  + +E                   +
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK

Query:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY
           + L+LM+VDGCF++ FL +CP  L N+  +IK+D+L+LENQLPM LL+KL SIA ++      V L  +D K+   K++      + ++ LHIL MY
Subjt:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY

Query:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM
        +  LL+P  D  D    +D SDP+ Q I  AT+L E+GI FK S+T+S  DV F+     L LP ++++  T+S +LNVMAFEKLH++   KVTSF+  M
Subjt:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM

Query:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY
         NLID  +DVA+LA + I+ NA+G+D+ AA LF+ L  G       HMA V+KMVN HC + WNE CA+LKH YFQ+PWTIIS+ AAIFGF +L+ Q+IY
Subjt:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY

Query:  QFCDYYKVP
        QF DYY  P
Subjt:  QFCDYYKVP

A0A6J1KYV8 UPF0481 protein At3g47200-like isoform X15.1e-8947.19Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK
        SIY+IP F+ +  P A+EP +VSLGPY+HGK HL  ME+ KL LF  F   C L+VE +V  V T+L+EL  SY  + +E                   +
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAK

Query:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY
           + L+LM+VDGCF++ FL +CP  L N+  +IK+D+L+LENQLPM LL+KL SIA ++      V L  +D K+   K++      + ++ LHIL MY
Subjt:  GIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYV------LAEEYLHILGMY

Query:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM
        +  LL+P  D  D    +D SDP+ Q I  AT+L E+GI FK S+T+S  DV F+     L LP ++++  T+S +LNVMAFEKLH++   KVTSF+  M
Subjt:  RAVLLHPRSD-TDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFE--GSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFM

Query:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY
         NLID  +DVA+LA + I+ NA+G+D+ AA LF+ L  G       HMA V+KMVN HC + WNE CA+LKH YFQ+PWTIIS+ AAIFGF +L+ Q+IY
Subjt:  DNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITT-YSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIY

Query:  QFCDYYKVP
        QF DYY  P
Subjt:  QFCDYYKVP

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.0e-1421.44Show/hide
Query:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTC-GLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCA
        SI+ +P  +    P+++ P  VS+GPYH  KP L  ME  KL + R+  +         +VE++ ++  ++R  Y                H  I     
Subjt:  SIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTC-GLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCA

Query:  KGIERLLKLMVVDGCFVIQFLDNCP-RWLQNM-----RSEIKRDILMLENQLPMKLL---------------DKLCSIANKSRITIQTVHLHFRDHKEPN
           E LL +M VD  F+I+FL     R ++ +      +EI RDI+M+ENQ+P+ +L               D L S+       +  + + F D +   
Subjt:  KGIERLLKLMVVDGCFVIQFLDNCP-RWLQNM-----RSEIKRDILMLENQLPMKLL---------------DKLCSIANKSRITIQTVHLHFRDHKEPN

Query:  SKYVLAEEYLHILGMYRAVLL-------------HPRSDTDSQN----FIDQSDPDFQ------------------------------------------
        +++   +E  HIL     +++               R+D +  N    F+D+    F+                                          
Subjt:  SKYVLAEEYLHILGMYRAVLL-------------HPRSDTDSQN----FIDQSDPDFQ------------------------------------------

Query:  -----------------------TIRHATQLRESGIDFKMSETKSFTDVSFEGST--LKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNL
                               TI   + L ++G+ FK +   + + V+F+ ++    LP + L+ +T++ L N++A+E  +       T +   ++ +
Subjt:  -----------------------TIRHATQLRESGIDFKMSETKSFTDVSFEGST--LKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNL

Query:  IDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITTYSRHMADVN-KMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQF
        ID  +DV LL  +G++ + L  D+ AA+++N ++K +        D   + VN++    W      L   Y    W I++  AA+    LLM  S+  F
Subjt:  IDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITTYSRHMADVN-KMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQF

Q9SD53 UPF0481 protein At3g472009.1e-3527.71Show/hide
Query:  IYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTC---GLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC
        I+ +P     + P A++P +VS+GPYH+G+ HL  ++  K  L + FLD      +E   +V+ V+ L +++R SYS   +E K G +            
Subjt:  IYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTC---GLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC

Query:  AKGIERLLKLMVVDGCFVIQF---------LDNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEP----NSKYVLA
              L+ +MV+DGCF++           L   P + +  + S I+ D+L+LENQ+P  +L  L  + +K  ++     + F   K P     S +   
Subjt:  AKGIERLLKLMVVDGCFVIQF---------LDNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEP----NSKYVLA

Query:  EEY--LHILGMYRAVLLHPRSDTD---------------SQNFIDQSDPDFQTIRHATQLRESGIDFKMSETK--SFTDVSFEGSTLKLPCVLLNYSTKS
          Y   H+L + R   L   S++D               S N           I  A +LR  GI F++  +K  S  +V  + + L++P +  +    S
Subjt:  EEY--LHILGMYRAVLLHPRSDTD---------------SQNFIDQSDPDFQTIRHATQLRESGIDFKMSETK--SFTDVSFEGSTLKLPCVLLNYSTKS

Query:  ALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLAS-KGIIENALGDDRAAADLFNLLAKGIT--TYSRHMADVNKMVNKHCKKSWNEWCASLKH
          LN +AFE+ + ++ +++T++I FM  L++  +DV  L + K IIEN  G +   ++ F  ++K +     + ++ +V K VN++ KK +N   A  +H
Subjt:  ALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLAS-KGIIENALGDDRAAADLFNLLAKGIT--TYSRHMADVNKMVNKHCKKSWNEWCASLKH

Query:  TYFQNPWTIISVFAAIFGFALLMQQSIYQFCDY
        T+F++PWT +S  A +F   L M QS      Y
Subjt:  TYFQNPWTIISVFAAIFGFALLMQQSIYQFCDY

Arabidopsis top hitse value%identityAlignment
AT3G50120.1 Plant protein of unknown function (DUF247)8.4e-5231.33Show/hide
Query:  KHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC
        K  IY +P +++E    ++ P  VSLGPYHHGK  L SM+  K     + L      ++  ++ +  L E+ R  Y           E PL         
Subjt:  KHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC

Query:  AKGIERLLKLMVVDGCFVIQ-------------FLDNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSI----ANKSRITIQTVHLHFRDHKEPNSK
        +      ++++V+DGCFV++             +  N P + ++     I+RD++MLENQLP+ +L++L  +     N++ +  Q + + F D   P  +
Subjt:  AKGIERLLKLMVVDGCFVIQ-------------FLDNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSI----ANKSRITIQTVHLHFRDHKEPNSK

Query:  YV-----------LAEE----------YLHILGMYRAVLLHPRSDTD--------SQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGST
         +           LA +           LH L ++R  LL      +        S+N         Q I   T+L+E+GI F+  +T  F D+ F+   
Subjt:  YV-----------LAEE----------YLHILGMYRAVLLHPRSDTD--------SQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGST

Query:  LKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT--TYSRHMADVNKMVNKHC
        L++P +L++  TKS  LN++AFE+ HI++ + +TS+I FMDNLID ++DV+ L   GIIE+ LG D   ADLFN L + +   T   +++ ++  VN++ 
Subjt:  LKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT--TYSRHMADVNKMVNKHC

Query:  KKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYKVP
           WN W A+LKH YF NPW I+S  AA+    L   QS Y    YYK P
Subjt:  KKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYKVP

AT3G50140.1 Plant protein of unknown function (DUF247)5.6e-4829.53Show/hide
Query:  KHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC
        K  IY +P  +K+   N++ P  VSLGPYHHG  HL  M+  K       +      +E  ++ +  L E  R  Y           E P+   + S+K 
Subjt:  KHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC

Query:  AKGIERLLKLMVVDGCFVIQFL-------------DNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTV-----------------
                +++V+DGCFV+                 N P + ++     I+RD+LMLENQLP+ +L++L  +   ++     V                 
Subjt:  AKGIERLLKLMVVDGCFVIQFL-------------DNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTV-----------------

Query:  HLHFRDHKEPNSKYV-----LAEEYLHILGMYRAVLLHPRSDTD--------SQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLKL
             + +E N+K+        +E LH L ++R  LL P    D        S+  +       Q +   T+LRE+GI FK  ++  F D+ F+   L++
Subjt:  HLHFRDHKEPNSKYV-----LAEEYLHILGMYRAVLLHPRSDTD--------SQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLKL

Query:  PCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT--TYSRHMADVNKMVNKHCKKS
        P +L++  TKS   N++A+E+ HI++ + +TS+I FMDNLID  +D+  L    IIE+ LG+D   AD+FN L + +     + ++++++  V+++  + 
Subjt:  PCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT--TYSRHMADVNKMVNKHCKKS

Query:  WNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYKVP
        WN   A+LKH YF NPW   S FAA+    L + QS +    Y+K P
Subjt:  WNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYKVP

AT3G50150.1 Plant protein of unknown function (DUF247)2.3e-4930.52Show/hide
Query:  KHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC
        K  IY +P +++E    ++ P  VS+GPYHHGK HL  ME  K       +      +E  ++ +  L EE R  Y    D                   
Subjt:  KHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC

Query:  AKGIERLLKLMVVDGCFVIQFLDNCPRWLQ--------------NMRSEIKRDILMLENQLPMKLLDKLCSI----ANKSRITIQTVHLHFRDHKEPNSK
         K      +++V+DGCFV++      +  Q               +   I+RD++MLENQLP+ +LD+L  +     N++ I +  V + F     P S+
Subjt:  AKGIERLLKLMVVDGCFVIQFLDNCPRWLQ--------------NMRSEIKRDILMLENQLPMKLLDKLCSI----ANKSRITIQTVHLHFRDHKEPNSK

Query:  YVLAEEY----------------LHILGMYRAVLLHPRSDTDSQN--FIDQS--DPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLKLPCVLLN
         +   E                 LH L ++   L+   S+T +Q   + D S  +   Q I   T+LR +G++F   ET    D+ F+   LK+P +L++
Subjt:  YVLAEEY----------------LHILGMYRAVLLHPRSDTDSQN--FIDQS--DPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLKLPCVLLN

Query:  YSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITTYSR--HMADVNKMVNKHCKKSWNEWCA
          TKS   N++AFE+ H ++ + +TS+I FMDNLI+ +QDV+ L   GIIE+ LG D   ADLFN L K +    +  +++ +++ VN++  + WN   A
Subjt:  YSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGITTYSR--HMADVNKMVNKHCKKSWNEWCA

Query:  SLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYK
        +L+  YF NPW   S  AA+    L   QS +    YYK
Subjt:  SLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYK

AT3G50160.1 Plant protein of unknown function (DUF247)4.2e-5131.65Show/hide
Query:  IYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAKG
        IY +P +++E    ++ P +VS+GPYHHG  HL  ME  K       +     ++E  ++ +  L E+ R  Y     +    + R              
Subjt:  IYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKCAKG

Query:  IERLLKLMVVDGCFVIQ-------------FLDNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEP--NSKYVLAE
            ++++V+DG F+I+             +  N P + ++ +   I+RD++MLENQLP  +L  L  +     +    V L F+   +P   ++ VL E
Subjt:  IERLLKLMVVDGCFVIQ-------------FLDNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEP--NSKYVLAE

Query:  E-YLHILGMYRAVLLHPRSDTDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKV
        E  LH L + R  LL     +D    +    P  Q I   T+LR +G++F   ET  F D+ F+   LK+P +L++  TKS  LN++AFE+ HI++  K+
Subjt:  E-YLHILGMYRAVLLHPRSDTDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKV

Query:  TSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGI--TTYSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFA
        TS+I FMDNLI+ ++DV+ L   GIIEN LG D   +DLFN L K +       +++ +   VN + ++ WN   A+L+H YF NPW   S  AA+    
Subjt:  TSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGI--TTYSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFA

Query:  LLMQQSIYQFCDYYKVP
            QS +    Y+K P
Subjt:  LLMQQSIYQFCDYYKVP

AT3G50170.1 Plant protein of unknown function (DUF247)6.7e-4929.82Show/hide
Query:  KHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC
        K  IY +P++++E    ++ P  VSLGPYHHGK  L  ME  K     + L      +E     +  L E+ R  Y           E P+         
Subjt:  KHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIWDESKEGIERPLAHLMISDKC

Query:  AKGIERLLKLMVVDGCFVIQ-------------FLDNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSI----ANKSRITIQTVHLHFRDHKEPNSK
        +       +++V+DGCFV++             +  N P + ++ +   I+RD++MLENQLP+ +LD+L  +     N++ I +  V + F D   P  +
Subjt:  AKGIERLLKLMVVDGCFVIQ-------------FLDNCPRW-LQNMRSEIKRDILMLENQLPMKLLDKLCSI----ANKSRITIQTVHLHFRDHKEPNSK

Query:  YVLAEEY-------------------LHILGMYRAVLLHPRSDTDSQNFIDQ--------SDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLK
         +   +                    LH L ++R  LL      ++++ + +             Q +   T+LRE+G+ F+  +T  F D+ F+   L+
Subjt:  YVLAEEY-------------------LHILGMYRAVLLHPRSDTDSQNFIDQ--------SDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLK

Query:  LPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT--TYSRHMADVNKMVNKHCKK
        +P +L++  TKS   N++AFE+ HIE+ + +TS+I FMDNLI+ ++DV+ L   GIIE+ LG D   ADLFN L + +       H++ ++  VN++  +
Subjt:  LPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMNQDVALLASKGIIENALGDDRAAADLFNLLAKGIT--TYSRHMADVNKMVNKHCKK

Query:  SWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYK
         WN   A+L H YF NPW   S  AA+    L + QS Y    YYK
Subjt:  SWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCCAGAAATTGGTGGAGCGATTGGCGGTCAAAGATGTAAGTGAATATCTCACTCAATTCCTTTCACATAGAAAGCAAATTTTAAAGCATTCAATTTACGAAAT
ACCAAACTTCATCAAAGAAGTTCAACCGAATGCTTTCGAGCCGACGTTGGTGTCGCTCGGGCCATACCACCATGGAAAGCCACACCTCGGTTCGATGGAGATGGCAAAAT
TGGACTTGTTTCGCCAATTTCTCGACACTTGTGGGCTCGAGGTTGAGTTCGTTGTCGAACGTGTGATGACATTGTTGGAAGAACTGCGAGGATCGTACAGCATGATTTGG
GATGAGAGCAAGGAAGGAATAGAGAGACCTTTGGCGCACTTGATGATTTCGGATAAGTGCGCGAAAGGAATAGAGAGATTGTTGAAGCTCATGGTTGTGGATGGTTGTTT
TGTTATTCAATTCTTGGACAATTGTCCTCGATGGCTTCAAAATATGAGATCTGAAATCAAGCGCGACATCCTGATGCTCGAGAATCAGCTGCCCATGAAGCTTCTTGACA
AGCTATGTTCCATAGCTAACAAGAGCCGAATAACCATTCAGACAGTCCATTTGCATTTTAGAGATCATAAGGAGCCCAACTCAAAATATGTGCTAGCGGAAGAATACTTG
CACATTTTAGGCATGTACAGGGCGGTATTATTGCATCCAAGGAGTGACACTGATAGTCAAAACTTCATCGACCAATCTGACCCGGATTTTCAAACCATACGACACGCAAC
ACAGCTTCGCGAATCTGGGATCGATTTCAAAATGAGCGAAACCAAGAGCTTCACTGACGTGTCATTCGAAGGCAGCACGTTGAAGCTCCCGTGCGTGTTATTGAACTACA
GCACAAAGTCGGCTTTATTAAACGTGATGGCATTTGAGAAACTCCACATTGAAACTCACCACAAAGTCACATCGTTCATAGCCTTCATGGACAATCTCATAGACATGAAC
CAAGATGTCGCGTTGTTAGCCTCCAAGGGAATCATTGAGAATGCGCTCGGTGACGACCGAGCCGCAGCTGACTTGTTCAATCTACTGGCTAAAGGAATTACTACATATAG
CAGGCACATGGCTGACGTAAACAAGATGGTGAATAAACATTGCAAGAAGTCATGGAATGAGTGGTGTGCAAGTCTCAAACATACCTATTTTCAAAACCCATGGACAATCA
TCTCTGTCTTTGCTGCTATTTTTGGCTTTGCCCTCCTAATGCAACAATCCATCTACCAATTCTGTGATTACTACAAGGTTCCGTTGCAAAAATAA
mRNA sequenceShow/hide mRNA sequence
TGTGGAGAGACTAAACAATAATGTCATCCCAGAAATTGGTGGAGCGATTGGCGGTCAAAGATGTAAGTGAATATCTCACTCAATTCCTTTCACATAGAAAGCAAATTTTA
AAGCATTCAATTTACGAAATACCAAACTTCATCAAAGAAGTTCAACCGAATGCTTTCGAGCCGACGTTGGTGTCGCTCGGGCCATACCACCATGGAAAGCCACACCTCGG
TTCGATGGAGATGGCAAAATTGGACTTGTTTCGCCAATTTCTCGACACTTGTGGGCTCGAGGTTGAGTTCGTTGTCGAACGTGTGATGACATTGTTGGAAGAACTGCGAG
GATCGTACAGCATGATTTGGGATGAGAGCAAGGAAGGAATAGAGAGACCTTTGGCGCACTTGATGATTTCGGATAAGTGCGCGAAAGGAATAGAGAGATTGTTGAAGCTC
ATGGTTGTGGATGGTTGTTTTGTTATTCAATTCTTGGACAATTGTCCTCGATGGCTTCAAAATATGAGATCTGAAATCAAGCGCGACATCCTGATGCTCGAGAATCAGCT
GCCCATGAAGCTTCTTGACAAGCTATGTTCCATAGCTAACAAGAGCCGAATAACCATTCAGACAGTCCATTTGCATTTTAGAGATCATAAGGAGCCCAACTCAAAATATG
TGCTAGCGGAAGAATACTTGCACATTTTAGGCATGTACAGGGCGGTATTATTGCATCCAAGGAGTGACACTGATAGTCAAAACTTCATCGACCAATCTGACCCGGATTTT
CAAACCATACGACACGCAACACAGCTTCGCGAATCTGGGATCGATTTCAAAATGAGCGAAACCAAGAGCTTCACTGACGTGTCATTCGAAGGCAGCACGTTGAAGCTCCC
GTGCGTGTTATTGAACTACAGCACAAAGTCGGCTTTATTAAACGTGATGGCATTTGAGAAACTCCACATTGAAACTCACCACAAAGTCACATCGTTCATAGCCTTCATGG
ACAATCTCATAGACATGAACCAAGATGTCGCGTTGTTAGCCTCCAAGGGAATCATTGAGAATGCGCTCGGTGACGACCGAGCCGCAGCTGACTTGTTCAATCTACTGGCT
AAAGGAATTACTACATATAGCAGGCACATGGCTGACGTAAACAAGATGGTGAATAAACATTGCAAGAAGTCATGGAATGAGTGGTGTGCAAGTCTCAAACATACCTATTT
TCAAAACCCATGGACAATCATCTCTGTCTTTGCTGCTATTTTTGGCTTTGCCCTCCTAATGCAACAATCCATCTACCAATTCTGTGATTACTACAAGGTTCCGTTGCAAA
AATAATTAATAGCGAAAAAGAATAATTCGTGTATTTTATAAATGTTGATGCGAAATCTTCGATTATGATGTTCTAAG
Protein sequenceShow/hide protein sequence
MSSQKLVERLAVKDVSEYLTQFLSHRKQILKHSIYEIPNFIKEVQPNAFEPTLVSLGPYHHGKPHLGSMEMAKLDLFRQFLDTCGLEVEFVVERVMTLLEELRGSYSMIW
DESKEGIERPLAHLMISDKCAKGIERLLKLMVVDGCFVIQFLDNCPRWLQNMRSEIKRDILMLENQLPMKLLDKLCSIANKSRITIQTVHLHFRDHKEPNSKYVLAEEYL
HILGMYRAVLLHPRSDTDSQNFIDQSDPDFQTIRHATQLRESGIDFKMSETKSFTDVSFEGSTLKLPCVLLNYSTKSALLNVMAFEKLHIETHHKVTSFIAFMDNLIDMN
QDVALLASKGIIENALGDDRAAADLFNLLAKGITTYSRHMADVNKMVNKHCKKSWNEWCASLKHTYFQNPWTIISVFAAIFGFALLMQQSIYQFCDYYKVPLQK