; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034000 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034000
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationscaffold13:36603279..36604532
RNA-Seq ExpressionSpg034000
SyntenySpg034000
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131634.1 UPF0481 protein At3g47200-like [Momordica charantia]3.4e-9248.91Show/hide
Query:  ECSIYRVPRRLFKINPIAYTPQVISIGPFHHGRE-HLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLL
        ECSIYRVP+RL  +N  AYTPQVISIGPFHH  + +L+  ++HKLQALD +L R+ M+VE ++ I +NWE  AR CY EPI MN+D FV M+L+DGCF++
Subjt:  ECSIYRVPRRLFKINPIAYTPQVISIGPFHHGRE-HLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLL

Query:  EFMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVC---KRNAYHLV
         F+IL + N+            FYE M   +  D+ MLENQLP FVL+ L++ +   KD  I+  S + L++ F     R  ++ +I C     N  HLV
Subjt:  EFMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVC---KRNAYHLV

Query:  HLLSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYVS
         LLS Y  FL   D ++  + ++    + P +TEL EAGVTIKK  +    MD+SFKNGVLEI  +DI+D FE  +RNLMAFEHY ++ +  RY I Y  
Subjt:  HLLSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYVS

Query:  FLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTFI
        FLD +ISTEKD  LLV A I+ N IGGS++EVS+LFNDL K V+I     Y  ++ K L  HCK    R  A+L+RDYFN+PWA IS VAAT++I+LT +
Subjt:  FLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTFI

Query:  QTMYSHLSYLK
        QT+++ +S  K
Subjt:  QTMYSHLSYLK

XP_022132066.1 UPF0481 protein At3g47200-like [Momordica charantia]6.1e-9449.15Show/hide
Query:  ECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLE
        ECSIYRV +RL  IN +AYTPQ ISIGPFHHG++  MAME+ KL+ LD +LRR+ M +ED   I + WE  AR+CYAE I+M  D+FVKMMLVDG FL+E
Subjt:  ECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLE

Query:  FMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACR-----ISHHSQIVCKRNAYHL
        F I +HY ++   Q    Y   ++ + + +  D+I+LENQLP F+LE           C ++  S       F    CR         S  +  +   HL
Subjt:  FMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACR-----ISHHSQIVCKRNAYHL

Query:  VHLLSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPW-MDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINY
        V  LS Y+   +     +  +  ++ S  PPT TELWEAGV  +K T++K   MD+ FK+GVL I  ++I+D FE  +RNL+A+EHYH      R +I Y
Subjt:  VHLLSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPW-MDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINY

Query:  VSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLT
        V FLD LISTE+D SLLV A I+TN IGG+NE+VSKLFNDLCK + IS    YY +++  L K+C+T  +R MASLRRDYFNTPWA ISF+AATFL+LLT
Subjt:  VSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLT

Query:  FIQTMYSHLSYLK
         +Q +YS +SY K
Subjt:  FIQTMYSHLSYLK

XP_022158989.1 UPF0481 protein At3g47200-like isoform X1 [Momordica charantia]5.4e-9849.88Show/hide
Query:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM
        M +ELP    EC+I+RVPRRL K N  AY PQ+ISIGPFHHGR+ LM ME+HKL+ LD +LRR N  +E  + I R+WE  AR CYAEPINM+ D FVKM
Subjt:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM

Query:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H
        MLVDGCF++E M+++    S   +  T + P  +  M   L  D+IMLENQLP FVL+ LF+Q       ++E G+SF+ L   F      I        
Subjt:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H

Query:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH
        H  ++      HLV  LS Y+     S      +    +K    PPT+TELWEAG+  KK  + K  MD+SFK+ VL+I  ++I D FE  +RNLMAFE 
Subjt:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH

Query:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA
        YH+     +Y I Y  FL+GLIS E+D SLLV A I+TNCIGG+N+EVS LFNDLCK V +      + ++ +AL +HC    N+ MASLRRDYFNTPWA
Subjt:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA

Query:  AISFVAATFLILLTFIQTMYSHLSYLK
         ISFVAA FLILLTF+QT++S +S  K
Subjt:  AISFVAATFLILLTFIQTMYSHLSYLK

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]5.4e-9849.88Show/hide
Query:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM
        M +ELP    EC+I+RVPRRL K N  AY PQ+ISIGPFHHGR+ LM ME+HKL+ LD +LRR N  +E  + I R+WE  AR CYAEPINM+ D FVKM
Subjt:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM

Query:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H
        MLVDGCF++E M+++    S   +  T + P  +  M   L  D+IMLENQLP FVL+ LF+Q       ++E G+SF+ L   F      I        
Subjt:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H

Query:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH
        H  ++      HLV  LS Y+     S      +    +K    PPT+TELWEAG+  KK  + K  MD+SFK+ VL+I  ++I D FE  +RNLMAFE 
Subjt:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH

Query:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA
        YH+     +Y I Y  FL+GLIS E+D SLLV A I+TNCIGG+N+EVS LFNDLCK V +      + ++ +AL +HC    N+ MASLRRDYFNTPWA
Subjt:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA

Query:  AISFVAATFLILLTFIQTMYSHLSYLK
         ISFVAA FLILLTF+QT++S +S  K
Subjt:  AISFVAATFLILLTFIQTMYSHLSYLK

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]5.4e-9849.88Show/hide
Query:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM
        M +ELP    EC+I+RVPRRL K N  AY PQ+ISIGPFHHGR+ LM ME+HKL+ LD +LRR N  +E  + I R+WE  AR CYAEPINM+ D FVKM
Subjt:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM

Query:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H
        MLVDGCF++E M+++    S   +  T + P  +  M   L  D+IMLENQLP FVL+ LF+Q       ++E G+SF+ L   F      I        
Subjt:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H

Query:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH
        H  ++      HLV  LS Y+     S      +    +K    PPT+TELWEAG+  KK  + K  MD+SFK+ VL+I  ++I D FE  +RNLMAFE 
Subjt:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH

Query:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA
        YH+     +Y I Y  FL+GLIS E+D SLLV A I+TNCIGG+N+EVS LFNDLCK V +      + ++ +AL +HC    N+ MASLRRDYFNTPWA
Subjt:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA

Query:  AISFVAATFLILLTFIQTMYSHLSYLK
         ISFVAA FLILLTF+QT++S +S  K
Subjt:  AISFVAATFLILLTFIQTMYSHLSYLK

TrEMBL top hitse value%identityAlignment
A0A6J1BQT6 UPF0481 protein At3g47200-like1.6e-9248.91Show/hide
Query:  ECSIYRVPRRLFKINPIAYTPQVISIGPFHHGRE-HLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLL
        ECSIYRVP+RL  +N  AYTPQVISIGPFHH  + +L+  ++HKLQALD +L R+ M+VE ++ I +NWE  AR CY EPI MN+D FV M+L+DGCF++
Subjt:  ECSIYRVPRRLFKINPIAYTPQVISIGPFHHGRE-HLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLL

Query:  EFMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVC---KRNAYHLV
         F+IL + N+            FYE M   +  D+ MLENQLP FVL+ L++ +   KD  I+  S + L++ F     R  ++ +I C     N  HLV
Subjt:  EFMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVC---KRNAYHLV

Query:  HLLSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYVS
         LLS Y  FL   D ++  + ++    + P +TEL EAGVTIKK  +    MD+SFKNGVLEI  +DI+D FE  +RNLMAFEHY ++ +  RY I Y  
Subjt:  HLLSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYVS

Query:  FLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTFI
        FLD +ISTEKD  LLV A I+ N IGGS++EVS+LFNDL K V+I     Y  ++ K L  HCK    R  A+L+RDYFN+PWA IS VAAT++I+LT +
Subjt:  FLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTFI

Query:  QTMYSHLSYLK
        QT+++ +S  K
Subjt:  QTMYSHLSYLK

A0A6J1BR71 UPF0481 protein At3g47200-like3.0e-9449.15Show/hide
Query:  ECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLE
        ECSIYRV +RL  IN +AYTPQ ISIGPFHHG++  MAME+ KL+ LD +LRR+ M +ED   I + WE  AR+CYAE I+M  D+FVKMMLVDG FL+E
Subjt:  ECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLE

Query:  FMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACR-----ISHHSQIVCKRNAYHL
        F I +HY ++   Q    Y   ++ + + +  D+I+LENQLP F+LE           C ++  S       F    CR         S  +  +   HL
Subjt:  FMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACR-----ISHHSQIVCKRNAYHL

Query:  VHLLSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPW-MDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINY
        V  LS Y+   +     +  +  ++ S  PPT TELWEAGV  +K T++K   MD+ FK+GVL I  ++I+D FE  +RNL+A+EHYH      R +I Y
Subjt:  VHLLSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPW-MDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINY

Query:  VSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLT
        V FLD LISTE+D SLLV A I+TN IGG+NE+VSKLFNDLCK + IS    YY +++  L K+C+T  +R MASLRRDYFNTPWA ISF+AATFL+LLT
Subjt:  VSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLT

Query:  FIQTMYSHLSYLK
         +Q +YS +SY K
Subjt:  FIQTMYSHLSYLK

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X22.6e-9849.88Show/hide
Query:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM
        M +ELP    EC+I+RVPRRL K N  AY PQ+ISIGPFHHGR+ LM ME+HKL+ LD +LRR N  +E  + I R+WE  AR CYAEPINM+ D FVKM
Subjt:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM

Query:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H
        MLVDGCF++E M+++    S   +  T + P  +  M   L  D+IMLENQLP FVL+ LF+Q       ++E G+SF+ L   F      I        
Subjt:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H

Query:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH
        H  ++      HLV  LS Y+     S      +    +K    PPT+TELWEAG+  KK  + K  MD+SFK+ VL+I  ++I D FE  +RNLMAFE 
Subjt:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH

Query:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA
        YH+     +Y I Y  FL+GLIS E+D SLLV A I+TNCIGG+N+EVS LFNDLCK V +      + ++ +AL +HC    N+ MASLRRDYFNTPWA
Subjt:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA

Query:  AISFVAATFLILLTFIQTMYSHLSYLK
         ISFVAA FLILLTF+QT++S +S  K
Subjt:  AISFVAATFLILLTFIQTMYSHLSYLK

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X32.6e-9849.88Show/hide
Query:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM
        M +ELP    EC+I+RVPRRL K N  AY PQ+ISIGPFHHGR+ LM ME+HKL+ LD +LRR N  +E  + I R+WE  AR CYAEPINM+ D FVKM
Subjt:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM

Query:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H
        MLVDGCF++E M+++    S   +  T + P  +  M   L  D+IMLENQLP FVL+ LF+Q       ++E G+SF+ L   F      I        
Subjt:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H

Query:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH
        H  ++      HLV  LS Y+     S      +    +K    PPT+TELWEAG+  KK  + K  MD+SFK+ VL+I  ++I D FE  +RNLMAFE 
Subjt:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH

Query:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA
        YH+     +Y I Y  FL+GLIS E+D SLLV A I+TNCIGG+N+EVS LFNDLCK V +      + ++ +AL +HC    N+ MASLRRDYFNTPWA
Subjt:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA

Query:  AISFVAATFLILLTFIQTMYSHLSYLK
         ISFVAA FLILLTF+QT++S +S  K
Subjt:  AISFVAATFLILLTFIQTMYSHLSYLK

A0A6J1E120 UPF0481 protein At3g47200-like isoform X12.6e-9849.88Show/hide
Query:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM
        M +ELP    EC+I+RVPRRL K N  AY PQ+ISIGPFHHGR+ LM ME+HKL+ LD +LRR N  +E  + I R+WE  AR CYAEPINM+ D FVKM
Subjt:  MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKM

Query:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H
        MLVDGCF++E M+++    S   +  T + P  +  M   L  D+IMLENQLP FVL+ LF+Q       ++E G+SF+ L   F      I        
Subjt:  MLVDGCFLLEFMILLHYNFSPPPQILTLYGP-FYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIE-GISFMGLVQFFLECACRIS------H

Query:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH
        H  ++      HLV  LS Y+     S      +    +K    PPT+TELWEAG+  KK  + K  MD+SFK+ VL+I  ++I D FE  +RNLMAFE 
Subjt:  HSQIVCKRNAYHLVHLLSDYF--RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEH

Query:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA
        YH+     +Y I Y  FL+GLIS E+D SLLV A I+TNCIGG+N+EVS LFNDLCK V +      + ++ +AL +HC    N+ MASLRRDYFNTPWA
Subjt:  YHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWA

Query:  AISFVAATFLILLTFIQTMYSHLSYLK
         ISFVAA FLILLTF+QT++S +S  K
Subjt:  AISFVAATFLILLTFIQTMYSHLSYLK

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026452.5e-1333.12Show/hide
Query:  ELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRIN-MSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMML
        E  L     SI+ VP+ L   +P +YTP  +SIGP+H  +  L  MER+KL        + N     D++   ++ E   R CY + I  N +  + +M 
Subjt:  ELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRIN-MSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMML

Query:  VDGCFLLEFMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVL
        VD  FL+EF+ +  Y+F     ++   G         +  DI+M+ENQ+PLFVL
Subjt:  VDGCFLLEFMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVL

Q9SD53 UPF0481 protein At3g472006.1e-3627.53Show/hide
Query:  CSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRI-NMSVEDIIAIDR--NWERDARRCYAEPINMNHDHFVKMMLVDGCFL
        C I+RVP     +NP AY P+V+SIGP+H+G +HL  +++HK + L +FL       VE+ + +    + E   R+ Y+E +   HD  + MM++DGCF+
Subjt:  CSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRI-NMSVEDIIAIDR--NWERDARRCYAEPINMNHDHFVKMMLVDGCFL

Query:  LEFMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHLVHL
        L   +++  N       +         ++     D+++LENQ+P FVL+ L+   +      +  I+F     FF     +   + +      A HL+ L
Subjt:  LEFMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHLVHL

Query:  LSDYF---RFLSDK-----------DGRENSESEQKGSWIPPTLT--ELWEAGVTIK-KVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHY
        + + F      SDK           +G+  +        +P  L+   L   G+  + + +KE   ++V  K   L+I  +  +        N +AFE +
Subjt:  LSDYF---RFLSDK-----------DGRENSESEQKGSWIPPTLT--ELWEAGVTIK-KVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHY

Query:  HSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAA
        ++  S    I  Y+ F+  L++ E+D + L N +++     GSN EVS+ F  + K V       Y  N+ K + ++ K   N   A  R  +F +PW  
Subjt:  HSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAA

Query:  ISFVAATFLILLTFIQTMYSHLSYL
        +S  A  F+ILLT +Q+  + LSYL
Subjt:  ISFVAATFLILLTFIQTMYSHLSYL

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)7.1e-4830.9Show/hide
Query:  CSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRI-NMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLE
        CSI+RVP+ +   N   Y P+V+SIGP+H G+  L  +E HK + L++ L R  N+++ED +   +N E  AR CY+E I+M+ + F +MM++DGCFLLE
Subjt:  CSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRI-NMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLE

Query:  FM----ILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKD----CAIEGISFMGLVQFFLECACRISHHSQIVCKRNA
               L+ +  + P   +    PF+         D + LENQ+P FVLE LFN  R   +     +++ ++F     FF     R         +  A
Subjt:  FM----ILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKD----CAIEGISFMGLVQFFLECACRISHHSQIVCKRNA

Query:  YHLVHLLSDYFRFLSD--KDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRY
         HL+ LL   F   S+       N   E+  S I  ++++L  AG+ ++++   + ++ V F++G +E+  I ++D     L N +A+E  H  ++ S +
Subjt:  YHLVHLLSDYFRFLSD--KDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRY

Query:  IINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFL
           Y + LD L +T KD   L +  I+ N   G++ E++K  N L + V      CY  ++ + + ++ K+  +   A+ +  YFN+PW+ +S +AA  L
Subjt:  IINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFL

Query:  ILLTFIQTMYS
        ++L+ IQT+Y+
Subjt:  ILLTFIQTMYS

AT3G50120.1 Plant protein of unknown function (DUF247)1.4e-4829.89Show/hide
Query:  IYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLEFMI
        IYRVP  L + +  +Y PQ +S+GP+HHG++ L +M+RHK +A++  L+R N  ++  I   R  E  AR CY  P++++ + F++M+++DGCF+LE   
Subjt:  IYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLEFMI

Query:  -----LLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFF-----------------LECACRIS
                  ++    +  + G  +      +  D++MLENQLPLFVL  L       ++    G+     ++FF                 LE +    
Subjt:  -----LLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFF-----------------LECACRIS

Query:  HHSQIVCKRNAYHLVHLLSDYF---------RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRL
                    H + +              R    +  R    ++++   +   +TEL EAG+  ++   ++ W D+ FKNG LEI  + I+D  +   
Subjt:  HHSQIVCKRNAYHLVHLLSDYF---------RFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRL

Query:  RNLMAFEHYHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRR
         NL+AFE  H  I  S  I +Y+ F+D LI + +D S L    I+ + + GS+ EV+ LFN LC+ V   + + Y   ++  + ++     N W A+L+ 
Subjt:  RNLMAFEHYHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRR

Query:  DYFNTPWAAISFVAATFLILLTFIQTMYSHLSYLK
         YFN PWA +SF AA  L++LTF Q+ Y+  +Y K
Subjt:  DYFNTPWAAISFVAATFLILLTFIQTMYSHLSYLK

AT3G50150.1 Plant protein of unknown function (DUF247)1.9e-4832.7Show/hide
Query:  IYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINM-NHDHFVKMMLVDGCFLLEFM
        IYRVP  L + +  +Y PQ +SIGP+HHG+ HL  MERHK +A+++ + R   ++E  I   +  E +AR CY  PI+M N + F +M+++DGCF+LE  
Subjt:  IYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINM-NHDHFVKMMLVDGCFLLEFM

Query:  ILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLF-------NQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHL
              F               G+   +  D+IMLENQLPLFVL+ L        NQ     + A+     +      L  + R     +   +      
Subjt:  ILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLF-------NQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHL

Query:  VHLLSDYFRFLSDKDGRENSESE-------QKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRS
        +H L  + R L       N  +        +K   +   +TEL  AGV   +    + W D+ FKNG L+I  + I+D  +    NL+AFE  H+  S +
Subjt:  VHLLSDYFRFLSDKDGRENSESE-------QKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRS

Query:  RYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAAT
          I +Y+ F+D LI++ +D S L +  I+ + + GS+ EV+ LFN LCK V     + Y   +++ + ++     N   A+LR+ YFN PWA  SF AA 
Subjt:  RYIINYVSFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAAT

Query:  FLILLTFIQTMYSHLSYLK
         L+ LTF Q+ ++  +Y K
Subjt:  FLILLTFIQTMYSHLSYLK

AT3G50160.1 Plant protein of unknown function (DUF247)9.9e-5032.52Show/hide
Query:  IYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLEFMI
        IYRVP  L + +  +Y PQ++SIGP+HHG +HLM MERHK +A+++ + R    +E  I   +  E  AR CY  PINMN + F++M+++DG F++E   
Subjt:  IYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLEFMI

Query:  -----LLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHLVHL
                  ++P   +  +      G+   +  D++MLENQLP  VL+ L  QL+  +   ++ ++    VQ F      +    +++ +    H + +
Subjt:  -----LLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHLVHL

Query:  LSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYVSFL
        L       S     + S   ++   +   +TEL  AGV   +      W D+ FKNG L+I  + I+D  +    NL+AFE  H  I  S+ I +Y+ F+
Subjt:  LSDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYVSFL

Query:  DGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTFIQT
        D LI++ +D S L +  I+ N + GS+ EVS LFN L K V    ++ Y   +   +  + +   N   A+LR  YFN PWA  SF+AA  L++ TF Q+
Subjt:  DGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTFIQT

Query:  MYSHLSYLK
         ++  +Y K
Subjt:  MYSHLSYLK

AT4G31980.1 unknown protein1.4e-5936.19Show/hide
Query:  TTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFL
        +T+C IY+VP +L ++NP AYTP+++S GP H G+E L AME  K + L  F+ R N S+ED++ + R WE++AR CYAE + ++ D FV+M++VDG FL
Subjt:  TTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFL

Query:  LEFMILLHYNFSPPPQILTLYGPFYEG--MELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHLV
        +E ++  HY     P++       +    M   +C D+I++ENQLP FV++++F  L +        I  +   + F     RI     I       H V
Subjt:  LEFMILLHYNFSPPPQILTLYGPFYEG--MELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHLV

Query:  HLL-SDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYV
         LL S Y      K      + +       P  TEL  AGV  K        +D+SF +GVL+I  I ++D  E   +N++ FE    S   ++  ++Y+
Subjt:  HLL-SDYFRFLSDKDGRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYV

Query:  SFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTF
          L   I +  DA LL+++ I+ N +G S  +VS LFN + K V I     Y+  +++ L+ +C T  NRW A LRRDYF+ PWA  S  AA  L+LLTF
Subjt:  SFLDGLISTEKDASLLVNAEILTNCIGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTF

Query:  IQTMYSHLS
        IQ++ S L+
Subjt:  IQTMYSHLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCGCGAATTGCCTCTCAACACTACGGAATGCAGCATCTATCGGGTCCCCAGACGGCTATTCAAGATAAATCCTATAGCCTATACGCCTCAAGTCATTTCCATTGG
TCCTTTTCACCATGGTCGAGAGCATTTGATGGCCATGGAACGACATAAGCTTCAAGCTCTCGATATCTTCCTACGTCGAATAAATATGAGTGTTGAGGATATCATTGCAA
TTGATCGAAATTGGGAAAGGGATGCTCGTCGTTGCTATGCAGAACCCATAAACATGAACCACGACCATTTTGTGAAAATGATGCTTGTGGACGGTTGTTTCCTACTGGAA
TTTATGATACTGTTGCATTACAACTTCAGTCCACCACCTCAAATTCTAACGTTATATGGCCCATTCTATGAGGGTATGGAGCTTCATTTATGTTACGATATTATAATGCT
TGAGAATCAACTTCCTCTCTTCGTTCTCGAAGATCTATTTAACCAACTTAGAGACTGCAAAGACTGTGCCATCGAAGGAATCTCCTTTATGGGACTTGTACAATTTTTTC
TCGAATGTGCGTGTCGTATATCACACCACTCCCAAATTGTCTGCAAAAGAAATGCATATCACCTGGTCCATTTGTTGAGTGACTACTTCAGATTCTTAAGTGATAAAGAT
GGGAGGGAAAATAGTGAATCGGAGCAGAAGGGTTCTTGGATTCCCCCAACTTTAACTGAGCTCTGGGAGGCTGGTGTCACCATCAAGAAAGTAACAAAAGAGAAACCCTG
GATGGACGTAAGTTTCAAAAATGGGGTTCTAGAAATCTCAGATATCGACATTAACGATCAATTTGAAATTCGTTTAAGAAATCTAATGGCGTTTGAGCATTACCACTCAA
GTATAAGTCGTTCAAGGTATATAATCAACTATGTCTCATTTCTAGATGGCTTGATAAGCACGGAGAAAGACGCAAGTTTACTTGTGAATGCAGAAATCCTAACCAACTGT
ATTGGTGGCAGTAATGAAGAAGTTTCAAAACTGTTTAATGATTTATGTAAAGGAGTAACAATCTCAAGTCATAACTGTTACTACTACAATATGGCCAAAGCTTTAAGAAA
GCATTGCAAGACAATGAAGAATCGGTGGATGGCTTCATTGAGACGCGACTATTTTAATACACCATGGGCTGCTATCTCCTTTGTTGCAGCAACTTTTCTCATTCTTCTCA
CTTTCATTCAAACCATGTACTCTCATCTATCGTATTTAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTCGCGAATTGCCTCTCAACACTACGGAATGCAGCATCTATCGGGTCCCCAGACGGCTATTCAAGATAAATCCTATAGCCTATACGCCTCAAGTCATTTCCATTGG
TCCTTTTCACCATGGTCGAGAGCATTTGATGGCCATGGAACGACATAAGCTTCAAGCTCTCGATATCTTCCTACGTCGAATAAATATGAGTGTTGAGGATATCATTGCAA
TTGATCGAAATTGGGAAAGGGATGCTCGTCGTTGCTATGCAGAACCCATAAACATGAACCACGACCATTTTGTGAAAATGATGCTTGTGGACGGTTGTTTCCTACTGGAA
TTTATGATACTGTTGCATTACAACTTCAGTCCACCACCTCAAATTCTAACGTTATATGGCCCATTCTATGAGGGTATGGAGCTTCATTTATGTTACGATATTATAATGCT
TGAGAATCAACTTCCTCTCTTCGTTCTCGAAGATCTATTTAACCAACTTAGAGACTGCAAAGACTGTGCCATCGAAGGAATCTCCTTTATGGGACTTGTACAATTTTTTC
TCGAATGTGCGTGTCGTATATCACACCACTCCCAAATTGTCTGCAAAAGAAATGCATATCACCTGGTCCATTTGTTGAGTGACTACTTCAGATTCTTAAGTGATAAAGAT
GGGAGGGAAAATAGTGAATCGGAGCAGAAGGGTTCTTGGATTCCCCCAACTTTAACTGAGCTCTGGGAGGCTGGTGTCACCATCAAGAAAGTAACAAAAGAGAAACCCTG
GATGGACGTAAGTTTCAAAAATGGGGTTCTAGAAATCTCAGATATCGACATTAACGATCAATTTGAAATTCGTTTAAGAAATCTAATGGCGTTTGAGCATTACCACTCAA
GTATAAGTCGTTCAAGGTATATAATCAACTATGTCTCATTTCTAGATGGCTTGATAAGCACGGAGAAAGACGCAAGTTTACTTGTGAATGCAGAAATCCTAACCAACTGT
ATTGGTGGCAGTAATGAAGAAGTTTCAAAACTGTTTAATGATTTATGTAAAGGAGTAACAATCTCAAGTCATAACTGTTACTACTACAATATGGCCAAAGCTTTAAGAAA
GCATTGCAAGACAATGAAGAATCGGTGGATGGCTTCATTGAGACGCGACTATTTTAATACACCATGGGCTGCTATCTCCTTTGTTGCAGCAACTTTTCTCATTCTTCTCA
CTTTCATTCAAACCATGTACTCTCATCTATCGTATTTAAAGTAA
Protein sequenceShow/hide protein sequence
MRRELPLNTTECSIYRVPRRLFKINPIAYTPQVISIGPFHHGREHLMAMERHKLQALDIFLRRINMSVEDIIAIDRNWERDARRCYAEPINMNHDHFVKMMLVDGCFLLE
FMILLHYNFSPPPQILTLYGPFYEGMELHLCYDIIMLENQLPLFVLEDLFNQLRDCKDCAIEGISFMGLVQFFLECACRISHHSQIVCKRNAYHLVHLLSDYFRFLSDKD
GRENSESEQKGSWIPPTLTELWEAGVTIKKVTKEKPWMDVSFKNGVLEISDIDINDQFEIRLRNLMAFEHYHSSISRSRYIINYVSFLDGLISTEKDASLLVNAEILTNC
IGGSNEEVSKLFNDLCKGVTISSHNCYYYNMAKALRKHCKTMKNRWMASLRRDYFNTPWAAISFVAATFLILLTFIQTMYSHLSYLK