; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028145 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028145
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationchr8:14460078..14461421
RNA-Seq ExpressionLag0028145
SyntenyLag0028145
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651212.1 hypothetical protein Csa_001883 [Cucumis sativus]2.0e-11454.21Show/hide
Query:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALT
        KP+D  + EV+     +  +I  +LQ LP +T EC IYRVS+RL+NI+P  YEP+++SIGPFHHGR+ LK MEQFKLQFL                    
Subjt:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALT

Query:  LETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPING----VTLVEV
                     + MN+  FV M+LVDGCFVVEFL+A      QT   +  L+ +AMNI+LYHDLI+LENQLPFFVLQGL   I   N       LV +
Subjt:  LETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPING----VTLVEV

Query:  VEKFFANIFMKDH-KIRRNI---SHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSAN---NIMDISFEGGVLKIPP
        V  FF   FMK + KI +NI   +  NI+HL+DFLGFYY+  +T + +   Q  ++ L LPP TTEL +AGV+LEKA + N   NIM ISFEGGVLKIPP
Subjt:  VEKFFANIFMKDH-KIRRNI---SHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSAN---NIMDISFEGGVLKIPP

Query:  FEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDISMALHEHCKAR
        FEIHDLFEI+MRNL+AFENFQ G+  +S A HY+LFLGALIS EKDSSLL+KKGI++NLIGGSDEEVSN+FN+IGK V  +G F Y   S  L +HC A+
Subjt:  FEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDISMALHEHCKAR

Query:  WNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
         N+WMA L+RDYFNTPW   SFI A +  ++TLL+T F+
Subjt:  WNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS

XP_008443397.1 PREDICTED: LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like [Cucumis melo]3.3e-13058.71Show/hide
Query:  MHQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNG---GI
        MHQ N       N+ VA     + ++I  +LQ LP IT EC IYRVS+RL+NIHP  YEP+++SIGPFHHGR+DLK MEQFKL+FL  YLSR +      
Subjt:  MHQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNG---GI

Query:  KGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLR--SLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPIN
        + VV+AAL  ETKAR CYED  ++MN+  FV M+LVDGCF+VEFLVA+YGE  QTQ   R   L+ +AMNI+LYHDLIMLENQLPFFV+QGLF  I   N
Subjt:  KGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLR--SLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPIN

Query:  G-----VTLVEVVEKFFANIFMKDHK-IRRNI---SHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKA--TSANNIMDI
              + LV +V  FF   F+K H+ I  NI    + +I HL+DFLGFYY   +  +   N    N+ L LPP TTEL +AGV+LEKA  TS  NIM  
Subjt:  G-----VTLVEVVEKFFANIFMKDHK-IRRNI---SHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKA--TSANNIMDI

Query:  SFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDI
        SFEGGVLKIPPFEIHDLFEI+MRNL+AFENFQ G+  +S A HY+ FLGALIS EKDSSLL+KKGI++NLIGGSD EVSN+FN+IGK V  +G FYY   
Subjt:  SFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDI

Query:  SMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTI
        S  L +HC AR NRWMA L+RDY NTPWA +S +  T++ ++TLL+TI
Subjt:  SMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTI

XP_022158989.1 UPF0481 protein At3g47200-like isoform X1 [Momordica charantia]8.9e-10750.34Show/hide
Query:  HQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVV
        + H T  D+V ++            I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  MEQ KL+FL  YL RTN GI+  V
Subjt:  HQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVV

Query:  RAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVE
            + ET ARNCY +  +NM++D FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S   G++ ++
Subjt:  RAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVE

Query:  VVEKFFA-NIFMKDHKIRR----NISHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIP
        +   F+     +K   +       IS   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AG+V +KA  A +IMDISF+  VL+IP
Subjt:  VVEKFFA-NIFMKDHKIRR----NISHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIP

Query:  PFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCK
        P EI D+FE  +RNLMAFE +   ++   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS +FND+ KDV+++GD   +  I+ ALHEHC 
Subjt:  PFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCK

Query:  ARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV
        ARWN+ MASLRRDYFNTPWAFISF+AA  +I+LT L+T+FS +
Subjt:  ARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]8.9e-10750.34Show/hide
Query:  HQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVV
        + H T  D+V ++            I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  MEQ KL+FL  YL RTN GI+  V
Subjt:  HQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVV

Query:  RAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVE
            + ET ARNCY +  +NM++D FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S   G++ ++
Subjt:  RAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVE

Query:  VVEKFFA-NIFMKDHKIRR----NISHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIP
        +   F+     +K   +       IS   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AG+V +KA  A +IMDISF+  VL+IP
Subjt:  VVEKFFA-NIFMKDHKIRR----NISHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIP

Query:  PFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCK
        P EI D+FE  +RNLMAFE +   ++   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS +FND+ KDV+++GD   +  I+ ALHEHC 
Subjt:  PFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCK

Query:  ARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV
        ARWN+ MASLRRDYFNTPWAFISF+AA  +I+LT L+T+FS +
Subjt:  ARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV

XP_038904513.1 UPF0481 protein At3g47200-like [Benincasa hispida]4.7e-14060.66Show/hide
Query:  MDFDHIEMHQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRT
        M+    +MHQ N       NAE+  R YN+  +I  EL++LP +T EC I+RVS+RLLNIH  AYEP+++SIGPFHHGR+DLK MEQFKLQFLR +++R 
Subjt:  MDFDHIEMHQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRT

Query:  NG---GIKGVVRAALTL-ETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRS----LIFRAMNIHLYHDLIMLENQLPFFVLQ
        N      K VV  AL   ET+ARNCYED A NMN+  FV+MMLVDGCF+VEFLV+VYG  PQTQ    S    L+F+AMNI+LYHDLIMLENQLPFFVLQ
Subjt:  NG---GIKGVVRAALTL-ETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRS----LIFRAMNIHLYHDLIMLENQLPFFVLQ

Query:  GLFNII--SPINGVTLVEVVEKFFANIFMKDH--KIRRNISHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNI
         LF++I     N  TLV+++ KFF + FMK +    +      NI+HL+ FL FYY+  +  +   N    NK LLLPP  TEL +AGV+LEK  S +NI
Subjt:  GLFNII--SPINGVTLVEVVEKFFANIFMKDH--KIRRNISHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNI

Query:  MDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYY
        ++++F+ GVLKIPPFEIH LFEI MRNLMAFENFQ  N  QSYA HYVLFLGALIS EKDSSLL+KKGI+TNLIGGSDEEVSN+FN+IGK V  QG FYY
Subjt:  MDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYY

Query:  KDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV
        +D+S  LH+HCK R NRWMASLRRDY NTPWA IS +AA  +     L+TIFSG+
Subjt:  KDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV

TrEMBL top hitse value%identityAlignment
A0A0A0LC32 Uncharacterized protein4.9e-12757.86Show/hide
Query:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALT
        KP+D  + EV+     +  +I  +LQ LP +T EC IYRVS+RL+NI+P  YEP+++SIGPFHHGR+ LK MEQFKLQFL  YLSR +       R  L+
Subjt:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALT

Query:  LETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPING----VTLVEV
         ETKAR CYED A++MN+  FV M+LVDGCFVVEFL+A      QT   +  L+ +AMNI+LYHDLI+LENQLPFFVLQGL   I   N       LV +
Subjt:  LETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPING----VTLVEV

Query:  VEKFFANIFMKDH-KIRRNI---SHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSAN---NIMDISFEGGVLKIPP
        V  FF   FMK + KI +NI   +  NI+HL+DFLGFYY+  +T + +   Q  ++ L LPP TTEL +AGV+LEKA + N   NIM ISFEGGVLKIPP
Subjt:  VEKFFANIFMKDH-KIRRNI---SHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSAN---NIMDISFEGGVLKIPP

Query:  FEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDISMALHEHCKAR
        FEIHDLFEI+MRNL+AFENFQ G+  +S A HY+LFLGALIS EKDSSLL+KKGI++NLIGGSDEEVSN+FN+IGK V  +G F Y   S  L +HC A+
Subjt:  FEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDISMALHEHCKAR

Query:  WNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
         N+WMA L+RDYFNTPW   SFI A +  ++TLL+T F+
Subjt:  WNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS

A0A1S3B8P8 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like1.6e-13058.71Show/hide
Query:  MHQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNG---GI
        MHQ N       N+ VA     + ++I  +LQ LP IT EC IYRVS+RL+NIHP  YEP+++SIGPFHHGR+DLK MEQFKL+FL  YLSR +      
Subjt:  MHQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNG---GI

Query:  KGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLR--SLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPIN
        + VV+AAL  ETKAR CYED  ++MN+  FV M+LVDGCF+VEFLVA+YGE  QTQ   R   L+ +AMNI+LYHDLIMLENQLPFFV+QGLF  I   N
Subjt:  KGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLR--SLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPIN

Query:  G-----VTLVEVVEKFFANIFMKDHK-IRRNI---SHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKA--TSANNIMDI
              + LV +V  FF   F+K H+ I  NI    + +I HL+DFLGFYY   +  +   N    N+ L LPP TTEL +AGV+LEKA  TS  NIM  
Subjt:  G-----VTLVEVVEKFFANIFMKDHK-IRRNI---SHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKA--TSANNIMDI

Query:  SFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDI
        SFEGGVLKIPPFEIHDLFEI+MRNL+AFENFQ G+  +S A HY+ FLGALIS EKDSSLL+KKGI++NLIGGSD EVSN+FN+IGK V  +G FYY   
Subjt:  SFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDI

Query:  SMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTI
        S  L +HC AR NRWMA L+RDY NTPWA +S +  T++ ++TLL+TI
Subjt:  SMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTI

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X24.3e-10750.34Show/hide
Query:  HQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVV
        + H T  D+V ++            I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  MEQ KL+FL  YL RTN GI+  V
Subjt:  HQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVV

Query:  RAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVE
            + ET ARNCY +  +NM++D FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S   G++ ++
Subjt:  RAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVE

Query:  VVEKFFA-NIFMKDHKIRR----NISHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIP
        +   F+     +K   +       IS   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AG+V +KA  A +IMDISF+  VL+IP
Subjt:  VVEKFFA-NIFMKDHKIRR----NISHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIP

Query:  PFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCK
        P EI D+FE  +RNLMAFE +   ++   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS +FND+ KDV+++GD   +  I+ ALHEHC 
Subjt:  PFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCK

Query:  ARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV
        ARWN+ MASLRRDYFNTPWAFISF+AA  +I+LT L+T+FS +
Subjt:  ARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X35.6e-10752.39Show/hide
Query:  IGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDR
        I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  MEQ KL+FL  YL RTN GI+  V    + ET ARNCY +  +NM++D 
Subjt:  IGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDR

Query:  FVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVEVVEKFFA-NIFMKDHKIRR----NI
        FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S   G++ +++   F+     +K   +       I
Subjt:  FVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVEVVEKFFA-NIFMKDHKIRR----NI

Query:  SHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN
        S   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AG+V +KA  A +IMDISF+  VL+IPP EI D+FE  +RNLMAFE +   +
Subjt:  SHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN

Query:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFI
        +   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS +FND+ KDV+++GD   +  I+ ALHEHC ARWN+ MASLRRDYFNTPWAFISF+
Subjt:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFI

Query:  AATVIIVLTLLRTIFSGV
        AA  +I+LT L+T+FS +
Subjt:  AATVIIVLTLLRTIFSGV

A0A6J1E120 UPF0481 protein At3g47200-like isoform X14.3e-10750.34Show/hide
Query:  HQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVV
        + H T  D+V ++            I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  MEQ KL+FL  YL RTN GI+  V
Subjt:  HQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVV

Query:  RAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVE
            + ET ARNCY +  +NM++D FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S   G++ ++
Subjt:  RAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVE

Query:  VVEKFFA-NIFMKDHKIRR----NISHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIP
        +   F+     +K   +       IS   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AG+V +KA  A +IMDISF+  VL+IP
Subjt:  VVEKFFA-NIFMKDHKIRR----NISHVNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIP

Query:  PFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCK
        P EI D+FE  +RNLMAFE +   ++   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS +FND+ KDV+++GD   +  I+ ALHEHC 
Subjt:  PFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDF-YYKDISMALHEHCK

Query:  ARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV
        ARWN+ MASLRRDYFNTPWAFISF+AA  +I+LT L+T+FS +
Subjt:  ARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026454.5e-1330.91Show/hide
Query:  NLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNG-GIKGVVRAALTLETKARNCYEDLAM
        N+   +  EL++        SI+ V + L+  HP +Y P  +SIGP+H  + +L  ME++KL   R   ++ N      +V    ++E K R CY    +
Subjt:  NLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNG-GIKGVVRAALTLETKARNCYEDLAM

Query:  NMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQ
          N +  + +M VD  F++EFL  +Y  F + +    +LI R  +  +  D++M+ENQ+P FVL+
Subjt:  NMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQ

Q9SD53 UPF0481 protein At3g472001.2e-3728.19Show/hide
Query:  CSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYL---SRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCF
        C I+RV    + ++PKAY+P+V+SIGP+H+G + L+ ++Q K + L+ +L    + +     +V+A + LE K R  Y +     ++  F  MM++DGCF
Subjt:  CSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYL---SRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCF

Query:  VVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVT--LVEVVEKFFANIFMKDHKIRRNISHVNIKHLIDFLGF
        ++   + + G    ++  + S+ +   +I    DL++LENQ+PFFVLQ L+  +    GV+  L  +   FF N   K+        +   KHL+D +  
Subjt:  VVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVT--LVEVVEKFFANIFMKDHKIRRNISHVNIKHLIDFLGF

Query:  YYAIPSTS----LASNNIQPQ-------------NKWLLLPPPTTELCDAGVVLE-KATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQV
         + +P+TS     +S ++Q Q             +K + L      L   G+    + +  ++I+++  +   L+IP             N +AFE F  
Subjt:  YYAIPSTS----LASNNIQPQ-------------NKWLLLPPPTTELCDAGVVLE-KATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQV

Query:  GNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGD-FYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFIS
         +   +  T Y++F+G L++ E+D + L    ++     GS+ EVS  F  I KDVV + D  Y  ++   ++E+ K  +N   A  R  +F +PW F+S
Subjt:  GNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGD-FYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFIS

Query:  FIAATVIIVLTLLRT
          A   +I+LT+L++
Subjt:  FIAATVIIVLTLLRT

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)7.3e-5129.28Show/hide
Query:  KPDDVANAEVALRCYNLVNVIGKELQQLPPI------TPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRT-NGGIKG
        +P   +  E  L+ +  +  + K+L++ P +       P CSI+RV + +++ + + YEPRV+SIGP+H G+  LK +E+ K ++L   L+RT N  ++ 
Subjt:  KPDDVANAEVALRCYNLVNVIGKELQQLPPI------TPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRT-NGGIKG

Query:  VVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNI-HLYHDLIMLENQLPFFVLQGLFNIISPIN---
         +++   +E  AR CY +  ++M+++ F +MM++DGCF++E    V    P    D   L+  A  +   Y D + LENQ+PFFVL+ LFN+    N   
Subjt:  VVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNI-HLYHDLIMLENQLPFFVLQGLFNIISPIN---

Query:  -GVTLVEVVEKFFANIFMKDHKIRRNISHVNIKHLIDFLGFYYAIPSTSL---ASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVL
           +L  +   FF N+  +  +       +  KHL+D L   + IP + L    + N   +     +    ++L  AG+ L +   A + + + F  G +
Subjt:  -GVTLVEVVEKFFANIFMKDHKIRRNISHVNIKHLIDFLGFYYAIPSTSL---ASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVL

Query:  KIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQ-GDFYYKDISMALHE
        ++P   + D     + N +A+E   V      + T Y   L  L +T KD   L  + I+ N   G+D E++   N +G+DV       Y KD+   ++E
Subjt:  KIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQ-GDFYYKDISMALHE

Query:  HCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
        + K+ W+   A+ +  YFN+PW+F+S +AA V++VL++++TI++
Subjt:  HCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS

AT3G50160.1 Plant protein of unknown function (DUF247)1.1e-4933.08Show/hide
Query:  IYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFL
        IYRV   L     K+Y P+++SIGP+HHG + L  ME+ K + +   ++R    I+  + A   LE KAR CY+   +NMN + F++M+++DG F++E  
Subjt:  IYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFL

Query:  VAVYGEFPQTQIDLRSLIF--RAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVEVVEKFFANIFMKDHKIRRNISHVNIKHLIDFL--GFYYA
              F +        +F  R +   +  D++MLENQLP+ VL+GL  +  P     L +V  + F   F      R  ++     H +D L  G   +
Subjt:  VAVYGEFPQTQIDLRSLIF--RAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVEVVEKFFANIFMKDHKIRRNISHVNIKHLIDFL--GFYYA

Query:  IPST--SLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALI
          ++   ++  N QPQ     L    TEL +AGV   +     +  DI F+ G LKIP   IHD  +    NL+AFE   + +      T Y++F+  LI
Subjt:  IPST--SLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALI

Query:  STEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQ-GDFYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
        ++ +D S L   GI+ N + GSD EVS++FN +GK+V+    D Y   ++  ++ + + +WN   A+LR  YFN PWA+ SFIAA  +++ T  ++ F+
Subjt:  STEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQ-GDFYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS

AT3G50180.1 Plant protein of unknown function (DUF247)6.2e-5033.82Show/hide
Query:  IYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFL
        IY+V   L     K+Y P+ +S+GP+HHGRQ  ++ME  K + +   L RTN GI+  + A + LE KAR CYE  ++ ++++ F +M+L+DGCF++E L
Subjt:  IYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFL

Query:  VAVYGEFPQTQIDLRSLIF--RAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVT-LVEVVEKFFANIF-----MKDHKIRRNISHVNIKHLIDFLG
          V   F +   D    +F  R     +  D+IMLENQLP FVL  L  +       T LVE+V +FF  +      + ++   R +S+  + H +D   
Subjt:  VAVYGEFPQTQIDLRSLIF--RAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVT-LVEVVEKFFANIF-----MKDHKIRRNISHVNIKHLIDFLG

Query:  FYYAIPSTSLASNNIQPQNKWLLLPPPT-TELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSY--ATHYVLF
             P +S  +N  +  +K L    PT TEL DAG    K    +   DI F  G L+IP   IHD  +    NL+AFE      HI+S    T Y++F
Subjt:  FYYAIPSTSLASNNIQPQNKWLLLPPPT-TELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSY--ATHYVLF

Query:  LGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQ-GDFYYKDISMALHEHCKARWNRWMASLRR----DYFNTPWAFISFIAATVIIVL
        +  LI + +D S L   GI+ + + GS+ EV+++FN + ++VV    D Y   + + +H   K  ++R + SL+      Y + PWA++SF AA ++++L
Subjt:  LGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQ-GDFYYKDISMALHEHCKARWNRWMASLRR----DYFNTPWAFISFIAATVIIVL

Query:  TLLRTIFSGVLHPN
        T  ++ F+   + N
Subjt:  TLLRTIFSGVLHPN

AT4G31980.1 unknown protein3.4e-6434.29Show/hide
Query:  LVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNM
        LV+ I  +L  L  ++ +C IY+V  +L  ++P AY PR++S GP H G+++L+AME  K ++L S++ RTN  ++ +VR A T E  AR+CY +  + +
Subjt:  LVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNM

Query:  NNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNI-HLYHDLIMLENQLPFFVLQGLFNIISPI---NGVTLVEVVEKFFANIFMK--DHK
        ++D FV+M++VDG F+VE L+     +P+ + +   +   +M I  +  D+I++ENQLPFFV++ +F ++         +++++ ++ F+    +  D K
Subjt:  NNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNI-HLYHDLIMLENQLPFFVLQGLFNIISPI---NGVTLVEVVEKFFANIFMK--DHK

Query:  IRRNISHVNIKHLIDFLGFYYAIPS--TSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFEN
                  +H +D L   Y +P     L    ++  N      P  TEL  AGV  + A +++ ++DISF  GVLKIP   + DL E   +N++ FE 
Subjt:  IRRNISHVNIKHLIDFLGFYYAIPS--TSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFEN

Query:  FQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAF
         +  N        Y++ LG  I +  D+ LL+  GI+ N +G S  +VSN+FN I K+V+    FY+  +S  L  +C   WNRW A LRRDYF+ PWA 
Subjt:  FQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAF

Query:  ISFIAATVIIVLTLLRTIFS
         S  AA ++++LT ++++ S
Subjt:  ISFIAATVIIVLTLLRTIFS

AT5G11290.1 Plant protein of unknown function (DUF247)6.0e-5334.44Show/hide
Query:  MEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLEN
        ME  KL++L+S++ RT   ++ +VR A T E +AR CY +  + +++D +VKM++VD  F+VE L+    +  +  +D R    + M + + HD+++LEN
Subjt:  MEQFKLQFLRSYLSRTNGGIKGVVRAALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLEN

Query:  QLPFFVLQGLFNIIS---PINGVTLVEVVEKFFANIFMKDHKIRRNISHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEK
        QLP+FV++G+F ++          L  ++   F   +M      R+IS   I H +D L   +     S    +++  +  L       E+ +AGV L+ 
Subjt:  QLPFFVLQGLFNIIS---PINGVTLVEVVEKFFANIFMKDHKIRRNISHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEK

Query:  ATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVV
        A +    +DISF  GVL IP  +I+D+ E   RN++ FE     + + +Y  HY+ FL   I +  D+ L +  GI+ N  G + E+VS +FN I K+  
Subjt:  ATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVV

Query:  IQGDFYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
          G FYYK +   L  HC A WN+W A+LRRDYF+ PW+  S +AA V+++LT ++ I S
Subjt:  IQGDFYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCGATCACATCGAAATGCATCAACATAACACTAAACCCGACGATGTCGCCAACGCCGAGGTAGCACTACGTTGTTATAACTTGGTGAATGTTATCGGAAAAGA
GCTTCAACAGCTACCTCCTATTACGCCAGAATGCAGCATCTATCGAGTTTCTAGACGACTCCTCAACATTCATCCTAAGGCTTATGAGCCTCGAGTCCTTTCCATCGGCC
CTTTTCACCACGGTCGACAGGATTTGAAGGCAATGGAACAATTTAAACTACAATTTCTCCGTAGCTATCTATCTCGCACAAATGGAGGAATAAAGGGCGTTGTTAGAGCC
GCTTTGACTTTAGAGACTAAAGCTCGCAATTGCTATGAAGATTTGGCCATGAACATGAACAACGACCGCTTTGTGAAAATGATGCTTGTAGATGGCTGTTTCGTGGTGGA
GTTTTTGGTAGCGGTTTATGGTGAATTCCCTCAAACTCAAATTGACTTGCGTTCTTTAATCTTCAGAGCTATGAACATCCATTTATATCATGACTTGATCATGCTTGAGA
ACCAACTTCCTTTCTTTGTTCTTCAAGGTCTTTTTAACATTATTTCACCGATCAATGGCGTCACCTTGGTTGAAGTTGTAGAAAAGTTCTTTGCAAATATATTTATGAAA
GATCATAAGATTCGTAGAAACATTTCCCATGTAAACATAAAGCACTTGATCGATTTCTTAGGTTTTTACTATGCGATCCCCTCAACTAGTTTAGCAAGTAACAACATTCA
GCCCCAAAATAAGTGGCTGTTGCTTCCCCCACCTACAACTGAGCTTTGTGACGCTGGAGTCGTTTTAGAGAAAGCAACATCAGCAAACAACATTATGGACATAAGCTTTG
AAGGTGGTGTTCTTAAAATCCCACCTTTTGAAATTCATGATCTCTTTGAAATCTCTATGCGAAACCTAATGGCTTTTGAGAATTTTCAAGTTGGAAATCACATTCAGAGC
TATGCAACCCATTATGTTTTGTTTCTGGGTGCCTTAATAAGCACCGAGAAAGACTCGAGTTTACTTGTGAAGAAGGGAATTGTGACCAACCTTATTGGTGGCAGTGATGA
AGAAGTTTCGAATATTTTTAACGATATTGGTAAAGATGTGGTCATTCAAGGGGATTTTTACTACAAAGATATAAGCATGGCTTTACATGAGCATTGCAAGGCACGATGGA
ATCGGTGGATGGCTTCACTGAGACGTGACTATTTCAATACGCCATGGGCTTTTATCTCCTTCATTGCTGCTACTGTCATCATTGTCCTCACTTTACTGCGAACCATATTT
TCTGGTGTATTGCATCCCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTCGATCACATCGAAATGCATCAACATAACACTAAACCCGACGATGTCGCCAACGCCGAGGTAGCACTACGTTGTTATAACTTGGTGAATGTTATCGGAAAAGA
GCTTCAACAGCTACCTCCTATTACGCCAGAATGCAGCATCTATCGAGTTTCTAGACGACTCCTCAACATTCATCCTAAGGCTTATGAGCCTCGAGTCCTTTCCATCGGCC
CTTTTCACCACGGTCGACAGGATTTGAAGGCAATGGAACAATTTAAACTACAATTTCTCCGTAGCTATCTATCTCGCACAAATGGAGGAATAAAGGGCGTTGTTAGAGCC
GCTTTGACTTTAGAGACTAAAGCTCGCAATTGCTATGAAGATTTGGCCATGAACATGAACAACGACCGCTTTGTGAAAATGATGCTTGTAGATGGCTGTTTCGTGGTGGA
GTTTTTGGTAGCGGTTTATGGTGAATTCCCTCAAACTCAAATTGACTTGCGTTCTTTAATCTTCAGAGCTATGAACATCCATTTATATCATGACTTGATCATGCTTGAGA
ACCAACTTCCTTTCTTTGTTCTTCAAGGTCTTTTTAACATTATTTCACCGATCAATGGCGTCACCTTGGTTGAAGTTGTAGAAAAGTTCTTTGCAAATATATTTATGAAA
GATCATAAGATTCGTAGAAACATTTCCCATGTAAACATAAAGCACTTGATCGATTTCTTAGGTTTTTACTATGCGATCCCCTCAACTAGTTTAGCAAGTAACAACATTCA
GCCCCAAAATAAGTGGCTGTTGCTTCCCCCACCTACAACTGAGCTTTGTGACGCTGGAGTCGTTTTAGAGAAAGCAACATCAGCAAACAACATTATGGACATAAGCTTTG
AAGGTGGTGTTCTTAAAATCCCACCTTTTGAAATTCATGATCTCTTTGAAATCTCTATGCGAAACCTAATGGCTTTTGAGAATTTTCAAGTTGGAAATCACATTCAGAGC
TATGCAACCCATTATGTTTTGTTTCTGGGTGCCTTAATAAGCACCGAGAAAGACTCGAGTTTACTTGTGAAGAAGGGAATTGTGACCAACCTTATTGGTGGCAGTGATGA
AGAAGTTTCGAATATTTTTAACGATATTGGTAAAGATGTGGTCATTCAAGGGGATTTTTACTACAAAGATATAAGCATGGCTTTACATGAGCATTGCAAGGCACGATGGA
ATCGGTGGATGGCTTCACTGAGACGTGACTATTTCAATACGCCATGGGCTTTTATCTCCTTCATTGCTGCTACTGTCATCATTGTCCTCACTTTACTGCGAACCATATTT
TCTGGTGTATTGCATCCCAACTGA
Protein sequenceShow/hide protein sequence
MDFDHIEMHQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITPECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMEQFKLQFLRSYLSRTNGGIKGVVRA
ALTLETKARNCYEDLAMNMNNDRFVKMMLVDGCFVVEFLVAVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPINGVTLVEVVEKFFANIFMK
DHKIRRNISHVNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGVVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQS
YATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNIFNDIGKDVVIQGDFYYKDISMALHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIF
SGVLHPN