; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015552 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015552
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationscaffold10:14673708..14675030
RNA-Seq ExpressionSpg015552
SyntenySpg015552
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651212.1 hypothetical protein Csa_001883 [Cucumis sativus]1.3e-11553.76Show/hide
Query:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALT
        KP+D  + EV+     +  +I  +LQ LP +T EC IYRVS+RL+NI+P  YEP+++SIGPFHHGR+ LK ME+FKLQFL                    
Subjt:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALT

Query:  WETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNR----VTLVEV
                     + MN+  FV M+LVDGCFVVEFL+       QT   +  L+ +AMNI+LYHDLI+LENQLPFFVLQGL   I   N       LV +
Subjt:  WETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNR----VTLVEV

Query:  VEKFFANIFMKDH-KIPRNI---SHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSAN---NIMDISFEGGVLKIPP
        V  FF   FMK + KIP+NI   +  NI+HL+DFLGFYY+  +T + +   Q  ++ L LPP TTEL +AG++LEKA + N   NIM ISFEGGVLKIPP
Subjt:  VEKFFANIFMKDH-KIPRNI---SHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSAN---NIMDISFEGGVLKIPP

Query:  FEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKAR
        FEIHDLFEI+MRNL+AFENFQ G+  +S A HY+LFLGALIS EKDSSLL+KKGI++NLIGGSDEEVSN+FN+IGK V  +G F Y   S +L +HC A+
Subjt:  FEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKAR

Query:  WNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
         N+WMA L+RDYFNTPW   SFI A +  ++TLL+T F+
Subjt:  WNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS

XP_008443397.1 PREDICTED: LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like [Cucumis melo]2.7e-13258.94Show/hide
Query:  NAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANE---GIQGVARAALTWET
        N+ VA     + ++I  +LQ LP IT EC IYRVS+RL+NIHP  YEP+++SIGPFHHGR+DLK ME+FKL+FL  YLSR +      + V +AAL WET
Subjt:  NAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANE---GIQGVARAALTWET

Query:  KARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLR--SLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNR-----VTLVEV
        KAR CYED  ++MN+  FV M+LVDGCF+VEFLV +YGE  QTQ   R   L+ +AMNI+LYHDLIMLENQLPFFV+QGLF  I   N      + LV +
Subjt:  KARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLR--SLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNR-----VTLVEV

Query:  VEKFFANIFMKDHK-IPRNI---SHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKA--TSANNIMDISFEGGVLKIPPF
        V  FF   F+K H+ IP NI    + +I HL+DFLGFYY   +  +   N    N+ L LPP TTEL +AG++LEKA  TS  NIM  SFEGGVLKIPPF
Subjt:  VEKFFANIFMKDHK-IPRNI---SHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKA--TSANNIMDISFEGGVLKIPPF

Query:  EIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARW
        EIHDLFEI+MRNL+AFENFQ G+  +S A HY+ FLGALIS EKDSSLL+KKGI++NLIGGSD EVSN+FN+IGK V  +G FYY   S +L +HC AR 
Subjt:  EIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARW

Query:  NRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTI
        NRWMA L+RDY NTPWA +S +  T++ ++TLL+TI
Subjt:  NRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTI

XP_022158990.1 UPF0481 protein At3g47200-like isoform X2 [Momordica charantia]3.9e-10752.15Show/hide
Query:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH
        I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  ME+ KL+FL  YL R N GI+       +WET ARNCY +  +NM++D 
Subjt:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH

Query:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I
        FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S    ++ +++   F+     I  +  ++P    I
Subjt:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I

Query:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN
        S   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AGIV +KA  A +IMDISF+  VL+IPP EI D+FE  +RNLMAFE +   +
Subjt:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN

Query:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI
        +   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS LFND+ KDV+++GD   +  I+  LHEHC ARWN+ MASLRRDYFNTPWAFISF+
Subjt:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI

Query:  AATVIIVLTLLRTIFSGV
        AA  +I+LT L+T+FS +
Subjt:  AATVIIVLTLLRTIFSGV

XP_022158992.1 UPF0481 protein At3g47200-like isoform X3 [Momordica charantia]3.9e-10752.15Show/hide
Query:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH
        I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  ME+ KL+FL  YL R N GI+       +WET ARNCY +  +NM++D 
Subjt:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH

Query:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I
        FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S    ++ +++   F+     I  +  ++P    I
Subjt:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I

Query:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN
        S   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AGIV +KA  A +IMDISF+  VL+IPP EI D+FE  +RNLMAFE +   +
Subjt:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN

Query:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI
        +   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS LFND+ KDV+++GD   +  I+  LHEHC ARWN+ MASLRRDYFNTPWAFISF+
Subjt:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI

Query:  AATVIIVLTLLRTIFSGV
        AA  +I+LT L+T+FS +
Subjt:  AATVIIVLTLLRTIFSGV

XP_038904513.1 UPF0481 protein At3g47200-like [Benincasa hispida]1.1e-14161.93Show/hide
Query:  NAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQG----VARAALTWE
        NAE+  R YN+  +I  EL++LP +T EC I+RVS+RLLNIH  AYEP+++SIGPFHHGR+DLK ME+FKLQFLR +++R N         V  A + WE
Subjt:  NAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQG----VARAALTWE

Query:  TKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRS----LIFRAMNIHLYHDLIMLENQLPFFVLQGLFNII--SPTNRVTLVEV
        T+ARNCYED A NMN+  FV+MMLVDGCF+VEFLV+VYG  PQTQ    S    L+F+AMNI+LYHDLIMLENQLPFFVLQ LF++I     N  TLV++
Subjt:  TKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRS----LIFRAMNIHLYHDLIMLENQLPFFVLQGLFNII--SPTNRVTLVEV

Query:  VEKFFANIFMKDH-KIPRN-ISHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHD
        + KFF + FMK + + P+N     NI+HL+ FL FYY+  +  +   N    NK LLLPP  TEL +AG++LEK  S +NI++++F+ GVLKIPPFEIH 
Subjt:  VEKFFANIFMKDH-KIPRN-ISHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHD

Query:  LFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARWNRWM
        LFEI MRNLMAFENFQ  N  QSYA HYVLFLGALIS EKDSSLL+KKGI+TNLIGGSDEEVSN+FN+IGK V  QG FYY+D+S DLH+HCK R NRWM
Subjt:  LFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARWNRWM

Query:  ASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV
        ASLRRDY NTPWA IS +AA  +     L+TIFSG+
Subjt:  ASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGV

TrEMBL top hitse value%identityAlignment
A0A0A0LC32 Uncharacterized protein1.5e-12857.4Show/hide
Query:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALT
        KP+D  + EV+     +  +I  +LQ LP +T EC IYRVS+RL+NI+P  YEP+++SIGPFHHGR+ LK ME+FKLQFL  YLSR       ++R  L+
Subjt:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALT

Query:  WETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNR----VTLVEV
        +ETKAR CYED A++MN+  FV M+LVDGCFVVEFL+       QT   +  L+ +AMNI+LYHDLI+LENQLPFFVLQGL   I   N       LV +
Subjt:  WETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNR----VTLVEV

Query:  VEKFFANIFMKDH-KIPRNI---SHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSAN---NIMDISFEGGVLKIPP
        V  FF   FMK + KIP+NI   +  NI+HL+DFLGFYY+  +T + +   Q  ++ L LPP TTEL +AG++LEKA + N   NIM ISFEGGVLKIPP
Subjt:  VEKFFANIFMKDH-KIPRNI---SHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSAN---NIMDISFEGGVLKIPP

Query:  FEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKAR
        FEIHDLFEI+MRNL+AFENFQ G+  +S A HY+LFLGALIS EKDSSLL+KKGI++NLIGGSDEEVSN+FN+IGK V  +G F Y   S +L +HC A+
Subjt:  FEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKAR

Query:  WNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
         N+WMA L+RDYFNTPW   SFI A +  ++TLL+T F+
Subjt:  WNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS

A0A1S3B8P8 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like1.3e-13258.94Show/hide
Query:  NAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANE---GIQGVARAALTWET
        N+ VA     + ++I  +LQ LP IT EC IYRVS+RL+NIHP  YEP+++SIGPFHHGR+DLK ME+FKL+FL  YLSR +      + V +AAL WET
Subjt:  NAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANE---GIQGVARAALTWET

Query:  KARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLR--SLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNR-----VTLVEV
        KAR CYED  ++MN+  FV M+LVDGCF+VEFLV +YGE  QTQ   R   L+ +AMNI+LYHDLIMLENQLPFFV+QGLF  I   N      + LV +
Subjt:  KARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLR--SLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNR-----VTLVEV

Query:  VEKFFANIFMKDHK-IPRNI---SHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKA--TSANNIMDISFEGGVLKIPPF
        V  FF   F+K H+ IP NI    + +I HL+DFLGFYY   +  +   N    N+ L LPP TTEL +AG++LEKA  TS  NIM  SFEGGVLKIPPF
Subjt:  VEKFFANIFMKDHK-IPRNI---SHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKA--TSANNIMDISFEGGVLKIPPF

Query:  EIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARW
        EIHDLFEI+MRNL+AFENFQ G+  +S A HY+ FLGALIS EKDSSLL+KKGI++NLIGGSD EVSN+FN+IGK V  +G FYY   S +L +HC AR 
Subjt:  EIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARW

Query:  NRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTI
        NRWMA L+RDY NTPWA +S +  T++ ++TLL+TI
Subjt:  NRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTI

A0A6J1DXD6 UPF0481 protein At3g47200-like isoform X21.9e-10752.15Show/hide
Query:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH
        I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  ME+ KL+FL  YL R N GI+       +WET ARNCY +  +NM++D 
Subjt:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH

Query:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I
        FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S    ++ +++   F+     I  +  ++P    I
Subjt:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I

Query:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN
        S   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AGIV +KA  A +IMDISF+  VL+IPP EI D+FE  +RNLMAFE +   +
Subjt:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN

Query:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI
        +   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS LFND+ KDV+++GD   +  I+  LHEHC ARWN+ MASLRRDYFNTPWAFISF+
Subjt:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI

Query:  AATVIIVLTLLRTIFSGV
        AA  +I+LT L+T+FS +
Subjt:  AATVIIVLTLLRTIFSGV

A0A6J1DYL4 UPF0481 protein At3g47200-like isoform X31.9e-10752.15Show/hide
Query:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH
        I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  ME+ KL+FL  YL R N GI+       +WET ARNCY +  +NM++D 
Subjt:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH

Query:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I
        FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S    ++ +++   F+     I  +  ++P    I
Subjt:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I

Query:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN
        S   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AGIV +KA  A +IMDISF+  VL+IPP EI D+FE  +RNLMAFE +   +
Subjt:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN

Query:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI
        +   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS LFND+ KDV+++GD   +  I+  LHEHC ARWN+ MASLRRDYFNTPWAFISF+
Subjt:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI

Query:  AATVIIVLTLLRTIFSGV
        AA  +I+LT L+T+FS +
Subjt:  AATVIIVLTLLRTIFSGV

A0A6J1E120 UPF0481 protein At3g47200-like isoform X11.9e-10752.15Show/hide
Query:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH
        I K LQ+LPP+  EC+I+RV RRLL  + +AY P+++SIGPFHHGRQDL  ME+ KL+FL  YL R N GI+       +WET ARNCY +  +NM++D 
Subjt:  IGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDH

Query:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I
        FVKMMLVDGCF+VE ++ V     +T+     L+F AM   LY DLIMLENQLPFFVLQGLF+  S    ++ +++   F+     I  +  ++P    I
Subjt:  FVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFAN---IFMKDHKIPRN--I

Query:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN
        S   + HL+DFL FYY  A  S S  S+++    K    PP  TEL +AGIV +KA  A +IMDISF+  VL+IPP EI D+FE  +RNLMAFE +   +
Subjt:  SHGNIKHLIDFLGFYY--AIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGN

Query:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI
        +   YA  Y LFL  LIS E+D SLLVK  I+TN IGG+++EVS LFND+ KDV+++GD   +  I+  LHEHC ARWN+ MASLRRDYFNTPWAFISF+
Subjt:  HIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDF-YYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFI

Query:  AATVIIVLTLLRTIFSGV
        AA  +I+LT L+T+FS +
Subjt:  AATVIIVLTLLRTIFSGV

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026455.8e-1330.91Show/hide
Query:  NLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANE-GIQGVARAALTWETKARNCYEDLAM
        N+   +  EL++        SI+ V + L+  HP +Y P  +SIGP+H  + +L  MER+KL   R   ++ N      +     + E K R CY    +
Subjt:  NLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANE-GIQGVARAALTWETKARNCYEDLAM

Query:  NMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQ
          N +  + +M VD  F++EFL  +Y  F + +    +LI R  +  +  D++M+ENQ+P FVL+
Subjt:  NMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQ

Q9SD53 UPF0481 protein At3g472007.6e-3727.75Show/hide
Query:  CSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANE---GIQGVARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCF
        C I+RV    + ++PKAY+P+V+SIGP+H+G + L+ +++ K + L+ +L  A +       + +A +  E K R  Y +  +   +D  + MM++DGCF
Subjt:  CSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANE---GIQGVARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCF

Query:  VVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLF-----NIISPTNRVTLVEVVEKFFANIFMKDHKIPRNISHGNIKHLIDF
        ++   + + G    ++  + S+ +   +I    DL++LENQ+PFFVLQ L+      + S  NR+        FF N   K+        +   KHL+D 
Subjt:  VVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLF-----NIISPTNRVTLVEVVEKFFANIFMKDHKIPRNISHGNIKHLIDF

Query:  LGFYYAIPSTS----LASNNIQPQ-------------NKWLLLPPPTTELCDAGIVLE-KATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFEN
        +   + +P+TS     +S ++Q Q             +K + L      L   GI    + +  ++I+++  +   L+IP             N +AFE 
Subjt:  LGFYYAIPSTS----LASNNIQPQ-------------NKWLLLPPPTTELCDAGIVLE-KATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFEN

Query:  FQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGD-FYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWA
        F   +   +  T Y++F+G L++ E+D + L    ++     GS+ EVS  F  I KDVV + D  Y  ++   ++E+ K  +N   A  R  +F +PW 
Subjt:  FQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGD-FYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWA

Query:  FISFIAATVIIVLTLLRT
        F+S  A   +I+LT+L++
Subjt:  FISFIAATVIIVLTLLRT

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)3.0e-4929.05Show/hide
Query:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITAE------CSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRA-NEGIQG
        +P   +  E  L+ +  +  + K+L++ P + +       CSI+RV + +++ + + YEPRV+SIGP+H G+  LK +E  K ++L   L+R  N  ++ 
Subjt:  KPDDVANAEVALRCYNLVNVIGKELQQLPPITAE------CSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRA-NEGIQG

Query:  VARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNI-HLYHDLIMLENQLPFFVLQGLFNIISPTN---
          ++    E  AR CY +  ++M+++ F +MM++DGCF++E    V    P    D   L+  A  +   Y D + LENQ+PFFVL+ LFN+    N   
Subjt:  VARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNI-HLYHDLIMLENQLPFFVLQGLFNIISPTN---

Query:  -RVTLVEVVEKFFANIFMKDHKIPRNISHGNIKHLIDFLGFYYAIPSTSL---ASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVL
           +L  +   FF N+  +  +          KHL+D L   + IP + L    + N   +     +    ++L  AGI L +   A + + + F  G +
Subjt:  -RVTLVEVVEKFFANIFMKDHKIPRNISHGNIKHLIDFLGFYYAIPSTSL---ASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVL

Query:  KIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQ-GDFYYKDISMDLHE
        ++P   + D     + N +A+E   V      + T Y   L  L +T KD   L  + I+ N   G+D E++   N +G+DV       Y KD+  +++E
Subjt:  KIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQ-GDFYYKDISMDLHE

Query:  HCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
        + K+ W+   A+ +  YFN+PW+F+S +AA V++VL++++TI++
Subjt:  HCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS

AT3G50160.1 Plant protein of unknown function (DUF247)8.0e-5033.08Show/hide
Query:  IYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFL
        IYRV   L     K+Y P+++SIGP+HHG + L  MER K + +   ++RA   I+    A    E KAR CY+   +NMN + F++M+++DG F++E  
Subjt:  IYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFL

Query:  VTVYGEFPQTQIDLRSLIF--RAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFANIFMKDHKIPRNISHGNIKHLIDFL--GFYYA
              F +        +F  R +   +  D++MLENQLP+ VL+GL  +  P     L +V  + F   F         ++     H +D L  G   +
Subjt:  VTVYGEFPQTQIDLRSLIF--RAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFANIFMKDHKIPRNISHGNIKHLIDFL--GFYYA

Query:  IPST--SLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALI
          ++   ++  N QPQ     L    TEL +AG+   +     +  DI F+ G LKIP   IHD  +    NL+AFE   + +      T Y++F+  LI
Subjt:  IPST--SLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALI

Query:  STEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQ-GDFYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
        ++ +D S L   GI+ N + GSD EVS+LFN +GK+V+    D Y   ++ +++ + + +WN   A+LR  YFN PWA+ SFIAA  +++ T  ++ F+
Subjt:  STEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQ-GDFYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS

AT3G50180.1 Plant protein of unknown function (DUF247)5.9e-5334.06Show/hide
Query:  IYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFL
        IY+V   L     K+Y P+ +S+GP+HHGRQ  ++ME  K + +   L R N+GI+    A +  E KAR CYE  ++ ++++ F +M+L+DGCF++E L
Subjt:  IYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFL

Query:  VTVYGEFPQTQIDLRSLIF--RAMNIHLYHDLIMLENQLPFFVLQGLFNIISPT-NRVTLVEVVEKFFANIF-----MKDHKIPRNISHGNIKHLIDFLG
          V   F +   D    +F  R     +  D+IMLENQLP FVL  L  +   T N+  LVE+V +FF  +      + ++  PR +S+G + H +D   
Subjt:  VTVYGEFPQTQIDLRSLIF--RAMNIHLYHDLIMLENQLPFFVLQGLFNIISPT-NRVTLVEVVEKFFANIF-----MKDHKIPRNISHGNIKHLIDFLG

Query:  FYYAIPSTSLASNNIQPQNKWLLLPPPT-TELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSY--ATHYVLF
             P +S  +N  +  +K L    PT TEL DAG    K    +   DI F  G L+IP   IHD  +    NL+AFE      HI+S    T Y++F
Subjt:  FYYAIPSTSLASNNIQPQNKWLLLPPPT-TELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSY--ATHYVLF

Query:  LGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQ-GDFYYKDISMDLHEHCKARWNRWMASLRR----DYFNTPWAFISFIAATVIIVL
        +  LI + +D S L   GI+ + + GS+ EV+++FN + ++VV    D Y   + +++H   K  ++R + SL+      Y + PWA++SF AA ++++L
Subjt:  LGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQ-GDFYYKDISMDLHEHCKARWNRWMASLRR----DYFNTPWAFISFIAATVIIVL

Query:  TLLRTIFSGVLHHN
        T  ++ F+   + N
Subjt:  TLLRTIFSGVLHHN

AT4G31980.1 unknown protein1.4e-6534.05Show/hide
Query:  LVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNM
        LV+ I  +L  L  ++ +C IY+V  +L  ++P AY PR++S GP H G+++L+AME  K ++L S++ R N  ++ + R A TWE  AR+CY +  + +
Subjt:  LVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNM

Query:  NNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNI-HLYHDLIMLENQLPFFVLQGLFNII---SPTNRVTLVEVVEKFFANIFMK--DHK
        ++D FV+M++VDG F+VE L+     +P+ + +   +   +M I  +  D+I++ENQLPFFV++ +F ++         +++++ ++ F+    +  D K
Subjt:  NNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNI-HLYHDLIMLENQLPFFVLQGLFNII---SPTNRVTLVEVVEKFFANIFMK--DHK

Query:  IPRNISHGNIKHLIDFLGFYYAIPS--TSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFEN
                  +H +D L   Y +P     L    ++  N      P  TEL  AG+  + A +++ ++DISF  GVLKIP   + DL E   +N++ FE 
Subjt:  IPRNISHGNIKHLIDFLGFYYAIPS--TSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFEN

Query:  FQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAF
         +  N        Y++ LG  I +  D+ LL+  GI+ N +G S  +VSNLFN I K+V+    FY+  +S +L  +C   WNRW A LRRDYF+ PWA 
Subjt:  FQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAF

Query:  ISFIAATVIIVLTLLRTIFS
         S  AA ++++LT ++++ S
Subjt:  ISFIAATVIIVLTLLRTIFS

AT5G11290.1 Plant protein of unknown function (DUF247)4.8e-5534.17Show/hide
Query:  MERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLEN
        ME  KL++L+S++ R    ++ + R A TWE +AR CY +  + +++D +VKM++VD  F+VE L+    +  +  +D R    + M + + HD+++LEN
Subjt:  MERFKLQFLRSYLSRANEGIQGVARAALTWETKARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLEN

Query:  QLPFFVLQGLFNIIS---PTNRVTLVEVVEKFFANIFMKDHKIPRNISHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEK
        QLP+FV++G+F ++          L  ++   F   +M      R+IS   I H +D L   +     S    +++  +  L       E+ +AG+ L+ 
Subjt:  QLPFFVLQGLFNIIS---PTNRVTLVEVVEKFFANIFMKDHKIPRNISHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEK

Query:  ATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVV
        A +    +DISF  GVL IP  +I+D+ E   RN++ FE     + + +Y  HY+ FL   I +  D+ L +  GI+ N  G + E+VS LFN I K+  
Subjt:  ATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVLFLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVV

Query:  IQGDFYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS
          G FYYK +  +L  HC A WN+W A+LRRDYF+ PW+  S +AA V+++LT ++ I S
Subjt:  IQGDFYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAACATAACACTAAACCTGACGATGTCGCCAACGCCGAGGTAGCACTACGTTGTTATAACTTGGTGAATGTTATCGGAAAAGAGCTTCAACAACTACCTCCTAT
TACGGCAGAATGCAGCATCTATCGAGTTTCTAGACGACTCCTCAACATTCATCCTAAGGCTTATGAGCCTCGAGTCCTTTCCATCGGCCCTTTTCACCACGGTCGACAGG
ATTTGAAGGCAATGGAACGATTTAAACTACAATTTCTCCGTAGCTATCTATCTCGTGCAAATGAAGGAATCCAGGGCGTTGCTAGAGCCGCTTTGACTTGGGAGACTAAA
GCTCGCAATTGCTATGAAGATTTGGCCATGAACATGAACAACGACCACTTTGTGAAAATGATGCTTGTAGATGGCTGTTTCGTGGTGGAGTTTTTGGTAACGGTTTATGG
TGAATTCCCTCAAACTCAAATTGACTTACGTTCTTTAATCTTCAGAGCTATGAACATCCATTTATATCATGACTTGATCATGCTTGAGAACCAACTTCCTTTCTTTGTTC
TTCAAGGTCTTTTTAACATTATTTCACCGACCAATCGCGTCACCTTGGTTGAAGTTGTAGAAAAGTTCTTTGCGAATATATTTATGAAAGATCATAAGATTCCTAGAAAC
ATCTCCCATGGAAACATAAAGCACTTGATCGATTTCTTAGGTTTTTACTATGCGATCCCCTCAACTAGTTTAGCAAGTAACAACATTCAGCCCCAAAATAAATGGCTGTT
GCTTCCCCCACCTACAACTGAGCTTTGTGACGCTGGAATCGTTTTAGAGAAAGCAACATCAGCAAACAACATTATGGACATAAGCTTTGAAGGTGGAGTTCTTAAAATCC
CACCTTTTGAAATTCATGATCTCTTTGAAATCTCTATGCGAAACCTAATGGCATTTGAGAATTTTCAAGTTGGAAATCACATTCAGAGCTATGCAACCCATTATGTTTTG
TTTCTAGGTGCCTTAATAAGTACCGAGAAAGACTCGAGTTTACTTGTGAAGAAGGGAATTGTGACCAACCTTATTGGTGGCAGTGATGAGGAAGTTTCGAATCTTTTTAA
CGATATTGGTAAAGATGTGGTGATTCAAGGGGATTTTTACTACAAAGATATAAGCATGGATTTACATGAGCATTGCAAGGCACGATGGAATCGGTGGATGGCTTCACTGA
GACGTGACTATTTCAATACGCCATGGGCTTTTATCTCGTTCATTGCTGCTACTGTGATCATTGTCCTCACTTTACTGCGAACCATATTTTCTGGTGTATTGCATCACAAC
TGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCAACATAACACTAAACCTGACGATGTCGCCAACGCCGAGGTAGCACTACGTTGTTATAACTTGGTGAATGTTATCGGAAAAGAGCTTCAACAACTACCTCCTAT
TACGGCAGAATGCAGCATCTATCGAGTTTCTAGACGACTCCTCAACATTCATCCTAAGGCTTATGAGCCTCGAGTCCTTTCCATCGGCCCTTTTCACCACGGTCGACAGG
ATTTGAAGGCAATGGAACGATTTAAACTACAATTTCTCCGTAGCTATCTATCTCGTGCAAATGAAGGAATCCAGGGCGTTGCTAGAGCCGCTTTGACTTGGGAGACTAAA
GCTCGCAATTGCTATGAAGATTTGGCCATGAACATGAACAACGACCACTTTGTGAAAATGATGCTTGTAGATGGCTGTTTCGTGGTGGAGTTTTTGGTAACGGTTTATGG
TGAATTCCCTCAAACTCAAATTGACTTACGTTCTTTAATCTTCAGAGCTATGAACATCCATTTATATCATGACTTGATCATGCTTGAGAACCAACTTCCTTTCTTTGTTC
TTCAAGGTCTTTTTAACATTATTTCACCGACCAATCGCGTCACCTTGGTTGAAGTTGTAGAAAAGTTCTTTGCGAATATATTTATGAAAGATCATAAGATTCCTAGAAAC
ATCTCCCATGGAAACATAAAGCACTTGATCGATTTCTTAGGTTTTTACTATGCGATCCCCTCAACTAGTTTAGCAAGTAACAACATTCAGCCCCAAAATAAATGGCTGTT
GCTTCCCCCACCTACAACTGAGCTTTGTGACGCTGGAATCGTTTTAGAGAAAGCAACATCAGCAAACAACATTATGGACATAAGCTTTGAAGGTGGAGTTCTTAAAATCC
CACCTTTTGAAATTCATGATCTCTTTGAAATCTCTATGCGAAACCTAATGGCATTTGAGAATTTTCAAGTTGGAAATCACATTCAGAGCTATGCAACCCATTATGTTTTG
TTTCTAGGTGCCTTAATAAGTACCGAGAAAGACTCGAGTTTACTTGTGAAGAAGGGAATTGTGACCAACCTTATTGGTGGCAGTGATGAGGAAGTTTCGAATCTTTTTAA
CGATATTGGTAAAGATGTGGTGATTCAAGGGGATTTTTACTACAAAGATATAAGCATGGATTTACATGAGCATTGCAAGGCACGATGGAATCGGTGGATGGCTTCACTGA
GACGTGACTATTTCAATACGCCATGGGCTTTTATCTCGTTCATTGCTGCTACTGTGATCATTGTCCTCACTTTACTGCGAACCATATTTTCTGGTGTATTGCATCACAAC
TGA
Protein sequenceShow/hide protein sequence
MQQHNTKPDDVANAEVALRCYNLVNVIGKELQQLPPITAECSIYRVSRRLLNIHPKAYEPRVLSIGPFHHGRQDLKAMERFKLQFLRSYLSRANEGIQGVARAALTWETK
ARNCYEDLAMNMNNDHFVKMMLVDGCFVVEFLVTVYGEFPQTQIDLRSLIFRAMNIHLYHDLIMLENQLPFFVLQGLFNIISPTNRVTLVEVVEKFFANIFMKDHKIPRN
ISHGNIKHLIDFLGFYYAIPSTSLASNNIQPQNKWLLLPPPTTELCDAGIVLEKATSANNIMDISFEGGVLKIPPFEIHDLFEISMRNLMAFENFQVGNHIQSYATHYVL
FLGALISTEKDSSLLVKKGIVTNLIGGSDEEVSNLFNDIGKDVVIQGDFYYKDISMDLHEHCKARWNRWMASLRRDYFNTPWAFISFIAATVIIVLTLLRTIFSGVLHHN