; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10010725 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10010725
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationChr06:25235238..25238079
RNA-Seq ExpressionHG10010725
SyntenyHG10010725
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025238.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-9749.87Show/hide
Query:  IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAM
        + ME   S ++  SIYK+P FM    PKA+EP +VS GPY++GK HL  ME  K K F  F  +  L  E+I   V  IL+ L+ SYDKLE++WK++PA 
Subjt:  IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAM

Query:  FLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFIEPVDAVIWNEVHILAMYRASLLYPPVNT
        FL+LMIVDGCF+L F   CP SL ++S DI++DMLLLENQLP +LL  +Y++A    D   L+ L+   + I P + V+ + +HIL MY+ SLL+PP++ 
Subjt:  FLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFIEPVDAVIWNEVHILAMYRASLLYPPVNT

Query:  IYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDK
                  + S+P C++IP AT+LR+A IKFK S T  L DV FD   GVL LP L+VDD +E++LLNVMAFEK H   G +VTS+VILM NLI+ ++
Subjt:  IYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDK

Query:  DVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQVAYQIADY
        DVA+L  +++LAN +G+D+ AA LFN LG GAAM LD H+  VH  VN+H ++ WN+  A+LK +Y Q  W +IS+ AA FGF ++ILQ  YQ  DY
Subjt:  DVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQVAYQIADY

XP_022960454.1 UPF0481 protein At3g47200-like [Cucurbita moschata]4.5e-9748.32Show/hide
Query:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL
        ++R K  L ++ +S   + ME   S ++  SIYK+P FM    PKA+EP +VS GPY++GK HL  ME  K K F  F A+  L  E+I   V  IL+ L
Subjt:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL

Query:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI--EGLRSLICKSMFIEPVDAVIWN
        + SYDKLE++WK++P  FL+LMIVDGCF+L F   CP SL ++S DI++DMLLLENQLP +LL  +Y++A +   +  + L+ L+   + I P + V+ +
Subjt:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI--EGLRSLICKSMFIEPVDAVIWN

Query:  EVHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVP
         +HIL MY+ SLL+PP++             S+P C++IP AT+LR+A IKFK S T  L DV FD   GVL LP L+VDD +E++LLNVMAFEK H   
Subjt:  EVHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVP

Query:  GSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATF
        G +VTS+VILM NLI+ ++DVA+L  +++LAN +G+D+ AA LFN LG GAAM LD H+  VH  VN+H ++ WN+  A+LK +Y Q  W +IS+ AA F
Subjt:  GSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATF

Query:  GFALIILQVAYQIADY
        GF ++ILQ  YQ  DY
Subjt:  GFALIILQVAYQIADY

XP_023004238.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita maxima]8.3e-9948.56Show/hide
Query:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL
        ++R K  L ++ +S   + ME   S ++  SIYK+P FM    PKA+EP +VS GPY++GK HL  ME  K K F  F  +  L  E+I   V  IL+ L
Subjt:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL

Query:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI--EGLRSLICKSMFIEPVDAVIWN
        + SYD+LE+EW Q+P  FL+LMIVDGCF+L F   CP+SL ++S DI++DMLLLENQLP +LL  +Y++A +   +  +  + L+CK + I P + V+ +
Subjt:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI--EGLRSLICKSMFIEPVDAVIWN

Query:  EVHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVP
         +HIL MY+ SLLYPP++           + S+P C++IP AT+L +A IKFK S T+ L DV FD  +GVL LP L+VDD +E+++LNVMAFEK H   
Subjt:  EVHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVP

Query:  GSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATF
        G +VTS+VILM NLI+ ++DVA+L  +KILAN +G+D+ AA LF+ LG GAAM LD+H+  VH  VN H ++ WN+  A+LK DY Q  W +IS+ AA F
Subjt:  GSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATF

Query:  GFALIILQVAYQIADY
        GF ++ILQ  YQ  DY
Subjt:  GFALIILQVAYQIADY

XP_023004239.1 UPF0481 protein At3g47200-like isoform X2 [Cucurbita maxima]6.3e-9948.67Show/hide
Query:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL
        ++R K  L ++ +S   + ME   S ++  SIYK+P FM    PKA+EP +VS GPY++GK HL  ME  K K F  F  +  L  E+I   V  IL+ L
Subjt:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL

Query:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI-EGLRSLICKSMFIEPVDAVIWNE
        + SYD+LE+EW Q+P  FL+LMIVDGCF+L F   CP+SL ++S DI++DMLLLENQLP +LL  +Y++A +   + +  + L+CK + I P + V+ + 
Subjt:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI-EGLRSLICKSMFIEPVDAVIWNE

Query:  VHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPG
        +HIL MY+ SLLYPP++           + S+P C++IP AT+L +A IKFK S T+ L DV FD  +GVL LP L+VDD +E+++LNVMAFEK H   G
Subjt:  VHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPG

Query:  SQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFG
         +VTS+VILM NLI+ ++DVA+L  +KILAN +G+D+ AA LF+ LG GAAM LD+H+  VH  VN H ++ WN+  A+LK DY Q  W +IS+ AA FG
Subjt:  SQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFG

Query:  FALIILQVAYQIADY
        F ++ILQ  YQ  DY
Subjt:  FALIILQVAYQIADY

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]3.5e-13463.04Show/hide
Query:  QKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVD
        +  + SIYK+PEFMR +Q KAFEP LVS GPYH+GK HLV ME  K+K F +F   Y L+ E+I ES+ N LE L G+YDKL+++WKQ+ A FL++MIVD
Subjt:  QKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVD

Query:  GCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIY-NLAHKKNDIEGLRSLICKSMFIEPVDAVIWNEVHILAMYRASLLYPPVNTIYCGGLP
        GCF+LDFF+ECP+SLS+M WDI+RDMLLLENQLP  LL  +Y N+ +   + + LRSLIC+ M I   +AV+  E+HIL MYRASLLYPPV+     G  
Subjt:  GCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIY-NLAHKKNDIEGLRSLICKSMFIEPVDAVIWNEVHILAMYRASLLYPPVNTIYCGGLP

Query:  K-NYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLV
        K  Y  S+P C+IIPQAT+L DA IKFK SPTK L DVSF+  QGVL+LPY+VVDD +ET+LLNVMAFEK +   GS+VTS+VILM+NLI+VD+DVALL 
Subjt:  K-NYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLV

Query:  SKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQVAYQIADYNRGN
        S KILAN LG+D+SAA LF+LLGKGAAMDLD+HI DVH  VNKH S SWN+  ASLK DY Q+ WA+IS+ AA FGFA++I+Q  YQI DY+RGN
Subjt:  SKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQVAYQIADYNRGN

TrEMBL top hitse value%identityAlignment
A0A5A7TA99 UPF0481 protein1.4e-9147.55Show/hide
Query:  KRLLSEVEESSIGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYN--LHFETIAESVCNILELLLGSY
        KRL+S  E  S        Q IK SIYK+P+FM+ +QPKA+EP LVSFGPYH+G  HL  ME+ K K F       N    +E+IA  V NILE L  +Y
Subjt:  KRLLSEVEESSIGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYN--LHFETIAESVCNILELLLGSY

Query:  DKLEDE-WKQNP---AMFLKLMIVDGCFLLDFF--EECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFIEPVDAVIWN
        D L+ E W++N    A F+++MI+D CF+L FF  ++   SL ++  DI+RD+LLLENQLP  LL  +Y +   K+  + L SLICK  F++  +  +  
Subjt:  DKLEDE-WKQNP---AMFLKLMIVDGCFLLDFF--EECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFIEPVDAVIWN

Query:  EVHILAMYRASLLYP-PVNTIYCG-----------GLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLL
          HIL MY+  LL P PV  +  G           G  K  + S  N +IIPQAT L DA IKF+ S T+ L D+ F    GVL+LP+L VDD +ET LL
Subjt:  EVHILAMYRASLLYP-PVNTIYCG-----------GLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLL

Query:  NVMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQH
        NVMAFEK HG   S VTS+V+LM+NLI++DKDV LL   KI+ N LG+D+ AA LF LLGKG A+DL+++I  VH  VNKH     N+  A+L+ +Y Q+
Subjt:  NVMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQH

Query:  KWALISVLAATFGFALIILQVAYQIADYN
         WA+IS++ A FGF ++I+Q  YQI DY+
Subjt:  KWALISVLAATFGFALIILQVAYQIADYN

A0A6J1H6V9 UPF0481 protein At3g47200-like6.8e-9148.65Show/hide
Query:  ISSQKIKASIYKLPEFMRAVQP----KAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMF
        + S   K SIYKLP FMR        KAF+P +VSFGPYH+GK HL  MER K K F+    +  L  E I   V  IL+ LL SYD+LE+EW Q+P  F
Subjt:  ISSQKIKASIYKLPEFMRAVQP----KAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMF

Query:  LKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLA-----HKKNDIEGLRSLICKSMFIEPVDAVIWNEVHILAMYRASLLYP
        LKLMIVDGCF+L  F +CP SL +M  DIR + LLLENQLP  LL  ++++A      + +D   L+ LIC  + I   D ++   +HIL +Y+ASLL P
Subjt:  LKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLA-----HKKNDIEGLRSLICKSMFIEPVDAVIWNEVHILAMYRASLLYP

Query:  PVN-TIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQ-----VTSYVI
        P++ T +C    +   LS    ++IP AT+L +A IKFK S TK L DV FD  +GVL LP L+VDD +E++ LNVMAFEK H    +Q     +TS+VI
Subjt:  PVN-TIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQ-----VTSYVI

Query:  LMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQV
        LM NLI+ ++DVALL SK  LAN LG+D+ AA LF+ LGKG AM  + H++ VH  +N +  + WN+  A+LK  Y Q+ W +IS+ AA FGF ++ILQ 
Subjt:  LMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQV

Query:  AYQIADY
         YQ+ DY
Subjt:  AYQIADY

A0A6J1HB25 UPF0481 protein At3g47200-like2.2e-9748.32Show/hide
Query:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL
        ++R K  L ++ +S   + ME   S ++  SIYK+P FM    PKA+EP +VS GPY++GK HL  ME  K K F  F A+  L  E+I   V  IL+ L
Subjt:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL

Query:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI--EGLRSLICKSMFIEPVDAVIWN
        + SYDKLE++WK++P  FL+LMIVDGCF+L F   CP SL ++S DI++DMLLLENQLP +LL  +Y++A +   +  + L+ L+   + I P + V+ +
Subjt:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI--EGLRSLICKSMFIEPVDAVIWN

Query:  EVHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVP
         +HIL MY+ SLL+PP++             S+P C++IP AT+LR+A IKFK S T  L DV FD   GVL LP L+VDD +E++LLNVMAFEK H   
Subjt:  EVHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVP

Query:  GSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATF
        G +VTS+VILM NLI+ ++DVA+L  +++LAN +G+D+ AA LFN LG GAAM LD H+  VH  VN+H ++ WN+  A+LK +Y Q  W +IS+ AA F
Subjt:  GSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATF

Query:  GFALIILQVAYQIADY
        GF ++ILQ  YQ  DY
Subjt:  GFALIILQVAYQIADY

A0A6J1KVQ6 UPF0481 protein At3g47200-like isoform X23.1e-9948.67Show/hide
Query:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL
        ++R K  L ++ +S   + ME   S ++  SIYK+P FM    PKA+EP +VS GPY++GK HL  ME  K K F  F  +  L  E+I   V  IL+ L
Subjt:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL

Query:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI-EGLRSLICKSMFIEPVDAVIWNE
        + SYD+LE+EW Q+P  FL+LMIVDGCF+L F   CP+SL ++S DI++DMLLLENQLP +LL  +Y++A +   + +  + L+CK + I P + V+ + 
Subjt:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI-EGLRSLICKSMFIEPVDAVIWNE

Query:  VHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPG
        +HIL MY+ SLLYPP++           + S+P C++IP AT+L +A IKFK S T+ L DV FD  +GVL LP L+VDD +E+++LNVMAFEK H   G
Subjt:  VHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPG

Query:  SQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFG
         +VTS+VILM NLI+ ++DVA+L  +KILAN +G+D+ AA LF+ LG GAAM LD+H+  VH  VN H ++ WN+  A+LK DY Q  W +IS+ AA FG
Subjt:  SQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFG

Query:  FALIILQVAYQIADY
        F ++ILQ  YQ  DY
Subjt:  FALIILQVAYQIADY

A0A6J1KYV8 UPF0481 protein At3g47200-like isoform X14.0e-9948.56Show/hide
Query:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL
        ++R K  L ++ +S   + ME   S ++  SIYK+P FM    PKA+EP +VS GPY++GK HL  ME  K K F  F  +  L  E+I   V  IL+ L
Subjt:  MMRDKRLLSEVEESS--IGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELL

Query:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI--EGLRSLICKSMFIEPVDAVIWN
        + SYD+LE+EW Q+P  FL+LMIVDGCF+L F   CP+SL ++S DI++DMLLLENQLP +LL  +Y++A +   +  +  + L+CK + I P + V+ +
Subjt:  LGSYDKLEDEWKQNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDI--EGLRSLICKSMFIEPVDAVIWN

Query:  EVHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVP
         +HIL MY+ SLLYPP++           + S+P C++IP AT+L +A IKFK S T+ L DV FD  +GVL LP L+VDD +E+++LNVMAFEK H   
Subjt:  EVHILAMYRASLLYPPVNTIYCGGLPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVP

Query:  GSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATF
        G +VTS+VILM NLI+ ++DVA+L  +KILAN +G+D+ AA LF+ LG GAAM LD+H+  VH  VN H ++ WN+  A+LK DY Q  W +IS+ AA F
Subjt:  GSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATF

Query:  GFALIILQVAYQIADY
        GF ++ILQ  YQ  DY
Subjt:  GFALIILQVAYQIADY

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026456.7e-1922.45Show/hide
Query:  MEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYN-LHFETIAESVCNILELLLGSYDKLEDEWKQNPAMF
        +EE   +++  SI+ +P+ +    P ++ P  VS GPYH  KP L EMER K     K   +YN   F  + E + ++   +   Y K       N    
Subjt:  MEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYN-LHFETIAESVCNILELLLGSYDKLEDEWKQNPAMF

Query:  LKLMIVDGCFLLDF-----FEECPHSLSSMSW-DIRRDMLLLENQLPNILLLHI--YNLAHKKNDIEGLRSL-------------------ICKSMFIE-
        L +M VD  FL++F     F +    ++ +   +I RD++++ENQ+P  +L     + L   ++  + L S+                   I K+ F E 
Subjt:  LKLMIVDGCFLLDF-----FEECPHSLSSMSW-DIRRDMLLLENQLPNILLLHI--YNLAHKKNDIEGLRSL-------------------ICKSMFIE-

Query:  -------------------------------------------------------PVDAVI---WNEVHILAMYRASLLYPPVNTIYCGGLPKN------
                                                               P D ++   W  +  L  + A  L    +  Y     +N      
Subjt:  -------------------------------------------------------PVDAVI---WNEVHILAMYRASLLYPPVNTIYCGGLPKN------

Query:  ------YNLSNPNC---RIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDK
               ++  P       IP  ++L  A ++FK +    +  V+FD N G   LP + +D  +ET L N++A+E  +       T Y  L++ +I+ ++
Subjt:  ------YNLSNPNC---RIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDK

Query:  DVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQV
        DV LL  + +L + L  DQ AA ++N + K   +     +      VN++Y+  W      L   Y+   W +++ LAA     L+ LQ+
Subjt:  DVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQV

Q9SD53 UPF0481 protein At3g472008.7e-3526.93Show/hide
Query:  SSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKF---TAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLK
        S+ K    I+++PE   A+ PKA++P +VS GPYHYG+ HL  ++++K +    F     K ++    + ++V ++ + +  SY   E+    +  MF  
Subjt:  SSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKF---TAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLK

Query:  LMIVDGCFLLDFF-------EECPHSLSSMSW---DIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFIEPVD--AVIWNE------VHI
        +M++DGCF+L  F       E     + S+ W    I+ D+LLLENQ+P  +L  +Y     K  +    + I    F  P+D     W +       H+
Subjt:  LMIVDGCFLLDFF-------EECPHSLSSMSW---DIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFIEPVD--AVIWNE------VHI

Query:  LAMYRASLL----------YPPVNTIYCGGLPKNYNLSNPNCRIIP---QATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVM
        L + R + L           P V      G  K+ N+ + + + +P    A  LR   IKF+   +K    ++    +  L++P L  D +  +  LN +
Subjt:  LAMYRASLL----------YPPVNTIYCGGLPKNYNLSNPNCRIIP---QATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVM

Query:  AFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKK-ILANTLGDDQSAAYLFNLLGKGAAMDLD-NHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHK
        AFE+F+    +++T+Y++ M  L+N ++DV  L + K I+ N  G +   +  F  + K    ++D +++ +V   VN++  K +N LWA  +  + +  
Subjt:  AFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKK-ILANTLGDDQSAAYLFNLLGKGAAMDLD-NHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHK

Query:  WALISVLAATFGFALIILQVAYQIADY
        W  +S  A  F   L +LQ    I  Y
Subjt:  WALISVLAATFGFALIILQVAYQIADY

Arabidopsis top hitse value%identityAlignment
AT3G50130.1 Plant protein of unknown function (DUF247)1.5e-4227.73Show/hide
Query:  EEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLK
        E+ ++   K  IY++P++++    K++ P  VS GP+H+G  HL+ M+R+K +  +   A+     E   +++  + +     Y+   D    +   F +
Subjt:  EEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLK

Query:  LMIVDGCFLLDFFEECPHSLSSMSWD--------------IRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLIC----------------------
        ++++DGCF+L+ F       S + +D              I+RDM++LENQLP  +L  +  +   K    GL S +                       
Subjt:  LMIVDGCFLLDFFEECPHSLSSMSWD--------------IRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLIC----------------------

Query:  KSMFIEPVDAVIWNEVHILAMYRASLLYPPVNTIYCGGLPKNYNL---------SNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYL
        +  F  P+      E+H L ++R +LL P  N       P+   +              ++I   TELR+A IKF+   T    D+ F    G L++P L
Subjt:  KSMFIEPVDAVIWNEVHILAMYRASLLYPPVNTIYCGGLPKNYNL---------SNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYL

Query:  VVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDN-HIIDVHNAVNKHYSKSWNK
        ++ D +++   N++AFE+ H    + +TSY+I MDNLI+  +DV  L    I+ + LG+D   A LFN L +  A D  N ++  + N V+++YS+ WN 
Subjt:  VVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMDLDN-HIIDVHNAVNKHYSKSWNK

Query:  LWASLKPDYLQHKWALISVLAATFGFALIILQVAYQIADY
        L A LK  Y  + WA  S  AA     L + Q  +    Y
Subjt:  LWASLKPDYLQHKWALISVLAATFGFALIILQVAYQIADY

AT3G50150.1 Plant protein of unknown function (DUF247)1.8e-4328.67Show/hide
Query:  KASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVDGCF
        K  IY++P +++    K++ P  VS GPYH+GK HL  MER+K +  +   A+   + E   +++  + E     Y    D   +N   F +++++DGCF
Subjt:  KASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVDGCF

Query:  LLDFFEECPHSLSSMSW--------------DIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMF--IEPVDAVIWNE-------------
        +L+ F+        + +               I+RDM++LENQLP  +L  +  L     +  G+ + +    F  + P   V+                
Subjt:  LLDFFEECPHSLSSMSW--------------DIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMF--IEPVDAVIWNE-------------

Query:  -----VHILAMYRASLLYPPVNTIYCGGLP-KNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEK
             +H L ++  SL+     T    G P ++ ++     ++I   TELR A + F    T  L D+ F    G LK+P L++ D +++   N++AFE+
Subjt:  -----VHILAMYRASLLYPPVNTIYCGGLP-KNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEK

Query:  FHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMD-LDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALIS
         H    + +TSY+I MDNLIN  +DV+ L    I+ + LG D   A LFN L K    D  D ++  +   VN++YS+ WN L A+L+  Y  + WA  S
Subjt:  FHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMD-LDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALIS

Query:  VLAATFGFALIILQVAYQIADY
          AA     L   Q  + +  Y
Subjt:  VLAATFGFALIILQVAYQIADY

AT3G50160.1 Plant protein of unknown function (DUF247)4.4e-4229.61Show/hide
Query:  IYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVDGCFLLD
        IY++P +++    K++ P +VS GPYH+G  HL+ MER+K +  +   A+     E   +++  + E     Y   +     N   F++++++DG F+++
Subjt:  IYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVDGCFLLD

Query:  FFEECPHSLSSMSW--------------DIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMF--IEPVDAVIWNE--VHILAMYRASLLYP
         F+        + +               IRRDM++LENQLP  +L  +  L  + + ++ +   + +  F  + P   V+  E  +H L + R  LL  
Subjt:  FFEECPHSLSSMSW--------------DIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMF--IEPVDAVIWNE--VHILAMYRASLLYP

Query:  PVNTIYCGGLPKNYNLSNPN-CRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNL
               G   ++ ++ N    ++I   TELR+A ++F    T    D+ F    G LK+P L++ D +++  LN++AFE+ H     ++TSY+I MDNL
Subjt:  PVNTIYCGGLPKNYNLSNPN-CRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNL

Query:  INVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMD-LDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAA
        IN  +DV+ L    I+ N LG D   + LFN LGK    D  D ++  +   VN +Y + WN L A+L+  Y  + WA  S +AA
Subjt:  INVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMD-LDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAA

AT3G50170.1 Plant protein of unknown function (DUF247)3.1e-4328.47Show/hide
Query:  KASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVDGCF
        K  IY++P +++    K++ P  VS GPYH+GK  L  MER+K +  +K   +     E    ++  + E     Y   E     +   F +++++DGCF
Subjt:  KASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVDGCF

Query:  LLDFFEECPHSLSSMSW--------------DIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMF----------IEPVDAVIWN------
        +L+ F       + + +               I+RDM++LENQLP  +L  +  L     +  G+ + +    F           +P  + + N      
Subjt:  LLDFFEECPHSLSSMSW--------------DIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMF----------IEPVDAVIWN------

Query:  -------EVHILAMYRASLLY--PPVNT-IYCGGLPKNYN-LSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLN
               E+H L ++R SLL   P  NT      L +N   +     +++   TELR+A +KF+   T    D+ F    G L++P L++ D +++   N
Subjt:  -------EVHILAMYRASLLY--PPVNT-IYCGGLPKNYN-LSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLN

Query:  VMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMD-LDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQH
        ++AFE+ H    + +TSY+I MDNLIN  +DV+ L    I+ + LG D   A LFN L +    D  D+H+  +   VN++Y++ WN L A+L   Y  +
Subjt:  VMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMD-LDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQH

Query:  KWALISVLAATFGFALIILQVAYQIADYNRGN
         WA  S  AA     L + Q  Y +  Y + N
Subjt:  KWALISVLAATFGFALIILQVAYQIADYNRGN

AT3G50180.1 Plant protein of unknown function (DUF247)3.1e-4329.02Show/hide
Query:  KASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVDGCF
        K  IYK+P ++     K++ P  VS GPYH+G+     ME +K +  +    + N   E   +++  + E     Y   E     +   F +++++DGCF
Subjt:  KASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWKQNPAMFLKLMIVDGCF

Query:  LLDFFEECPHSLSSMSWD--------------IRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFI------------EPVDAVIWNEVHI
        +L+  +        + +D              I+RDM++LENQLP  +L  +  L     +  GL  L+ +  FI             P   V   E+H 
Subjt:  LLDFFEECPHSLSSMSWD--------------IRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFI------------EPVDAVIWNEVHI

Query:  LAMYRASLLYPPVNTIYCGGLPKNYNLSNPNC-RIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQ
        L ++  SLL+P        G      +++ +  R+IP  TELRDA  KFK + T    D+ F  + G L++P L++ D +++  LN++AFE+ H    + 
Subjt:  LAMYRASLLYPPVNTIYCGGLPKNYNLSNPNC-RIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQ

Query:  VTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMD-----LDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAA
        +TSY+I MDNLI+  +D++ L    I+ ++LG +   A +FN L +    D     L   +I+VH    ++YS+  N L  +L   YL + WA +S  AA
Subjt:  VTSYVILMDNLINVDKDVALLVSKKILANTLGDDQSAAYLFNLLGKGAAMD-----LDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAA

Query:  TFGFALIILQ
             L   Q
Subjt:  TFGFALIILQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAGAGACAAGAGACTTCTGAGTGAAGTGGAGGAAAGTAGTATTGGGATGGAGGAGATATCATCGCAAAAGATCAAAGCTTCAATCTACAAATTACCAGAATTCAT
GAGAGCAGTTCAACCAAAAGCTTTCGAGCCATGTTTGGTGTCTTTTGGGCCATACCATTATGGGAAACCCCATTTGGTTGAAATGGAAAGGAATAAAAAGAAAGGGTTTT
CAAAATTTACAGCTAAATACAATCTGCATTTTGAGACTATTGCGGAGAGTGTGTGTAACATTTTGGAACTTCTTTTGGGATCGTACGATAAGCTCGAGGATGAATGGAAG
CAAAATCCTGCAATGTTCTTGAAACTCATGATTGTGGATGGCTGTTTCCTGCTGGATTTCTTCGAAGAATGTCCCCACTCGCTAAGCTCTATGAGTTGGGATATAAGGCG
GGACATGCTGCTGCTCGAGAATCAGCTACCCAACATCCTACTTCTACACATCTATAACTTAGCACACAAAAAGAATGATATTGAAGGTCTGAGATCGCTTATTTGCAAGT
CAATGTTCATCGAACCTGTGGATGCGGTAATATGGAACGAGGTGCATATTTTGGCGATGTATAGAGCGTCATTATTGTATCCTCCGGTAAACACTATATATTGTGGTGGG
TTGCCAAAAAATTATAACCTATCCAACCCAAATTGTCGCATCATTCCACAAGCAACAGAGCTTCGTGATGCGAGAATAAAATTCAAGTGTAGCCCTACAAAAGGTCTTGG
AGACGTGAGTTTTGACCGAAATCAAGGCGTGTTGAAACTGCCATATTTAGTGGTGGATGACTATAGTGAAACAAGTTTATTAAATGTGATGGCATTTGAGAAATTCCACG
GGGTACCTGGAAGTCAAGTCACATCTTATGTGATCTTGATGGACAATTTGATCAACGTGGATAAAGATGTGGCGTTGCTGGTGTCTAAGAAGATATTAGCCAATACGCTT
GGGGATGATCAAAGTGCGGCGTATTTATTTAATTTGCTTGGGAAAGGGGCGGCTATGGATTTGGATAACCACATAATAGATGTGCATAACGCAGTGAATAAGCATTACAG
CAAGTCATGGAATAAATTGTGGGCAAGTCTGAAACCTGATTACCTTCAACACAAATGGGCCCTTATCTCTGTCCTTGCCGCTACTTTCGGTTTCGCCCTCATTATTCTTC
AGGTCGCCTACCAAATTGCTGATTACAACAGAGGGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGAGAGACAAGAGACTTCTGAGTGAAGTGGAGGAAAGTAGTATTGGGATGGAGGAGATATCATCGCAAAAGATCAAAGCTTCAATCTACAAATTACCAGAATTCAT
GAGAGCAGTTCAACCAAAAGCTTTCGAGCCATGTTTGGTGTCTTTTGGGCCATACCATTATGGGAAACCCCATTTGGTTGAAATGGAAAGGAATAAAAAGAAAGGGTTTT
CAAAATTTACAGCTAAATACAATCTGCATTTTGAGACTATTGCGGAGAGTGTGTGTAACATTTTGGAACTTCTTTTGGGATCGTACGATAAGCTCGAGGATGAATGGAAG
CAAAATCCTGCAATGTTCTTGAAACTCATGATTGTGGATGGCTGTTTCCTGCTGGATTTCTTCGAAGAATGTCCCCACTCGCTAAGCTCTATGAGTTGGGATATAAGGCG
GGACATGCTGCTGCTCGAGAATCAGCTACCCAACATCCTACTTCTACACATCTATAACTTAGCACACAAAAAGAATGATATTGAAGGTCTGAGATCGCTTATTTGCAAGT
CAATGTTCATCGAACCTGTGGATGCGGTAATATGGAACGAGGTGCATATTTTGGCGATGTATAGAGCGTCATTATTGTATCCTCCGGTAAACACTATATATTGTGGTGGG
TTGCCAAAAAATTATAACCTATCCAACCCAAATTGTCGCATCATTCCACAAGCAACAGAGCTTCGTGATGCGAGAATAAAATTCAAGTGTAGCCCTACAAAAGGTCTTGG
AGACGTGAGTTTTGACCGAAATCAAGGCGTGTTGAAACTGCCATATTTAGTGGTGGATGACTATAGTGAAACAAGTTTATTAAATGTGATGGCATTTGAGAAATTCCACG
GGGTACCTGGAAGTCAAGTCACATCTTATGTGATCTTGATGGACAATTTGATCAACGTGGATAAAGATGTGGCGTTGCTGGTGTCTAAGAAGATATTAGCCAATACGCTT
GGGGATGATCAAAGTGCGGCGTATTTATTTAATTTGCTTGGGAAAGGGGCGGCTATGGATTTGGATAACCACATAATAGATGTGCATAACGCAGTGAATAAGCATTACAG
CAAGTCATGGAATAAATTGTGGGCAAGTCTGAAACCTGATTACCTTCAACACAAATGGGCCCTTATCTCTGTCCTTGCCGCTACTTTCGGTTTCGCCCTCATTATTCTTC
AGGTCGCCTACCAAATTGCTGATTACAACAGAGGGAATTAA
Protein sequenceShow/hide protein sequence
MMRDKRLLSEVEESSIGMEEISSQKIKASIYKLPEFMRAVQPKAFEPCLVSFGPYHYGKPHLVEMERNKKKGFSKFTAKYNLHFETIAESVCNILELLLGSYDKLEDEWK
QNPAMFLKLMIVDGCFLLDFFEECPHSLSSMSWDIRRDMLLLENQLPNILLLHIYNLAHKKNDIEGLRSLICKSMFIEPVDAVIWNEVHILAMYRASLLYPPVNTIYCGG
LPKNYNLSNPNCRIIPQATELRDARIKFKCSPTKGLGDVSFDRNQGVLKLPYLVVDDYSETSLLNVMAFEKFHGVPGSQVTSYVILMDNLINVDKDVALLVSKKILANTL
GDDQSAAYLFNLLGKGAAMDLDNHIIDVHNAVNKHYSKSWNKLWASLKPDYLQHKWALISVLAATFGFALIILQVAYQIADYNRGN