; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007018 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007018
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr10:399659..402875
RNA-Seq ExpressionHG10007018
SyntenyHG10007018
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135765.1 pentatricopeptide repeat-containing protein At5g48910 isoform X1 [Cucumis sativus]2.2e-17892.01Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSISTSH P   KPVDFS EK IPTSKLPQKTVLKLFDSKSITSLQYL+Q+HGLVLRSGHFQDHYVSGAL+KCYANPHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENNK FKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGI  D+HIKSAGI MYASFG LEDARK+F SGESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NTMIDGYLKCG LEAAKGLFAQMP++NIGSWNVMINGLAKGG LGDARK+FDEMSERDEISWSSMVDGYIS G YKEALEIFQQMQREETRPGRFILSSV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACS+ GAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

XP_016900941.1 PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Cucumis melo]6.5e-18394.38Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSISTSH P   KPVD S EKKIPTSKLPQKTVLKLFDSKSITSLQYL QVHGLVLRSGHFQDHYVSGAL+KCYANPHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENNK FKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGI  DMHIKSAGIQMYASFG LEDARKLF SGESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NTMIDGYLKCGDLEAAKGLFAQMP+RNIGSWNVMINGLAKGGKLGDARK+FDEMSERDEISWSSMVDGYIS GCYKEALEIFQQMQREETRPGRFILSSV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACS+ GAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

XP_022989356.1 pentatricopeptide repeat-containing protein At5g66520-like isoform X1 [Cucurbita maxima]1.0e-16786.09Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSIST+  P  L+P  F  E+K+PTSKLPQKTVLKLFDSKSITSL+YLSQVHGL+LRSGHFQDHYVSGAL+KCYANPHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENN  FKAIYFYGRMVIDARPNKF+YPTLFKACSVAQAV EG QIH HVVKHG   DMHIKSAGIQMY SFG  EDARKL D+GESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NT+IDGYLKCG+LEAAKGLF QMP RNIGSWNVMI+G AKGGKLGDARKVFDEM +RDEI+WSSMVDGYIS GCYKEALEIFQ MQR+E  PGRFIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACSS GAIDQGRWVHAYL+RNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

XP_023529861.1 pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Cucurbita pepo subsp. pepo]7.7e-16886.39Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSIST+  P  LKP  F  E+K+PTSKLPQKTVLKLFDSKSITSL+YLSQVHGL+LRSGHFQDHYVSGALVKCYANPHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENN  FKAIYFYGRMVIDARPNKF+YPTLFKACSVAQAV EGRQIH HVVKHG   DMHIKSAGIQMY SFG  EDARKL D+GESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NT+IDGYLKCG+LEAAKGLF QMP  NIGSWNVMI+G AKGGKLGDARKVFDEM +RDEI+WSSMVDGYIS GCYKEALEIFQ MQR+   PGRFIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACSS GAIDQGRWVHAYL+RNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

XP_038878241.1 pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Benincasa hispida]1.8e-18092.6Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSISTSH P   KPVDFS EK+IP SKL QKTVLKLFDSKSI SL YLSQVHGLVLRSGHFQDHYVSGALVKCYA+PHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENNK F+AIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIC DMHIKSAGIQMYASFGGLEDA+KLFDSGESD+VCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYIS G YKEALEIFQQMQ+EE RPG+FILSSV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACS+ GAIDQGRWVH YLKRNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

TrEMBL top hitse value%identityAlignment
A0A0A0LYD3 DYW_deaminase domain-containing protein1.0e-17892.01Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSISTSH P   KPVDFS EK IPTSKLPQKTVLKLFDSKSITSLQYL+Q+HGLVLRSGHFQDHYVSGAL+KCYANPHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENNK FKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGI  D+HIKSAGI MYASFG LEDARK+F SGESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NTMIDGYLKCG LEAAKGLFAQMP++NIGSWNVMINGLAKGG LGDARK+FDEMSERDEISWSSMVDGYIS G YKEALEIFQQMQREETRPGRFILSSV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACS+ GAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

A0A1S4DY81 pentatricopeptide repeat-containing protein At5g48910-like isoform X13.1e-18394.38Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSISTSH P   KPVD S EKKIPTSKLPQKTVLKLFDSKSITSLQYL QVHGLVLRSGHFQDHYVSGAL+KCYANPHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENNK FKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGI  DMHIKSAGIQMYASFG LEDARKLF SGESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NTMIDGYLKCGDLEAAKGLFAQMP+RNIGSWNVMINGLAKGGKLGDARK+FDEMSERDEISWSSMVDGYIS GCYKEALEIFQQMQREETRPGRFILSSV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACS+ GAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

A0A5A7UBJ6 Pentatricopeptide repeat-containing protein3.1e-18394.38Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSISTSH P   KPVD S EKKIPTSKLPQKTVLKLFDSKSITSLQYL QVHGLVLRSGHFQDHYVSGAL+KCYANPHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENNK FKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGI  DMHIKSAGIQMYASFG LEDARKLF SGESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NTMIDGYLKCGDLEAAKGLFAQMP+RNIGSWNVMINGLAKGGKLGDARK+FDEMSERDEISWSSMVDGYIS GCYKEALEIFQQMQREETRPGRFILSSV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACS+ GAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

A0A6J1EZS2 pentatricopeptide repeat-containing protein At5g66520-like isoform X14.6e-16685.21Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        M SIST+  P  LKP  F  E+K+PTSKLPQKTVLKLFDSKSITSL+YLSQVHGL+LRSGHFQDHYVSGALVKCYANPHF NFDFALKVFS IPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLEN+  FKAIYFYGRMVIDARPNKF+YPTLF ACSVAQAV EG QIH HVVKHG   DMHIKSAGIQMY SFG  EDARKL D+GESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NT+IDGYLKCG+LEAAKGLF QMP RNIGSWNVMI+G AKGGKLGDARK+FDEM +RDEI+WSSMVDGYIS GCYKEALEIFQ MQR+E  PGRFIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACSS GAIDQGRWVHAYL+RNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

A0A6J1JJU5 pentatricopeptide repeat-containing protein At5g66520-like isoform X14.9e-16886.09Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI
        MSSIST+  P  L+P  F  E+K+PTSKLPQKTVLKLFDSKSITSL+YLSQVHGL+LRSGHFQDHYVSGAL+KCYANPHF NFDFALKVFSSIPNPNVFI
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFI

Query:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW
        WNIVIKGCLENN  FKAIYFYGRMVIDARPNKF+YPTLFKACSVAQAV EG QIH HVVKHG   DMHIKSAGIQMY SFG  EDARKL D+GESDVVCW
Subjt:  WNIVIKGCLENNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCW

Query:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV
        NT+IDGYLKCG+LEAAKGLF QMP RNIGSWNVMI+G AKGGKLGDARKVFDEM +RDEI+WSSMVDGYIS GCYKEALEIFQ MQR+E  PGRFIL SV
Subjt:  NTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSV

Query:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        LAACSS GAIDQGRWVHAYL+RNSIKLDAVLGTALLDM
Subjt:  LAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

SwissProt top hitse value%identityAlignment
O49399 Pentatricopeptide repeat-containing protein At4g188401.8e-5537.29Show/hide
Query:  SKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYA-NPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMVI-DARPNKFTYPT
        +KS+T +Q   Q H  +L++G F D + +  LV   A NP      +A  + + I +PN F  N VI+    ++    A+  +  M++    P+K+++  
Subjt:  SKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYA-NPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMVI-DARPNKFTYPT

Query:  LFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMIN
        + KAC+     +EGRQIHG  +K G+  D+ +++  + +Y   G  E ARK+ D     D V WN+++  YL+ G ++ A+ LF +M  RN+ SWN MI+
Subjt:  LFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMIN

Query:  GLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREET-RPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTAL
        G A  G + +A++VFD M  RD +SW++MV  Y  VGCY E LE+F +M  + T +P  F L SVL+AC+S G++ QG WVH Y+ ++ I+++  L TAL
Subjt:  GLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREET-RPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTAL

Query:  LDM
        +DM
Subjt:  LDM

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.5e-4935.76Show/hide
Query:  SLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANP--HFGNFDFALKVFSSIPNPNVFIWNIVIKGCLEN--NKAFKAIYFYGRMVID--ARPNKFTYPTL
        +++ LSQ+H + ++SG  +D   +  +++  A    H  + D+A K+F+ +P  N F WN +I+G  E+  +KA  AI  +  M+ D    PN+FT+P++
Subjt:  SLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANP--HFGNFDFALKVFSSIPNPNVFIWNIVIKGCLEN--NKAFKAIYFYGRMVID--ARPNKFTYPTL

Query:  FKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSG--ESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMIN
         KAC+    +QEG+QIHG  +K+G  GD  + S  ++MY   G ++DAR LF     E D+V    M D   + G+               I  WNVMI+
Subjt:  FKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSG--ESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMIN

Query:  GLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALL
        G  + G    AR +FD+M +R  +SW++M+ GY   G +K+A+E+F++M++ + RP    L SVL A S  G+++ G W+H Y + + I++D VLG+AL+
Subjt:  GLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALL

Query:  DM
        DM
Subjt:  DM

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665202.7e-4632.53Show/hide
Query:  LSQVHGLVLRSGHFQDHY-VSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMVIDARP-NKFTYPTLFKACSVAQ
        L Q+H  +L++G  QD Y ++  L  C ++       +A  VF     P+ F+WN++I+G   +++  +++  Y RM+  + P N +T+P+L KACS   
Subjt:  LSQVHGLVLRSGHFQDHY-VSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMVIDARP-NKFTYPTLFKACSVAQ

Query:  AVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGD
        A +E  QIH  + K G                               E+DV   N++I+ Y   G+ + A  LF ++P  +  SWN +I G  K GK+  
Subjt:  AVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGD

Query:  ARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        A  +F +M+E++ ISW++M+ GY+     KEAL++F +MQ  +  P    L++ L+AC+  GA++QG+W+H+YL +  I++D+VLG  L+DM
Subjt:  ARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic6.3e-4829.5Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSI----TSLQYLSQVHGLVLRSGHFQDHYVSGALVK-CYANPHFGNFDFALKVFSSIPN
        M S S    P S  P  F     +P+S  P    ++   S S+     +LQ L  +H  +++ G    +Y    L++ C  +PHF    +A+ VF +I  
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSI----TSLQYLSQVHGLVLRSGHFQDHYVSGALVK-CYANPHFGNFDFALKVFSSIPN

Query:  PNVFIWNIVIKGCLENNKAFKAIYFYGRMV-IDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SG
        PN+ IWN + +G   ++    A+  Y  M+ +   PN +T+P + K+C+ ++A +EG+QIHGHV+K G   D+++ ++ I MY   G LEDA K+FD S 
Subjt:  PNVFIWNIVIKGCLENNKAFKAIYFYGRMV-IDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SG

Query:  ESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSE-------------------------------------
          DVV +  +I GY   G +E A+ LF ++P++++ SWN MI+G A+ G   +A ++F +M +                                     
Subjt:  ESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSE-------------------------------------

Query:  ---------------------------------RDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKR-
                                         +D ISW++++ GY  +  YKEAL +FQ+M R    P    + S+L AC+  GAID GRW+H Y+ + 
Subjt:  ---------------------------------RDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKR-

Query:  -NSIKLDAVLGTALLDM
           +   + L T+L+DM
Subjt:  -NSIKLDAVLGTALLDM

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial5.2e-5030.43Show/hide
Query:  LQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMV----IDARPNKFTYPTLFKA
        L +L Q+   ++ +G   D + S  L+   A       D+++K+   I NPN+F WN+ I+G  E+    ++   Y +M+     ++RP+ FTYP LFK 
Subjt:  LQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMV----IDARPNKFTYPTLFKA

Query:  CSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SGESDVVCWNTMIDGY---------------------------------
        C+  +    G  I GHV+K  +    H+ +A I M+AS G +E+ARK+FD S   D+V WN +I+GY                                 
Subjt:  CSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SGESDVVCWNTMIDGY---------------------------------

Query:  -------------------------------------LKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYI
                                              KCGD+  A+ +F  +  R I SW  MI+G A+ G L  +RK+FD+M E+D + W++M+ G +
Subjt:  -------------------------------------LKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYI

Query:  SVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
             ++AL +FQ+MQ   T+P    +   L+ACS  GA+D G W+H Y+++ S+ L+  LGT+L+DM
Subjt:  SVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-4929.5Show/hide
Query:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSI----TSLQYLSQVHGLVLRSGHFQDHYVSGALVK-CYANPHFGNFDFALKVFSSIPN
        M S S    P S  P  F     +P+S  P    ++   S S+     +LQ L  +H  +++ G    +Y    L++ C  +PHF    +A+ VF +I  
Subjt:  MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSI----TSLQYLSQVHGLVLRSGHFQDHYVSGALVK-CYANPHFGNFDFALKVFSSIPN

Query:  PNVFIWNIVIKGCLENNKAFKAIYFYGRMV-IDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SG
        PN+ IWN + +G   ++    A+  Y  M+ +   PN +T+P + K+C+ ++A +EG+QIHGHV+K G   D+++ ++ I MY   G LEDA K+FD S 
Subjt:  PNVFIWNIVIKGCLENNKAFKAIYFYGRMV-IDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SG

Query:  ESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSE-------------------------------------
          DVV +  +I GY   G +E A+ LF ++P++++ SWN MI+G A+ G   +A ++F +M +                                     
Subjt:  ESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSE-------------------------------------

Query:  ---------------------------------RDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKR-
                                         +D ISW++++ GY  +  YKEAL +FQ+M R    P    + S+L AC+  GAID GRW+H Y+ + 
Subjt:  ---------------------------------RDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKR-

Query:  -NSIKLDAVLGTALLDM
           +   + L T+L+DM
Subjt:  -NSIKLDAVLGTALLDM

AT2G22410.1 SLOW GROWTH 13.7e-5130.43Show/hide
Query:  LQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMV----IDARPNKFTYPTLFKA
        L +L Q+   ++ +G   D + S  L+   A       D+++K+   I NPN+F WN+ I+G  E+    ++   Y +M+     ++RP+ FTYP LFK 
Subjt:  LQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMV----IDARPNKFTYPTLFKA

Query:  CSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SGESDVVCWNTMIDGY---------------------------------
        C+  +    G  I GHV+K  +    H+ +A I M+AS G +E+ARK+FD S   D+V WN +I+GY                                 
Subjt:  CSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SGESDVVCWNTMIDGY---------------------------------

Query:  -------------------------------------LKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYI
                                              KCGD+  A+ +F  +  R I SW  MI+G A+ G L  +RK+FD+M E+D + W++M+ G +
Subjt:  -------------------------------------LKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYI

Query:  SVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
             ++AL +FQ+MQ   T+P    +   L+ACS  GA+D G W+H Y+++ S+ L+  LGT+L+DM
Subjt:  SVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM

AT4G18840.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.3e-5637.29Show/hide
Query:  SKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYA-NPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMVI-DARPNKFTYPT
        +KS+T +Q   Q H  +L++G F D + +  LV   A NP      +A  + + I +PN F  N VI+    ++    A+  +  M++    P+K+++  
Subjt:  SKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYA-NPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMVI-DARPNKFTYPT

Query:  LFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMIN
        + KAC+     +EGRQIHG  +K G+  D+ +++  + +Y   G  E ARK+ D     D V WN+++  YL+ G ++ A+ LF +M  RN+ SWN MI+
Subjt:  LFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFD-SGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMIN

Query:  GLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREET-RPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTAL
        G A  G + +A++VFD M  RD +SW++MV  Y  VGCY E LE+F +M  + T +P  F L SVL+AC+S G++ QG WVH Y+ ++ I+++  L TAL
Subjt:  GLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREET-RPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTAL

Query:  LDM
        +DM
Subjt:  LDM

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-5035.76Show/hide
Query:  SLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANP--HFGNFDFALKVFSSIPNPNVFIWNIVIKGCLEN--NKAFKAIYFYGRMVID--ARPNKFTYPTL
        +++ LSQ+H + ++SG  +D   +  +++  A    H  + D+A K+F+ +P  N F WN +I+G  E+  +KA  AI  +  M+ D    PN+FT+P++
Subjt:  SLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANP--HFGNFDFALKVFSSIPNPNVFIWNIVIKGCLEN--NKAFKAIYFYGRMVID--ARPNKFTYPTL

Query:  FKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSG--ESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMIN
         KAC+    +QEG+QIHG  +K+G  GD  + S  ++MY   G ++DAR LF     E D+V    M D   + G+               I  WNVMI+
Subjt:  FKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSG--ESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMIN

Query:  GLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALL
        G  + G    AR +FD+M +R  +SW++M+ GY   G +K+A+E+F++M++ + RP    L SVL A S  G+++ G W+H Y + + I++D VLG+AL+
Subjt:  GLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALL

Query:  DM
        DM
Subjt:  DM

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-4732.53Show/hide
Query:  LSQVHGLVLRSGHFQDHY-VSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMVIDARP-NKFTYPTLFKACSVAQ
        L Q+H  +L++G  QD Y ++  L  C ++       +A  VF     P+ F+WN++I+G   +++  +++  Y RM+  + P N +T+P+L KACS   
Subjt:  LSQVHGLVLRSGHFQDHY-VSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLENNKAFKAIYFYGRMVIDARP-NKFTYPTLFKACSVAQ

Query:  AVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGD
        A +E  QIH  + K G                               E+DV   N++I+ Y   G+ + A  LF ++P  +  SWN +I G  K GK+  
Subjt:  AVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPIRNIGSWNVMINGLAKGGKLGD

Query:  ARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM
        A  +F +M+E++ ISW++M+ GY+     KEAL++F +MQ  +  P    L++ L+AC+  GA++QG+W+H+YL +  I++D+VLG  L+DM
Subjt:  ARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAVLGTALLDM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCAATCTCCACCTCACACCGCCCTTTTTCCTTGAAACCAGTAGATTTCTCGACAGAGAAGAAGATTCCCACATCAAAACTTCCACAGAAAACAGTTTTGAAGCT
TTTCGACTCAAAATCCATCACTTCTTTGCAATATCTCAGCCAAGTTCATGGACTTGTATTGCGTAGTGGCCATTTCCAAGACCATTACGTCTCTGGCGCGTTGGTGAAGT
GTTATGCAAATCCCCATTTCGGCAATTTCGATTTCGCTTTGAAGGTATTCTCCTCAATTCCAAATCCCAACGTTTTCATTTGGAATATTGTGATTAAAGGGTGTTTAGAG
AACAACAAAGCATTTAAAGCTATTTACTTCTATGGTAGGATGGTTATTGATGCTAGGCCCAATAAATTCACATACCCAACTCTGTTTAAAGCTTGTTCTGTGGCACAAGC
TGTTCAAGAAGGGAGACAAATCCATGGCCATGTGGTGAAACATGGCATTTGTGGTGATATGCATATCAAAAGTGCTGGAATTCAAATGTATGCCTCTTTTGGTGGCTTAG
AGGATGCAAGGAAACTGTTTGATAGTGGGGAATCTGATGTTGTCTGTTGGAATACAATGATTGATGGGTACCTGAAATGTGGGGATCTGGAAGCTGCTAAAGGGTTGTTT
GCTCAAATGCCAATCAGAAACATTGGCTCATGGAATGTGATGATCAATGGTTTAGCTAAGGGTGGGAAGTTGGGAGATGCAAGGAAGGTGTTTGATGAAATGAGTGAAAG
AGATGAAATTTCTTGGAGTTCTATGGTAGATGGTTACATATCAGTAGGTTGTTACAAGGAAGCACTAGAGATTTTCCAGCAAATGCAAAGAGAGGAGACCAGGCCTGGAA
GGTTCATTTTGTCCAGTGTTCTAGCTGCTTGCTCCAGTACTGGAGCCATTGATCAAGGGAGGTGGGTTCATGCTTATCTCAAGAGGAACTCCATTAAATTGGATGCGGTG
TTGGGGACTGCCTTATTGGATATGTTGATCATTTGGCTTCAGCACATAGTTCTGCCATTTTCAATCCCAACTTTGTATGAAGTAGAAAGAAGCACCAATTCAACCTTCAT
CAATCAGTACTCAAAATTAAATCCTAGAATGAAGGAATTACAATACCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCAATCTCCACCTCACACCGCCCTTTTTCCTTGAAACCAGTAGATTTCTCGACAGAGAAGAAGATTCCCACATCAAAACTTCCACAGAAAACAGTTTTGAAGCT
TTTCGACTCAAAATCCATCACTTCTTTGCAATATCTCAGCCAAGTTCATGGACTTGTATTGCGTAGTGGCCATTTCCAAGACCATTACGTCTCTGGCGCGTTGGTGAAGT
GTTATGCAAATCCCCATTTCGGCAATTTCGATTTCGCTTTGAAGGTATTCTCCTCAATTCCAAATCCCAACGTTTTCATTTGGAATATTGTGATTAAAGGGTGTTTAGAG
AACAACAAAGCATTTAAAGCTATTTACTTCTATGGTAGGATGGTTATTGATGCTAGGCCCAATAAATTCACATACCCAACTCTGTTTAAAGCTTGTTCTGTGGCACAAGC
TGTTCAAGAAGGGAGACAAATCCATGGCCATGTGGTGAAACATGGCATTTGTGGTGATATGCATATCAAAAGTGCTGGAATTCAAATGTATGCCTCTTTTGGTGGCTTAG
AGGATGCAAGGAAACTGTTTGATAGTGGGGAATCTGATGTTGTCTGTTGGAATACAATGATTGATGGGTACCTGAAATGTGGGGATCTGGAAGCTGCTAAAGGGTTGTTT
GCTCAAATGCCAATCAGAAACATTGGCTCATGGAATGTGATGATCAATGGTTTAGCTAAGGGTGGGAAGTTGGGAGATGCAAGGAAGGTGTTTGATGAAATGAGTGAAAG
AGATGAAATTTCTTGGAGTTCTATGGTAGATGGTTACATATCAGTAGGTTGTTACAAGGAAGCACTAGAGATTTTCCAGCAAATGCAAAGAGAGGAGACCAGGCCTGGAA
GGTTCATTTTGTCCAGTGTTCTAGCTGCTTGCTCCAGTACTGGAGCCATTGATCAAGGGAGGTGGGTTCATGCTTATCTCAAGAGGAACTCCATTAAATTGGATGCGGTG
TTGGGGACTGCCTTATTGGATATGTTGATCATTTGGCTTCAGCACATAGTTCTGCCATTTTCAATCCCAACTTTGTATGAAGTAGAAAGAAGCACCAATTCAACCTTCAT
CAATCAGTACTCAAAATTAAATCCTAGAATGAAGGAATTACAATACCTGTAA
Protein sequenceShow/hide protein sequence
MSSISTSHRPFSLKPVDFSTEKKIPTSKLPQKTVLKLFDSKSITSLQYLSQVHGLVLRSGHFQDHYVSGALVKCYANPHFGNFDFALKVFSSIPNPNVFIWNIVIKGCLE
NNKAFKAIYFYGRMVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGICGDMHIKSAGIQMYASFGGLEDARKLFDSGESDVVCWNTMIDGYLKCGDLEAAKGLF
AQMPIRNIGSWNVMINGLAKGGKLGDARKVFDEMSERDEISWSSMVDGYISVGCYKEALEIFQQMQREETRPGRFILSSVLAACSSTGAIDQGRWVHAYLKRNSIKLDAV
LGTALLDMLIIWLQHIVLPFSIPTLYEVERSTNSTFINQYSKLNPRMKELQYL