; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg23501 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg23501
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr14:12782515..12783068
RNA-Seq ExpressionCarg23501
SyntenyCarg23501
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049341.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.8e-4969.93Show/hide
Query:  RERYVGTSALIH--------LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETE
        R+ +V  S LIH        LFARKVFDGMLER+VVSWTSLICGY  TE S EAVALF QMIEAGV+P SVTMVC ISACA+LKDL LAK++H YI+E+E
Subjt:  RERYVGTSALIH--------LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETE

Query:  TRLNTRMVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN
          LNT MVN +VDM+ +CGE GAAK LYDECVDKNLVLC+TIMSN+  HG+ N
Subjt:  TRLNTRMVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN

KAG6582317.1 Serine/threonine-protein kinase STY13, partial [Cucurbita argyrosperma subsp. sororia]7.9e-8599.36Show/hide
Query:  SSAPWSSYEDWFRERYVGTSALIHLFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYI
        SSAPWSSYEDWFRERYVGTSALIHLFARKVFDGMLERSVVSWTSLICGYPWTES WEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYI
Subjt:  SSAPWSSYEDWFRERYVGTSALIHLFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYI

Query:  DETETRLNTRMVNVIVDMYKQCGEAGAAKLYDECVDKNLVLCDTIMSNFVCHGVRN
        DETETRLNTRMVNVIVDMYKQCGEAGAAKLYDECVDKNLVLCDTIMSNFVCHGVRN
Subjt:  DETETRLNTRMVNVIVDMYKQCGEAGAAKLYDECVDKNLVLCDTIMSNFVCHGVRN

KAG7018728.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-91100Show/hide
Query:  MTQSIPESFTSSAPWSSYEDWFRERYVGTSALIHLFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDL
        MTQSIPESFTSSAPWSSYEDWFRERYVGTSALIHLFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDL
Subjt:  MTQSIPESFTSSAPWSSYEDWFRERYVGTSALIHLFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDL

Query:  TLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAGAAKLYDECVDKNLVLCDTIMSNFVCHGVRN
        TLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAGAAKLYDECVDKNLVLCDTIMSNFVCHGVRN
Subjt:  TLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAGAAKLYDECVDKNLVLCDTIMSNFVCHGVRN

XP_008438644.1 PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g22690 [Cucumis melo]1.8e-4969.93Show/hide
Query:  RERYVGTSALIH--------LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETE
        R+ +V  S LIH        LFARKVFDGMLER+VVSWTSLICGY  TE S EAVALF QMIEAGV+P SVTMVC ISACA+LKDL LAK++H YI+E+E
Subjt:  RERYVGTSALIH--------LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETE

Query:  TRLNTRMVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN
          LNT MVN +VDM+ +CGE GAAK LYDECVDKNLVLC+TIMSN+  HG+ N
Subjt:  TRLNTRMVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN

XP_011650966.1 pentatricopeptide repeat-containing protein At3g22690 [Cucumis sativus]5.9e-4874.44Show/hide
Query:  LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGE
        LFARKVFDGMLER+VVSWTSLICGY  T+   EAVALF QMIEAGVKP SVTMVC ISACA+LKDL LAK++H YI+E+E  LNT MVN + DM+ +CGE
Subjt:  LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGE

Query:  AGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN
         GAAK LYDECVDKNLVLC+TIMSN+  HG+ N
Subjt:  AGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN

TrEMBL top hitse value%identityAlignment
A0A0A0LA65 DYW_deaminase domain-containing protein2.9e-4874.44Show/hide
Query:  LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGE
        LFARKVFDGMLER+VVSWTSLICGY  T+   EAVALF QMIEAGVKP SVTMVC ISACA+LKDL LAK++H YI+E+E  LNT MVN + DM+ +CGE
Subjt:  LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGE

Query:  AGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN
         GAAK LYDECVDKNLVLC+TIMSN+  HG+ N
Subjt:  AGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN

A0A1S3AXK0 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g226908.9e-5069.93Show/hide
Query:  RERYVGTSALIH--------LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETE
        R+ +V  S LIH        LFARKVFDGMLER+VVSWTSLICGY  TE S EAVALF QMIEAGV+P SVTMVC ISACA+LKDL LAK++H YI+E+E
Subjt:  RERYVGTSALIH--------LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETE

Query:  TRLNTRMVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN
          LNT MVN +VDM+ +CGE GAAK LYDECVDKNLVLC+TIMSN+  HG+ N
Subjt:  TRLNTRMVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN

A0A5D3D302 Pentatricopeptide repeat-containing protein8.9e-5069.93Show/hide
Query:  RERYVGTSALIH--------LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETE
        R+ +V  S LIH        LFARKVFDGMLER+VVSWTSLICGY  TE S EAVALF QMIEAGV+P SVTMVC ISACA+LKDL LAK++H YI+E+E
Subjt:  RERYVGTSALIH--------LFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETE

Query:  TRLNTRMVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN
          LNT MVN +VDM+ +CGE GAAK LYDECVDKNLVLC+TIMSN+  HG+ N
Subjt:  TRLNTRMVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN

A0A6J1CCK6 pentatricopeptide repeat-containing protein At3g226904.1e-4770.55Show/hide
Query:  SALIHL--------FARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM
        ++LIHL        FARKVFDGMLER+VVSWTSLICGY  T+SS EAVALF QMIEAGV P SVTMVC ISACA+LKD+ LAK+IH YI+E+E  LNT+M
Subjt:  SALIHL--------FARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM

Query:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN
        VN +VDMY + GE GAAK LY+ CVDKNLVLC+TIMSNF  HG+ N
Subjt:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGVRN

A0A6J1GGL0 pentatricopeptide repeat-containing protein At3g226901.2e-4670.14Show/hide
Query:  SALIHLF--------ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM
        ++LIHL+        ARKVFD M ER+VVSWTSLICGYP T+SS EAVALF QMIEAGV+P SVTMVC ISACA+LKDL LA KIH YI E+E  LNT M
Subjt:  SALIHLF--------ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM

Query:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGV
        VN +VDMY +CGE GAA+ LY+ECVDKNLVLC+TIMSN   HG+
Subjt:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGV

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210657.1e-2037.06Show/hide
Query:  SALIHLFAR--------KVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM
        ++L+HL+A         KVFD M E+ +V+W S+I G+       EA+AL+ +M   G+KP+  T+V  +SACA++  LTL K++H Y+ +     N   
Subjt:  SALIHLFAR--------KVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM

Query:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHG
         NV++D+Y +CG    AK L+DE VDKN V   +++     +G
Subjt:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHG

P93011 Pentatricopeptide repeat-containing protein At2g337605.1e-1834.38Show/hide
Query:  ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAG
        AR+VFD M E+S+V+W SL+ G+     + EA+ +F QM E+G +P+S T V  +SACA+   ++L   +H YI      LN ++   ++++Y +CG+ G
Subjt:  ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAG

Query:  AAK-LYDECVDKNLVLCDTIMSNFVCHG
         A+ ++D+  + N+     ++S +  HG
Subjt:  AAK-LYDECVDKNLVLCDTIMSNFVCHG

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic3.3e-1737.5Show/hide
Query:  ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAG
        AR++FDGMLER+VVSW S+I  Y   E+  EA+ +F +M++ GVKP  V+++  + ACA+L DL   + IH    E     N  +VN ++ MY +C E  
Subjt:  ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAG

Query:  -AAKLYDECVDKNLVLCDTIMSNFVCHG
         AA ++ +   + LV  + ++  F  +G
Subjt:  -AAKLYDECVDKNLVLCDTIMSNFVCHG

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226903.0e-2644.83Show/hide
Query:  SALIHLF--------ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMI-EAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTR
        ++L+H +        ARKVFD M ER+VVSWTS+ICGY   + + +AV LF +M+ +  V P SVTMVC ISACA+L+DL   +K++ +I  +   +N  
Subjt:  SALIHLF--------ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMI-EAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTR

Query:  MVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGV
        MV+ +VDMY +C     AK L+DE    NL LC+ + SN+V  G+
Subjt:  MVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGV

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088207.3e-1736.29Show/hide
Query:  ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAG
        A K+FD + +RSVV+WT+L  GY  +    EA+ LF +M+E GVKP+S  +V  +SAC  + DL   + I  Y++E E + N+ +   +V++Y +CG+  
Subjt:  ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAG

Query:  AAK-LYDECVDKNLVLCDTIMSNF
         A+ ++D  V+K++V   T++  +
Subjt:  AAK-LYDECVDKNLVLCDTIMSNF

Arabidopsis top hitse value%identityAlignment
AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-1934.38Show/hide
Query:  ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAG
        AR+VFD M E+S+V+W SL+ G+     + EA+ +F QM E+G +P+S T V  +SACA+   ++L   +H YI      LN ++   ++++Y +CG+ G
Subjt:  ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRMVNVIVDMYKQCGEAG

Query:  AAK-LYDECVDKNLVLCDTIMSNFVCHG
         A+ ++D+  + N+     ++S +  HG
Subjt:  AAK-LYDECVDKNLVLCDTIMSNFVCHG

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)2.1e-2744.83Show/hide
Query:  SALIHLF--------ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMI-EAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTR
        ++L+H +        ARKVFD M ER+VVSWTS+ICGY   + + +AV LF +M+ +  V P SVTMVC ISACA+L+DL   +K++ +I  +   +N  
Subjt:  SALIHLF--------ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMI-EAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTR

Query:  MVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGV
        MV+ +VDMY +C     AK L+DE    NL LC+ + SN+V  G+
Subjt:  MVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGV

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification2.1e-2744.83Show/hide
Query:  SALIHLF--------ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMI-EAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTR
        ++L+H +        ARKVFD M ER+VVSWTS+ICGY   + + +AV LF +M+ +  V P SVTMVC ISACA+L+DL   +K++ +I  +   +N  
Subjt:  SALIHLF--------ARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMI-EAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTR

Query:  MVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGV
        MV+ +VDMY +C     AK L+DE    NL LC+ + SN+V  G+
Subjt:  MVNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHGV

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.0e-2137.06Show/hide
Query:  SALIHLFAR--------KVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM
        ++L+HL+A         KVFD M E+ +V+W S+I G+       EA+AL+ +M   G+KP+  T+V  +SACA++  LTL K++H Y+ +     N   
Subjt:  SALIHLFAR--------KVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM

Query:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHG
         NV++D+Y +CG    AK L+DE VDKN V   +++     +G
Subjt:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHG

AT4G21065.2 Tetratricopeptide repeat (TPR)-like superfamily protein5.0e-2137.06Show/hide
Query:  SALIHLFAR--------KVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM
        ++L+HL+A         KVFD M E+ +V+W S+I G+       EA+AL+ +M   G+KP+  T+V  +SACA++  LTL K++H Y+ +     N   
Subjt:  SALIHLFAR--------KVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYIDETETRLNTRM

Query:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHG
         NV++D+Y +CG    AK L+DE VDKN V   +++     +G
Subjt:  VNVIVDMYKQCGEAGAAK-LYDECVDKNLVLCDTIMSNFVCHG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCAAAGCATACCAGAAAGCTTCACGAGTTCGGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGAGAGATATGTTGGTACTAGTGCTCTAATACATCTGTTTGC
TCGTAAGGTGTTTGATGGAATGCTTGAGAGAAGTGTTGTTTCGTGGACCTCCTTGATTTGTGGTTATCCTTGGACAGAATCTTCCTGGGAAGCTGTGGCTTTGTTTCTCC
AAATGATCGAGGCAGGTGTTAAACCCGAATCTGTCACAATGGTGTGTGGGATATCGGCTTGCGCCGAGTTGAAAGATCTTACACTGGCCAAGAAAATTCATGGTTACATT
GATGAGACAGAAACGAGGCTTAATACTCGTATGGTGAATGTTATTGTGGATATGTACAAGCAATGTGGAGAAGCTGGTGCTGCAAAGCTATATGATGAATGTGTGGATAA
GAATTTGGTTTTATGTGACACAATCATGTCAAACTTTGTGTGCCATGGTGTGCGGAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTCAAAGCATACCAGAAAGCTTCACGAGTTCGGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGAGAGATATGTTGGTACTAGTGCTCTAATACATCTGTTTGC
TCGTAAGGTGTTTGATGGAATGCTTGAGAGAAGTGTTGTTTCGTGGACCTCCTTGATTTGTGGTTATCCTTGGACAGAATCTTCCTGGGAAGCTGTGGCTTTGTTTCTCC
AAATGATCGAGGCAGGTGTTAAACCCGAATCTGTCACAATGGTGTGTGGGATATCGGCTTGCGCCGAGTTGAAAGATCTTACACTGGCCAAGAAAATTCATGGTTACATT
GATGAGACAGAAACGAGGCTTAATACTCGTATGGTGAATGTTATTGTGGATATGTACAAGCAATGTGGAGAAGCTGGTGCTGCAAAGCTATATGATGAATGTGTGGATAA
GAATTTGGTTTTATGTGACACAATCATGTCAAACTTTGTGTGCCATGGTGTGCGGAACTAA
Protein sequenceShow/hide protein sequence
MTQSIPESFTSSAPWSSYEDWFRERYVGTSALIHLFARKVFDGMLERSVVSWTSLICGYPWTESSWEAVALFLQMIEAGVKPESVTMVCGISACAELKDLTLAKKIHGYI
DETETRLNTRMVNVIVDMYKQCGEAGAAKLYDECVDKNLVLCDTIMSNFVCHGVRN