; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg00544 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg00544
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr03:5544615..5546890
RNA-Seq ExpressionCarg00544
SyntenyCarg00544
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603868.1 hypothetical protein SDJN03_04477, partial [Cucurbita argyrosperma subsp. sororia]6.1e-12485.14Show/hide
Query:  RGGIGRWGNCRHRFVQLSPPRSLTNRLVLRRRLRFCPASTAPYIKNEHQNFARKPLRVRAPATLESRNSRRCSNRLVLQMPRSSTAPSSPPRNLIGSIDV
        RGGIGRWGNCRHRFVQLSPPRSLTNRLVLRRRLRFCPASTAPYIKNEHQNFARKPLRVRAPATLESRNSRRCSNRLVLQMPRSSTAPSSPPRNLIGSIDV
Subjt:  RGGIGRWGNCRHRFVQLSPPRSLTNRLVLRRRLRFCPASTAPYIKNEHQNFARKPLRVRAPATLESRNSRRCSNRLVLQMPRSSTAPSSPPRNLIGSIDV

Query:  SYLTLHLREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIESPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVT
        SYLTLHLREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSV                           ALSTVYGFIDEMRSSCGAKPDLVT
Subjt:  SYLTLHLREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIESPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVT

Query:  CTILIDNVCNSKDLREATRLVSVLAK--------------EYTEVTEEGEGAKPCYITGLDQQPQQAFGWLVNKVV
        CTILIDNVCNSKDLREATRLVSVLAK              EYTEVTEEGEGAKPCYITGLDQQPQQAFGWLVNKVV
Subjt:  CTILIDNVCNSKDLREATRLVSVLAK--------------EYTEVTEEGEGAKPCYITGLDQQPQQAFGWLVNKVV

KAG7026711.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-4377.42Show/hide
Query:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI
        TDSSL SVRQILNFMV  GFNPD+ TTDIAVRSLCS GL+DEA+E         SP DSYT+NHLVKQLC S++LSTVY FI+EMRSSCGA PDLVT TI
Subjt:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI

Query:  LIDNVCNSKDLREATRLVSVLAKE
        LIDNVCN K+LREATRLVSVLAKE
Subjt:  LIDNVCNSKDLREATRLVSVLAKE

KAG7034046.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-147100Show/hide
Query:  RGGIGRWGNCRHRFVQLSPPRSLTNRLVLRRRLRFCPASTAPYIKNEHQNFARKPLRVRAPATLESRNSRRCSNRLVLQMPRSSTAPSSPPRNLIGSIDV
        RGGIGRWGNCRHRFVQLSPPRSLTNRLVLRRRLRFCPASTAPYIKNEHQNFARKPLRVRAPATLESRNSRRCSNRLVLQMPRSSTAPSSPPRNLIGSIDV
Subjt:  RGGIGRWGNCRHRFVQLSPPRSLTNRLVLRRRLRFCPASTAPYIKNEHQNFARKPLRVRAPATLESRNSRRCSNRLVLQMPRSSTAPSSPPRNLIGSIDV

Query:  SYLTLHLREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIESPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVT
        SYLTLHLREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIESPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVT
Subjt:  SYLTLHLREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIESPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVT

Query:  CTILIDNVCNSKDLREATRLVSVLAKEYTEVTEEGEGAKPCYITGLDQQPQQAFGWLVNKVV
        CTILIDNVCNSKDLREATRLVSVLAKEYTEVTEEGEGAKPCYITGLDQQPQQAFGWLVNKVV
Subjt:  CTILIDNVCNSKDLREATRLVSVLAKEYTEVTEEGEGAKPCYITGLDQQPQQAFGWLVNKVV

XP_022926982.1 pentatricopeptide repeat-containing protein At2g17670 [Cucurbita moschata]1.1e-4378.23Show/hide
Query:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI
        TDSSL SVRQILNFMV  GFNPDK TTDIAVRSLCS GL+DEA+E         SP DSYT+NHLVKQLC S++LSTVY FI+EMRSSCGA PDLVT TI
Subjt:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI

Query:  LIDNVCNSKDLREATRLVSVLAKE
        LIDNVCN K+LREATRLVSVLAKE
Subjt:  LIDNVCNSKDLREATRLVSVLAKE

XP_023003882.1 pentatricopeptide repeat-containing protein At2g17670-like [Cucurbita maxima]3.1e-4377.42Show/hide
Query:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI
        TDSSL SVRQILNFMV +GFNPDK T DIAVRSLCS GL+DEA+E         SP DSYT+NHLVKQLC S++LSTVYGFI EMRSSCGA PDLVT TI
Subjt:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI

Query:  LIDNVCNSKDLREATRLVSVLAKE
        LIDNVCN K+LREATRLVSVLA+E
Subjt:  LIDNVCNSKDLREATRLVSVLAKE

TrEMBL top hitse value%identityAlignment
A0A0A0KHF8 Uncharacterized protein3.1e-4175.61Show/hide
Query:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL
        DS+L SV+QILNFMV NGFNPDKVT D+AVRSLCSVGLVDEA+E         +P D YT+NHLVKQLC S+ALSTVY FI EMRSSCGAKPDLVT TIL
Subjt:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL

Query:  IDNVCNSKDLREATRLVSVLAKE
        IDNVCNS +LREA RLVS+L KE
Subjt:  IDNVCNSKDLREATRLVSVLAKE

A0A1S4DT84 pentatricopeptide repeat-containing protein At2g176702.2e-4277.24Show/hide
Query:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL
        DSSL SVR+ILNFMV NGFNPDKVT D+AVRSLCSVGLVDEA+E         +PLD YT+NHLVKQLC S+ALSTVY FI EMRSSCGAKPDLVT TIL
Subjt:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL

Query:  IDNVCNSKDLREATRLVSVLAKE
        IDNVCNS +LREA RLVS+L KE
Subjt:  IDNVCNSKDLREATRLVSVLAKE

A0A6J1BW58 pentatricopeptide repeat-containing protein At2g176709.7e-4377.05Show/hide
Query:  SSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTILI
        SSL SV+QILNFMV+NGFNPDKVTTDIAVRSLCS GL+DEA+E         SP DS+T+NHLVKQLC S+ALSTVYGFIDEMRSS G+KPDLVT TILI
Subjt:  SSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTILI

Query:  DNVCNSKDLREATRLVSVLAKE
        DNVCN K+LREATRL+SVL +E
Subjt:  DNVCNSKDLREATRLVSVLAKE

A0A6J1EGP9 pentatricopeptide repeat-containing protein At2g176705.2e-4478.23Show/hide
Query:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI
        TDSSL SVRQILNFMV  GFNPDK TTDIAVRSLCS GL+DEA+E         SP DSYT+NHLVKQLC S++LSTVY FI+EMRSSCGA PDLVT TI
Subjt:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI

Query:  LIDNVCNSKDLREATRLVSVLAKE
        LIDNVCN K+LREATRLVSVLAKE
Subjt:  LIDNVCNSKDLREATRLVSVLAKE

A0A6J1KT16 pentatricopeptide repeat-containing protein At2g17670-like1.5e-4377.42Show/hide
Query:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI
        TDSSL SVRQILNFMV +GFNPDK T DIAVRSLCS GL+DEA+E         SP DSYT+NHLVKQLC S++LSTVYGFI EMRSSCGA PDLVT TI
Subjt:  TDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTI

Query:  LIDNVCNSKDLREATRLVSVLAKE
        LIDNVCN K+LREATRLVSVLA+E
Subjt:  LIDNVCNSKDLREATRLVSVLAKE

SwissProt top hitse value%identityAlignment
Q3ECK2 Pentatricopeptide repeat-containing protein At1g62680, mitochondrial5.2e-0929.37Show/hide
Query:  YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLC-------SVGLVDEAIESPL--DSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTC
        +C    +     IL  M+  G+ PD+VT    V   C       +V LVD+ +E     D   +N ++  LC +K ++  + F  E+    G +P++VT 
Subjt:  YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLC-------SVGLVDEAIESPL--DSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTC

Query:  TILIDNVCNSKDLREATRLVSVLAKE
        T L++ +CNS    +A RL+S + K+
Subjt:  TILIDNVCNSKDLREATRLVSVLAKE

Q84J71 Pentatricopeptide repeat-containing protein At2g176702.8e-3160Show/hide
Query:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL
        DSS+ +V ++LN MVNNG  PD+VTTDIAVRSLC  G VDEA +         SP D+YT+N L+K LC  K L  VY F+DEMR     KPDLV+ TIL
Subjt:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL

Query:  IDNVCNSKDLREATRLVSVL
        IDNVCNSK+LREA  LVS L
Subjt:  IDNVCNSKDLREATRLVSVL

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745805.2e-0928.35Show/hide
Query:  YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIES---------PLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTC
        Y T   +++  +IL+ M++NG +PD  T +  +  LC     ++ +E+           + +TFN L++ LC  + L    G ++EM++     PD VT 
Subjt:  YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIES---------PLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTC

Query:  TILIDNVCNSKDLREATRLVSVLAKEY
          LID  C + DL  A  L   + + Y
Subjt:  TILIDNVCNSKDLREATRLVSVLAKEY

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial6.8e-0925.37Show/hide
Query:  LREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPD
        L + +C ++ L+   Q+++ M++ G +PD +T +I +   C    +D+ +E            ++ T+N LV+  C S  L        EM S    +PD
Subjt:  LREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPD

Query:  LVTCTILIDNVCNSKDLREATRLVSVLAKEYTEV
        +V+  IL+D +C++ +L +A  +   + K   E+
Subjt:  LVTCTILIDNVCNSKDLREATRLVSVLAKEYTEV

Q9SHK2 Pentatricopeptide repeat-containing protein At1g065803.4e-0827.91Show/hide
Query:  CTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIESPL---------DSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCT
        C    LD  +++LN +V+ GF P+ VT +  +   C    VD+ ++            D++T+N L +  C +   S     +  M  SCG  PD+ T  
Subjt:  CTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIESPL---------DSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCT

Query:  ILIDNVCNSKDLREATRLVSVLAKEYTEV
        IL+D +C+   + +A   +  L K  T V
Subjt:  ILIDNVCNSKDLREATRLVSVLAKEYTEV

Arabidopsis top hitse value%identityAlignment
AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein4.8e-1025.37Show/hide
Query:  LREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPD
        L + +C ++ L+   Q+++ M++ G +PD +T +I +   C    +D+ +E            ++ T+N LV+  C S  L        EM S    +PD
Subjt:  LREWYCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPD

Query:  LVTCTILIDNVCNSKDLREATRLVSVLAKEYTEV
        +V+  IL+D +C++ +L +A  +   + K   E+
Subjt:  LVTCTILIDNVCNSKDLREATRLVSVLAKEYTEV

AT1G62680.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-1029.37Show/hide
Query:  YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLC-------SVGLVDEAIESPL--DSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTC
        +C    +     IL  M+  G+ PD+VT    V   C       +V LVD+ +E     D   +N ++  LC +K ++  + F  E+    G +P++VT 
Subjt:  YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLC-------SVGLVDEAIESPL--DSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTC

Query:  TILIDNVCNSKDLREATRLVSVLAKE
        T L++ +CNS    +A RL+S + K+
Subjt:  TILIDNVCNSKDLREATRLVSVLAKE

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-1028.35Show/hide
Query:  YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIES---------PLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTC
        Y T   +++  +IL+ M++NG +PD  T +  +  LC     ++ +E+           + +TFN L++ LC  + L    G ++EM++     PD VT 
Subjt:  YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIES---------PLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTC

Query:  TILIDNVCNSKDLREATRLVSVLAKEY
          LID  C + DL  A  L   + + Y
Subjt:  TILIDNVCNSKDLREATRLVSVLAKEY

AT2G17670.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-3260Show/hide
Query:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL
        DSS+ +V ++LN MVNNG  PD+VTTDIAVRSLC  G VDEA +         SP D+YT+N L+K LC  K L  VY F+DEMR     KPDLV+ TIL
Subjt:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL

Query:  IDNVCNSKDLREATRLVSVL
        IDNVCNSK+LREA  LVS L
Subjt:  IDNVCNSKDLREATRLVSVL

AT2G17670.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-3260Show/hide
Query:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL
        DSS+ +V ++LN MVNNG  PD+VTTDIAVRSLC  G VDEA +         SP D+YT+N L+K LC  K L  VY F+DEMR     KPDLV+ TIL
Subjt:  DSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIE---------SPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTIL

Query:  IDNVCNSKDLREATRLVSVL
        IDNVCNSK+LREA  LVS L
Subjt:  IDNVCNSKDLREATRLVSVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CGCGGGGGAATTGGAAGATGGGGAAATTGTCGCCATCGTTTCGTTCAGCTCTCTCCACCACGGTCGTTAACCAACCGCCTCGTCCTCCGGCGGCGCCTCCGCTTTTGTCC
GGCGAGTACCGCTCCTTATATTAAAAATGAACACCAAAACTTCGCCAGAAAACCTCTTCGCGTCAGAGCTCCGGCCACCCTGGAAAGCCGAAACTCCCGACGGTGTTCAA
ATCGGCTAGTCTTGCAGATGCCAAGAAGCTCCACAGCTCCTTCGTCACCACCACGAAACCTCATCGGATCGATCGACGTTTCATATCTTACTCTCCACCTCCGGGAATGG
TACTGTACTGATTCGTCTCTCGATTCGGTTCGGCAAATTCTCAATTTCATGGTTAACAATGGCTTCAATCCTGACAAGGTAACCACGGATATTGCTGTGCGTTCGCTTTG
TTCGGTAGGTTTGGTTGATGAAGCTATAGAGTCGCCTCTTGATTCTTATACATTTAATCATCTCGTTAAACAACTTTGCATGTCCAAAGCTCTGTCTACTGTTTATGGTT
TTATTGATGAAATGCGTAGTAGCTGTGGCGCGAAGCCAGATCTTGTTACTTGTACTATCTTGATAGATAATGTGTGCAATAGCAAGGATCTACGTGAGGCGACACGGTTG
GTTAGTGTGCTGGCTAAGGAGTACACTGAAGTCACTGAAGAAGGCGAGGGAGCAAAGCCATGCTATATAACTGGCCTGGATCAACAACCTCAACAAGCATTTGGCTGGCT
GGTGAACAAAGTGGTTTAG
mRNA sequenceShow/hide mRNA sequence
CGCGGGGGAATTGGAAGATGGGGAAATTGTCGCCATCGTTTCGTTCAGCTCTCTCCACCACGGTCGTTAACCAACCGCCTCGTCCTCCGGCGGCGCCTCCGCTTTTGTCC
GGCGAGTACCGCTCCTTATATTAAAAATGAACACCAAAACTTCGCCAGAAAACCTCTTCGCGTCAGAGCTCCGGCCACCCTGGAAAGCCGAAACTCCCGACGGTGTTCAA
ATCGGCTAGTCTTGCAGATGCCAAGAAGCTCCACAGCTCCTTCGTCACCACCACGAAACCTCATCGGATCGATCGACGTTTCATATCTTACTCTCCACCTCCGGGAATGG
TACTGTACTGATTCGTCTCTCGATTCGGTTCGGCAAATTCTCAATTTCATGGTTAACAATGGCTTCAATCCTGACAAGGTAACCACGGATATTGCTGTGCGTTCGCTTTG
TTCGGTAGGTTTGGTTGATGAAGCTATAGAGTCGCCTCTTGATTCTTATACATTTAATCATCTCGTTAAACAACTTTGCATGTCCAAAGCTCTGTCTACTGTTTATGGTT
TTATTGATGAAATGCGTAGTAGCTGTGGCGCGAAGCCAGATCTTGTTACTTGTACTATCTTGATAGATAATGTGTGCAATAGCAAGGATCTACGTGAGGCGACACGGTTG
GTTAGTGTGCTGGCTAAGGAGTACACTGAAGTCACTGAAGAAGGCGAGGGAGCAAAGCCATGCTATATAACTGGCCTGGATCAACAACCTCAACAAGCATTTGGCTGGCT
GGTGAACAAAGTGGTTTAG
Protein sequenceShow/hide protein sequence
RGGIGRWGNCRHRFVQLSPPRSLTNRLVLRRRLRFCPASTAPYIKNEHQNFARKPLRVRAPATLESRNSRRCSNRLVLQMPRSSTAPSSPPRNLIGSIDVSYLTLHLREW
YCTDSSLDSVRQILNFMVNNGFNPDKVTTDIAVRSLCSVGLVDEAIESPLDSYTFNHLVKQLCMSKALSTVYGFIDEMRSSCGAKPDLVTCTILIDNVCNSKDLREATRL
VSVLAKEYTEVTEEGEGAKPCYITGLDQQPQQAFGWLVNKVV