; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029503 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029503
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic
Genome locationtig00153403:1420592..1432894
RNA-Seq ExpressionSgr029503
SyntenySgr029503
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR003425 - CCB3/YggT


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575811.1 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.7e-8880.27Show/hide
Query:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR
        MAA  CS L+AIRVIG              +GSSPLIPN+GNSS  RGF +HPQ NPNCRFQA +CSSSLLG+ TSSKIHL L YATPPLKP     AAR
Subjt:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR

Query:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
        TIPFALQDAS+AASDFM N++LADLDP TAKLAIGFLGPFLSAFSFLFI RIVMSWYPKLP+GKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
Subjt:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG

Query:  LVSFLNEILLGPQGLLVLLSQQI
        LVSFLNEILLGPQGLLVLLSQQ+
Subjt:  LVSFLNEILLGPQGLLVLLSQQI

XP_004136016.2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucumis sativus]4.1e-8778.03Show/hide
Query:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR
        MAA  CS L++IRVIG              IGSSPL PN+GNS+  RGFK+HPQRNPNC+FQA++CSSSLLG+ TSSK  LSLAY  PPLKPAA YEAAR
Subjt:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR

Query:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
        TIPF LQDASMAASDF+ +M LADLDP TAKLAI FLGP LS FSFLFI RIVMSWYPKLP+GKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFG
Subjt:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG

Query:  LVSFLNEILLGPQGLLVLLSQQI
        L+SFLNEILLGPQGLLVLLSQQ+
Subjt:  LVSFLNEILLGPQGLLVLLSQQI

XP_022991242.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucurbita maxima]5.5e-8478.92Show/hide
Query:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR
        MAA  CS L+AIRVIG              +GSS LIPN+GNSS+ RGF +HPQ NPNCRFQA +CSSSLLG+ TSSKI L L  ATP LKP     AAR
Subjt:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR

Query:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
        TIPFALQDASMAASDF  N+ALADLDP TAKLAIGFLGPFLSAFSFLFI RIVMSWYPKLP+GKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
Subjt:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG

Query:  LVSFLNEILLGPQGLLVLLSQQI
        LVSFLNEILLGPQGLLVLLSQQ+
Subjt:  LVSFLNEILLGPQGLLVLLSQQI

XP_023548987.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]2.8e-8880.72Show/hide
Query:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR
        MAA  CS L+AIRVIG              +GSSPLIPN+GNSS  RGF +HPQ NPNCRFQA +CSSSLLG+ TSSKIHL L YATPPLKP     AAR
Subjt:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR

Query:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
        TIPFALQDASMAASDFM N+ALADLDP TAKLAIG LGPFLSAFSFLFI RIVMSWYPKLP+GKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
Subjt:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG

Query:  LVSFLNEILLGPQGLLVLLSQQI
        LVSFLNEILLGPQGLLVLLSQQ+
Subjt:  LVSFLNEILLGPQGLLVLLSQQI

XP_038896182.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Benincasa hispida]1.4e-9081.9Show/hide
Query:  AACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAARTI
        AA CS L+ IRV                IGSSPLIPNYGNSS SRGFK+HPQRNPNCRFQA +CSSS+L + T+SK+HLSLAYAT PLKPAA YEAARTI
Subjt:  AACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAARTI

Query:  PFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLV
        PFALQDASM+ASDFM N+ALADLDP  AKLAIGFLGPFLSAFSFLFI RIVMSWYPKLP+GKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLV
Subjt:  PFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLV

Query:  SFLNEILLGPQGLLVLLSQQI
        SFLNEILLGPQGLLVLLSQQ+
Subjt:  SFLNEILLGPQGLLVLLSQQI

TrEMBL top hitse value%identityAlignment
A0A0A0K9A7 Uncharacterized protein2.0e-8778.03Show/hide
Query:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR
        MAA  CS L++IRVIG              IGSSPL PN+GNS+  RGFK+HPQRNPNC+FQA++CSSSLLG+ TSSK  LSLAY  PPLKPAA YEAAR
Subjt:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR

Query:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
        TIPF LQDASMAASDF+ +M LADLDP TAKLAI FLGP LS FSFLFI RIVMSWYPKLP+GKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFG
Subjt:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG

Query:  LVSFLNEILLGPQGLLVLLSQQI
        L+SFLNEILLGPQGLLVLLSQQ+
Subjt:  LVSFLNEILLGPQGLLVLLSQQI

A0A6J1CTL8 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X11.8e-8088.95Show/hide
Query:  SSVSRGFKHHPQRNPNCRFQALQCSSSLLG-TVTSSKIHLSLAYATPPLKPAAVYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFL
        +S+ RGFKHHPQRNPN RFQALQCSSSLLG TVTSSK+ L LA ATPPLKPAA +E  RT PFALQDASMAASDF  NMALADLDPATAKLAIGFLGPFL
Subjt:  SSVSRGFKHHPQRNPNCRFQALQCSSSLLG-TVTSSKIHLSLAYATPPLKPAAVYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFL

Query:  SAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        SAFSFLFI+RIVMSWYPKLP+GKFPYV+AYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  SAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

A0A6J1CUI3 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X21.8e-8088.95Show/hide
Query:  SSVSRGFKHHPQRNPNCRFQALQCSSSLLG-TVTSSKIHLSLAYATPPLKPAAVYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFL
        +S+ RGFKHHPQRNPN RFQALQCSSSLLG TVTSSK+ L LA ATPPLKPAA +E  RT PFALQDASMAASDF  NMALADLDPATAKLAIGFLGPFL
Subjt:  SSVSRGFKHHPQRNPNCRFQALQCSSSLLG-TVTSSKIHLSLAYATPPLKPAAVYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFL

Query:  SAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        SAFSFLFI+RIVMSWYPKLP+GKFPYV+AYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  SAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

A0A6J1CUU2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X31.1e-8088.46Show/hide
Query:  SSVSRGFKHHPQRNPNCRFQALQCSSSLLG-TVTSSKIHLSLAYATPPLKPAAVYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFL
        +S+ RGFKHHPQRNPN RFQALQCSSSLLG TVTSSK+ L LA ATPPLKPAA +E  RT PFALQDASMAASDF  NMALADLDPATAKLAIGFLGPFL
Subjt:  SSVSRGFKHHPQRNPNCRFQALQCSSSLLG-TVTSSKIHLSLAYATPPLKPAAVYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFL

Query:  SAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQI
        SAFSFLFI+RIVMSWYPKLP+GKFPYV+AYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ+
Subjt:  SAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQI

A0A6J1JL88 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X12.7e-8478.92Show/hide
Query:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR
        MAA  CS L+AIRVIG              +GSS LIPN+GNSS+ RGF +HPQ NPNCRFQA +CSSSLLG+ TSSKI L L  ATP LKP     AAR
Subjt:  MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAAR

Query:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
        TIPFALQDASMAASDF  N+ALADLDP TAKLAIGFLGPFLSAFSFLFI RIVMSWYPKLP+GKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG
Subjt:  TIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFG

Query:  LVSFLNEILLGPQGLLVLLSQQI
        LVSFLNEILLGPQGLLVLLSQQ+
Subjt:  LVSFLNEILLGPQGLLVLLSQQI

SwissProt top hitse value%identityAlignment
Q8RWM7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic3.4e-4474.22Show/hide
Query:  VYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVT
        + EAA T     Q  S+  S+ + N++LADLDP TAKLAIG LGP LSAF FLFILRIVMSWYPKLP+ KFPYV+AYAPTEP+L+ TRKVIPPL GVDVT
Subjt:  VYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVT

Query:  PVVWFGLVSFLNEILLGPQGLLVLLSQQ
        PVVWFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  PVVWFGLVSFLNEILLGPQGLLVLLSQQ

Arabidopsis top hitse value%identityAlignment
AT5G36120.1 cofactor assembly, complex C (B6F)2.4e-4574.22Show/hide
Query:  VYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVT
        + EAA T     Q  S+  S+ + N++LADLDP TAKLAIG LGP LSAF FLFILRIVMSWYPKLP+ KFPYV+AYAPTEP+L+ TRKVIPPL GVDVT
Subjt:  VYEAARTIPFALQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVT

Query:  PVVWFGLVSFLNEILLGPQGLLVLLSQQ
        PVVWFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  PVVWFGLVSFLNEILLGPQGLLVLLSQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCCTGCTGCTCCCCTCTCACCGCCATTCGAGTAATAGGTTCTTTCTTTCTTTCTCCTCTTTCAATTTTCAATCCTAGTATAATAGGATCGTCCCCT
TTGATTCCTAACTATGGAAATTCAAGCGTCTCTAGAGGCTTCAAGCACCATCCTCAAAGAAATCCAAACTGCAGATTCCAGGCACTCCAATGTAGCTCATCTTTG
TTGGGTACTGTTACCTCTTCCAAGATTCATCTGTCATTAGCCTATGCCACCCCTCCATTAAAGCCAGCTGCTGTATATGAAGCTGCAAGGACTATCCCCTTTGCC
CTGCAAGATGCATCGATGGCTGCCTCGGATTTCATGACAAACATGGCCCTGGCCGACCTCGACCCAGCAACAGCAAAGCTCGCTATCGGCTTTCTGGGGCCATTT
CTCTCTGCATTTTCGTTTCTGTTTATCTTGAGAATAGTGATGTCTTGGTATCCAAAGTTGCCTCTGGGAAAGTTTCCATATGTTATAGCCTATGCCCCCACTGAA
CCACTTCTAATTGCAACAAGGAAGGTGATCCCCCCTCTCGGCGGAGTTGACGTAACGCCAGTCGTCTGGTTCGGATTGGTTAGTTTCCTCAACGAGATATTGCTT
GGTCCCCAAGGGCTGCTTGTCCTCCTTTCTCAACAGATTTTCAGTAACATTAACGAATCTGGGTTTGAACTCAAGTTAATTGACTGGTTGAGTCGGGCATTTCTT
CAAACATTTGGTGGGACAATTTGGACATTAACTTCGCAAGAGATTCAAGTAGAGAAAGTAGACTACCGAAAGAACTATATCGGATCAAAAGAGAAGAACAGGGCA
AACAGTGGCTGGTCGAAGTGCATGAAATCAGAAAAGTACCTCTTTTCCGCGAAAATGGAGGAAAATTTTGGGAGACGCTCTGCAGAAAAATTCTATCCACAGGCT
AAAATCAGGGAGGTGCGTTTGATTTATTTACAGCGACTTGTCATTTTTCAGTTCTCTTATCAGCAACCGGCTGCTGCAGAATATAAGGAGATTGACAGTTCTCCA
AGGGAGAGAGCTCTAGGAAAGCTCTGCACTCCCTCAGCTCCCTCTTGTCTTCTACCTCTGAAGAAGACAGTGTTGCCAAGAAGTAGCCATTTGCATCAGTTGATT
CACTCAGAAAGGAGTGTAGAAGAAAGGTCCAACGGAATACAACTTCATCTTGTTGTTCGCGAGAATGCGAAAAGAACGAAGCAGAGCACCCGCGACGCCATTGTT
GATGTTGGTGACAATGTCGCCAAGAAGTACCCATGCTCGTCCGTTGGAAAGCTAGAGAAGGAGAAGGGAGCCGCTTCGTTGCCTTTCTCGTTCAAAGCTAAGCAT
CGGAGGAAGAAACGGCGCCGGCGGCCAGCAGCCACAGCAGCAGCACCGGAGTTGCAGAGAGTTGCCTTGCGAGAGCCATCTGATCTGTATAACTTTGGAGCTGCG
AAGAATTATTGTTGGTTTGGGGGTTTTATAGAGAGATTGAATACTTGGACGCACGACGCCGGAGGAGGGCATGATGATGATGCCGACAGAAACAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGCCTGCTGCTCCCCTCTCACCGCCATTCGAGTAATAGGTTCTTTCTTTCTTTCTCCTCTTTCAATTTTCAATCCTAGTATAATAGGATCGTCCCCT
TTGATTCCTAACTATGGAAATTCAAGCGTCTCTAGAGGCTTCAAGCACCATCCTCAAAGAAATCCAAACTGCAGATTCCAGGCACTCCAATGTAGCTCATCTTTG
TTGGGTACTGTTACCTCTTCCAAGATTCATCTGTCATTAGCCTATGCCACCCCTCCATTAAAGCCAGCTGCTGTATATGAAGCTGCAAGGACTATCCCCTTTGCC
CTGCAAGATGCATCGATGGCTGCCTCGGATTTCATGACAAACATGGCCCTGGCCGACCTCGACCCAGCAACAGCAAAGCTCGCTATCGGCTTTCTGGGGCCATTT
CTCTCTGCATTTTCGTTTCTGTTTATCTTGAGAATAGTGATGTCTTGGTATCCAAAGTTGCCTCTGGGAAAGTTTCCATATGTTATAGCCTATGCCCCCACTGAA
CCACTTCTAATTGCAACAAGGAAGGTGATCCCCCCTCTCGGCGGAGTTGACGTAACGCCAGTCGTCTGGTTCGGATTGGTTAGTTTCCTCAACGAGATATTGCTT
GGTCCCCAAGGGCTGCTTGTCCTCCTTTCTCAACAGATTTTCAGTAACATTAACGAATCTGGGTTTGAACTCAAGTTAATTGACTGGTTGAGTCGGGCATTTCTT
CAAACATTTGGTGGGACAATTTGGACATTAACTTCGCAAGAGATTCAAGTAGAGAAAGTAGACTACCGAAAGAACTATATCGGATCAAAAGAGAAGAACAGGGCA
AACAGTGGCTGGTCGAAGTGCATGAAATCAGAAAAGTACCTCTTTTCCGCGAAAATGGAGGAAAATTTTGGGAGACGCTCTGCAGAAAAATTCTATCCACAGGCT
AAAATCAGGGAGGTGCGTTTGATTTATTTACAGCGACTTGTCATTTTTCAGTTCTCTTATCAGCAACCGGCTGCTGCAGAATATAAGGAGATTGACAGTTCTCCA
AGGGAGAGAGCTCTAGGAAAGCTCTGCACTCCCTCAGCTCCCTCTTGTCTTCTACCTCTGAAGAAGACAGTGTTGCCAAGAAGTAGCCATTTGCATCAGTTGATT
CACTCAGAAAGGAGTGTAGAAGAAAGGTCCAACGGAATACAACTTCATCTTGTTGTTCGCGAGAATGCGAAAAGAACGAAGCAGAGCACCCGCGACGCCATTGTT
GATGTTGGTGACAATGTCGCCAAGAAGTACCCATGCTCGTCCGTTGGAAAGCTAGAGAAGGAGAAGGGAGCCGCTTCGTTGCCTTTCTCGTTCAAAGCTAAGCAT
CGGAGGAAGAAACGGCGCCGGCGGCCAGCAGCCACAGCAGCAGCACCGGAGTTGCAGAGAGTTGCCTTGCGAGAGCCATCTGATCTGTATAACTTTGGAGCTGCG
AAGAATTATTGTTGGTTTGGGGGTTTTATAGAGAGATTGAATACTTGGACGCACGACGCCGGAGGAGGGCATGATGATGATGCCGACAGAAACAGCTAA
Protein sequenceShow/hide protein sequence
MAAACCSPLTAIRVIGSFFLSPLSIFNPSIIGSSPLIPNYGNSSVSRGFKHHPQRNPNCRFQALQCSSSLLGTVTSSKIHLSLAYATPPLKPAAVYEAARTIPFA
LQDASMAASDFMTNMALADLDPATAKLAIGFLGPFLSAFSFLFILRIVMSWYPKLPLGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILL
GPQGLLVLLSQQIFSNINESGFELKLIDWLSRAFLQTFGGTIWTLTSQEIQVEKVDYRKNYIGSKEKNRANSGWSKCMKSEKYLFSAKMEENFGRRSAEKFYPQA
KIREVRLIYLQRLVIFQFSYQQPAAAEYKEIDSSPRERALGKLCTPSAPSCLLPLKKTVLPRSSHLHQLIHSERSVEERSNGIQLHLVVRENAKRTKQSTRDAIV
DVGDNVAKKYPCSSVGKLEKEKGAASLPFSFKAKHRRKKRRRRPAATAAAPELQRVALREPSDLYNFGAAKNYCWFGGFIERLNTWTHDAGGGHDDDADRNS