; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G024170 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G024170
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic
Genome locationchr02:30791004..30792809
RNA-Seq ExpressionLsi02G024170
SyntenyLsi02G024170
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR003425 - CCB3/YggT


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575811.1 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.4e-8984.65Show/hide
Query:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA
        AA ACSS S+IR       +G+GSSPLIPN+GNSSF RGF YH Q NPNCR QA KCSSSLL SFT SKI L L YATPPLKP     AARTIPFALQDA
Subjt:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA

Query:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
        S++ASDFMNNVSLADLDPG AKLAIG LGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
Subjt:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL

Query:  LGPQGLLVLLSQQVS
        LGPQGLLVLLSQQVS
Subjt:  LGPQGLLVLLSQQVS

XP_004136016.2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucumis sativus]8.1e-9083.26Show/hide
Query:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA
        AA ACSS SSIR       +GIGSSPL PN+GNS+F RGFKYH QRNPNC+ QAIKCSSSLL SFT SK  LSLAY  PPLKPAAAYEAARTIPF LQDA
Subjt:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA

Query:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
        SM+ASDF+N+++LADLDPG AKLAI  LGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEIL
Subjt:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL

Query:  LGPQGLLVLLSQQVS
        LGPQGLLVLLSQQVS
Subjt:  LGPQGLLVLLSQQVS

XP_023548987.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]3.7e-9085.12Show/hide
Query:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA
        AA ACSS S+IR       +G+GSSPLIPN+GNSSF RGF YH Q NPNCR QA KCSSSLL SFT SKI L L YATPPLKP     AARTIPFALQDA
Subjt:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA

Query:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
        SM+ASDFMNNV+LADLDPG AKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
Subjt:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL

Query:  LGPQGLLVLLSQQVS
        LGPQGLLVLLSQQVS
Subjt:  LGPQGLLVLLSQQVS

XP_038896182.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Benincasa hispida]9.0e-9790.19Show/hide
Query:  AAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDAS
        AAACSS S IR         IGSSPLIPNYGNSSFSRGFKYH QRNPNCR QA KCSSS+LDSFT SK+ LSLAYAT PLKPAAAYEAARTIPFALQDAS
Subjt:  AAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDAS

Query:  MSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILL
        MSASDFMNNV+LADLDPGMAKLAIG LGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILL
Subjt:  MSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILL

Query:  GPQGLLVLLSQQVS
        GPQGLLVLLSQQVS
Subjt:  GPQGLLVLLSQQVS

XP_038896183.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X2 [Benincasa hispida]2.1e-8582.71Show/hide
Query:  AAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDAS
        AAACSS S IR                         RGFKYH QRNPNCR QA KCSSS+LDSFT SK+ LSLAYAT PLKPAAAYEAARTIPFALQDAS
Subjt:  AAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDAS

Query:  MSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILL
        MSASDFMNNV+LADLDPGMAKLAIG LGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILL
Subjt:  MSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILL

Query:  GPQGLLVLLSQQVS
        GPQGLLVLLSQQVS
Subjt:  GPQGLLVLLSQQVS

TrEMBL top hitse value%identityAlignment
A0A0A0K9A7 Uncharacterized protein3.9e-9083.26Show/hide
Query:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA
        AA ACSS SSIR       +GIGSSPL PN+GNS+F RGFKYH QRNPNC+ QAIKCSSSLL SFT SK  LSLAY  PPLKPAAAYEAARTIPF LQDA
Subjt:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA

Query:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
        SM+ASDF+N+++LADLDPG AKLAI  LGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEIL
Subjt:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL

Query:  LGPQGLLVLLSQQVS
        LGPQGLLVLLSQQVS
Subjt:  LGPQGLLVLLSQQVS

A0A1S3BSI7 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic4.8e-8078.34Show/hide
Query:  MAAAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQ
        MAA AACSS SSIR                         RGFKYH QRNPN + QAIKCSSSLL SFT SKI  SLAY  PPLKPAAAYEAARTIPFALQ
Subjt:  MAAAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQ

Query:  DASMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNE
        DASM+ASDF+N+++LADLDPG AKLAI  LGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNE
Subjt:  DASMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNE

Query:  ILLGPQGLLVLLSQQVS
        ILLGPQGLLVLLSQQVS
Subjt:  ILLGPQGLLVLLSQQVS

A0A5A7VQJ0 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB32.0e-7888.76Show/hide
Query:  RGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDASMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSF
        RGFKYH QRNPN + QAIKCSSSLL SFT SKI  SLAY  PPLKPAAAYEAARTIPFALQDASM+ASDF+N+++LADLDPG AKLAI  LGP LS FSF
Subjt:  RGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDASMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSF

Query:  LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A6J1GQG0 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.2e-7878.14Show/hide
Query:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA
        AA ACSS S+IR                         RGF YH Q NPNCR QA KCSSSLL SFT SKI L L YATPPLKP     AARTIPFALQDA
Subjt:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA

Query:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
        S++ASDFMNNVSLADLDPG AKLAIG LGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
Subjt:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL

Query:  LGPQGLLVLLSQQVS
        LGPQGLLVLLSQQVS
Subjt:  LGPQGLLVLLSQQVS

A0A6J1JL88 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X15.0e-8582.33Show/hide
Query:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA
        AA ACSS S+IR       +G+GSS LIPN+GNSS  RGF YH Q NPNCR QA KCSSSLL SFT SKI L L  ATP LKP     AARTIPFALQDA
Subjt:  AAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDA

Query:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
        SM+ASDF NNV+LADLDPG AKLAIG LGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL
Subjt:  SMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEIL

Query:  LGPQGLLVLLSQQVS
        LGPQGLLVLLSQQVS
Subjt:  LGPQGLLVLLSQQVS

SwissProt top hitse value%identityAlignment
Q8RWM7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.3e-4576.98Show/hide
Query:  EAARTIPFALQDASMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPV
        EAA T     Q  S+S S  + N+SLADLDPG AKLAIG+LGP LSAF FLFI RIVMSWYPKLPV KFPYV+AYAPTEP+L+ TRKVIPPL GVDVTPV
Subjt:  EAARTIPFALQDASMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPV

Query:  VWFGLVSFLNEILLGPQGLLVLLSQQ
        VWFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  VWFGLVSFLNEILLGPQGLLVLLSQQ

Arabidopsis top hitse value%identityAlignment
AT3G07430.1 YGGT family protein2.5e-0427.93Show/hide
Query:  SMSASDFMNNVSLADLDPG-----MAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPL-GGVDVTPVVWFGLVS
        S+S +  +   SL D  PG     +  +A+G +  +L  +S + + R+++SW+P +P  + P        +P L   R +IPP+   +DV+P++ F ++ 
Subjt:  SMSASDFMNNVSLADLDPG-----MAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPL-GGVDVTPVVWFGLVS

Query:  FLNEILLGPQG
         L  I+ G  G
Subjt:  FLNEILLGPQG

AT4G27990.1 YGGT family protein9.5e-0429.11Show/hide
Query:  LGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG
        L  +L  +S + + R+++SW+P +P  + P        +P L   R +IPP+   +DV+P++ F ++  L  IL   +G
Subjt:  LGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG

AT5G21920.1 YGGT family protein5.6e-0433.71Show/hide
Query:  MNNVSLADLDPG--MAKLAI--GLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPT--EPLLIATRKVIPPLGGVDVTPVVWF
        M+N   A + PG  +A L +  GL+  FL+ ++ + + R+V++W+P  P    P ++    T  +P L   R  IPPLGG+D++P++ F
Subjt:  MNNVSLADLDPG--MAKLAI--GLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPT--EPLLIATRKVIPPLGGVDVTPVVWF

AT5G36120.1 cofactor assembly, complex C (B6F)9.1e-4776.98Show/hide
Query:  EAARTIPFALQDASMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPV
        EAA T     Q  S+S S  + N+SLADLDPG AKLAIG+LGP LSAF FLFI RIVMSWYPKLPV KFPYV+AYAPTEP+L+ TRKVIPPL GVDVTPV
Subjt:  EAARTIPFALQDASMSASDFMNNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPV

Query:  VWFGLVSFLNEILLGPQGLLVLLSQQ
        VWFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  VWFGLVSFLNEILLGPQGLLVLLSQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCCGCCGCCTGCTCCTCTTTCAGCTCCATTCGATTTTTACTTAATCCTTGTGATTTAGGTATAGGATCGTCCCCTTTGATTCCCAACTATGGAAATTCAAG
CTTCTCTAGAGGCTTCAAGTACCATTCTCAAAGGAATCCAAACTGCAGAGGCCAAGCAATCAAATGTAGCTCATCTTTGTTGGATTCTTTTACCTGTTCCAAGATTCGTC
TGTCACTGGCCTATGCCACCCCTCCATTAAAGCCAGCTGCTGCATATGAAGCTGCAAGGACTATCCCCTTTGCCTTGCAAGATGCATCTATGTCTGCCTCTGATTTCATG
AACAATGTGTCCTTGGCCGACCTCGATCCAGGAATGGCAAAGCTTGCGATCGGCCTTCTGGGGCCATTTCTCTCGGCATTTTCGTTTTTGTTTATTGCGAGAATAGTAAT
GTCTTGGTATCCCAAGTTGCCTGTGGGGAAGTTTCCATATGTTATAGCTTATGCCCCCACTGAACCACTTCTAATTGCAACAAGGAAGGTGATCCCTCCTCTCGGCGGAG
TTGATGTAACGCCAGTCGTCTGGTTCGGATTGGTTAGTTTTCTCAACGAGATATTGCTTGGTCCCCAAGGGCTGCTTGTCCTCCTTTCTCAACAGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGCCGCCGCCTGCTCCTCTTTCAGCTCCATTCGATTTTTACTTAATCCTTGTGATTTAGGTATAGGATCGTCCCCTTTGATTCCCAACTATGGAAATTCAAG
CTTCTCTAGAGGCTTCAAGTACCATTCTCAAAGGAATCCAAACTGCAGAGGCCAAGCAATCAAATGTAGCTCATCTTTGTTGGATTCTTTTACCTGTTCCAAGATTCGTC
TGTCACTGGCCTATGCCACCCCTCCATTAAAGCCAGCTGCTGCATATGAAGCTGCAAGGACTATCCCCTTTGCCTTGCAAGATGCATCTATGTCTGCCTCTGATTTCATG
AACAATGTGTCCTTGGCCGACCTCGATCCAGGAATGGCAAAGCTTGCGATCGGCCTTCTGGGGCCATTTCTCTCGGCATTTTCGTTTTTGTTTATTGCGAGAATAGTAAT
GTCTTGGTATCCCAAGTTGCCTGTGGGGAAGTTTCCATATGTTATAGCTTATGCCCCCACTGAACCACTTCTAATTGCAACAAGGAAGGTGATCCCTCCTCTCGGCGGAG
TTGATGTAACGCCAGTCGTCTGGTTCGGATTGGTTAGTTTTCTCAACGAGATATTGCTTGGTCCCCAAGGGCTGCTTGTCCTCCTTTCTCAACAGGTCAGCTGA
Protein sequenceShow/hide protein sequence
MAAAAACSSFSSIRFLLNPCDLGIGSSPLIPNYGNSSFSRGFKYHSQRNPNCRGQAIKCSSSLLDSFTCSKIRLSLAYATPPLKPAAAYEAARTIPFALQDASMSASDFM
NNVSLADLDPGMAKLAIGLLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS