; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002571 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002571
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic
Genome locationchr4:43901987..43903398
RNA-Seq ExpressionLag0002571
SyntenyLag0002571
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR003425 - CCB3/YggT


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067975.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3 [Cucumis melo var. makuwa]1.3e-8486.98Show/hide
Query:  FPKFVDSLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKL
        F K +D LPCFLGQ+GFKYHPQRNPN +FQAI+CSSSLL SF SSKI  SLAY  PPLKPAAAYEAARTIPFALQDAS+A SDF+N++ LADLDP TAKL
Subjt:  FPKFVDSLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKL

Query:  AISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        AISFLGP LS FSFLFI RIVMSWYPKLPVGKFPYVIAYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  AISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

TYK18148.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3 [Cucumis melo var. makuwa]3.8e-8486.46Show/hide
Query:  FPKFVDSLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKL
        F K +D LPCFLGQ+GFKYHPQRNPN +FQAI+CSSSLL SF SSKI  SLAY  PPLKPAAAYEAARTIPFAL+DAS+A SDF+N++ LADLDP TAKL
Subjt:  FPKFVDSLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKL

Query:  AISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        AISFLGP LS FSFLFI RIVMSWYPKLPVGKFPYVIAYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  AISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

XP_004136017.2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X3 [Cucumis sativus]4.8e-7986.59Show/hide
Query:  QKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFS
        ++GFKYHPQRNPNC+FQAI+CSSSLL SF SSK  LSLAY  PPLKPAAAYEAARTIPF LQDAS+A SDF+N++ LADLDP TAKLAISFLGP LS FS
Subjt:  QKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFS

Query:  FLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        FLFI RIVMSWYPKLPVGKFPYVIAYAPTEPLL+ TRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQGLLVLLSQQVS
Subjt:  FLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

XP_038896182.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Benincasa hispida]3.6e-8289.89Show/hide
Query:  KGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSF
        +GFKYHPQRNPNCRFQA +CSSS+L SF +SK+HLSLAYA  PLKPAAAYEAARTIPFALQDAS++ SDFMNNVALADLDP  AKLAI FLGPFLSAFSF
Subjt:  KGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSF

Query:  LFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        LFI RIVMSWYPKLPVGKFPYVIAYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  LFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

XP_038896183.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X2 [Benincasa hispida]1.2e-8287.63Show/hide
Query:  SLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLG
        SL C   ++GFKYHPQRNPNCRFQA +CSSS+L SF +SK+HLSLAYA  PLKPAAAYEAARTIPFALQDAS++ SDFMNNVALADLDP  AKLAI FLG
Subjt:  SLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLG

Query:  PFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        PFLSAFSFLFI RIVMSWYPKLPVGKFPYVIAYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  PFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

TrEMBL top hitse value%identityAlignment
A0A0A0K9A7 Uncharacterized protein3.0e-7987.08Show/hide
Query:  KGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSF
        +GFKYHPQRNPNC+FQAI+CSSSLL SF SSK  LSLAY  PPLKPAAAYEAARTIPF LQDAS+A SDF+N++ LADLDP TAKLAISFLGP LS FSF
Subjt:  KGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSF

Query:  LFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        LFI RIVMSWYPKLPVGKFPYVIAYAPTEPLL+ TRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQGLLVLLSQQVS
Subjt:  LFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A1S3BSI7 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.2e-7887.71Show/hide
Query:  QKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFS
        ++GFKYHPQRNPN +FQAI+CSSSLL SF SSKI  SLAY  PPLKPAAAYEAARTIPFALQDAS+A SDF+N++ LADLDP TAKLAISFLGP LS FS
Subjt:  QKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFS

Query:  FLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        FLFI RIVMSWYPKLPVGKFPYVIAYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  FLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A5A7VQJ0 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB36.3e-8586.98Show/hide
Query:  FPKFVDSLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKL
        F K +D LPCFLGQ+GFKYHPQRNPN +FQAI+CSSSLL SF SSKI  SLAY  PPLKPAAAYEAARTIPFALQDAS+A SDF+N++ LADLDP TAKL
Subjt:  FPKFVDSLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKL

Query:  AISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        AISFLGP LS FSFLFI RIVMSWYPKLPVGKFPYVIAYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  AISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A5D3D3A7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB31.8e-8486.46Show/hide
Query:  FPKFVDSLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKL
        F K +D LPCFLGQ+GFKYHPQRNPN +FQAI+CSSSLL SF SSKI  SLAY  PPLKPAAAYEAARTIPFAL+DAS+A SDF+N++ LADLDP TAKL
Subjt:  FPKFVDSLPCFLGQKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKL

Query:  AISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        AISFLGP LS FSFLFI RIVMSWYPKLPVGKFPYVIAYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  AISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A6J1GQG0 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.2e-7887.15Show/hide
Query:  QKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFS
        ++GF YHPQ NPNCRFQA +CSSSLL SF SSKIHL L YA PPLKP     AARTIPFALQDAS+A SDFMNNV+LADLDP TAKLAI FLGPFLSAFS
Subjt:  QKGFKYHPQRNPNCRFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFS

Query:  FLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        FLFI RIVMSWYPKLPVGKFPYVIAYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  FLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

SwissProt top hitse value%identityAlignment
Q8RWM7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic4.0e-4475.4Show/hide
Query:  EAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPV
        EAA T     Q  SI  S+ + N++LADLDP TAKLAI  LGP LSAF FLFI+RIVMSWYPKLPV KFPYV+AYAPTEP+L+ TRKVIPPL GVDVTPV
Subjt:  EAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPV

Query:  VWFGLVSFLNEILLGPQGLLVLLSQQ
        VWFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  VWFGLVSFLNEILLGPQGLLVLLSQQ

Arabidopsis top hitse value%identityAlignment
AT3G07430.1 YGGT family protein9.2e-0430.26Show/hide
Query:  FLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG
        +L  +S + +VR+++SW+P +P  + P        +P L + R +IPP+   +DV+P++ F ++  L  I+ G  G
Subjt:  FLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG

AT4G27990.1 YGGT family protein5.4e-0430.38Show/hide
Query:  LGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG
        L  +L  +S + +VR+++SW+P +P  + P        +P L + R +IPP+   +DV+P++ F ++  L  IL   +G
Subjt:  LGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG

AT5G21920.1 YGGT family protein4.1e-0435.48Show/hide
Query:  FLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPT--EPLLIVTRKVIPPLGGVDVTPVVWF
        FL+ ++ + +VR+V++W+P  P    P ++    T  +P L + R  IPPLGG+D++P++ F
Subjt:  FLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPT--EPLLIVTRKVIPPLGGVDVTPVVWF

AT5G21920.2 YGGT family protein3.2e-0429.1Show/hide
Query:  SLLSSFPSSKIHL-SLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPY
        SL+S+    K  L SLA  NP  + A        + FA+ +   A        A+   D     +  + L  FL+ ++ + +VR+V++W+P  P    P 
Subjt:  SLLSSFPSSKIHL-SLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPY

Query:  VIAYAPT--EPLLIVTRKVIPPLGGVDVTPVVWF
        ++    T  +P L + R  IPPLGG+D++P++ F
Subjt:  VIAYAPT--EPLLIVTRKVIPPLGGVDVTPVVWF

AT5G36120.1 cofactor assembly, complex C (B6F)2.8e-4575.4Show/hide
Query:  EAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPV
        EAA T     Q  SI  S+ + N++LADLDP TAKLAI  LGP LSAF FLFI+RIVMSWYPKLPV KFPYV+AYAPTEP+L+ TRKVIPPL GVDVTPV
Subjt:  EAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVIAYAPTEPLLIVTRKVIPPLGGVDVTPV

Query:  VWFGLVSFLNEILLGPQGLLVLLSQQ
        VWFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  VWFGLVSFLNEILLGPQGLLVLLSQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCACCTTGTACACCTCTCTGTTTTGCCTCCTCTGCTCTCGATTTTGTGGTTTCTGTTCCTTTCTTTTTCTTCTTCTTCTTCTTTTACGAGTTTTCTGTGTTGTT
TTTGACTGGAGGGTTGTGGTTTGTTCAGTTGGACAAGGGTCTTTTAAGAGGATTTGACAATGCTGGTGTAATTGTGGCTGAAATGTTTCTTGTTATCTATTTTGGTTTCT
GTCCAATCCTTGCTGTAAATTGCTTCACATTCCCCAAATTTGTAGATTCATTACCTTGTTTCCTTGGACAGAAAGGCTTCAAGTACCATCCTCAAAGAAATCCAAACTGC
AGATTCCAAGCAATCCAATGTAGCTCATCTTTGTTGAGTTCTTTTCCCTCTTCCAAGATTCATCTGTCATTGGCCTATGCCAACCCTCCATTAAAGCCAGCTGCTGCATA
TGAAGCTGCAAGGACTATCCCCTTTGCCTTGCAAGATGCATCAATTGCTACCTCTGATTTCATGAACAATGTGGCCCTGGCTGACCTCGACCCAGCAACAGCAAAGCTCG
CAATTAGCTTTCTGGGGCCGTTTCTCTCGGCGTTTTCGTTTTTGTTTATTGTGAGAATAGTAATGTCCTGGTATCCAAAGTTGCCTGTGGGTAAGTTTCCATATGTTATA
GCTTATGCCCCCACTGAACCACTTCTAATTGTAACAAGGAAGGTGATCCCCCCTCTCGGCGGAGTTGACGTAACGCCAGTCGTCTGGTTCGGATTGGTTAGTTTCCTCAA
CGAGATATTGCTCGGTCCCCAAGGGCTGCTTGTCCTCCTTTCTCAGCAGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATTCACCTTGTACACCTCTCTGTTTTGCCTCCTCTGCTCTCGATTTTGTGGTTTCTGTTCCTTTCTTTTTCTTCTTCTTCTTCTTTTACGAGTTTTCTGTGTTGTT
TTTGACTGGAGGGTTGTGGTTTGTTCAGTTGGACAAGGGTCTTTTAAGAGGATTTGACAATGCTGGTGTAATTGTGGCTGAAATGTTTCTTGTTATCTATTTTGGTTTCT
GTCCAATCCTTGCTGTAAATTGCTTCACATTCCCCAAATTTGTAGATTCATTACCTTGTTTCCTTGGACAGAAAGGCTTCAAGTACCATCCTCAAAGAAATCCAAACTGC
AGATTCCAAGCAATCCAATGTAGCTCATCTTTGTTGAGTTCTTTTCCCTCTTCCAAGATTCATCTGTCATTGGCCTATGCCAACCCTCCATTAAAGCCAGCTGCTGCATA
TGAAGCTGCAAGGACTATCCCCTTTGCCTTGCAAGATGCATCAATTGCTACCTCTGATTTCATGAACAATGTGGCCCTGGCTGACCTCGACCCAGCAACAGCAAAGCTCG
CAATTAGCTTTCTGGGGCCGTTTCTCTCGGCGTTTTCGTTTTTGTTTATTGTGAGAATAGTAATGTCCTGGTATCCAAAGTTGCCTGTGGGTAAGTTTCCATATGTTATA
GCTTATGCCCCCACTGAACCACTTCTAATTGTAACAAGGAAGGTGATCCCCCCTCTCGGCGGAGTTGACGTAACGCCAGTCGTCTGGTTCGGATTGGTTAGTTTCCTCAA
CGAGATATTGCTCGGTCCCCAAGGGCTGCTTGTCCTCCTTTCTCAGCAGGTCAGCTGA
Protein sequenceShow/hide protein sequence
MYSPCTPLCFASSALDFVVSVPFFFFFFFFYEFSVLFLTGGLWFVQLDKGLLRGFDNAGVIVAEMFLVIYFGFCPILAVNCFTFPKFVDSLPCFLGQKGFKYHPQRNPNC
RFQAIQCSSSLLSSFPSSKIHLSLAYANPPLKPAAAYEAARTIPFALQDASIATSDFMNNVALADLDPATAKLAISFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVI
AYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS