; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g06850 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g06850
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic
Genome locationchr9:5391227..5392742
RNA-Seq ExpressionMoc09g06850
SyntenyMoc09g06850
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR003425 - CCB3/YggT


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451549.1 PREDICTED: protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic [Cucumis melo]3.4e-7983.77Show/hide
Query:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
        MAA A CSSL+SIRRGFK+HPQRNPNY+FQA++CSSSLLGS  TSSK+   LA   PPLKPAAA+E  RT PFALQDASMAASDF  +M LADLDP TAK
Subjt:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        LAI FLGP LS FSFLFI RIVMSWYPKLPVGKFPYV+AYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

XP_022144899.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Momordica charantia]6.6e-9999.49Show/hide
Query:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
        MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
Subjt:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLS-QQEKNY
        LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLS QQEKNY
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLS-QQEKNY

XP_022144908.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X2 [Momordica charantia]2.7e-100100Show/hide
Query:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
        MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
Subjt:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQEKNY
        LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQEKNY
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQEKNY

XP_022144916.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X3 [Momordica charantia]2.1e-97100Show/hide
Query:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
        MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
Subjt:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

XP_038896183.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X2 [Benincasa hispida]2.4e-7784.04Show/hide
Query:  AAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAI
        AA CSSL+ IRRGFK+HPQRNPN RFQA +CSSS+L S  T+SK+ L LA AT PLKPAAA+E  RT PFALQDASM+ASDF  N+ALADLDP  AKLAI
Subjt:  AAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAI

Query:  GFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        GFLGPFLSAFSFLFI RIVMSWYPKLPVGKFPYV+AYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  GFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

TrEMBL top hitse value%identityAlignment
A0A1S3BSI7 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.6e-7983.77Show/hide
Query:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
        MAA A CSSL+SIRRGFK+HPQRNPNY+FQA++CSSSLLGS  TSSK+   LA   PPLKPAAA+E  RT PFALQDASMAASDF  +M LADLDP TAK
Subjt:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        LAI FLGP LS FSFLFI RIVMSWYPKLPVGKFPYV+AYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

A0A6J1CTL8 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X13.2e-9999.49Show/hide
Query:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
        MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
Subjt:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLS-QQEKNY
        LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLS QQEKNY
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLS-QQEKNY

A0A6J1CUI3 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X21.3e-100100Show/hide
Query:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
        MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
Subjt:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQEKNY
        LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQEKNY
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQEKNY

A0A6J1CUU2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X31.0e-97100Show/hide
Query:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
        MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK
Subjt:  MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAK

Query:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  LAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

A0A6J1JVN7 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X25.9e-7784.66Show/hide
Query:  AAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAKLA
        AA  CSSL++IRRGF +HPQ NPN RFQA +CSSSLLGS  TSSK+ L L SATP LKPAA     RT PFALQDASMAASDFT N+ALADLDP TAKLA
Subjt:  AAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAKLA

Query:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
        IGFLGPFLSAFSFLFI RIVMSWYPKLPVGKFPYV+AYAPTEPLLI TRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ
Subjt:  IGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQ

SwissProt top hitse value%identityAlignment
Q8RWM7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic5.7e-4573.64Show/hide
Query:  EGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPV
        E   TT    Q  S+  S+  +N++LADLDP TAKLAIG LGP LSAF FLFI+RIVMSWYPKLPV KFPYV+AYAPTEP+L+ TRKVIPPL GVDVTPV
Subjt:  EGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPV

Query:  VWFGLVSFLNEILLGPQGLLVLLSQQEKN
        VWFGLVSFL+EIL+GPQGLLVL+SQQ+ N
Subjt:  VWFGLVSFLNEILLGPQGLLVLLSQQEKN

Arabidopsis top hitse value%identityAlignment
AT3G07430.1 YGGT family protein5.0e-0425.48Show/hide
Query:  SSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTP-------FALQDASMAASDFTKNMALADLDPATAKLAIGFLGPFLSAFSFLFIVRIVMSWYP
        SS+L GST + + +A L  + T  L    +     ++P       F+L  A        ++     L+     +A+G +  +L  +S + +VR+++SW+P
Subjt:  SSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTP-------FALQDASMAASDFTKNMALADLDPATAKLAIGFLGPFLSAFSFLFIVRIVMSWYP

Query:  KLPVGKFPYVVAYAPTEPLLIVTRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG
         +P  + P        +P L + R +IPP+   +DV+P++ F ++  L  I+ G  G
Subjt:  KLPVGKFPYVVAYAPTEPLLIVTRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG

AT4G27990.1 YGGT family protein2.2e-0430.38Show/hide
Query:  LGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG
        L  +L  +S + +VR+++SW+P +P  + P        +P L + R +IPP+   +DV+P++ F ++  L  IL   +G
Subjt:  LGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPL-GGVDVTPVVWFGLVSFLNEILLGPQG

AT5G21920.1 YGGT family protein2.2e-0437.1Show/hide
Query:  FLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPT--EPLLIVTRKVIPPLGGVDVTPVVWF
        FL+ ++ + +VR+V++W+P  P    P +V    T  +P L + R  IPPLGG+D++P++ F
Subjt:  FLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPT--EPLLIVTRKVIPPLGGVDVTPVVWF

AT5G21920.2 YGGT family protein3.8e-0429.85Show/hide
Query:  SLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPY
        SL+ +T         LAS  P  + A     T    FA+ +   AA        L     A   +A G +  FL+ ++ + +VR+V++W+P  P    P 
Subjt:  SLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPY

Query:  VVAYAPT--EPLLIVTRKVIPPLGGVDVTPVVWF
        +V    T  +P L + R  IPPLGG+D++P++ F
Subjt:  VVAYAPT--EPLLIVTRKVIPPLGGVDVTPVVWF

AT5G36120.1 cofactor assembly, complex C (B6F)4.1e-4673.64Show/hide
Query:  EGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPV
        E   TT    Q  S+  S+  +N++LADLDP TAKLAIG LGP LSAF FLFI+RIVMSWYPKLPV KFPYV+AYAPTEP+L+ TRKVIPPL GVDVTPV
Subjt:  EGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAIGFLGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPV

Query:  VWFGLVSFLNEILLGPQGLLVLLSQQEKN
        VWFGLVSFL+EIL+GPQGLLVL+SQQ+ N
Subjt:  VWFGLVSFLNEILLGPQGLLVLLSQQEKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCCGCCTTCTGCTCCTCTCTCACCTCCATTCGAAGAGGCTTTAAGCACCATCCTCAAAGAAATCCAAACTACAGATTCCAGGCGCTCCAGTGTAGC
TCATCCTTATTGGGTTCTACTGTTACCTCTTCCAAGGTTGCTCTGTTATTAGCCTCTGCCACCCCTCCATTAAAGCCAGCTGCTGCATTTGAAGGTACAAGAACT
ACCCCCTTTGCCCTGCAAGATGCATCAATGGCTGCCTCTGATTTCACGAAAAACATGGCCCTGGCCGACCTCGACCCAGCAACGGCAAAGCTCGCAATCGGCTTT
CTGGGTCCGTTTCTCTCGGCATTTTCGTTTTTGTTTATTGTGAGAATAGTGATGTCTTGGTATCCAAAGTTACCTGTGGGGAAGTTTCCATATGTTGTAGCTTAT
GCTCCAACTGAACCACTTCTAATTGTGACAAGGAAGGTGATTCCCCCTCTTGGCGGAGTTGACGTAACGCCTGTTGTCTGGTTCGGATTGGTTAGTTTCCTCAAC
GAGATATTGCTCGGTCCCCAAGGACTGCTTGTTCTTCTTTCTCAACAGGAGAAGAATTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGCCGCCTTCTGCTCCTCTCTCACCTCCATTCGAAGAGGCTTTAAGCACCATCCTCAAAGAAATCCAAACTACAGATTCCAGGCGCTCCAGTGTAGC
TCATCCTTATTGGGTTCTACTGTTACCTCTTCCAAGGTTGCTCTGTTATTAGCCTCTGCCACCCCTCCATTAAAGCCAGCTGCTGCATTTGAAGGTACAAGAACT
ACCCCCTTTGCCCTGCAAGATGCATCAATGGCTGCCTCTGATTTCACGAAAAACATGGCCCTGGCCGACCTCGACCCAGCAACGGCAAAGCTCGCAATCGGCTTT
CTGGGTCCGTTTCTCTCGGCATTTTCGTTTTTGTTTATTGTGAGAATAGTGATGTCTTGGTATCCAAAGTTACCTGTGGGGAAGTTTCCATATGTTGTAGCTTAT
GCTCCAACTGAACCACTTCTAATTGTGACAAGGAAGGTGATTCCCCCTCTTGGCGGAGTTGACGTAACGCCTGTTGTCTGGTTCGGATTGGTTAGTTTCCTCAAC
GAGATATTGCTCGGTCCCCAAGGACTGCTTGTTCTTCTTTCTCAACAGGAGAAGAATTACTAA
Protein sequenceShow/hide protein sequence
MAAAAFCSSLTSIRRGFKHHPQRNPNYRFQALQCSSSLLGSTVTSSKVALLLASATPPLKPAAAFEGTRTTPFALQDASMAASDFTKNMALADLDPATAKLAIGF
LGPFLSAFSFLFIVRIVMSWYPKLPVGKFPYVVAYAPTEPLLIVTRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQEKNY