; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g36080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g36080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr8:26624994..26626005
RNA-Seq ExpressionMoc08g36080
SyntenyMoc08g36080
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.6e-8244.47Show/hide
Query:  EVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGR
        EVGA   LPA FADRVDDP ARMGGT DVTARFR+EPSSSG                                R IDYAA+AFVASIQSALAVKAELDGR
Subjt:  EVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGR

Query:  EALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEV------------------------------------------------------------
        E LAAREKEEFSAALEAASSTMKDEL KAHSEVE LKAEV                                                            
Subjt:  EALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEV------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------EAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEES
                                EAKAELLK+E++R KA LRAAHAIT+GLEKEKFQLLKEKDDML ALE KD  +    AEL+  KERL+NG LLE +
Subjt:  ------------------------EAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEES

Query:  FRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP
        FRQHPDFDGF+KDFSDAGFKFLMKGIA D+P L++DL  LKKRYAE+WASGP+GT GP +LVDKYVR LD DYSDL+ED        +VGTTQEG P
Subjt:  FRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]2.8e-7968.25Show/hide
Query:  ARFRVEPSS-SGRTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAA
        +RF  +P S   RTID AA+AF+ASI SA+ VKAELDGREAL A+E+E  S  LEAA +T+K EL KA  EV+IL+AEV+AK +LLKKE ++ KA LRAA
Subjt:  ARFRVEPSS-SGRTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAA

Query:  HAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYA
        HAIT+GLEKEKFQLLKEKDD+   LE KD  +   T EL+  KERL++G LLEESFRQHP+FDGF+KDFSDAGFKFLMKGIA DMP LQIDLS LKKRY+
Subjt:  HAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYA

Query:  EQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP
        E WASGP+GTPGPQ+LVDKYVR LD DYSD+EE+        +VGTTQE AP
Subjt:  EQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]5.2e-10588.98Show/hide
Query:  RTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKF
        RTIDYAA+AFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKDEL KAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKF
Subjt:  RTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKF

Query:  QLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPG
        QLLKEKDDML ALEAKD+EL+HATAELETAKERLSNGVLLEE+FRQHPDFDGF+KDFSDAGFKFLMKGIA DMPDLQIDLSGLK+RYAE+WASGP GTPG
Subjt:  QLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPG

Query:  PQALVDKYVRVLDFDYSDLEEDQVGTTQEGAPQAGS
        PQALVD+YVR LD DYSD EEDQVG+TQEGA   GS
Subjt:  PQALVDKYVRVLDFDYSDLEEDQVGTTQEGAPQAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.3e-8464.6Show/hide
Query:  MGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTM
        MGGTFDV  RFR+EPSSSG                                RTID AA+AFVASI SA+ VKAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTM

Query:  KDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPD
        K EL KA  EV IL+AEV+AKAELLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+   LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP
        FDGF+KDFSDAGFKFLMKGIA DMP LQIDLS LKK+Y+E+WASGP+GTPGPQ+LV KYVR LD DYSD+EE+        ++GTTQE  P
Subjt:  FDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.2e-8861.76Show/hide
Query:  EVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGR
        E GA   LP S AD VDDPEARM GT +V  RF +EPSSSG                                RTID  A+AF+ASI  A+ VKAELDGR
Subjt:  EVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGR

Query:  EALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAEL
        EALAA+E+E   AALEAA +T+K EL KA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+   LE KD  +   T EL
Subjt:  EALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAEL

Query:  ETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED-----
        +  KERL+NG LLEESFRQHPDFDGF+KDFSDAGFKFLMKGIA DMP LQIDL+GLKK+Y+E+WASGP+GTP PQ+LVDKYVR LD DYSD+EE+     
Subjt:  ETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED-----

Query:  ---QVGTTQEGAP--QAGS
           +VGTTQE  P  Q GS
Subjt:  ---QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124677.8e-8344.47Show/hide
Query:  EVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGR
        EVGA   LPA FADRVDDP ARMGGT DVTARFR+EPSSSG                                R IDYAA+AFVASIQSALAVKAELDGR
Subjt:  EVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGR

Query:  EALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEV------------------------------------------------------------
        E LAAREKEEFSAALEAASSTMKDEL KAHSEVE LKAEV                                                            
Subjt:  EALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEV------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------EAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEES
                                EAKAELLK+E++R KA LRAAHAIT+GLEKEKFQLLKEKDDML ALE KD  +    AEL+  KERL+NG LLE +
Subjt:  ------------------------EAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEES

Query:  FRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP
        FRQHPDFDGF+KDFSDAGFKFLMKGIA D+P L++DL  LKKRYAE+WASGP+GT GP +LVDKYVR LD DYSDL+ED        +VGTTQEG P
Subjt:  FRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP

A0A6J1D1N9 uncharacterized protein LOC1110161931.4e-7968.25Show/hide
Query:  ARFRVEPSS-SGRTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAA
        +RF  +P S   RTID AA+AF+ASI SA+ VKAELDGREAL A+E+E  S  LEAA +T+K EL KA  EV+IL+AEV+AK +LLKKE ++ KA LRAA
Subjt:  ARFRVEPSS-SGRTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAA

Query:  HAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYA
        HAIT+GLEKEKFQLLKEKDD+   LE KD  +   T EL+  KERL++G LLEESFRQHP+FDGF+KDFSDAGFKFLMKGIA DMP LQIDLS LKKRY+
Subjt:  HAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYA

Query:  EQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP
        E WASGP+GTPGPQ+LVDKYVR LD DYSD+EE+        +VGTTQE AP
Subjt:  EQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185382.5e-10588.98Show/hide
Query:  RTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKF
        RTIDYAA+AFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKDEL KAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKF
Subjt:  RTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKF

Query:  QLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPG
        QLLKEKDDML ALEAKD+EL+HATAELETAKERLSNGVLLEE+FRQHPDFDGF+KDFSDAGFKFLMKGIA DMPDLQIDLSGLK+RYAE+WASGP GTPG
Subjt:  QLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPG

Query:  PQALVDKYVRVLDFDYSDLEEDQVGTTQEGAPQAGS
        PQALVD+YVR LD DYSD EEDQVG+TQEGA   GS
Subjt:  PQALVDKYVRVLDFDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DF31 uncharacterized protein LOC1110199096.4e-8564.6Show/hide
Query:  MGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTM
        MGGTFDV  RFR+EPSSSG                                RTID AA+AFVASI SA+ VKAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTM

Query:  KDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPD
        K EL KA  EV IL+AEV+AKAELLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+   LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP
        FDGF+KDFSDAGFKFLMKGIA DMP LQIDLS LKK+Y+E+WASGP+GTPGPQ+LV KYVR LD DYSD+EE+        ++GTTQE  P
Subjt:  FDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED--------QVGTTQEGAP

A0A6J1DZB3 uncharacterized protein LOC1110256655.6e-8961.76Show/hide
Query:  EVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGR
        E GA   LP S AD VDDPEARM GT +V  RF +EPSSSG                                RTID  A+AF+ASI  A+ VKAELDGR
Subjt:  EVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSG--------------------------------RTIDYAAKAFVASIQSALAVKAELDGR

Query:  EALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAEL
        EALAA+E+E   AALEAA +T+K EL KA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+   LE KD  +   T EL
Subjt:  EALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAEL

Query:  ETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED-----
        +  KERL+NG LLEESFRQHPDFDGF+KDFSDAGFKFLMKGIA DMP LQIDL+GLKK+Y+E+WASGP+GTP PQ+LVDKYVR LD DYSD+EE+     
Subjt:  ETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQIDLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEED-----

Query:  ---QVGTTQEGAP--QAGS
           +VGTTQE  P  Q GS
Subjt:  ---QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTCGGAGCTCATGAGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTTCGACGTGACAGCACGGTTCAGAGTCGA
GCCGTCAAGTTCTGGGAGGACCATCGACTACGCCGCTAAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAG
CGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTAGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCAGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAG
GCCAAGGCCGAGCTGCTGAAGAAAGAGGAGGACAGACGCAAAGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGA
GAAGGACGACATGCTCGACGCGCTTGAAGCGAAGGATGAGGAGCTGAAACATGCGACTGCCGAGCTGGAGACGGCGAAAGAGCGTCTCAGCAATGGAGTCCTATTGGAGG
AATCGTTTAGGCAACATCCTGACTTCGATGGATTTTCCAAAGACTTCTCTGATGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTTCGACATGCCTGACCTTCAGATC
GATCTCAGTGGTCTGAAGAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGTTCTGGACTTTGA
CTACTCCGATCTGGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTCGGAGCTCATGAGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTTCGACGTGACAGCACGGTTCAGAGTCGA
GCCGTCAAGTTCTGGGAGGACCATCGACTACGCCGCTAAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAG
CGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTAGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCAGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAG
GCCAAGGCCGAGCTGCTGAAGAAAGAGGAGGACAGACGCAAAGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGA
GAAGGACGACATGCTCGACGCGCTTGAAGCGAAGGATGAGGAGCTGAAACATGCGACTGCCGAGCTGGAGACGGCGAAAGAGCGTCTCAGCAATGGAGTCCTATTGGAGG
AATCGTTTAGGCAACATCCTGACTTCGATGGATTTTCCAAAGACTTCTCTGATGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTTCGACATGCCTGACCTTCAGATC
GATCTCAGTGGTCTGAAGAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGTTCTGGACTTTGA
CTACTCCGATCTGGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MEVGAHEALPASFADRVDDPEARMGGTFDVTARFRVEPSSSGRTIDYAAKAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELQKAHSEVEILKAEVE
AKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLDALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIAFDMPDLQI
DLSGLKKRYAEQWASGPSGTPGPQALVDKYVRVLDFDYSDLEEDQVGTTQEGAPQAGS