; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g34710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g34710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr1:24580295..24581196
RNA-Seq ExpressionMoc01g34710
SyntenyMoc01g34710
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]7.6e-9368.44Show/hide
Query:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS
        RFR+E SSSGV+DQVSRISA  LDRCLRRAS+FVSD GSVLQRTID AAEAF+ASI SA+ VKAELDGRE L A+E+E  S  LEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS
        EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE       K+KDD+ Q L+ KD  +   T EL+  KERL++G LL+ESFRQHP+FDGFAKDFS
Subjt:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS

Query:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP
        D GFKFLMKGIA+DMP LQIDLS LK+RY+ENWASGP GTPGPQ+LVD+YVR+LDSDYSD+EE+        +VGTTQE AP
Subjt:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.4e-12589.64Show/hide
Query:  MARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKA
        +A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS  GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKDELLKA
Subjt:  MARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKA

Query:  HSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKD
        HSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE       K+KDDMLQAL+AKDKELEHATAELET KERLSNGVLL+E+FRQHPDFDGFAKD
Subjt:  HSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKD

Query:  FSDEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS
        FSD GFKFLMKGIASDMPDLQIDLSGLKRRYAE WASGPGGTPGPQALVDQYVRDLDSDYSD EEDQVG+TQEGA    S
Subjt:  FSDEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]6.9e-9469.86Show/hide
Query:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS
        RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSD GSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS
        EV IL+AEV+++AELLKKE ++ KA LRAAHAIT+GLE       K+KDD+ Q L+ KD  +   TAEL+  KERL+NG LL+ESFRQH DFDGFAKDFS
Subjt:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS

Query:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP
        D GFKFLMKGIA+DMP LQIDLS LK++Y+E WASGP GTPGPQ+LV +YVR+LDSDYSD+EE+        ++GTTQE  P
Subjt:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]2.1e-9577.37Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEI
        EPSSSGVRDQVSRISAASLDRCLRRASKFVSD GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVE 
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEI

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFSDEGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE       K+KDDMLQAL+ KDKELEHATAELET KERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFSDEGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS
                      +IDLSGLKRRYAE WASGPGGTPGPQALVDQYVRDLDSDYSD +EDQVG+TQEGAP A S
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.2e-9167.73Show/hide
Query:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS
        RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSD GSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS
        EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE       K+KDD+ Q L+ KD  +   T EL+  KERL+NG LL+ESFRQHPDFDGFAKDFS
Subjt:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS

Query:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP
        D GFKFLMKGIA+DMP LQIDL+GLK++Y+E WASGP GTP PQ+LVD+YVR+LDSDYSD+EE+        +VGTTQE  P
Subjt:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1D1N9 uncharacterized protein LOC1110161933.7e-9368.44Show/hide
Query:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS
        RFR+E SSSGV+DQVSRISA  LDRCLRRAS+FVSD GSVLQRTID AAEAF+ASI SA+ VKAELDGRE L A+E+E  S  LEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS
        EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE       K+KDD+ Q L+ KD  +   T EL+  KERL++G LL+ESFRQHP+FDGFAKDFS
Subjt:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS

Query:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP
        D GFKFLMKGIA+DMP LQIDLS LK+RY+ENWASGP GTPGPQ+LVD+YVR+LDSDYSD+EE+        +VGTTQE AP
Subjt:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185381.6e-12589.64Show/hide
Query:  MARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKA
        +A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS  GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKDELLKA
Subjt:  MARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKA

Query:  HSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKD
        HSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE       K+KDDMLQAL+AKDKELEHATAELET KERLSNGVLL+E+FRQHPDFDGFAKD
Subjt:  HSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKD

Query:  FSDEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS
        FSD GFKFLMKGIASDMPDLQIDLSGLKRRYAE WASGPGGTPGPQALVDQYVRDLDSDYSD EEDQVG+TQEGA    S
Subjt:  FSDEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS

A0A6J1DF31 uncharacterized protein LOC1110199093.3e-9469.86Show/hide
Query:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS
        RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSD GSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS
        EV IL+AEV+++AELLKKE ++ KA LRAAHAIT+GLE       K+KDD+ Q L+ KD  +   TAEL+  KERL+NG LL+ESFRQH DFDGFAKDFS
Subjt:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS

Query:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP
        D GFKFLMKGIA+DMP LQIDLS LK++Y+E WASGP GTPGPQ+LV +YVR+LDSDYSD+EE+        ++GTTQE  P
Subjt:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP

A0A6J1DVF6 uncharacterized protein LOC1110247401.0e-9577.37Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEI
        EPSSSGVRDQVSRISAASLDRCLRRASKFVSD GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVE 
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEI

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFSDEGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE       K+KDDMLQAL+ KDKELEHATAELET KERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFSDEGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS
                      +IDLSGLKRRYAE WASGPGGTPGPQALVDQYVRDLDSDYSD +EDQVG+TQEGAP A S
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-9167.73Show/hide
Query:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS
        RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSD GSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  
Subjt:  RFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS
        EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE       K+KDD+ Q L+ KD  +   T EL+  KERL+NG LL+ESFRQHPDFDGFAKDFS
Subjt:  EVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFS

Query:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP
        D GFKFLMKGIA+DMP LQIDL+GLK++Y+E WASGP GTP PQ+LVD+YVR+LDSDYSD+EE+        +VGTTQE  P
Subjt:  DEGFKFLMKGIASDMPDLQIDLSGLKRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGTTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGATCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGAAGGGCGTCCAAATTTGTGAG
CGACCTTGGGTCCGTTCTGCAGAGGACCATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTT
TGGCAGCAAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAG
GTGGAGTCCCAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCCTGGAGAAGGACAAGGACGACATGCT
CCAGGCACTTGATGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGACGAAGGAGCGCCTCAGCAATGGAGTCCTATTGAAGGAATCGTTTAGGCAAC
ATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGAGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTG
AAAAGGAGGTATGCCGAGAATTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGA
AGAGGACCAGGTCGGCACCACACAGGAGGGCGCTCCTCAAGCGGACTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGGGTGAGGGATCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGAAGGGCGTCCAAATTTGTGAG
CGACCTTGGGTCCGTTCTGCAGAGGACCATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTT
TGGCAGCAAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAG
GTGGAGTCCCAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCCTGGAGAAGGACAAGGACGACATGCT
CCAGGCACTTGATGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGACGAAGGAGCGCCTCAGCAATGGAGTCCTATTGAAGGAATCGTTTAGGCAAC
ATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGAGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTG
AAAAGGAGGTATGCCGAGAATTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGA
AGAGGACCAGGTCGGCACCACACAGGAGGGCGCTCCTCAAGCGGACTCTTAG
Protein sequenceShow/hide protein sequence
MARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE
VESQAELLKKEEDRRKAQLRAAHAITRGLEKDKDDMLQALDAKDKELEHATAELETTKERLSNGVLLKESFRQHPDFDGFAKDFSDEGFKFLMKGIASDMPDLQIDLSGL
KRRYAENWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGTTQEGAPQADS