; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g22720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g22720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionS-adenosyl-L-methionine-dependent methyltransferase
Genome locationchr7:16757504..16758058
RNA-Seq ExpressionMoc07g22720
SyntenyMoc07g22720
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR009902 - Protein of unknown function DUF1442


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039995.1 uncharacterized protein E6C27_scaffold122G002780 [Cucumis melo var. makuwa]5.7e-7679.89Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMA--AVVGVDFLVVDCKRRDFARVLRFT
        MAAGWNAKLI+HACS   +S+VTT+GLAVAARHTGGRYVCAVADERSKSEYV+ LQ+AGV  P E++VGEA MA  AVVGVDFLVVDCKR+DF RVLR  
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMA--AVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGS---ISTSKPSRWMRHVDERSGEEHVYRERI
        KVS KGAIL+C+NTW RSF KLGYC LLP+GTRVV+SVSLPVGQGLSIIH+GSS GG     ISTSKPSRWM+HVDERSGEEHVYRERI
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGS---ISTSKPSRWMRHVDERSGEEHVYRERI

XP_008460114.1 PREDICTED: uncharacterized protein LOC103499021 [Cucumis melo]1.4e-7478.84Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMA--AVVGVDFLVVDCKRRDFARVLRFT
        MAAGWNAKLI+HACS   +S+VTT+GLAVAARHTGGRYVCAVADERSKS YV+ LQ+AGV  P E++VGEA MA  AVVGVDFLVVDCKR+DF RVLR  
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMA--AVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGS---ISTSKPSRWMRHVDERSGEEHVYRERI
        KVS KGAIL+C+NTW RSF KLGYC LLP+GTR V+SVSLPVGQGLSIIH+GSS GG     ISTSKPSRWM+HVDERSGEEHVYRERI
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGS---ISTSKPSRWMRHVDERSGEEHVYRERI

XP_022151163.1 uncharacterized protein LOC111019154 [Momordica charantia]6.4e-96100Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMAAVVGVDFLVVDCKRRDFARVLRFTKV
        MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMAAVVGVDFLVVDCKRRDFARVLRFTKV
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMAAVVGVDFLVVDCKRRDFARVLRFTKV

Query:  SNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI
        SNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI
Subjt:  SNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI

XP_023000186.1 uncharacterized protein LOC111494475 [Cucurbita maxima]1.2e-7378.65Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVV--GEAAMAAVVGVDFLVVDCKRRDFARVLRFT
        MAAGWNAKLIVHACS   +S VTT+GLAVAARHTGGR+VC +ADERSKSEYV+ LQ+AGVL PAE+VV   EAAMA VVGVDFLVVDCKRRDFARVLRF 
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVV--GEAAMAAVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGG------GSISTSKPSRWMRHVDERSGEEHVYRERI
         VS+KGAILVCKNTW R+F K+ YC LLPRG RVV+SVSLPVGQGLSIIHVGSS+GG      GSIS SK SRWMRH+DERSGEEHVYR+ I
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGG------GSISTSKPSRWMRHVDERSGEEHVYRERI

XP_038875680.1 uncharacterized protein LOC120068075 [Benincasa hispida]3.7e-8386.02Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEA--AMAAVVGVDFLVVDCKRRDFARVLRFT
        MAAGWNAKLIVHACSA  +S+VTTVGLAVAARH+GGRYVCAVADERSKSEYV+ LQ+AGV PP E+VVGEA  AMAAVVGVDFLVVDCKRRDFARVLRF 
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEA--AMAAVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI
        KV+++GAIL+CKNTWSRSF KLGYCGLLP+GTRVV+SVSLPVGQGLSIIHVGSS+G G+ISTSKPSRWM+HVDERSGEEHVYR+RI
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI

TrEMBL top hitse value%identityAlignment
A0A1S3CBT0 uncharacterized protein LOC1034990216.8e-7578.84Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMA--AVVGVDFLVVDCKRRDFARVLRFT
        MAAGWNAKLI+HACS   +S+VTT+GLAVAARHTGGRYVCAVADERSKS YV+ LQ+AGV  P E++VGEA MA  AVVGVDFLVVDCKR+DF RVLR  
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMA--AVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGS---ISTSKPSRWMRHVDERSGEEHVYRERI
        KVS KGAIL+C+NTW RSF KLGYC LLP+GTR V+SVSLPVGQGLSIIH+GSS GG     ISTSKPSRWM+HVDERSGEEHVYRERI
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGS---ISTSKPSRWMRHVDERSGEEHVYRERI

A0A5A7TEV7 Uncharacterized protein2.7e-7679.89Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMA--AVVGVDFLVVDCKRRDFARVLRFT
        MAAGWNAKLI+HACS   +S+VTT+GLAVAARHTGGRYVCAVADERSKSEYV+ LQ+AGV  P E++VGEA MA  AVVGVDFLVVDCKR+DF RVLR  
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMA--AVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGS---ISTSKPSRWMRHVDERSGEEHVYRERI
        KVS KGAIL+C+NTW RSF KLGYC LLP+GTRVV+SVSLPVGQGLSIIH+GSS GG     ISTSKPSRWM+HVDERSGEEHVYRERI
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGS---ISTSKPSRWMRHVDERSGEEHVYRERI

A0A6J1DBF8 uncharacterized protein LOC1110191543.1e-96100Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMAAVVGVDFLVVDCKRRDFARVLRFTKV
        MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMAAVVGVDFLVVDCKRRDFARVLRFTKV
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMAAVVGVDFLVVDCKRRDFARVLRFTKV

Query:  SNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI
        SNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI
Subjt:  SNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI

A0A6J1HJV1 uncharacterized protein LOC1114642281.6e-7177.89Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVV--GEAAMAAVVGVDFLVVDCKRRDFARVLRFT
        MAAGWNAKLIVHACS   +S VTT+GL+VAARHTGGR+VC +ADERSKSEYV+ LQ+AGV  PAE+VV   EAAMAAVVGVDFLVVDCKRRDFARVLRF 
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVV--GEAAMAAVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGG----GSISTSKPSRWMRHVDERSGEEHVYRERI
         VS+KGAILVCKNTW RSF K+ Y  LLPRG RVV+SVSLPVGQGLSIIH GSS+GG     SIS SK SRWMRH+DERSGEEHVYR+ I
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGG----GSISTSKPSRWMRHVDERSGEEHVYRERI

A0A6J1KF65 uncharacterized protein LOC1114944755.7e-7478.65Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVV--GEAAMAAVVGVDFLVVDCKRRDFARVLRFT
        MAAGWNAKLIVHACS   +S VTT+GLAVAARHTGGR+VC +ADERSKSEYV+ LQ+AGVL PAE+VV   EAAMA VVGVDFLVVDCKRRDFARVLRF 
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVV--GEAAMAAVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGG------GSISTSKPSRWMRHVDERSGEEHVYRERI
         VS+KGAILVCKNTW R+F K+ YC LLPRG RVV+SVSLPVGQGLSIIHVGSS+GG      GSIS SK SRWMRH+DERSGEEHVYR+ I
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGG------GSISTSKPSRWMRHVDERSGEEHVYRERI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12320.1 Protein of unknown function (DUF1442)4.0e-2740.64Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQD-AGVLPPAEVVVGE---AAMAAVVGVDFLVVDCKRRDF-ARVL
        MAAGWN KLIV   S G A + +++GL VA++H   +++C V + RS+S Y++ +Q+ +  L   E +V E    AM  + GVDFLVVD + ++F A  L
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQD-AGVLPPAEVVVGE---AAMAAVVGVDFLVVDCKRRDF-ARVL

Query:  RFTKVSNKGAILVCKNTWSRSFSKLGYCGL--LPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVY
        +     N+GA++VC+N         GY  L  + R  +VV++V+LPV  G+ I HV + + G S +  +  RW+ HVD+RSGEEHV+
Subjt:  RFTKVSNKGAILVCKNTWSRSFSKLGYCGL--LPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVY

AT1G62840.1 Protein of unknown function (DUF1442)1.8e-2738.74Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVG-------EAAMAAVVGVDFLVVDCKRRDF-A
        MAAGWNA LIV   S G   ++ +VGL +A+RHT GR++C V + RS++ Y++ + +       E ++        E  M  + G+DFLVVD  ++DF A
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVG-------EAAMAAVVGVDFLVVDCKRRDF-A

Query:  RVLRFTKVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSS-DGGGSISTSKPSRWMRHVDERSGEEHVYRE
         VLR     ++GA++VC++ + RS S   +         VV++V+LPV  GL I HV ++   G S + S   +W++H D+RSGEEHV R+
Subjt:  RVLRFTKVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSS-DGGGSISTSKPSRWMRHVDERSGEEHVYRE

AT2G45360.1 Protein of unknown function (DUF1442)1.2e-4456.83Show/hide
Query:  AAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEA---AMAAVVGVDFLVVDCKRRDFARVLRFT
        AAGWNA+LIV   S G     T+VGLAVAA HTGGR+VC V DE+SK EYV  ++  G +    VVVGE+    M    GVDFLVVD KRR+F R LRF 
Subjt:  AAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEA---AMAAVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYR
        K+SNKGA+LVCKN   R+ S   +  +L RGTRVV+SV LPVG GL I+HVG++  G   S +  SRW+RHVD  SGEEH++R
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYR

AT3G60780.1 Protein of unknown function (DUF1442)1.4e-4051.37Show/hide
Query:  AAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVV---GEAAMAAVVGVDFLVVDCKRRDFARVLRFT
        AAGWN +LIV   S G     T+VGLAVAA HT GR+VC V DE S+SEY   ++ A      EV+V    E  +  + GVDF+VVD KR +F   L   
Subjt:  AAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVV---GEAAMAAVVGVDFLVVDCKRRDFARVLRFT

Query:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYR
        K S  GA+LVCKN   +S     + GLL RGTRVV+SV LPVG+GL I+HVG+S GGG+     PSRW++H+D RSGEEH+++
Subjt:  KVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYR

AT5G62280.1 Protein of unknown function (DUF1442)1.1e-0527.27Show/hide
Query:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEV-----VVGEAAMAAVVG-----VDFLVVDCKRRD
        +AAG +A+ I  AC AG A++   V L  AA  T G+ VC +          E +    +L P+E+     VVGE+    ++       DF++VDC   +
Subjt:  MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEV-----VVGEAAMAAVVG-----VDFLVVDCKRRD

Query:  FARVL----------RFTKVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSD-----GGGSISTSKPSRWMRHVDERSG
           ++            T   +  A++V  N +SR            R +   K+  LP+G+GL +  V  +              + SRW+  VD+ +G
Subjt:  FARVL----------RFTKVSNKGAILVCKNTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSD-----GGGSISTSKPSRWMRHVDERSG

Query:  EEHVYRERI
        EEHV+R R+
Subjt:  EEHVYRERI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGGCTGGAACGCCAAACTCATCGTACACGCTTGCTCCGCCGGCGTCGCCTCCTCCGTCACCACCGTCGGCCTCGCCGTGGCCGCGCGGCACACCGGCGGGCG
GTACGTTTGTGCGGTGGCGGACGAGCGGTCGAAATCGGAGTACGTGGAGTGCTTGCAAGATGCCGGCGTTTTGCCGCCGGCGGAGGTTGTGGTCGGAGAGGCGGCGATGG
CGGCGGTGGTCGGAGTGGATTTTCTGGTGGTGGATTGTAAGCGGAGGGATTTCGCTAGGGTTTTGAGGTTCACCAAGGTGAGCAACAAAGGGGCAATTTTGGTATGTAAA
AATACGTGGTCTAGAAGCTTCTCGAAGCTCGGTTACTGTGGGCTCCTCCCGAGGGGGACACGTGTCGTCAAATCAGTGAGTTTACCGGTGGGCCAGGGGTTGAGTATAAT
TCATGTGGGGTCGTCGGATGGTGGCGGCTCGATCTCGACGAGCAAGCCTAGCCGTTGGATGAGGCACGTCGATGAACGATCCGGTGAAGAACACGTGTACCGGGAGCGAA
TCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCGGCTGGAACGCCAAACTCATCGTACACGCTTGCTCCGCCGGCGTCGCCTCCTCCGTCACCACCGTCGGCCTCGCCGTGGCCGCGCGGCACACCGGCGGGCG
GTACGTTTGTGCGGTGGCGGACGAGCGGTCGAAATCGGAGTACGTGGAGTGCTTGCAAGATGCCGGCGTTTTGCCGCCGGCGGAGGTTGTGGTCGGAGAGGCGGCGATGG
CGGCGGTGGTCGGAGTGGATTTTCTGGTGGTGGATTGTAAGCGGAGGGATTTCGCTAGGGTTTTGAGGTTCACCAAGGTGAGCAACAAAGGGGCAATTTTGGTATGTAAA
AATACGTGGTCTAGAAGCTTCTCGAAGCTCGGTTACTGTGGGCTCCTCCCGAGGGGGACACGTGTCGTCAAATCAGTGAGTTTACCGGTGGGCCAGGGGTTGAGTATAAT
TCATGTGGGGTCGTCGGATGGTGGCGGCTCGATCTCGACGAGCAAGCCTAGCCGTTGGATGAGGCACGTCGATGAACGATCCGGTGAAGAACACGTGTACCGGGAGCGAA
TCTGA
Protein sequenceShow/hide protein sequence
MAAGWNAKLIVHACSAGVASSVTTVGLAVAARHTGGRYVCAVADERSKSEYVECLQDAGVLPPAEVVVGEAAMAAVVGVDFLVVDCKRRDFARVLRFTKVSNKGAILVCK
NTWSRSFSKLGYCGLLPRGTRVVKSVSLPVGQGLSIIHVGSSDGGGSISTSKPSRWMRHVDERSGEEHVYRERI