; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g01960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g01960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr4:1250004..1253903
RNA-Seq ExpressionMoc04g01960
SyntenyMoc04g01960
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.6e-12291.94Show/hide
Query:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        P RR DRPAVINTIFGGPSGGQ GHKRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE
        LGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTLG DQTQ TQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYS PNGVG VRGEQ ASRE
Subjt:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE

Query:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLL
        CYASALKGSSVCALETL SRDGTLEF+A+L RREFAAPTEELELVPLL
Subjt:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLL

XP_022150861.1 uncharacterized protein LOC111018906 [Momordica charantia]1.2e-10982.17Show/hide
Query:  MPPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYL
        MPPRR DR AVINTIF GPSGGQ G+KRKELAR ARREVCIIREQ+PTC I+F  ADLE VHLPHNDALVIAPLIDHV+VRRVLVDGGAS NILSL TYL
Subjt:  MPPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYL

Query:  ALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASR
        ALGWTRSQLKKSPTPLVGFSGE V PEGCIDLPV +GQD TQ TQMAEFVVI GRSAYNAIFGRPIIHSFRA+ STLHQVLKYS PNGVGTVRGEQ  SR
Subjt:  ALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASR

Query:  ECYASALKGSSVCALETLASRDGTLEFEADLL---RREFAAPTEELELVPLLSPEKQL
        ECYASALKGSSV ALE  AS D   + EA+LL   +REF+APTE+LELVPLLSP++Q+
Subjt:  ECYASALKGSSVCALETLASRDGTLEFEADLL---RREFAAPTEELELVPLLSPEKQL

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]6.1e-13092.91Show/hide
Query:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        PPRRTDRPAVINTIFGGPSGGQ GHKRK+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE
        LGWTRSQLKKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+ TQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYS PNGVGTVRGEQTASRE
Subjt:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE

Query:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQL
        CYAS LKG+SVCALETL SRDGTLEFEADL  REFAAP EELELVPLLS EKQ+
Subjt:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQL

XP_022155866.1 uncharacterized protein LOC111022880 [Momordica charantia]7.0e-12690.94Show/hide
Query:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        PPRRTDRPAVINTIFGGPSGGQ GHKRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDAL+IA LIDHVVVRRVLV+GGASANILSLPTYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE
        LGWTRSQL++SPTPLVGFSGESVIPEGCIDLPVTLGQ+QT+ TQMAEFVV+DGRS YNAIFGRPIIHSFRA+PSTLHQVLKY  PNGVGTVRGEQ ASRE
Subjt:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE

Query:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQL
        CYA+ALKG SVCALETL  RDGTLEFEA+L R+EFAAPTEELELVPLLSPEKQL
Subjt:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQL

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]3.3e-12892.49Show/hide
Query:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        PPRRTDRPAVINTIFGGPSGGQ GHKRKELAR ARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE
        LGWTRSQLK+SPTPLVGFSGESVIPEGCIDLPVTLGQDQT+ TQM EFVV+DGRS YNAIFGRPIIHSFR +PSTLHQVLKYS PNGVGTVRGEQT SRE
Subjt:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE

Query:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQ
        CYA+ALKGSSVCALETL  RDGTLE EADL R+EFAAPTEELELVPLLSPEKQ
Subjt:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1D9E1 uncharacterized protein LOC1110188237.8e-12391.94Show/hide
Query:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        P RR DRPAVINTIFGGPSGGQ GHKRKELARAARREVCIIREQRPTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE
        LGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTLG DQTQ TQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYS PNGVG VRGEQ ASRE
Subjt:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE

Query:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLL
        CYASALKGSSVCALETL SRDGTLEF+A+L RREFAAPTEELELVPLL
Subjt:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLL

A0A6J1DCR3 uncharacterized protein LOC1110189065.8e-11082.17Show/hide
Query:  MPPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYL
        MPPRR DR AVINTIF GPSGGQ G+KRKELAR ARREVCIIREQ+PTC I+F  ADLE VHLPHNDALVIAPLIDHV+VRRVLVDGGAS NILSL TYL
Subjt:  MPPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYL

Query:  ALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASR
        ALGWTRSQLKKSPTPLVGFSGE V PEGCIDLPV +GQD TQ TQMAEFVVI GRSAYNAIFGRPIIHSFRA+ STLHQVLKYS PNGVGTVRGEQ  SR
Subjt:  ALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASR

Query:  ECYASALKGSSVCALETLASRDGTLEFEADLL---RREFAAPTEELELVPLLSPEKQL
        ECYASALKGSSV ALE  AS D   + EA+LL   +REF+APTE+LELVPLLSP++Q+
Subjt:  ECYASALKGSSVCALETLASRDGTLEFEADLL---RREFAAPTEELELVPLLSPEKQL

A0A6J1DD03 uncharacterized protein LOC1110198992.9e-13092.91Show/hide
Query:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        PPRRTDRPAVINTIFGGPSGGQ GHKRK+LARAARREVCIIREQRPTCPITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE
        LGWTRSQLKKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+ TQMAEFVV+DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYS PNGVGTVRGEQTASRE
Subjt:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE

Query:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQL
        CYAS LKG+SVCALETL SRDGTLEFEADL  REFAAP EELELVPLLS EKQ+
Subjt:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQL

A0A6J1DT04 uncharacterized protein LOC1110228803.4e-12690.94Show/hide
Query:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        PPRRTDRPAVINTIFGGPSGGQ GHKRKELARAARREVCIIREQ PTCPITFDGADLEEVHLPHNDAL+IA LIDHVVVRRVLV+GGASANILSLPTYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE
        LGWTRSQL++SPTPLVGFSGESVIPEGCIDLPVTLGQ+QT+ TQMAEFVV+DGRS YNAIFGRPIIHSFRA+PSTLHQVLKY  PNGVGTVRGEQ ASRE
Subjt:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE

Query:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQL
        CYA+ALKG SVCALETL  RDGTLEFEA+L R+EFAAPTEELELVPLLSPEKQL
Subjt:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQL

A0A6J1DYW5 uncharacterized protein LOC1110243321.6e-12892.49Show/hide
Query:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        PPRRTDRPAVINTIFGGPSGGQ GHKRKELAR ARREVCIIREQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE
        LGWTRSQLK+SPTPLVGFSGESVIPEGCIDLPVTLGQDQT+ TQM EFVV+DGRS YNAIFGRPIIHSFR +PSTLHQVLKYS PNGVGTVRGEQT SRE
Subjt:  LGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRE

Query:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQ
        CYA+ALKGSSVCALETL  RDGTLE EADL R+EFAAPTEELELVPLLSPEKQ
Subjt:  CYASALKGSSVCALETLASRDGTLEFEADLLRREFAAPTEELELVPLLSPEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTTCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCG
CGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCTATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCT
TGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAG
AAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGCCACCCAAATGGC
CGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATT
CCAACCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGT
AGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCTGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATC
AGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGAACTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAATCTTCATGGATGG
ACCCGATTGTGGACTTTATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGACCCGACTTGGGAGGGGCCGTTT
GAGGTCAAGGGCATAGTCCGATCTGGGACGTACATATTGGCCGATCGAAAGGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTTCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCG
CGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCTATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCTCCCT
TGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAG
AAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGCCACCCAAATGGC
CGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATT
CCAACCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGT
AGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCTGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATTAGCATC
AGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGAACTCAGAGCCAGATCTGATGGAGATCGACGCTCCAGAATCTTCATGGATGG
ACCCGATTGTGGACTTTATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGACCCGACTTGGGAGGGGCCGTTT
GAGGTCAAGGGCATAGTCCGATCTGGGACGTACATATTGGCCGATCGAAAGGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequenceShow/hide protein sequence
MPPRRTDRPAVINTIFGGPSGGQFGHKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLK
KSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQATQMAEFVVIDGRSAYNAIFGRPIIHSFRALPSTLHQVLKYSNPNGVGTVRGEQTASRECYASALKGSSVCALETLAS
RDGTLEFEADLLRREFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSNSEPDLMEIDAPESSWMDPIVDFIRGNSPQDPKERRRRVQTHVGALDPTWEGPF
EVKGIVRSGTYILADRKGDVLAHPWNAEHLKRYYP