; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g29340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g29340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF4219 domain-containing protein
Genome locationchr9:22085729..22089399
RNA-Seq ExpressionMoc09g29340
SyntenyMoc09g29340
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
DAD45748.1 TPA_asm: hypothetical protein HUJ06_003978 [Nelumbo nucifera]2.2e-2227.84Show/hide
Query:  MNGG---NNICVDKLTNDNYCFWRLCAKE-----------VEDE-------------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERL
        MNGG   +NICVDKL  +NY +W+LC +              DE                   KCGKALFALRT I KEYIE VHD K  KQ+W+ LE L
Subjt:  MNGG---NNICVDKLTNDNYCFWRLCAKE-----------VEDE-------------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERL

Query:  FNSKEH---------------------------------------------DEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFRY----
        F  K                                                +VED L++  + K N  SK   NDS  ++ EG S  N K C+R     
Subjt:  FNSKEH---------------------------------------------DEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFRY----

Query:  ----------------------------------------------------KHIMEREYTNIS--------W-------------------------KK
                                                             H+     T+++        W                         K+
Subjt:  ----------------------------------------------------KHIMEREYTNIS--------W-------------------------KK

Query:  INCDADNSLHPVFEEGYVDV-EN----------------GAPK-------------------------------FADIFLPGKMKDSLYVLSASDAYVEN
            ADNSLHPV  EG  +V EN                G  K                                ADI   GK K+SLYVLSA+DAYV+ 
Subjt:  INCDADNSLHPVFEEGYVDV-EN----------------GAPK-------------------------------FADIFLPGKMKDSLYVLSASDAYVEN

Query:  TGQNDSVAVWHTQLGHVDYQKLQRISTKKFL
        TGQN S A+WH +LGH+ YQ LQ+IS++K L
Subjt:  TGQNDSVAVWHTQLGHVDYQKLQRISTKKFL

KAA8538375.1 hypothetical protein F0562_028079 [Nyssa sinensis]6.3e-2234.52Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLFNS
        MN G+++ VDKL  +NY +W+LC +                    ED             KCGKALFALRTSI +EYI+ V D K           L N 
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLFNS

Query:  KEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFRYKHIMEREYTNISWKKINCD-----ADNSLHPVFEEGYVDVENGAPKF-ADIF
          H      L  +D+      +  ++++S H   +  +    K       +  ++  ++   K N       AD+  + +F    V + +      AD+ 
Subjt:  KEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFRYKHIMEREYTNISWKKINCD-----ADNSLHPVFEEGYVDVENGAPKF-ADIF

Query:  LPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL
          GK KDSLYVLS SDAYVE TGQN SV +WH +LGHV YQ LQ+ISTKK L
Subjt:  LPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL

KAA8540328.1 hypothetical protein F0562_024753 [Nyssa sinensis]1.7e-2225.25Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF--
        MNGG+++ +DKL  +NY +W+LC +                    ED             KCGKALFALRTSI +EYI+ V D K  KQVW+ LERLF  
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF--

Query:  ---------------------------------------------------------------------------------------------------N
                                                                                                           N
Subjt:  ---------------------------------------------------------------------------------------------------N

Query:  SKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFR----------------------YKHIMER---------------EYTNISW-
         +   +VED LY KDK K NS+SK SS D+  SKTEG S  N + C+R                        HI +                E+  + W 
Subjt:  SKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFR----------------------YKHIMER---------------EYTNISW-

Query:  --------------------------------------------------------------KKINCDADNSLHPVFEEGYVDVEN--------------
                                                                      K+    ADNSLHPV +EG  +V+               
Subjt:  --------------------------------------------------------------KKINCDADNSLHPVFEEGYVDVEN--------------

Query:  ---GAPK-------------------------------FADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL
           G  K                                AD+   GK KDSLYVLSASDAYVE  GQN SV +WH +LGHV YQ L +ISTKK L
Subjt:  ---GAPK-------------------------------FADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL

KAA8549858.1 hypothetical protein F0562_001542 [Nyssa sinensis]1.7e-1924.65Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKE------------------VEDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF--
        MNGG+++ VDKL  +NY + +LC +                   +ED             K GKALFALRTSI +EYI+ V D K  KQVW  LERLF  
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKE------------------VEDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF--

Query:  ---------------------------------------------------------------------------------------------------N
                                                                                                           N
Subjt:  ---------------------------------------------------------------------------------------------------N

Query:  SKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIK----------------------RCFRYKHIMER---------------EYTNISW-
         +   +VED LY KDK K NS+SK SS DS  SKT+G S  N K                      RC +  HI +                ++  + W 
Subjt:  SKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIK----------------------RCFRYKHIMER---------------EYTNISW-

Query:  --------------------------------------------------------------KKINCDADNSLHPVFEEGYVDVENGAPKF---------
                                                                      K+    ADNSLHP+ +EG  +V+               
Subjt:  --------------------------------------------------------------KKINCDADNSLHPVFEEGYVDVENGAPKF---------

Query:  ---------------------------------------ADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL
                                               AD+   GK KDSLYVLSASDAYVE TGQN S+ +WH +LGHV YQ LQ+ISTKK L
Subjt:  ---------------------------------------ADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL

KAG6391252.1 hypothetical protein SASPL_149005 [Salvia splendens]1.3e-1929.47Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------EDE-----------------KCGKALFALRTSIGKEYIERVH-DEK---------------
        MNGG++I VDKL  + Y +W+LC +              ED+                 KCGKALFALRTSI K+YIE V  DEK               
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------EDE-----------------KCGKALFALRTSIGKEYIERVH-DEK---------------

Query:  FRKQVWDVLERLF----------------------------NSKEHDEVEDVLYVKDKGKDNSYSKR--SSNDSNHSKTEGWSIANIKRCFRYKHIMERE
         R++    +  ++                            N++    VE+VLY KDKGK    SK   +   + +   +  +  NI   + Y+ I++  
Subjt:  FRKQVWDVLERLF----------------------------NSKEHDEVEDVLYVKDKGKDNSYSKR--SSNDSNHSKTEGWSIANIKRCFRYKHIMERE

Query:  YTN--------------ISWKKINCDADNSLHPVFEEGYVDVENGAPKFADIFLPGKMKDSL--------------YVLSASDAYVENTGQNDSVAVWHT
         ++               + KK+   ADNSLHPV  EG   +E          +PG  K+                YVLSA++AY E   Q DS  +WH 
Subjt:  YTN--------------ISWKKINCDADNSLHPVFEEGYVDVENGAPKFADIFLPGKMKDSL--------------YVLSASDAYVENTGQNDSVAVWHT

Query:  QLGHVDYQKLQRISTKKFL
        +L HV +Q LQ+IS KK L
Subjt:  QLGHVDYQKLQRISTKKFL

TrEMBL top hitse value%identityAlignment
A0A5J5ATF7 gag_pre-integrs domain-containing protein6.0e-1826.81Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKEVEDE-----KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF---------------------------
        MNGG++   D ++ D+    +   + VE +     KCGKALFALRTSI +EYIE V D K  KQVW+ LERLF                           
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKEVEDE-----KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF---------------------------

Query:  --------------------------------------------------------------------------NSKEHDEVEDVLYVKDKGKDNSYSKR
                                                                                  N +   +VED  Y KDK K NS+SK 
Subjt:  --------------------------------------------------------------------------NSKEHDEVEDVLYVKDKGKDNSYSKR

Query:  SSNDSNHSKTEGWSIANIKRCFRYKHIME-------------REYTNIS------W-------------------------KKINCDADNSLHPVFEEG-
        SS  +   +T  +     ++C   + I +               Y N S      W                         K+    A NSLHP  +EG 
Subjt:  SSNDSNHSKTEGWSIANIKRCFRYKHIME-------------REYTNIS------W-------------------------KKINCDADNSLHPVFEEG-

Query:  ----------------------------------------------YVDVENGAPKF-ADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHV
                                                      YV + +    F  D+   GK KDSLYVLSASDAYVE TGQN SV +WH +LGHV
Subjt:  ----------------------------------------------YVDVENGAPKF-ADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHV

Query:  DYQKLQRISTKKFL
         YQ LQ+ISTKK L
Subjt:  DYQKLQRISTKKFL

A0A5J5B552 Uncharacterized protein3.0e-2234.52Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLFNS
        MN G+++ VDKL  +NY +W+LC +                    ED             KCGKALFALRTSI +EYI+ V D K           L N 
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLFNS

Query:  KEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFRYKHIMEREYTNISWKKINCD-----ADNSLHPVFEEGYVDVENGAPKF-ADIF
          H      L  +D+      +  ++++S H   +  +    K       +  ++  ++   K N       AD+  + +F    V + +      AD+ 
Subjt:  KEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFRYKHIMEREYTNISWKKINCD-----ADNSLHPVFEEGYVDVENGAPKF-ADIF

Query:  LPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL
          GK KDSLYVLS SDAYVE TGQN SV +WH +LGHV YQ LQ+ISTKK L
Subjt:  LPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL

A0A5J5B8Q2 DUF4219 domain-containing protein5.0e-1732.89Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLFNS
        MNGG+++  DKL ++NY +W+LC +                    ED             KCGKALFALRTSI +EYIE + D K  KQVW+ LER+   
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLFNS

Query:  KEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFRYKHIMEREYTNISWKKINCDADNSLHPVFEEGYVDVENGAPKF-ADIFLPGKM
            E            + + +K  SN    S  + + +  +K+               +   ++  AD+  + +F    V + +      AD+   GK 
Subjt:  KEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFRYKHIMEREYTNISWKKINCDADNSLHPVFEEGYVDVENGAPKF-ADIFLPGKM

Query:  KDSLYVLSASDAYVENTGQNDSVAV
        KDSLYVLS SDAYVE TGQN SV +
Subjt:  KDSLYVLSASDAYVENTGQNDSVAV

A0A5J5BCB3 Uncharacterized protein8.0e-2325.25Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF--
        MNGG+++ +DKL  +NY +W+LC +                    ED             KCGKALFALRTSI +EYI+ V D K  KQVW+ LERLF  
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKEV------------------EDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF--

Query:  ---------------------------------------------------------------------------------------------------N
                                                                                                           N
Subjt:  ---------------------------------------------------------------------------------------------------N

Query:  SKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFR----------------------YKHIMER---------------EYTNISW-
         +   +VED LY KDK K NS+SK SS D+  SKTEG S  N + C+R                        HI +                E+  + W 
Subjt:  SKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIKRCFR----------------------YKHIMER---------------EYTNISW-

Query:  --------------------------------------------------------------KKINCDADNSLHPVFEEGYVDVEN--------------
                                                                      K+    ADNSLHPV +EG  +V+               
Subjt:  --------------------------------------------------------------KKINCDADNSLHPVFEEGYVDVEN--------------

Query:  ---GAPK-------------------------------FADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL
           G  K                                AD+   GK KDSLYVLSASDAYVE  GQN SV +WH +LGHV YQ L +ISTKK L
Subjt:  ---GAPK-------------------------------FADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL

A0A5J5C3K7 Uncharacterized protein8.3e-2024.65Show/hide
Query:  MNGGNNICVDKLTNDNYCFWRLCAKE------------------VEDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF--
        MNGG+++ VDKL  +NY + +LC +                   +ED             K GKALFALRTSI +EYI+ V D K  KQVW  LERLF  
Subjt:  MNGGNNICVDKLTNDNYCFWRLCAKE------------------VEDE------------KCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLF--

Query:  ---------------------------------------------------------------------------------------------------N
                                                                                                           N
Subjt:  ---------------------------------------------------------------------------------------------------N

Query:  SKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIK----------------------RCFRYKHIMER---------------EYTNISW-
         +   +VED LY KDK K NS+SK SS DS  SKT+G S  N K                      RC +  HI +                ++  + W 
Subjt:  SKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIANIK----------------------RCFRYKHIMER---------------EYTNISW-

Query:  --------------------------------------------------------------KKINCDADNSLHPVFEEGYVDVENGAPKF---------
                                                                      K+    ADNSLHP+ +EG  +V+               
Subjt:  --------------------------------------------------------------KKINCDADNSLHPVFEEGYVDVENGAPKF---------

Query:  ---------------------------------------ADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL
                                               AD+   GK KDSLYVLSASDAYVE TGQN S+ +WH +LGHV YQ LQ+ISTKK L
Subjt:  ---------------------------------------ADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGGTGGAAACAATATTTGTGTAGACAAGCTTACCAATGACAACTATTGCTTTTGGAGGCTATGTGCGAAAGAAGTGGAAGATGAAAAGTGTGGCAAAGCATTATT
TGCTTTGCGAACTTCTATTGGTAAGGAGTATATCGAGCGTGTTCATGACGAAAAGTTTCGAAAACAAGTGTGGGATGTACTTGAAAGGTTGTTCAACTCAAAAGAACATG
ATGAGGTGGAAGATGTTCTTTATGTAAAAGACAAAGGAAAAGACAATTCGTATTCCAAGCGTTCTTCAAATGACAGCAACCATTCCAAGACTGAAGGGTGGTCCATAGCT
AATATAAAAAGATGTTTTAGGTACAAACATATCATGGAAAGAGAATATACAAACATATCATGGAAAAAGATTAATTGCGACGCTGATAATTCCTTACATCCTGTTTTTGA
AGAAGGATATGTTGATGTTGAGAATGGTGCTCCAAAATTTGCTGATATTTTTTTACCTGGAAAGATGAAAGATTCCCTTTATGTTTTATCTGCAAGTGATGCATACGTTG
AAAATACAGGTCAGAATGATAGTGTAGCAGTTTGGCACACTCAATTGGGCCATGTTGATTATCAAAAGTTACAACGAATTTCTACAAAGAAGTTTTTGGGTGACAAGCTT
ACCGGTGACAAATATAGCTATTGGAGGCTATGTATGGAAGCTTTTTTACAAGGGCATGATTTGTGGAATCTTATCTCTGGTGAAGATAGTGTAATTCCATATGATAATCC
ACAAAATACTGAGGAAGCCTTGGTGAAACAAGTGCTTGGTGATAACAAGCGGTCTTCTCATGAGGTAGAAGGTATTCTTTGTACAAGAGACAAAATAGAAAGAAGTTGTA
TTTTTGAATCTTCTACAAATGATAGCAAGCATTCTGAAAATGAAAGGCTGTCCAAAGGTGAAGTTCATCGAAGAGTTTCTTTCGGTCAGTTGAAAGAATTTGCATGGAGC
AAAAGCTTTGAGGGAGATGCTTTGCAACATGAGAATTTGAGTACCGTTGTCTACTGTTCTCGAAAACTTAAGCTTGAGGACCTTACTTTGGATTGGCCTACTCAAATGTA
TTACCTTGAAATAATTGGGATAACAACTCAGTATAGGATATTGGGATCGGATGAAAGCTCTCTCGCTTTAGTTGCCTCCGACTACTATGATTATCTGGTAATGTTGGCAA
CCTTAGTTATGGGTCATACTGCAGAGCCAATGAGTTTATGGGTACTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGGTGGAAACAATATTTGTGTAGACAAGCTTACCAATGACAACTATTGCTTTTGGAGGCTATGTGCGAAAGAAGTGGAAGATGAAAAGTGTGGCAAAGCATTATT
TGCTTTGCGAACTTCTATTGGTAAGGAGTATATCGAGCGTGTTCATGACGAAAAGTTTCGAAAACAAGTGTGGGATGTACTTGAAAGGTTGTTCAACTCAAAAGAACATG
ATGAGGTGGAAGATGTTCTTTATGTAAAAGACAAAGGAAAAGACAATTCGTATTCCAAGCGTTCTTCAAATGACAGCAACCATTCCAAGACTGAAGGGTGGTCCATAGCT
AATATAAAAAGATGTTTTAGGTACAAACATATCATGGAAAGAGAATATACAAACATATCATGGAAAAAGATTAATTGCGACGCTGATAATTCCTTACATCCTGTTTTTGA
AGAAGGATATGTTGATGTTGAGAATGGTGCTCCAAAATTTGCTGATATTTTTTTACCTGGAAAGATGAAAGATTCCCTTTATGTTTTATCTGCAAGTGATGCATACGTTG
AAAATACAGGTCAGAATGATAGTGTAGCAGTTTGGCACACTCAATTGGGCCATGTTGATTATCAAAAGTTACAACGAATTTCTACAAAGAAGTTTTTGGGTGACAAGCTT
ACCGGTGACAAATATAGCTATTGGAGGCTATGTATGGAAGCTTTTTTACAAGGGCATGATTTGTGGAATCTTATCTCTGGTGAAGATAGTGTAATTCCATATGATAATCC
ACAAAATACTGAGGAAGCCTTGGTGAAACAAGTGCTTGGTGATAACAAGCGGTCTTCTCATGAGGTAGAAGGTATTCTTTGTACAAGAGACAAAATAGAAAGAAGTTGTA
TTTTTGAATCTTCTACAAATGATAGCAAGCATTCTGAAAATGAAAGGCTGTCCAAAGGTGAAGTTCATCGAAGAGTTTCTTTCGGTCAGTTGAAAGAATTTGCATGGAGC
AAAAGCTTTGAGGGAGATGCTTTGCAACATGAGAATTTGAGTACCGTTGTCTACTGTTCTCGAAAACTTAAGCTTGAGGACCTTACTTTGGATTGGCCTACTCAAATGTA
TTACCTTGAAATAATTGGGATAACAACTCAGTATAGGATATTGGGATCGGATGAAAGCTCTCTCGCTTTAGTTGCCTCCGACTACTATGATTATCTGGTAATGTTGGCAA
CCTTAGTTATGGGTCATACTGCAGAGCCAATGAGTTTATGGGTACTGTAG
Protein sequenceShow/hide protein sequence
MNGGNNICVDKLTNDNYCFWRLCAKEVEDEKCGKALFALRTSIGKEYIERVHDEKFRKQVWDVLERLFNSKEHDEVEDVLYVKDKGKDNSYSKRSSNDSNHSKTEGWSIA
NIKRCFRYKHIMEREYTNISWKKINCDADNSLHPVFEEGYVDVENGAPKFADIFLPGKMKDSLYVLSASDAYVENTGQNDSVAVWHTQLGHVDYQKLQRISTKKFLGDKL
TGDKYSYWRLCMEAFLQGHDLWNLISGEDSVIPYDNPQNTEEALVKQVLGDNKRSSHEVEGILCTRDKIERSCIFESSTNDSKHSENERLSKGEVHRRVSFGQLKEFAWS
KSFEGDALQHENLSTVVYCSRKLKLEDLTLDWPTQMYYLEIIGITTQYRILGSDESSLALVASDYYDYLVMLATLVMGHTAEPMSLWVL