; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021812 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021812
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHomeobox domain-containing protein
Genome locationscaffold2:4407796..4408964
RNA-Seq ExpressionSpg021812
SyntenySpg021812
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600669.1 Homeobox-leucine zipper protein ATHB-6, partial [Cucurbita argyrosperma subsp. sororia]5.2e-6867.24Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKL+YETL  DNQALLKE+Q                               ELK KLQEDNSESNLS+EEE  VPADSEN+LIEQ+KPEI DQF+VP A
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGYR
         ESQ FNY S  NNGGEGEE      SLF DFKDGSSDSDSSAILNEDY PT AI     L H+H HFMT AAS SPS  VKLNC TT L+YLQYQKGY+
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGYR

Query:  -QTQMFPKMEEHNFFSGEEACNFFSDEQAPTL
         QTQMFPKMEEHNFFSGEE CNFFSDEQAPTL
Subjt:  -QTQMFPKMEEHNFFSGEEACNFFSDEQAPTL

KAG7031308.1 Homeobox-leucine zipper protein ATHB-6 [Cucurbita argyrosperma subsp. argyrosperma]6.1e-6967.67Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKL+YETL  DNQALLKE+Q                               ELK KLQEDNSESNLS+EEE  VPADSEN+LIEQ+KPEI DQF+VP A
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGYR
         ESQ FNY S  NNGGEGEE      SLF DFKDGSSDSDSSAILNEDY PT AI     L H+H HFMT AASPSPS  VKLNC TT L+YLQYQKGY+
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGYR

Query:  -QTQMFPKMEEHNFFSGEEACNFFSDEQAPTL
         QTQMFPKMEEHNFFSGEE CNFFSDEQAPTL
Subjt:  -QTQMFPKMEEHNFFSGEEACNFFSDEQAPTL

XP_022941752.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita moschata]3.0e-6867.24Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKL+YETL  DNQALLKE+Q                               ELK KLQEDNSESNLS+EEE  VPADSEN+LIEQ+KPEI DQF+VP A
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-
         ESQ FNY S  NNGGEGEE      SLF DFKDGSSDSDSSAILNEDY PT AI     L H+H HFMT AASPSPS  VKLNC TT L+YLQYQKGY 
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-

Query:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL
        +Q+QMFPKMEEHNFFSGEE CNFFSDEQAPTL
Subjt:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL

XP_022999792.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita maxima]9.5e-7068.1Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKL+YETL  DNQALLKE+Q                               ELK KLQEDNS+SNLS+EEEM VPADSEN+LIEQ+KPEI DQF+VP A
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-
         ESQ FNYES  NNGGEGEE      SLF DFKDGSSDSDSSAILNEDY PT AI     L H+H HFMT AASPSPS  VKLNC TT L+YLQYQKGY 
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-

Query:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL
        +QTQMFPKMEEHNFFSGEE CNFFSDEQAPTL
Subjt:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL

XP_023547219.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita pepo subsp. pepo]2.1e-6967.67Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKL+YETL  DNQALLKE+Q                               ELK KLQEDNSESNLS+EEEM VPADSEN+LIEQ+KPEI DQF+VP A
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-
         E+Q FNYES  +NGGEGEE      SLF DFKDGSSDSDSSAILNEDY PT AI     L H+H HFMT AASPSPS  VKLNC TT L+YLQYQKGY 
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-

Query:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL
        +QTQMFPKMEEHNFFSGEE CNFFSDEQAPTL
Subjt:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL

TrEMBL top hitse value%identityAlignment
A0A6J1C6W4 homeobox-leucine zipper protein ATHB-6-like5.4e-6362.93Show/hide
Query:  LKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVV-PADSENSLIEQVKPEIADQFTVPPA
        LKL+YETL QDN ALLKE                               IRELK+KLQEDNSESN+S+EEEMV+  ADSEN+LIE+ +PE  D F+VPPA
Subjt:  LKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVV-PADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMTAA-SPSPSAAVKLNCVTTTLSYLQYQKGYR
         E +  NYESF+NNGGEGEE PTE+ASLF DFKDGSSDSDSSAILNEDYS TAAI     L + H+HFM A+ SPSPSAAVK NC T  L+Y Q+QK Y+
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMTAA-SPSPSAAVKLNCVTTTLSYLQYQKGYR

Query:  QTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTL
        QTQ++PKMEEHNFF+GEE  CNFFS+EQAP+L
Subjt:  QTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTL

A0A6J1ENE6 homeobox-leucine zipper protein ATHB-6-like1.6e-5157.87Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKLS+E L  DNQALLKE                               IRELKAK+QEDNS        EM+VPADSEN+LIEQ KPEI D F+VPPA
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAILN----HHHHHFMTAASP--SPS---AAVKLNCVTTTLSYLQYQK
                 SF+NNGGEG+E PT         KDGSSDSDSSAILNEDYSPTA + +     +++HFMT A P  SPS   A VKLN  TT L+YLQ+QK
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAILN----HHHHHFMTAASP--SPS---AAVKLNCVTTTLSYLQYQK

Query:  GYRQTQ-MFPKMEEHNFFSGEEACNFFSDEQAPTL
        GY+QTQ MFPKMEEHNFF GEEACNFFSDEQAPTL
Subjt:  GYRQTQ-MFPKMEEHNFFSGEEACNFFSDEQAPTL

A0A6J1FNC9 homeobox-leucine zipper protein ATHB-6-like1.5e-6867.24Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKL+YETL  DNQALLKE+Q                               ELK KLQEDNSESNLS+EEE  VPADSEN+LIEQ+KPEI DQF+VP A
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-
         ESQ FNY S  NNGGEGEE      SLF DFKDGSSDSDSSAILNEDY PT AI     L H+H HFMT AASPSPS  VKLNC TT L+YLQYQKGY 
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-

Query:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL
        +Q+QMFPKMEEHNFFSGEE CNFFSDEQAPTL
Subjt:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL

A0A6J1JAW1 homeobox-leucine zipper protein ATHB-6-like7.3e-5257.45Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKLS+E L  DNQALLKE                               IRELKAK+QEDNS        EM+ PADSEN+LIEQ KPEI D F+VPPA
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAILN----HHHHHFMTAASP--SPS---AAVKLNCVTTTLSYLQYQK
                 SF+NNGGEG+E PT         KDGSSDSDSSAILNEDYSPTA + +     +++HFMT   P  SPS   A VKLNC TT L+YLQ+QK
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAILN----HHHHHFMTAASP--SPS---AAVKLNCVTTTLSYLQYQK

Query:  GYRQTQ-MFPKMEEHNFFSGEEACNFFSDEQAPTL
        GY+QTQ MFPKMEEHNFF GEEACNFFSDEQAPTL
Subjt:  GYRQTQ-MFPKMEEHNFFSGEEACNFFSDEQAPTL

A0A6J1KBS4 homeobox-leucine zipper protein ATHB-6-like4.6e-7068.1Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA
        NLKL+YETL  DNQALLKE+Q                               ELK KLQEDNS+SNLS+EEEM VPADSEN+LIEQ+KPEI DQF+VP A
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQFTVPPA

Query:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-
         ESQ FNYES  NNGGEGEE      SLF DFKDGSSDSDSSAILNEDY PT AI     L H+H HFMT AASPSPS  VKLNC TT L+YLQYQKGY 
Subjt:  IESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAI-----LNHHHHHFMT-AASPSPSAAVKLNCVTTTLSYLQYQKGY-

Query:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL
        +QTQMFPKMEEHNFFSGEE CNFFSDEQAPTL
Subjt:  RQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G22430.1 homeobox protein 68.7e-0527.04Show/hide
Query:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKL-----QEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQF
        +L+ ++++L +DN++LL+E                               I +LK KL     +E+  E+N ++  E  +    E   + +   +I +  
Subjt:  NLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKL-----QEDNSESNLSMEEEMVVPADSENSLIEQVKPEIADQF

Query:  TVPPAI--ESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSD-SDSSAILNEDYSPTAAILNHHHHHFMTAASPSPSAAVKLNCVTTTLSYLQYQKG
        + PP     S G NY SF +     +  P + A+       GSSD SDSSA+LNE+ S             +T A+P               ++ Q+ K 
Subjt:  TVPPAI--ESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSD-SDSSAILNEDYSPTAAILNHHHHHFMTAASPSPSAAVKLNCVTTTLSYLQYQKG

Query:  YRQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL
          QT+     +  +F SGEEAC FFSDEQ P+L
Subjt:  YRQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTGCCGGCCGATTCTGATAATTCTCTGATTGAACAAGTAAAGCCGGAAATTGCTGATCAGTTCTCTGTTCCTCCGGCGATTGAATCCCAAGACTTCAATTACGA
GAGCTTCGACAACAATGGCGGAGAAGGGGAAGAGGCGCCGACGGAAGAGGCGACATTGTTCCGCGATTTCAAAGATGGGTCATCCAATAGCGATTCGAGCGCAATTTTGA
ACGAAGACTACTGCCCCACGGCGGCCATTCTCCGATTGAACAATGAATCCCAAGACTTCAATTACGAGAGTTTCAACAACAATGGCGGAGAAGGGGAAGAGGCGCCGACG
GAAGAGGCGACATTGTTCCGCGATTTCAAAGATGGGTCATCCGATAGCGATTCGAGCGCAATTATGAACGAAGATTACAGCCCCACGGCGGCCATTTATTCACCGGGGGT
GCTGCACAATCTCAAACTCAGTTATGAAACTCTCCATCAGGACAATCAAGCTCTTCTCAAAGAGGTACAATTACAGATCACACGAACCCAGTTTTTTCTTTTCACGAACA
TCGGGAATGCGATGATATTGATGGAGACTGTAAAATTGTGTTTGCATATTCGGGAACTGAAAGCGAAGCTTCAAGAAGATAACTCTGAGAGCAATCTTTCGATGGAGGAA
GAGATGGTGGTGCCGGCCGATTCTGAGAATTCTCTGATTGAACAAGTAAAGCCGGAAATTGCCGATCAGTTCACTGTTCCTCCGGCGATTGAATCCCAAGGCTTCAATTA
CGAGAGCTTCGACAACAATGGCGGAGAAGGGGAAGAGGCGCCGACGGAAAAGGCGTCATTGTTCCGCGATTTCAAAGATGGGTCATCCGATAGCGATTCGAGCGCAATTT
TGAACGAAGACTACAGCCCCACGGCGGCCATTTTGAACCACCACCATCACCACTTCATGACAGCGGCATCTCCGTCTCCGTCCGCCGCCGTGAAACTGAACTGCGTAACG
ACGACGCTGAGTTACTTGCAGTATCAGAAGGGGTATCGACAAACCCAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCAGCGGAGAGGAGGCTTGTAACTTCTTCTC
CGATGAGCAAGCTCCGACTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTGCCGGCCGATTCTGATAATTCTCTGATTGAACAAGTAAAGCCGGAAATTGCTGATCAGTTCTCTGTTCCTCCGGCGATTGAATCCCAAGACTTCAATTACGA
GAGCTTCGACAACAATGGCGGAGAAGGGGAAGAGGCGCCGACGGAAGAGGCGACATTGTTCCGCGATTTCAAAGATGGGTCATCCAATAGCGATTCGAGCGCAATTTTGA
ACGAAGACTACTGCCCCACGGCGGCCATTCTCCGATTGAACAATGAATCCCAAGACTTCAATTACGAGAGTTTCAACAACAATGGCGGAGAAGGGGAAGAGGCGCCGACG
GAAGAGGCGACATTGTTCCGCGATTTCAAAGATGGGTCATCCGATAGCGATTCGAGCGCAATTATGAACGAAGATTACAGCCCCACGGCGGCCATTTATTCACCGGGGGT
GCTGCACAATCTCAAACTCAGTTATGAAACTCTCCATCAGGACAATCAAGCTCTTCTCAAAGAGGTACAATTACAGATCACACGAACCCAGTTTTTTCTTTTCACGAACA
TCGGGAATGCGATGATATTGATGGAGACTGTAAAATTGTGTTTGCATATTCGGGAACTGAAAGCGAAGCTTCAAGAAGATAACTCTGAGAGCAATCTTTCGATGGAGGAA
GAGATGGTGGTGCCGGCCGATTCTGAGAATTCTCTGATTGAACAAGTAAAGCCGGAAATTGCCGATCAGTTCACTGTTCCTCCGGCGATTGAATCCCAAGGCTTCAATTA
CGAGAGCTTCGACAACAATGGCGGAGAAGGGGAAGAGGCGCCGACGGAAAAGGCGTCATTGTTCCGCGATTTCAAAGATGGGTCATCCGATAGCGATTCGAGCGCAATTT
TGAACGAAGACTACAGCCCCACGGCGGCCATTTTGAACCACCACCATCACCACTTCATGACAGCGGCATCTCCGTCTCCGTCCGCCGCCGTGAAACTGAACTGCGTAACG
ACGACGCTGAGTTACTTGCAGTATCAGAAGGGGTATCGACAAACCCAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCAGCGGAGAGGAGGCTTGTAACTTCTTCTC
CGATGAGCAAGCTCCGACTCTGTAG
Protein sequenceShow/hide protein sequence
MVVPADSDNSLIEQVKPEIADQFSVPPAIESQDFNYESFDNNGGEGEEAPTEEATLFRDFKDGSSNSDSSAILNEDYCPTAAILRLNNESQDFNYESFNNNGGEGEEAPT
EEATLFRDFKDGSSDSDSSAIMNEDYSPTAAIYSPGVLHNLKLSYETLHQDNQALLKEVQLQITRTQFFLFTNIGNAMILMETVKLCLHIRELKAKLQEDNSESNLSMEE
EMVVPADSENSLIEQVKPEIADQFTVPPAIESQGFNYESFDNNGGEGEEAPTEKASLFRDFKDGSSDSDSSAILNEDYSPTAAILNHHHHHFMTAASPSPSAAVKLNCVT
TTLSYLQYQKGYRQTQMFPKMEEHNFFSGEEACNFFSDEQAPTL