; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025264 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025264
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUPF0587 protein C1orf123 homolog
Genome locationscaffold13:40660963..40663601
RNA-Seq ExpressionSpg025264
SyntenySpg025264
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR008584 - CXXC motif containing zinc binding protein, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585732.1 CXXC motif containing zinc binding protein, partial [Cucurbita argyrosperma subsp. sororia]9.6e-9195.18Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGYEPVGF+FGPGWK ESIEGTKFED+DLSGGE+AEYDEKGECPVMISNL+ATFE VK
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

XP_004142917.1 CXXC motif containing zinc binding protein [Cucumis sativus]2.1e-8589.16Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFED+DL+GGE+AEYDEKGECPVMIS+L+A FE++K
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

XP_008444495.1 PREDICTED: UPF0587 protein C1orf123 homolog [Cucumis melo]9.9e-8892.17Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGYEP+GFVFGPGWKVESIEGTKFED+DL+GGEFAEYDEKGECPVMISNL A FE++K
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

XP_022952030.1 UPF0587 protein C1orf123 homolog [Cucurbita moschata]9.6e-9195.18Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGYEPVGF+FGPGWK ESIEGTKFED+DLSGGE+AEYDEKGECPVMISNL+ATFE VK
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

XP_038884333.1 CXXC motif containing zinc binding protein isoform X1 [Benincasa hispida]1.2e-8893.98Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRGQPLTQETSE GKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGYEP+ FVFGPGWK ESIEGTKFED+DLS GEFAEYDEKGECPVMIS L+ATFE+VK
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

TrEMBL top hitse value%identityAlignment
A0A0A0LN37 Uncharacterized protein1.0e-8589.16Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGEVSQKETCVTL ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGY+P+GF+FGPGWKVESIEGTKFED+DL+GGE+AEYDEKGECPVMIS+L+A FE++K
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

A0A1S3BAF1 UPF0587 protein C1orf123 homolog4.8e-8892.17Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGYEP+GFVFGPGWKVESIEGTKFED+DL+GGEFAEYDEKGECPVMISNL A FE++K
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

A0A5A7V299 UPF0587 protein C1orf123-like protein4.8e-8892.17Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTL+ETIPLQAGKGTTNLVQKCKFCGR+GTITMIPGRG+PLTQE SESG  
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGYEP+GFVFGPGWKVESIEGTKFED+DL+GGEFAEYDEKGECPVMISNL A FE++K
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

A0A6J1GKL1 UPF0587 protein C1orf123 homolog4.7e-9195.18Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGYEPVGF+FGPGWK ESIEGTKFED+DLSGGE+AEYDEKGECPVMISNL+ATFE VK
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

A0A6J1KKK0 UPF0587 protein C1orf123 homolog4.7e-9195.18Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVNFLLKIKAELENLTNLQPQDGCDDPNF YLFKVKCGRCGE+SQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        SPLMLFDCRGYEPVGF+FGPGWK ESIEGTKFED+DLSGGE+AEYDEKGECPVMISNL+ATFE VK
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

SwissProt top hitse value%identityAlignment
Q32P66 CXXC motif containing zinc binding protein3.3e-2537.04Show/hide
Query:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML
        L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+  ++ 
Subjt:  LKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLML

Query:  FDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
        F+CRG EPV F    G+  E +E GT F D++L   ++ +YDEK +  V I   + T + VK
Subjt:  FDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

Q3B8G0 CXXC motif containing zinc binding protein3.2e-2838.65Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MV F L+ KA LENLT L+P       +F +  K+KCG CGEVS K   +TL +++PL+ G+G+ ++VQ+CK C R+ +I ++     P   E SE+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATF
          ++ F+CRG EP+ F    G+  E  E GT F +++L   ++ +YDEK +  V I  ++  F
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATF

Q498R7 CXXC motif containing zinc binding protein3.3e-2536.53Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
          ++ F+CRG EPV F    G+  E +E GT F D++L   ++ +YDEK +  V I   + T + VK
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

Q8BHG2 CXXC motif containing zinc binding protein4.3e-2535.93Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S+K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +    E +E  K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
          ++ F+CRG EPV F    G+  + +E GT F D++L   ++ +YDEK +  V I   + T + VK
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

Q9NWV4 CXXC motif containing zinc binding protein2.3e-2637.13Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        M    L++KA LEN+TNL+P       +F +  K+KCG CGE+S K   + L +++ L+ G+G+ ++VQKCK C R+ +I ++    +P   E +E+ K+
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK
          ++ F+CRG EPV F    G+  E +E GT F D++L   ++ +YDEK +  V I   + T + VK
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIE-GTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK

Arabidopsis top hitse value%identityAlignment
AT4G32930.1 unknown protein1.6e-6767.66Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTLNET     G+GT +LVQKCKFCGR+G +TMIPG+G+PLT E SE+G+ 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKS

Query:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGG-EFAEYDEKGECPVMISNLQATFEMVK
        +PLM+FDCRGYEP+ F FG  WK ++  GTKF+++DLS G EF EYDEKGECPVMISN +A+F + K
Subjt:  SPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGG-EFAEYDEKGECPVMISNLQATFEMVK

AT4G32930.2 unknown protein2.5e-6564.57Show/hide
Query:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ
        MVN++LKI A+LENLTNLQP  GCDD NF YLFK+KC RCGEV+ KETCVTLNET     G+GT +LVQK        CKFCGR+G +TMIPG+G+PLT 
Subjt:  MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQK--------CKFCGRDGTITMIPGRGQPLTQ

Query:  ETSESGKSSPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGG-EFAEYDEKGECPVMISNLQATFEMVK
        E SE+G+ +PLM+FDCRGYEP+ F FG  WK ++  GTKF+++DLS G EF EYDEKGECPVMISN +A+F + K
Subjt:  ETSESGKSSPLMLFDCRGYEPVGFVFGPGWKVESIEGTKFEDVDLSGG-EFAEYDEKGECPVMISNLQATFEMVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCCCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTTTCAAAGTGAAATG
CGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTCGTTCAAAAGTGCAAGTTTT
GTGGGAGGGATGGAACAATTACAATGATTCCCGGGCGAGGCCAACCATTGACTCAGGAAACGAGTGAATCAGGCAAGTCATCTCCCTTGATGTTATTTGACTGCAGAGGT
TATGAGCCTGTGGGCTTCGTATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATGTTGACTTGTCTGGAGGTGAGTTTGCAGAGTATGATGA
GAAGGGAGAGTGCCCTGTAATGATTTCCAATCTACAGGCCACATTTGAAATGGTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGAACTTCTTGCTTAAGATCAAAGCGGAGCTCGAGAACCTCACGAATCTTCAGCCCCAAGATGGTTGCGACGACCCCAACTTCACTTACCTTTTCAAAGTGAAATG
CGGGAGATGCGGGGAGGTGAGCCAGAAAGAAACGTGTGTGACCTTGAATGAAACTATTCCTCTCCAAGCGGGTAAAGGGACTACTAATCTCGTTCAAAAGTGCAAGTTTT
GTGGGAGGGATGGAACAATTACAATGATTCCCGGGCGAGGCCAACCATTGACTCAGGAAACGAGTGAATCAGGCAAGTCATCTCCCTTGATGTTATTTGACTGCAGAGGT
TATGAGCCTGTGGGCTTCGTATTTGGACCTGGATGGAAAGTAGAATCTATAGAGGGGACTAAATTTGAGGATGTTGACTTGTCTGGAGGTGAGTTTGCAGAGTATGATGA
GAAGGGAGAGTGCCCTGTAATGATTTCCAATCTACAGGCCACATTTGAAATGGTAAAGTAG
Protein sequenceShow/hide protein sequence
MVNFLLKIKAELENLTNLQPQDGCDDPNFTYLFKVKCGRCGEVSQKETCVTLNETIPLQAGKGTTNLVQKCKFCGRDGTITMIPGRGQPLTQETSESGKSSPLMLFDCRG
YEPVGFVFGPGWKVESIEGTKFEDVDLSGGEFAEYDEKGECPVMISNLQATFEMVK