; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018577 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018577
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionChromo domain-containing protein
Genome locationtig00153206:449344..450156
RNA-Seq ExpressionSgr018577
SyntenySgr018577
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8544487.1 hypothetical protein F0562_022473 [Nyssa sinensis]2.7e-6651.25Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHR--------ISARQDPRTLGDLLPKTVKIDFPRFDGREDPTSW
        MDQRVE++E  + +L+ GQ++I+   +E+  ++ +Q    S +   E GENS AP           I  RQ+  +     PK VK+DFPRF+G ED TSW
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHR--------ISARQDPRTLGDLLPKTVKIDFPRFDGREDPTSW

Query:  ICRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQ
        +CR EQ+F+ H  P  +RV LA+FHLEGDAQLW+QLLKQ+   ++W+EF + L  RYGP+QF DFFGEL KLQQ  +V +YQT FEKLL K  HLPQN Q
Subjt:  ICRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQ

Query:  VSCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN--PVQKTWVATRPIPTNNGTSATQIKKMTTEEL
        VSCFISGLKD+IR DV + RPTTL+ AI LARLYEAR+AS RRT +   ++K ++  +   T+N T    +++M+T EL
Subjt:  VSCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN--PVQKTWVATRPIPTNNGTSATQIKKMTTEEL

XP_038985806.1 uncharacterized protein LOC103721475 isoform X1 [Phoenix dactylifera]9.8e-7756.57Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRIS----ARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRA
        MDQRVE++EK++E+LS GQ+EI     E+ G +  ++  L  Q   EV ENS       S      Q   ++   LPKTV++DFP F+G EDPTSW+CRA
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRIS----ARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRA

Query:  EQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCF
        EQ+F+IH IP  DRV+LA+FHLEG+AQLW+QLLKQE   ++WE+FKE L +RYGPNQF DFFGELTKL+Q  T+ +YQT+FEKLL K   +PQN QVSCF
Subjt:  EQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCF

Query:  ISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN
        +SGL D+IR DVQA+RPTTL+ AI LARLYEARD S R+  + V K    +R I     +S+  +KKMTTEELN
Subjt:  ISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN

XP_038985807.1 uncharacterized protein LOC103721475 isoform X2 [Phoenix dactylifera]9.8e-7756.57Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRIS----ARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRA
        MDQRVE++EK++E+LS GQ+EI     E+ G +  ++  L  Q   EV ENS       S      Q   ++   LPKTV++DFP F+G EDPTSW+CRA
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRIS----ARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRA

Query:  EQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCF
        EQ+F+IH IP  DRV+LA+FHLEG+AQLW+QLLKQE   ++WE+FKE L +RYGPNQF DFFGELTKL+Q  T+ +YQT+FEKLL K   +PQN QVSCF
Subjt:  EQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCF

Query:  ISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN
        +SGL D+IR DVQA+RPTTL+ AI LARLYEARD S R+  + V K    +R I     +S+  +KKMTTEELN
Subjt:  ISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN

XP_038985808.1 uncharacterized protein LOC103721475 isoform X3 [Phoenix dactylifera]9.8e-7756.57Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRIS----ARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRA
        MDQRVE++EK++E+LS GQ+EI     E+ G +  ++  L  Q   EV ENS       S      Q   ++   LPKTV++DFP F+G EDPTSW+CRA
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRIS----ARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRA

Query:  EQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCF
        EQ+F+IH IP  DRV+LA+FHLEG+AQLW+QLLKQE   ++WE+FKE L +RYGPNQF DFFGELTKL+Q  T+ +YQT+FEKLL K   +PQN QVSCF
Subjt:  EQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCF

Query:  ISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN
        +SGL D+IR DVQA+RPTTL+ AI LARLYEARD S R+  + V K    +R I     +S+  +KKMTTEELN
Subjt:  ISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN

XP_038985809.1 uncharacterized protein LOC103721475 isoform X4 [Phoenix dactylifera]9.8e-7756.57Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRIS----ARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRA
        MDQRVE++EK++E+LS GQ+EI     E+ G +  ++  L  Q   EV ENS       S      Q   ++   LPKTV++DFP F+G EDPTSW+CRA
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRIS----ARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRA

Query:  EQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCF
        EQ+F+IH IP  DRV+LA+FHLEG+AQLW+QLLKQE   ++WE+FKE L +RYGPNQF DFFGELTKL+Q  T+ +YQT+FEKLL K   +PQN QVSCF
Subjt:  EQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCF

Query:  ISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN
        +SGL D+IR DVQA+RPTTL+ AI LARLYEARD S R+  + V K    +R I     +S+  +KKMTTEELN
Subjt:  ISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN

TrEMBL top hitse value%identityAlignment
A0A5B7BP57 Reverse transcriptase domain-containing protein (Fragment)4.5e-6750.9Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRISAR-------QDPRTLGDLLPKTVKIDFPRFDGREDPTSWI
        MD RVE++E+ + +L  GQ++I+   +E+  ++    +  + Q + EVGENS   +     R       Q   +     PK VK+DFPRF+G EDPTSW+
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRISAR-------QDPRTLGDLLPKTVKIDFPRFDGREDPTSWI

Query:  CRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQV
        CRA+Q+F+ H  P G+RV LA+FHLEGDAQLW+QLLKQE   +SWE F+E L +RYGP QF DFFGELTKLQQ  +V EYQT FEKLL K   L Q  QV
Subjt:  CRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQV

Query:  SCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN-PVQKTWVATRPIPTNNGTSATQIKKMTTEEL
        SCF+SGLK+SI+ DV A RPT+L+ AISLARLYEAR+ S RR IN  V+K   +++ +  N  T    ++KM+  E+
Subjt:  SCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN-PVQKTWVATRPIPTNNGTSATQIKKMTTEEL

A0A5J4ZIY9 Integrase catalytic domain-containing protein2.5e-6550.9Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENS----AAPAHRISARQDPRTLG---DLLPKTVKIDFPRFDGREDPTSWI
        MD RVE++E+ + +L  GQ++++   +E+  ++       + Q + E  E+S     A   R +     +  G      PK VK+DFPRF+G EDPTSW+
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENS----AAPAHRISARQDPRTLG---DLLPKTVKIDFPRFDGREDPTSWI

Query:  CRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQV
        CRA+Q+F+ H  P G+RV LA+FHLEGDAQLW+QLLKQE   +SWE FKE L +RYGP QF DFFGELTKLQQ  +V EYQT FEKLL K   L Q  QV
Subjt:  CRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQV

Query:  SCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN-PVQKTWVATRPIPTNNGTSATQIKKMTTEEL
        SCF+SGLK+SI+VDV A RP +L+ AISLARLYEAR+ S RR IN  V+K     R   TN  T    ++KM+  E+
Subjt:  SCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN-PVQKTWVATRPIPTNNGTSATQIKKMTTEEL

A0A5J5A901 Chromo domain-containing protein6.5e-6650Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRISAR-------QDPRTLGDLLPKTVKIDFPRFDGREDPTSWI
        MDQRVE++E  + +L+ GQ++I+   +E+  ++ +Q   +      E GENS AP      R       +   +     PK  K+DFPRF+G ED TSW+
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRISAR-------QDPRTLGDLLPKTVKIDFPRFDGREDPTSWI

Query:  CRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQV
        CR EQ+F+ H  P  +RV LA+FHLEGDAQLW+QLLKQ+   ++W+EF + L  RYGP+QF DFFGEL KLQQ  +V +YQT FEKLL K  HLPQN QV
Subjt:  CRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQV

Query:  SCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN--PVQKTWVATRPIPTNNGTSATQIKKMTTEEL
        SCFISGLKD+I+ DV + RPTTL+ AI LARLYEAR+AS RRTI+   ++K ++  +   T+N T    +++M+T EL
Subjt:  SCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN--PVQKTWVATRPIPTNNGTSATQIKKMTTEEL

A0A5J5BRX2 Chromo domain-containing protein1.3e-6651.25Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHR--------ISARQDPRTLGDLLPKTVKIDFPRFDGREDPTSW
        MDQRVE++E  + +L+ GQ++I+   +E+  ++ +Q    S +   E GENS AP           I  RQ+  +     PK VK+DFPRF+G ED TSW
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHR--------ISARQDPRTLGDLLPKTVKIDFPRFDGREDPTSW

Query:  ICRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQ
        +CR EQ+F+ H  P  +RV LA+FHLEGDAQLW+QLLKQ+   ++W+EF + L  RYGP+QF DFFGEL KLQQ  +V +YQT FEKLL K  HLPQN Q
Subjt:  ICRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQ

Query:  VSCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN--PVQKTWVATRPIPTNNGTSATQIKKMTTEEL
        VSCFISGLKD+IR DV + RPTTL+ AI LARLYEAR+AS RRT +   ++K ++  +   T+N T    +++M+T EL
Subjt:  VSCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN--PVQKTWVATRPIPTNNGTSATQIKKMTTEEL

A0A5J5C5K3 Uncharacterized protein7.1e-6550.54Show/hide
Query:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENS----AAPAHRISARQDPRTLG---DLLPKTVKIDFPRFDGREDPTSWI
        MD RVE++E+ + +L  GQ++++   +E+  ++       + Q + E  E+S     A   R +     +  G      PK VK+DFPRF+G EDPTSW+
Subjt:  MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENS----AAPAHRISARQDPRTLG---DLLPKTVKIDFPRFDGREDPTSWI

Query:  CRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQV
        CRA+Q+F+ H  P G+RV LA+FHLEGDAQLW+QLLKQE   +SWE FKE L +RYGP QF DFFGELTKLQQ  +V EYQT FEKLL K   L Q  QV
Subjt:  CRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQV

Query:  SCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN-PVQKTWVATRPIPTNNGTSATQIKKMTTEEL
        SCF+SGLK+SI+ DV A RP +L+ AISLARLYEAR+ S RR IN  V+K     R   TN  T    ++KM+  E+
Subjt:  SCFISGLKDSIRVDVQASRPTTLTIAISLARLYEARDASSRRTIN-PVQKTWVATRPIPTNNGTSATQIKKMTTEEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G44713.1 unknown protein7.4e-0629.55Show/hide
Query:  FPRFDG-REDPTSWICRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQ
        FP F+G   +  SWI   E +F        +++ LA   +EG+A+ WF   ++     SWE  ++ L+ R+G  + L+    L K  Q
Subjt:  FPRFDG-REDPTSWICRAEQYFEIHGIPVGDRVTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAGAGGGTAGAACAAATAGAGAAGGCTATTGAAACCTTATCCAATGGCCAAAAGGAGATCATGGTGCACTTTTCAGAGATATTGGGTCAAATTCAAAATCAAAT
ACAACATCTCTCCCACCAAGGCATTCTAGAGGTAGGAGAAAATTCAGCTGCACCAGCTCATAGAATATCTGCAAGGCAAGATCCCAGGACCTTAGGTGACTTACTTCCAA
AGACAGTCAAGATAGATTTTCCAAGGTTTGACGGGAGAGAAGACCCCACTAGTTGGATATGCCGAGCAGAACAATATTTTGAAATCCACGGCATACCAGTGGGTGATCGA
GTCACCCTCGCAACATTTCATTTGGAAGGAGATGCTCAATTATGGTTCCAATTGTTGAAGCAGGAAGCAAACCATGTGTCTTGGGAAGAATTCAAGGAAAGCTTGCTAAA
TAGGTACGGACCTAATCAATTTTTGGATTTCTTTGGTGAATTAACCAAATTACAGCAAAAGGTGACGGTTGTAGAATACCAAACAACCTTTGAGAAATTGCTGGGCAAGG
TTAGACATCTTCCTCAAAACCACCAAGTTAGTTGTTTCATCAGTGGGTTGAAGGACTCCATTCGAGTCGACGTGCAAGCTAGCCGCCCAACAACTTTAACCATTGCCATA
AGTTTGGCTAGACTATATGAAGCTAGAGATGCTTCCAGTAGAAGAACAATCAACCCAGTCCAGAAGACTTGGGTGGCCACCCGACCAATTCCCACAAATAATGGTACTTC
AGCTACACAAATTAAGAAGATGACTACTGAAGAGCTTAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCAGAGGGTAGAACAAATAGAGAAGGCTATTGAAACCTTATCCAATGGCCAAAAGGAGATCATGGTGCACTTTTCAGAGATATTGGGTCAAATTCAAAATCAAAT
ACAACATCTCTCCCACCAAGGCATTCTAGAGGTAGGAGAAAATTCAGCTGCACCAGCTCATAGAATATCTGCAAGGCAAGATCCCAGGACCTTAGGTGACTTACTTCCAA
AGACAGTCAAGATAGATTTTCCAAGGTTTGACGGGAGAGAAGACCCCACTAGTTGGATATGCCGAGCAGAACAATATTTTGAAATCCACGGCATACCAGTGGGTGATCGA
GTCACCCTCGCAACATTTCATTTGGAAGGAGATGCTCAATTATGGTTCCAATTGTTGAAGCAGGAAGCAAACCATGTGTCTTGGGAAGAATTCAAGGAAAGCTTGCTAAA
TAGGTACGGACCTAATCAATTTTTGGATTTCTTTGGTGAATTAACCAAATTACAGCAAAAGGTGACGGTTGTAGAATACCAAACAACCTTTGAGAAATTGCTGGGCAAGG
TTAGACATCTTCCTCAAAACCACCAAGTTAGTTGTTTCATCAGTGGGTTGAAGGACTCCATTCGAGTCGACGTGCAAGCTAGCCGCCCAACAACTTTAACCATTGCCATA
AGTTTGGCTAGACTATATGAAGCTAGAGATGCTTCCAGTAGAAGAACAATCAACCCAGTCCAGAAGACTTGGGTGGCCACCCGACCAATTCCCACAAATAATGGTACTTC
AGCTACACAAATTAAGAAGATGACTACTGAAGAGCTTAACTAG
Protein sequenceShow/hide protein sequence
MDQRVEQIEKAIETLSNGQKEIMVHFSEILGQIQNQIQHLSHQGILEVGENSAAPAHRISARQDPRTLGDLLPKTVKIDFPRFDGREDPTSWICRAEQYFEIHGIPVGDR
VTLATFHLEGDAQLWFQLLKQEANHVSWEEFKESLLNRYGPNQFLDFFGELTKLQQKVTVVEYQTTFEKLLGKVRHLPQNHQVSCFISGLKDSIRVDVQASRPTTLTIAI
SLARLYEARDASSRRTINPVQKTWVATRPIPTNNGTSATQIKKMTTEELN