; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030558 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030558
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold6:31601555..31604038
RNA-Seq ExpressionSpg030558
SyntenySpg030558
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054837.1 hypothetical protein E6C27_scaffold406G00150 [Cucumis melo var. makuwa]1.8e-1131.55Show/hide
Query:  EEKSHFAEEWG---FSCKLPQTMTVQISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLV--RGHTDDDFLNAVLTAVARPGAQWDGHGSAR-TLW
        EE+ HF  E G   F  +L   +   I    W+            +V  FY  +   E +  +V  R    D  +   L  VA    +WD     +  L+
Subjt:  EEKSHFAEEWG---FSCKLPQTMTVQISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLV--RGHTDDDFLNAVLTAVARPGAQWDGHGSAR-TLW

Query:  AQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLC--RASSLLQAP
           L  EA+ WL +IK ++MPT HD T+S  R++L+YCIM+ I +D+  II + +      P+GA   P L+ RLC    S L Q+P
Subjt:  AQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLC--RASSLLQAP

MCH80834.1 hypothetical protein [Trifolium medium]7.5e-1026.67Show/hide
Query:  YAPTLVHDFYRAQWQEEINEVLVRGHTDDDFLNAVLTAVARPGAQWDGHGSARTLWAQLLGF--EATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKG
        Y+   +++ Y  +  E+  + L+   +D++ L AV+  +  PG +W    +      + +    E   W   +K+ I+PT+H+ T++  R+VL++CIM+ 
Subjt:  YAPTLVHDFYRAQWQEEINEVLVRGHTDDDFLNAVLTAVARPGAQWDGHGSARTLWAQLLGF--EATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKG

Query:  ITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQ-APDEEIQP
        I +++ RII + +     + +G ++ P L+T LC+A  + + A DE   P
Subjt:  ITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQ-APDEEIQP

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]7.5e-1026.78Show/hide
Query:  ISHHKWELVCENFPNYAPTLVHDFY----------------RAQWQEE-INEVLVRGHTDDDF-----------LNAVLTAVARPGAQWD-GHGSARTLW
        I+ H W+  C +  +    LV +FY                +  W EE IN V   G   D+            L  VL  VA  GA+W+     A T  
Subjt:  ISHHKWELVCENFPNYAPTLVHDFY----------------RAQWQEE-INEVLVRGHTDDDF-----------LNAVLTAVARPGAQWD-GHGSARTLW

Query:  AQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEEIQPMWKDFDDHW
           L   A  W H++K+ ++PT+H  T+S  R++L++ ++ G ++++ R+I   +    A+  GA+F PSL+TRLCR +      +EE      + D   
Subjt:  AQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEEIQPMWKDFDDHW

Query:  WTDITCTHNVRLQAEAQRQDPASQRQDPASQRHVPASQA
                     A   ++ P    Q P+S R   AS +
Subjt:  WTDITCTHNVRLQAEAQRQDPASQRQDPASQRHVPASQA

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.5e-1027.69Show/hide
Query:  ERPAADPSKKRKGPPTAS--KGKEKVREEKSHFAEE--WGFSCKLPQTMTVQ--ISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLVRG------
        ER A + SK  K    A+  + +E ++       +E  W  S  L Q   +   I  H W+L C +  +    LV +FY      + + V +RG      
Subjt:  ERPAADPSKKRKGPPTAS--KGKEKVREEKSHFAEE--WGFSCKLPQTMTVQ--ISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLVRG------

Query:  ----------------HTD--DDF----LNAVLTAVARPGAQWD-GHGSARTLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLD
                        H++  +D     L  VL  VA  GA+W+     A T     L   A  W H++K+R++PT+H  T+S   V L+Y ++ G +++
Subjt:  ----------------HTD--DDF----LNAVLTAVARPGAQWD-GHGSARTLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLD

Query:  LRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEE
        + R+I   +C   A+  GA+F PSL+T +CR +      +EE
Subjt:  LRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEE

XP_024971944.1 uncharacterized protein LOC112510826 [Cynara cardunculus var. scolymus]6.8e-1125.52Show/hide
Query:  VARPGAQWDGHGS--ARTLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASS
        + + G +WD H +   RT  A  L   A  W+++I+  + PT+HD+++S+ +++L+YC++ G T+++ +++ +A+     + +G +F PSL+ +L   + 
Subjt:  VARPGAQWDGHGS--ARTLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASS

Query:  LLQAPDEEIQPMWKDFDDHWWTDITCTHNVRLQAEAQRQDPASQR
        + +  D+ +    K+ ++ W  D+     +R ++E  R+ P   R
Subjt:  LLQAPDEEIQPMWKDFDDHWWTDITCTHNVRLQAEAQRQDPASQR

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.7e-1026.78Show/hide
Query:  ISHHKWELVCENFPNYAPTLVHDFY----------------RAQWQEE-INEVLVRGHTDDDF-----------LNAVLTAVARPGAQWD-GHGSARTLW
        I+ H W+  C +  +    LV +FY                +  W EE IN V   G   D+            L  VL  VA  GA+W+     A T  
Subjt:  ISHHKWELVCENFPNYAPTLVHDFY----------------RAQWQEE-INEVLVRGHTDDDF-----------LNAVLTAVARPGAQWD-GHGSARTLW

Query:  AQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEEIQPMWKDFDDHW
           L   A  W H++K+ ++PT+H  T+S  R++L++ ++ G ++++ R+I   +    A+  GA+F PSL+TRLCR +      +EE      + D   
Subjt:  AQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEEIQPMWKDFDDHW

Query:  WTDITCTHNVRLQAEAQRQDPASQRQDPASQRHVPASQA
                     A   ++ P    Q P+S R   AS +
Subjt:  WTDITCTHNVRLQAEAQRQDPASQRQDPASQRHVPASQA

A0A2P5DAQ2 Uncharacterized protein7.4e-1127.69Show/hide
Query:  ERPAADPSKKRKGPPTAS--KGKEKVREEKSHFAEE--WGFSCKLPQTMTVQ--ISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLVRG------
        ER A + SK  K    A+  + +E ++       +E  W  S  L Q   +   I  H W+L C +  +    LV +FY      + + V +RG      
Subjt:  ERPAADPSKKRKGPPTAS--KGKEKVREEKSHFAEE--WGFSCKLPQTMTVQ--ISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLVRG------

Query:  ----------------HTD--DDF----LNAVLTAVARPGAQWD-GHGSARTLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLD
                        H++  +D     L  VL  VA  GA+W+     A T     L   A  W H++K+R++PT+H  T+S   V L+Y ++ G +++
Subjt:  ----------------HTD--DDF----LNAVLTAVARPGAQWD-GHGSARTLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLD

Query:  LRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEE
        + R+I   +C   A+  GA+F PSL+T +CR +      +EE
Subjt:  LRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEE

A0A392M2J7 Uncharacterized protein (Fragment)3.7e-1026.67Show/hide
Query:  YAPTLVHDFYRAQWQEEINEVLVRGHTDDDFLNAVLTAVARPGAQWDGHGSARTLWAQLLGF--EATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKG
        Y+   +++ Y  +  E+  + L+   +D++ L AV+  +  PG +W    +      + +    E   W   +K+ I+PT+H+ T++  R+VL++CIM+ 
Subjt:  YAPTLVHDFYRAQWQEEINEVLVRGHTDDDFLNAVLTAVARPGAQWDGHGSARTLWAQLLGF--EATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKG

Query:  ITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQ-APDEEIQP
        I +++ RII + +     + +G ++ P L+T LC+A  + + A DE   P
Subjt:  ITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQ-APDEEIQP

A0A392MF28 Uncharacterized protein (Fragment)4.8e-1023.19Show/hide
Query:  QTMTVQISHHKWELVCENFPNYAPTLVHDFYR--AQWQEEINEVLVRGHT----------------------------DDDFLNAVLTAVARPGAQWDGH
        Q + + I+HHKW+    +  NY   +V +FY       +++ EV+VRG                               D+ L  ++ ++   G+ W+  
Subjt:  QTMTVQISHHKWELVCENFPNYAPTLVHDFYR--AQWQEEINEVLVRGHT----------------------------DDDFLNAVLTAVARPGAQWDGH

Query:  GSAR--TLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEEIQP
        G     T+    L      W  +IK+ +MPTSH+  ++  R+VL++CI     +++ +IIS+ + +      G ++ P L+T LCR   +++  ++++  
Subjt:  GSAR--TLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEEIQP

Query:  MWKDFDD
            FD+
Subjt:  MWKDFDD

A0A5D3DVQ6 Uncharacterized protein8.7e-1231.55Show/hide
Query:  EEKSHFAEEWG---FSCKLPQTMTVQISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLV--RGHTDDDFLNAVLTAVARPGAQWDGHGSAR-TLW
        EE+ HF  E G   F  +L   +   I    W+            +V  FY  +   E +  +V  R    D  +   L  VA    +WD     +  L+
Subjt:  EEKSHFAEEWG---FSCKLPQTMTVQISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLV--RGHTDDDFLNAVLTAVARPGAQWDGHGSAR-TLW

Query:  AQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLC--RASSLLQAP
           L  EA+ WL +IK ++MPT HD T+S  R++L+YCIM+ I +D+  II + +      P+GA   P L+ RLC    S L Q+P
Subjt:  AQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCIMKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLC--RASSLLQAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAATCATAGTCTACCTACCTGCGTGGGATGTGAAGGGGCATCATTACCCCAAACCAGAAAGCACTTTGCGACAAGTTGAAGGCCCAAACAGAGACATTGTCATTTC
CTCTGACGAGGAAAATGACAATCCCATCCAAGGCGAATTCATGGAGGAAGAAGAGCCCGAAGTAATCGACGGGGAGCGGCCCGTTGCAAGTTCCCAAAGGCCGCCAGCCA
CCGTGCCATCCACCTCCGGAGCCAATGCCACGGATGTACTCAACGTAGAACCCCTGGCCACATCCTCTTACAACATCCAGCAAGTAAACCTCCAGCCAGACCAAAACCTC
CCCGACCCACGAAACCTAACCCCATTCTCTTCCCTTTTTCCGTCCACATCCTTCCTTCCACCACCAGATTCTTCGGCTACCCTCGCCTCAAACCCTTCCCCATCATTTTC
TTCCACTCCTACAGTCGTTTCAAATCCCCTCCCACCTTCCACCTCCACAAATCCCACTTCACCGCAATTTTCTGAGCGTGAAGTGGCCGCATGGGACTCCCTGCTGGGGG
AGTTGGGCGAAGAAGGCGAAGCAATTCACCGAAGAGAAGAAGAAGCAAAAGTTGCGGTAGAGGAAGCTGCCCGCAAGGCCGTCGCAGAAGAAGAAACGACTAGAGAGAAA
GAACGCGCCAGCAAAGCGGCGCAAGATCGGGCTGCGGAAGAAGTCGCCGTAGCAAATCTGGCAGCAATTGCGGCAACCCTTGGCTCTCTAACGCATGGGTCAGAATCAAC
AGATTCAGAAGACGACCTACCCCTGACCCATCGCCGCAACGTCCAACCTGCTGGGGTAACCATCCAAGAGCCCTCAGAGAGACCTGCGGCAGACCCAAGCAAAAAGAGAA
AGGGGCCCCCAACCGCATCAAAAGGAAAAGAAAAAGTAAGAGAAGAGAAGAGCCACTTCGCAGAAGAATGGGGGTTTAGCTGCAAGCTCCCGCAGACTATGACGGTCCAA
ATCTCCCACCATAAATGGGAGTTAGTCTGCGAGAACTTTCCTAACTACGCCCCTACACTGGTCCATGATTTCTACAGAGCGCAGTGGCAGGAAGAGATTAATGAAGTGTT
GGTGCGTGGCCACACAGACGACGATTTCCTGAACGCAGTGCTAACTGCGGTGGCCAGACCAGGTGCTCAGTGGGATGGGCATGGCTCTGCGAGGACCCTCTGGGCGCAAC
TGTTAGGTTTTGAGGCCACAACTTGGTTGCATTGGATAAAGAACCGCATCATGCCCACATCGCATGATGCCACGTTAAGCATCCCAAGGGTGGTCCTGGTCTACTGTATA
ATGAAGGGGATCACACTTGACTTGCGCAGAATCATCTCAGAAGCCCTGTGCCAAACTGATGCAAAACCAAAAGGCGCAATGTTCTGCCCTAGCCTTGTTACGCGCCTCTG
CCGCGCATCCAGCCTGCTGCAAGCACCAGACGAAGAAATCCAACCTATGTGGAAGGACTTTGATGATCACTGGTGGACGGACATAACATGCACTCACAACGTGCGTCTGC
AGGCAGAGGCTCAACGCCAAGACCCTGCATCTCAGCGCCAAGACCCTGCATCTCAGCGCCATGTCCCCGCATCCCAGGCAACAGGAACTGCCCGTGCGCCTGAAGCACCA
AAGAAGAAAAGGCAAAAACAATCCTCCTCTGGCCGCAAGGAATACTCAGCCAGATCCACCGGCGCAGGAAAACACTATCTGCGCCCCAAATCAGGAACCCTTGCCGCCAC
CATCGCCACCGCCGTTGCCACCTGTCATCAGCCACCACCTTCTATCCAGCACGAAGAGGTCGTGCATCAATTTGAGGTCTGCGAGACCCTGTCGCCTGACTCTCCCCACA
GACAAGAAAGCCCCGCGCCAACCTCCCAGCCGCAGTGCACACCATCTGCCAGCACAAGCCATGCTACGCCTGTTTTGCAACTTGACGTGGAAACGACCCTCATCTTCTCG
GAATTAAGGCAGATGGTCCACGAAGTCGCGGAGCCACTGCGCAAGCAGCAAGATGAATTATCTCAGCGAGTTAATGCATTAGTACTATTCTTGCTGCATTGGACTGATTC
TTTCGGTCGTCTGCCTGTCACGCCAAACCTTTTCGTACGGCCGCCTGCGCAACCTAGACCTGATGACGCAGACCCATCTCAAATACGCACCCTGATGGCTCCACCTGCAC
CGAGACAGCCAAGGCCACCGCCTCCACCACCGCCGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAATCATAGTCTACCTACCTGCGTGGGATGTGAAGGGGCATCATTACCCCAAACCAGAAAGCACTTTGCGACAAGTTGAAGGCCCAAACAGAGACATTGTCATTTC
CTCTGACGAGGAAAATGACAATCCCATCCAAGGCGAATTCATGGAGGAAGAAGAGCCCGAAGTAATCGACGGGGAGCGGCCCGTTGCAAGTTCCCAAAGGCCGCCAGCCA
CCGTGCCATCCACCTCCGGAGCCAATGCCACGGATGTACTCAACGTAGAACCCCTGGCCACATCCTCTTACAACATCCAGCAAGTAAACCTCCAGCCAGACCAAAACCTC
CCCGACCCACGAAACCTAACCCCATTCTCTTCCCTTTTTCCGTCCACATCCTTCCTTCCACCACCAGATTCTTCGGCTACCCTCGCCTCAAACCCTTCCCCATCATTTTC
TTCCACTCCTACAGTCGTTTCAAATCCCCTCCCACCTTCCACCTCCACAAATCCCACTTCACCGCAATTTTCTGAGCGTGAAGTGGCCGCATGGGACTCCCTGCTGGGGG
AGTTGGGCGAAGAAGGCGAAGCAATTCACCGAAGAGAAGAAGAAGCAAAAGTTGCGGTAGAGGAAGCTGCCCGCAAGGCCGTCGCAGAAGAAGAAACGACTAGAGAGAAA
GAACGCGCCAGCAAAGCGGCGCAAGATCGGGCTGCGGAAGAAGTCGCCGTAGCAAATCTGGCAGCAATTGCGGCAACCCTTGGCTCTCTAACGCATGGGTCAGAATCAAC
AGATTCAGAAGACGACCTACCCCTGACCCATCGCCGCAACGTCCAACCTGCTGGGGTAACCATCCAAGAGCCCTCAGAGAGACCTGCGGCAGACCCAAGCAAAAAGAGAA
AGGGGCCCCCAACCGCATCAAAAGGAAAAGAAAAAGTAAGAGAAGAGAAGAGCCACTTCGCAGAAGAATGGGGGTTTAGCTGCAAGCTCCCGCAGACTATGACGGTCCAA
ATCTCCCACCATAAATGGGAGTTAGTCTGCGAGAACTTTCCTAACTACGCCCCTACACTGGTCCATGATTTCTACAGAGCGCAGTGGCAGGAAGAGATTAATGAAGTGTT
GGTGCGTGGCCACACAGACGACGATTTCCTGAACGCAGTGCTAACTGCGGTGGCCAGACCAGGTGCTCAGTGGGATGGGCATGGCTCTGCGAGGACCCTCTGGGCGCAAC
TGTTAGGTTTTGAGGCCACAACTTGGTTGCATTGGATAAAGAACCGCATCATGCCCACATCGCATGATGCCACGTTAAGCATCCCAAGGGTGGTCCTGGTCTACTGTATA
ATGAAGGGGATCACACTTGACTTGCGCAGAATCATCTCAGAAGCCCTGTGCCAAACTGATGCAAAACCAAAAGGCGCAATGTTCTGCCCTAGCCTTGTTACGCGCCTCTG
CCGCGCATCCAGCCTGCTGCAAGCACCAGACGAAGAAATCCAACCTATGTGGAAGGACTTTGATGATCACTGGTGGACGGACATAACATGCACTCACAACGTGCGTCTGC
AGGCAGAGGCTCAACGCCAAGACCCTGCATCTCAGCGCCAAGACCCTGCATCTCAGCGCCATGTCCCCGCATCCCAGGCAACAGGAACTGCCCGTGCGCCTGAAGCACCA
AAGAAGAAAAGGCAAAAACAATCCTCCTCTGGCCGCAAGGAATACTCAGCCAGATCCACCGGCGCAGGAAAACACTATCTGCGCCCCAAATCAGGAACCCTTGCCGCCAC
CATCGCCACCGCCGTTGCCACCTGTCATCAGCCACCACCTTCTATCCAGCACGAAGAGGTCGTGCATCAATTTGAGGTCTGCGAGACCCTGTCGCCTGACTCTCCCCACA
GACAAGAAAGCCCCGCGCCAACCTCCCAGCCGCAGTGCACACCATCTGCCAGCACAAGCCATGCTACGCCTGTTTTGCAACTTGACGTGGAAACGACCCTCATCTTCTCG
GAATTAAGGCAGATGGTCCACGAAGTCGCGGAGCCACTGCGCAAGCAGCAAGATGAATTATCTCAGCGAGTTAATGCATTAGTACTATTCTTGCTGCATTGGACTGATTC
TTTCGGTCGTCTGCCTGTCACGCCAAACCTTTTCGTACGGCCGCCTGCGCAACCTAGACCTGATGACGCAGACCCATCTCAAATACGCACCCTGATGGCTCCACCTGCAC
CGAGACAGCCAAGGCCACCGCCTCCACCACCGCCGCAGTAG
Protein sequenceShow/hide protein sequence
MIIIVYLPAWDVKGHHYPKPESTLRQVEGPNRDIVISSDEENDNPIQGEFMEEEEPEVIDGERPVASSQRPPATVPSTSGANATDVLNVEPLATSSYNIQQVNLQPDQNL
PDPRNLTPFSSLFPSTSFLPPPDSSATLASNPSPSFSSTPTVVSNPLPPSTSTNPTSPQFSEREVAAWDSLLGELGEEGEAIHRREEEAKVAVEEAARKAVAEEETTREK
ERASKAAQDRAAEEVAVANLAAIAATLGSLTHGSESTDSEDDLPLTHRRNVQPAGVTIQEPSERPAADPSKKRKGPPTASKGKEKVREEKSHFAEEWGFSCKLPQTMTVQ
ISHHKWELVCENFPNYAPTLVHDFYRAQWQEEINEVLVRGHTDDDFLNAVLTAVARPGAQWDGHGSARTLWAQLLGFEATTWLHWIKNRIMPTSHDATLSIPRVVLVYCI
MKGITLDLRRIISEALCQTDAKPKGAMFCPSLVTRLCRASSLLQAPDEEIQPMWKDFDDHWWTDITCTHNVRLQAEAQRQDPASQRQDPASQRHVPASQATGTARAPEAP
KKKRQKQSSSGRKEYSARSTGAGKHYLRPKSGTLAATIATAVATCHQPPPSIQHEEVVHQFEVCETLSPDSPHRQESPAPTSQPQCTPSASTSHATPVLQLDVETTLIFS
ELRQMVHEVAEPLRKQQDELSQRVNALVLFLLHWTDSFGRLPVTPNLFVRPPAQPRPDDADPSQIRTLMAPPAPRQPRPPPPPPPQ