; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027241 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027241
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold8:6258768..6261686
RNA-Seq ExpressionSpg027241
SyntenySpg027241
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]2.1e-1627.71Show/hide
Query:  FINDLARAKYLKMLKRDFLFERGF------NDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQ--
        F+++ A+  Y  +  R   FE GF      N +L   +   +T H W +F   P  VN  IV+EFY+NI E      MVRG+++ ++P AIN  F LQ  
Subjt:  FINDLARAKYLKMLKRDFLFERGF------NDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQ--

Query:  DFPHAGFNKMV---------------VAPFNDQLNATVRKVVSRD-----------------------------RVLLVFAILRSLSIDVSKIISNEIYN
        D  +  F + V                  +N Q     RK V RD                             R+LL+ +IL   +ID+ KII    + 
Subjt:  DFPHAGFNKMV---------------VAPFNDQLNATVRKVVSRD-----------------------------RVLLVFAILRSLSIDVSKIISNEIYN

Query:  CWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQEARQGGLVCGIHQILEQLQLLASRQEYAERQAQT-------------YW
        C +++   L FPN IT LC +  V     D IL     ++K  +  L   +EA+         ++     + AS  +  +   +T             Y+
Subjt:  CWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQEARQGGLVCGIHQILEQLQLLASRQEYAERQAQT-------------YW

Query:  TYAKRRDATLRRAL
         YAKRRDA L  AL
Subjt:  TYAKRRDATLRRAL

KAE8712804.1 hypothetical protein F3Y22_tig00110223pilonHSYRG00028 [Hibiscus syriacus]5.7e-1430.12Show/hide
Query:  ARAKYLKMLKRDFLFERGF------NDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQ-------
        A  +Y  +  R   FE GF      + DL   +   +T H W +F   P +VN  +V+EFY+NI E      MV G+++ ++P AIN  F LQ       
Subjt:  ARAKYLKMLKRDFLFERGF------NDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQ-------

Query:  --------DFPHA----GFNKMVVAPFNDQLNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVI
                 FP       F K  + P +   N T    VS  R++L+ +I+  L+ID+ KII  + + C ++    L FPN IT L  +  V     D I
Subjt:  --------DFPHA----GFNKMVVAPFNDQLNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVI

Query:  LLDKGIIDKPN-----LARLQRTQEARQGGLVCGIHQILEQLQLLASRQ
        L     ++K         R  + + ARQ  L      +  QLQ   SRQ
Subjt:  LLDKGIIDKPN-----LARLQRTQEARQGGLVCGIHQILEQLQLLASRQ

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.3e-1732.02Show/hide
Query:  RFINDLARAKYLKMLK-RDFLFERGFNDD-------LPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNL
        +F  + A  +Y   ++ R    E+GF  D       LP F+   IT H W QFCA PE     +VREFYAN+ +       VRGV V WS  AIN++F L
Subjt:  RFINDLARAKYLKMLK-RDFLFERGFNDD-------LPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNL

Query:  QD--FPHAGFNKMV-----------VAPFNDQLNATVR-------------------------------KVVSRDRVLLVFAILRSLSIDVSKIISNEIY
         D    H+ F + +           VA    + N + +                               K VS+DR+LL+ ++L   SI+V ++I +EI 
Subjt:  QD--FPHAGFNKMV-----------VAPFNDQLNATVR-------------------------------KVVSRDRVLLVFAILRSLSIDVSKIISNEIY

Query:  NCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQE
         C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+  TQE
Subjt:  NCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.6e-2430.84Show/hide
Query:  RFINDLARAKYLKMLK-RDFLFERGFNDD-------LPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNL
        +F  + A  +Y   ++ R    E+GF  D       LP F+   IT H W QFCA PE     +VREFYAN+ + E     VRGV V WS  AIN++F L
Subjt:  RFINDLARAKYLKMLK-RDFLFERGFNDD-------LPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNL

Query:  QD--FPHAGFNKMV-----------VAPFNDQLNATVR-------------------------------KVVSRDRVLLVFAILRSLSIDVSKIISNEIY
         D    H+ F + +           VA    + N + +                               K VS+DR+LL+ ++L   SI+V ++I +EI 
Subjt:  QD--FPHAGFNKMV-----------VAPFNDQLNATVR-------------------------------KVVSRDRVLLVFAILRSLSIDVSKIISNEIY

Query:  NCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARL------QRTQEARQGGLVCGIH-----QILEQLQLLASR------QEY----
         C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+      + TQ+                 IL+QL+ L  R      Q+Y    
Subjt:  NCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARL------QRTQEARQGGLVCGIH-----QILEQLQLLASR------QEY----

Query:  ----AERQAQTYWTYAKRRDATLRRALQYNFSKP
              +Q Q +W Y+K RD  L++ALQ NF++P
Subjt:  ----AERQAQTYWTYAKRRDATLRRALQYNFSKP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.3e-1830.77Show/hide
Query:  IVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQD--FPHAGFNKMVVAP------------------------------------------FNDQ
        +VREFYAN+ + E     VRGV V WS  AIN++F L D    H+ F + +  P                                           +  
Subjt:  IVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQD--FPHAGFNKMVVAP------------------------------------------FNDQ

Query:  LNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARL--------------Q
        L  T  K+VS+DR+LL+ ++L   SI+V ++I +EI  C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+               
Subjt:  LNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARL--------------Q

Query:  RTQEARQGGLVCGIHQILEQLQLLASRQEYAERQAQTYWTYAKRRDATLRRALQYNFSKP
        R   A        + Q L+ L+   S+QE+  +Q Q +W Y+K RD  L++ALQ NF++P
Subjt:  RTQEARQGGLVCGIHQILEQLQLLASRQEYAERQAQTYWTYAKRRDATLRRALQYNFSKP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.6e-1732.02Show/hide
Query:  RFINDLARAKYLKMLK-RDFLFERGFNDD-------LPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNL
        +F  + A  +Y   ++ R    E+GF  D       LP F+   IT H W QFCA PE     +VREFYAN+ +       VRGV V WS  AIN++F L
Subjt:  RFINDLARAKYLKMLK-RDFLFERGFNDD-------LPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNL

Query:  QD--FPHAGFNKMV-----------VAPFNDQLNATVR-------------------------------KVVSRDRVLLVFAILRSLSIDVSKIISNEIY
         D    H+ F + +           VA    + N + +                               K VS+DR+LL+ ++L   SI+V ++I +EI 
Subjt:  QD--FPHAGFNKMV-----------VAPFNDQLNATVR-------------------------------KVVSRDRVLLVFAILRSLSIDVSKIISNEIY

Query:  NCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQE
         C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+  TQE
Subjt:  NCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.7e-2430.84Show/hide
Query:  RFINDLARAKYLKMLK-RDFLFERGFNDD-------LPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNL
        +F  + A  +Y   ++ R    E+GF  D       LP F+   IT H W QFCA PE     +VREFYAN+ + E     VRGV V WS  AIN++F L
Subjt:  RFINDLARAKYLKMLK-RDFLFERGFNDD-------LPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNL

Query:  QD--FPHAGFNKMV-----------VAPFNDQLNATVR-------------------------------KVVSRDRVLLVFAILRSLSIDVSKIISNEIY
         D    H+ F + +           VA    + N + +                               K VS+DR+LL+ ++L   SI+V ++I +EI 
Subjt:  QD--FPHAGFNKMV-----------VAPFNDQLNATVR-------------------------------KVVSRDRVLLVFAILRSLSIDVSKIISNEIY

Query:  NCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARL------QRTQEARQGGLVCGIH-----QILEQLQLLASR------QEY----
         C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+      + TQ+                 IL+QL+ L  R      Q+Y    
Subjt:  NCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARL------QRTQEARQGGLVCGIH-----QILEQLQLLASR------QEY----

Query:  ----AERQAQTYWTYAKRRDATLRRALQYNFSKP
              +Q Q +W Y+K RD  L++ALQ NF++P
Subjt:  ----AERQAQTYWTYAKRRDATLRRALQYNFSKP

A0A2P5DXM3 Uncharacterized protein1.1e-1830.77Show/hide
Query:  IVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQD--FPHAGFNKMVVAP------------------------------------------FNDQ
        +VREFYAN+ + E     VRGV V WS  AIN++F L D    H+ F + +  P                                           +  
Subjt:  IVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQD--FPHAGFNKMVVAP------------------------------------------FNDQ

Query:  LNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARL--------------Q
        L  T  K+VS+DR+LL+ ++L   SI+V ++I +EI  C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+               
Subjt:  LNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARL--------------Q

Query:  RTQEARQGGLVCGIHQILEQLQLLASRQEYAERQAQTYWTYAKRRDATLRRALQYNFSKP
        R   A        + Q L+ L+   S+QE+  +Q Q +W Y+K RD  L++ALQ NF++P
Subjt:  RTQEARQGGLVCGIHQILEQLQLLASRQEYAERQAQTYWTYAKRRDATLRRALQYNFSKP

A0A6A2ZUE4 Uncharacterized protein1.0e-1627.71Show/hide
Query:  FINDLARAKYLKMLKRDFLFERGF------NDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQ--
        F+++ A+  Y  +  R   FE GF      N +L   +   +T H W +F   P  VN  IV+EFY+NI E      MVRG+++ ++P AIN  F LQ  
Subjt:  FINDLARAKYLKMLKRDFLFERGF------NDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQ--

Query:  DFPHAGFNKMV---------------VAPFNDQLNATVRKVVSRD-----------------------------RVLLVFAILRSLSIDVSKIISNEIYN
        D  +  F + V                  +N Q     RK V RD                             R+LL+ +IL   +ID+ KII    + 
Subjt:  DFPHAGFNKMV---------------VAPFNDQLNATVRKVVSRD-----------------------------RVLLVFAILRSLSIDVSKIISNEIYN

Query:  CWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQEARQGGLVCGIHQILEQLQLLASRQEYAERQAQT-------------YW
        C +++   L FPN IT LC +  V     D IL     ++K  +  L   +EA+         ++     + AS  +  +   +T             Y+
Subjt:  CWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQEARQGGLVCGIHQILEQLQLLASRQEYAERQAQT-------------YW

Query:  TYAKRRDATLRRAL
         YAKRRDA L  AL
Subjt:  TYAKRRDATLRRAL

A0A6A3B6J9 RT_RNaseH_2 domain-containing protein2.8e-1430.12Show/hide
Query:  ARAKYLKMLKRDFLFERGF------NDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQ-------
        A  +Y  +  R   FE GF      + DL   +   +T H W +F   P +VN  +V+EFY+NI E      MV G+++ ++P AIN  F LQ       
Subjt:  ARAKYLKMLKRDFLFERGF------NDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLFNLQ-------

Query:  --------DFPHA----GFNKMVVAPFNDQLNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVI
                 FP       F K  + P +   N T    VS  R++L+ +I+  L+ID+ KII  + + C ++    L FPN IT L  +  V     D I
Subjt:  --------DFPHA----GFNKMVVAPFNDQLNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVI

Query:  LLDKGIIDKPN-----LARLQRTQEARQGGLVCGIHQILEQLQLLASRQ
        L     ++K         R  + + ARQ  L      +  QLQ   SRQ
Subjt:  LLDKGIIDKPN-----LARLQRTQEARQGGLVCGIHQILEQLQLLASRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGTGTCCGGTTGTCCCTTACAAGTGGTGTCAGAGCCATGCCCCGGGAGTAGCCGTGTTGGGTGGAATCCTCGGATGCCGAACAAAGAAGTTGTGAGCCTTAATAG
TCATGGTGGAGGTCCGTGGATGAGATTACTCTGGGTAGGGGGTATACTCCGGAGAAGAGGAGTCATCCTAGATAAGAGGATTACTCTGGGTAGGGGTCCCCTACAAGTGG
TATCAGAGCTCCGTTCGATCCAAAGTTTGGAGCGAGCTGAGGTTACGGAAGTAGTGAGAAAATTGATTGAAGACATTGCTGAGGAAGTGGTTGAGGAAGAACAACCAAAA
GACCCTGAGGAAAAGAAAGATGTTGAACAAGGAGACCAAACAGTTGAAAATCCGCAAGAGGAGCAAGAAAAGCGAGTGGAAGATGTACAAAGACAAGGTAATGGTCAGGA
GCAACATACTCAAGAGGTCATGCCGGAGATTCCACGTCGTCGCCGCTGCAAGCAAAAGGCGGGACGAAATAAGGTGATCAAGACAGACACTCCATCTCCGTCAACGACAG
AGTCTAAGAAGGAAAATTCTGAAAAAAATGAGCAAGAAAAAGAAGAGGCTGAGAAGAAAACGGAAGAAGAAGCCTTGACGAAGCAACAAGAGGACAAGGGCAAGGGAGTT
GTTGAAGCGCTGGCAAGGACAGAAGAGACTGACGTTGAGGAACCGAGTCTGCCGTATGCGCGCTTCATTAATGATCTTGCCAGAGCAAAATATCTGAAAATGTTGAAGAG
GGACTTCTTGTTTGAAAGGGGATTCAATGATGACTTGCCTCACTTCCTACGTGCTGGGATTACAAACCACGGATGGGATCAGTTTTGCGCGAAGCCAGAATCGGTGAATA
CAAATATTGTTCGTGAATTTTATGCGAACATAGATGAGCAAGAGGGATTCCAAGCCATGGTCCGAGGGGTAGCTGTCGATTGGAGCCCAGGGGCAATAAATTCTCTCTTC
AACCTCCAGGATTTTCCACATGCCGGATTTAATAAGATGGTGGTGGCACCATTTAACGATCAATTGAACGCGACTGTTCGAAAGGTTGTATCACGAGATAGAGTGTTGCT
GGTATTTGCCATTCTGCGATCCTTGAGCATAGATGTTAGTAAGATTATTTCTAATGAAATCTACAATTGTTGGCGTAAGAAGGTGGGAAAGTTGTTCTTCCCAAACACGA
TTACCATGTTGTGCAGCAGGCCAGGGGTGCCCACGAGTCCAGAGGATGTTATTCTACTTGATAAAGGGATTATAGACAAGCCCAATTTGGCTCGGCTTCAGCGCACACAG
GAGGCTCGCCAAGGTGGGCTAGTGTGCGGCATCCACCAGATTTTAGAGCAACTTCAACTTTTGGCCAGTAGGCAAGAGTATGCTGAAAGGCAGGCTCAGACCTACTGGAC
CTATGCTAAGAGACGGGATGCCACCCTAAGGAGGGCCCTGCAATACAATTTTTCAAAACCATACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTGTGTCCGGTTGTCCCTTACAAGTGGTGTCAGAGCCATGCCCCGGGAGTAGCCGTGTTGGGTGGAATCCTCGGATGCCGAACAAAGAAGTTGTGAGCCTTAATAG
TCATGGTGGAGGTCCGTGGATGAGATTACTCTGGGTAGGGGGTATACTCCGGAGAAGAGGAGTCATCCTAGATAAGAGGATTACTCTGGGTAGGGGTCCCCTACAAGTGG
TATCAGAGCTCCGTTCGATCCAAAGTTTGGAGCGAGCTGAGGTTACGGAAGTAGTGAGAAAATTGATTGAAGACATTGCTGAGGAAGTGGTTGAGGAAGAACAACCAAAA
GACCCTGAGGAAAAGAAAGATGTTGAACAAGGAGACCAAACAGTTGAAAATCCGCAAGAGGAGCAAGAAAAGCGAGTGGAAGATGTACAAAGACAAGGTAATGGTCAGGA
GCAACATACTCAAGAGGTCATGCCGGAGATTCCACGTCGTCGCCGCTGCAAGCAAAAGGCGGGACGAAATAAGGTGATCAAGACAGACACTCCATCTCCGTCAACGACAG
AGTCTAAGAAGGAAAATTCTGAAAAAAATGAGCAAGAAAAAGAAGAGGCTGAGAAGAAAACGGAAGAAGAAGCCTTGACGAAGCAACAAGAGGACAAGGGCAAGGGAGTT
GTTGAAGCGCTGGCAAGGACAGAAGAGACTGACGTTGAGGAACCGAGTCTGCCGTATGCGCGCTTCATTAATGATCTTGCCAGAGCAAAATATCTGAAAATGTTGAAGAG
GGACTTCTTGTTTGAAAGGGGATTCAATGATGACTTGCCTCACTTCCTACGTGCTGGGATTACAAACCACGGATGGGATCAGTTTTGCGCGAAGCCAGAATCGGTGAATA
CAAATATTGTTCGTGAATTTTATGCGAACATAGATGAGCAAGAGGGATTCCAAGCCATGGTCCGAGGGGTAGCTGTCGATTGGAGCCCAGGGGCAATAAATTCTCTCTTC
AACCTCCAGGATTTTCCACATGCCGGATTTAATAAGATGGTGGTGGCACCATTTAACGATCAATTGAACGCGACTGTTCGAAAGGTTGTATCACGAGATAGAGTGTTGCT
GGTATTTGCCATTCTGCGATCCTTGAGCATAGATGTTAGTAAGATTATTTCTAATGAAATCTACAATTGTTGGCGTAAGAAGGTGGGAAAGTTGTTCTTCCCAAACACGA
TTACCATGTTGTGCAGCAGGCCAGGGGTGCCCACGAGTCCAGAGGATGTTATTCTACTTGATAAAGGGATTATAGACAAGCCCAATTTGGCTCGGCTTCAGCGCACACAG
GAGGCTCGCCAAGGTGGGCTAGTGTGCGGCATCCACCAGATTTTAGAGCAACTTCAACTTTTGGCCAGTAGGCAAGAGTATGCTGAAAGGCAGGCTCAGACCTACTGGAC
CTATGCTAAGAGACGGGATGCCACCCTAAGGAGGGCCCTGCAATACAATTTTTCAAAACCATACTAG
Protein sequenceShow/hide protein sequence
MCVSGCPLQVVSEPCPGSSRVGWNPRMPNKEVVSLNSHGGGPWMRLLWVGGILRRRGVILDKRITLGRGPLQVVSELRSIQSLERAEVTEVVRKLIEDIAEEVVEEEQPK
DPEEKKDVEQGDQTVENPQEEQEKRVEDVQRQGNGQEQHTQEVMPEIPRRRRCKQKAGRNKVIKTDTPSPSTTESKKENSEKNEQEKEEAEKKTEEEALTKQQEDKGKGV
VEALARTEETDVEEPSLPYARFINDLARAKYLKMLKRDFLFERGFNDDLPHFLRAGITNHGWDQFCAKPESVNTNIVREFYANIDEQEGFQAMVRGVAVDWSPGAINSLF
NLQDFPHAGFNKMVVAPFNDQLNATVRKVVSRDRVLLVFAILRSLSIDVSKIISNEIYNCWRKKVGKLFFPNTITMLCSRPGVPTSPEDVILLDKGIIDKPNLARLQRTQ
EARQGGLVCGIHQILEQLQLLASRQEYAERQAQTYWTYAKRRDATLRRALQYNFSKPY