; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000757 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000757
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr4:15292520..15297782
RNA-Seq ExpressionLag0000757
SyntenyLag0000757
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.8e-3155.09Show/hide
Query:  MTAKEIFSKNF---HKKEKKNFATSYCIDV----------EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVS
        +T  E F ++F   H +E     T +   +          EEVDNS + +QRTSVFDRIKP TTR S+FQR+SMA  +EENQ  TST  R SAF+RLS+S
Subjt:  MTAKEIFSKNF---HKKEKKNFATSYCIDV----------EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVS

Query:  TLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        T KK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  TLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]9.0e-3165.35Show/hide
Query:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD
        EEVDNS + +QRTS+FDRIKP TTR  +FQR+SMA  +EENQ  TST  R SAF+RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D
Subjt:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD

Query:  KKLHSSIPSRMKRKFSVLINTEGSLKV
         K+HS +PSRMKRK SV INTEGSL V
Subjt:  KKLHSSIPSRMKRKFSVLINTEGSLKV

KAA0055462.1 retrotransposon gag protein [Cucumis melo var. makuwa]9.0e-3165.35Show/hide
Query:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR S+FQR+SMA  +E+NQ  TST  R SAF+RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D
Subjt:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD

Query:  KKLHSSIPSRMKRKFSVLINTEGSLKV
         K+H+ +PSRMKRK SV INTEGSL V
Subjt:  KKLHSSIPSRMKRKFSVLINTEGSLKV

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]9.0e-3165.35Show/hide
Query:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR S+FQR+SMA  +E+NQ  TST  R SAF+RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D
Subjt:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD

Query:  KKLHSSIPSRMKRKFSVLINTEGSLKV
         K+H+ +PSRMKRK SV INTEGSL V
Subjt:  KKLHSSIPSRMKRKFSVLINTEGSLKV

TYK16519.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.9e-3166.14Show/hide
Query:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR  +FQR+SMA  +EENQ  TST  R SAF+RLS+ST KK R STS FDRLK+TN+Q KR+M + + K F E N D
Subjt:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD

Query:  KKLHSSIPSRMKRKFSVLINTEGSLKV
         K+HS +PSRMKRK SV INTEGSL V
Subjt:  KKLHSSIPSRMKRKFSVLINTEGSLKV

TrEMBL top hitse value%identityAlignment
A0A5A7TQ06 Retrotransposon gag protein8.8e-3255.09Show/hide
Query:  MTAKEIFSKNF---HKKEKKNFATSYCIDV----------EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVS
        +T  E F ++F   H +E     T +   +          EEVDNS + +QRTSVFDRIKP TTR S+FQR+SMA  +EENQ  TST  R SAF+RLS+S
Subjt:  MTAKEIFSKNF---HKKEKKNFATSYCIDV----------EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVS

Query:  TLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV
        T KK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS +PSRMKRK SV INTEGSL V
Subjt:  TLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKV

A0A5A7UI09 Retrotransposon gag protein4.4e-3165.35Show/hide
Query:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR S+FQR+SMA  +E+NQ  TST  R SAF+RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D
Subjt:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD

Query:  KKLHSSIPSRMKRKFSVLINTEGSLKV
         K+H+ +PSRMKRK SV INTEGSL V
Subjt:  KKLHSSIPSRMKRKFSVLINTEGSLKV

A0A5D3BBF9 Gag protease polyprotein4.4e-3165.35Show/hide
Query:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD
        EEVDNS + +QRTS+FDRIKP TTR  +FQR+SMA  +EENQ  TST  R SAF+RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D
Subjt:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD

Query:  KKLHSSIPSRMKRKFSVLINTEGSLKV
         K+HS +PSRMKRK SV INTEGSL V
Subjt:  KKLHSSIPSRMKRKFSVLINTEGSLKV

A0A5D3CCI8 Retrotransposon gag protein4.4e-3165.35Show/hide
Query:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR S+FQR+SMA  +E+NQ  TST  R SAF+RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D
Subjt:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD

Query:  KKLHSSIPSRMKRKFSVLINTEGSLKV
         K+H+ +PSRMKRK SV INTEGSL V
Subjt:  KKLHSSIPSRMKRKFSVLINTEGSLKV

A0A5D3D209 Retrotransposon gag protein3.4e-3166.14Show/hide
Query:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD
        EEVDNS + +QRTSVFDRIKP TTR  +FQR+SMA  +EENQ  TST  R SAF+RLS+ST KK R STS FDRLK+TN+Q KR+M + + K F E N D
Subjt:  EEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPKRKMNNSELKLFDEVNSD

Query:  KKLHSSIPSRMKRKFSVLINTEGSLKV
         K+HS +PSRMKRK SV INTEGSL V
Subjt:  KKLHSSIPSRMKRKFSVLINTEGSLKV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCGAAGGAGATCTTCTCCAAAAATTTCCACAAGAAGGAAAAAAAGAACTTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGGGTGA
ACAAAGGACATCCGTCTTTGATCGCATCAAGCCTTCAACTACTCGTCCTTCGATATTCCAAAGAATGAGTATGGCCGCGACAAAAGAAGAAAATCAACGTTCGACGTCCA
CCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATTAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAA
AGAAAGATGAACAACTCGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAA
TACGGAAGGTTCCTTGAAGGTTCCCACATTGCGCTGTTGTGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGCTGTGCTGCTACCTTCCTCCAAGTTCGAAGGTTTTC
ATGCGCTTTGTTGCAGTTCCTTCTCTCCAAGTTCAAAAGGTTCTCACGCATTCGGCTACAGTTCATTTCCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTC
CTTCTCTCCAAGTTCGAAGGTGTTCTCATGCGCGCCGCTGCAGTTCCTTCTCTCCAAACTCGAAGGTGTTCTCACGCGCGCCACTGCAGTTCATTCTCTCCAAGTTTGAA
GGTTCTCTCAAGTGCTTCGCTGCAGTTCCTTCCTCCCAAATTTGAAGGTTCTCACGACGCTCCGCTACAGTTCCTTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCG
CTGCAGTTCCTTCGCTTTCGCTGCAATTCCTTCTCTCCGAGTTCGAAGGTTCTCACGACGTTTCGTTGCAGTTCCTTCCTCCCAAATTGAAGGTTCTCACGACGCTCCGC
TGTAGTTCCTTCTCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGGTGTTCTCGCGCGCTTCGCTGCAGTTCCTTCCTCCAA
AATTCGAAGGTTCTCTCACGCACTTCGTTACAGTTCCTTCTCTCCAAGTATGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTGCAGTTCCTTCCTCC
AAGTTCGAAGTTCCTTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCTCTTCGCTGCACAATTCCTTCCTCAAGTTCGAAGGTTCTCATGCGC
TTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTC
GAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTC
CAAGTTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTACT
TCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTT
TGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACTGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCTTCGTTCCGTTTCATCTTCAAATGT
TGGCGAGTCTGGTGATCACCCCTTCAAGATACTACAGTCATCAAAGTGACTGGTCTAGACAGGCGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGTAGGCGAG
TCTGGTGATGAAGTCACTGCAAGTGAATCTGATGACGACCGTTGTAGGCGAGTCGAGTCTGGTGACCACCCTTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGG
TCTAGCAGGAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGAG
ATCAACAAGCCAACCGACCGATCAAGAAGATCAGCAAGTCAGCAGGTCGATCATCTAAGAAGAGATCAACAAGCCAACTGACCGATCAAGAAGATCAACAAGTCAGTAGG
CCGATCATCCAAGAAGATCAACAAGCAGACCGATCGATCAACAGGATCAATAAGTCAACAGGCCGATCATCCAAAAGGATCAACAAGCTAACAAGCCGATCCAACAGATC
ATCAAGCCAACAAGCCGATCCAACAGATCATCAAGTTAACAGACTGATCATCCAAGAAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGAT
CATCCAAGAAGATCAACAAGCCAACTGATCGATCAAGAGGATCAATAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCGAAGGAGATCTTCTCCAAAAATTTCCACAAGAAGGAAAAAAAGAACTTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGGGTGA
ACAAAGGACATCCGTCTTTGATCGCATCAAGCCTTCAACTACTCGTCCTTCGATATTCCAAAGAATGAGTATGGCCGCGACAAAAGAAGAAAATCAACGTTCGACGTCCA
CCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATTAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAA
AGAAAGATGAACAACTCGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAA
TACGGAAGGTTCCTTGAAGGTTCCCACATTGCGCTGTTGTGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTGACGCTGTGCTGCTACCTTCCTCCAAGTTCGAAGGTTTTC
ATGCGCTTTGTTGCAGTTCCTTCTCTCCAAGTTCAAAAGGTTCTCACGCATTCGGCTACAGTTCATTTCCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTC
CTTCTCTCCAAGTTCGAAGGTGTTCTCATGCGCGCCGCTGCAGTTCCTTCTCTCCAAACTCGAAGGTGTTCTCACGCGCGCCACTGCAGTTCATTCTCTCCAAGTTTGAA
GGTTCTCTCAAGTGCTTCGCTGCAGTTCCTTCCTCCCAAATTTGAAGGTTCTCACGACGCTCCGCTACAGTTCCTTCTCTCTCCAAATTCGAAGGTTCTCACGCGCTCCG
CTGCAGTTCCTTCGCTTTCGCTGCAATTCCTTCTCTCCGAGTTCGAAGGTTCTCACGACGTTTCGTTGCAGTTCCTTCCTCCCAAATTGAAGGTTCTCACGACGCTCCGC
TGTAGTTCCTTCTCTCCAAATTTGAAGGTGTTCTCACGCGCGCCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGGTGTTCTCGCGCGCTTCGCTGCAGTTCCTTCCTCCAA
AATTCGAAGGTTCTCTCACGCACTTCGTTACAGTTCCTTCTCTCCAAGTATGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTGCAGTTCCTTCCTCC
AAGTTCGAAGTTCCTTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCTCTTCGCTGCACAATTCCTTCCTCAAGTTCGAAGGTTCTCATGCGC
TTCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTC
GAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTC
CAAGTTTGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTACT
TCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTT
TGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACTGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCTTCGTTCCGTTTCATCTTCAAATGT
TGGCGAGTCTGGTGATCACCCCTTCAAGATACTACAGTCATCAAAGTGACTGGTCTAGACAGGCGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGTAGGCGAG
TCTGGTGATGAAGTCACTGCAAGTGAATCTGATGACGACCGTTGTAGGCGAGTCGAGTCTGGTGACCACCCTTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGG
TCTAGCAGGAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGAG
ATCAACAAGCCAACCGACCGATCAAGAAGATCAGCAAGTCAGCAGGTCGATCATCTAAGAAGAGATCAACAAGCCAACTGACCGATCAAGAAGATCAACAAGTCAGTAGG
CCGATCATCCAAGAAGATCAACAAGCAGACCGATCGATCAACAGGATCAATAAGTCAACAGGCCGATCATCCAAAAGGATCAACAAGCTAACAAGCCGATCCAACAGATC
ATCAAGCCAACAAGCCGATCCAACAGATCATCAAGTTAACAGACTGATCATCCAAGAAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGAT
CATCCAAGAAGATCAACAAGCCAACTGATCGATCAAGAGGATCAATAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAA
Protein sequenceShow/hide protein sequence
MTAKEIFSKNFHKKEKKNFATSYCIDVEEVDNSKKGEQRTSVFDRIKPSTTRPSIFQRMSMAATKEENQRSTSTSTRPSAFQRLSVSTLKKSRSSTSVFDRLKVTNDQPK
RKMNNSELKLFDEVNSDKKLHSSIPSRMKRKFSVLINTEGSLKVPTLRCCAASFSKFEGSDAVLLPSSKFEGFHALCCSSFSPSSKGSHAFGYSSFPPNLKVFSRAPLQF
LLSKFEGVLMRAAAVPSLQTRRCSHARHCSSFSPSLKVLSSASLQFLPPKFEGSHDAPLQFLLSPNSKVLTRSAAVPSLSLQFLLSEFEGSHDVSLQFLPPKLKVLTTLR
CSSFSPNLKVFSRAPLQFLLSKFEGVLARFAAVPSSKIRRFSHALRYSSFSPSMKVLSSKSKVLTLLHCSSFLQVRSSFLPKFEVPSSKFEGSHALRCTIPSSSSKVLMR
FVATFLQVRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTAPSPSSKALLSVATFLQVRRFSHALLQFLPPSLKVPSRASLAPSPSSKALLSVATSPSSKALLSTAT
SPSSKALLSTAPSPSSKALLSTAPSPSSKVLLSTPLFEGSPLRFSFSKFEGSLLLLFKCLAAVDVFVPFHLQMLASLVITPSRYYSHQSDWSRQAVKSLQLNLMTTVVGE
SGDEVTASESDDDRCRRVESGDHPCRLLRSPNKMGTGLAGVHEGESGYSDHPIKWGLGLAGVHEGESGDYPCRDQQANRPIKKISKSAGRSSKKRSTSQLTDQEDQQVSR
PIIQEDQQADRSINRINKSTGRSSKRINKLTSRSNRSSSQQADPTDHQVNRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSISQQADHPRGSTS