; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025551 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025551
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr10:14967391..14972280
RNA-Seq ExpressionLag0025551
SyntenyLag0025551
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036008.1 Retrotransposon gag protein [Cucumis melo var. makuwa]6.1e-2868.18Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D K+HSR+PSRMKRK SV INTEGSL VKP  
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEG
        II TNP NEG
Subjt:  IILTNPANEG

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.6e-2865.22Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL VKP  
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEGSDQAM
        II TNP NEG ++ +
Subjt:  IILTNPANEGSDQAM

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]8.0e-2865.22Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL VKP  
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEGSDQAM
        II TNP NEG  + +
Subjt:  IILTNPANEGSDQAM

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.7e-2861.16Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC MST TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K++SR+PSR+KRK S+ INTEGSL VKP  
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEGSDQAMTKIRAF
        II TNP NEG ++ + + + F
Subjt:  IILTNPANEGSDQAMTKIRAF

TYK18884.1 gag protease polyprotein [Cucumis melo var. makuwa]8.0e-2864.35Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK S+ INT+GSL VKP L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEGSDQAM
        II TNP NEG ++ +
Subjt:  IILTNPANEGSDQAM

TrEMBL top hitse value%identityAlignment
A0A5A7SZJ7 Retrotransposon gag protein3.0e-2868.18Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC MST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M + + K F E N D K+HSR+PSRMKRK SV INTEGSL VKP  
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEG
        II TNP NEG
Subjt:  IILTNPANEG

A0A5A7TQ06 Retrotransposon gag protein1.7e-2865.22Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL VKP  
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEGSDQAM
        II TNP NEG ++ +
Subjt:  IILTNPANEGSDQAM

A0A5A7U974 Retrotransposon gag protein2.3e-2861.16Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC MST TR SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K++SR+PSR+KRK S+ INTEGSL VKP  
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEGSDQAMTKIRAF
        II TNP NEG ++ + + + F
Subjt:  IILTNPANEGSDQAMTKIRAF

A0A5D3BBF9 Gag protease polyprotein3.9e-2865.22Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK SV INTEGSL VKP  
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEGSDQAM
        II TNP NEG  + +
Subjt:  IILTNPANEGSDQAM

A0A5D3D5Q0 Gag protease polyprotein3.9e-2864.35Show/hide
Query:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL
        +SMA  EEENQC  ST  R SAF+RLS+STSKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HSR+PSRMKRK S+ INT+GSL VKP L
Subjt:  MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNL

Query:  IILTNPANEGSDQAM
        II TNP NEG ++ +
Subjt:  IILTNPANEGSDQAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCTTC
AACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATA
GTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGCCAAATCTCATTATCTTGACCAATCCTGCAAATGAAGGA
TCTGATCAAGCCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGATCCTAGCCCACACGAGCTTAAAAGGTTCTTCGC
TGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCTAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCC
AAGTTCGAGGGTTCTCAGTTGCACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGT
TCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGT
TCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTTGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCT
CCGCTGCTGCAGTTCATTCTTCCAAGTTCGAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAATTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAATTCCTT
CCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTTTCACGCGCTTCGTTGCATTTCCTTCCCCCAAGTTCGAAGGTTCTCACGC
GCTTCGCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCACGCACTTTGCTGCAGTTCCTTCTCCCAAATT
CGAAGGTTCTCACGCATTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACATCCCTTCGGAGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCG
GCAGTTCCTTCCTCCAAGTTCGAAGGTCCTCACGCGGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGTTCTCACACGCTTCGCTGGAGTTCCTTCCTCCAAGTTCGAG
GGTTCTCACGCGCTTCGCTGCAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGTTCTCACGCGCTTCGCTGCAGTTC
CTTCCCCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCT
CACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGTTCCTTCCCCCAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCCTCCAAGTTCAAAGGTTCTCACG
CACTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCACAAGTTCGAAGGTTCTCACGCGCTGCTGCAGCTCCTTCCTCCAA
GTTCGAAGGTTCCCTCACACGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTACTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTC
CTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAATGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGGAGGCGCTTCTCTC
CACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCAAAGGTGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCA
CTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAA
TGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTCCGATGCGCTTCATCTTCAAATGTTGGCAGAAACTACAGTCATCAAAG
TGACTGGTCTAGACAAGTGGTGAAGTCATTGCAATTGAATCTGATGACGACCGTTGTAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAGTAA
AATGGGGACTGGAGTGCATCACTGTAGGCAAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAGTAAAATGGGGACTGGTCTAGCAGGAGTGAAATCATTGCAA
GCGAATTTGAGGCAGAGAGGGCAGAGTCCAGAGCATTCTCCCAAGATCCATAGTCGTCAGAATCCAAAGAGTCCAGAGAATTCAGAGATCCGAGATTCAAAATTCAAAGG
ATTCAAGACTCAGAAGATTAGAAGACTAGAAGACTCTAAAAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATC
AACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATGGCCGCGACAGAGGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCAAAGAAAAGTCGATCTTC
AACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATA
GTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGCCAAATCTCATTATCTTGACCAATCCTGCAAATGAAGGA
TCTGATCAAGCCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGATCCTAGCCCACACGAGCTTAAAAGGTTCTTCGC
TGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCTAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCC
AAGTTCGAGGGTTCTCAGTTGCACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTGCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGT
TCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTGCGTTGCAGTTCCTTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGT
TCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTTGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCT
CCGCTGCTGCAGTTCATTCTTCCAAGTTCGAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAATTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAATTCCTT
CCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGGTTTTCACGCGCTTCGTTGCATTTCCTTCCCCCAAGTTCGAAGGTTCTCACGC
GCTTCGCTGCAATTCCTTCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCACGCACTTTGCTGCAGTTCCTTCTCCCAAATT
CGAAGGTTCTCACGCATTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACATCCCTTCGGAGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCG
GCAGTTCCTTCCTCCAAGTTCGAAGGTCCTCACGCGGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGTTCTCACACGCTTCGCTGGAGTTCCTTCCTCCAAGTTCGAG
GGTTCTCACGCGCTTCGCTGCAGTTCCTCCTCCAAGTTCGAGGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAGGTTCTCACGCGCTTCGCTGCAGTTC
CTTCCCCCAAGTTTGAAGGTTCTCACATCGCTTCGCTGCGATCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCAAGTTCGAAGGTTCT
CACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGTTCCTTCCCCCAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAATTCCTTCCTCCAAGTTCAAAGGTTCTCACG
CACTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCACAAGTTCGAAGGTTCTCACGCGCTGCTGCAGCTCCTTCCTCCAA
GTTCGAAGGTTCCCTCACACGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTACTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTC
CTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAATGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGGAGGCGCTTCTCTC
CACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCAAAGGTGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCA
CTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAA
TGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTCCGATGCGCTTCATCTTCAAATGTTGGCAGAAACTACAGTCATCAAAG
TGACTGGTCTAGACAAGTGGTGAAGTCATTGCAATTGAATCTGATGACGACCGTTGTAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAGTAA
AATGGGGACTGGAGTGCATCACTGTAGGCAAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAGTAAAATGGGGACTGGTCTAGCAGGAGTGAAATCATTGCAA
GCGAATTTGAGGCAGAGAGGGCAGAGTCCAGAGCATTCTCCCAAGATCCATAGTCGTCAGAATCCAAAGAGTCCAGAGAATTCAGAGATCCGAGATTCAAAATTCAAAGG
ATTCAAGACTCAGAAGATTAGAAGACTAGAAGACTCTAAAAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATC
AACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGA
Protein sequenceShow/hide protein sequence
MSMAATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKVKPNLIILTNPANEG
SDQAMTKIRAFKCKSSLSQEPKLHDDPSPHELKRFFAAVPSLQVRGSLHCTLLRCSFSKFEGSSLYPAALFLLQVRGFSVAQLLRCSSSKCEGSYVVRCCIVPSSLKFDG
SHAALLEFLLPKFEGSHALRCSSFFPRSKVLTRCVAVLSPQVRRFTHFAAVPSPKFEGSHALRCSSFLQVRRFSAAAVHSSKFEGRRFSRAALQFFLPKFEGSRTSLQFL
PPSSKVLTRFVQFLPPNSKVFTRFVAFPSPKFEGSHALRCNSFPKFEGSHALRAVPSSKFEGSRTLLQFLLPNSKVLTHFAAVPSSKFEGSHIPSEQFLPPSSKVLTRFA
AVPSSKFEGPHAASLQFLPPSSRFSHASLEFLPPSSRVLTRFAAVPPPSSRVLTRFAAVPSSKFEVLTRFAAVPSPKFEGSHIASLRSFLQVRRFSRASLCNSFPKFEGS
HALRAVPSSKFEVPSPKFEGSHALRCNSFLQVQRFSRTSLQFLPPSSKVLTRFAAVPSHKFEGSHALLQLLPPSSKVPSHASLAPSPSSKALLSTAPSPSSKALLSVATF
LQVRRFSHALLQFLPPSSNVPSRASLAPSPSSEALLSTAPSPSSKALLSVATSPSSKVLLSTAPSPSSKALLSTAPSPSSKVLLSTPLFEGSPLRFSFSKFEGSPLLLFK
CLAAVDVLVPLHLQMLVVDGVRCASSSNVGRNYSHQSDWSRQVVKSLQLNLMTTVVGESGLVTTPAGYSDHPVKWGLECITVGKSGDYPCRLLRSPSKMGTGLAGVKSLQ
ANLRQRGQSPEHSPKIHSRQNPKSPENSEIRDSKFKGFKTQKIRRLEDSKKRSTSQPTDQEDQQVSRPIIQEDQQANKPIQQIIKPTG