; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026093 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026093
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr10:28998119..29002121
RNA-Seq ExpressionLag0026093
SyntenyLag0026093
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035697.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.2e-1940.1Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKN----------------FHKKEKE-----------------NFATSYWIDVEE
        KVDD  YCKYH VI HPVE+CFVLK+LILKLA+E KI+LD+DEI   N                + ++EKE                 N+A+S     +E
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKN----------------FHKKEKE-----------------NFATSYWIDVEE

Query:  VDNSKKGEQRTSV---LIASSLQILVLRIPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIR
        V NS + +Q TSV   +  S+ +  + +  +      +EENQ  T T T+ SAF+RL +S SKK R STS FDRLK       +TN   +  +KT+ K +
Subjt:  VDNSKKGEQRTSV---LIASSLQILVLRIPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIR

Query:  AF
        +F
Subjt:  AF

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-2137.05Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEI--------------------FSKNFHKKEKENFA------TSYWIDV-------EE
        KVDD  YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DE+                    + ++F +   E         T+  ++V       EE
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEI--------------------FSKNFHKKEKENFA------TSYWIDV-------EE

Query:  VDNSKKGEQRTSVLIASSLQILVLR---IPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIR
        VDNS + +QRT V     ++ L  R     +      +EE QC TST+TR S F+RLS+STSKK R STS FDRLK       +TN   +  +K++ K +
Subjt:  VDNSKKGEQRTSVLIASSLQILVLR---IPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIR

Query:  AFKCKSSLSQEPKLHD-APSPHELKSSFSPSTKVLFSKFEGSYIAAMQFLL
         F  K+    + K+H   PS  + K S   +T       EGS     +F++
Subjt:  AFKCKSSLSQEPKLHD-APSPHELKSSFSPSTKVLFSKFEGSYIAAMQFLL

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.9e-2546.59Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLD--------------------EIFSKNFHKKEKENFA------TSYWIDV-------EE
        KVDD  YCKYHRVI HPVE+CFVLK+LILKLA+E KI+LD+D                    E   ++F +   E         T+  ++V       EE
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLD--------------------EIFSKNFHKKEKENFA------TSYWIDV-------EE

Query:  VDNSKKGEQRTSVL-IASSLQILVLRIPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVK
        VDNS + +QRTSV      L    L   +      +EENQC TST+TR SAF+RLS+STSKK R STS FDRLK+K
Subjt:  VDNSKKGEQRTSVL-IASSLQILVLRIPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVK

KAA0066166.1 Retrotransposon gag protein [Cucumis melo var. makuwa]3.8e-2041.27Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNF------------------HKKEKENFATSYWIDV----------EEVDNSK
        KVDD  YCKYHRVI HPVE+CF+LK +ILKLA+E KIELD+ E+   N                   H +E       + + +          EEVDNS 
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNF------------------HKKEKENFATSYWIDV----------EEVDNSK

Query:  KGEQRTSVLIASSLQILVLRIPKNEYGRDKEE-NQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTM
        + +QRT V       I    I +      KEE NQC  ST T+ SAF+RLS+STSK+ R  TS FDRLK       +TN   +  +KT+
Subjt:  KGEQRTSVLIASSLQILVLRIPKNEYGRDKEE-NQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTM

TYK15207.1 Retrotransposon gag protein [Cucumis melo var. makuwa]1.7e-2041.27Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNF------------------HKKEKENFATSYWIDV----------EEVDNSK
        KVDD  YCKYHRVI HPVE+CF+LK++ILKLA+E KIELD+ E+   N                   H +E       + + +          EEVDNS 
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNF------------------HKKEKENFATSYWIDV----------EEVDNSK

Query:  KGEQRTSVLIASSLQILVLRIPKNEYGRDKEE-NQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTM
        + +QRT V       I    I +      KEE NQC  ST T+ SAF+RLS+STSK+ R  TS FDRLK       +TN   +  +KT+
Subjt:  KGEQRTSVLIASSLQILVLRIPKNEYGRDKEE-NQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTM

TrEMBL top hitse value%identityAlignment
A0A5A7VFA5 Ty3-gypsy retrotransposon protein1.9e-2546.59Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLD--------------------EIFSKNFHKKEKENFA------TSYWIDV-------EE
        KVDD  YCKYHRVI HPVE+CFVLK+LILKLA+E KI+LD+D                    E   ++F +   E         T+  ++V       EE
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLD--------------------EIFSKNFHKKEKENFA------TSYWIDV-------EE

Query:  VDNSKKGEQRTSVL-IASSLQILVLRIPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVK
        VDNS + +QRTSV      L    L   +      +EENQC TST+TR SAF+RLS+STSKK R STS FDRLK+K
Subjt:  VDNSKKGEQRTSVL-IASSLQILVLRIPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVK

A0A5A7VII4 Retrotransposon gag protein1.8e-2041.27Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNF------------------HKKEKENFATSYWIDV----------EEVDNSK
        KVDD  YCKYHRVI HPVE+CF+LK +ILKLA+E KIELD+ E+   N                   H +E       + + +          EEVDNS 
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNF------------------HKKEKENFATSYWIDV----------EEVDNSK

Query:  KGEQRTSVLIASSLQILVLRIPKNEYGRDKEE-NQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTM
        + +QRT V       I    I +      KEE NQC  ST T+ SAF+RLS+STSK+ R  TS FDRLK       +TN   +  +KT+
Subjt:  KGEQRTSVLIASSLQILVLRIPKNEYGRDKEE-NQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTM

A0A5D3CA53 Retrotransposon gag protein7.5e-2237.05Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEI--------------------FSKNFHKKEKENFA------TSYWIDV-------EE
        KVDD  YCKYHRVI H +E+CFVLK+LILKLA + KIELD+DE+                    + ++F +   E         T+  ++V       EE
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEI--------------------FSKNFHKKEKENFA------TSYWIDV-------EE

Query:  VDNSKKGEQRTSVLIASSLQILVLR---IPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIR
        VDNS + +QRT V     ++ L  R     +      +EE QC TST+TR S F+RLS+STSKK R STS FDRLK       +TN   +  +K++ K +
Subjt:  VDNSKKGEQRTSVLIASSLQILVLR---IPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIR

Query:  AFKCKSSLSQEPKLHD-APSPHELKSSFSPSTKVLFSKFEGSYIAAMQFLL
         F  K+    + K+H   PS  + K S   +T       EGS     +F++
Subjt:  AFKCKSSLSQEPKLHD-APSPHELKSSFSPSTKVLFSKFEGSYIAAMQFLL

A0A5D3CTF5 Retrotransposon gag protein8.3e-2141.27Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNF------------------HKKEKENFATSYWIDV----------EEVDNSK
        KVDD  YCKYHRVI HPVE+CF+LK++ILKLA+E KIELD+ E+   N                   H +E       + + +          EEVDNS 
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNF------------------HKKEKENFATSYWIDV----------EEVDNSK

Query:  KGEQRTSVLIASSLQILVLRIPKNEYGRDKEE-NQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTM
        + +QRT V       I    I +      KEE NQC  ST T+ SAF+RLS+STSK+ R  TS FDRLK       +TN   +  +KT+
Subjt:  KGEQRTSVLIASSLQILVLRIPKNEYGRDKEE-NQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTM

A0A5D3E4T1 Retrotransposon gag protein2.0e-1940.1Show/hide
Query:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKN----------------FHKKEKE-----------------NFATSYWIDVEE
        KVDD  YCKYH VI HPVE+CFVLK+LILKLA+E KI+LD+DEI   N                + ++EKE                 N+A+S     +E
Subjt:  KVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKN----------------FHKKEKE-----------------NFATSYWIDVEE

Query:  VDNSKKGEQRTSV---LIASSLQILVLRIPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIR
        V NS + +Q TSV   +  S+ +  + +  +      +EENQ  T T T+ SAF+RL +S SKK R STS FDRLK       +TN   +  +KT+ K +
Subjt:  VDNSKKGEQRTSV---LIASSLQILVLRIPKNEYGRDKEENQCSTSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIR

Query:  AF
        +F
Subjt:  AF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAGTCGATGATTCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTTCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGG
CAAAATCGAGCTCGACCTTGATGAAATCTTCTCCAAAAATTTCCACAAGAAGGAAAAAGAGAACTTTGCAACTTCCTACTGGATCGACGTAGAAGAAGTTGACAATTCCA
AGAAGGGTGAACAAAGGACATCCGTCTTGATCGCATCAAGCCTCCAAATACTCGTCCTTCGTATTCCAAAGAATGAGTATGGCCGCGACAAAGAAGAAAATCAATGTTCG
ACGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTGAAACCAAATCT
CATTATCTTGACCAATCCTGCAATGAAGGATATGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTC
CTAGCCCACACGAGCTTAAAAGTTCCTTTTCTCCAAGTACGAAGGTGCTCTTCTCCAAGTTCGAAGGCTCTTACATTGCTGCGATGCAGTTCCTTCTCTCCAAGTTCAAA
GGTTCTCACGCGTTCCGCTACAATTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTGCAGTTCCTTCCTCCAAGTTCGAAG
GTTCTCACGTTGCTTCGCCGTAGTTCCTTCTCTCACGGTACGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCTCACGTTGCTTCGCTGAGTTCCTTCCTCCAAGTTCGAA
GGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCAAGTTCAAAGGTTCTCACGCATTTCGCTACAGTTCATTTCCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCCGT
AGTTCCTTCTCTACAAGTACGAAGTTCTTCCTCCAAGTCGAAGGTTCTCTCACGTTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCA
GTTCCTTCTCTCCAAGTTCAAAGGTTCTCACGGATTTCGCTACAGTTCATTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGGTGAAGTTCTTCCTACAAGTCTGAAGG
TTCTCACGTGCTTCGGTGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTTACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTT
CTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACTCGATTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTT
TCGAAGGTTCTCACGGATGTGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCACGCGCT
TTCTCTTCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCATTCTCCAGTTCGAAGCGCTTCTCTCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTC
TCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTC
CACTACTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTACTCCAAGTTCGAAGGTGCTTCTCTCAC
GACTCCTTCTCCAAGTTCGAAGGCGCTTCTCTATACTACTCCTTCTCCAAGTTCGAAGGTGTTTCTCTCCACCCCTCCTTTTGAAGTTGACGGCGTCCGTTGCACTTCAT
CTTCAAATGTTGGCAGTTGACGGCGTCCGCTTCGCTTCATCTTCAAAAATTGACTGTGATGAAGTCACTGCAAGTGAATCTGATGACGACCGTTGTAGGCGAGTCGAGTA
TGGTGACCACCCTTGCAGGTTACTCAAATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAATCTGGAGTGCATCACTGAAGGCGAATCTGG
TGACTACCCCTGCAGGCGAATCTGGTGACTACCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGAGTGCATCACTGAAGGCGAATCTGGTGACTACCCCTGC
AGGTTACTCAGATCACCCAATAAAATGAGGACTGGTCTAGCAGGAGTGCATCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAAGTCGATGATTCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTTCTAAAGGACTTAATTTTAAAGCTGGCTAAGGAAGG
CAAAATCGAGCTCGACCTTGATGAAATCTTCTCCAAAAATTTCCACAAGAAGGAAAAAGAGAACTTTGCAACTTCCTACTGGATCGACGTAGAAGAAGTTGACAATTCCA
AGAAGGGTGAACAAAGGACATCCGTCTTGATCGCATCAAGCCTCCAAATACTCGTCCTTCGTATTCCAAAGAATGAGTATGGCCGCGACAAAGAAGAAAATCAATGTTCG
ACGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTGAAACCAAATCT
CATTATCTTGACCAATCCTGCAATGAAGGATATGATCAAGACCATGACAAAGATAAGAGCTTTTAAATGTAAAAGCTCCTTATCGCAAGAGCCTAAACTGCATGATGCTC
CTAGCCCACACGAGCTTAAAAGTTCCTTTTCTCCAAGTACGAAGGTGCTCTTCTCCAAGTTCGAAGGCTCTTACATTGCTGCGATGCAGTTCCTTCTCTCCAAGTTCAAA
GGTTCTCACGCGTTCCGCTACAATTCCTTCTCTCCAAGTACGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCACGTTGCTTCACTGCAGTTCCTTCCTCCAAGTTCGAAG
GTTCTCACGTTGCTTCGCCGTAGTTCCTTCTCTCACGGTACGAAGGTTCTCTCCTCCAAGTCGAAGGTTCTCTCACGTTGCTTCGCTGAGTTCCTTCCTCCAAGTTCGAA
GGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCAAGTTCAAAGGTTCTCACGCATTTCGCTACAGTTCATTTCCTCCAAGTTCGAAGGTTCTCACGTTGCTTCGCCGT
AGTTCCTTCTCTACAAGTACGAAGTTCTTCCTCCAAGTCGAAGGTTCTCTCACGTTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGTTGCA
GTTCCTTCTCTCCAAGTTCAAAGGTTCTCACGGATTTCGCTACAGTTCATTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGGTGAAGTTCTTCCTACAAGTCTGAAGG
TTCTCACGTGCTTCGGTGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTTACGTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCGCTGCAGTTT
CTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACTCGATTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTT
TCGAAGGTTCTCACGGATGTGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCTCACGCGCT
TTCTCTTCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCATTCTCCAGTTCGAAGCGCTTCTCTCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTC
TCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTC
CACTACTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTACTCCAAGTTCGAAGGTGCTTCTCTCAC
GACTCCTTCTCCAAGTTCGAAGGCGCTTCTCTATACTACTCCTTCTCCAAGTTCGAAGGTGTTTCTCTCCACCCCTCCTTTTGAAGTTGACGGCGTCCGTTGCACTTCAT
CTTCAAATGTTGGCAGTTGACGGCGTCCGCTTCGCTTCATCTTCAAAAATTGACTGTGATGAAGTCACTGCAAGTGAATCTGATGACGACCGTTGTAGGCGAGTCGAGTA
TGGTGACCACCCTTGCAGGTTACTCAAATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAATCTGGAGTGCATCACTGAAGGCGAATCTGG
TGACTACCCCTGCAGGCGAATCTGGTGACTACCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGAGTGCATCACTGAAGGCGAATCTGGTGACTACCCCTGC
AGGTTACTCAGATCACCCAATAAAATGAGGACTGGTCTAGCAGGAGTGCATCACTGA
Protein sequenceShow/hide protein sequence
MEKVDDSKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEIFSKNFHKKEKENFATSYWIDVEEVDNSKKGEQRTSVLIASSLQILVLRIPKNEYGRDKEENQCS
TSTFTRPSAFQRLSVSTSKKSRSSTSVFDRLKVKPNLIILTNPAMKDMIKTMTKIRAFKCKSSLSQEPKLHDAPSPHELKSSFSPSTKVLFSKFEGSYIAAMQFLLSKFK
GSHAFRYNSFSPSTKVLSSKSKVLTLLHCSSFLQVRRFSRCFAVVPSLTVRRFSPPSRRFSHVASLSSFLQVRRFSCALLQFLLSKFKGSHAFRYSSFPPSSKVLTLLRR
SSFSTSTKFFLQVEGSLTLLRCSSFLQVRRFSCALLQFLLSKFKGSHGFRYSSFPPSSKVLTRFGEVLPTSLKVLTCFGEVPSSKFEGSYVLRCSSFLQVQRFSRASLQF
LLPKFEGSHALRCSSFLPKFEGSHSIRCSSFLQIRRFRRFSRMCFVAVPSSKFEGSLTRCCSSFLQVRRFLTRFLFAAPSPSSKALLSTAHSPVRSASLAAPSPSSKALL
SVATSPSSKALLSVATSPSSKALLSTAPSPSSKALLSTTPSPSSKALLSAAPSPSSKALLSAAPTPSSKVLLSRLLLQVRRRFSILLLLQVRRCFSPPLLLKLTASVALH
LQMLAVDGVRFASSSKIDCDEVTASESDDDRCRRVEYGDHPCRLLKSPNKMGTGLAGVHHCRQIWSASLKANLVTTPAGESGDYPAGYSDHPIKWGLECITEGESGDYPC
RLLRSPNKMRTGLAGVHH