; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018252 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018252
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon gag protein
Genome locationscaffold3:11810191..11824258
RNA-Seq ExpressionSpg018252
SyntenySpg018252
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041727.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.7e-3047.37Show/hide
Query:  QRSKKFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDV-------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTS
        ++ + F QPR+ +T  E  S+ F +   E         T+  ++V       EEVDNS + +QRTSVFDRIK  TTR SVFQR+SMA  +EENQC   T 
Subjt:  QRSKKFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDV-------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTS

Query:  TRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSLKFLLSKLEGSYTTLLHCSFFNFE
        TR S F+RLS+STL+K + STS FDRLK+ +DQ +RKM +LK K F+  + D+K+ S V SRMKRK SV IN E S  FL          L HC FF F+
Subjt:  TRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSLKFLLSKLEGSYTTLLHCSFFNFE

Query:  GSHVALLRC
             L RC
Subjt:  GSHVALLRC

KAA0041771.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.1e-3050.57Show/hide
Query:  RSKKFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDV-------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTST
        + K F QPR+ +T  E   ++F +   E         T+  ++V       EE+DNS + +QRT VFDRIKP TTR SVFQR+SMA  KEENQC  ST  
Subjt:  RSKKFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDV-------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTST

Query:  RPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSL
        R S F+RLS+ST +K +  TS FDRLK+ +DQ +R+M  LK K F+E + D+K+ S V+SRMKRKFSV INTE SL
Subjt:  RPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.5e-3145.66Show/hide
Query:  EEKDSQIAQLKSQIENQH----IAESSQTQRSKKF-------------SQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------EEVDNS
        +E+ +   Q KS    +H    I+   + +R+KK               QPRQ +T  E F ++F   H +E     T +   +          EEVDNS
Subjt:  EEKDSQIAQLKSQIENQH----IAESSQTQRSKKF-------------SQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------EEVDNS

Query:  KKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSI
         + +QRTSVFDRIKP TTR SVFQR+SMA  +EENQC  ST  R S F+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S 
Subjt:  KKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSI

Query:  VASRMKRKFSVLINTEGSL
        V SRMKRK SV INTEGSL
Subjt:  VASRMKRKFSVLINTEGSL

KAA0063719.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.7e-3043.29Show/hide
Query:  VTVMMTETRTMEERMTEMQEHINTLMKAIEEKDSQIAQLKSQIENQHIAESSQTQRSKKFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----
        V  MM  T T + +   + ++IN       +   ++ +L + I+           + + F QPRQ +T  E   ++F   H KE       +   +    
Subjt:  VTVMMTETRTMEERMTEMQEHINTLMKAIEEKDSQIAQLKSQIENQHIAESSQTQRSKKFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----

Query:  ------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLF
              +EVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  +EENQC  ST  R STF+RLS+ST +K + STS FDRLK+ +DQ +R+M  LK K F
Subjt:  ------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLF

Query:  NEVSSDEKLQSIVASRMKRKFSVLINTEGSL
        +E + D+K+ S V SRMKRK SV INTEGSL
Subjt:  NEVSSDEKLQSIVASRMKRKFSVLINTEGSL

TYK00108.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.6e-3045.21Show/hide
Query:  EEKDSQIAQLKSQIENQH----IAESSQTQRSKK-------------FSQPRQPVTAKELFSKTFHKKEKENF-------ATSYC------IDVEEVDNS
        +E+ +   Q KS    +H    I+   + +R+KK             F QPRQ +T  E F ++F +   E          TS+          EEVDNS
Subjt:  EEKDSQIAQLKSQIENQH----IAESSQTQRSKK-------------FSQPRQPVTAKELFSKTFHKKEKENF-------ATSYC------IDVEEVDNS

Query:  KKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSI
         + +QRTSVFDRIKP TTR SVFQR+SMA  +E+NQC  ST TR S F+RLS+ST +K + STS FDR+K+ +DQ +R+M +LK K F+E + D+K+ S 
Subjt:  KKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSI

Query:  VASRMKRKFSVLINTEGSL
        V SRMKRK  V INTEGSL
Subjt:  VASRMKRKFSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7TJ64 Retrotransposon gag protein1.3e-3047.37Show/hide
Query:  QRSKKFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDV-------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTS
        ++ + F QPR+ +T  E  S+ F +   E         T+  ++V       EEVDNS + +QRTSVFDRIK  TTR SVFQR+SMA  +EENQC   T 
Subjt:  QRSKKFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDV-------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTS

Query:  TRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSLKFLLSKLEGSYTTLLHCSFFNFE
        TR S F+RLS+STL+K + STS FDRLK+ +DQ +RKM +LK K F+  + D+K+ S V SRMKRK SV IN E S  FL          L HC FF F+
Subjt:  TRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSLKFLLSKLEGSYTTLLHCSFFNFE

Query:  GSHVALLRC
             L RC
Subjt:  GSHVALLRC

A0A5A7TQ06 Retrotransposon gag protein2.7e-3145.66Show/hide
Query:  EEKDSQIAQLKSQIENQH----IAESSQTQRSKKF-------------SQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------EEVDNS
        +E+ +   Q KS    +H    I+   + +R+KK               QPRQ +T  E F ++F   H +E     T +   +          EEVDNS
Subjt:  EEKDSQIAQLKSQIENQH----IAESSQTQRSKKF-------------SQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------EEVDNS

Query:  KKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSI
         + +QRTSVFDRIKP TTR SVFQR+SMA  +EENQC  ST  R S F+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S 
Subjt:  KKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSI

Query:  VASRMKRKFSVLINTEGSL
        V SRMKRK SV INTEGSL
Subjt:  VASRMKRKFSVLINTEGSL

A0A5A7VDY3 Ty3-gypsy retrotransposon protein2.3e-3043.29Show/hide
Query:  VTVMMTETRTMEERMTEMQEHINTLMKAIEEKDSQIAQLKSQIENQHIAESSQTQRSKKFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----
        V  MM  T T + +   + ++IN       +   ++ +L + I+           + + F QPRQ +T  E   ++F   H KE       +   +    
Subjt:  VTVMMTETRTMEERMTEMQEHINTLMKAIEEKDSQIAQLKSQIENQHIAESSQTQRSKKFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----

Query:  ------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLF
              +EVDNS + +QRTSVFDRIKP TTR SVFQR+SMA  +EENQC  ST  R STF+RLS+ST +K + STS FDRLK+ +DQ +R+M  LK K F
Subjt:  ------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLF

Query:  NEVSSDEKLQSIVASRMKRKFSVLINTEGSL
        +E + D+K+ S V SRMKRK SV INTEGSL
Subjt:  NEVSSDEKLQSIVASRMKRKFSVLINTEGSL

A0A5D3BBF9 Gag protease polyprotein3.0e-3050Show/hide
Query:  RSKKFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDV-------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTST
        + + F QPR+ +T  E   ++F +   E         T+  ++V       EEVDNS + +QRTS+FDRIKP TTR  VFQR+SMA  +EENQC  ST  
Subjt:  RSKKFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDV-------EEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTST

Query:  RPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSL
        R S F+RLS+ST +K + STS FDRLK+ +DQ +R+M +LK K F+E + D+K+ S V SRMKRK SV INTEGSL
Subjt:  RPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSL

A0A5D3BLW3 Retrotransposon gag protein1.7e-3045.21Show/hide
Query:  EEKDSQIAQLKSQIENQH----IAESSQTQRSKK-------------FSQPRQPVTAKELFSKTFHKKEKENF-------ATSYC------IDVEEVDNS
        +E+ +   Q KS    +H    I+   + +R+KK             F QPRQ +T  E F ++F +   E          TS+          EEVDNS
Subjt:  EEKDSQIAQLKSQIENQH----IAESSQTQRSKK-------------FSQPRQPVTAKELFSKTFHKKEKENF-------ATSYC------IDVEEVDNS

Query:  KKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSI
         + +QRTSVFDRIKP TTR SVFQR+SMA  +E+NQC  ST TR S F+RLS+ST +K + STS FDR+K+ +DQ +R+M +LK K F+E + D+K+ S 
Subjt:  KKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMNNLKVKLFNEVSSDEKLQSI

Query:  VASRMKRKFSVLINTEGSL
        V SRMKRK  V INTEGSL
Subjt:  VASRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCAAATCTACTTGTGTAATGTGACTAACGCATTCTTAGAAACTCTTCCCTTATCGCATAGTGCAAAACAACAATGGAAAGCTTGTTGCTGTAGCAATGGAAAGCT
TGTTGCTCTAGCAAGGGATGTTTGTTGCTGTAGCAATGGAAAGCTTGTTGCTCTAGCAAGGGATGCTTGTTGCTGTAGCAATGGAAGCTTGATGGGATGCTTGTTGATGC
TACAACAAGTTTCTATCCAAGCAATACATGCTTTGGCTCAAAGCAACACATGTTTTGGCTCCAAGCAACTCATGTCGCTCCTTCCCTGTGGATTCGACCCTGGAATACTC
CAGTTGCAAGAAGATACAGCTTCTATCGTTGCAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTCTTGTTAAGTATAACCCTTTGTTTGAACCTGA
TTCTGACGTAGTGACTGTCATGATGACTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAGGAACACATCAACACCTTGATGAAGGCGATTGAAGAAAAAGATT
CTCAAATCGCGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGTGACTGCGAAG
GAACTCTTCTCCAAAACTTTCCACAAAAAGGAAAAAGAAAACTTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGGGTGAACAAAGGACTTC
CGTCTTCGATCGCATCAAGCCTTCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCTGCGACAAAAGAAGAAAATCAATGTTCGGTGTCCACCTCCACTCGAC
CTTCAACTTTCCAAAGGCTAAGTGTCTCCACATTGAGGAAAAGTCAGTCTTCAACATCTGTCTTTGATCGCCTCAAAGTAGCAGACGATCAACCTAAAAGAAAGATGAAC
AACTTGAAGGTGAAACTTTTCAATGAAGTAAGCAGTGACGAGAAACTTCAAAGTATTGTCGCGTCACGTATGAAAAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTC
CTTGAAGTTCCTTCTCTCCAAGTTAGAGGGTTCTTACACGACGCTGCTTCATTGTTCCTTCTTCAACTTCGAAGGTTCTCACGTTGCGCTGTTGCGCTGCTTCCTTCTCC
AAGTTCAAAAGTTCGCACGCTGTGCTGCTGCGCGCTACTCCATTCTCTCTCCAAGTTCGAAGGTTCTCATACTGCGCTGCTACGCTGCTTCCTTCTCCAAATTCGAAGGT
TCTCAGGCTACGCTGCTGCGCTACCTCCATGTTTTAAGTTCGAAGGTTCCCACGTTGCACTGTTGTGCTGCTTCCTTCTCCAAGTTCGAAGGTTTTGACGTTGCGCTGCT
GCGCTATTTCATTCACCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCTCTTCAAGTTCGAAGGTTCTCCCACGCTTCGCTGTAATTCCTTCTCTCCAAGTT
CGAAGGTGCTCACACGTTTCGATACAGTTCCTTCTCTCCAAGTTAGAGGGTTCTTACGCGGCGTTGCTTCGTTGTTCCTTCTTCAATCCTTCCTCCAAGTTCGAAGGTTC
TCACGCTGCTTCGTTGCAGTTCCTTCCTCCAAGATCGATGGTTCTCATGCCACGCTGCTTCGTTGTTCCTTCTCCAAGTTCAAAGGTTTTCACGTTATACATTGCGTAGT
TCCATTTCCAAGTTCGAAAGTTATCTGTTGCTTCGTTGTTGTTTCTTCTCTTCAACTTCGAAGGTTCTCATGCGCTTCGCTGAGGTTTCTTCTCTCCAAATTCGAAGGTT
CTCACGTGCTTCGCGGAAGTTCCTTCTCTCCAAGTTTGAAGGTTCTCATGCGCTTCGCTAAGGTTTCTTCTCTCCAAGTTCGAAGGTTCTCACGCTGTTTCGCTGATGTT
CCTTCTTCAAGTTTGAAACTTTTCTCCAAGTTTGAAGATTCTCATGCACTTCGCTGCAGTTCCTTCTCTCCAAGTTTGAAGGTTCTCACACGCTTTGCTACAGTTCCTTA
TCCAAGTTCAAAGGTTCTGATGCTACGCTGCTTCTTTCTCCAAGTTCGAAGTTTCTCCACTGAACACTCCTGCGTTGCTCCTTCTCCAAGTCCGAAGGTTCTGACGTTGC
GCTGTTGTGCTGCTTCCTTCTGTTCTCACTCTACGTTGTTCGAAGGTTCTTCCTCTTACACTGCATTGCATCGTTGTTCCTTCAATAAGTTCGAAGGTTCTTATGTTGAA
GCAGTAAAGGAGCCCAACAGTAAAGGAGCCCAAGTGGAAGTGACATCATGTGTCCTTGGGCCGGGGAACGTCATAGCAACAAAAGTCCAAAGACGTCAATTGTCCTTCAG
TAAAGGAGCCCAAGTGGGAGTGACATCATGTGTCCTTGGACCGGGGAACGTCATAGCAACAAAAGTTCAAAGACACGAAGAGCACGGGCGTGAGGAGGAAATGTGGAGGC
GATTTTGGTGGAGGGTTTGTGTTGATTTGAACTTAAATGCTGAACCCGCCAGAGCTGTATGTACTGAGTCTGATCTGAGGGCTGGCGAGCTTGATCTCAGATGGAGGAAG
GTTGAATCTTCAACTTGTATCTTGATGAAGAAACGGTTGATTCTTTGGAGTCTTCCAGTCGTTGGATCTTCTGAGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCCAAATCTACTTGTGTAATGTGACTAACGCATTCTTAGAAACTCTTCCCTTATCGCATAGTGCAAAACAACAATGGAAAGCTTGTTGCTGTAGCAATGGAAAGCT
TGTTGCTCTAGCAAGGGATGTTTGTTGCTGTAGCAATGGAAAGCTTGTTGCTCTAGCAAGGGATGCTTGTTGCTGTAGCAATGGAAGCTTGATGGGATGCTTGTTGATGC
TACAACAAGTTTCTATCCAAGCAATACATGCTTTGGCTCAAAGCAACACATGTTTTGGCTCCAAGCAACTCATGTCGCTCCTTCCCTGTGGATTCGACCCTGGAATACTC
CAGTTGCAAGAAGATACAGCTTCTATCGTTGCAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTCTTGTTAAGTATAACCCTTTGTTTGAACCTGA
TTCTGACGTAGTGACTGTCATGATGACTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAGGAACACATCAACACCTTGATGAAGGCGATTGAAGAAAAAGATT
CTCAAATCGCGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGTGACTGCGAAG
GAACTCTTCTCCAAAACTTTCCACAAAAAGGAAAAAGAAAACTTTGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAGGGTGAACAAAGGACTTC
CGTCTTCGATCGCATCAAGCCTTCAACTACTCGTCCTTCAGTATTCCAAAGAATGAGTATGGCTGCGACAAAAGAAGAAAATCAATGTTCGGTGTCCACCTCCACTCGAC
CTTCAACTTTCCAAAGGCTAAGTGTCTCCACATTGAGGAAAAGTCAGTCTTCAACATCTGTCTTTGATCGCCTCAAAGTAGCAGACGATCAACCTAAAAGAAAGATGAAC
AACTTGAAGGTGAAACTTTTCAATGAAGTAAGCAGTGACGAGAAACTTCAAAGTATTGTCGCGTCACGTATGAAAAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTC
CTTGAAGTTCCTTCTCTCCAAGTTAGAGGGTTCTTACACGACGCTGCTTCATTGTTCCTTCTTCAACTTCGAAGGTTCTCACGTTGCGCTGTTGCGCTGCTTCCTTCTCC
AAGTTCAAAAGTTCGCACGCTGTGCTGCTGCGCGCTACTCCATTCTCTCTCCAAGTTCGAAGGTTCTCATACTGCGCTGCTACGCTGCTTCCTTCTCCAAATTCGAAGGT
TCTCAGGCTACGCTGCTGCGCTACCTCCATGTTTTAAGTTCGAAGGTTCCCACGTTGCACTGTTGTGCTGCTTCCTTCTCCAAGTTCGAAGGTTTTGACGTTGCGCTGCT
GCGCTATTTCATTCACCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCTCTTCAAGTTCGAAGGTTCTCCCACGCTTCGCTGTAATTCCTTCTCTCCAAGTT
CGAAGGTGCTCACACGTTTCGATACAGTTCCTTCTCTCCAAGTTAGAGGGTTCTTACGCGGCGTTGCTTCGTTGTTCCTTCTTCAATCCTTCCTCCAAGTTCGAAGGTTC
TCACGCTGCTTCGTTGCAGTTCCTTCCTCCAAGATCGATGGTTCTCATGCCACGCTGCTTCGTTGTTCCTTCTCCAAGTTCAAAGGTTTTCACGTTATACATTGCGTAGT
TCCATTTCCAAGTTCGAAAGTTATCTGTTGCTTCGTTGTTGTTTCTTCTCTTCAACTTCGAAGGTTCTCATGCGCTTCGCTGAGGTTTCTTCTCTCCAAATTCGAAGGTT
CTCACGTGCTTCGCGGAAGTTCCTTCTCTCCAAGTTTGAAGGTTCTCATGCGCTTCGCTAAGGTTTCTTCTCTCCAAGTTCGAAGGTTCTCACGCTGTTTCGCTGATGTT
CCTTCTTCAAGTTTGAAACTTTTCTCCAAGTTTGAAGATTCTCATGCACTTCGCTGCAGTTCCTTCTCTCCAAGTTTGAAGGTTCTCACACGCTTTGCTACAGTTCCTTA
TCCAAGTTCAAAGGTTCTGATGCTACGCTGCTTCTTTCTCCAAGTTCGAAGTTTCTCCACTGAACACTCCTGCGTTGCTCCTTCTCCAAGTCCGAAGGTTCTGACGTTGC
GCTGTTGTGCTGCTTCCTTCTGTTCTCACTCTACGTTGTTCGAAGGTTCTTCCTCTTACACTGCATTGCATCGTTGTTCCTTCAATAAGTTCGAAGGTTCTTATGTTGAA
GCAGTAAAGGAGCCCAACAGTAAAGGAGCCCAAGTGGAAGTGACATCATGTGTCCTTGGGCCGGGGAACGTCATAGCAACAAAAGTCCAAAGACGTCAATTGTCCTTCAG
TAAAGGAGCCCAAGTGGGAGTGACATCATGTGTCCTTGGACCGGGGAACGTCATAGCAACAAAAGTTCAAAGACACGAAGAGCACGGGCGTGAGGAGGAAATGTGGAGGC
GATTTTGGTGGAGGGTTTGTGTTGATTTGAACTTAAATGCTGAACCCGCCAGAGCTGTATGTACTGAGTCTGATCTGAGGGCTGGCGAGCTTGATCTCAGATGGAGGAAG
GTTGAATCTTCAACTTGTATCTTGATGAAGAAACGGTTGATTCTTTGGAGTCTTCCAGTCGTTGGATCTTCTGAGTCTTGA
Protein sequenceShow/hide protein sequence
MRQIYLCNVTNAFLETLPLSHSAKQQWKACCCSNGKLVALARDVCCCSNGKLVALARDACCCSNGSLMGCLLMLQQVSIQAIHALAQSNTCFGSKQLMSLLPCGFDPGIL
QLQEDTASIVAGQETTLQGAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTMEERMTEMQEHINTLMKAIEEKDSQIAQLKSQIENQHIAESSQTQRSKKFSQPRQPVTAK
ELFSKTFHKKEKENFATSYCIDVEEVDNSKKGEQRTSVFDRIKPSTTRPSVFQRMSMAATKEENQCSVSTSTRPSTFQRLSVSTLRKSQSSTSVFDRLKVADDQPKRKMN
NLKVKLFNEVSSDEKLQSIVASRMKRKFSVLINTEGSLKFLLSKLEGSYTTLLHCSFFNFEGSHVALLRCFLLQVQKFARCAAARYSILSPSSKVLILRCYAASFSKFEG
SQATLLRYLHVLSSKVPTLHCCAASFSKFEGFDVALLRYFIHQVRRFSRASLQFLLFKFEGSPTLRCNSFSPSSKVLTRFDTVPSLQVRGFLRGVASLFLLQSFLQVRRF
SRCFVAVPSSKIDGSHATLLRCSFSKFKGFHVIHCVVPFPSSKVICCFVVVSSLQLRRFSCASLRFLLSKFEGSHVLRGSSFSPSLKVLMRFAKVSSLQVRRFSRCFADV
PSSSLKLFSKFEDSHALRCSSFSPSLKVLTRFATVPYPSSKVLMLRCFFLQVRSFSTEHSCVAPSPSPKVLTLRCCAASFCSHSTLFEGSSSYTALHRCSFNKFEGSYVE
AVKEPNSKGAQVEVTSCVLGPGNVIATKVQRRQLSFSKGAQVGVTSCVLGPGNVIATKVQRHEEHGREEEMWRRFWWRVCVDLNLNAEPARAVCTESDLRAGELDLRWRK
VESSTCILMKKRLILWSLPVVGSSES