; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026927 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026927
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon gag protein
Genome locationscaffold13:24308306..24314092
RNA-Seq ExpressionSpg026927
SyntenySpg026927
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.6e-3051.41Show/hide
Query:  RSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------DEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST
        + ++F QPRQ +T  E F ++F   H KE       +   +          +EVDNS + +QRTSVFD IKP TTR SVFQR+S+ T EEENQC  ST T
Subjt:  RSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------DEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST

Query:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL
        + SAF+ LS+ST +K + STS FD LK+ ++Q QR+M +LKVK F E + D K+ S VPSRMKRK SV INTE GSL
Subjt:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL

KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]8.1e-3051.41Show/hide
Query:  RSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------DEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST
        + ++  QPRQ +T  E F ++F   H +E     T +   +          +EVDNS + +QRTSVFD IKP TTR SVFQR+SMAT EEENQC  ST  
Subjt:  RSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------DEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST

Query:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL
        + SAF+RLS+ST +K + STS FD LK+ ++Q QR+M +LK K F E + D K+ S VPSRMKRK SV INTE GSL
Subjt:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.7e-3050.56Show/hide
Query:  QRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD-------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTS
        ++ ++F QPR+ +T  E F ++F +   E         T+  ++VD       EVDNS + +QRTSVFD IKP TTR SVFQR+SMAT EEENQC +ST 
Subjt:  QRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD-------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTS

Query:  TQPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL
        T+ SAF+RLS+S  +K + STS FD LK+ ++Q QR+M +LK K F E + D K+ S VPSR+KRK S+ INTE GSL
Subjt:  TQPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.1e-2950.85Show/hide
Query:  RSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD-------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST
        + ++F QPR+ +T  E  S++F +   E         T+  ++VD       EVDNS + +QRTSVFD IKP TTR SVFQR+SMAT EE+NQC  ST  
Subjt:  RSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD-------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST

Query:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL
        + SAF+RLS+ST +K + STS FD LK+ ++Q QR+M +LK K F E + D K+ + VPSRMKRK SV INTE GSL
Subjt:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL

TYK30263.1 gag protease polyprotein [Cucumis melo var. makuwa]1.1e-2949.71Show/hide
Query:  RSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTSTQ
        + ++F QPR  +T  E   + F +   E         T+  ++VD      E+DNS + +QRT VFDCIKP TTR   FQR+SMAT EEENQC  ST T+
Subjt:  RSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTSTQ

Query:  PSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEA
         SAF+RLS+ST +K + STS FD  K+ +NQ QR++ +LK KLF E + D K+ S VPSRMKRK SV INTE+
Subjt:  PSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEA

TrEMBL top hitse value%identityAlignment
A0A5A7TGM1 Retrotransposon gag protein1.8e-3051.41Show/hide
Query:  RSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------DEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST
        + ++F QPRQ +T  E F ++F   H KE       +   +          +EVDNS + +QRTSVFD IKP TTR SVFQR+S+ T EEENQC  ST T
Subjt:  RSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------DEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST

Query:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL
        + SAF+ LS+ST +K + STS FD LK+ ++Q QR+M +LKVK F E + D K+ S VPSRMKRK SV INTE GSL
Subjt:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL

A0A5A7TQ06 Retrotransposon gag protein3.9e-3051.41Show/hide
Query:  RSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------DEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST
        + ++  QPRQ +T  E F ++F   H +E     T +   +          +EVDNS + +QRTSVFD IKP TTR SVFQR+SMAT EEENQC  ST  
Subjt:  RSKEFSQPRQPVTAKELFSKTF---HKKEKENFATSYCIDV----------DEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST

Query:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL
        + SAF+RLS+ST +K + STS FD LK+ ++Q QR+M +LK K F E + D K+ S VPSRMKRK SV INTE GSL
Subjt:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL

A0A5A7U974 Retrotransposon gag protein2.3e-3050.56Show/hide
Query:  QRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD-------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTS
        ++ ++F QPR+ +T  E F ++F +   E         T+  ++VD       EVDNS + +QRTSVFD IKP TTR SVFQR+SMAT EEENQC +ST 
Subjt:  QRSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD-------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTS

Query:  TQPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL
        T+ SAF+RLS+S  +K + STS FD LK+ ++Q QR+M +LK K F E + D K+ S VPSR+KRK S+ INTE GSL
Subjt:  TQPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL

A0A5D3CCI8 Retrotransposon gag protein5.1e-3050.85Show/hide
Query:  RSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD-------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST
        + ++F QPR+ +T  E  S++F +   E         T+  ++VD       EVDNS + +QRTSVFD IKP TTR SVFQR+SMAT EE+NQC  ST  
Subjt:  RSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD-------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTST

Query:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL
        + SAF+RLS+ST +K + STS FD LK+ ++Q QR+M +LK K F E + D K+ + VPSRMKRK SV INTE GSL
Subjt:  QPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSL

A0A5D3E2D7 Gag protease polyprotein5.1e-3049.71Show/hide
Query:  RSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTSTQ
        + ++F QPR  +T  E   + F +   E         T+  ++VD      E+DNS + +QRT VFDCIKP TTR   FQR+SMAT EEENQC  ST T+
Subjt:  RSKEFSQPRQPVTAKELFSKTFHKKEKENFA------TSYCIDVD------EVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTSTQ

Query:  PSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEA
         SAF+RLS+ST +K + STS FD  K+ +NQ QR++ +LK KLF E + D K+ S VPSRMKRK SV INTE+
Subjt:  PSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMDNLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAAGAAGATACAGCTTCTATCAATGCAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTCTTGTTAAGTATAACCCTTTGTTTGAACATGA
TTCTGACGTAGTGACTGTCATGATGACTGAAACAAGAACTATGGAAGAAAGAATGGCTGAGATGCAGGAACACATCAACACCTTGATGAAGGCGATTGAAGAAAAAGATT
CTCAAATCGCGCAACTAAAGAGCCAAATTGAGAACCAATATATCGCTGAATCAAGTCAAACCCAAAGAAGTAAAGAGTTTTCTCAACCTCGACAACCGGTGACTGCGAAG
GAACTCTTCTCCAAAACTTTCCACAAAAAGGAAAAAGAAAACTTTGCAACTTCCTACTGCATCGACGTAGATGAAGTTGACAATTCCAAGAAGGATGAACAAAGGACTTC
CGTCTTCGATTGTATTAAGCCTTCAACTACTCGACCTTCAGTATTCCAAAGAATGAGTATGGCCACGACAGAAGAAGAAAATCAATGTGTGGTGTCCACCTCCACTCAAC
CTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATTGAGGAAAAGTCAGTCTTCAACATCTGTCTTTGATCACCTCAAAGTAGCAGACAATCAACCTCAAAGAAAGATGGAC
AACTTAAAGGTGAAATTGTTCGATGAAGTAAGCTGTGACAAGAAGCTTCAAAGTATTGTCCCGTCACGTATGAAAAGGAAGTTTTCTGTTCTCATAAATACAGAAGCTGG
TTCCTTGAAGTTCGGTGTTCCATTCACCTTAAGTTTGTTGCTTTCTCTTCTCTCCAAGTTCGAGGGTTCTTACGATGTACGCTACTCTGTGTTGTTCTCTCTTCCTCCAA
GTTCGAAGGTTCTCACGTTGCTTCACTGCAGTTTCCTTCCTCCAAGTTCGAAGGTTGTCACGTTGCTTCGCTGCAGTTCCTTCTCTCCAAATTTGAAGGTTCTCACGCGC
TTCGCTTTCATTCTCTCCAAATTTGAAGGTTCTCATGGAAGTTTGAAGGTTTTCACGCGCTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCT
GCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACCCGCTTCGCTGCAATTCCTTCTCTTCAAGTTCGAAGGTTCTCTCATTGCTCCCTGCAGTTTCCTTCCTCCAAGTTC
GAAGGCTCCTCCAAGTTCGAGGGTTCTTACATGGCACGTTACGGCGTTGTTCCTTCTCCAAGTTTGAGGGTTCTTACGTTGTACGCTACTATGTTGTTCCTTCTTCAAGT
TTGAAGGTTATTCACTTCAAGCTCCTGTGTTGTTCCTTCTCCAAGTTTGAAGGTTCTCATACTTCGCTGCTATGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTTAGGCTA
TGCTTCTGCGCTACTTCCTTCTTTAAGGTCGAAGGTTCCCACATTGCGTTGTTGTGTTGTTTCCTTCTCCAAGTTTGAAGGTTCTGACGCTGCGCTGCTTCCTTCACCAA
GTTTGAAGGTTCTCACGCTGCGCTGTTTCTCTGTTCCTTCTCCAAGTTCGAAGGTCCTCATGCACTACACTACTGTTCCTTCTCCAAGTTCGAAGTTTTTCACGTGTTCG
AAGGTTGTTACGCTACAACTCCAGTTCCTTCCTCCAATTTCGAAGGTTCTGACGCTGCGCTGCTTCCTTCACCAAGTTCGAAGTTTGAAGGTTCCCACATTGCGCTGTTG
TGCTATTTCCTTCTCAGAGATTGAAGGTTCTGACGTTGGACTGCTTCCTTCACCAAGTTCGAAGGTTCTCACGTTGCACTGTTTCGCTGTTCCTTCTCCAACTCCAGTTC
CTTCCTCCAAGTTCGAAGGGGCTCTCACGCTGTTTCGCTGTAGTTCCTTCCTCCTGCAACAGTTCTTTCTTCTAAGTTCGAAGGTTATCACGCAGCTTCGATGCAGTTCC
TTCCTCCAAGTTCGAAGGGGTTCTCATGCAGCGCCTTCCTCCAAGTTCGAAGGTTCTCACGCTGCTTCTTCCTCACGTTTCTTCGCAACGGTTCCTTCCTCCGAAGGTTC
TAACGCTGCTTCTACTCCTACGAAGGTTCTAACGCTGCTTCTGCTTCCTCACGTTGCTTCACTGCAGTTCCTTCATCTTGCAGTTCCTTGTTCTCACGCTACGTTGTTCG
AAGGTCCTTCCTCTTACACTGCATTGCATCACTGTTCCTTCAACAAGTTCGAAGGTTCTTACGTTGAAGCAGTAAAGGAGCCCAACAGGAAAAGAGCCCAAGTGGGAATG
ACATCATGTGTCCTTGGGCCGAGGGACGTCGTAGCAACAAAAGTCCAAGGAACATGTCCCTGTACTCATGCTGAAAGACGTGACGGCGACAAAAGTCCAAGGAACATGTC
CTCAAAAGTCCAAGGAACATGTCCCCAAAAGTCCAAGGAACATGTCCTTGTACTGTTGGGTCGCTACCGCAAGCGCATGGTATCGCAAAAAGCCTTGAGAGGGGCTCCAA
AGGAGCCTAAGAAAAGGAGGAAAGGAATTGATTCCTTGATGGATTGTTGGATGATAAATGAGGCACCAAGAGGTGCCTTTTATAGGCCTGGAATGGGGGTAGCGTCGCGA
CGCTGCAACGCACGTGTCTTGTTGCCCAAGAAATGGGTAGCGTCGCGACGCTATCTACACAGCGTCTCGACACTGACCCAATTTCCAGATTTTTCCAGCTTTGATTTGGG
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCAAGAAGATACAGCTTCTATCAATGCAGGCCAAGAAACAACCTTGCAGGGGGCATATACTAATGACAAGTTTCTTGTTAAGTATAACCCTTTGTTTGAACATGA
TTCTGACGTAGTGACTGTCATGATGACTGAAACAAGAACTATGGAAGAAAGAATGGCTGAGATGCAGGAACACATCAACACCTTGATGAAGGCGATTGAAGAAAAAGATT
CTCAAATCGCGCAACTAAAGAGCCAAATTGAGAACCAATATATCGCTGAATCAAGTCAAACCCAAAGAAGTAAAGAGTTTTCTCAACCTCGACAACCGGTGACTGCGAAG
GAACTCTTCTCCAAAACTTTCCACAAAAAGGAAAAAGAAAACTTTGCAACTTCCTACTGCATCGACGTAGATGAAGTTGACAATTCCAAGAAGGATGAACAAAGGACTTC
CGTCTTCGATTGTATTAAGCCTTCAACTACTCGACCTTCAGTATTCCAAAGAATGAGTATGGCCACGACAGAAGAAGAAAATCAATGTGTGGTGTCCACCTCCACTCAAC
CTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATTGAGGAAAAGTCAGTCTTCAACATCTGTCTTTGATCACCTCAAAGTAGCAGACAATCAACCTCAAAGAAAGATGGAC
AACTTAAAGGTGAAATTGTTCGATGAAGTAAGCTGTGACAAGAAGCTTCAAAGTATTGTCCCGTCACGTATGAAAAGGAAGTTTTCTGTTCTCATAAATACAGAAGCTGG
TTCCTTGAAGTTCGGTGTTCCATTCACCTTAAGTTTGTTGCTTTCTCTTCTCTCCAAGTTCGAGGGTTCTTACGATGTACGCTACTCTGTGTTGTTCTCTCTTCCTCCAA
GTTCGAAGGTTCTCACGTTGCTTCACTGCAGTTTCCTTCCTCCAAGTTCGAAGGTTGTCACGTTGCTTCGCTGCAGTTCCTTCTCTCCAAATTTGAAGGTTCTCACGCGC
TTCGCTTTCATTCTCTCCAAATTTGAAGGTTCTCATGGAAGTTTGAAGGTTTTCACGCGCTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCT
GCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTCACCCGCTTCGCTGCAATTCCTTCTCTTCAAGTTCGAAGGTTCTCTCATTGCTCCCTGCAGTTTCCTTCCTCCAAGTTC
GAAGGCTCCTCCAAGTTCGAGGGTTCTTACATGGCACGTTACGGCGTTGTTCCTTCTCCAAGTTTGAGGGTTCTTACGTTGTACGCTACTATGTTGTTCCTTCTTCAAGT
TTGAAGGTTATTCACTTCAAGCTCCTGTGTTGTTCCTTCTCCAAGTTTGAAGGTTCTCATACTTCGCTGCTATGCTGCTTCCTTCTCCAAGTTCGAAGGTTCTTAGGCTA
TGCTTCTGCGCTACTTCCTTCTTTAAGGTCGAAGGTTCCCACATTGCGTTGTTGTGTTGTTTCCTTCTCCAAGTTTGAAGGTTCTGACGCTGCGCTGCTTCCTTCACCAA
GTTTGAAGGTTCTCACGCTGCGCTGTTTCTCTGTTCCTTCTCCAAGTTCGAAGGTCCTCATGCACTACACTACTGTTCCTTCTCCAAGTTCGAAGTTTTTCACGTGTTCG
AAGGTTGTTACGCTACAACTCCAGTTCCTTCCTCCAATTTCGAAGGTTCTGACGCTGCGCTGCTTCCTTCACCAAGTTCGAAGTTTGAAGGTTCCCACATTGCGCTGTTG
TGCTATTTCCTTCTCAGAGATTGAAGGTTCTGACGTTGGACTGCTTCCTTCACCAAGTTCGAAGGTTCTCACGTTGCACTGTTTCGCTGTTCCTTCTCCAACTCCAGTTC
CTTCCTCCAAGTTCGAAGGGGCTCTCACGCTGTTTCGCTGTAGTTCCTTCCTCCTGCAACAGTTCTTTCTTCTAAGTTCGAAGGTTATCACGCAGCTTCGATGCAGTTCC
TTCCTCCAAGTTCGAAGGGGTTCTCATGCAGCGCCTTCCTCCAAGTTCGAAGGTTCTCACGCTGCTTCTTCCTCACGTTTCTTCGCAACGGTTCCTTCCTCCGAAGGTTC
TAACGCTGCTTCTACTCCTACGAAGGTTCTAACGCTGCTTCTGCTTCCTCACGTTGCTTCACTGCAGTTCCTTCATCTTGCAGTTCCTTGTTCTCACGCTACGTTGTTCG
AAGGTCCTTCCTCTTACACTGCATTGCATCACTGTTCCTTCAACAAGTTCGAAGGTTCTTACGTTGAAGCAGTAAAGGAGCCCAACAGGAAAAGAGCCCAAGTGGGAATG
ACATCATGTGTCCTTGGGCCGAGGGACGTCGTAGCAACAAAAGTCCAAGGAACATGTCCCTGTACTCATGCTGAAAGACGTGACGGCGACAAAAGTCCAAGGAACATGTC
CTCAAAAGTCCAAGGAACATGTCCCCAAAAGTCCAAGGAACATGTCCTTGTACTGTTGGGTCGCTACCGCAAGCGCATGGTATCGCAAAAAGCCTTGAGAGGGGCTCCAA
AGGAGCCTAAGAAAAGGAGGAAAGGAATTGATTCCTTGATGGATTGTTGGATGATAAATGAGGCACCAAGAGGTGCCTTTTATAGGCCTGGAATGGGGGTAGCGTCGCGA
CGCTGCAACGCACGTGTCTTGTTGCCCAAGAAATGGGTAGCGTCGCGACGCTATCTACACAGCGTCTCGACACTGACCCAATTTCCAGATTTTTCCAGCTTTGATTTGGG
CTGA
Protein sequenceShow/hide protein sequence
MLQEDTASINAGQETTLQGAYTNDKFLVKYNPLFEHDSDVVTVMMTETRTMEERMAEMQEHINTLMKAIEEKDSQIAQLKSQIENQYIAESSQTQRSKEFSQPRQPVTAK
ELFSKTFHKKEKENFATSYCIDVDEVDNSKKDEQRTSVFDCIKPSTTRPSVFQRMSMATTEEENQCVVSTSTQPSAFQRLSVSTLRKSQSSTSVFDHLKVADNQPQRKMD
NLKVKLFDEVSCDKKLQSIVPSRMKRKFSVLINTEAGSLKFGVPFTLSLLLSLLSKFEGSYDVRYSVLFSLPPSSKVLTLLHCSFLPPSSKVVTLLRCSSFSPNLKVLTR
FAFILSKFEGSHGSLKVFTRFAAVPSLQVRRFSCASLQFLLSKFEGSHPLRCNSFSSSSKVLSLLPAVSFLQVRRLLQVRGFLHGTLRRCSFSKFEGSYVVRYYVVPSSS
LKVIHFKLLCCSFSKFEGSHTSLLCCFLLQVRRFLGYASALLPSLRSKVPTLRCCVVSFSKFEGSDAALLPSPSLKVLTLRCFSVPSPSSKVLMHYTTVPSPSSKFFTCS
KVVTLQLQFLPPISKVLTLRCFLHQVRSLKVPTLRCCAISFSEIEGSDVGLLPSPSSKVLTLHCFAVPSPTPVPSSKFEGALTLFRCSSFLLQQFFLLSSKVITQLRCSS
FLQVRRGSHAAPSSKFEGSHAASSSRFFATVPSSEGSNAASTPTKVLTLLLLPHVASLQFLHLAVPCSHATLFEGPSSYTALHHCSFNKFEGSYVEAVKEPNRKRAQVGM
TSCVLGPRDVVATKVQGTCPCTHAERRDGDKSPRNMSSKVQGTCPQKSKEHVLVLLGRYRKRMVSQKALRGAPKEPKKRRKGIDSLMDCWMINEAPRGAFYRPGMGVASR
RCNARVLLPKKWVASRRYLHSVSTLTQFPDFSSFDLG