; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036965 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036965
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon gag protein
Genome locationscaffold5:25949802..25965075
RNA-Seq ExpressionSpg036965
SyntenySpg036965
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044978.1 retrotransposon gag protein [Cucumis melo var. makuwa]4.5e-3165.6Show/hide
Query:  EEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDN    +QRTSVFDR KP TTR SVFQR+SMA  +EE+QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLQSSIPSRMKRKFSVLINTEGSL
         K+ S +PSRMKRK SV INTEGSL
Subjt:  KKLQSSIPSRMKRKFSVLINTEGSL

KAA0050734.1 gag protease polyprotein [Cucumis melo var. makuwa]2.9e-3064Show/hide
Query:  EEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDN    +QRTS+FDR KP TTR  VFQR+SMA  +EE+QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLQSSIPSRMKRKFSVLINTEGSL
         K+ S +PSRMKRK SV INTEGSL
Subjt:  KKLQSSIPSRMKRKFSVLINTEGSL

KAA0052018.1 retrotransposon gag protein [Cucumis melo var. makuwa]6.5e-3060.87Show/hide
Query:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD
        E +N   SY    EE+DN    +QRTSVFDR KP TTR SVFQR+SMA  KEE+QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M 
Subjt:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD

Query:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
         L+ K F E N D K+ S +PS MKRK SV INT+GSL
Subjt:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

KAA0055462.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.8e-3060.87Show/hide
Query:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD
        E +N   SY    EEVDN    +QRTSVFDR KP TTR SVFQR+SMA  +E++QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M 
Subjt:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD

Query:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
        +L+ K F E N D K+ + +PSRMKRK SV INTEGSL
Subjt:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.8e-3060.87Show/hide
Query:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD
        E +N   SY    EEVDN    +QRTSVFDR KP TTR SVFQR+SMA  +E++QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M 
Subjt:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD

Query:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
        +L+ K F E N D K+ + +PSRMKRK SV INTEGSL
Subjt:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7TQ06 Retrotransposon gag protein2.2e-3165.6Show/hide
Query:  EEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDN    +QRTSVFDR KP TTR SVFQR+SMA  +EE+QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLQSSIPSRMKRKFSVLINTEGSL
         K+ S +PSRMKRK SV INTEGSL
Subjt:  KKLQSSIPSRMKRKFSVLINTEGSL

A0A5A7U9V3 Retrotransposon gag protein3.1e-3060.87Show/hide
Query:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD
        E +N   SY    EE+DN    +QRTSVFDR KP TTR SVFQR+SMA  KEE+QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M 
Subjt:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD

Query:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
         L+ K F E N D K+ S +PS MKRK SV INT+GSL
Subjt:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5A7UI09 Retrotransposon gag protein1.8e-3060.87Show/hide
Query:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD
        E +N   SY    EEVDN    +QRTSVFDR KP TTR SVFQR+SMA  +E++QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M 
Subjt:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD

Query:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
        +L+ K F E N D K+ + +PSRMKRK SV INTEGSL
Subjt:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5D3BBF9 Gag protease polyprotein1.4e-3064Show/hide
Query:  EEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSD
        EEVDN    +QRTS+FDR KP TTR  VFQR+SMA  +EE+QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M +L+ K F E N D
Subjt:  EEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSD

Query:  KKLQSSIPSRMKRKFSVLINTEGSL
         K+ S +PSRMKRK SV INTEGSL
Subjt:  KKLQSSIPSRMKRKFSVLINTEGSL

A0A5D3CCI8 Retrotransposon gag protein1.8e-3060.87Show/hide
Query:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD
        E +N   SY    EEVDN    +QRTSVFDR KP TTR SVFQR+SMA  +E++QC TST+ R SAF+RLS+STSKK +PSTS FD LK+T+DQ +R+M 
Subjt:  EKENFATSYCIDVEEVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMD

Query:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL
        +L+ K F E N D K+ + +PSRMKRK SV INTEGSL
Subjt:  NLEVKLFDEVNSDKKLQSSIPSRMKRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATAGGTGAAATCGCCGCTGAAATTCGAGGCCAAAACCGAGAACGAGCTGCTGTTTCTGCCCAGAATACGCTTCCAGACGCAACAGCGTCGGGACGCTACTCCAA
CAGCATCTCGACACTGTCCCGATTCCGCTGGCGCTTAGGTATCGCGAGGATAAGATTTGGCCGGTTCGACGATGTCAGCGGTCCGGTTCGGCTGGGCTGGGACCAATTCG
GTCCGGTTCAGCTGAATTTCAGCTTGGTTCGTGATTTTGAAGACCGGTTCGAGGACGCCACCATTCATTTTGGAGGGGAGATTGAGCCAGAGGCAGAGAGGGTAGAGTCC
AGAGCATTCTCCCAAGATCCAGAGTCGTCAGAATCAAAAGACTCCAGAGAATTCAGAGATCCGAGATTTAGAATTCAAAGGATTCAAGACTCAGGAGATTTGGAGACAGA
GTCAGAGAACTCAGAGTCTAGAGCATTCTGCCAGATTCCAGAGTCGGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAAC
AGGCCGATCCAAGAGATCAACAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAGCAGGTCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAA
GCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCAAGGAACAACCTTGCAGGGGGCATATACTAATGACAAATTTCTTGTTAAGTATAACCCTTTGCTTGAACC
TGATTCTGACGTAGTGACTGTCATGATGACTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAGGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAG
ATTCTCAAATCGCGCAACTAAAGAGCCAAATTGAGAATCAACATATCGCCGAATCAAGTCAAACCCAAGAAAAGGAAAACTTTGCAACTTCCTACTGCATCGACGTAGAA
GAAGTTGACAATTTCAAGAATGGTGAACAAAGGACATCCGTCTTTGATCGCACCAAGCCTTCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCGCGACAAA
AGAAGAAAGTCAATGTTCGACGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATTGCC
TCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTAAAACTTTTCGATGAAGTAAACAGCGACAAGAAACTTCAAAGTAGTATCCCGTCACGTATG
AAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCGTTGTTCCCTCTTCTTCAAGTCAAAGGTTCTCACGTGCTTTGCGAGTTCGAAGGTTCTCACG
CACTTCGTTGGAGTTCCTTCTCTTCAAGTTCGAAGGTTCTCACGTTTTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTTACGTTGTACGCTACTGCGTTGTTCA
TTCTCCAAGTTCGAAGGTTCTCGGATTTCGCTCTTGCGTTGCTCCTTCTCCAAGTCCGAAGGTTTATGTTGTTATGCTGCTTCGATGTTGTTCCTTCTCCAAAGTTCGAA
GGTTCCCACTCTGCGTTGTTTCGCAGTTCCTTCTCCAAGTTCAAAGGTTCACGCGGTGATGCTTTGTTGTTCCTCCCCAAGTTCAAAGGTTCACGCGGTGATGCTTTGTT
GTTCCTCCCGAAGTTCGAAGGTTCTGACACTGCACTGCTCCCTTCTCCAAGTTCGAAGGTTCTGACATTGCATCGCTGCGCTGCTTCCTTCACCAAGTTTGAAGGTTCCC
ACTCTGCGCTGTTTCGTTGTTCCTTCTCCAAGTTTGAAGGTTCGAAGGTTCTCATGCGCTTCGCTGCAGTCCCTTCCTCCAAGTCTGAAGGTTCTCTCACGCGCTTCGCT
GTAGTTCCTTCCTCCAAGTTCGGAGGTTATCACGCGCTTTGCTGCAGTTCCTTCCTCCAAGTCAGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTCCGAA
GGTTCTCACGCGCTTTGCTGCCGTCCCTTCCTCCAAGTCCGAAGGTTCTCACGCGCTTTGTTGCAGTTCCTTCCTCCAAGTCCGAAGGTTCTCACGCGCTTTGCTGTCGT
TCCTTCCTCCAAGTCCGAAGGTTCTCAAGCGCTTCGCTGCAAAGCCTTCCTCCAAGTCCGAAGGTTCTCACGCGCTTTGCTGCAGTTCCTTCCTCCAAGTCCGAAGGTTC
TCACGCGCTTTGCTGCCGTTCCTTCCTCCAAGTCCGAAGGTTCTCACGCGCTTTGCTGCCGTTCGTTCCTCCAAGTCCGAAGGGGTTCTCACGCGCTTACGTTGCAATTC
CTTCCTCCAAGTCCGAAGCGATAATGCAACAGTGGAGTTAATCGGGTGCTCGGGACGCGAAAAGATGCAAAGGAAGGAAAAGAATCAAAAGGAAAAAAAGTCAAAATTCG
GTCAAAAGGTGACTAGCGTCGAGACGCTAGCATTCCTTATTCGGATAGGCGCGAATTCATCGCAGCGTCTCGACGCTGCGACCTTAGCGTCTCGACGCTACGGCCAACCA
GAGAAAACTCAGGAAATGGTTGATTGGAGCCATAATGCAATAGTGGAGTTAATCGGGTGCTCGGGACGCGAAAAGATGCAAAGGAAGGAAAAGAATCAAAAGGAAAAAAA
GTCAAAATTCGGTCAAAAGGTGACTAGCGTCGAGACGCTAGCCCTTAGGCGTCTCGACGCTAGCATTCCTTATTCGGATAGGCGCGAATTCATCGCAGCGTCGACACGCT
GCGACCTTAGCGTCTCGACGCTACCGGATATCAAGTTGGTTAATTTGCATGAGAATCAATTATACTCGGGAGAGGAAAAATCCACACCAGTTCTCACGCGCTTCGCTGCA
GTTTTCGTCCTCCAAGTCTTAAGGTTCTCACGCGCTTACGATGCAGTTCCTTCTCCAAGTCCGAAGGTTGTCACGCGCTTACGCTGCTGCAGTTCCTTCCCCCACAAGTC
CAAAGGTTCTCACGCGCTCCGCTCCTTCCTTCCTCCAAGTTTGAAGGTTCTCACACACTTCGCTGCTGCAGTTTCTCCCCAAGCGAGTCTGGTGATCACCTCTGCAGGAA
ACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTCGTAGGCGAGTCTGGTGACCACCCCTGCAGGCTACTCAGA
TCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTTACTCAGATCACCCAATAAAATGGGGACTGGAGTGCATCACTGTAGGCAAATCTGGTGACTACCC
CTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGAGTGCATCACTGTCGCCAAATCTGGTGACTACTCCTGCAGGTCGAT
CATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAATCGACCGATCAAGAAGATCAACAAGTCA
GCAGGTCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGAT
CAACAAGCTAACAAGCCGATCCAAGAGATCATCAACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAATAGGTGAAATCGCCGCTGAAATTCGAGGCCAAAACCGAGAACGAGCTGCTGTTTCTGCCCAGAATACGCTTCCAGACGCAACAGCGTCGGGACGCTACTCCAA
CAGCATCTCGACACTGTCCCGATTCCGCTGGCGCTTAGGTATCGCGAGGATAAGATTTGGCCGGTTCGACGATGTCAGCGGTCCGGTTCGGCTGGGCTGGGACCAATTCG
GTCCGGTTCAGCTGAATTTCAGCTTGGTTCGTGATTTTGAAGACCGGTTCGAGGACGCCACCATTCATTTTGGAGGGGAGATTGAGCCAGAGGCAGAGAGGGTAGAGTCC
AGAGCATTCTCCCAAGATCCAGAGTCGTCAGAATCAAAAGACTCCAGAGAATTCAGAGATCCGAGATTTAGAATTCAAAGGATTCAAGACTCAGGAGATTTGGAGACAGA
GTCAGAGAACTCAGAGTCTAGAGCATTCTGCCAGATTCCAGAGTCGGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAAC
AGGCCGATCCAAGAGATCAACAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAGCAGGTCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAA
GCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCAAGGAACAACCTTGCAGGGGGCATATACTAATGACAAATTTCTTGTTAAGTATAACCCTTTGCTTGAACC
TGATTCTGACGTAGTGACTGTCATGATGACTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAGGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAG
ATTCTCAAATCGCGCAACTAAAGAGCCAAATTGAGAATCAACATATCGCCGAATCAAGTCAAACCCAAGAAAAGGAAAACTTTGCAACTTCCTACTGCATCGACGTAGAA
GAAGTTGACAATTTCAAGAATGGTGAACAAAGGACATCCGTCTTTGATCGCACCAAGCCTTCAACTACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCGCGACAAA
AGAAGAAAGTCAATGTTCGACGTCCACCTTCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCAACCTTCGACATCTGTTTTTGATTGCC
TCAAAGTAACAAGCGATCAACCTAAAAGAAAGATGGATAACTTGGAGGTAAAACTTTTCGATGAAGTAAACAGCGACAAGAAACTTCAAAGTAGTATCCCGTCACGTATG
AAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCGTTGTTCCCTCTTCTTCAAGTCAAAGGTTCTCACGTGCTTTGCGAGTTCGAAGGTTCTCACG
CACTTCGTTGGAGTTCCTTCTCTTCAAGTTCGAAGGTTCTCACGTTTTTCGCTGCAGTTCCTTCTCTCCAAGTTCGAAGGTTCTTACGTTGTACGCTACTGCGTTGTTCA
TTCTCCAAGTTCGAAGGTTCTCGGATTTCGCTCTTGCGTTGCTCCTTCTCCAAGTCCGAAGGTTTATGTTGTTATGCTGCTTCGATGTTGTTCCTTCTCCAAAGTTCGAA
GGTTCCCACTCTGCGTTGTTTCGCAGTTCCTTCTCCAAGTTCAAAGGTTCACGCGGTGATGCTTTGTTGTTCCTCCCCAAGTTCAAAGGTTCACGCGGTGATGCTTTGTT
GTTCCTCCCGAAGTTCGAAGGTTCTGACACTGCACTGCTCCCTTCTCCAAGTTCGAAGGTTCTGACATTGCATCGCTGCGCTGCTTCCTTCACCAAGTTTGAAGGTTCCC
ACTCTGCGCTGTTTCGTTGTTCCTTCTCCAAGTTTGAAGGTTCGAAGGTTCTCATGCGCTTCGCTGCAGTCCCTTCCTCCAAGTCTGAAGGTTCTCTCACGCGCTTCGCT
GTAGTTCCTTCCTCCAAGTTCGGAGGTTATCACGCGCTTTGCTGCAGTTCCTTCCTCCAAGTCAGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTCCGAA
GGTTCTCACGCGCTTTGCTGCCGTCCCTTCCTCCAAGTCCGAAGGTTCTCACGCGCTTTGTTGCAGTTCCTTCCTCCAAGTCCGAAGGTTCTCACGCGCTTTGCTGTCGT
TCCTTCCTCCAAGTCCGAAGGTTCTCAAGCGCTTCGCTGCAAAGCCTTCCTCCAAGTCCGAAGGTTCTCACGCGCTTTGCTGCAGTTCCTTCCTCCAAGTCCGAAGGTTC
TCACGCGCTTTGCTGCCGTTCCTTCCTCCAAGTCCGAAGGTTCTCACGCGCTTTGCTGCCGTTCGTTCCTCCAAGTCCGAAGGGGTTCTCACGCGCTTACGTTGCAATTC
CTTCCTCCAAGTCCGAAGCGATAATGCAACAGTGGAGTTAATCGGGTGCTCGGGACGCGAAAAGATGCAAAGGAAGGAAAAGAATCAAAAGGAAAAAAAGTCAAAATTCG
GTCAAAAGGTGACTAGCGTCGAGACGCTAGCATTCCTTATTCGGATAGGCGCGAATTCATCGCAGCGTCTCGACGCTGCGACCTTAGCGTCTCGACGCTACGGCCAACCA
GAGAAAACTCAGGAAATGGTTGATTGGAGCCATAATGCAATAGTGGAGTTAATCGGGTGCTCGGGACGCGAAAAGATGCAAAGGAAGGAAAAGAATCAAAAGGAAAAAAA
GTCAAAATTCGGTCAAAAGGTGACTAGCGTCGAGACGCTAGCCCTTAGGCGTCTCGACGCTAGCATTCCTTATTCGGATAGGCGCGAATTCATCGCAGCGTCGACACGCT
GCGACCTTAGCGTCTCGACGCTACCGGATATCAAGTTGGTTAATTTGCATGAGAATCAATTATACTCGGGAGAGGAAAAATCCACACCAGTTCTCACGCGCTTCGCTGCA
GTTTTCGTCCTCCAAGTCTTAAGGTTCTCACGCGCTTACGATGCAGTTCCTTCTCCAAGTCCGAAGGTTGTCACGCGCTTACGCTGCTGCAGTTCCTTCCCCCACAAGTC
CAAAGGTTCTCACGCGCTCCGCTCCTTCCTTCCTCCAAGTTTGAAGGTTCTCACACACTTCGCTGCTGCAGTTTCTCCCCAAGCGAGTCTGGTGATCACCTCTGCAGGAA
ACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTCGTAGGCGAGTCTGGTGACCACCCCTGCAGGCTACTCAGA
TCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTTACTCAGATCACCCAATAAAATGGGGACTGGAGTGCATCACTGTAGGCAAATCTGGTGACTACCC
CTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGAGTGCATCACTGTCGCCAAATCTGGTGACTACTCCTGCAGGTCGAT
CATCCAAGAGGATCAACAAGCTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCAACAAGCCAATCGACCGATCAAGAAGATCAACAAGTCA
GCAGGTCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCATCCAAGAGGAT
CAACAAGCTAACAAGCCGATCCAAGAGATCATCAACCTAG
Protein sequenceShow/hide protein sequence
MKIGEIAAEIRGQNRERAAVSAQNTLPDATASGRYSNSISTLSRFRWRLGIARIRFGRFDDVSGPVRLGWDQFGPVQLNFSLVRDFEDRFEDATIHFGGEIEPEAERVES
RAFSQDPESSESKDSREFRDPRFRIQRIQDSGDLETESENSESRAFCQIPESAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKKINKPTDRTDHQ
ANRPIQEIIKSAGQGTTLQGAYTNDKFLVKYNPLLEPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIAQLKSQIENQHIAESSQTQEKENFATSYCIDVE
EVDNFKNGEQRTSVFDRTKPSTTRPSVFQRMSMAATKEESQCSTSTFTRPSAFQRLSVSTSKKSQPSTSVFDCLKVTSDQPKRKMDNLEVKLFDEVNSDKKLQSSIPSRM
KRKFSVLINTEGSLKFVVPSSSSQRFSRALRVRRFSRTSLEFLLFKFEGSHVFRCSSFSPSSKVLTLYATALFILQVRRFSDFALALLLLQVRRFMLLCCFDVVPSPKFE
GSHSALFRSSFSKFKGSRGDALLFLPKFKGSRGDALLFLPKFEGSDTALLPSPSSKVLTLHRCAASFTKFEGSHSALFRCSFSKFEGSKVLMRFAAVPSSKSEGSLTRFA
VVPSSKFGGYHALCCSSFLQVRRFSRASLQFLPPSPKVLTRFAAVPSSKSEGSHALCCSSFLQVRRFSRALLSFLPPSPKVLKRFAAKPSSKSEGSHALCCSSFLQVRRF
SRALLPFLPPSPKVLTRFAAVRSSKSEGVLTRLRCNSFLQVRSDNATVELIGCSGREKMQRKEKNQKEKKSKFGQKVTSVETLAFLIRIGANSSQRLDAATLASRRYGQP
EKTQEMVDWSHNAIVELIGCSGREKMQRKEKNQKEKKSKFGQKVTSVETLALRRLDASIPYSDRREFIAASTRCDLSVSTLPDIKLVNLHENQLYSGEEKSTPVLTRFAA
VFVLQVLRFSRAYDAVPSPSPKVVTRLRCCSSFPHKSKGSHALRSFLPPSLKVLTHFAAAVSPQASLVITSAGNYSHQSDWSRQVVKSLQLNLMTTVVGESGDHPCRLLR
SPNKMGTGLAGVHHCYSDHPIKWGLECITVGKSGDYPCRLLRSPNKMGTGLAGVHHCRSASLSPNLVTTPAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKS
AGRSSKKINKPTDRTDHQANRPIQEIIKSAGRSSKRINKLTSRSKRSST