; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004004 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004004
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold4:1940976..1945931
RNA-Seq ExpressionSpg004004
SyntenySpg004004
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038339.1 uncharacterized protein E6C27_scaffold270G002130 [Cucumis melo var. makuwa]3.6e-3036.9Show/hide
Query:  PISTPCM-----DESDVASDVSLSSEEFESLTMCQTNDVVIEDSYAET-LGLLFQEDEQKIESPISSLKV---FNPSSIAIPDNFSSLIISWNTRGLGDR
        P  TP +      +S V S +S+SSEE +       N+ +++    ET L  LFQ DE   E  +    +       S  IP++  S+++     G+   
Subjt:  PISTPCM-----DESDVASDVSLSSEEFESLTMCQTNDVVIEDSYAET-LGLLFQEDEQKIESPISSLKV---FNPSSIAIPDNFSSLIISWNTRGLGDR

Query:  SKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGGYGPNDYRERKYLWNELRSLFSYVDEP
        +KR+A ++F+  H P++VLIQE+K   FDI+ IKSLWS  DI W  VES G SGG+L +WD S L  ++ +K             +   L+         
Subjt:  SKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGGYGPNDYRERKYLWNELRSLFSYVDEP

Query:  WCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSK
          + GDFNITRW HER P  R TK +R FN VI   +L E+ L+NG+FTWS+
Subjt:  WCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSK

KAA0063088.1 uncharacterized protein E6C27_scaffold623G00050 [Cucumis melo var. makuwa]2.8e-3040Show/hide
Query:  DVASDVSLSSEEFESLTMC-------QTNDVVIE----DSYAETLGLLFQEDEQKIESPISSLKVFNPSSIAIPDNFSSLIISWNTRGLGDRSKRVALKK
        D  S  S SSE+ E L +          N + +E    DS A+  G+ F  D      P  S    +    +  D  S L      + L D SKR+ALK+
Subjt:  DVASDVSLSSEEFESLTMC-------QTNDVVIE----DSYAETLGLLFQEDEQKIESPISSLKVFNPSSIAIPDNFSSLIISWNTRGLGDRSKRVALKK

Query:  FIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGG--------------------YGPNDYRERKYLWN
        F+     D+VLIQE+K   FDI  IKSLWSS D  W   E FG SGG+L +WD SKLK ++ +KGG                    YGPND++ER+ +W 
Subjt:  FIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGG--------------------YGPNDYRERKYLWN

Query:  ELRSLFSYVDEPWCIEGDFNITRWVHERIPLRRQT
        EL SL +Y  + WCI GD NI RW HER P RR T
Subjt:  ELRSLFSYVDEPWCIEGDFNITRWVHERIPLRRQT

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]1.6e-3348.8Show/hide
Query:  PD-LVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGG--------------------YGPNDYRERKYLWNELRSL
        PD LV+    +    DI LIKSLWSS DI W  VESFGR GG+L MWD SK+K ++ +KGG                    YGP DY ER+++W  L SL
Subjt:  PD-LVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGG--------------------YGPNDYRERKYLWNELRSL

Query:  FSYVDEPWCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSKPVDGDSL
          Y    WCI G  NITRW HE  PL +QT+GMR FN  ID L + ELPL NG+ TWS+  +G S+
Subjt:  FSYVDEPWCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSKPVDGDSL

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]8.6e-4048.13Show/hide
Query:  IISWNTRGLGDRSKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKL--------------------KAL
        I++W+TRGLGD SKR+ LK+F+    PD+VLIQETK    + + IKSLWSS ++    VE+ G+SGGLL +WDDSK+                    K +
Subjt:  IISWNTRGLGDRSKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKL--------------------KAL

Query:  QFIKGGYGPNDYRERKYLWNELRSLFSYVDEPWCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSKPVD
         +I   YGP DY+ER+ LW EL SL   +D+PWCI GDFN  R  HER P+ + T+ M  FNK I    L E+PL+NG+FTWSK  D
Subjt:  QFIKGGYGPNDYRERKYLWNELRSLFSYVDEPWCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSKPVD

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]2.1e-3047.18Show/hide
Query:  MKFDGKWKFIGNLHLKIENWSCQNHFHLEVIEGYGGWISVKNLPLPLGNRSTFEVICQHFGGLVSISSQTLNLLECSKARIEVRKNLCGFIPAEIAVTDK
        +   GKW+  G+ HLK E W+   H     + GYGGWIS+KNLPL    + TFE I ++FGGL SI+ + LNL++   A I+V++NLCGF+PA I V+++
Subjt:  MKFDGKWKFIGNLHLKIENWSCQNHFHLEVIEGYGGWISVKNLPLPLGNRSTFEVICQHFGGLVSISSQTLNLLECSKARIEVRKNLCGFIPAEIAVTDK

Query:  KHGNFSLRFGDISSLDAPISIPGNLSLSDFVNEIDLKRVHHV
        K G+  L FGDIS+ + P  + G+L  SDF N IDL R++ V
Subjt:  KHGNFSLRFGDISSLDAPISIPGNLSLSDFVNEIDLKRVHHV

TrEMBL top hitse value%identityAlignment
A0A5A7TAF5 Uncharacterized protein1.8e-3036.9Show/hide
Query:  PISTPCM-----DESDVASDVSLSSEEFESLTMCQTNDVVIEDSYAET-LGLLFQEDEQKIESPISSLKV---FNPSSIAIPDNFSSLIISWNTRGLGDR
        P  TP +      +S V S +S+SSEE +       N+ +++    ET L  LFQ DE   E  +    +       S  IP++  S+++     G+   
Subjt:  PISTPCM-----DESDVASDVSLSSEEFESLTMCQTNDVVIEDSYAET-LGLLFQEDEQKIESPISSLKV---FNPSSIAIPDNFSSLIISWNTRGLGDR

Query:  SKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGGYGPNDYRERKYLWNELRSLFSYVDEP
        +KR+A ++F+  H P++VLIQE+K   FDI+ IKSLWS  DI W  VES G SGG+L +WD S L  ++ +K             +   L+         
Subjt:  SKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGGYGPNDYRERKYLWNELRSLFSYVDEP

Query:  WCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSK
          + GDFNITRW HER P  R TK +R FN VI   +L E+ L+NG+FTWS+
Subjt:  WCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSK

A0A5A7U128 Uncharacterized protein1.1e-2932.7Show/hide
Query:  WVRQEKEVVDLKLEEFCVVSKMFAHNSWREVKQDLEDYFQSKVLLNPFMADKALVKLDDSFSE--MKFDGKWKFIGNLHLKIENWSCQNHFHLEVIEGYG
        WV +  EV+    E   +++K+FA +  R++++ LE+YFQ+K+++NP   + AL+ LD+   +  +  +GKW+ +G+ +LK E W    +     ++GYG
Subjt:  WVRQEKEVVDLKLEEFCVVSKMFAHNSWREVKQDLEDYFQSKVLLNPFMADKALVKLDDSFSE--MKFDGKWKFIGNLHLKIENWSCQNHFHLEVIEGYG

Query:  GWISVKNLPLPLGNRSTFEVICQHFGGLVSISSQTLNLLECSKARIEVRKNLCGFIPAEIAVTDKKHGNFSLRFGDISSLDAPISIPGNLSLSDFVNEID
        GW+ +KNL    G                            S+ARI+V+ NLCGF+P+ I + D K GN  L FGD   L+ P      + +SDF   I 
Subjt:  GWISVKNLPLPLGNRSTFEVICQHFGGLVSISSQTLNLLECSKARIEVRKNLCGFIPAEIAVTDKKHGNFSLRFGDISSLDAPISIPGNLSLSDFVNEID

Query:  LKRVHHVMEDE
        L R+  V++DE
Subjt:  LKRVHHVMEDE

A0A5A7V639 Uncharacterized protein1.3e-3040Show/hide
Query:  DVASDVSLSSEEFESLTMC-------QTNDVVIE----DSYAETLGLLFQEDEQKIESPISSLKVFNPSSIAIPDNFSSLIISWNTRGLGDRSKRVALKK
        D  S  S SSE+ E L +          N + +E    DS A+  G+ F  D      P  S    +    +  D  S L      + L D SKR+ALK+
Subjt:  DVASDVSLSSEEFESLTMC-------QTNDVVIE----DSYAETLGLLFQEDEQKIESPISSLKVFNPSSIAIPDNFSSLIISWNTRGLGDRSKRVALKK

Query:  FIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGG--------------------YGPNDYRERKYLWN
        F+     D+VLIQE+K   FDI  IKSLWSS D  W   E FG SGG+L +WD SKLK ++ +KGG                    YGPND++ER+ +W 
Subjt:  FIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGG--------------------YGPNDYRERKYLWN

Query:  ELRSLFSYVDEPWCIEGDFNITRWVHERIPLRRQT
        EL SL +Y  + WCI GD NI RW HER P RR T
Subjt:  ELRSLFSYVDEPWCIEGDFNITRWVHERIPLRRQT

A0A5D3BDI9 Uncharacterized protein3.9e-3036.51Show/hide
Query:  PISTPCM-----DESDVASDVSLSSEEFESLTMCQTNDVVIEDSYAET-LGLLFQEDEQKIESPISSLKV---FNPSSIAIPDNFSSLIISWNTRGLGDR
        P  TP +      +S V S +S+SSEE +       N+ +++    ET L  LFQ DE   E  +    +       S  IP++  S+++     G+   
Subjt:  PISTPCM-----DESDVASDVSLSSEEFESLTMCQTNDVVIEDSYAET-LGLLFQEDEQKIESPISSLKV---FNPSSIAIPDNFSSLIISWNTRGLGDR

Query:  SKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGGYGPNDYRERKYLWNELRSLFSYVDEP
        +KR+A ++F+  H P++VLIQE+K   FDI+ IKSLWS  D+ W  VES G SGG+L +WD S L  ++ +K             +   L+         
Subjt:  SKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGGYGPNDYRERKYLWNELRSLFSYVDEP

Query:  WCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSK
          + GDFNITRW HER P  R TK +R FN VI   +L E+ L+NG+FTWS+
Subjt:  WCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSK

A0A5D3BHE3 Uncharacterized protein7.6e-3448.8Show/hide
Query:  PD-LVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGG--------------------YGPNDYRERKYLWNELRSL
        PD LV+    +    DI LIKSLWSS DI W  VESFGR GG+L MWD SK+K ++ +KGG                    YGP DY ER+++W  L SL
Subjt:  PD-LVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLKALQFIKGG--------------------YGPNDYRERKYLWNELRSL

Query:  FSYVDEPWCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSKPVDGDSL
          Y    WCI G  NITRW HE  PL +QT+GMR FN  ID L + ELPL NG+ TWS+  +G S+
Subjt:  FSYVDEPWCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSKPVDGDSL

SwissProt top hitse value%identityAlignment
Q9FGN7 Sister chromatid cohesion protein SCC41.7e-0667.5Show/hide
Query:  IPEELMKLGITDGVREVSLQHSAIWMAGVYLMLLMQLLEN
        I +EL+KLGITD VRE  L+H+AIWM+ V+LML MQ LEN
Subjt:  IPEELMKLGITDGVREVSLQHSAIWMAGVYLMLLMQLLEN

Arabidopsis top hitse value%identityAlignment
AT5G51340.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-0767.5Show/hide
Query:  IPEELMKLGITDGVREVSLQHSAIWMAGVYLMLLMQLLEN
        I +EL+KLGITD VRE  L+H+AIWM+ V+LML MQ LEN
Subjt:  IPEELMKLGITDGVREVSLQHSAIWMAGVYLMLLMQLLEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGAAGACAAACCAAGTAGCTTGTCGAGGAAACACCAAAGTCAACGGAAGACACCAAAGAAATCGACGACGGAAGAAATCATATCGAAAGGAGAGACACCAGAAGA
AAAGAACGCCAAAGAGATGAAGAATCTAACGACTAAGTATCTTCAATCCTGGAAAAGAGTGCAAGTTCCTGATGGTTTTGCTAAGAAAGGCAGGTCAATTTTTTGGGAAA
TGGTAAGAGATTTTCTTATGGAATATGTGGAAATTAAGTCTGCTAAAAATGTTTCAAAGAAGTTTGAGATTGAAGAGTCATCCATATCGGCTGTTAATATTAGTAAGAAT
TTGGATAGAAGATATGCAGAAGTGGTGTGGTCAAAATCAGGGCATCCCCATCATGGTTCCCATTCTAAGAAGCTGCCAGAATTTTCTTCATTTTGGGTCAGACAAGAAAA
AGAAGTGGTGGACTTAAAGTTAGAAGAATTTTGTGTGGTTTCCAAAATGTTTGCACATAACTCTTGGAGGGAAGTAAAACAAGATTTGGAAGATTATTTTCAGTCTAAGG
TTTTACTCAACCCTTTTATGGCTGACAAAGCATTGGTGAAATTGGACGATAGTTTTTCTGAGATGAAATTTGATGGCAAGTGGAAATTCATTGGAAACCTTCATTTGAAA
ATTGAAAATTGGTCATGTCAAAACCATTTTCATCTAGAAGTTATTGAAGGTTATGGAGGTTGGATTTCTGTGAAGAACTTACCTCTGCCTCTTGGGAATCGTTCAACTTT
TGAAGTGATTTGTCAACATTTTGGTGGATTGGTGAGTATTTCTTCTCAAACGCTTAACCTTTTGGAGTGTTCAAAAGCTCGAATTGAAGTAAGAAAGAATCTTTGTGGAT
TCATCCCTGCTGAGATTGCTGTTACAGATAAAAAGCACGGAAATTTTTCTCTTCGTTTTGGTGATATCTCCTCTTTAGATGCCCCTATTTCTATCCCTGGGAATTTATCT
TTGAGTGATTTTGTAAATGAAATTGATTTAAAAAGAGTTCATCACGTTATGGAAGACGAAAGGTTTGCTCTTCATCAAGATGATTTTGATTCTTGTGTTGACCAAGAGTT
GAATCATTCAGCATTGAATGATCAGAATGTATTCACCAAGGGTATTCTTTCTCAAGATCAAAATGCTTCTACCAAGTGTCCAATTTCTACCCCTTGTATGGATGAATCTG
ATGTGGCTTCTGATGTCAGCTTAAGTAGTGAAGAATTTGAATCACTTACTATGTGTCAAACTAATGATGTTGTTATTGAAGACTCTTATGCTGAAACTTTGGGATTATTA
TTCCAGGAAGATGAGCAGAAAATTGAATCTCCTATTTCTAGTCTAAAAGTTTTCAACCCTTCATCTATAGCTATTCCGGATAATTTCTCTTCTTTGATCATTTCATGGAA
TACAAGAGGTCTTGGGGATCGTTCTAAAAGAGTTGCTTTAAAAAAGTTTATTCATCATCATATTCCAGATCTGGTTTTAATTCAAGAAACCAAGACAGTTTCATTTGATA
TTAATTTGATCAAATCATTATGGAGCTCCAATGATATATATTGGATTAATGTGGAATCTTTCGGCCGTTCAGGTGGTTTGCTGATTATGTGGGATGACAGTAAGTTGAAA
GCTCTGCAATTTATTAAAGGAGGGTATGGTCCAAATGATTATAGAGAAAGGAAGTATCTATGGAATGAACTTCGGTCTTTATTTTCTTATGTTGATGAACCATGGTGTAT
TGAAGGTGATTTTAATATAACCCGTTGGGTTCACGAGAGAATTCCATTACGAAGACAAACAAAAGGAATGAGGATCTTTAATAAGGTTATTGATGAGCTTGAACTCCGGG
AGTTACCCTTAGCCAATGGAAAATTCACTTGGTCCAAACCAGTAGATGGGGATTCTCTTATGTCTTTCAGAGATATTGAAGCTGAAATTCTTGGTTATTTTGCGTCACTT
TATACGAAGATACCGGAGGAGTTGATGAAGCTTGGGATAACTGATGGTGTAAGAGAAGTCAGTTTGCAACACTCTGCCATTTGGATGGCTGGCGTTTATTTAATGCTCCT
TATGCAGCTTCTTGAAAACTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTGAAGACAAACCAAGTAGCTTGTCGAGGAAACACCAAAGTCAACGGAAGACACCAAAGAAATCGACGACGGAAGAAATCATATCGAAAGGAGAGACACCAGAAGA
AAAGAACGCCAAAGAGATGAAGAATCTAACGACTAAGTATCTTCAATCCTGGAAAAGAGTGCAAGTTCCTGATGGTTTTGCTAAGAAAGGCAGGTCAATTTTTTGGGAAA
TGGTAAGAGATTTTCTTATGGAATATGTGGAAATTAAGTCTGCTAAAAATGTTTCAAAGAAGTTTGAGATTGAAGAGTCATCCATATCGGCTGTTAATATTAGTAAGAAT
TTGGATAGAAGATATGCAGAAGTGGTGTGGTCAAAATCAGGGCATCCCCATCATGGTTCCCATTCTAAGAAGCTGCCAGAATTTTCTTCATTTTGGGTCAGACAAGAAAA
AGAAGTGGTGGACTTAAAGTTAGAAGAATTTTGTGTGGTTTCCAAAATGTTTGCACATAACTCTTGGAGGGAAGTAAAACAAGATTTGGAAGATTATTTTCAGTCTAAGG
TTTTACTCAACCCTTTTATGGCTGACAAAGCATTGGTGAAATTGGACGATAGTTTTTCTGAGATGAAATTTGATGGCAAGTGGAAATTCATTGGAAACCTTCATTTGAAA
ATTGAAAATTGGTCATGTCAAAACCATTTTCATCTAGAAGTTATTGAAGGTTATGGAGGTTGGATTTCTGTGAAGAACTTACCTCTGCCTCTTGGGAATCGTTCAACTTT
TGAAGTGATTTGTCAACATTTTGGTGGATTGGTGAGTATTTCTTCTCAAACGCTTAACCTTTTGGAGTGTTCAAAAGCTCGAATTGAAGTAAGAAAGAATCTTTGTGGAT
TCATCCCTGCTGAGATTGCTGTTACAGATAAAAAGCACGGAAATTTTTCTCTTCGTTTTGGTGATATCTCCTCTTTAGATGCCCCTATTTCTATCCCTGGGAATTTATCT
TTGAGTGATTTTGTAAATGAAATTGATTTAAAAAGAGTTCATCACGTTATGGAAGACGAAAGGTTTGCTCTTCATCAAGATGATTTTGATTCTTGTGTTGACCAAGAGTT
GAATCATTCAGCATTGAATGATCAGAATGTATTCACCAAGGGTATTCTTTCTCAAGATCAAAATGCTTCTACCAAGTGTCCAATTTCTACCCCTTGTATGGATGAATCTG
ATGTGGCTTCTGATGTCAGCTTAAGTAGTGAAGAATTTGAATCACTTACTATGTGTCAAACTAATGATGTTGTTATTGAAGACTCTTATGCTGAAACTTTGGGATTATTA
TTCCAGGAAGATGAGCAGAAAATTGAATCTCCTATTTCTAGTCTAAAAGTTTTCAACCCTTCATCTATAGCTATTCCGGATAATTTCTCTTCTTTGATCATTTCATGGAA
TACAAGAGGTCTTGGGGATCGTTCTAAAAGAGTTGCTTTAAAAAAGTTTATTCATCATCATATTCCAGATCTGGTTTTAATTCAAGAAACCAAGACAGTTTCATTTGATA
TTAATTTGATCAAATCATTATGGAGCTCCAATGATATATATTGGATTAATGTGGAATCTTTCGGCCGTTCAGGTGGTTTGCTGATTATGTGGGATGACAGTAAGTTGAAA
GCTCTGCAATTTATTAAAGGAGGGTATGGTCCAAATGATTATAGAGAAAGGAAGTATCTATGGAATGAACTTCGGTCTTTATTTTCTTATGTTGATGAACCATGGTGTAT
TGAAGGTGATTTTAATATAACCCGTTGGGTTCACGAGAGAATTCCATTACGAAGACAAACAAAAGGAATGAGGATCTTTAATAAGGTTATTGATGAGCTTGAACTCCGGG
AGTTACCCTTAGCCAATGGAAAATTCACTTGGTCCAAACCAGTAGATGGGGATTCTCTTATGTCTTTCAGAGATATTGAAGCTGAAATTCTTGGTTATTTTGCGTCACTT
TATACGAAGATACCGGAGGAGTTGATGAAGCTTGGGATAACTGATGGTGTAAGAGAAGTCAGTTTGCAACACTCTGCCATTTGGATGGCTGGCGTTTATTTAATGCTCCT
TATGCAGCTTCTTGAAAACTGTTAG
Protein sequenceShow/hide protein sequence
MTEDKPSSLSRKHQSQRKTPKKSTTEEIISKGETPEEKNAKEMKNLTTKYLQSWKRVQVPDGFAKKGRSIFWEMVRDFLMEYVEIKSAKNVSKKFEIEESSISAVNISKN
LDRRYAEVVWSKSGHPHHGSHSKKLPEFSSFWVRQEKEVVDLKLEEFCVVSKMFAHNSWREVKQDLEDYFQSKVLLNPFMADKALVKLDDSFSEMKFDGKWKFIGNLHLK
IENWSCQNHFHLEVIEGYGGWISVKNLPLPLGNRSTFEVICQHFGGLVSISSQTLNLLECSKARIEVRKNLCGFIPAEIAVTDKKHGNFSLRFGDISSLDAPISIPGNLS
LSDFVNEIDLKRVHHVMEDERFALHQDDFDSCVDQELNHSALNDQNVFTKGILSQDQNASTKCPISTPCMDESDVASDVSLSSEEFESLTMCQTNDVVIEDSYAETLGLL
FQEDEQKIESPISSLKVFNPSSIAIPDNFSSLIISWNTRGLGDRSKRVALKKFIHHHIPDLVLIQETKTVSFDINLIKSLWSSNDIYWINVESFGRSGGLLIMWDDSKLK
ALQFIKGGYGPNDYRERKYLWNELRSLFSYVDEPWCIEGDFNITRWVHERIPLRRQTKGMRIFNKVIDELELRELPLANGKFTWSKPVDGDSLMSFRDIEAEILGYFASL
YTKIPEELMKLGITDGVREVSLQHSAIWMAGVYLMLLMQLLENC