; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G06155 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G06155
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr07:10896561..10903318
RNA-Seq ExpressionClc07G06155
SyntenyClc07G06155
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]1.2e-5360.59Show/hide
Query:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL
        +RPHHTKLDPKSLKCIFLGYS VQK YRCYCPTL +YLVSP+V F ED PF+ SPSS CQG+DD+LFIYE+ SPT S     +PS  + S   +R     
Subjt:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL

Query:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP
           S +    M  ++  P PSDDLPIALRKGKR CTYP+SSF+S+HQLS  TY+F TS +STSIPNS           + MI+EMT LDDNGTWDLVS P
Subjt:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP

Query:  RDK
          K
Subjt:  RDK

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]1.2e-5360.59Show/hide
Query:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL
        +RPHHTKLDPKSLKCIFLGYS VQK YRCYCPTL +YLVSP+V F ED PF+ SPSS CQG+DD+LFIYE+ SPT S     +PS  + S   +R     
Subjt:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL

Query:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP
           S +    M  ++  P PSDDLPIALRKGKR CTYP+SSF+S+HQLS  TY+F TS +STSIPNS           + MI+EMT LDDNGTWDLVS P
Subjt:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP

Query:  RDK
          K
Subjt:  RDK

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]1.2e-5360.59Show/hide
Query:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL
        +RPHHTKLDPKSLKCIFLGYS VQK YRCYCPTL +YLVSP+V F ED PF+ SPSS CQG+DD+LFIYE+ SPT S     +PS  + S   +R     
Subjt:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL

Query:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP
           S +    M  ++  P PSDDLPIALRKGKR CTYP+SSF+S+HQLS  TY+F TS +STSIPNS           + MI+EMT LDDNGTWDLVS P
Subjt:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP

Query:  RDK
          K
Subjt:  RDK

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]1.2e-5360.59Show/hide
Query:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL
        +RPHHTKLDPKSLKCIFLGYS VQK YRCYCPTL +YLVSP+V F ED PF+ SPSS CQG+DD+LFIYE+ SPT S     +PS  + S   +R     
Subjt:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL

Query:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP
           S +    M  ++  P PSDDLPIALRKGKR CTYP+SSF+S+HQLS  TY+F TS +STSIPNS           + MI+EMT LDDNGTWDLVS P
Subjt:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP

Query:  RDK
          K
Subjt:  RDK

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]1.2e-5360.59Show/hide
Query:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL
        +RPHHTKLDPKSLKCIFLGYS VQK YRCYCPTL +YLVSP+V F ED PF+ SPSS CQG+DD+LFIYE+ SPT S     +PS  + S   +R     
Subjt:  LRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSS-----TPSSSVTSFSSTRYSSLL

Query:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP
           S +    M  ++  P PSDDLPIALRKGKR CTYP+SSF+S+HQLS  TY+F TS +STSIPNS           + MI+EMT LDDNGTWDLVS P
Subjt:  QATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVSHP

Query:  RDK
          K
Subjt:  RDK

TrEMBL top hitse value%identityAlignment
A0A5A7UAV0 Putative Polyprotein1.8e-4760.21Show/hide
Query:  PHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSSTPSSSVTSFSSTRYSSLLQATSTTT
        P HTKLD KSLKCIFLGYS VQK Y CYCPTL +YLVS +V F ED+PF+ SPSS  QG+DD+ FIYEI  PT + PS  + S   +R    L   S + 
Subjt:  PHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSSTPSSSVTSFSSTRYSSLLQATSTTT

Query:  FKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVS
           M  ++    PSDDLPIALRKGKR CTYPISSFV +HQLS+PTY+F TS DSTSIPN+           + MI+EMT LDDNGTWDLVS
Subjt:  FKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVS

A0A5A7V878 DUF4283 domain-containing protein2.8e-3232.28Show/hide
Query:  GWILRCVVWPTYRGRFFIHIQEAPLQQDWTAFLKMLNDFIQKSKQVKKKSVRSSMPMDVTIENLLPISKKLQMSYAETSMMTEMFSPKIQSPVSAIEKQS
        GW+LRC VWP   GRF++H+   P QQ W +F  M+ DF+  S    K+ +R                                        V  +E  +
Subjt:  GWILRCVVWPTYRGRFFIHIQEAPLQQDWTAFLKMLNDFIQKSKQVKKKSVRSSMPMDVTIENLLPISKKLQMSYAETSMMTEMFSPKIQSPVSAIEKQS

Query:  LQQDIAKFLEEFYQALSQFEDKALIKVVQRKVEDLIETPRKWLDYRKFHLPFEKWNTLQYSQPTLIKGFGGWLSIKNLPLEYWKRDTFEAIGTYFGGLDR
          Q + +F          F+ K+    V +  E L E  +  LD                    ++KG+GGW+SIKNLPL+YW  D ++AIG +FGG + 
Subjt:  LQQDIAKFLEEFYQALSQFEDKALIKVVQRKVEDLIETPRKWLDYRKFHLPFEKWNTLQYSQPTLIKGFGGWLSIKNLPLEYWKRDTFEAIGTYFGGLDR

Query:  IAIETLNLLNCSEAKIKVKKNLCGFMPATIEIKNEIRGNIFGTLNQLKLL---AMFIKTIFIKDFSNPTDQVRLKQVAKYEENKS
        I+++T+NL+NCSEAKIKV +NLCGF+PA +E+++  R NIF     +K+L    +  + +FI   +N  D +R+ QV   E  +S
Subjt:  IAIETLNLLNCSEAKIKVKKNLCGFMPATIEIKNEIRGNIFGTLNQLKLL---AMFIKTIFIKDFSNPTDQVRLKQVAKYEENKS

A0A5D3BHE3 Uncharacterized protein1.9e-4452.12Show/hide
Query:  FSSKDVGWIFVEAYGKSGGMLTIAPGHSKLSVIEVIKGGYSLSVKCRTICRKVCWVTNIYDPTSYKGRRFIWPEFFSLSAYCIEAWCLGGDFNVTRTIQE
        +SSKD+GW  VE++G+ GG+LT+    SK+ V+E +KGGYSLS+   T C+K CW+TN+Y P  Y+ RRF+W    SLS YC  AWC+GG  N+TR   E
Subjt:  FSSKDVGWIFVEAYGKSGGMLTIAPGHSKLSVIEVIKGGYSLSVKCRTICRKVCWVTNIYDPTSYKGRRFIWPEFFSLSAYCIEAWCLGGDFNVTRTIQE

Query:  RFLVGSMTRGMKKFNKFIANANLLEILLSNGRFTWSREGRVVSRSLLDKFLVSNEWDEAFEDTRV
         F +   TRGM++FN  I + N+ E+ L NGR TWSREG  +SRSLLD F +  EWDE  E++RV
Subjt:  RFLVGSMTRGMKKFNKFIANANLLEILLSNGRFTWSREGRVVSRSLLDKFLVSNEWDEAFEDTRV

A0A5D3C8E2 Putative Polyprotein1.8e-4760.21Show/hide
Query:  PHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSSTPSSSVTSFSSTRYSSLLQATSTTT
        P HTKLD KSLKCIFLGYS VQK Y CYCPTL +YLVS +V F ED+PF+ SPSS  QG+DD+ FIYEI  PT + PS  + S   +R    L   S + 
Subjt:  PHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQGQDDDLFIYEIISPTSSTPSSSVTSFSSTRYSSLLQATSTTT

Query:  FKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVS
           M  ++    PSDDLPIALRKGKR CTYPISSFV +HQLS+PTY+F TS DSTSIPN+           + MI+EMT LDDNGTWDLVS
Subjt:  FKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNS----------GSTMIDEMTTLDDNGTWDLVS

A0A5D3E3A5 Ulp1-like peptidase9.7e-3336.99Show/hide
Query:  RDGFGAMRLAKFKSNLGWILRCVVWPTYRGRFFIHIQEAPLQQDWTAFLKMLNDFIQKSK-QVKKKSVRSSM---PMDVTIENLLPISKKLQMSYAETSM
        +D  GA RL KFKS  GWILRCV         F+H+     +Q W +         Q ++ +   +S+ SS+       +   LL  S K+      T +
Subjt:  RDGFGAMRLAKFKSNLGWILRCVVWPTYRGRFFIHIQEAPLQQDWTAFLKMLNDFIQKSK-QVKKKSVRSSM---PMDVTIENLLPISKKLQMSYAETSM

Query:  MTEMFSPKIQSPVSAIEKQSLQQDIAKFLEEFYQALSQ----FEDKALIKVVQRKVEDLIETPRKWLDYRKFHLPFEKWNTLQYSQPTLIKGFGGWLSIK
        +T   S +I               I  FLE+ ++        F+++A  K+   K+E+L+ T  KW +Y KFHL FEKW+ + +S+P+ IKGF GWLSIK
Subjt:  MTEMFSPKIQSPVSAIEKQSLQQDIAKFLEEFYQALSQ----FEDKALIKVVQRKVEDLIETPRKWLDYRKFHLPFEKWNTLQYSQPTLIKGFGGWLSIK

Query:  NLPLEYWKRDTFEAIGTYFGGLDRIAIETLNLLNCSEAKIKVKKNL
        NLPL+ W+R  FE IG +FGGL+  A++TLNL+ C++A+I+V+KN+
Subjt:  NLPLEYWKRDTFEAIGTYFGGLDRIAIETLNLLNCSEAKIKVKKNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCTCCTTTCAAGAAAAAGAGAGTGTTTGACAAGAAGAATGATCAGGTAGAACAGAATCTGACCAGAGCTACTGATCGAAGAAGAGAAGGGGGAGGGGGGAGAGG
AGAGAGGAGAGAGCATCCTTACCTTTTTTCCCCTTTTTTTCTCCGTCCTCATCATACCAAGTTAGATCCCAAATCCTTGAAATGCATATTCTTAGGTTATTCGAGTGTTC
AAAAGGAATATCGTTGTTATTGTCCTACTCTTAACAAATATCTTGTCTCTCCTAATGTTACGTTTCTCGAAGATATGCCCTTTAGTCCATCGCCGTCAAGTTCGTGTCAA
GGGCAGGATGATGATCTTTTTATCTATGAGATTATCTCTCCCACATCATCTACTCCATCTTCCTCTGTCACTTCCTTCTCGTCCACCCGTTACTCGAGTTTACTCCAGGC
GACTTCCACAACTACCTTCAAGCACATGTCCTACACCAATGACTATCCAAAACCAAGTGATGATCTTCCCATCGCCCTTCGGAAAGGTAAACGCACTTGTACTTACCCTA
TTTCTTCATTTGTTTCACATCATCAGTTGTCCTCGCCCACATATTCTTTTAGTACATCCTTTGATTCCACCTCTATTCCTAACTCTGGTAGTACAATGATTGACGAGATG
ACTACTTTAGATGATAATGGTACTTGGGATTTAGTCTCTCATCCTAGAGATAAACCGAGCCGAACCACCAAATTTAGAAAGAAATTTGAAGAAGCCGGACTGAACCACTT
CGGTCCATCGCTTGACTATCTTCAACCTATGAATAATGGGTTGCAAGCCTCACCATCTCACTTGTTTATGGATTCGGCTGCTTTGGATGTTGAAAAGACAGATTGGAAAG
AGAAAGATAAAGGAAAGAATAAAGGCTTTTCAACTCCAACATTTCCCTTCTACTCTAATTCCCCGGTCATCCCCCAACGTCAAGCCTTTTCTCCGGCCGGTCATCCCCCG
ACGTCAAGCCTCTTCTCCGGCAAACGCGATGGATTTGGAGCAATGAGACTAGCTAAATTCAAATCCAATTTGGGATGGATTCTGAGATGTGTGGTTTGGCCGACATACAG
AGGTCGCTTCTTTATACATATCCAAGAGGCGCCTTTGCAACAAGATTGGACAGCCTTTCTGAAAATGCTTAACGACTTCATTCAAAAAAGCAAGCAAGTTAAAAAGAAGT
CGGTAAGGAGCTCGATGCCAATGGATGTCACAATTGAGAACCTCCTCCCTATCTCAAAAAAACTACAGATGAGCTATGCAGAGACGAGCATGATGACTGAGATGTTTTCT
CCAAAGATCCAATCCCCTGTTTCAGCAATCGAAAAACAGAGCCTACAACAAGATATTGCCAAATTCCTAGAAGAATTTTATCAAGCTTTATCTCAGTTTGAAGATAAAGC
TTTAATCAAAGTTGTTCAAAGAAAAGTAGAAGACTTGATAGAAACTCCGAGAAAATGGCTTGACTACCGAAAGTTTCATTTACCTTTTGAAAAATGGAATACTCTTCAAT
ACAGTCAGCCAACTCTCATAAAAGGATTCGGAGGATGGTTATCAATCAAAAACCTCCCTTTGGAATATTGGAAAAGAGACACTTTTGAGGCCATTGGAACATATTTTGGA
GGTCTAGACCGTATAGCTATTGAGACACTTAATCTTCTAAATTGTTCCGAAGCAAAAATTAAAGTTAAAAAGAATTTATGTGGTTTTATGCCCGCAACAATTGAGATAAA
GAATGAAATTAGAGGAAATATCTTTGGGACATTGAATCAGTTGAAGCTCCTAGCAATGTTTATAAAGACTATATTCATTAAAGATTTCTCTAATCCTACTGATCAAGTTC
GATTGAAGCAAGTTGCAAAATATGAAGAGAATAAGTCGAGCATCACAAGAAATCCCTTCAATGCCATTAAAAAGCAATCCATTCATCCAAAGAAGCATTCACCGAAAGTA
GGCCAAAGCTCGTCGGAAAATCCAGAGAAGGAGAAAGAAGGATCGAAAAAAGAAAATGTCTCTGAGAAAAAACTTTCAAATGTCAACGCACCTAAGTTAAATGCGTTCGA
GGAAATTGAAAAGCCAGATCTTGTCGTGCGGAAAGAAGTTGAAAGGTCGAAAGTAATGAGGAGAGAGAGTAATGTCGGCCGCCTTTCATTTGGACCTAGCAGTAATCCCA
CACCCGTTTTCAATTCCAACATTATTAAGAATGACAAGTCATCAAAGTCCCCTGCTATTCCTCAAATGAAAGATCAAGTAATGGTAGGGTCCACAATTGAAAGAGCTTCA
ACTAATCATGACTTCCAGTTTGCCATTATTGAATCGGATGAGCCTTTCGAAGCAGCACCTGAAATTCCTCTTCAACTTTCACAATCCAAGCTGCCAGATAACTATACTCT
AAGAAGTCTTTTGTGCCAGCGCTCAAAGGAGCCCGTCCTCATGAAGATTATAACATGGAATACTAGAGGACTTGAAGATCATTCAAAGCAATTGCCCCTTAAATGTTTAC
TCAAGAAGACTAATCCAGATTTCAGTTCGAAAGATGTTGGTTGGATATTTGTCGAAGCATATGGCAAATCAGGAGGGATGTTAACCATTGCACCCGGACATAGCAAATTA
TCAGTTATTGAAGTTATTAAAGGGGGTTATTCACTATCAGTAAAGTGTAGGACTATTTGCAGAAAGGTTTGTTGGGTGACAAATATCTATGATCCCACTAGCTACAAAGG
AAGAAGATTTATTTGGCCTGAATTTTTCTCCCTCTCAGCTTATTGCATAGAAGCTTGGTGTTTGGGTGGAGACTTCAATGTTACCAGAACAATTCAAGAACGTTTCCTAG
TAGGAAGCATGACAAGAGGCATGAAGAAGTTCAATAAGTTCATAGCAAATGCTAATTTACTGGAAATACTATTATCCAATGGACGTTTTACTTGGTCAAGAGAAGGAAGG
GTGGTTTCTCGCTCATTGCTTGACAAATTTCTTGTATCAAATGAATGGGATGAGGCATTTGAAGACACAAGAGTGTTATCAAGGTTGGGTAGACTCATCATATCTTCAAA
ACTGCGAACTTTGAAAAGAACCCTAAAAGATTGGCATGCTGATTTTGAAGGAAAGCAAAGAAGACAGGAAGAGAAACTTTTAGAAGAGCATTATTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCTCCTTTCAAGAAAAAGAGAGTGTTTGACAAGAAGAATGATCAGGTAGAACAGAATCTGACCAGAGCTACTGATCGAAGAAGAGAAGGGGGAGGGGGGAGAGG
AGAGAGGAGAGAGCATCCTTACCTTTTTTCCCCTTTTTTTCTCCGTCCTCATCATACCAAGTTAGATCCCAAATCCTTGAAATGCATATTCTTAGGTTATTCGAGTGTTC
AAAAGGAATATCGTTGTTATTGTCCTACTCTTAACAAATATCTTGTCTCTCCTAATGTTACGTTTCTCGAAGATATGCCCTTTAGTCCATCGCCGTCAAGTTCGTGTCAA
GGGCAGGATGATGATCTTTTTATCTATGAGATTATCTCTCCCACATCATCTACTCCATCTTCCTCTGTCACTTCCTTCTCGTCCACCCGTTACTCGAGTTTACTCCAGGC
GACTTCCACAACTACCTTCAAGCACATGTCCTACACCAATGACTATCCAAAACCAAGTGATGATCTTCCCATCGCCCTTCGGAAAGGTAAACGCACTTGTACTTACCCTA
TTTCTTCATTTGTTTCACATCATCAGTTGTCCTCGCCCACATATTCTTTTAGTACATCCTTTGATTCCACCTCTATTCCTAACTCTGGTAGTACAATGATTGACGAGATG
ACTACTTTAGATGATAATGGTACTTGGGATTTAGTCTCTCATCCTAGAGATAAACCGAGCCGAACCACCAAATTTAGAAAGAAATTTGAAGAAGCCGGACTGAACCACTT
CGGTCCATCGCTTGACTATCTTCAACCTATGAATAATGGGTTGCAAGCCTCACCATCTCACTTGTTTATGGATTCGGCTGCTTTGGATGTTGAAAAGACAGATTGGAAAG
AGAAAGATAAAGGAAAGAATAAAGGCTTTTCAACTCCAACATTTCCCTTCTACTCTAATTCCCCGGTCATCCCCCAACGTCAAGCCTTTTCTCCGGCCGGTCATCCCCCG
ACGTCAAGCCTCTTCTCCGGCAAACGCGATGGATTTGGAGCAATGAGACTAGCTAAATTCAAATCCAATTTGGGATGGATTCTGAGATGTGTGGTTTGGCCGACATACAG
AGGTCGCTTCTTTATACATATCCAAGAGGCGCCTTTGCAACAAGATTGGACAGCCTTTCTGAAAATGCTTAACGACTTCATTCAAAAAAGCAAGCAAGTTAAAAAGAAGT
CGGTAAGGAGCTCGATGCCAATGGATGTCACAATTGAGAACCTCCTCCCTATCTCAAAAAAACTACAGATGAGCTATGCAGAGACGAGCATGATGACTGAGATGTTTTCT
CCAAAGATCCAATCCCCTGTTTCAGCAATCGAAAAACAGAGCCTACAACAAGATATTGCCAAATTCCTAGAAGAATTTTATCAAGCTTTATCTCAGTTTGAAGATAAAGC
TTTAATCAAAGTTGTTCAAAGAAAAGTAGAAGACTTGATAGAAACTCCGAGAAAATGGCTTGACTACCGAAAGTTTCATTTACCTTTTGAAAAATGGAATACTCTTCAAT
ACAGTCAGCCAACTCTCATAAAAGGATTCGGAGGATGGTTATCAATCAAAAACCTCCCTTTGGAATATTGGAAAAGAGACACTTTTGAGGCCATTGGAACATATTTTGGA
GGTCTAGACCGTATAGCTATTGAGACACTTAATCTTCTAAATTGTTCCGAAGCAAAAATTAAAGTTAAAAAGAATTTATGTGGTTTTATGCCCGCAACAATTGAGATAAA
GAATGAAATTAGAGGAAATATCTTTGGGACATTGAATCAGTTGAAGCTCCTAGCAATGTTTATAAAGACTATATTCATTAAAGATTTCTCTAATCCTACTGATCAAGTTC
GATTGAAGCAAGTTGCAAAATATGAAGAGAATAAGTCGAGCATCACAAGAAATCCCTTCAATGCCATTAAAAAGCAATCCATTCATCCAAAGAAGCATTCACCGAAAGTA
GGCCAAAGCTCGTCGGAAAATCCAGAGAAGGAGAAAGAAGGATCGAAAAAAGAAAATGTCTCTGAGAAAAAACTTTCAAATGTCAACGCACCTAAGTTAAATGCGTTCGA
GGAAATTGAAAAGCCAGATCTTGTCGTGCGGAAAGAAGTTGAAAGGTCGAAAGTAATGAGGAGAGAGAGTAATGTCGGCCGCCTTTCATTTGGACCTAGCAGTAATCCCA
CACCCGTTTTCAATTCCAACATTATTAAGAATGACAAGTCATCAAAGTCCCCTGCTATTCCTCAAATGAAAGATCAAGTAATGGTAGGGTCCACAATTGAAAGAGCTTCA
ACTAATCATGACTTCCAGTTTGCCATTATTGAATCGGATGAGCCTTTCGAAGCAGCACCTGAAATTCCTCTTCAACTTTCACAATCCAAGCTGCCAGATAACTATACTCT
AAGAAGTCTTTTGTGCCAGCGCTCAAAGGAGCCCGTCCTCATGAAGATTATAACATGGAATACTAGAGGACTTGAAGATCATTCAAAGCAATTGCCCCTTAAATGTTTAC
TCAAGAAGACTAATCCAGATTTCAGTTCGAAAGATGTTGGTTGGATATTTGTCGAAGCATATGGCAAATCAGGAGGGATGTTAACCATTGCACCCGGACATAGCAAATTA
TCAGTTATTGAAGTTATTAAAGGGGGTTATTCACTATCAGTAAAGTGTAGGACTATTTGCAGAAAGGTTTGTTGGGTGACAAATATCTATGATCCCACTAGCTACAAAGG
AAGAAGATTTATTTGGCCTGAATTTTTCTCCCTCTCAGCTTATTGCATAGAAGCTTGGTGTTTGGGTGGAGACTTCAATGTTACCAGAACAATTCAAGAACGTTTCCTAG
TAGGAAGCATGACAAGAGGCATGAAGAAGTTCAATAAGTTCATAGCAAATGCTAATTTACTGGAAATACTATTATCCAATGGACGTTTTACTTGGTCAAGAGAAGGAAGG
GTGGTTTCTCGCTCATTGCTTGACAAATTTCTTGTATCAAATGAATGGGATGAGGCATTTGAAGACACAAGAGTGTTATCAAGGTTGGGTAGACTCATCATATCTTCAAA
ACTGCGAACTTTGAAAAGAACCCTAAAAGATTGGCATGCTGATTTTGAAGGAAAGCAAAGAAGACAGGAAGAGAAACTTTTAGAAGAGCATTATTGTTAA
Protein sequenceShow/hide protein sequence
MSAPFKKKRVFDKKNDQVEQNLTRATDRRREGGGGRGERREHPYLFSPFFLRPHHTKLDPKSLKCIFLGYSSVQKEYRCYCPTLNKYLVSPNVTFLEDMPFSPSPSSSCQ
GQDDDLFIYEIISPTSSTPSSSVTSFSSTRYSSLLQATSTTTFKHMSYTNDYPKPSDDLPIALRKGKRTCTYPISSFVSHHQLSSPTYSFSTSFDSTSIPNSGSTMIDEM
TTLDDNGTWDLVSHPRDKPSRTTKFRKKFEEAGLNHFGPSLDYLQPMNNGLQASPSHLFMDSAALDVEKTDWKEKDKGKNKGFSTPTFPFYSNSPVIPQRQAFSPAGHPP
TSSLFSGKRDGFGAMRLAKFKSNLGWILRCVVWPTYRGRFFIHIQEAPLQQDWTAFLKMLNDFIQKSKQVKKKSVRSSMPMDVTIENLLPISKKLQMSYAETSMMTEMFS
PKIQSPVSAIEKQSLQQDIAKFLEEFYQALSQFEDKALIKVVQRKVEDLIETPRKWLDYRKFHLPFEKWNTLQYSQPTLIKGFGGWLSIKNLPLEYWKRDTFEAIGTYFG
GLDRIAIETLNLLNCSEAKIKVKKNLCGFMPATIEIKNEIRGNIFGTLNQLKLLAMFIKTIFIKDFSNPTDQVRLKQVAKYEENKSSITRNPFNAIKKQSIHPKKHSPKV
GQSSSENPEKEKEGSKKENVSEKKLSNVNAPKLNAFEEIEKPDLVVRKEVERSKVMRRESNVGRLSFGPSSNPTPVFNSNIIKNDKSSKSPAIPQMKDQVMVGSTIERAS
TNHDFQFAIIESDEPFEAAPEIPLQLSQSKLPDNYTLRSLLCQRSKEPVLMKIITWNTRGLEDHSKQLPLKCLLKKTNPDFSSKDVGWIFVEAYGKSGGMLTIAPGHSKL
SVIEVIKGGYSLSVKCRTICRKVCWVTNIYDPTSYKGRRFIWPEFFSLSAYCIEAWCLGGDFNVTRTIQERFLVGSMTRGMKKFNKFIANANLLEILLSNGRFTWSREGR
VVSRSLLDKFLVSNEWDEAFEDTRVLSRLGRLIISSKLRTLKRTLKDWHADFEGKQRRQEEKLLEEHYC