; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G14370 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G14370
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr7:12893766..12895995
RNA-Seq ExpressionCSPI07G14370
SyntenyCSPI07G14370
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]7.8e-8868.36Show/hide
Query:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN
        SSVLNGEI +RVLF  K+LFPIAPKIFGC+CFVRDVRPHHTKLDPKSLK I L                                F SSPSS CQG+DDN
Subjt:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN

Query:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF
        LFIYEVTSPTPS STD  P RPLIS+VYSRR PPQ SDSCPPSM PSSC+P PSDDLPIALRKGK KCTYPVSSF+ YH LSP TYAF+TSL+ T +PN 
Subjt:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF

Query:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA
        VHEALSHP W+NAMIEEMTAL++N          GKK IGC WV AVKMNPDGTVA
Subjt:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]3.6e-3271.68Show/hide
Query:  LGSTDMDDHMTEDPPKDAKQKKDW---------------------------SVKELLEFLDFLYSGKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKI
        L STDMDDHMTEDPPKDAKQKKDW                           SVKELLEFLDFLYSGKEQVH+MFEVCMQFFRAEQK E VTSYFMRLKKI
Subjt:  LGSTDMDDHMTEDPPKDAKQKKDW---------------------------SVKELLEFLDFLYSGKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKI

Query:  TAELGLLLPFSPD
         AELGLLLPFSPD
Subjt:  TAELGLLLPFSPD

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]7.8e-8868.36Show/hide
Query:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN
        SSVLNGEI +RVLF  K+LFPIAPKIFGC+CFVRDVRPHHTKLDPKSLK I L                                F SSPSS CQG+DDN
Subjt:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN

Query:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF
        LFIYEVTSPTPS STD  P RPLIS+VYSRR PPQ SDSCPPSM PSSC+P PSDDLPIALRKGK KCTYPVSSF+ YH LSP TYAF+TSL+ T +PN 
Subjt:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF

Query:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA
        VHEALSHP W+NAMIEEMTAL++N          GKK IGC WV AVKMNPDGTVA
Subjt:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]7.8e-8868.36Show/hide
Query:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN
        SSVLNGEI +RVLF  K+LFPIAPKIFGC+CFVRDVRPHHTKLDPKSLK I L                                F SSPSS CQG+DDN
Subjt:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN

Query:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF
        LFIYEVTSPTPS STD  P RPLIS+VYSRR PPQ SDSCPPSM PSSC+P PSDDLPIALRKGK KCTYPVSSF+ YH LSP TYAF+TSL+ T +PN 
Subjt:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF

Query:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA
        VHEALSHP W+NAMIEEMTAL++N          GKK IGC WV AVKMNPDGTVA
Subjt:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]7.8e-8868.36Show/hide
Query:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN
        SSVLNGEI +RVLF  K+LFPIAPKIFGC+CFVRDVRPHHTKLDPKSLK I L                                F SSPSS CQG+DDN
Subjt:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN

Query:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF
        LFIYEVTSPTPS STD  P RPLIS+VYSRR PPQ SDSCPPSM PSSC+P PSDDLPIALRKGK KCTYPVSSF+ YH LSP TYAF+TSL+ T +PN 
Subjt:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF

Query:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA
        VHEALSHP W+NAMIEEMTAL++N          GKK IGC WV AVKMNPDGTVA
Subjt:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]7.8e-8868.36Show/hide
Query:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN
        SSVLNGEI +RVLF  K+LFPIAPKIFGC+CFVRDVRPHHTKLDPKSLK I L                                F SSPSS CQG+DDN
Subjt:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDN

Query:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF
        LFIYEVTSPTPS STD  P RPLIS+VYSRR PPQ SDSCPPSM PSSC+P PSDDLPIALRKGK KCTYPVSSF+ YH LSP TYAF+TSL+ T +PN 
Subjt:  LFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNF

Query:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA
        VHEALSHP W+NAMIEEMTAL++N          GKK IGC WV AVKMNPDGTVA
Subjt:  VHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTVA

TrEMBL top hitse value%identityAlignment
A0A5A7UAV0 Putative Polyprotein8.2e-7539.4Show/hide
Query:  MDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLL-----------------------
        MDDHMTED P+DAK+KKD  + +   +L    S   KEQVH+MFEVCMQFFRAEQK EFVT+YFM LKKITAE  +L                       
Subjt:  MDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLL-----------------------

Query:  --------------------------------------------------------------------------------LPFSP----------DNTSS
                                                                                         PFSP           +TSS
Subjt:  --------------------------------------------------------------------------------LPFSP----------DNTSS

Query:  V-------------------------------------------------------------------------------LNGEISHRVLFSAKNLFPIA
        V                                                                               LNGEI +RVLF  K+LFPIA
Subjt:  V-------------------------------------------------------------------------------LNGEISHRVLFSAKNLFPIA

Query:  PKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPL
        PKIFGC+CFVRD+ P HTKLD KSLK I L                                F SSPSS  QG+DDN FIYE+T P     TD PP R L
Subjt:  PKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPL

Query:  ISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNFVHEALSHPNWRNAMIEEMTALEN
         SRVYSRR   Q SDSCP SMPPSS +  PSDDLPIALRKGK KCTYP+SSFVFYH LS  TYAF+TS D T +PN VHEALSH  W+NAMIEEMT L++
Subjt:  ISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNFVHEALSHPNWRNAMIEEMTALEN

Query:  N----------GKKTIGCNWVLAVKMNPDGTVA
        N          GKK IGC WV +VK+NPDG VA
Subjt:  N----------GKKTIGCNWVLAVKMNPDGTVA

A0A5A7VIR8 Putative mitochondrial protein5.9e-6552.17Show/hide
Query:  YSGLLKIEGYAIVLGSTDMDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLLLPFSP
        +SGL +++ Y +VL           D PKDAK+KKDW   +   +L    S   KEQ H MFEVCMQFFRAEQK E VTSYFMRL+KI AEL LLL F+P
Subjt:  YSGLLKIEGYAIVLGSTDMDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLLLPFSP

Query:  D-------------------------------------------------------------NTSSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDV
                                                                        SSVLNGEIS+RVLF  K+LFPI PKIFGC+CFVRDV
Subjt:  D-------------------------------------------------------------NTSSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDV

Query:  RPHHTKLDPKSLK-IHLFISS-PSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKS
        R HHTKLDPKSLK I+L  S     SCQG+DDNLFIY++TSPTPSSSTD PP RPL  RVYSRR   Q SDSCPPSMP SSC+  PSDDL IAL KGK+
Subjt:  RPHHTKLDPKSLK-IHLFISS-PSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKS

A0A5D3BGS5 Sterol 3-beta-glucosyltransferase UGT80A2 isoform X35.9e-6552.17Show/hide
Query:  YSGLLKIEGYAIVLGSTDMDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLLLPFSP
        +SGL +++ Y +VL           D PKDAK+KKDW   +   +L    S   KEQ H MFEVCMQFFRAEQK E VTSYFMRL+KI AEL LLL F+P
Subjt:  YSGLLKIEGYAIVLGSTDMDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLLLPFSP

Query:  D-------------------------------------------------------------NTSSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDV
                                                                        SSVLNGEIS+RVLF  K+LFPI PKIFGC+CFVRDV
Subjt:  D-------------------------------------------------------------NTSSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDV

Query:  RPHHTKLDPKSLK-IHLFISS-PSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKS
        R HHTKLDPKSLK I+L  S     SCQG+DDNLFIY++TSPTPSSSTD PP RPL  RVYSRR   Q SDSCPPSMP SSC+  PSDDL IAL KGK+
Subjt:  RPHHTKLDPKSLK-IHLFISS-PSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKS

A0A5D3C8E2 Putative Polyprotein8.2e-7539.4Show/hide
Query:  MDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLL-----------------------
        MDDHMTED P+DAK+KKD  + +   +L    S   KEQVH+MFEVCMQFFRAEQK EFVT+YFM LKKITAE  +L                       
Subjt:  MDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLL-----------------------

Query:  --------------------------------------------------------------------------------LPFSP----------DNTSS
                                                                                         PFSP           +TSS
Subjt:  --------------------------------------------------------------------------------LPFSP----------DNTSS

Query:  V-------------------------------------------------------------------------------LNGEISHRVLFSAKNLFPIA
        V                                                                               LNGEI +RVLF  K+LFPIA
Subjt:  V-------------------------------------------------------------------------------LNGEISHRVLFSAKNLFPIA

Query:  PKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPL
        PKIFGC+CFVRD+ P HTKLD KSLK I L                                F SSPSS  QG+DDN FIYE+T P     TD PP R L
Subjt:  PKIFGCICFVRDVRPHHTKLDPKSLK-IHL--------------------------------FISSPSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPL

Query:  ISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNFVHEALSHPNWRNAMIEEMTALEN
         SRVYSRR   Q SDSCP SMPPSS +  PSDDLPIALRKGK KCTYP+SSFVFYH LS  TYAF+TS D T +PN VHEALSH  W+NAMIEEMT L++
Subjt:  ISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNFVHEALSHPNWRNAMIEEMTALEN

Query:  N----------GKKTIGCNWVLAVKMNPDGTVA
        N          GKK IGC WV +VK+NPDG VA
Subjt:  N----------GKKTIGCNWVLAVKMNPDGTVA

A0A5D3E5M8 Copia protein1.7e-5149.21Show/hide
Query:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLKIHL--------------------------------FISSPSSSCQGDDDNL
        SS+LNGEI +RVLF  K+LFPI PKIFGC+C VRDV PHHTKLDPKSLK                                   F SSPSSSC+G+DDNL
Subjt:  SSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLKIHL--------------------------------FISSPSSSCQGDDDNL

Query:  FIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNFV
        FIYE+T P     T+ PP RPL SRVYS + P Q SDSCP SMPPSSC+  PSDDLPIALRK                                      
Subjt:  FIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTFVPNFV

Query:  HEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTV
          ALSHP WRNAMIEEMTAL++N          GKK IGC WV ++K+N +GTV
Subjt:  HEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTV

A0A5D3E5M8 Copia protein2.0e-2066.67Show/hide
Query:  MDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLLLPFSPDNTSSVLN
        MDDHMTED P+DAK KKDW   +   +L    S   KEQVH+MFEVCMQF RAEQK E VT+YFMRLKKITAEL LLLPFSPD  + +L+
Subjt:  MDDHMTEDPPKDAKQKKDWSVKELLEFLDFLYS--GKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLLLPFSPDNTSSVLN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.5e-0732.56Show/hide
Query:  YPVSSFVFYHHLSPLTYAFVTSLDYTFVPNFVHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTV
        + +S F+ Y  +SPL ++F+  +     P+  +EA     W  AM +E+ A+E             KK IGC WV  +K N DGT+
Subjt:  YPVSSFVFYHHLSPLTYAFVTSLDYTFVPNFVHEALSHPNWRNAMIEEMTALENN----------GKKTIGCNWVLAVKMNPDGTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCGAATGGTCATTGGCTTCCTCCTCCATTAGGCTTAAGTTTAGGTTACCTTTCGCTGACCGTTTGTCGACACCATACTGTAAGAAGTGCCACCTCCGTCTGTGC
AAGTTGTCTTTGCCGATCATTGGGAAACACTGCGCGTGAGCTTCACGAGCCATCGGAATATCCTTGCACATCACCCACGTGCCAGCGCTATTCGGGTTTGCTGAAAATTG
AAGGCTACGCCATTGTTTTAGGAAGTACTGATATGGATGATCACATGACCGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGTCTGTTAAAGAACTTCTGGAA
TTTTTAGATTTTCTATATTCAGGTAAAGAGCAAGTACATAAAATGTTTGAGGTTTGTATGCAATTTTTTCGTGCTGAACAAAAAGTTGAGTTTGTCACCAGCTACTTTAT
GCGACTTAAGAAGATAACTGCTGAGCTTGGCTTGTTGTTACCTTTTAGTCCTGATAATACTTCCTCTGTTCTTAATGGTGAGATTTCCCACCGTGTTCTTTTTTCTGCCA
AAAATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTATCTGTTTTGTTCGTGACGTTCGCCCTCATCATACTAAGTTAGATCCCAAATCCTTGAAGATACACCTTTTT
ATTTCATCACCATCGAGTTCATGTCAAGGGGATGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTCTTCCACTGATACGCCTCCTTTCCGCCCACT
GATTTCTCGAGTCTACTCCCGACGACTTCCACCACAAGCTTCAGACTCATGTCCTCCATCAATGCCTCCTTCATCATGCAATCCAAGGCCAAGTGATGATCTTCCCATTG
CTCTTCGCAAAGGTAAAAGCAAGTGTACTTACCCTGTTTCTTCCTTTGTTTTCTATCACCACTTGTCTCCCCTCACATATGCTTTTGTTACGTCTCTTGACTACACATTT
GTTCCTAACTTTGTTCATGAAGCTTTGTCTCATCCTAACTGGCGAAATGCAATGATTGAGGAGATGACTGCTTTAGAGAATAATGGAAAGAAGACCATTGGTTGTAATTG
GGTGCTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCGAATGGTCATTGGCTTCCTCCTCCATTAGGCTTAAGTTTAGGTTACCTTTCGCTGACCGTTTGTCGACACCATACTGTAAGAAGTGCCACCTCCGTCTGTGC
AAGTTGTCTTTGCCGATCATTGGGAAACACTGCGCGTGAGCTTCACGAGCCATCGGAATATCCTTGCACATCACCCACGTGCCAGCGCTATTCGGGTTTGCTGAAAATTG
AAGGCTACGCCATTGTTTTAGGAAGTACTGATATGGATGATCACATGACCGAAGATCCCCCAAAAGATGCAAAGCAGAAGAAGGATTGGTCTGTTAAAGAACTTCTGGAA
TTTTTAGATTTTCTATATTCAGGTAAAGAGCAAGTACATAAAATGTTTGAGGTTTGTATGCAATTTTTTCGTGCTGAACAAAAAGTTGAGTTTGTCACCAGCTACTTTAT
GCGACTTAAGAAGATAACTGCTGAGCTTGGCTTGTTGTTACCTTTTAGTCCTGATAATACTTCCTCTGTTCTTAATGGTGAGATTTCCCACCGTGTTCTTTTTTCTGCCA
AAAATTTGTTTCCTATTGCTCCTAAGATATTTGGTTGTATCTGTTTTGTTCGTGACGTTCGCCCTCATCATACTAAGTTAGATCCCAAATCCTTGAAGATACACCTTTTT
ATTTCATCACCATCGAGTTCATGTCAAGGGGATGATGACAATCTTTTTATATATGAGGTTACCTCTCCCACACCATCCTCTTCCACTGATACGCCTCCTTTCCGCCCACT
GATTTCTCGAGTCTACTCCCGACGACTTCCACCACAAGCTTCAGACTCATGTCCTCCATCAATGCCTCCTTCATCATGCAATCCAAGGCCAAGTGATGATCTTCCCATTG
CTCTTCGCAAAGGTAAAAGCAAGTGTACTTACCCTGTTTCTTCCTTTGTTTTCTATCACCACTTGTCTCCCCTCACATATGCTTTTGTTACGTCTCTTGACTACACATTT
GTTCCTAACTTTGTTCATGAAGCTTTGTCTCATCCTAACTGGCGAAATGCAATGATTGAGGAGATGACTGCTTTAGAGAATAATGGAAAGAAGACCATTGGTTGTAATTG
GGTGCTTGCTGTCAAGATGAATCCTGATGGAACAGTGGCTTGA
Protein sequenceShow/hide protein sequence
MVSNGHWLPPPLGLSLGYLSLTVCRHHTVRSATSVCASCLCRSLGNTARELHEPSEYPCTSPTCQRYSGLLKIEGYAIVLGSTDMDDHMTEDPPKDAKQKKDWSVKELLE
FLDFLYSGKEQVHKMFEVCMQFFRAEQKVEFVTSYFMRLKKITAELGLLLPFSPDNTSSVLNGEISHRVLFSAKNLFPIAPKIFGCICFVRDVRPHHTKLDPKSLKIHLF
ISSPSSSCQGDDDNLFIYEVTSPTPSSSTDTPPFRPLISRVYSRRLPPQASDSCPPSMPPSSCNPRPSDDLPIALRKGKSKCTYPVSSFVFYHHLSPLTYAFVTSLDYTF
VPNFVHEALSHPNWRNAMIEEMTALENNGKKTIGCNWVLAVKMNPDGTVA