; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0045681 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0045681
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr02:9724014..9724601
RNA-Seq ExpressionCmc02g0045681
SyntenyCmc02g0045681
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038222.1 Copia protein [Cucumis melo var. makuwa]1.3e-107100Show/hide
Query:  VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
        VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
Subjt:  VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE

Query:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
Subjt:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

TYK30615.1 Copia protein [Cucumis melo var. makuwa]3.0e-10799.47Show/hide
Query:  VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
        VDHWAVVEQILCYSKAAPGRGILY+DHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
Subjt:  VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE

Query:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
Subjt:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]8.6e-9989.23Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        MSSPTVDHWA VEQILCY KAAPGRGILY+DHG+TRVECFSDADWAGSR+D+RSTSGYCVFVG NLV WKSKKQNV+SRSSAESEYRAM QSVC IVWIH
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        QLLSE  FSITVPAKLWCDNQ ALHIASNPVFHE+TK++EVDCHFIREKIQDGLVSTGYVKTGE+LGDILTKA+NG RISYLCNKL MIDIFAPA
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]8.6e-9989.23Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        MSSPTVDHWA VEQILCY KAAPGRGILY+DHG+TRVECFSDADWAGSR+D+RSTSGYCVFVG NLV WKSKKQNV+SRSSAESEYRAM QSVC IVWIH
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        QLLSE  FSITVPAKLWCDNQ ALHIASNPVFHE+TK++EVDCHFIREKIQDGLVSTGYVKTGE+LGDILTKA+NG RISYLCNKL MIDIFAPA
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]8.6e-9989.23Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        MSSPTVDHWA VEQILCY KAAPGRGILY+DHG+TRVECFSDADWAGSR+D+RSTSGYCVFVG NLV WKSKKQNV+SRSSAESEYRAM QSVC IVWIH
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        QLLSE  FSITVPAKLWCDNQ ALHIASNPVFHE+TK++EVDCHFIREKIQDGLVSTGYVKTGE+LGDILTKA+NG RISYLCNKL MIDIFAPA
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

TrEMBL top hitse value%identityAlignment
A0A5A7T406 Copia protein6.4e-108100Show/hide
Query:  VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
        VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
Subjt:  VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE

Query:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
Subjt:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

A0A5A7UHS1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-9385.13Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        MS PTVDHWA VEQILCY KAA GRGILY+DHG+T+V+CFSDADW GSR+D+RS SGYCVFVG NLV WKSKKQNV+S SSA+SEYRAM QSVC IVWIH
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        QLLSE  FSITVP KLWCDNQVALHIASNPVFHEQTK++EVDCHFIREKIQDGL+STGYVKTGE+LGDILTK VNGARISYL  KLDMIDIFAPA
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

A0A5D3DZU1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-9385.13Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        MS PTVDHWA VEQILCY KAA GRGILY+DHG+T+V+CFSDADW GSR+D+RS SGYCVFVG NLV WKSKKQNV+S SSA+SEYRAM QSVC IVWIH
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        QLLSE  FSITVP KLWCDNQVALHIASNPVFHEQTK++EVDCHFIREKIQDGL+STGYVKTGE+LGDILTK VNGARISYL  KLDMIDIFAPA
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

A0A5D3E5M8 Copia protein1.4e-10799.47Show/hide
Query:  VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
        VDHWAVVEQILCYSKAAPGRGILY+DHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
Subjt:  VDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE

Query:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
Subjt:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

A0A6P6VCZ7 uncharacterized protein LOC1137194882.1e-8774.87Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        MSSPTVDHW  VEQ+L Y K APGRGILY +HG+TR+ECFSD+DWAG ++D+RSTSGYCVFVG NLV WKSKKQNV+SRSSAE+EYRAM +SVC ++W++
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA
        QLLSE    ++VPAKLWCDNQ ALHIASNPVFHE+TK++E+DCHF+REKIQ GL++TGYVKTGE+LGDI TKA+NG RI YLCNKL MI+I+APA
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-2732.07Show/hide
Query:  WAVVEQILCYSKAAPGRGILYRDH--GYTRVECFSDADWAGSRKDKRSTSGYCVFVGE-NLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE
        W  ++++L Y K      ++++ +     ++  + D+DWAGS  D++ST+GY   + + NL+ W +K+QN ++ SS E+EY A+ ++V   +W+  LL+ 
Subjt:  WAVVEQILCYSKAAPGRGILYRDH--GYTRVECFSDADWAGSRKDKRSTSGYCVFVGE-NLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSE

Query:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMI
            +  P K++ DNQ  + IA+NP  H++ K++++  HF RE++Q+ ++   Y+ T  +L DI TK +  AR   L +KL ++
Subjt:  TCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-2028.49Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        + +P  +HW  V+ IL Y +   G  + +       ++ ++DAD AG   +++S++GY        + W+SK Q  ++ S+ E+EY A T++   ++W+ 
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKL
        + L E          ++CD+Q A+ ++ N ++H +TK+++V  H+IRE + D  +    + T E   D+LTK V   +   LC +L
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKL

P92519 Uncharacterized mitochondrial protein AtMg008104.3e-1635.71Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVW
        M  PT+  + +++++L Y K     G+    +    V+ F D+DWAG    +RST+G+C F+G N++ W +K+Q  +SRSS E+EYRA+  +   + W
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-3536.56Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        M  PT +H   +++IL Y    P  GI  +      +  +SDADWAG + D  ST+GY V++G + + W SKKQ  + RSS E+EYR++  +   + WI 
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKL
         LL+E    +T P  ++CDN  A ++ +NPVFH + K++ +D HFIR ++Q G +   +V T ++L D LTK ++        +K+
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.8e-3636.13Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH
        M  PT DHW  ++++L Y    P  GI  +      +  +SDADWAG   D  ST+GY V++G + + W SKKQ  + RSS E+EYR++  +   + WI 
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIH

Query:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDI
         LL+E    ++ P  ++CDN  A ++ +NPVFH + K++ +D HFIR ++Q G +   +V T ++L D LTK ++         K+ +I +
Subjt:  QLLSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-3543.54Show/hide
Query:  SPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQL
        +P + H   V +IL Y K   G+G+ Y      +++ FSDA +   +  +RST+GYC+F+G +L+ WKSKKQ V+S+SSAE+EYRA++ +   ++W+ Q 
Subjt:  SPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQL

Query:  LSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREK
          E    ++ P  L+CDN  A+HIA+N VFHE+TK++E DCH +RE+
Subjt:  LSETCFSITVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREK

ATMG00810.1 DNA/RNA polymerases superfamily protein3.0e-1735.71Show/hide
Query:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVW
        M  PT+  + +++++L Y K     G+    +    V+ F D+DWAG    +RST+G+C F+G N++ W +K+Q  +SRSS E+EYRA+  +   + W
Subjt:  MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCCCTACAGTGGATCATTGGGCTGTAGTAGAGCAAATTCTATGTTATTCAAAAGCTGCTCCTGGACGTGGGATCTTATACAGAGATCATGGATATACGAGAGT
TGAATGTTTTTCTGATGCTGATTGGGCGGGATCTCGAAAGGATAAGAGATCAACCTCTGGATATTGTGTCTTTGTAGGCGAAAACTTGGTACCATGGAAGAGTAAGAAAC
AAAATGTTATTTCTCGTTCGAGTGCTGAGTCAGAATATAGAGCTATGACACAATCTGTGTGCGCAATAGTATGGATTCACCAACTATTATCTGAGACATGCTTCAGTATT
ACAGTGCCAGCTAAATTATGGTGTGATAATCAAGTTGCACTTCACATTGCATCTAATCCAGTATTTCATGAACAAACTAAAAATGTTGAGGTGGATTGTCACTTCATTCG
TGAGAAAATCCAAGATGGGTTGGTGTCCACCGGATATGTGAAGACCGGAGAAAAATTGGGAGATATTTTGACCAAAGCTGTAAATGGAGCAAGGATAAGCTATCTGTGTA
ACAAGCTAGACATGATCGACATATTTGCTCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCCCCTACAGTGGATCATTGGGCTGTAGTAGAGCAAATTCTATGTTATTCAAAAGCTGCTCCTGGACGTGGGATCTTATACAGAGATCATGGATATACGAGAGT
TGAATGTTTTTCTGATGCTGATTGGGCGGGATCTCGAAAGGATAAGAGATCAACCTCTGGATATTGTGTCTTTGTAGGCGAAAACTTGGTACCATGGAAGAGTAAGAAAC
AAAATGTTATTTCTCGTTCGAGTGCTGAGTCAGAATATAGAGCTATGACACAATCTGTGTGCGCAATAGTATGGATTCACCAACTATTATCTGAGACATGCTTCAGTATT
ACAGTGCCAGCTAAATTATGGTGTGATAATCAAGTTGCACTTCACATTGCATCTAATCCAGTATTTCATGAACAAACTAAAAATGTTGAGGTGGATTGTCACTTCATTCG
TGAGAAAATCCAAGATGGGTTGGTGTCCACCGGATATGTGAAGACCGGAGAAAAATTGGGAGATATTTTGACCAAAGCTGTAAATGGAGCAAGGATAAGCTATCTGTGTA
ACAAGCTAGACATGATCGACATATTTGCTCCAGCTTGA
Protein sequenceShow/hide protein sequence
MSSPTVDHWAVVEQILCYSKAAPGRGILYRDHGYTRVECFSDADWAGSRKDKRSTSGYCVFVGENLVPWKSKKQNVISRSSAESEYRAMTQSVCAIVWIHQLLSETCFSI
TVPAKLWCDNQVALHIASNPVFHEQTKNVEVDCHFIREKIQDGLVSTGYVKTGEKLGDILTKAVNGARISYLCNKLDMIDIFAPA