; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0063901 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0063901
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationCMiso1.1chr03:4423803..4424432
RNA-Seq ExpressionCmc03g0063901
SyntenyCmc03g0063901
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABI34306.1 Polyprotein, putative [Solanum demissum]1.4e-6258.37Show/hide
Query:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRY
        MFKLN+E+NK ++S YML+S N WHARLCH+N R +  MS L LIP +   +FEKC  CS+AKITK  H  V R T  LEL+H+++CE  GILTR   RY
Subjt:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRY

Query:  VVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLE
         +TFIDD S++T++YL KNKSDA+E F+ ++ E+ENQF ++IKR RSDRG EY+S  FN F  S GII++TT PYSP  NG AERKNRTL EL  A+L+E
Subjt:  VVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLE

Query:  SGAAPSWWG
        S A  ++WG
Subjt:  SGAAPSWWG

CAN66576.1 hypothetical protein VITISV_016964 [Vitis vinifera]1.5e-4846.4Show/hide
Query:  MFKLN--------LEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGI
        +FK+N        +  NKI SSAY+L S N+WH RL HVN   +  +  L+ +PK ++    KC  C ++K+TK+    V R T+PL+L HS++C+   +
Subjt:  MFKLN--------LEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGI

Query:  LTRNSKRYVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTE
         TR  K+Y +TFIDDC+ Y ++YL K+K +A EMF+ +  E+ENQ +K+IK  RSDRG EY+S  F EF    GII++T APYSP+ NG AERKNRTL E
Subjt:  LTRNSKRYVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTE

Query:  LVVAILLESGAAPSW-----WG
        ++ A+LL SG  P +     WG
Subjt:  LVVAILLESGAAPSW-----WG

KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]7.1e-10792.34Show/hide
Query:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRY
        MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITK SHKFV RVTKPLELIHS+LCEFDG LTRNSKRY
Subjt:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRY

Query:  VVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLE
        VVTFIDDCS+YTFIYL KNKSDAYEMF+VFVTEIENQFNKRIKR RSDRGTEYDS+AFNEFYNSKGII++TT PYSP+MNGK ERKNRTLTEL VAILLE
Subjt:  VVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLE

Query:  SGAAPSWWG
        S AAPSWWG
Subjt:  SGAAPSWWG

KAG5527251.1 hypothetical protein RHGRI_028223 [Rhododendron griersonianum]2.0e-4848.82Show/hide
Query:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHD--FEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSK
        MFKL++  N  ASS Y++ SF++WH+RL H+N R I NMSR  LI   + HD   +KC  C++AK+ K S   V R ++ L+LIHS++CE +G+LTR  K
Subjt:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHD--FEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSK

Query:  RYVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAIL
        RY  TF+DD S+YTF+YL + K + +  F+ +  E+ENQ NK+IK  RSDRG EY    F++F    GII++ TAPYSP+ NG AERKNRTLTE+   ++
Subjt:  RYVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAIL

Query:  LESGAAPSWWG
        + + A    WG
Subjt:  LESGAAPSWWG

RVW26252.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.1e-4946.54Show/hide
Query:  MFKLN--------LEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGI
        +FK+N        +  NKI SSAY+L S N+WH RL HVN   +  +  L+ +PK ++    KC  C ++K+TK+    V R T+PL+LIH+++C+   +
Subjt:  MFKLN--------LEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGI

Query:  LTRNSKRYVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTE
         TR  K+Y +TFIDDC+ Y ++YL ++K +A EMF+ +  E+ENQ +K+IK  RSDRG EY+S+ F EF    GII++TTAPYSP+ NG AE KNRTL E
Subjt:  LTRNSKRYVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTE

Query:  LVVAILLESGAAPSWWG
        ++ A+LL SG   + WG
Subjt:  LVVAILLESGAAPSWWG

TrEMBL top hitse value%identityAlignment
A0A2N9F8T0 Integrase catalytic domain-containing protein1.0e-5048.84Show/hide
Query:  MFKLN------LEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILT
        +FK+N      +  NK  SSAY+L S NVWH RL HVN R +  +  LNL+PK  +    KC  C +AK+T+ S   + R + PLELIHS++C+   + T
Subjt:  MFKLN------LEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILT

Query:  RNSKRYVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELV
        R  ++Y VTFIDDCS Y ++YL ++K +A E F  +  E+ENQ NK+IK  RSDRG EY+S  F EF +  GI+++TTAPYSP+ NG AERKNRTL E++
Subjt:  RNSKRYVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELV

Query:  VAILLESGAAPSWWG
         A+L+ SG   + WG
Subjt:  VAILLESGAAPSWWG

A0A5D3DCJ1 Putative Polyprotein3.4e-10792.34Show/hide
Query:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRY
        MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITK SHKFV RVTKPLELIHS+LCEFDG LTRNSKRY
Subjt:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRY

Query:  VVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLE
        VVTFIDDCS+YTFIYL KNKSDAYEMF+VFVTEIENQFNKRIKR RSDRGTEYDS+AFNEFYNSKGII++TT PYSP+MNGK ERKNRTLTEL VAILLE
Subjt:  VVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLE

Query:  SGAAPSWWG
        S AAPSWWG
Subjt:  SGAAPSWWG

A0A7N2L531 Uncharacterized protein1.3e-6965.71Show/hide
Query:  MFKLNLEINKIA-SSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKR
        MFKLN+E NK + SS YML+S N WHARLCH+N R +  MS L LIP+LS  DFEKC  CSQAKITK  HK V+R T+ LELIHS+LCEF+GILTR   R
Subjt:  MFKLNLEINKIA-SSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKR

Query:  YVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILL
        Y++TFIDD S+YT IYL KNKSDA+E F+ F+ E+ENQF ++IKR RSDRG EY+S AFN F  S GII++TTAPYSP  NG AERKNRTL EL  A+L+
Subjt:  YVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILL

Query:  ESGAAPSWWG
        ESGA   +WG
Subjt:  ESGAAPSWWG

A0A7N2R9F3 Uncharacterized protein2.2e-6965.24Show/hide
Query:  MFKLNLEINKIA-SSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKR
        MFKLN+E NK + SS YML+S N WHARLCH+N R +  MS L LIP+LS  DFEKC  CSQAKITK  HK V+R T+ LELIHS+LCEF+GILTR   R
Subjt:  MFKLNLEINKIA-SSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKR

Query:  YVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILL
        Y++TFIDD S+YT IYL KNKSDA+E F+ F+ E+ENQF ++IKR RSDRG EY+S AFN F  S GII++TTAPYSP  NG  ERKNRTL EL  A+L+
Subjt:  YVVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILL

Query:  ESGAAPSWWG
        ESGA   +WG
Subjt:  ESGAAPSWWG

Q0KIN7 Polyprotein, putative6.7e-6358.37Show/hide
Query:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRY
        MFKLN+E+NK ++S YML+S N WHARLCH+N R +  MS L LIP +   +FEKC  CS+AKITK  H  V R T  LEL+H+++CE  GILTR   RY
Subjt:  MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRY

Query:  VVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLE
         +TFIDD S++T++YL KNKSDA+E F+ ++ E+ENQF ++IKR RSDRG EY+S  FN F  S GII++TT PYSP  NG AERKNRTL EL  A+L+E
Subjt:  VVTFIDDCSEYTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLE

Query:  SGAAPSWWG
        S A  ++WG
Subjt:  SGAAPSWWG

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-2333.17Show/hide
Query:  SFNVWHARLCHVN---------KRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSE
        +F +WH R  H++         K + S+ S LN + +LS    E C    QA++     K    + +PL ++HS++C     +T + K Y V F+D  + 
Subjt:  SFNVWHARLCHVN---------KRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSE

Query:  YTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWWG
        Y   YL K KSD + MF+ FV + E  FN ++     D G EY S    +F   KGI Y  T P++P++NG +ER  RT+TE    ++  +    S+WG
Subjt:  YTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWWG

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein5.4e-0928.7Show/hide
Query:  NSKRYVVTFIDDCSEY--TFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTEL
        ++KRY++  +D+ + Y  T  +  KN           +  +E QF+++++   SDRGTE+ +    E++ SKGI +  T+      NG+AER  RT+   
Subjt:  NSKRYVVTFIDDCSEY--TFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTEL

Query:  VVAILLESGAAPSWW
           +L +S     +W
Subjt:  VVAILLESGAAPSWW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.2e-2532.46Show/hide
Query:  SFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVI-RVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSEYTFIYLFK
        S ++WH R+ H++++ +  +++ +LI        + C  C   K  ++S +    R    L+L++S++C    I +    +Y VTFIDD S   ++Y+ K
Subjt:  SFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVI-RVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSEYTFIYLFK

Query:  NKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWWG
         K   +++F+ F   +E +  +++KR RSD G EY S  F E+ +S GI ++ T P +P+ NG AER NRT+ E V ++L  +    S+WG
Subjt:  NKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWWG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-2235.26Show/hide
Query:  WHARLCH----VNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKIS-HKFVIRVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSEYTFIYLFK
        WHARL H    +   +ISN S   L P    H F  C+ C   K  K+   +  I  T+PLE I+S++     IL+ ++ RY V F+D  + YT++Y  K
Subjt:  WHARLCH----VNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKIS-HKFVIRVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSEYTFIYLFK

Query:  NKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWW
         KS   E F  F   +EN+F  RI  F SD G E+  +A  E+++  GI + T+ P++P+ NG +ERK+R + E  + +L  +    ++W
Subjt:  NKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.1e-2031.58Show/hide
Query:  WHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKF-----VIRVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSEYTFIYLFK
        WH+RL H +  +++++   + +P   L+   K   CS   I K SHK       I  +KPLE I+S++     IL+ ++ RY V F+D  + YT++Y  K
Subjt:  WHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKF-----VIRVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSEYTFIYLFK

Query:  NKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWW
         KS   + F +F + +EN+F  RI    SD G E+  +   ++ +  GI + T+ P++P+ NG +ERK+R + E+ + +L  +    ++W
Subjt:  NKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWW

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAATTGAATCTAGAAATTAATAAGATTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGCTAGACTTTGTCATGTTAATAAAAGATTAATTAG
TAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACTAAAATCTCGCATAAGTTTGTAATTA
GAGTAACAAAGCCTTTAGAATTAATTCATTCTAACTTATGTGAATTTGATGGCATTTTAACTAGAAACAGTAAAAGGTATGTAGTTACCTTTATAGATGACTGTTCTGAA
TACACTTTTATTTATCTGTTTAAAAATAAAAGTGATGCTTATGAAATGTTCGAAGTCTTTGTAACTGAAATAGAGAACCAATTTAACAAAAGAATTAAGAGATTTCGTAG
TGATAGAGGAACTGAATATGATTCAATTGCTTTTAATGAATTTTATAACTCAAAAGGAATAATATATAAAACTACTGCGCCTTATTCTCCTAAAATGAATGGAAAAGCAG
AAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTCAGGAGCAGCACCATCTTGGTGGGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAATTGAATCTAGAAATTAATAAGATTGCATCTTCTGCTTACATGTTGACTTCTTTTAATGTTTGGCATGCTAGACTTTGTCATGTTAATAAAAGATTAATTAG
TAACATGAGTAGGTTAAATCTTATACCTAAGTTATCTCTGCATGATTTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACTAAAATCTCGCATAAGTTTGTAATTA
GAGTAACAAAGCCTTTAGAATTAATTCATTCTAACTTATGTGAATTTGATGGCATTTTAACTAGAAACAGTAAAAGGTATGTAGTTACCTTTATAGATGACTGTTCTGAA
TACACTTTTATTTATCTGTTTAAAAATAAAAGTGATGCTTATGAAATGTTCGAAGTCTTTGTAACTGAAATAGAGAACCAATTTAACAAAAGAATTAAGAGATTTCGTAG
TGATAGAGGAACTGAATATGATTCAATTGCTTTTAATGAATTTTATAACTCAAAAGGAATAATATATAAAACTACTGCGCCTTATTCTCCTAAAATGAATGGAAAAGCAG
AAAGAAAGAATAGAACTCTAACTGAGTTAGTAGTTGCTATCTTACTTGAGTCAGGAGCAGCACCATCTTGGTGGGGGTGA
Protein sequenceShow/hide protein sequence
MFKLNLEINKIASSAYMLTSFNVWHARLCHVNKRLISNMSRLNLIPKLSLHDFEKCACCSQAKITKISHKFVIRVTKPLELIHSNLCEFDGILTRNSKRYVVTFIDDCSE
YTFIYLFKNKSDAYEMFEVFVTEIENQFNKRIKRFRSDRGTEYDSIAFNEFYNSKGIIYKTTAPYSPKMNGKAERKNRTLTELVVAILLESGAAPSWWG