; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04892 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04892
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionzf-RVT domain-containing protein
Genome locationClcChr08:15052229..15054575
RNA-Seq ExpressionClc08G04892
SyntenyClc08G04892
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032570.1 protein SAND [Cucumis melo var. makuwa]7.2e-4150.26Show/hide
Query:  FFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIEELA
        FFWEGN   KLN+LVKWE+ ++SQ DG LGL  L+ RN+ALLAKW WRF+ E +SLW  V+RS+H S  F WHT+GK+ S LRSPWISISR   K+E LA
Subjt:  FFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIEELA

Query:  YSNLG-----------------MVAELPFDY-LELP---SISEHWDSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKKVTEYFGRADMKALW
           +G                 +  + P  Y + LP   S++ +WDSSSSSWSI FR LLKEEEI+DFQ LL  +  ++ T      D K  W
Subjt:  YSNLG-----------------MVAELPFDY-LELP---SISEHWDSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKKVTEYFGRADMKALW

KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]4.5e-5938.5Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE
        MRNFF EG+   K+N LV W   S   KDGGLGLGG++  N ALLAKWGWR+ KE+++LW  VIRS+HG   F W T GK  + LRSPW++I+R W  ++
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE

Query:  ELAYSNLGMVAELPF---DYLELPSISEH---------WDSSS---SSWSIFFRRLLKEEEIADFQNLLGIISNKKVT----------EYFGRADMK---
         LA  NLG    + F    ++    I E          W   S    SW + FRR L++EEI +FQ+LL ++S +KV           E  GR   K   
Subjt:  ELAYSNLGMVAELPF---DYLELPSISEH---------WDSSS---SSWSIFFRRLLKEEEIADFQNLLGIISNKKVT----------EYFGRADMK---

Query:  ---------------ALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCM-ADKELQHL-----------------------------
                       A+ +S SPR++NI IWIM+  ++  S ++Q+K   + +SP ICPLC+ A K L H+                             
Subjt:  ---------------ALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCM-ADKELQHL-----------------------------

Query:  ENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSW
         +V+Q+L G    K P+I+W    KA+L EIW ERNQR+FH+KA  R +    A LNA +W
Subjt:  ENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSW

KAB2635258.1 hypothetical protein D8674_025792 [Pyrus ussuriensis x Pyrus communis]7.9e-4035.53Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISIS---RSWL
        M+ F WEG + GK NHLVKWEI  +S+++GGLG+G LRN+N ALLAKW WRF KE NSLW  VIRS +G     W+         RSPW  IS   +S+L
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISIS---RSWL

Query:  KIEELAYSNLGMVAELPFDYLELPSISEHW--------------------DSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKK----VTEYFGRADMKA
        +  +    N   V      +LE   + E +                      +S SW+  FRR L E EI +   LL  + N +    + ++F  +    
Subjt:  KIEELAYSNLGMVAELPFDYLELPSISEHW--------------------DSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKK----VTEYFGRADMKA

Query:  LWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCMADKE-----LQHLENVLQILV---GPTFKKGPKILWSNAVKAILFEIWFERNQR
        +WKSK P +V + +W++  G LN    +QR+    C+SPH C LC A +E       H    +Q+          +  K LW   V A+ + IW ERN+R
Subjt:  LWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCMADKE-----LQHLENVLQILV---GPTFKKGPKILWSNAVKAILFEIWFERNQR

Query:  VFHN
        +F +
Subjt:  VFHN

TYK14440.1 uncharacterized protein E5676_scaffold186G00990 [Cucumis melo var. makuwa]3.1e-4443.4Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE
        +RNFFWEGN G K+NH V W+  + S  DG LGLGG+RN+++ALLAKWGWR+MKE+ +LW  V+RS+HG   F W T  K  + LRSPW+ ISR W K+E
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE

Query:  ELAYSNLG-----------MVAELPFD-------YLELPSISEHWDSSSSSWSIFFRRLLKEEEI---ADFQNLLGIISNKKVTEYFGRADMKALWKSKS
         LA   LG              E+P +        L   S++ HWDS ++SWSI FRRLL    +     F + L I   +K       A  KALWK+ S
Subjt:  ELAYSNLG-----------MVAELPFD-------YLELPSISEHWDSSSSSWSIFFRRLLKEEEI---ADFQNLLGIISNKKVTEYFGRADMKALWKSKS

Query:  PRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHI
        P       W+M    LN   +MQ KL+  CL P +
Subjt:  PRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHI

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]4.5e-5938.23Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE
        MRNFF EG+   K+N LV W   S   KDGGLGLGG++  N ALLAKWGWR+ KE+++LW  VIRS+HG   F W T GK  + LRS W++I+R W  ++
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE

Query:  ELAYSNLGMVAELPF---DYLELPSISEHWDSSSS------------SWSIFFRRLLKEEEIADFQNLLGIISNKKVT----------EYFGRADMK---
         LA  NLG    + F    ++    I E +    S            SW + FRR L++EEI +FQ+LL ++S +KV           E  GR   K   
Subjt:  ELAYSNLGMVAELPF---DYLELPSISEHWDSSSS------------SWSIFFRRLLKEEEIADFQNLLGIISNKKVT----------EYFGRADMK---

Query:  ---------------ALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCM-ADKELQHL-----------------------------
                       A+ +S SPR++NI IWIM+  ++N S ++Q+K   + +SP ICPLC+ A K L H+                             
Subjt:  ---------------ALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCM-ADKELQHL-----------------------------

Query:  ENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSW
         +V+Q+L G    K P+I+W    KA+L EIW ERNQR+FH+KA  R +    A LNA +W
Subjt:  ENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSW

TrEMBL top hitse value%identityAlignment
A0A5A7SP09 Vacuolar fusion protein MON1 homolog3.5e-4150.26Show/hide
Query:  FFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIEELA
        FFWEGN   KLN+LVKWE+ ++SQ DG LGL  L+ RN+ALLAKW WRF+ E +SLW  V+RS+H S  F WHT+GK+ S LRSPWISISR   K+E LA
Subjt:  FFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIEELA

Query:  YSNLG-----------------MVAELPFDY-LELP---SISEHWDSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKKVTEYFGRADMKALW
           +G                 +  + P  Y + LP   S++ +WDSSSSSWSI FR LLKEEEI+DFQ LL  +  ++ T      D K  W
Subjt:  YSNLG-----------------MVAELPFDY-LELP---SISEHWDSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKKVTEYFGRADMKALW

A0A5A7T2Y0 zf-RVT domain-containing protein2.2e-5938.5Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE
        MRNFF EG+   K+N LV W   S   KDGGLGLGG++  N ALLAKWGWR+ KE+++LW  VIRS+HG   F W T GK  + LRSPW++I+R W  ++
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE

Query:  ELAYSNLGMVAELPF---DYLELPSISEH---------WDSSS---SSWSIFFRRLLKEEEIADFQNLLGIISNKKVT----------EYFGRADMK---
         LA  NLG    + F    ++    I E          W   S    SW + FRR L++EEI +FQ+LL ++S +KV           E  GR   K   
Subjt:  ELAYSNLGMVAELPF---DYLELPSISEH---------WDSSS---SSWSIFFRRLLKEEEIADFQNLLGIISNKKVT----------EYFGRADMK---

Query:  ---------------ALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCM-ADKELQHL-----------------------------
                       A+ +S SPR++NI IWIM+  ++  S ++Q+K   + +SP ICPLC+ A K L H+                             
Subjt:  ---------------ALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCM-ADKELQHL-----------------------------

Query:  ENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSW
         +V+Q+L G    K P+I+W    KA+L EIW ERNQR+FH+KA  R +    A LNA +W
Subjt:  ENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSW

A0A5D3CSP2 Uncharacterized protein1.5e-4443.4Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE
        +RNFFWEGN G K+NH V W+  + S  DG LGLGG+RN+++ALLAKWGWR+MKE+ +LW  V+RS+HG   F W T  K  + LRSPW+ ISR W K+E
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE

Query:  ELAYSNLG-----------MVAELPFD-------YLELPSISEHWDSSSSSWSIFFRRLLKEEEI---ADFQNLLGIISNKKVTEYFGRADMKALWKSKS
         LA   LG              E+P +        L   S++ HWDS ++SWSI FRRLL    +     F + L I   +K       A  KALWK+ S
Subjt:  ELAYSNLG-----------MVAELPFD-------YLELPSISEHWDSSSSSWSIFFRRLLKEEEI---ADFQNLLGIISNKKVTEYFGRADMKALWKSKS

Query:  PRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHI
        P       W+M    LN   +MQ KL+  CL P +
Subjt:  PRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHI

A0A5D3DE60 zf-RVT domain-containing protein2.2e-5938.23Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE
        MRNFF EG+   K+N LV W   S   KDGGLGLGG++  N ALLAKWGWR+ KE+++LW  VIRS+HG   F W T GK  + LRS W++I+R W  ++
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIE

Query:  ELAYSNLGMVAELPF---DYLELPSISEHWDSSSS------------SWSIFFRRLLKEEEIADFQNLLGIISNKKVT----------EYFGRADMK---
         LA  NLG    + F    ++    I E +    S            SW + FRR L++EEI +FQ+LL ++S +KV           E  GR   K   
Subjt:  ELAYSNLGMVAELPF---DYLELPSISEHWDSSSS------------SWSIFFRRLLKEEEIADFQNLLGIISNKKVT----------EYFGRADMK---

Query:  ---------------ALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCM-ADKELQHL-----------------------------
                       A+ +S SPR++NI IWIM+  ++N S ++Q+K   + +SP ICPLC+ A K L H+                             
Subjt:  ---------------ALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCM-ADKELQHL-----------------------------

Query:  ENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSW
         +V+Q+L G    K P+I+W    KA+L EIW ERNQR+FH+KA  R +    A LNA +W
Subjt:  ENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSW

A0A5N5I637 zf-RVT domain-containing protein3.8e-4035.53Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISIS---RSWL
        M+ F WEG + GK NHLVKWEI  +S+++GGLG+G LRN+N ALLAKW WRF KE NSLW  VIRS +G     W+         RSPW  IS   +S+L
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISIS---RSWL

Query:  KIEELAYSNLGMVAELPFDYLELPSISEHW--------------------DSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKK----VTEYFGRADMKA
        +  +    N   V      +LE   + E +                      +S SW+  FRR L E EI +   LL  + N +    + ++F  +    
Subjt:  KIEELAYSNLGMVAELPFDYLELPSISEHW--------------------DSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKK----VTEYFGRADMKA

Query:  LWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCMADKE-----LQHLENVLQILV---GPTFKKGPKILWSNAVKAILFEIWFERNQR
        +WKSK P +V + +W++  G LN    +QR+    C+SPH C LC A +E       H    +Q+          +  K LW   V A+ + IW ERN+R
Subjt:  LWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCMADKE-----LQHLENVLQILV---GPTFKKGPKILWSNAVKAILFEIWFERNQR

Query:  VFHN
        +F +
Subjt:  VFHN

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.1e-1037.76Show/hide
Query:  RNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKI
        R F W      K  HLVKW      +K+GGLG+   ++ N AL++K GWR ++E NSLW LV++        K+H     DS    P  S S +W  I
Subjt:  RNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKI

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-0433.78Show/hide
Query:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRS--VHGSTP
        + +F+W   +  K  H   W+  S  + +GG+G   +   NLALL K  WR +    SL A V +S   H S P
Subjt:  MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRS--VHGSTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAACTTCTTTTGGGAGGGGAATAAGGGAGGAAAACTGAATCATTTAGTTAAATGGGAGATTGCCTCTAGATCTCAAAAAGATGGAGGCCTTGGTTTGGGGGGTTT
GAGAAATAGGAACTTGGCATTGCTAGCTAAATGGGGTTGGAGATTTATGAAGGAGGACAATTCTCTTTGGGCTCTGGTAATAAGAAGTGTCCATGGAAGTACTCCTTTTA
AATGGCACACGGCTGGCAAGGATGATTCTGGTCTCCGTAGCCCTTGGATAAGTATATCTAGATCTTGGTTGAAAATTGAAGAGCTGGCGTATTCAAACTTGGGAATGGTG
GCAGAATTGCCTTTTGATTATTTAGAATTGCCCTCAATCTCAGAACATTGGGACTCCTCTTCTTCATCATGGTCCATTTTCTTCCGTCGGCTTCTAAAAGAAGAGGAGAT
TGCAGATTTTCAGAACCTTCTTGGAATCATTTCTAATAAAAAAGTTACTGAATATTTCGGACGAGCGGATATGAAAGCCTTGTGGAAGTCCAAAAGCCCTCGTCAGGTGA
ACATCACGATATGGATTATGTTAAATGGCTACTTAAATTTCTCCTCAGTTATGCAAAGGAAACTTGCAGCCCATTGCTTGTCTCCTCATATTTGCCCACTGTGTATGGCT
GATAAGGAGTTACAACATCTTGAAAATGTCCTGCAGATTCTGGTTGGTCCAACATTTAAGAAGGGTCCCAAGATCCTATGGAGTAATGCGGTTAAAGCTATACTTTTTGA
AATTTGGTTTGAAAGAAACCAACGAGTCTTCCATAATAAAGCAACCCCTAGGTTAGATTGCTTTGAGATTGCACATCTCAACGCTACCTCTTGGACCGTTGGAGCTTATT
TTGGTGGTCTTGAGAATATTGCAACCGAAACAAATCTTTTGAACTGCTTGGAAGCTAAGATTCAAGATGGTCTCCATCTTAACTTATTAAAACCCAATTGGACATTCATG
GCATCATCACAGAATATGACGAATATATCTAGGAATCCATTTGCGGCCTTAGTAAAGGATGATTCTCAGGCATGCTTCCCGATTGCCGATAATTCTTTGGACCTAAAGAA
GAAGGTAATCCAGCCTGATGTGAACGTAACAGAGGCATTCAAAGAGAAATTTATTGAGGAAAGAGAAAAAGGCAGCTGCTTGACTAATGGGCTCAATGACTTGCTTAGAC
GTGATTCATTAACTAATGAAAGTGTCAAAAACTCTCATATTGAGACCCTTATCATTCCTTCCAAGGATACTCAACGAACACCTTTTGGCGGTGCTTCAAGCTCCTTGATC
AACAACTTAGCGTCGGTTGAAGTAATGGAACCGTTGGTATTCTCTCTAAATGATGTCTTCTTTCCTAAGAGTAAAAAAGGCAGCGTGCACAACAACCATCAAGGTCTCCC
CCACATTTCTCCTAAGGATATGGTACTTGCTTCCTCCTTCGGTGCACCTTCTCAACCCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAACTTCTTTTGGGAGGGGAATAAGGGAGGAAAACTGAATCATTTAGTTAAATGGGAGATTGCCTCTAGATCTCAAAAAGATGGAGGCCTTGGTTTGGGGGGTTT
GAGAAATAGGAACTTGGCATTGCTAGCTAAATGGGGTTGGAGATTTATGAAGGAGGACAATTCTCTTTGGGCTCTGGTAATAAGAAGTGTCCATGGAAGTACTCCTTTTA
AATGGCACACGGCTGGCAAGGATGATTCTGGTCTCCGTAGCCCTTGGATAAGTATATCTAGATCTTGGTTGAAAATTGAAGAGCTGGCGTATTCAAACTTGGGAATGGTG
GCAGAATTGCCTTTTGATTATTTAGAATTGCCCTCAATCTCAGAACATTGGGACTCCTCTTCTTCATCATGGTCCATTTTCTTCCGTCGGCTTCTAAAAGAAGAGGAGAT
TGCAGATTTTCAGAACCTTCTTGGAATCATTTCTAATAAAAAAGTTACTGAATATTTCGGACGAGCGGATATGAAAGCCTTGTGGAAGTCCAAAAGCCCTCGTCAGGTGA
ACATCACGATATGGATTATGTTAAATGGCTACTTAAATTTCTCCTCAGTTATGCAAAGGAAACTTGCAGCCCATTGCTTGTCTCCTCATATTTGCCCACTGTGTATGGCT
GATAAGGAGTTACAACATCTTGAAAATGTCCTGCAGATTCTGGTTGGTCCAACATTTAAGAAGGGTCCCAAGATCCTATGGAGTAATGCGGTTAAAGCTATACTTTTTGA
AATTTGGTTTGAAAGAAACCAACGAGTCTTCCATAATAAAGCAACCCCTAGGTTAGATTGCTTTGAGATTGCACATCTCAACGCTACCTCTTGGACCGTTGGAGCTTATT
TTGGTGGTCTTGAGAATATTGCAACCGAAACAAATCTTTTGAACTGCTTGGAAGCTAAGATTCAAGATGGTCTCCATCTTAACTTATTAAAACCCAATTGGACATTCATG
GCATCATCACAGAATATGACGAATATATCTAGGAATCCATTTGCGGCCTTAGTAAAGGATGATTCTCAGGCATGCTTCCCGATTGCCGATAATTCTTTGGACCTAAAGAA
GAAGGTAATCCAGCCTGATGTGAACGTAACAGAGGCATTCAAAGAGAAATTTATTGAGGAAAGAGAAAAAGGCAGCTGCTTGACTAATGGGCTCAATGACTTGCTTAGAC
GTGATTCATTAACTAATGAAAGTGTCAAAAACTCTCATATTGAGACCCTTATCATTCCTTCCAAGGATACTCAACGAACACCTTTTGGCGGTGCTTCAAGCTCCTTGATC
AACAACTTAGCGTCGGTTGAAGTAATGGAACCGTTGGTATTCTCTCTAAATGATGTCTTCTTTCCTAAGAGTAAAAAAGGCAGCGTGCACAACAACCATCAAGGTCTCCC
CCACATTTCTCCTAAGGATATGGTACTTGCTTCCTCCTTCGGTGCACCTTCTCAACCCCCTTAA
Protein sequenceShow/hide protein sequence
MRNFFWEGNKGGKLNHLVKWEIASRSQKDGGLGLGGLRNRNLALLAKWGWRFMKEDNSLWALVIRSVHGSTPFKWHTAGKDDSGLRSPWISISRSWLKIEELAYSNLGMV
AELPFDYLELPSISEHWDSSSSSWSIFFRRLLKEEEIADFQNLLGIISNKKVTEYFGRADMKALWKSKSPRQVNITIWIMLNGYLNFSSVMQRKLAAHCLSPHICPLCMA
DKELQHLENVLQILVGPTFKKGPKILWSNAVKAILFEIWFERNQRVFHNKATPRLDCFEIAHLNATSWTVGAYFGGLENIATETNLLNCLEAKIQDGLHLNLLKPNWTFM
ASSQNMTNISRNPFAALVKDDSQACFPIADNSLDLKKKVIQPDVNVTEAFKEKFIEEREKGSCLTNGLNDLLRRDSLTNESVKNSHIETLIIPSKDTQRTPFGGASSSLI
NNLASVEVMEPLVFSLNDVFFPKSKKGSVHNNHQGLPHISPKDMVLASSFGAPSQPP