; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G079600 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G079600
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDNA-directed RNA polymerase II subunit 1-like
Genome locationCiama_Chr05:711399..713130
RNA-Seq ExpressionCaUC05G079600
SyntenyCaUC05G079600
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048864.1 WW domain-binding protein 11-like [Cucumis melo var. makuwa]9.7e-8469.4Show/hide
Query:  IKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPN-LPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDP
        I + +PAAA+    PEPEILPLAPT QTLQ FEP   P AAAPPSS   EPT     P+ + + P  SPKYEATV+  AS PLKPA+SPPVSPP KS++P
Subjt:  IKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPN-LPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDP

Query:  RQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRK
        R SISPNSY++P KPTTP LS L LPKS DVTTI S IKPEVEQKTGP KK D Q EY SG    + QA+AINL G NIGAVM+I+QFSDK  GGEV RK
Subjt:  RQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRK

Query:  IETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVD
        IETE+GV+ ENDEEK+RR T FPMTPI NSNFQEVNNSV+YNSSC+ RDPGLHLDFSGK KD  ATV+
Subjt:  IETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVD

XP_008437853.1 PREDICTED: WW domain-binding protein 11-like [Cucumis melo]6.5e-7270.35Show/hide
Query:  PPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPRQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEV
        PPSS   EPT     P+ + + P  SPKYEATV+  AS PLKPA+SPPVSPP KS++PR SISPNSY++P KPTTP LS L LPKS DVTTI S IKPEV
Subjt:  PPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPRQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEV

Query:  EQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRKIETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYN
        EQKTGP KK D Q EY SG    + QA+AINL G NIGAVM+I+QFSDK  GGEV RKIETE+GV+ ENDEEK+RR T FPMTPI NSNFQEVNNSV+YN
Subjt:  EQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRKIETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYN

Query:  SSCSGRDPGLHLDFSGKSKDNRATVD
        SSC+ RDPGLHLDFSGK KD  ATV+
Subjt:  SSCSGRDPGLHLDFSGKSKDNRATVD

XP_011650663.1 gibberellin-regulated protein 14 [Cucumis sativus]4.5e-8970.44Show/hide
Query:  IKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPR
        + + +P A Q    PEPEILPLAPT QTLQPFEP  PAAAAPPSS   EPTPRIS PA        SPKYEATV+  AS PLKPA+SPPVSPP KS DPR
Subjt:  IKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPR

Query:  QSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRKI
         SISPNSY++P KPT P LS LALPKS DVTT+ S IKPEVEQKT  +KK DRQ +  S K   + QA+AINLTG NIGAVM+I+QFSDK  GGEV RKI
Subjt:  QSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRKI

Query:  ETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSIY
        ET++GV+ END EK+RR T FPMTPI NSNFQEVNNSVMYNSSCSGRDPGLHLDFSG+ KD  ATVDG  KS Y
Subjt:  ETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSIY

XP_022147458.1 vegetative cell wall protein gp1-like [Momordica charantia]4.7e-4649.1Show/hide
Query:  AAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSD--PRQSIS
        A   PA+VP     P  PTTQTLQPFEPNLP +A+P SSP++      +SP ++   P+ SPKY A    V S P K A+SPP SP +K SD    Q+  
Subjt:  AAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSD--PRQSIS

Query:  PNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYK-------KPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVI
        P S  K  +  TPPLSPLALP++ +   + S + PEVE+K+  Y        K  R +E+GSGK     QAE INLTGHN+GAVMEI+Q S K   GE+I
Subjt:  PNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYK-------KPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVI

Query:  RKIETES---GVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSI
        +K E+E+   G  H NDE KT      P T  +NSNFQ VNNSV+Y+SSCS RDPGLHL FS        T DGG  +I
Subjt:  RKIETES---GVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSI

XP_038879417.1 proline-rich receptor-like protein kinase PERK8 [Benincasa hispida]1.5e-3945.71Show/hide
Query:  AAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLP-LKPAQSPPVSPPSKSSDPR--QSI
        AA QP + P+PEI P APT+   QP EP  PA    P+SP+R+    I+SP KKA SP  SPKY  ++  V S P  K  +SPP SP +K  + R  ++ 
Subjt:  AAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLP-LKPAQSPPVSPPSKSSDPR--QSI

Query:  SPNSYKKPHKPTTPPLSPLALPK----SVDVTTIQSGIKPEVEQKTGPYK-------KPDRQTEYGSGKMSLNQQ--AEAINLTGHNIGAVMEISQFSD-
         P S  K  +  TPPLSPL LP+    S D TT     +P VE K   Y        K DR +EYGSGK    QQ  AE INL GHN+GAVMEI++ SD 
Subjt:  SPNSYKKPHKPTTPPLSPLALPK----SVDVTTIQSGIKPEVEQKTGPYK-------KPDRQTEYGSGKMSLNQQ--AEAINLTGHNIGAVMEISQFSD-

Query:  KRLGGEVIRKIETESGVRHENDEEKTRRATGF-PMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDG
         RLGGE ++  ET+ G  H   +EK + A    P+T  +N+NFQ +NNS++Y+SSC+  DPGLHL        + ATV G
Subjt:  KRLGGEVIRKIETESGVRHENDEEKTRRATGF-PMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDG

TrEMBL top hitse value%identityAlignment
A0A0A0L8G8 Uncharacterized protein2.2e-8970.44Show/hide
Query:  IKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPR
        + + +P A Q    PEPEILPLAPT QTLQPFEP  PAAAAPPSS   EPTPRIS PA        SPKYEATV+  AS PLKPA+SPPVSPP KS DPR
Subjt:  IKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPR

Query:  QSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRKI
         SISPNSY++P KPT P LS LALPKS DVTT+ S IKPEVEQKT  +KK DRQ +  S K   + QA+AINLTG NIGAVM+I+QFSDK  GGEV RKI
Subjt:  QSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRKI

Query:  ETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSIY
        ET++GV+ END EK+RR T FPMTPI NSNFQEVNNSVMYNSSCSGRDPGLHLDFSG+ KD  ATVDG  KS Y
Subjt:  ETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSIY

A0A1S3AVL7 WW domain-binding protein 11-like3.2e-7270.35Show/hide
Query:  PPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPRQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEV
        PPSS   EPT     P+ + + P  SPKYEATV+  AS PLKPA+SPPVSPP KS++PR SISPNSY++P KPTTP LS L LPKS DVTTI S IKPEV
Subjt:  PPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPRQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEV

Query:  EQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRKIETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYN
        EQKTGP KK D Q EY SG    + QA+AINL G NIGAVM+I+QFSDK  GGEV RKIETE+GV+ ENDEEK+RR T FPMTPI NSNFQEVNNSV+YN
Subjt:  EQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRKIETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYN

Query:  SSCSGRDPGLHLDFSGKSKDNRATVD
        SSC+ RDPGLHLDFSGK KD  ATV+
Subjt:  SSCSGRDPGLHLDFSGKSKDNRATVD

A0A5D3DB96 WW domain-binding protein 11-like4.7e-8469.4Show/hide
Query:  IKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPN-LPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDP
        I + +PAAA+    PEPEILPLAPT QTLQ FEP   P AAAPPSS   EPT     P+ + + P  SPKYEATV+  AS PLKPA+SPPVSPP KS++P
Subjt:  IKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPN-LPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDP

Query:  RQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRK
        R SISPNSY++P KPTTP LS L LPKS DVTTI S IKPEVEQKTGP KK D Q EY SG    + QA+AINL G NIGAVM+I+QFSDK  GGEV RK
Subjt:  RQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVIRK

Query:  IETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVD
        IETE+GV+ ENDEEK+RR T FPMTPI NSNFQEVNNSV+YNSSC+ RDPGLHLDFSGK KD  ATV+
Subjt:  IETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVD

A0A6J1D1D2 vegetative cell wall protein gp1-like2.3e-4649.1Show/hide
Query:  AAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSD--PRQSIS
        A   PA+VP     P  PTTQTLQPFEPNLP +A+P SSP++      +SP ++   P+ SPKY A    V S P K A+SPP SP +K SD    Q+  
Subjt:  AAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSD--PRQSIS

Query:  PNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYK-------KPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVI
        P S  K  +  TPPLSPLALP++ +   + S + PEVE+K+  Y        K  R +E+GSGK     QAE INLTGHN+GAVMEI+Q S K   GE+I
Subjt:  PNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYK-------KPDRQTEYGSGKMSLNQQAEAINLTGHNIGAVMEISQFSDKRLGGEVI

Query:  RKIETES---GVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSI
        +K E+E+   G  H NDE KT      P T  +NSNFQ VNNSV+Y+SSCS RDPGLHL FS        T DGG  +I
Subjt:  RKIETES---GVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSI

A0A6J1GHE5 sulfated surface glycoprotein 185-like4.9e-3342.37Show/hide
Query:  PNKQIKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVAS-LPLKPAQSPPVSPPSK
        P   + +  P+ A P         P+ PTT         LPAA   P +       +   PA+K  SP  SPKY  +V  V S  P K   SPPVSP  K
Subjt:  PNKQIKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAASPTTSPKYEATVVSVAS-LPLKPAQSPPVSPPSK

Query:  SSDPRQSISP--NSYKKPHKPTTPPLSPLALP----KSVDVTTIQSGIKPEVEQKTGPYK-------KPDRQTEYGSGKMSLNQQA-EAINLTGHNIGAV
          D     SP  +  +       PP  PLALP     +V+ TT Q  I+PEVE+K+  Y        K DR +EYGSGK    Q+A E+INL GHN+GAV
Subjt:  SSDPRQSISP--NSYKKPHKPTTPPLSPLALP----KSVDVTTIQSGIKPEVEQKTGPYK-------KPDRQTEYGSGKMSLNQQA-EAINLTGHNIGAV

Query:  MEISQFS-DKRLGGEVIRKIETESGVRHENDE-------EKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDG
        MEI + S   RLGGE +RK ETE G R  N+E       +K ++    PMT  +NSNFQ VNNSV+Y+SSCS RDPGLHL F+  +  + A VDG
Subjt:  MEISQFS-DKRLGGEVIRKIETESGVRHENDE-------EKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46630.1 unknown protein1.3e-0627.49Show/hide
Query:  PVPNKQIKKYSPAAAQ-PASVPEPEILPLAPTTQTLQPFEPNLPAAA--APPSSPVREPTP--RISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPP
        P P +Q +  SP   Q P S P  +  PL P  Q   P  P    +   +PPS  +  PTP    + P     S  TSP     V    +LP +   SPP
Subjt:  PVPNKQIKKYSPAAAQ-PASVPEPEILPLAPTTQTLQPFEPNLPAAA--APPSSPVREPTP--RISSPAKKAASPTTSPKYEATVVSVASLPLKPAQSPP

Query:  VSPPSKSSDPRQSI---SPNSYKKPHKPTTP-PLSPLALPKSV---DVTTIQSGI----KPEVEQKTGPYKK------------------PDRQTEYGSG
            S  S   +S+   SP+  +   K  +P  LSP +LP S+   +  T Q  I    K     +T  + +                     Q   G+ 
Subjt:  VSPPSKSSDPRQSI---SPNSYKKPHKPTTP-PLSPLALPKSV---DVTTIQSGI----KPEVEQKTGPYKK------------------PDRQTEYGSG

Query:  KMSLNQQ-----------AEAINLTGHNIGAVMEI--SQFSDKRLG------------GEVIRKIETESGVRHENDEEKTRRA--------TGFPMTPIL
           +++Q              I + G N GAVMEI  S   +K  G            GE  R++++ S    +  E K +          +  PM   +
Subjt:  KMSLNQQ-----------AEAINLTGHNIGAVMEI--SQFSDKRLG------------GEVIRKIETESGVRHENDEEKTRRA--------TGFPMTPIL

Query:  NSNFQEVNNSVMYNSSCSGRDPGLHLDFSGK
        NSN Q +NNS++YNS+ S  DPG+HL  S K
Subjt:  NSNFQEVNNSVMYNSSCSGRDPGLHLDFSGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGACCTTTTCTTTCCACTTTCCATCTCCCAATTCTTGCCTCCTGCAATTCTTCTTCCCTTCAGAGTCTGCAGTCTCACTGGTACGGAGAAATGTTGCAG
ATTCCAGTACCTAACAAACAAATCAAGAAATATTCTCCGGCTGCCGCGCAGCCTGCTTCAGTACCAGAGCCTGAGATTCTGCCATTAGCTCCAACCACCCAAACT
CTGCAACCTTTCGAACCCAACCTGCCGGCGGCGGCTGCTCCGCCATCTTCCCCTGTAAGAGAACCCACTCCTCGCATATCCTCTCCGGCGAAGAAAGCGGCCTCG
CCTACCACATCGCCAAAATACGAAGCTACCGTCGTAAGTGTGGCCAGCCTGCCGCTGAAACCTGCACAATCACCGCCGGTCTCCCCTCCAAGCAAATCTTCTGAT
CCGAGGCAGTCTATTAGCCCTAATTCGTACAAAAAACCCCACAAGCCAACCACGCCACCACTTTCACCTCTCGCTCTGCCCAAATCTGTAGATGTGACGACGATT
CAATCCGGAATCAAGCCGGAGGTGGAGCAGAAAACCGGTCCGTACAAGAAGCCGGATCGGCAGACAGAGTACGGATCCGGTAAGATGTCGCTGAACCAGCAGGCC
GAGGCTATAAACCTCACCGGACATAACATAGGCGCGGTAATGGAAATTAGCCAATTCTCTGACAAACGTTTAGGCGGAGAAGTCATCAGGAAGATCGAAACAGAA
AGCGGCGTCCGGCATGAAAATGACGAGGAGAAAACTCGCAGAGCAACCGGATTCCCGATGACGCCAATCCTGAACAGCAATTTTCAAGAAGTGAACAATTCAGTT
ATGTATAATTCGTCTTGCAGTGGCCGTGATCCGGGGCTGCATCTTGATTTCTCCGGCAAGTCGAAGGATAATAGAGCCACTGTGGACGGCGGCGCGAAATCTATA
TACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGGACCTTTTCTTTCCACTTTCCATCTCCCAATTCTTGCCTCCTGCAATTCTTCTTCCCTTCAGAGTCTGCAGTCTCACTGGTACGGAGAAATGTTGCAG
ATTCCAGTACCTAACAAACAAATCAAGAAATATTCTCCGGCTGCCGCGCAGCCTGCTTCAGTACCAGAGCCTGAGATTCTGCCATTAGCTCCAACCACCCAAACT
CTGCAACCTTTCGAACCCAACCTGCCGGCGGCGGCTGCTCCGCCATCTTCCCCTGTAAGAGAACCCACTCCTCGCATATCCTCTCCGGCGAAGAAAGCGGCCTCG
CCTACCACATCGCCAAAATACGAAGCTACCGTCGTAAGTGTGGCCAGCCTGCCGCTGAAACCTGCACAATCACCGCCGGTCTCCCCTCCAAGCAAATCTTCTGAT
CCGAGGCAGTCTATTAGCCCTAATTCGTACAAAAAACCCCACAAGCCAACCACGCCACCACTTTCACCTCTCGCTCTGCCCAAATCTGTAGATGTGACGACGATT
CAATCCGGAATCAAGCCGGAGGTGGAGCAGAAAACCGGTCCGTACAAGAAGCCGGATCGGCAGACAGAGTACGGATCCGGTAAGATGTCGCTGAACCAGCAGGCC
GAGGCTATAAACCTCACCGGACATAACATAGGCGCGGTAATGGAAATTAGCCAATTCTCTGACAAACGTTTAGGCGGAGAAGTCATCAGGAAGATCGAAACAGAA
AGCGGCGTCCGGCATGAAAATGACGAGGAGAAAACTCGCAGAGCAACCGGATTCCCGATGACGCCAATCCTGAACAGCAATTTTCAAGAAGTGAACAATTCAGTT
ATGTATAATTCGTCTTGCAGTGGCCGTGATCCGGGGCTGCATCTTGATTTCTCCGGCAAGTCGAAGGATAATAGAGCCACTGTGGACGGCGGCGCGAAATCTATA
TACTGA
Protein sequenceShow/hide protein sequence
MSGPFLSTFHLPILASCNSSSLQSLQSHWYGEMLQIPVPNKQIKKYSPAAAQPASVPEPEILPLAPTTQTLQPFEPNLPAAAAPPSSPVREPTPRISSPAKKAAS
PTTSPKYEATVVSVASLPLKPAQSPPVSPPSKSSDPRQSISPNSYKKPHKPTTPPLSPLALPKSVDVTTIQSGIKPEVEQKTGPYKKPDRQTEYGSGKMSLNQQA
EAINLTGHNIGAVMEISQFSDKRLGGEVIRKIETESGVRHENDEEKTRRATGFPMTPILNSNFQEVNNSVMYNSSCSGRDPGLHLDFSGKSKDNRATVDGGAKSI
Y