; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G17545 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G17545
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotransposon protein
Genome locationClcChr05:26341792..26345585
RNA-Seq ExpressionClc05G17545
SyntenyClc05G17545
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035620.1 retrotransposon protein [Cucumis melo var. makuwa]4.9e-6653.15Show/hide
Query:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW
        ++C ++TRMD+R F IL  LLRT+ GL   + VDVEEMVA+FLHI+AHDV             + +    N+VL A++RL+E+LL+K +PV   CTD RW
Subjt:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW

Query:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG--------------------
        RWF+NCLGA DGT+IKVNV A+DR RYRTR GE+ATNVL VC+   +F++V  GWEGS ADSR+LRD +SRP  LKVP+G                    
Subjt:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG--------------------

Query:  -------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYY
                     N P+  +E FNM HSSA NVIERAFG+LKGRWAILRGKSYY
Subjt:  -------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYY

KAA0047510.1 retrotransposon protein [Cucumis melo var. makuwa]4.4e-6753.54Show/hide
Query:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW
        ++C E+TRMD+R F IL  LLRT+ GL   + VDVEEMVA+FLHI+AHDV             + +    N+VL A++RL+E+LL+K +PV   CTD RW
Subjt:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW

Query:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG--------------------
        RWF+NCLGA DGT+IKVNV A+DR RYRTR GE+ATNVL VC++  +F++V  GWEGS ADSR+LRD +SRP GLKVP+G                    
Subjt:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG--------------------

Query:  -------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYY
                     N P+  +E FNM H SA NVIERAFG+LKGRWAILRGKSYY
Subjt:  -------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYY

KAA0058874.1 retrotransposon protein [Cucumis melo var. makuwa]4.4e-6756.54Show/hide
Query:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW
        ++C ++TRMD+R F IL  LLR + GL   + VDVEEMVA+FLH++AHDV             + V    NIVL A+LRLYE+L+++  PVT NC D RW
Subjt:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW

Query:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG------------NPPTNLRE
        + F+NCLGA DGT+IKVNV A DRP +RTR GEI TNVL VC+   +F++V  GWEGS ADSR+LR+ +SR  GL+VP+G            N PTN +E
Subjt:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG------------NPPTNLRE

Query:  IFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQL
         FNM HSSA NVIERAFG+LKGRWAILRGKSYY  Q+
Subjt:  IFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQL

KAA0062547.1 retrotransposon protein [Cucumis melo var. makuwa]8.4e-6657.83Show/hide
Query:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDV------WL-----DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW
        ++C ++T MD+R F IL  LLR + GL   + VDVEEMVA+FLH++AHDV      W      + V    NIVL A+ RLYE+L+++  PVT NC D RW
Subjt:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDV------WL-----DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW

Query:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPR-----GNPPTNLREIFNMIHS
        + F+NCLGA DGT+IKVNV A DRP +RTR GEIATNVL VC+   +F++V  GWEGS ADSR+LRD +SR  GL+VP+      N PTN +E FNM HS
Subjt:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPR-----GNPPTNLREIFNMIHS

Query:  SAWNVIERAFGMLKGRWAILRGKSYYSKQL
        SA NVIERAFG+LKGRWAILRGK YY  Q+
Subjt:  SAWNVIERAFGMLKGRWAILRGKSYYSKQL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.3e-7050.85Show/hide
Query:  YSKQLLNVVQANGLEGYLNGTVPVPSKYLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLDTAANIWNSLR--------------------
        +  QLLN V ANGL GYL+GT+  P ++LD   +Q N  +  WE YN  +MCWIYSS SEEKMGE+VSL+T  +IW+SL                     
Subjt:  YSKQLLNVVQANGLEGYLNGTVPVPSKYLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLDTAANIWNSLR--------------------

Query:  --------SQYLSQIKEVADKFSAIGELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHN
                SQYL++IKE+ADKF+A+GE +SYRDHLAH+LD LGSEYNAFVT I N +D+PS+EDVRSLLLAYEA L+KQN VDQLN+A+AN   LSLQHN
Subjt:  --------SQYLSQIKEVADKFSAIGELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHN

Query:  SKR-NSSWSFPNPSSSASLRPFSLVFNLPAFNQPNPNIP------TSVLGRPQFFPKWPPKPFSSKPQCQICHKFGYTAPNCHHLASLAHQSIPP
        SKR    +SFPN                  +    PN P       S+LG+PQ   KWPPKP SSK QCQIC K G++A  C+H  ++A+ +  P
Subjt:  SKR-NSSWSFPNPSSSASLRPFSLVFNLPAFNQPNPNIP------TSVLGRPQFFPKWPPKPFSSKPQCQICHKFGYTAPNCHHLASLAHQSIPP

TrEMBL top hitse value%identityAlignment
A0A5A7TWH8 Retrotransposon protein2.1e-6753.54Show/hide
Query:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW
        ++C E+TRMD+R F IL  LLRT+ GL   + VDVEEMVA+FLHI+AHDV             + +    N+VL A++RL+E+LL+K +PV   CTD RW
Subjt:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW

Query:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG--------------------
        RWF+NCLGA DGT+IKVNV A+DR RYRTR GE+ATNVL VC++  +F++V  GWEGS ADSR+LRD +SRP GLKVP+G                    
Subjt:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG--------------------

Query:  -------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYY
                     N P+  +E FNM H SA NVIERAFG+LKGRWAILRGKSYY
Subjt:  -------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYY

A0A5A7UUT3 Retrotransposon protein2.1e-6756.54Show/hide
Query:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW
        ++C ++TRMD+R F IL  LLR + GL   + VDVEEMVA+FLH++AHDV             + V    NIVL A+LRLYE+L+++  PVT NC D RW
Subjt:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW

Query:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG------------NPPTNLRE
        + F+NCLGA DGT+IKVNV A DRP +RTR GEI TNVL VC+   +F++V  GWEGS ADSR+LR+ +SR  GL+VP+G            N PTN +E
Subjt:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG------------NPPTNLRE

Query:  IFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQL
         FNM HSSA NVIERAFG+LKGRWAILRGKSYY  Q+
Subjt:  IFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQL

A0A5A7V6H4 Retrotransposon protein4.0e-6657.83Show/hide
Query:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDV------WL-----DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW
        ++C ++T MD+R F IL  LLR + GL   + VDVEEMVA+FLH++AHDV      W      + V    NIVL A+ RLYE+L+++  PVT NC D RW
Subjt:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDV------WL-----DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW

Query:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPR-----GNPPTNLREIFNMIHS
        + F+NCLGA DGT+IKVNV A DRP +RTR GEIATNVL VC+   +F++V  GWEGS ADSR+LRD +SR  GL+VP+      N PTN +E FNM HS
Subjt:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPR-----GNPPTNLREIFNMIHS

Query:  SAWNVIERAFGMLKGRWAILRGKSYYSKQL
        SA NVIERAFG+LKGRWAILRGK YY  Q+
Subjt:  SAWNVIERAFGMLKGRWAILRGKSYYSKQL

A0A5D3BDX0 Retrotransposon protein2.4e-6653.15Show/hide
Query:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW
        ++C ++TRMD+R F IL  LLRT+ GL   + VDVEEMVA+FLHI+AHDV             + +    N+VL A++RL+E+LL+K +PV   CTD RW
Subjt:  MMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWL-----------DNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTDDRW

Query:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG--------------------
        RWF+NCLGA DGT+IKVNV A+DR RYRTR GE+ATNVL VC+   +F++V  GWEGS ADSR+LRD +SRP  LKVP+G                    
Subjt:  RWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRG--------------------

Query:  -------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYY
                     N P+  +E FNM HSSA NVIERAFG+LKGRWAILRGKSYY
Subjt:  -------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYY

A0A6J1DQX7 uncharacterized protein LOC1110223151.6e-7050.85Show/hide
Query:  YSKQLLNVVQANGLEGYLNGTVPVPSKYLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLDTAANIWNSLR--------------------
        +  QLLN V ANGL GYL+GT+  P ++LD   +Q N  +  WE YN  +MCWIYSS SEEKMGE+VSL+T  +IW+SL                     
Subjt:  YSKQLLNVVQANGLEGYLNGTVPVPSKYLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLDTAANIWNSLR--------------------

Query:  --------SQYLSQIKEVADKFSAIGELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHN
                SQYL++IKE+ADKF+A+GE +SYRDHLAH+LD LGSEYNAFVT I N +D+PS+EDVRSLLLAYEA L+KQN VDQLN+A+AN   LSLQHN
Subjt:  --------SQYLSQIKEVADKFSAIGELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHN

Query:  SKR-NSSWSFPNPSSSASLRPFSLVFNLPAFNQPNPNIP------TSVLGRPQFFPKWPPKPFSSKPQCQICHKFGYTAPNCHHLASLAHQSIPP
        SKR    +SFPN                  +    PN P       S+LG+PQ   KWPPKP SSK QCQIC K G++A  C+H  ++A+ +  P
Subjt:  SKR-NSSWSFPNPSSSASLRPFSLVFNLPAFNQPNPNIP------TSVLGRPQFFPKWPPKPFSSKPQCQICHKFGYTAPNCHHLASLAHQSIPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein6.1e-1427.56Show/hide
Query:  CHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAH-----DVWL------DNVQALQNIVLNAILRLYEDLLR-----KLEPVTKNCTD
        C +  RM    FT L  +L+T   L+P   + +EE VA+FL I  H     DV L      + VQ     VL A   L  D +R     +L  + +    
Subjt:  CHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAH-----DVWL------DNVQALQNIVLNAILRLYEDLLR-----KLEPVTKNCTD

Query:  DR--WRWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSR----PI-----------------G
        D+  W +F   +GA DGTH+ V V    +  Y  R    + N++ +C+    F +++ G  GS  D+ VL+         P+                 G
Subjt:  DR--WRWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSR----PI-----------------G

Query:  LKVP-----------------RGNPPTNLREIFNMIHSSAWNVIERAFGMLKGR
        L  P                  G  P N  E+FN  H+S  +VIER F + K +
Subjt:  LKVP-----------------RGNPPTNLREIFNMIHSSAWNVIERAFGMLKGR

AT4G10890.1 unknown protein2.6e-0437.04Show/hide
Query:  RGNPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQLLNVVQAN
        RG PP  ++E+FN  H    +VI+R FG+ K +W IL          +NV + N
Subjt:  RGNPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKSYYSKQLLNVVQAN

AT5G28950.1 unknown protein5.5e-1547.14Show/hide
Query:  WFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSR
        +F++C+GA D THI   VS    P +R R G+I+ N+L  CN   EF++V +GWEGS  DS+VL D ++R
Subjt:  WFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSR

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)8.5e-0834.48Show/hide
Query:  FIFVFTGWEGSVADSRVLRDVV----------SRPIGLKVP-RG------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKS--YYS
        FI+V +GWEGS  DSRVL D +          +  +    P RG              P    E+FN+ H S  NVIER FG+ K R+AI +      Y 
Subjt:  FIFVFTGWEGSVADSRVLRDVV----------SRPIGLKVP-RG------------NPPTNLREIFNMIHSSAWNVIERAFGMLKGRWAILRGKS--YYS

Query:  KQLLNVVQANGLEGYL
        KQ   V+    L  +L
Subjt:  KQLLNVVQANGLEGYL

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.1e-0521.14Show/hide
Query:  YLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLD-TAANIWNSLRS----------------------------QYLSQIKEVADKFSAIG
        ++D  +    +    W+  +  +  WIY + ++  +  I+ +  TA ++W SL +                            +Y  ++K ++D  + + 
Subjt:  YLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLD-TAANIWNSLRS----------------------------QYLSQIKEVADKFSAIG

Query:  ELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHNS
          IS R  + H+L+ L  +Y+  +  I++ S  PS  + RS+LL  E+ L             +N SK SL H +
Subjt:  ELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGACATGGACATTCAACTTGACGAGTTAATCGCAATACTGACTGCTGTATATGCGGCCACCGTTACAATGATGTGTCATGAGAACACTCGTATGGATAAGAGAAT
GTTCACGATTCTATTTCAATTACTTAGGACGATGGGTGGGCTAAGGCCAATAAAATATGTAGACGTGGAGGAAATGGTGGCCATCTTCTTGCACATAGTCGCACACGATG
TCTGGCTAGACAATGTCCAGGCACTTCAAAATATTGTGCTAAATGCAATATTGAGATTGTACGAAGATCTGTTGCGAAAACTAGAGCCAGTCACTAAGAATTGTACAGAT
GACAGATGGCGCTGGTTTCAGAATTGCTTAGGTGCATTTGACGGAACCCACATAAAGGTAAATGTCAGTGCAGCTGATCGACCACGCTATAGAACTAGAATGGGCGAGAT
CGCAACGAATGTCCTTATTGTATGCAATCAAAGTTGCGAGTTCATATTCGTTTTCACTGGATGGGAAGGATCAGTTGCTGACTCGAGAGTTCTGCGAGATGTAGTGTCCA
GGCCAATTGGATTGAAAGTCCCAAGGGGGAACCCGCCAACAAACCTTAGAGAGATTTTCAATATGATACATTCATCTGCCTGGAACGTCATTGAGAGAGCGTTCGGGATG
TTGAAAGGTCGATGGGCAATCCTCCGAGGAAAATCATACTACTCGAAGCAGCTGCTGAATGTTGTTCAAGCCAATGGACTGGAAGGTTATCTTAATGGTACGGTTCCTGT
TCCTTCAAAATATTTGGATGCTCAAAATATGCAGCTGAATCTAGAGTTTTCGACATGGGAAATGTATAATAGTTTCATTATGTGTTGGATTTACTCTTCATTTTCTGAAG
AAAAGATGGGAGAAATTGTTAGTTTAGATACTGCTGCTAATATATGGAATTCGTTAAGAAGTCAATATTTGTCTCAGATTAAAGAGGTTGCTGATAAGTTTAGTGCCATT
GGAGAACTTATATCTTATAGAGATCATTTAGCTCATATTCTAGATAGTCTAGGAAGTGAATATAATGCTTTTGTCACTTATATTCAAAATTGCTCTGATAATCCTTCTAT
TGAAGATGTGAGAAGTTTATTGTTAGCTTATGAAGCCCCTTTGGAGAAACAAAATGTTGTTGATCAATTGAATGTTGCCCGAGCTAATTTTAGCAAGCTTTCTCTTCAAC
ACAATAGCAAGCGGAATTCGTCTTGGTCCTTTCCAAACCCATCTTCCTCTGCTTCTCTAAGACCCTTTTCCCTTGTTTTTAATCTCCCCGCCTTCAATCAACCAAATCCA
AACATACCGACGAGTGTTCTTGGTCGTCCTCAATTTTTCCCAAAATGGCCTCCAAAACCTTTTTCTTCTAAACCTCAATGCCAAATCTGTCACAAATTTGGTTATACCGC
TCCTAACTGTCATCATCTTGCCTCTTTGGCCCACCAATCCATACCTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGACATGGACATTCAACTTGACGAGTTAATCGCAATACTGACTGCTGTATATGCGGCCACCGTTACAATGATGTGTCATGAGAACACTCGTATGGATAAGAGAAT
GTTCACGATTCTATTTCAATTACTTAGGACGATGGGTGGGCTAAGGCCAATAAAATATGTAGACGTGGAGGAAATGGTGGCCATCTTCTTGCACATAGTCGCACACGATG
TCTGGCTAGACAATGTCCAGGCACTTCAAAATATTGTGCTAAATGCAATATTGAGATTGTACGAAGATCTGTTGCGAAAACTAGAGCCAGTCACTAAGAATTGTACAGAT
GACAGATGGCGCTGGTTTCAGAATTGCTTAGGTGCATTTGACGGAACCCACATAAAGGTAAATGTCAGTGCAGCTGATCGACCACGCTATAGAACTAGAATGGGCGAGAT
CGCAACGAATGTCCTTATTGTATGCAATCAAAGTTGCGAGTTCATATTCGTTTTCACTGGATGGGAAGGATCAGTTGCTGACTCGAGAGTTCTGCGAGATGTAGTGTCCA
GGCCAATTGGATTGAAAGTCCCAAGGGGGAACCCGCCAACAAACCTTAGAGAGATTTTCAATATGATACATTCATCTGCCTGGAACGTCATTGAGAGAGCGTTCGGGATG
TTGAAAGGTCGATGGGCAATCCTCCGAGGAAAATCATACTACTCGAAGCAGCTGCTGAATGTTGTTCAAGCCAATGGACTGGAAGGTTATCTTAATGGTACGGTTCCTGT
TCCTTCAAAATATTTGGATGCTCAAAATATGCAGCTGAATCTAGAGTTTTCGACATGGGAAATGTATAATAGTTTCATTATGTGTTGGATTTACTCTTCATTTTCTGAAG
AAAAGATGGGAGAAATTGTTAGTTTAGATACTGCTGCTAATATATGGAATTCGTTAAGAAGTCAATATTTGTCTCAGATTAAAGAGGTTGCTGATAAGTTTAGTGCCATT
GGAGAACTTATATCTTATAGAGATCATTTAGCTCATATTCTAGATAGTCTAGGAAGTGAATATAATGCTTTTGTCACTTATATTCAAAATTGCTCTGATAATCCTTCTAT
TGAAGATGTGAGAAGTTTATTGTTAGCTTATGAAGCCCCTTTGGAGAAACAAAATGTTGTTGATCAATTGAATGTTGCCCGAGCTAATTTTAGCAAGCTTTCTCTTCAAC
ACAATAGCAAGCGGAATTCGTCTTGGTCCTTTCCAAACCCATCTTCCTCTGCTTCTCTAAGACCCTTTTCCCTTGTTTTTAATCTCCCCGCCTTCAATCAACCAAATCCA
AACATACCGACGAGTGTTCTTGGTCGTCCTCAATTTTTCCCAAAATGGCCTCCAAAACCTTTTTCTTCTAAACCTCAATGCCAAATCTGTCACAAATTTGGTTATACCGC
TCCTAACTGTCATCATCTTGCCTCTTTGGCCCACCAATCCATACCTCCTTAG
Protein sequenceShow/hide protein sequence
MDDMDIQLDELIAILTAVYAATVTMMCHENTRMDKRMFTILFQLLRTMGGLRPIKYVDVEEMVAIFLHIVAHDVWLDNVQALQNIVLNAILRLYEDLLRKLEPVTKNCTD
DRWRWFQNCLGAFDGTHIKVNVSAADRPRYRTRMGEIATNVLIVCNQSCEFIFVFTGWEGSVADSRVLRDVVSRPIGLKVPRGNPPTNLREIFNMIHSSAWNVIERAFGM
LKGRWAILRGKSYYSKQLLNVVQANGLEGYLNGTVPVPSKYLDAQNMQLNLEFSTWEMYNSFIMCWIYSSFSEEKMGEIVSLDTAANIWNSLRSQYLSQIKEVADKFSAI
GELISYRDHLAHILDSLGSEYNAFVTYIQNCSDNPSIEDVRSLLLAYEAPLEKQNVVDQLNVARANFSKLSLQHNSKRNSSWSFPNPSSSASLRPFSLVFNLPAFNQPNP
NIPTSVLGRPQFFPKWPPKPFSSKPQCQICHKFGYTAPNCHHLASLAHQSIPP