; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh20G010870 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh20G010870
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionReverse transcriptase
Genome locationCma_Chr20:8360665..8362044
RNA-Seq ExpressionCmaCh20G010870
SyntenyCmaCh20G010870
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022972954.1 uncharacterized protein LOC111471473 [Cucurbita maxima]1.2e-12781.34Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +LMFRV SLE +V+PTS P+PSGSPDSS+AHKEG GEEFDVLQNTMMSLFNGLADEFRTTIDDIQE+M +M TRIEVTMKAVENV+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
         NTGSNKL+  DPR FK NRDAKELENFIFDVEQYFKAT ACTDD KVTVASMYLID+AKLWWRTKVQDIE+ L TIDSWEDLK+ELRD+FLPEN  HLA
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEKL+ALK TG  RD+VRQFSTLMLDI GT EKDK+FFFINGLQPW KTK+HE K+Q LA A++ +ERLLDYGNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

XP_022975176.1 uncharacterized protein LOC111474215 [Cucurbita maxima]1.4e-13183.1Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +LMF+V SLE +V+PTS P+PSGSPDSS+AHKEG GEEFDVLQNTMMSLFNGLADEFRTTIDDIQE+M +M TRIEVTMKAVENV+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
        TNTGSNKL+  +PR FKGNRDAKELENFIFDVEQYFKAT ACTDD KVTVASMYL D+AKLWWRTKVQDIE+ LCTIDSWEDLK+ELRDQFLPEN  HLA
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEKL+ALK TGS RD+VRQFSTLMLDIRGTSEKDKVFFFINGLQPW KTK+HE K+Q LA A++ +ERLLDYGNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

XP_022975516.1 uncharacterized protein LOC111474945, partial [Cucurbita maxima]2.4e-13182.75Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +LMFRV SLE +V+PTS P+PSGSPDSS+AHKEG GEEFDVLQNTMMSLFNGLADEFRTTIDDIQE+M +M TRIEVTMKAVENV+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
        TNTGSNKL+  +PR FKGN+DAKELENFIFDVEQYFKAT  C DD KVTVASMYL D+AKLWWRTKVQDIE+ LCTIDSWEDLK+ELRDQFLPEN EHLA
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEKL+ALK TGS RD+VRQFSTLMLDIRGTSEKDKVFFFINGLQPW KTK+HE K+Q LA A++ +ERLLDYGNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

XP_022975706.1 uncharacterized protein LOC111475733, partial [Cucurbita maxima]4.1e-13183.1Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +LMFRV SLE +V+PTS P+PSGSPDSS+AHKEG GEEFDVLQNTMMSLFNGLADEFRTTIDDIQE+M +M TRIEVTMKAVENV+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
        TNTGS+KL+  DPR FKGNRDAKELENFIFDVEQYFKAT ACTDD KVTVASMYL D+AKLWWRTKVQDIE+ LCTIDSWEDLK+ELRDQFLPEN  HLA
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEKL+ALK TG  RD+VRQFSTLMLDIRGTSEKDKVFFFINGLQPW KTK+HE K+Q LA A++  ERLLDYGNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]2.7e-12778.87Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +L+FRV SLE +V+PTS P+PS SPDSS+AHKEG GEEFD+LQNTMMSLFNGLADEFR+T+DD+QE+M AMSTRIEVTMKAVE V+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
        T+TGSNKL+  DPR FKGNRDAKELENFIFDVEQYFKAT ACTDD KVTVA+MYL+D+AKLWWRTKVQDIE+ LCTIDSWEDLKRELR+QFLPEN  H+A
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEK++ALK TG+ RD+VRQFSTLMLDIRGT+EKDKVFFFINGLQPW KTK+HE ++Q LA A++ +ERL+D GNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

TrEMBL top hitse value%identityAlignment
A0A6J1EG61 uncharacterized protein LOC1114340287.0e-10587.05Show/hide
Query:  MSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQTNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLID
        MSLFNGLADEFRTTIDDIQEKM  MST+IEVTMK VEN+S GQTNTGSNKLK  DPRPFKGNRDAKEL+NFIFDVE YFKATLACTDDIKVTVASMYLID
Subjt:  MSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQTNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLID

Query:  NAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLAMEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQ
        +AKLWWR KVQDIEN LCTIDSWEDLKRELRDQFLPENVEHLAMEKLIALK+T S +D+VRQFSTLMLDIRGTSEKDKV FFINGLQPW KTK+HEKK+Q
Subjt:  NAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLAMEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQ

Query:  DLATAISSSERLLDYGNEAGYQRK
        DLAT I+S+ERL DYG+ A YQRK
Subjt:  DLATAISSSERLLDYGNEAGYQRK

A0A6J1ID35 uncharacterized protein LOC1114714735.9e-12881.34Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +LMFRV SLE +V+PTS P+PSGSPDSS+AHKEG GEEFDVLQNTMMSLFNGLADEFRTTIDDIQE+M +M TRIEVTMKAVENV+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
         NTGSNKL+  DPR FK NRDAKELENFIFDVEQYFKAT ACTDD KVTVASMYLID+AKLWWRTKVQDIE+ L TIDSWEDLK+ELRD+FLPEN  HLA
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEKL+ALK TG  RD+VRQFSTLMLDI GT EKDK+FFFINGLQPW KTK+HE K+Q LA A++ +ERLLDYGNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

A0A6J1IDF7 uncharacterized protein LOC1114742156.7e-13283.1Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +LMF+V SLE +V+PTS P+PSGSPDSS+AHKEG GEEFDVLQNTMMSLFNGLADEFRTTIDDIQE+M +M TRIEVTMKAVENV+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
        TNTGSNKL+  +PR FKGNRDAKELENFIFDVEQYFKAT ACTDD KVTVASMYL D+AKLWWRTKVQDIE+ LCTIDSWEDLK+ELRDQFLPEN  HLA
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEKL+ALK TGS RD+VRQFSTLMLDIRGTSEKDKVFFFINGLQPW KTK+HE K+Q LA A++ +ERLLDYGNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

A0A6J1IEF9 uncharacterized protein LOC1114749451.2e-13182.75Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +LMFRV SLE +V+PTS P+PSGSPDSS+AHKEG GEEFDVLQNTMMSLFNGLADEFRTTIDDIQE+M +M TRIEVTMKAVENV+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
        TNTGSNKL+  +PR FKGN+DAKELENFIFDVEQYFKAT  C DD KVTVASMYL D+AKLWWRTKVQDIE+ LCTIDSWEDLK+ELRDQFLPEN EHLA
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEKL+ALK TGS RD+VRQFSTLMLDIRGTSEKDKVFFFINGLQPW KTK+HE K+Q LA A++ +ERLLDYGNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

A0A6J1IEY4 uncharacterized protein LOC1114757332.0e-13183.1Show/hide
Query:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ
        MGNRLDGLPI +LMFRV SLE +V+PTS P+PSGSPDSS+AHKEG GEEFDVLQNTMMSLFNGLADEFRTTIDDIQE+M +M TRIEVTMKAVENV+AGQ
Subjt:  MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQ

Query:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA
        TNTGS+KL+  DPR FKGNRDAKELENFIFDVEQYFKAT ACTDD KVTVASMYL D+AKLWWRTKVQDIE+ LCTIDSWEDLK+ELRDQFLPEN  HLA
Subjt:  TNTGSNKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLA

Query:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP
        MEKL+ALK TG  RD+VRQFSTLMLDIRGTSEKDKVFFFINGLQPW KTK+HE K+Q LA A++  ERLLDYGNEAG QR+  P
Subjt:  MEKLIALKKTGSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAACCGCCTGGATGGGTTGCCAATCGTAAAACTGATGTTTCGAGTGATCTCACTCGAAGTAAAAGTTTCTCCTACGAGCAGACCAAGACCGTCTGGTAGC
CCCGATAGCTCTATCGCTCACAAGGAGGGACATGGCGAAGAGTTTGACGTGCTACAAAATACAATGATGAGTTTGTTCAATGGATTAGCTGACGAATTCAGAACA
ACGATCGATGACATCCAAGAAAAGATGTGCGCCATGAGCACCCGAATTGAGGTGACCATGAAAGCCGTGGAGAACGTCTCGGCTGGGCAAACTAATACAGGATCC
AACAAACTGAAGCTCCTAGACCCTAGACCTTTCAAAGGGAATCGGGATGCCAAAGAGTTGGAGAACTTCATCTTTGATGTCGAACAGTACTTCAAAGCCACACTG
GCTTGTACCGACGACATAAAGGTGACAGTAGCCTCGATGTATCTCATAGACAATGCCAAACTTTGGTGGCGTACGAAGGTGCAAGACATCGAGAATGAATTGTGC
ACCATAGACTCGTGGGAAGATCTCAAGAGAGAGTTGAGGGACCAATTCCTCCCCGAAAACGTAGAACATCTAGCAATGGAAAAACTAATAGCCCTAAAAAAAACT
GGAAGCAGAAGGGACTTTGTCAGACAATTTTCGACCCTGATGCTAGATATTAGAGGCACATCAGAGAAGGACAAGGTATTCTTCTTTATAAATGGGTTACAACCA
TGGGTCAAAACAAAAGTACACGAGAAAAAGATCCAAGACCTAGCTACCGCAATTTCCAGCAGCGAGAGACTCCTAGACTATGGGAACGAAGCGGGTTACCAAAGA
AAACACAGGCCCCAAACACTAGGGGCAAAACATATAAGCTGCCAGGTCATCGAAATGGAAGCCCCAACAGGCCAAACGGAAATAACGACAGACCAAGCGGGTGGA
CAGATAAACCTCCTCAAAACAACCAAGCTGGGACATCTCGAGGACCTTACCCTCAAAGGAACCACCCGACGATACCTTTACAATGCATATAGTGTAAAGGCCCCC
ACAAAGTGTCTTACTGTCCTCATCGAGCCTCTCTCACGCACTCCAAGTGTCCATTCAAGAGAGCAACGACACAAGAGTCAAGACTATGCTAGACAAGCAGGAAGA
TCAAGACAACCCCCGAATGGGCGCACTCAAATTCTTGTCAGCCCTCCAATGAAAGGTCGACCTGAAGTAGATAATAGAGAAAGGGCTCATGTTTGTGGATGCGAC
AATAAACTCTCGATCGAGCAAGAGCACCTTGATAGGCTCAGGAGCGATTCACAATTTCATCGCCGATCAAGAAGCACGAAGATTGGGACTCACCATAGAAATAGA
CCCTGGAAAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAACCGCCTGGATGGGTTGCCAATCGTAAAACTGATGTTTCGAGTGATCTCACTCGAAGTAAAAGTTTCTCCTACGAGCAGACCAAGACCGTCTGGTAGC
CCCGATAGCTCTATCGCTCACAAGGAGGGACATGGCGAAGAGTTTGACGTGCTACAAAATACAATGATGAGTTTGTTCAATGGATTAGCTGACGAATTCAGAACA
ACGATCGATGACATCCAAGAAAAGATGTGCGCCATGAGCACCCGAATTGAGGTGACCATGAAAGCCGTGGAGAACGTCTCGGCTGGGCAAACTAATACAGGATCC
AACAAACTGAAGCTCCTAGACCCTAGACCTTTCAAAGGGAATCGGGATGCCAAAGAGTTGGAGAACTTCATCTTTGATGTCGAACAGTACTTCAAAGCCACACTG
GCTTGTACCGACGACATAAAGGTGACAGTAGCCTCGATGTATCTCATAGACAATGCCAAACTTTGGTGGCGTACGAAGGTGCAAGACATCGAGAATGAATTGTGC
ACCATAGACTCGTGGGAAGATCTCAAGAGAGAGTTGAGGGACCAATTCCTCCCCGAAAACGTAGAACATCTAGCAATGGAAAAACTAATAGCCCTAAAAAAAACT
GGAAGCAGAAGGGACTTTGTCAGACAATTTTCGACCCTGATGCTAGATATTAGAGGCACATCAGAGAAGGACAAGGTATTCTTCTTTATAAATGGGTTACAACCA
TGGGTCAAAACAAAAGTACACGAGAAAAAGATCCAAGACCTAGCTACCGCAATTTCCAGCAGCGAGAGACTCCTAGACTATGGGAACGAAGCGGGTTACCAAAGA
AAACACAGGCCCCAAACACTAGGGGCAAAACATATAAGCTGCCAGGTCATCGAAATGGAAGCCCCAACAGGCCAAACGGAAATAACGACAGACCAAGCGGGTGGA
CAGATAAACCTCCTCAAAACAACCAAGCTGGGACATCTCGAGGACCTTACCCTCAAAGGAACCACCCGACGATACCTTTACAATGCATATAGTGTAAAGGCCCCC
ACAAAGTGTCTTACTGTCCTCATCGAGCCTCTCTCACGCACTCCAAGTGTCCATTCAAGAGAGCAACGACACAAGAGTCAAGACTATGCTAGACAAGCAGGAAGA
TCAAGACAACCCCCGAATGGGCGCACTCAAATTCTTGTCAGCCCTCCAATGAAAGGTCGACCTGAAGTAGATAATAGAGAAAGGGCTCATGTTTGTGGATGCGAC
AATAAACTCTCGATCGAGCAAGAGCACCTTGATAGGCTCAGGAGCGATTCACAATTTCATCGCCGATCAAGAAGCACGAAGATTGGGACTCACCATAGAAATAGA
CCCTGGAAAAATTAA
Protein sequenceShow/hide protein sequence
MGNRLDGLPIVKLMFRVISLEVKVSPTSRPRPSGSPDSSIAHKEGHGEEFDVLQNTMMSLFNGLADEFRTTIDDIQEKMCAMSTRIEVTMKAVENVSAGQTNTGS
NKLKLLDPRPFKGNRDAKELENFIFDVEQYFKATLACTDDIKVTVASMYLIDNAKLWWRTKVQDIENELCTIDSWEDLKRELRDQFLPENVEHLAMEKLIALKKT
GSRRDFVRQFSTLMLDIRGTSEKDKVFFFINGLQPWVKTKVHEKKIQDLATAISSSERLLDYGNEAGYQRKHRPQTLGAKHISCQVIEMEAPTGQTEITTDQAGG
QINLLKTTKLGHLEDLTLKGTTRRYLYNAYSVKAPTKCLTVLIEPLSRTPSVHSREQRHKSQDYARQAGRSRQPPNGRTQILVSPPMKGRPEVDNRERAHVCGCD
NKLSIEQEHLDRLRSDSQFHRRSRSTKIGTHHRNRPWKN