; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003560 (gene) of Snake gourd v1 genome

Gene IDTan0003560
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionB box-type domain-containing protein
Genome locationLG06:58036858..58037855
RNA-Seq ExpressionTan0003560
SyntenyTan0003560
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006734 - PLATZ transcription factor


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593617.1 hypothetical protein SDJN03_13093, partial [Cucurbita argyrosperma subsp. sororia]3.0e-9089.64Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW
        RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVK QGGVSRHLLECNFLPLPEP W
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW

Query:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY
        DDG IMTPDSVLEP GSTRTSSSSDGG       T+ASTATTDFVKKKRSSLT   AAYRT CRP CCS TVSETSGSLMSRRK TPQRAPLY
Subjt:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY

KAG7025963.1 hypothetical protein SDJN02_12461, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-9089.64Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW
        RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVK QGGVSRHLLECNFLPLPEP W
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW

Query:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY
        DDG IMTPDSVLEP GSTRTSSSSDGG       T+ASTATTDFVKKKRSSLT   AAYRT CRP CCS TVSETSGSLMSRRK TPQRAPLY
Subjt:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY

XP_022964357.1 uncharacterized protein LOC111464390 [Cucurbita moschata]3.0e-9089.64Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW
        RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVK QGGVSRHLLECNFLPLPEP W
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW

Query:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY
        DDG IMTPDSVLEP GSTRTSSSSDGG       T+ASTATTDFVKKKRSSLT   AAYRT CRP CCS TVSETSGSLMSRRK TPQRAPLY
Subjt:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY

XP_023000095.1 uncharacterized protein LOC111494393 [Cucurbita maxima]1.5e-8989.64Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW
        RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPY FCSLSCKIDYLVK QGGVSRHLLECNFLPLPEPAW
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW

Query:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY
        DDG IMTPDSVLEP GSTRTSSSSDGG       T+ASTATTDFVKKKRSSLT   AAYRT CRP  CS TVSETSGSLMSRRKGTPQRAPLY
Subjt:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY

XP_023514505.1 uncharacterized protein LOC111778763 [Cucurbita pepo subsp. pepo]3.0e-9089.64Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW
        RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVK QGGVSRHLLECNFLPLPEP W
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW

Query:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY
        DDG IMTPDSVLEP GSTRTSSSSDGG       T+ASTATTDFVKKKRSSLT   AAYRT CRP CCS TVSETSGSLMSRRK TPQRAPLY
Subjt:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY

TrEMBL top hitse value%identityAlignment
A0A0A0KCM1 B box-type domain-containing protein3.0e-8880.28Show/hide
Query:  ALTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQ
        +L  CP    ++SS      RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCS+SCKIDYLVKTQ
Subjt:  ALTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQ

Query:  GGVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLM
        GG+S+HLLECNFLPLPEP WDDG +MTPDSVLEP GS R+SS SD  GGG E +T+ STATT+FVKKKRSSLT   AAYR ACRPVC   T+SETSGSLM
Subjt:  GGVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLM

Query:  SRRKGTPQRAPLY
        SRRKGTPQRAPLY
Subjt:  SRRKGTPQRAPLY

A0A1S3C8E3 uncharacterized protein LOC1034976637.8e-8981.52Show/hide
Query:  ALTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQ
        +L  CP    ++SS      RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCS+SCKIDYLVKTQ
Subjt:  ALTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQ

Query:  GGVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT-AAYRTACRPVCCSATVSETSGSLMSR
        GG+S++LLECNFLPLPEP WDDG +MTPDSVLEP GSTRTSS SD  GGG E +T+ STATT+FVKKKRSSLT AAYR ACRPVC   T+SETSGSLMSR
Subjt:  GGVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT-AAYRTACRPVCCSATVSETSGSLMSR

Query:  RKGTPQRAPLY
        RKGTPQRAPLY
Subjt:  RKGTPQRAPLY

A0A6J1H7K0 uncharacterized protein LOC1114602957.9e-8177.14Show/hide
Query:  ALTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQ
        +L  CP    ++SS      RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFL QRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYL+KT+
Subjt:  ALTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQ

Query:  GGVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLTAAYRTACRPVCCSATVSETSGSLMSRR
        GGVSR+LLECNF      AWDDG +MTPDSVLE  GS +TSS SDGGG       VASTATT+FVKKKRSSLTAAYR ACRP C   T+SETSGSL+SRR
Subjt:  GGVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLTAAYRTACRPVCCSATVSETSGSLMSRR

Query:  KGTPQRAPLY
        KGTPQRAPLY
Subjt:  KGTPQRAPLY

A0A6J1HHL3 uncharacterized protein LOC1114643901.4e-9089.64Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW
        RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVK QGGVSRHLLECNFLPLPEP W
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW

Query:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY
        DDG IMTPDSVLEP GSTRTSSSSDGG       T+ASTATTDFVKKKRSSLT   AAYRT CRP CCS TVSETSGSLMSRRK TPQRAPLY
Subjt:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY

A0A6J1KHE0 uncharacterized protein LOC1114943937.1e-9089.64Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW
        RRYVYHDVIRLDDA KLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPY FCSLSCKIDYLVK QGGVSRHLLECNFLPLPEPAW
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPLPEPAW

Query:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY
        DDG IMTPDSVLEP GSTRTSSSSDGG       T+ASTATTDFVKKKRSSLT   AAYRT CRP  CS TVSETSGSLMSRRKGTPQRAPLY
Subjt:  DDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLT---AAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPLY

SwissProt top hitse value%identityAlignment
Q1G3Q4 Protein RGF1 INDUCIBLE TRANSCRIPTION FACTOR 12.2e-3241.41Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPL-----
        RRYVYHDV+RL+D  KLIDC+ VQ+YT NSAKVVF+K+RPQ R F+G +GN C++CDRSLQ+PY+ CSL CK+D+++K    ++  L  C+ L L     
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPL-----

Query:  -PEPAWDDGFIM---TPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLTAAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPL
         P+    D  +    TP S +     + + SS+      A      +  TT  V+KKR+      ++A      S    + S + ++RRKG PQR+PL
Subjt:  -PEPAWDDGFIM---TPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLTAAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPL

Arabidopsis top hitse value%identityAlignment
AT1G21000.1 PLATZ transcription factor family protein2.5e-1546.74Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNF
        RR  YH+V+R+++  K ID A VQ+Y  NSAK+VFL +RPQ R  +G + N C  C RSL D + FCSL CK+       GG+ R  L   F
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNF

AT1G21000.2 PLATZ transcription factor family protein2.5e-1546.74Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNF
        RR  YH+V+R+++  K ID A VQ+Y  NSAK+VFL +RPQ R  +G + N C  C RSL D + FCSL CK+       GG+ R  L   F
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNF

AT1G31040.1 PLATZ transcription factor family protein1.1e-2350.91Show/hide
Query:  TKKTSSVSTVPSVFALTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLF
        T++ S  +    +  L+ CP    ++ S      RRYVYHDV+RL D  KLIDC++VQ YT N AKV+FL QR Q+R     S N+C TCDR LQ+P+ F
Subjt:  TKKTSSVSTVPSVFALTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLF

Query:  CSLSCKIDYL
        CSLSCK+DYL
Subjt:  CSLSCKIDYL

AT2G12646.1 PLATZ transcription factor family protein1.6e-3341.41Show/hide
Query:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPL-----
        RRYVYHDV+RL+D  KLIDC+ VQ+YT NSAKVVF+K+RPQ R F+G +GN C++CDRSLQ+PY+ CSL CK+D+++K    ++  L  C+ L L     
Subjt:  RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQGGVSRHLLECNFLPL-----

Query:  -PEPAWDDGFIM---TPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLTAAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPL
         P+    D  +    TP S +     + + SS+      A      +  TT  V+KKR+      ++A      S    + S + ++RRKG PQR+PL
Subjt:  -PEPAWDDGFIM---TPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLTAAYRTACRPVCCSATVSETSGSLMSRRKGTPQRAPL

AT3G60670.1 PLATZ transcription factor family protein1.9e-4751.64Show/hide
Query:  LTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQG
        LT CP    +++S      RRYVY DV+R++D +KL+DC+ +Q YTTNS+KVVF+ +RPQ+R FRG SGN+C TCDRSLQ PYLFC LSCKI  ++  Q 
Subjt:  LTACPLTALTNSS------RRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSCKIDYLVKTQG

Query:  GVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEI--RTVASTATTDFVKKKRSSLTAAYRTACRPV--CCSATVSETSGSLM
        G+S  L  CN L L     D+    TP S LEPTGS RTSS S G  G      + +A TATT+ V+KKRSSL+    T CR V    S T +E   + +
Subjt:  GVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEI--RTVASTATTDFVKKKRSSLTAAYRTACRPV--CCSATVSETSGSLM

Query:  SRRKGTPQRAPLY
        +RRK  PQRAPLY
Subjt:  SRRKGTPQRAPLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGTGTGCTTCATGAAGACGCCAAAAAGAACGAAAAAAACATCTTCTGTTTCGACTGTTCCCTCGGTATTTGCCCTCACTGCTTGTCCTCTCACCGCTCTCACAAA
CTCCTCCAGGCGATATGTTTACCATGATGTGATTCGACTAGACGATGCTGCCAAGCTAATTGACTGTGCTTTTGTGCAATCGTACACTACGAATAGTGCGAAAGTGGTGT
TTCTGAAACAGAGGCCACAGACCAGAAACTTCAGAGGCTCCTCCGGCAACCTCTGCAGCACCTGCGACAGAAGCCTTCAAGACCCTTATCTCTTTTGCTCTCTCTCTTGC
AAGATTGATTATCTGGTAAAGACGCAAGGTGGGGTTTCGAGGCATCTTTTGGAGTGCAATTTCTTGCCATTGCCCGAACCGGCTTGGGACGACGGCTTCATTATGACGCC
TGACTCCGTTCTCGAGCCGACCGGTTCAACCCGAACCTCATCCAGTTCCGACGGCGGCGGCGGAGGGGCGGAGATTAGGACTGTGGCTTCGACGGCCACCACGGACTTTG
TGAAGAAGAAGAGAAGCAGCTTGACGGCGGCGTATCGGACAGCGTGCCGGCCGGTTTGCTGTTCGGCGACGGTTTCCGAGACGTCTGGGAGTTTGATGAGCCGGAGAAAA
GGTACCCCGCAGCGGGCCCCACTCTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGTGTGCTTCATGAAGACGCCAAAAAGAACGAAAAAAACATCTTCTGTTTCGACTGTTCCCTCGGTATTTGCCCTCACTGCTTGTCCTCTCACCGCTCTCACAAA
CTCCTCCAGGCGATATGTTTACCATGATGTGATTCGACTAGACGATGCTGCCAAGCTAATTGACTGTGCTTTTGTGCAATCGTACACTACGAATAGTGCGAAAGTGGTGT
TTCTGAAACAGAGGCCACAGACCAGAAACTTCAGAGGCTCCTCCGGCAACCTCTGCAGCACCTGCGACAGAAGCCTTCAAGACCCTTATCTCTTTTGCTCTCTCTCTTGC
AAGATTGATTATCTGGTAAAGACGCAAGGTGGGGTTTCGAGGCATCTTTTGGAGTGCAATTTCTTGCCATTGCCCGAACCGGCTTGGGACGACGGCTTCATTATGACGCC
TGACTCCGTTCTCGAGCCGACCGGTTCAACCCGAACCTCATCCAGTTCCGACGGCGGCGGCGGAGGGGCGGAGATTAGGACTGTGGCTTCGACGGCCACCACGGACTTTG
TGAAGAAGAAGAGAAGCAGCTTGACGGCGGCGTATCGGACAGCGTGCCGGCCGGTTTGCTGTTCGGCGACGGTTTCCGAGACGTCTGGGAGTTTGATGAGCCGGAGAAAA
GGTACCCCGCAGCGGGCCCCACTCTATTGA
Protein sequenceShow/hide protein sequence
MPVCFMKTPKRTKKTSSVSTVPSVFALTACPLTALTNSSRRYVYHDVIRLDDAAKLIDCAFVQSYTTNSAKVVFLKQRPQTRNFRGSSGNLCSTCDRSLQDPYLFCSLSC
KIDYLVKTQGGVSRHLLECNFLPLPEPAWDDGFIMTPDSVLEPTGSTRTSSSSDGGGGGAEIRTVASTATTDFVKKKRSSLTAAYRTACRPVCCSATVSETSGSLMSRRK
GTPQRAPLY