; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020132 (gene) of Snake gourd v1 genome

Gene IDTan0020132
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionVQ motif-containing protein 22-like
Genome locationLG01:106783002..106784159
RNA-Seq ExpressionTan0020132
SyntenyTan0020132
Gene Ontology termsNA
InterPro domainsIPR008889 - VQ
IPR039609 - VQ motif-containing protein 15/22


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608003.1 VQ motif-containing protein 22, partial [Cucurbita argyrosperma subsp. sororia]4.8e-6774.64Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ
        MADQW+HMYHQN   NNQQL F+LSDH Q   D+NSG+VSDSTVVTAAV    T P SSGL PDGGRV KPVRKR+RASRRTPTTL NTDAANFRAMVQQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ

Query:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ-----PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFDG
        FTGGPS   N +SNF+ FGFP NSN VFDPTVAAYHL A    PP+FQFQNQ     PLQQPPFMFSLGNSA  SE FFQQ  G   ++SNVNYMGLFDG
Subjt:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ-----PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFDG

Query:  SSPAAQNPR
        SSP AQNPR
Subjt:  SSPAAQNPR

XP_022940097.1 VQ motif-containing protein 22-like [Cucurbita moschata]8.2e-6773.81Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ
        MADQW+HMYHQN   NNQQL F+LSDH Q   D+NSG+VSDSTVVTAA+    T P SSGL PDGGRV KPVRKR+RASRRTPTTL NTDAANFRAMVQQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ

Query:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ------PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFD
        FTGGPS   N +SNF+ FGFP NSN VFDPTVAAYHL A    PP+FQFQNQ      PLQQPPFMFSLGNSA  SE FFQQ  G   ++SNVNYMGLFD
Subjt:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ------PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFD

Query:  GSSPAAQNPR
        GSSP AQNPR
Subjt:  GSSPAAQNPR

XP_022981901.1 VQ motif-containing protein 22-like [Cucurbita maxima]2.6e-6572.73Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ
        MADQW+HMYHQ++LSNNQQL F+LSDH Q   D+NSG+VSDSTVVTAAV    T P SSGL PDGGRV KPVRKR+RASRRTPTTL NTDAANFRAMVQQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ

Query:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ-----PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFDG
        FTGGPS   N +SNF+ FGF  NSN VFDP+VAAY+L A    PP+FQFQNQ     PLQQ PFMFSLGNSA  SE FFQQ  G   ++SNVNY GLFDG
Subjt:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ-----PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFDG

Query:  SSPAAQNPR
        SSP AQNPR
Subjt:  SSPAAQNPR

XP_023525782.1 VQ motif-containing protein 22-like [Cucurbita pepo subsp. pepo]3.1e-6674.16Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ
        MADQW+ MYHQN   NNQQL F+LSDH Q   D+NSG+VSDSTVVTAAV    T P SSGL PDGGRV KPVRKR+RASRRTPTTL NTDAANFRAMVQQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ

Query:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ-----PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFDG
        FTGGPS   N +SNF+ FGFP NSN VFDPTVAAYHL A    PP+FQFQNQ     PLQQPPFMFSLGNSA  SE FFQQ  G   ++SNVNYMGLFDG
Subjt:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ-----PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFDG

Query:  SSPAAQNPR
        SSP AQNPR
Subjt:  SSPAAQNPR

XP_038898930.1 VQ motif-containing protein 22-like [Benincasa hispida]6.3e-6771.43Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ
        M+++WLHMYHQN+LSN  QLPFHLSDHDQ   +NNS  VSDSTVVTAAV    TPPAS+GL+PDG R VKPVRKRSRASRRTPTT+LNTDAANFRAMVQQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ

Query:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ--PLQQPPFMFSLGNSAPGSEAFFQQTQG------HNSNVNYMGLFDG
        FTGGPSNANN   NF  FGFPGN   +FDP  AAYHL  PPQQP +FQFQ+Q  PLQQPPFMFSL NSA G+ AFFQQ  G      +++N+NYM +FDG
Subjt:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ--PLQQPPFMFSLGNSAPGSEAFFQQTQG------HNSNVNYMGLFDG

Query:  SSPAAQNPRP
        SSP  QN RP
Subjt:  SSPAAQNPRP

TrEMBL top hitse value%identityAlignment
A0A0A0L4D1 VQ domain-containing protein1.9e-5367.51Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHD---QDNNSGLVSDSTVVTAAV----TPPA-SSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQ
        MA+QWL+MY+QN+LSN  QLPFHLS HD    +NNS +VSDSTVVT AV    TPPA SSGL+PDG RV KPVRKRSRASRRTPTTLLNTDAANFRAMVQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHD---QDNNSGLVSDSTVVTAAV----TPPA-SSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQ

Query:  QFTGGPSNANNASSNFAAFGFPGNSNTVFDP-TVAAYHLAAPPQQPPIFQFQNQPLQQPPFMFSLGNSAPGSEAFFQQTQG-----HNSNVNYMGLF
        QFTGGPSNANN   NFA FGFP NS T+FDP + AAY +  P QQ  + QFQNQP   PP MFSL NS  G +AF+QQ  G     +N+N+NY G F
Subjt:  QFTGGPSNANNASSNFAAFGFPGNSNTVFDP-TVAAYHLAAPPQQPPIFQFQNQPLQQPPFMFSLGNSAPGSEAFFQQTQG-----HNSNVNYMGLF

A0A1S3CM56 VQ motif-containing protein 22-like4.3e-5367.16Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHD---QDNNSGLVSDSTVVTAAV----TPPA-SSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQ
        MA+QWL+MY+QN+L N  QLPFHLSDHD    +NNS +VSDSTVVT AV    TPPA SSGL+PDG RV KPVRKRSRASRRTPTTLLNTDAANFRAMVQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHD---QDNNSGLVSDSTVVTAAV----TPPA-SSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQ

Query:  QFTGGPSNANNASSNFAAFGFPGNSNTVFDP-TVAAYHLAAPP--QQPPIFQFQNQPLQQPPFMFSLGNSAPGSEAFFQQTQG-------HNSNVNYMGL
        QFTGGPSNAN+   NFA FGFP NS T++DP + AAY +  PP  QQ  +FQFQNQP   PP MFSL NS  G +AFFQQ  G       +N+N NYMG 
Subjt:  QFTGGPSNANNASSNFAAFGFPGNSNTVFDP-TVAAYHLAAPP--QQPPIFQFQNQPLQQPPFMFSLGNSAPGSEAFFQQTQG-------HNSNVNYMGL

Query:  F
        F
Subjt:  F

A0A5A7V6Y2 VQ motif-containing protein 22-like4.3e-5367.16Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHD---QDNNSGLVSDSTVVTAAV----TPPA-SSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQ
        MA+QWL+MY+QN+L N  QLPFHLSDHD    +NNS +VSDSTVVT AV    TPPA SSGL+PDG RV KPVRKRSRASRRTPTTLLNTDAANFRAMVQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHD---QDNNSGLVSDSTVVTAAV----TPPA-SSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQ

Query:  QFTGGPSNANNASSNFAAFGFPGNSNTVFDP-TVAAYHLAAPP--QQPPIFQFQNQPLQQPPFMFSLGNSAPGSEAFFQQTQG-------HNSNVNYMGL
        QFTGGPSNAN+   NFA FGFP NS T++DP + AAY +  PP  QQ  +FQFQNQP   PP MFSL NS  G +AFFQQ  G       +N+N NYMG 
Subjt:  QFTGGPSNANNASSNFAAFGFPGNSNTVFDP-TVAAYHLAAPP--QQPPIFQFQNQPLQQPPFMFSLGNSAPGSEAFFQQTQG-------HNSNVNYMGL

Query:  F
        F
Subjt:  F

A0A6J1FIN8 VQ motif-containing protein 22-like4.0e-6773.81Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ
        MADQW+HMYHQN   NNQQL F+LSDH Q   D+NSG+VSDSTVVTAA+    T P SSGL PDGGRV KPVRKR+RASRRTPTTL NTDAANFRAMVQQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ

Query:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ------PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFD
        FTGGPS   N +SNF+ FGFP NSN VFDPTVAAYHL A    PP+FQFQNQ      PLQQPPFMFSLGNSA  SE FFQQ  G   ++SNVNYMGLFD
Subjt:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ------PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFD

Query:  GSSPAAQNPR
        GSSP AQNPR
Subjt:  GSSPAAQNPR

A0A6J1IXU5 VQ motif-containing protein 22-like1.3e-6572.73Show/hide
Query:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ
        MADQW+HMYHQ++LSNNQQL F+LSDH Q   D+NSG+VSDSTVVTAAV    T P SSGL PDGGRV KPVRKR+RASRRTPTTL NTDAANFRAMVQQ
Subjt:  MADQWLHMYHQNHLSNNQQLPFHLSDHDQ---DNNSGLVSDSTVVTAAV----TPPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQ

Query:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ-----PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFDG
        FTGGPS   N +SNF+ FGF  NSN VFDP+VAAY+L A    PP+FQFQNQ     PLQQ PFMFSLGNSA  SE FFQQ  G   ++SNVNY GLFDG
Subjt:  FTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQ-----PLQQPPFMFSLGNSAPGSEAFFQQTQG---HNSNVNYMGLFDG

Query:  SSPAAQNPR
        SSP AQNPR
Subjt:  SSPAAQNPR

SwissProt top hitse value%identityAlignment
Q9LIE6 VQ motif-containing protein 223.9e-1943.82Show/hide
Query:  DNNSGLVSDSTVVTAAVTPPAS-------SGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFD
        +NN    + ST  + AVT   +       S LSP+ GRV KP R+RSRASRRTPTTLLNTD +NFRAMVQQ+TGGPS          AFG  GN+ + F 
Subjt:  DNNSGLVSDSTVVTAAVTPPAS-------SGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFD

Query:  PTVAAYHLAAPPQQPP-IFQFQ-NQPLQQP--PFMFSLGNSAPGSEAFFQQTQGHNSNVNYMGLF---DGSSPAAQNP
         T ++   A   QQ P  + FQ + PLQ P  P+MFSL N  P        +  +N N    G+F   DGS      P
Subjt:  PTVAAYHLAAPPQQPP-IFQFQ-NQPLQQP--PFMFSLGNSAPGSEAFFQQTQGHNSNVNYMGLF---DGSSPAAQNP

Arabidopsis top hitse value%identityAlignment
AT1G35830.1 VQ motif-containing protein1.6e-0756.6Show/hide
Query:  PPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSN
        PP SS  +P         RKR+RASRR PTT+L TD +NFRAMVQ+FTG P++
Subjt:  PPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSN

AT3G22160.1 VQ motif-containing protein2.8e-2043.82Show/hide
Query:  DNNSGLVSDSTVVTAAVTPPAS-------SGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFD
        +NN    + ST  + AVT   +       S LSP+ GRV KP R+RSRASRRTPTTLLNTD +NFRAMVQQ+TGGPS          AFG  GN+ + F 
Subjt:  DNNSGLVSDSTVVTAAVTPPAS-------SGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFD

Query:  PTVAAYHLAAPPQQPP-IFQFQ-NQPLQQP--PFMFSLGNSAPGSEAFFQQTQGHNSNVNYMGLF---DGSSPAAQNP
         T ++   A   QQ P  + FQ + PLQ P  P+MFSL N  P        +  +N N    G+F   DGS      P
Subjt:  PTVAAYHLAAPPQQPP-IFQFQ-NQPLQQP--PFMFSLGNSAPGSEAFFQQTQGHNSNVNYMGLF---DGSSPAAQNP

AT4G15120.1 VQ motif-containing protein1.3e-1752.38Show/hide
Query:  STVVTAAVT-PPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQP
        STV T   T   A S LSPD  RV KP R+RSRASRRTPTTL NTD ANFRAMVQQFTGGPS     SS  + F     S T  DPT     +++ P Q 
Subjt:  STVVTAAVT-PPASSGLSPDGGRVVKPVRKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQP

Query:  PIFQFQ---NQPLQQP-PFMFSLGNS
           Q Q   N+ +QQ  P+MFS  N+
Subjt:  PIFQFQ---NQPLQQP-PFMFSLGNS

AT4G39720.1 VQ motif-containing protein4.1e-0832.54Show/hide
Query:  YHQNHLSN---NQQLPFHLSDHDQDNNSGLV-----SDSTVVTAAVTP-----------PASSGLSPDGG-RVVKPVRKRSRASRRTPTTLLNTDAANFR
        +H NH+S+    QQ   +L   D +NN+ L+     +++T +     P            A+S L P     V+K  +KRSRASRR PTT+L TD +NFR
Subjt:  YHQNHLSN---NQQLPFHLSDHDQDNNSGLV-----SDSTVVTAAVTP-----------PASSGLSPDGG-RVVKPVRKRSRASRRTPTTLLNTDAANFR

Query:  AMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQPLQQPPFMFSLGNSAP---GSEAFFQQTQGHNSNVNYMGLFDG
        AMVQ+FTG P     A   F       N+N++ + T     L      P  +   N  L+  PF   L  + P   GS+   QQ Q  N+    M L   
Subjt:  AMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQPLQQPPFMFSLGNSAP---GSEAFFQQTQGHNSNVNYMGLFDG

Query:  SSPAAQNPR
              NPR
Subjt:  SSPAAQNPR

AT5G65170.1 VQ motif-containing protein7.8e-0738.37Show/hide
Query:  RKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPS------NANNASSNFAAFGFPGNSNT---------VFDPTVAAYHLAAPPQQ
        +KRSR SRR PTT+L TD +NFRAMVQ+FTG PS      +++   S F  FG   +S++         +  P+   +H   P  +
Subjt:  RKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPS------NANNASSNFAAFGFPGNSNT---------VFDPTVAAYHLAAPPQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAATTTCCCCCTTGAAACCTTTGAAAATTACACACACACCCCCCATGAATCTATATATATACATTTTCCCTCTCGCCTAAACCACCTTCTAATTCTTCTTTCTTC
CTTTCTTTCTAATTCTTACTTCCAAATGGCCGATCAATGGCTTCATATGTATCACCAAAACCACCTCTCAAACAATCAACAACTCCCCTTTCACCTTTCCGATCACGATC
AAGACAACAATTCCGGGTTGGTTTCTGATTCTACCGTCGTCACTGCCGCCGTCACCCCACCGGCCTCATCCGGCTTGAGCCCAGATGGGGGACGAGTCGTCAAGCCCGTC
CGCAAGCGGTCCAGAGCCTCCCGACGGACCCCAACCACTTTACTCAACACGGACGCTGCCAATTTCCGAGCCATGGTCCAGCAATTCACCGGTGGGCCCTCCAACGCCAA
TAACGCCTCCTCCAATTTTGCCGCATTCGGCTTCCCCGGTAATTCCAACACCGTTTTCGACCCCACCGTCGCCGCCTACCACCTCGCGGCTCCGCCGCAGCAGCCGCCCA
TATTCCAATTCCAAAACCAACCGCTGCAGCAACCGCCGTTTATGTTCTCGTTGGGAAATTCGGCGCCCGGGAGTGAGGCGTTTTTTCAACAAACGCAGGGTCATAATAGT
AATGTTAATTACATGGGGCTTTTTGATGGGTCTTCTCCGGCGGCTCAGAATCCTCGGCCGGCGTGA
mRNA sequenceShow/hide mRNA sequence
CTCCTTTTACGACCCCCAATTAAATCCCAATGGTCAATTTCCCCCTTGAAACCTTTGAAAATTACACACACACCCCCCATGAATCTATATATATACATTTTCCCTCTCGC
CTAAACCACCTTCTAATTCTTCTTTCTTCCTTTCTTTCTAATTCTTACTTCCAAATGGCCGATCAATGGCTTCATATGTATCACCAAAACCACCTCTCAAACAATCAACA
ACTCCCCTTTCACCTTTCCGATCACGATCAAGACAACAATTCCGGGTTGGTTTCTGATTCTACCGTCGTCACTGCCGCCGTCACCCCACCGGCCTCATCCGGCTTGAGCC
CAGATGGGGGACGAGTCGTCAAGCCCGTCCGCAAGCGGTCCAGAGCCTCCCGACGGACCCCAACCACTTTACTCAACACGGACGCTGCCAATTTCCGAGCCATGGTCCAG
CAATTCACCGGTGGGCCCTCCAACGCCAATAACGCCTCCTCCAATTTTGCCGCATTCGGCTTCCCCGGTAATTCCAACACCGTTTTCGACCCCACCGTCGCCGCCTACCA
CCTCGCGGCTCCGCCGCAGCAGCCGCCCATATTCCAATTCCAAAACCAACCGCTGCAGCAACCGCCGTTTATGTTCTCGTTGGGAAATTCGGCGCCCGGGAGTGAGGCGT
TTTTTCAACAAACGCAGGGTCATAATAGTAATGTTAATTACATGGGGCTTTTTGATGGGTCTTCTCCGGCGGCTCAGAATCCTCGGCCGGCGTGACTCCGGCGCGGCGGC
GGAATTCGATGGTAGTTTCTTTTTTCTTTTTCTTCTTTTTTTTTTTTTTTTTTTTTTGGGAAATGGGGTTTAAAAGAAGAAGTTAGGCACCGAAGTCGCCTACTGACAAA
TTCTTCAAAGATGACATTTTTTATTTTGACCACGGTGTACATGTGTCGGAAAGTTGGTGCGTGCGGCACGTCTCTGCTTCGCACGTGTGAACTCTTTTCTTTTTTCGTTT
TCTTTTTCTTTTTTGAGTTCCACGTGTGAACTCTTTTCAATCGCCACGTTCCATTTCCACCCACCGCATGCCGTGTTTGGAACCCCTTTGTTAATTATTTAGCGTTATTT
TTTATTATTATTGTTATAATGTATTTTTATTTTTTATTTTCTGACTCCACTCTCCGAC
Protein sequenceShow/hide protein sequence
MVNFPLETFENYTHTPHESIYIHFPSRLNHLLILLSSFLSNSYFQMADQWLHMYHQNHLSNNQQLPFHLSDHDQDNNSGLVSDSTVVTAAVTPPASSGLSPDGGRVVKPV
RKRSRASRRTPTTLLNTDAANFRAMVQQFTGGPSNANNASSNFAAFGFPGNSNTVFDPTVAAYHLAAPPQQPPIFQFQNQPLQQPPFMFSLGNSAPGSEAFFQQTQGHNS
NVNYMGLFDGSSPAAQNPRPA