; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018542 (gene) of Snake gourd v1 genome

Gene IDTan0018542
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG04:7269734..7272302
RNA-Seq ExpressionTan0018542
SyntenyTan0018542
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601286.1 hypothetical protein SDJN03_06519, partial [Cucurbita argyrosperma subsp. sororia]6.1e-9192.78Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLSHQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

XP_022150691.1 uncharacterized protein LOC111018762 [Momordica charantia]6.1e-9192.82Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSN FSGHEEMRTSVPGPISDLRDQ+VCPKPRRL NLKVTVNGHAD+SLRWNL HQVEQIDMAAGPDLLDFLLTK GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPI-ASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPI ASP+ QLSPST ASRKG RVRA+FGNKP VRIEGFDCLDRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPI-ASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

XP_022957437.1 uncharacterized protein LOC111458833 [Cucurbita moschata]6.1e-9192.78Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLSHQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

XP_022986983.1 uncharacterized protein LOC111484540 [Cucurbita maxima]4.0e-9092.22Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLSHQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIP APIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

XP_023517111.1 uncharacterized protein LOC111780963 [Cucurbita pepo subsp. pepo]5.2e-9091.67Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSNAFS HEEMR  VPGP SDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLSHQVEQIDM  GPDLLDFLLT+GGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

TrEMBL top hitse value%identityAlignment
A0A1S3BG95 uncharacterized protein LOC1034892879.8e-8789.44Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSNAFSGHEEMRTSVP PISD RDQ+VCPKPRRL     TVN H+D+SLRWNLSHQVE IDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARF +EKFIPF PIASP+GQLSPST +SRKG RVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

A0A5D3CB96 Uncharacterized protein9.8e-8789.44Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSNAFSGHEEMRTSVP PISD RDQ+VCPKPRRL     TVN H+D+SLRWNLSHQVE IDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARF +EKFIPF PIASP+GQLSPST +SRKG RVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

A0A6J1DC97 uncharacterized protein LOC1110187623.0e-9192.82Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSN FSGHEEMRTSVPGPISDLRDQ+VCPKPRRL NLKVTVNGHAD+SLRWNL HQVEQIDMAAGPDLLDFLLTK GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPI-ASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPI ASP+ QLSPST ASRKG RVRA+FGNKP VRIEGFDCLDRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPI-ASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

A0A6J1GZ47 uncharacterized protein LOC1114588333.0e-9192.78Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLSHQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

A0A6J1JI50 uncharacterized protein LOC1114845401.9e-9092.22Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHCAILSNAFS HEEMR  VPGPISDLRDQ+VCPKPRRL N KVTV GHADSSLRWNLSHQVEQIDMA GPDLLDFLLT+GGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIP APIASP+GQLSPST ASRKG RVRASFGNKPTVRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13390.1 unknown protein1.1e-3245.95Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPP-
        MN C I  NAF   EEMR +    +SD RD V+CPKPRR+  L    N H+  SLRW L+HQ+E  +  +G ++LDF+LTK GG   +Q  T+   +PP 
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPP-

Query:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAAS-RKGNRVRA--SFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        F  GSPPSRV+NPL +D+ F +E  +  +P  S      P   +S R G+ V A  SFGN P VR+ GFDC DR   N SI   A
Subjt:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAAS-RKGNRVRA--SFGNKPTVRIEGFDCLDRDRQNCSIPAFA

AT1G13390.2 unknown protein1.1e-3245.95Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPP-
        MN C I  NAF   EEMR +    +SD RD V+CPKPRR+  L    N H+  SLRW L+HQ+E  +  +G ++LDF+LTK GG   +Q  T+   +PP 
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPP-

Query:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAAS-RKGNRVRA--SFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        F  GSPPSRV+NPL +D+ F +E  +  +P  S      P   +S R G+ V A  SFGN P VR+ GFDC DR   N SI   A
Subjt:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAAS-RKGNRVRA--SFGNKPTVRIEGFDCLDRDRQNCSIPAFA

AT1G68490.1 unknown protein4.0e-4050.27Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRR--LRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSP-
        MNH A+  NAF+   ++R+S    +   +  VVCPKPRR  LRN     + H   SLR   SHQ+E  +  A  D+LD +LTK G   +Q   Q+  SP 
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRR--LRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSP-

Query:  PFLCGSPPSRVANPLIQDARFGDEKFIPFAPIASPAG---QLSPSTAASRKGN-RVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA
        PFLCGSPPSRVANPL QDARF DE     + I    G     SPS+++ RKG   VR +FGN P VR+EGFDCLDRD +NCSIPA A
Subjt:  PFLCGSPPSRVANPLIQDARFGDEKFIPFAPIASPAG---QLSPSTAASRKGN-RVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA

AT3G02555.1 unknown protein1.4e-3248.62Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL
        MNHC++  NAF   EE R  VP   S   D VVCPKPRR  N+           L ++LS   +  D  AG DLLD    K   S        + SPPF 
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKP-TVRIEGFDCLDRDRQNCSIPAFA
         GSPPSR ANPL QDARFGDEK    +P  SP   L PS +  + G   R  FG KP TVR+EGFDCL+RDR N SIPA A
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKP-TVRIEGFDCLDRDRQNCSIPAFA

AT5G16110.1 unknown protein1.2e-3145.55Show/hide
Query:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQI-DMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPP
        MNHC +  NAF   EEM         D +D VVCPKPRR+  L      +    LR ++S     + D  AG +LL+ +  K    ++ Q    L+SSPP
Subjt:  MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQI-DMAAGPDLLDFLLTK-GGCSVDQSFTQLASSPP

Query:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIA------SPAGQLSPSTAASRKGNR--VRASFG-NKPTVRIEGFDCLDRDRQNCSIPAFA
        +  GSPPSR ANPL QDARF DEK  P +P +      S  G  SPS+++S   +R  VR  FG N P VR+EGFDCL+RDRQN SIPA A
Subjt:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIA------SPAGQLSPSTAASRKGNR--VRASFG-NKPTVRIEGFDCLDRDRQNCSIPAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCACTGCGCCATTCTGTCAAACGCCTTCTCGGGCCACGAGGAGATGAGAACCTCTGTTCCGGGTCCCATTTCTGACCTCAGAGATCAAGTTGTTTGCCCTAAACC
ACGGCGTCTTAGGAATCTTAAGGTCACCGTCAACGGCCACGCCGATAGCTCTCTCCGGTGGAATCTCAGTCACCAAGTAGAGCAAATTGACATGGCAGCCGGACCGGATC
TGCTTGACTTCCTCCTGACAAAAGGTGGTTGCAGCGTGGACCAATCGTTTACGCAGTTGGCTTCGTCGCCCCCTTTTTTATGTGGGTCTCCGCCGAGCAGAGTAGCCAAC
CCATTGATTCAGGACGCCCGATTTGGGGATGAAAAATTCATCCCCTTTGCACCGATTGCTTCACCGGCGGGTCAGTTGTCGCCCTCTACAGCTGCTTCCAGGAAAGGAAA
CCGCGTAAGGGCGAGTTTTGGGAACAAACCAACGGTGAGGATTGAGGGTTTCGATTGCCTTGACAGGGATAGGCAGAATTGCAGCATCCCTGCCTTCGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATATTAAAAAATGGAAATATAAATTTAAAAACCCAAATTCCGAAGTGAAGACTCGGCCCGAGCGAGTTGGGAACCGTCGCTGGTCTTCGTCTTCCTCTCCTTCATCCGCC
TTATCTCTCTATTCTCTCTCTCTCTGTTCTTATTTTTTGTTCTGCTATTCTGATTGCTTTGTCTCCCGCTTTGTAAACAGCCTTTTCTCCTTTTTACTCTTTTTCTCTTT
ATATTTCTGCGTAACAGTTTCCCCGTTTACGCCCACCACATCCAAGGTTTTGCTTCTTTCATTTTTGTATATAATACTGCTCAACTTTTTCGAAGTCTCATACTCAACTG
CTGCCGCTTAATTTTGAGCTTTATTTGTCCCTTCAAATAACCGAAGACGAATATGAATCACTGCGCCATTCTGTCAAACGCCTTCTCGGGCCACGAGGAGATGAGAACCT
CTGTTCCGGGTCCCATTTCTGACCTCAGAGATCAAGTTGTTTGCCCTAAACCACGGCGTCTTAGGAATCTTAAGGTCACCGTCAACGGCCACGCCGATAGCTCTCTCCGG
TGGAATCTCAGTCACCAAGTAGAGCAAATTGACATGGCAGCCGGACCGGATCTGCTTGACTTCCTCCTGACAAAAGGTGGTTGCAGCGTGGACCAATCGTTTACGCAGTT
GGCTTCGTCGCCCCCTTTTTTATGTGGGTCTCCGCCGAGCAGAGTAGCCAACCCATTGATTCAGGACGCCCGATTTGGGGATGAAAAATTCATCCCCTTTGCACCGATTG
CTTCACCGGCGGGTCAGTTGTCGCCCTCTACAGCTGCTTCCAGGAAAGGAAACCGCGTAAGGGCGAGTTTTGGGAACAAACCAACGGTGAGGATTGAGGGTTTCGATTGC
CTTGACAGGGATAGGCAGAATTGCAGCATCCCTGCCTTCGCTTAGAAACCCATCTCAATCAATATCAAATCCATAGAGTAATACAAGACAAGCATATTATAGTTTCAAGA
GAAGCAAGATTCCCAATTACAAAACAATATCGAGGGTCTCTCAAGAGAAAGAAAAGGATACACAATTTGCCCTTTTTTTTGTGAATACGTTTGAAGTGTTCTTTGTAAAT
GTAAATAAAAATCAATGCCCATGTATACTAAAGCTGTGTATTTTTTAGGCTTTTTGAGTCGATGGAGAGCAAACGATGCGTGGATGGTCAACCTCTTGTATTATATTTTT
GTAATTAGAAGTCTTAAGGTTGAGAGAATTTTCATTTGAAGTTGAAAGTAAGAGAGAAAGAGAGAGAGAGAGGCAATCTGTACATATTGCAGAACTTGAAATGGAGTCTG
TAGAAGTGATCAACATGTGAATAATTGTAGAACAATTTCTTATTGTGAAGTTTTGTTCATCTTTCTTTCTGGGTTTGAATTTGGGTTTGGAAGTTTGCTTGCTTAAGCAT
AACGCTAGCGATGGATCCATTAGATTTGCTTTTTCAGGGCATCTTCTTCATGGAATCGCCGACCTCTCGGCTTTTGCTAAAGCAACTTTGCTGGTGGAATTAATACCTCT
TTCTTTGTCTCTCCTCTCTGTTCAAAGCTACTTTGTTGCAGAGCTTGTCTCTTTATTGGAGTGTTGATTTCTGAATTCTGACACAATTTTAT
Protein sequenceShow/hide protein sequence
MNHCAILSNAFSGHEEMRTSVPGPISDLRDQVVCPKPRRLRNLKVTVNGHADSSLRWNLSHQVEQIDMAAGPDLLDFLLTKGGCSVDQSFTQLASSPPFLCGSPPSRVAN
PLIQDARFGDEKFIPFAPIASPAGQLSPSTAASRKGNRVRASFGNKPTVRIEGFDCLDRDRQNCSIPAFA