; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006438 (gene) of Snake gourd v1 genome

Gene IDTan0006438
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG10:6931197..6935711
RNA-Seq ExpressionTan0006438
SyntenyTan0006438
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138115.1 uncharacterized protein LOC111009363 isoform X1 [Momordica charantia]1.3e-11886.33Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGAL GGGVTWAG WRLNKFIRLNLSGGAAAL GLWRFSRSLN+CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANIVVTKYHNDPRTMQHISKHFY+EKVFDDSTLDRP+IRWRYRNFFSDDVAH Q+TH+ND KNN+HGNSHH SSN D NSNQ+ SY   DD+ NA+EFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQT
        VLT PGT+AT DPLDC+FG  A+ EEIQHS+SS+T+ KS SRSRRYHRRHRRHNQT
Subjt:  VLTTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQT

XP_022138116.1 uncharacterized protein LOC111009363 isoform X2 [Momordica charantia]1.3e-11886.33Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGAL GGGVTWAG WRLNKFIRLNLSGGAAAL GLWRFSRSLN+CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANIVVTKYHNDPRTMQHISKHFY+EKVFDDSTLDRP+IRWRYRNFFSDDVAH Q+TH+ND KNN+HGNSHH SSN D NSNQ+ SY   DD+ NA+EFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQT
        VLT PGT+AT DPLDC+FG  A+ EEIQHS+SS+T+ KS SRSRRYHRRHRRHNQT
Subjt:  VLTTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQT

XP_022956077.1 uncharacterized protein LOC111457878 [Cucurbita moschata]1.6e-11683.77Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEA +LQTCKSKAVRDFTFG LVGGGVTWAG WRLNKF+RLNLSGGA AL GL RFSRSL++CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANIVVTKYHNDPRTMQHISKHF++E+VFDDSTLDRPKIRWRYRNFFSDDVAHAQ+TH NDPK+N+HGN HHDSSNRD N NQSDSYG  DD+ NA EF P
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV
        VLT PG + AT DPLD IFG+  REEEIQHSS+SS SPKS  RS+RY+RRHRRHNQT+PT FEHV
Subjt:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV

XP_022980008.1 uncharacterized protein LOC111479542 [Cucurbita maxima]1.2e-11482.64Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEA +LQTCKSKAVRDFTFG LVGGGVTWAG WRLNKF+RLNLSGGA AL GL RFSRSL++CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANI+VTK HNDPRTMQHISKHF++E+VFDDSTLDRPKIRWRYRNFFSDDVAHAQ+ H NDPK+N+HGN HHDSSNRD N NQSDSYG  DD+ NA EF P
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV
        VLT PG + AT DPLD IFG+  REEEIQHSS+SS SPKS  RS+RY+RRHRRHNQT+PT FEHV
Subjt:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV

XP_023527180.1 uncharacterized protein LOC111790494 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-11683.77Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEA +LQTCKSKAVRDFTFG LVGGGVTWAG WRLNKF+RLNLSGGA AL GL RFSRSL++CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANIVVTKYHNDPRTMQHISKHF++E+VFDDSTLDRPKIRWRYRNFFSDDVAHAQ+TH NDPK+N+HGN HHDSSNRD N NQSDSYG  DD+ NA EF P
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV
        VLT PG + AT DPLD IFG+  REEEIQHSS+SS SPKS  RS+RY+RRHRRHNQT+PT FEHV
Subjt:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV

TrEMBL top hitse value%identityAlignment
A0A1S3AWL2 uncharacterized protein LOC103483703 isoform X13.8e-10375.85Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGE L ELE VLRSK N LTIEEA LLQTC+SKAVRDFTFG ++GGG+TWAG WRLNKF RLNLSGGAAAL G WRFSRSLN+CVD+IL+LDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANIVVT+YHNDPR MQ+ISKHF++E+VFDDST DRPKIRWRYRNFFSDDVAH+Q+TH ND  NNVH NSH DSS     ++Q DSYG +DD+ NA EFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV
        VLT  GT+ AT DPLDCIFG+ AREEEIQHS+ S+ SPK  SRSRRY+RRHR+ NQT PT+FE+V
Subjt:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV

A0A6J1C8I6 uncharacterized protein LOC111009363 isoform X16.4e-11986.33Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGAL GGGVTWAG WRLNKFIRLNLSGGAAAL GLWRFSRSLN+CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANIVVTKYHNDPRTMQHISKHFY+EKVFDDSTLDRP+IRWRYRNFFSDDVAH Q+TH+ND KNN+HGNSHH SSN D NSNQ+ SY   DD+ NA+EFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQT
        VLT PGT+AT DPLDC+FG  A+ EEIQHS+SS+T+ KS SRSRRYHRRHRRHNQT
Subjt:  VLTTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQT

A0A6J1C8T0 uncharacterized protein LOC111009363 isoform X26.4e-11986.33Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGAL GGGVTWAG WRLNKFIRLNLSGGAAAL GLWRFSRSLN+CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANIVVTKYHNDPRTMQHISKHFY+EKVFDDSTLDRP+IRWRYRNFFSDDVAH Q+TH+ND KNN+HGNSHH SSN D NSNQ+ SY   DD+ NA+EFKP
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQT
        VLT PGT+AT DPLDC+FG  A+ EEIQHS+SS+T+ KS SRSRRYHRRHRRHNQT
Subjt:  VLTTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQT

A0A6J1GVC2 uncharacterized protein LOC1114578787.9e-11783.77Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEA +LQTCKSKAVRDFTFG LVGGGVTWAG WRLNKF+RLNLSGGA AL GL RFSRSL++CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANIVVTKYHNDPRTMQHISKHF++E+VFDDSTLDRPKIRWRYRNFFSDDVAHAQ+TH NDPK+N+HGN HHDSSNRD N NQSDSYG  DD+ NA EF P
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV
        VLT PG + AT DPLD IFG+  REEEIQHSS+SS SPKS  RS+RY+RRHRRHNQT+PT FEHV
Subjt:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV

A0A6J1IXZ4 uncharacterized protein LOC1114795425.6e-11582.64Show/hide
Query:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL
        MGEALFELEQVLRSKQNSLTIEEA +LQTCKSKAVRDFTFG LVGGGVTWAG WRLNKF+RLNLSGGA AL GL RFSRSL++CVDHILALDGSRMQKEL
Subjt:  MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKEL

Query:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP
        ANI+VTK HNDPRTMQHISKHF++E+VFDDSTLDRPKIRWRYRNFFSDDVAHAQ+ H NDPK+N+HGN HHDSSNRD N NQSDSYG  DD+ NA EF P
Subjt:  ANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKP

Query:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV
        VLT PG + AT DPLD IFG+  REEEIQHSS+SS SPKS  RS+RY+RRHRRHNQT+PT FEHV
Subjt:  VLTTPGTE-ATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05430.1 unknown protein1.1e-1732.35Show/hide
Query:  ALFELEQVLRSK--QNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNK---FIRLNLSGGAAA--LLGLWRFSRSLNA--CVDHILALDGS
        AL +L  VL SK  Q  +T EE+  + +C  KA+    F + VGGG+TW    +L K     R+ L+ G AA   +  W +S S  A   +DHIL+ D +
Subjt:  ALFELEQVLRSK--QNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNK---FIRLNLSGGAAA--LLGLWRFSRSLNA--CVDHILALDGS

Query:  RMQKELANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDREN
        RMQKEL N++V     +    Q +SKHFY E V+ D   D+P++RWR R  F++  +              + + +   S R+PN   + S+       +
Subjt:  RMQKELANIVVTKYHNDPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDREN

Query:  AVEFKPVL-----TTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSR-SRRYHRRHRRHNQTVPTS
        A + K  L      + G  A  D LD +FG     E I     S  + K+Q+R  +R  RR R  N+   T+
Subjt:  AVEFKPVL-----TTPGTEATTDPLDCIFGSPAREEEIQHSSSSSTSPKSQSR-SRRYHRRHRRHNQTVPTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAAGCTTTATTTGAACTTGAACAAGTTCTCAGGTCCAAACAGAACAGCTTGACGATTGAGGAAGCGACTTTGCTCCAAACATGTAAGTCTAAGGCTGTACGAGA
TTTTACATTTGGAGCTCTCGTTGGAGGTGGTGTGACATGGGCTGGAGCATGGAGGCTAAATAAATTCATCCGGTTAAATCTTTCTGGAGGAGCTGCTGCGCTACTTGGAT
TATGGAGATTTAGCAGGTCCCTCAATGCATGCGTCGATCATATTCTTGCACTGGATGGAAGTAGAATGCAAAAGGAGTTGGCAAATATTGTGGTGACGAAATATCACAAT
GATCCTCGCACAATGCAGCACATATCCAAGCATTTTTATCATGAGAAAGTATTTGATGATTCAACCTTGGATCGTCCAAAAATAAGGTGGCGTTATCGAAATTTCTTTAG
TGATGATGTTGCTCATGCTCAGAAGACACATGAAAATGACCCTAAGAACAACGTGCATGGAAACTCTCACCATGATTCATCCAACCGCGACCCCAATTCCAACCAGAGTG
ACTCCTATGGTGCGACTGATGACAGAGAAAATGCAGTTGAGTTCAAGCCAGTCCTTACTACGCCAGGAACCGAGGCTACCACAGACCCTCTGGATTGTATTTTTGGTTCA
CCAGCAAGAGAAGAAGAAATCCAACACTCGAGTTCCTCTAGCACATCACCCAAATCTCAGTCTCGTAGTAGAAGATACCACCGCCGGCATCGAAGACATAACCAGACAGT
GCCAACAAGCTTTGAACATGTGTAA
mRNA sequenceShow/hide mRNA sequence
CTCCGTTTAATCGTCGTGACTGGTGATCGGAGCTCCGCCATGGGTGAAGCTTTATTTGAACTTGAACAAGTTCTCAGGTCCAAACAGAACAGCTTGACGATTGAGGAAGC
GACTTTGCTCCAAACATGTAAGTCTAAGGCTGTACGAGATTTTACATTTGGAGCTCTCGTTGGAGGTGGTGTGACATGGGCTGGAGCATGGAGGCTAAATAAATTCATCC
GGTTAAATCTTTCTGGAGGAGCTGCTGCGCTACTTGGATTATGGAGATTTAGCAGGTCCCTCAATGCATGCGTCGATCATATTCTTGCACTGGATGGAAGTAGAATGCAA
AAGGAGTTGGCAAATATTGTGGTGACGAAATATCACAATGATCCTCGCACAATGCAGCACATATCCAAGCATTTTTATCATGAGAAAGTATTTGATGATTCAACCTTGGA
TCGTCCAAAAATAAGGTGGCGTTATCGAAATTTCTTTAGTGATGATGTTGCTCATGCTCAGAAGACACATGAAAATGACCCTAAGAACAACGTGCATGGAAACTCTCACC
ATGATTCATCCAACCGCGACCCCAATTCCAACCAGAGTGACTCCTATGGTGCGACTGATGACAGAGAAAATGCAGTTGAGTTCAAGCCAGTCCTTACTACGCCAGGAACC
GAGGCTACCACAGACCCTCTGGATTGTATTTTTGGTTCACCAGCAAGAGAAGAAGAAATCCAACACTCGAGTTCCTCTAGCACATCACCCAAATCTCAGTCTCGTAGTAG
AAGATACCACCGCCGGCATCGAAGACATAACCAGACAGTGCCAACAAGCTTTGAACATGTGTAATTCCAGGATATACAAAGAAGCCATAGCCATCCATAGTGCATGGTGT
AAGGGAGCACTTACAGAGACGATGGTCATTAGACAATAGCGTGGTGGTTGTTGCGATGGGGAGAGCGATATTTCAGAGAACGCGGTAGTGGTTAAAGGTTGGAAACTCCA
ATCTCTTTTCCAGTTACTCAATTTCCTAACATTTTCAGGAAGATTTGTGGGGGCAAAATCTTTGTTTTCACAGATCGTGTGGCGTGTCGAAAGTCTTACACAAGCATAAG
ACCACTAGGAATACGCGTCAAGAGTGGTAAATGCCCTTTTGTCTGAGATTGATTCTTGACTTTTAATCCTATAATAACTAACAAGAAACTTCATGCTTTGTCTTTTTTAC
ATTAATATGAGATTCTCCCCCTCTTTTTTAATGGTTGTGATGGAAAGATGGAAACATTTACTTAGACTAGCGGTCTCTTACATTCAACGCCAG
Protein sequenceShow/hide protein sequence
MGEALFELEQVLRSKQNSLTIEEATLLQTCKSKAVRDFTFGALVGGGVTWAGAWRLNKFIRLNLSGGAAALLGLWRFSRSLNACVDHILALDGSRMQKELANIVVTKYHN
DPRTMQHISKHFYHEKVFDDSTLDRPKIRWRYRNFFSDDVAHAQKTHENDPKNNVHGNSHHDSSNRDPNSNQSDSYGATDDRENAVEFKPVLTTPGTEATTDPLDCIFGS
PAREEEIQHSSSSSTSPKSQSRSRRYHRRHRRHNQTVPTSFEHV