; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016163 (gene) of Snake gourd v1 genome

Gene IDTan0016163
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationLG02:94923214..94927879
RNA-Seq ExpressionTan0016163
SyntenyTan0016163
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064985.1 DUF1997 family protein [Cucumis melo var. makuwa]4.2e-11185.6Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MV+LTTKW GQG GSFPLL   +FSSPRKKFE IK SKATNSETNTK+ANLSV R+EKI+LPSYS  G  RTYHI +FLNHPSGIEAMLNKNAL+SFQLL
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTLPKL+LLNFEAAPTLDLR+IPTDEDFTVEMLSCKFEGSELVE QND FSA+MINHLTW+T+DSNS+LEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS
        ENPGNLMLQALLDNLVPLLL+QLVQDYEKWI QQ DHS +SIS
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS

XP_022131393.1 uncharacterized protein LOC111004622 isoform X1 [Momordica charantia]1.0e-11286.83Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MVVLT+KWCG+G+  FPLLA Q+ SSPRKKFEVIKLSKATNSETNTKRANL V RKEKIKLPSYSD  G RTYHIS+FL HPSGIEAMLNKNAL+SFQLL
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTLP L+LLNFEAAPTLDLRVIPTD+DFTVEMLSCKFEGSELVE QND FSALMINHLTW+TVDSNSFLEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS
        ENPGNLMLQAL+D+LVPLLL+Q+VQDYEKWIRQQ+DHS +S S
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS

XP_022929609.1 uncharacterized protein LOC111436145 [Cucurbita moschata]7.2e-11187.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MVVL TKW GQGKGSFPLLA  +FSSPRKKFE IK+SKATNSETNTKRANLSV RKEKIKLPSYS  GG RTYHI +FLNHPSGIEAM+NKNAL+SFQ L
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTL KL+LLNFEAAPTLDLRVIPT+EDFTVEMLSCKFEGSELVE QN+ FSALMINHLTW++V SNS+LEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQ-DHSHVSIS
        ENPGNLMLQALLDNLVPLLL+QLVQDYEKWI QQQ DHSHVSIS
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQ-DHSHVSIS

XP_022997460.1 uncharacterized protein LOC111492370 [Cucurbita maxima]2.5e-11187.7Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MVVL TKW GQGKGSFPLLA  +FSSPRKKFE IKLSKATNSETNTKRANLSV RKEKIKLPSYS  GG RTYHI +FLNHPSGIEAM+NKNAL+SFQ L
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTL KL+LLNFEAAPTLDLRVIPT+EDFTVEMLSCKFEGSELVE QN+ FSALMINHLTW++V SNS+LEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQ-DHSHVSIS
        ENPGNLMLQALLDNLVPLLL+QLVQDYEKWI QQQ DHSHVSIS
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQ-DHSHVSIS

XP_038885685.1 uncharacterized protein LOC120075989 isoform X1 [Benincasa hispida]4.5e-11388.07Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MVVLTTKW GQGKGSFPLLA  +FSSPRKKFEVIKLSKATNSETNTKRANLSV R+EKI+LPSYS   G R YHI +FLNHPSGIEAMLNKNAL SFQLL
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTLPKL+LLNFEAAPTLDLRVIPTD+DFTVEMLSCKFEGSELVE QN+ FSALMINHLTW+TVDSNS+LEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS
        ENPGNLMLQALLDNLVPLLL+QLVQDYEKWI QQ DHS +SIS
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS

TrEMBL top hitse value%identityAlignment
A0A1S3BBW5 uncharacterized protein LOC1034882461.0e-11085.19Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MV+LTTKW GQG GSFPLL   +FSSPRKKFE IK SKATNSETNTK+ANLSV R+EKI+LP YS  G  RTYHI +FLNHPSGIEAMLNKNAL+SFQLL
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTLPKL+LLNFEAAPTLDLR+IPTDEDFTVEMLSCKFEGSELVE QND FSA+MINHLTW+T+DSNS+LEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS
        ENPGNLMLQALLDNLVPLLL+QLVQDYEKWI QQ DHS +SIS
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS

A0A5A7VGG9 DUF1997 family protein2.0e-11185.6Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MV+LTTKW GQG GSFPLL   +FSSPRKKFE IK SKATNSETNTK+ANLSV R+EKI+LPSYS  G  RTYHI +FLNHPSGIEAMLNKNAL+SFQLL
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTLPKL+LLNFEAAPTLDLR+IPTDEDFTVEMLSCKFEGSELVE QND FSA+MINHLTW+T+DSNS+LEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS
        ENPGNLMLQALLDNLVPLLL+QLVQDYEKWI QQ DHS +SIS
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS

A0A6J1BT83 uncharacterized protein LOC111004622 isoform X14.8e-11386.83Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MVVLT+KWCG+G+  FPLLA Q+ SSPRKKFEVIKLSKATNSETNTKRANL V RKEKIKLPSYSD  G RTYHIS+FL HPSGIEAMLNKNAL+SFQLL
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTLP L+LLNFEAAPTLDLRVIPTD+DFTVEMLSCKFEGSELVE QND FSALMINHLTW+TVDSNSFLEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS
        ENPGNLMLQAL+D+LVPLLL+Q+VQDYEKWIRQQ+DHS +S S
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHSHVSIS

A0A6J1EP97 uncharacterized protein LOC1114361453.5e-11187.3Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MVVL TKW GQGKGSFPLLA  +FSSPRKKFE IK+SKATNSETNTKRANLSV RKEKIKLPSYS  GG RTYHI +FLNHPSGIEAM+NKNAL+SFQ L
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTL KL+LLNFEAAPTLDLRVIPT+EDFTVEMLSCKFEGSELVE QN+ FSALMINHLTW++V SNS+LEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQ-DHSHVSIS
        ENPGNLMLQALLDNLVPLLL+QLVQDYEKWI QQQ DHSHVSIS
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQ-DHSHVSIS

A0A6J1K9Q3 uncharacterized protein LOC1114923701.2e-11187.7Show/hide
Query:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL
        MVVL TKW GQGKGSFPLLA  +FSSPRKKFE IKLSKATNSETNTKRANLSV RKEKIKLPSYS  GG RTYHI +FLNHPSGIEAM+NKNAL+SFQ L
Subjt:  MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLL

Query:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM
        DANTYRCTL KL+LLNFEAAPTLDLRVIPT+EDFTVEMLSCKFEGSELVE QN+ FSALMINHLTW++V SNS+LEVDVKL LSLEIYTLPFTLMPTAA+
Subjt:  DANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAM

Query:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQ-DHSHVSIS
        ENPGNLMLQALLDNLVPLLL+QLVQDYEKWI QQQ DHSHVSIS
Subjt:  ENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQ-DHSHVSIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31115.1 Protein of unknown function (DUF1997)4.1e-5648.92Show/hide
Query:  GKGSFPLLANQEFSSPRKKFEV--IKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLLD--ANTYRC
        G     LL +   + PR++  +  +   K     ++ K+AN+S +RK++IKL    +  G +    S+FL HPSG+EA++N  AL+S+ L+D   +TYRC
Subjt:  GKGSFPLLANQEFSSPRKKFEV--IKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLLD--ANTYRC

Query:  TLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAMENPGNLM
        TLPK++L++FE  P L LRV PT ED TVE+LSCK EGSEL+E+Q++RFSA+M N +TW       FLEVDV+L ++LEI T PFT++P +A+E PGNL+
Subjt:  TLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAMENPGNLM

Query:  LQALLDNLVPLLLQQLVQDYEKWIRQQQDHS
        +Q L+D LVPLLLQQL++DY++WI++QQ +S
Subjt:  LQALLDNLVPLLLQQLVQDYEKWIRQQQDHS

AT4G31115.2 Protein of unknown function (DUF1997)5.4e-5654.31Show/hide
Query:  TNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLLD--ANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSC
        ++ K+AN+S +RK++IKL    +  G +    S+FL HPSG+EA++N  AL+S+ L+D   +TYRCTLPK++L++FE  P L LRV PT ED TVE+LSC
Subjt:  TNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLLD--ANTYRCTLPKLKLLNFEAAPTLDLRVIPTDEDFTVEMLSC

Query:  KFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAMENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHS
        K EGSEL+E+Q++RFSA+M N +TW       FLEVDV+L ++LEI T PFT++P +A+E PGNL++Q L+D LVPLLLQQL++DY++WI++QQ +S
Subjt:  KFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAMENPGNLMLQALLDNLVPLLLQQLVQDYEKWIRQQQDHS

AT5G04440.1 Protein of unknown function (DUF1997)1.1e-1628.64Show/hide
Query:  SFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEK-IKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLLDANTYRCTLPKLK
        SF + ++  F    K       S AT+S T+  R + S   K + I     S         + ++++ P+   ++L+   +E    +D NT+RC +   K
Subjt:  SFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEK-IKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLLDANTYRCTLPKLK

Query:  LLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSF---LEVDVKLKLSLEIYTLPFTLMPTAAMENPGNLMLQA
          NFE  P L +RV        +++LSCK EGS +V +QND+F A M+N ++ ++    S    +  D  +++++EI    F + P  A+E  G  +L  
Subjt:  LLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSF---LEVDVKLKLSLEIYTLPFTLMPTAAMENPGNLMLQA

Query:  LLDNLVPLLLQQLVQDYEKW
        +L  ++P  L QL +DY  W
Subjt:  LLDNLVPLLLQQLVQDYEKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTTTTAACTACTAAATGGTGTGGGCAAGGAAAAGGTTCGTTTCCATTGCTTGCTAACCAGGAGTTTAGTTCACCAAGGAAGAAATTTGAAGTGATCAAGCTATC
TAAGGCCACCAATTCTGAGACTAATACCAAGAGGGCAAACTTATCTGTCGCAAGGAAGGAAAAGATCAAATTGCCGAGTTACAGCGACTGTGGCGGAAGCAGGACATATC
ATATCAGCAAATTCTTAAATCACCCTTCAGGAATTGAAGCAATGCTGAACAAAAATGCCTTGGAAAGTTTCCAGTTACTTGATGCTAACACATACAGGTGCACTCTGCCA
AAATTGAAACTTTTGAACTTTGAAGCTGCCCCTACACTGGATTTACGAGTGATCCCGACAGACGAAGATTTTACCGTTGAGATGCTTTCATGCAAGTTTGAAGGTTCAGA
ATTGGTGGAAAGCCAAAACGACCGTTTTTCGGCTTTGATGATTAATCACTTAACATGGGAAACAGTTGATTCAAATTCGTTTCTAGAAGTTGATGTGAAGTTGAAACTGT
CTCTGGAGATTTATACACTTCCCTTCACCCTGATGCCTACAGCTGCTATGGAGAATCCAGGAAATTTGATGCTACAAGCTCTCTTAGACAACCTTGTACCTCTGCTGCTA
CAGCAATTAGTGCAAGATTATGAAAAGTGGATACGTCAGCAGCAAGATCATTCCCATGTTTCCATCTCTTGA
mRNA sequenceShow/hide mRNA sequence
CAAAAAAATTGAAAAGAAAAGAAAAAAAAGGTGATGTACTCTTGTCACCAATATTGCCCCAACCATCTAATTGTAAAATAAGGGATTCCACTTAAGAACAAGTACCACGT
GTATCCCATAATTCAGTGCATCGTCTGGATATGGAGTCCCTGAATTTGCTTTTTTGTTGGCATTCAGAATTCCGGTCGCCGCTCTCCGCCGCCCAGGTGAGATTCCGGGC
ATAATTGATACTTGGAGCACAAAATTCCACTCGAAGAGAGGATGGTGGTTTTAACTACTAAATGGTGTGGGCAAGGAAAAGGTTCGTTTCCATTGCTTGCTAACCAGGAG
TTTAGTTCACCAAGGAAGAAATTTGAAGTGATCAAGCTATCTAAGGCCACCAATTCTGAGACTAATACCAAGAGGGCAAACTTATCTGTCGCAAGGAAGGAAAAGATCAA
ATTGCCGAGTTACAGCGACTGTGGCGGAAGCAGGACATATCATATCAGCAAATTCTTAAATCACCCTTCAGGAATTGAAGCAATGCTGAACAAAAATGCCTTGGAAAGTT
TCCAGTTACTTGATGCTAACACATACAGGTGCACTCTGCCAAAATTGAAACTTTTGAACTTTGAAGCTGCCCCTACACTGGATTTACGAGTGATCCCGACAGACGAAGAT
TTTACCGTTGAGATGCTTTCATGCAAGTTTGAAGGTTCAGAATTGGTGGAAAGCCAAAACGACCGTTTTTCGGCTTTGATGATTAATCACTTAACATGGGAAACAGTTGA
TTCAAATTCGTTTCTAGAAGTTGATGTGAAGTTGAAACTGTCTCTGGAGATTTATACACTTCCCTTCACCCTGATGCCTACAGCTGCTATGGAGAATCCAGGAAATTTGA
TGCTACAAGCTCTCTTAGACAACCTTGTACCTCTGCTGCTACAGCAATTAGTGCAAGATTATGAAAAGTGGATACGTCAGCAGCAAGATCATTCCCATGTTTCCATCTCT
TGATGCTACTGATGTTAAAAATAGCCTTCCATTTTCAAGCTACCAGAGCTGAATGCAAAAGGTTTGAGCAGATTCTGCCATTACATTGCATTTAGATCAAGAGAAGCTGA
CCTTTTTTAATTTTATTTTAAGGGAGAGTGTATTTTACATTGAACATTGTTTACTAAGCTTGGGCTCCATCTCCCATGCCTAAGAAAGGGAAATAGTTAGTTCGGGGCAC
AAGATAGTGAAAAGTTGAGGATCTTGTTTGATTTGTGATTCGAAAACAAAGTGTTTAATGGAAAATAGAGTTTGTATTTA
Protein sequenceShow/hide protein sequence
MVVLTTKWCGQGKGSFPLLANQEFSSPRKKFEVIKLSKATNSETNTKRANLSVARKEKIKLPSYSDCGGSRTYHISKFLNHPSGIEAMLNKNALESFQLLDANTYRCTLP
KLKLLNFEAAPTLDLRVIPTDEDFTVEMLSCKFEGSELVESQNDRFSALMINHLTWETVDSNSFLEVDVKLKLSLEIYTLPFTLMPTAAMENPGNLMLQALLDNLVPLLL
QQLVQDYEKWIRQQQDHSHVSIS