; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010740 (gene) of Snake gourd v1 genome

Gene IDTan0010740
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG05:81124484..81125770
RNA-Seq ExpressionTan0010740
SyntenyTan0010740
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019402.1 hypothetical protein SDJN02_18363, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-7375.73Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY
        MK LYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN   HRGKA  QKP+      AAK GSDHPP FSC CFRCY
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGE
        TSYWVRWDSSPNRQ+IHEIIDAYEE LAE+K GK NKKERKKRN+GSGSGS     +GKGSE     EESRVTEMEAA  GGE EAEKGSVR IV+++GE
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGE

Query:  RIWGGW
        +IWGGW
Subjt:  RIWGGW

XP_022927395.1 uncharacterized protein LOC111434229 [Cucurbita moschata]7.5e-7476.21Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY
        MK LYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN  GHRGKA  QKP+      AAK GSDHPP FSC CFRCY
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGE
        TSYWVRWDSSPNRQ+IHEIIDAYEE LAE+K GK NKKERKKRN+GSGSGS     +GKGSE     EESRVTEMEAA  GGE EAEKGSVR IV+++GE
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGE

Query:  RIWGGW
        +IWGGW
Subjt:  RIWGGW

XP_023001610.1 uncharacterized protein LOC111495687 [Cucurbita maxima]4.0e-7576.44Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY
        M  LYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN  GHRGKA QQKP+      AAK GSDHPPAFSC CFRCY
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAA--GGGGEEEAEKGSVRKIVNFL
        TSYWVRWDSSPNRQ+IHEIIDAYEE LAE+K GK NKKERKKRN+GSGSGS     +GKGSE     EESRVTEMEAA  GGGGE EAEKG+VR IV ++
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAA--GGGGEEEAEKGSVRKIVNFL

Query:  GERIWGGW
        GE+IWGGW
Subjt:  GERIWGGW

XP_023520004.1 uncharacterized protein LOC111783314 [Cucurbita pepo subsp. pepo]2.0e-7475.48Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY
        MK LYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN  GHRGKA  QKP+      AA+ GSDHPP FSC CFRCY
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAA--GGGGEEEAEKGSVRKIVNFL
        TSYWVRWDSSPNRQ+IHEIIDAYEE LAE+K GK +KKERKKRN+GSGSGS     +GKGSE     EESRVTEMEAA  GGGGE EAEKGSVR IV+++
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAA--GGGGEEEAEKGSVRKIVNFL

Query:  GERIWGGW
        GE+IWGGW
Subjt:  GERIWGGW

XP_038894832.1 uncharacterized protein LOC120083238 [Benincasa hispida]4.0e-7579.21Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY
        MK LYR+RGTVHPSPPIISDHLSFLP AILTLAAALS EDREVLAYLISS      AVNN   HRGKA  QKP A     AAKGGSDHPPAFSC CFRCY
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGERIWG
        TSYWVRWDSSPNRQLIHEIIDAYEEKLAE+K GK NKKERKKRNSG  SG  EGK SEP + EEE RVTE EAA  GGEEE EKGSVR+IV+F+GERIWG
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGERIWG

Query:  GW
         W
Subjt:  GW

TrEMBL top hitse value%identityAlignment
A0A0A0LVV3 Uncharacterized protein8.9e-6572.41Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY
        MK LYR+RGTVHPSP IISDHLSFLP  ILTLAAALS  DREVLAYLISS      AV N   HRGKA  QK +      AA GG DHPPAFSC CF+CY
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNS-GSGSGSSEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGERIW
        TSYWVRWDSSPNRQLIHEIIDAYEEKLAE+K GK NKKERKKRN+ G  SG  EGKGSE  + EEE RVTE E A  GGEE AEKG VR+IV+ LGE+IW
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNS-GSGSGSSEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGERIW

Query:  GGW
        G W
Subjt:  GGW

A0A6J1EBG8 uncharacterized protein LOC1114326281.1e-6570.85Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTSYWVR
        MK  YRRRGTVHPSPP ISDHLSFLPAAILTLAAAL+ EDR+VLAYLISSA  N  GHRGK  QQKP+   T+P+AK GSDH PAFSCGCFRCYT YWVR
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAA---GGGGEEEAEKGSVRKIVNFLGERIWGGW
        WDSSPNR++IHEII+AYEEKLAET+ GK+N+KERKKRN G             R  EE+ R+TE E A   GGGGEE AEKG+VRKIV F+GE IWGG+
Subjt:  WDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAA---GGGGEEEAEKGSVRKIVNFLGERIWGGW

A0A6J1ENT0 uncharacterized protein LOC1114342293.6e-7476.21Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY
        MK LYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN  GHRGKA  QKP+      AAK GSDHPP FSC CFRCY
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGE
        TSYWVRWDSSPNRQ+IHEIIDAYEE LAE+K GK NKKERKKRN+GSGSGS     +GKGSE     EESRVTEMEAA  GGE EAEKGSVR IV+++GE
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGE

Query:  RIWGGW
        +IWGGW
Subjt:  RIWGGW

A0A6J1JYL3 uncharacterized protein LOC1114900451.0e-6874.13Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTSYWVR
        MK  YRRRGTVHPSPP ISDHLSFLPAAILTLAAAL+ EDREVLAYLISSA  N  GHRGK+ QQK     T+P+AK GSDHPP+FSCGCFRCYT YWVR
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAA-----GGGGEEEAEKGSVRKIVNFLGERIWGG
        WDSSPNR++IHEII+AYEEKLAETKTGKKNKKERKKRN        EG  SE R  EE+ R+TE EAA     GGGGEE AEKG+VRKIV F+GE IWGG
Subjt:  WDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAA-----GGGGEEEAEKGSVRKIVNFLGERIWGG

Query:  W
        +
Subjt:  W

A0A6J1KLN6 uncharacterized protein LOC1114956871.9e-7576.44Show/hide
Query:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY
        M  LYRRRGTVHPSPPIISDHLSFLP AILTLAAALSPEDRE+LAYLISS       VNN  GHRGKA QQKP+      AAK GSDHPPAFSC CFRCY
Subjt:  MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISS------AVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAA--GGGGEEEAEKGSVRKIVNFL
        TSYWVRWDSSPNRQ+IHEIIDAYEE LAE+K GK NKKERKKRN+GSGSGS     +GKGSE     EESRVTEMEAA  GGGGE EAEKG+VR IV ++
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGS----SEGKGSEPRSNEEESRVTEMEAA--GGGGEEEAEKGSVRKIVNFL

Query:  GERIWGGW
        GE+IWGGW
Subjt:  GERIWGGW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein1.1e-3545.73Show/hide
Query:  MKNLYRRRGTVHPSPPII--SDH-LSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGS---DHPPAFSCGCFRCY
        MK LY R+GTVHPSPP I  +DH L+ LP AI +LAA LSPEDREVLAYLIS+A          +G++ P++      A   +   +H P F C CF CY
Subjt:  MKNLYRRRGTVHPSPPII--SDH-LSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGS---DHPPAFSCGCFRCY

Query:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKN---KKERKKRNSGS----GSGSSEGKGSEPRSNEEESRV--------TEMEAAGGG--------
        TSYWVRWDSSP+RQLIHEIIDA+E+ L + K  KKN   KK+R+KR+  S     S S     SE  S   ES V        +E+   GGG        
Subjt:  TSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKN---KKERKKRNSGS----GSGSSEGKGSEPRSNEEESRV--------TEMEAAGGG--------

Query:  ---------GEEEAEKGSVRKIVNFLGERIWGGW
                  + E EKG+VR+ V+F+GE+++G W
Subjt:  ---------GEEEAEKGSVRKIVNFLGERIWGGW

AT1G24270.1 unknown protein6.6e-2042.07Show/hide
Query:  RRGTVHPSPPIIS-------DHLS---FLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTS
        ++G VHPSPP+ S       D LS    L +AIL L + LS ED EVLAYLI+ ++N                T      K  S   P   C CF CYTS
Subjt:  RRGTVHPSPPIIS-------DHLS---FLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTS

Query:  YWVRWDSSPNRQLIHEIIDAYE-----EKLAETKTGKKNKKERKK
        YW +WDSS NR+LI++II+A+E     ++++ + T KKNKK  KK
Subjt:  YWVRWDSSPNRQLIHEIIDAYE-----EKLAETKTGKKNKKERKK

AT1G62422.1 unknown protein4.7e-3448.98Show/hide
Query:  RRGTVHPSPP--IISDH--LSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTSYWVRWD
        R+GTVHPSPP  I +D   LS LP AIL+L AALS EDREVLAYLIS++     G   +  + K +        K  + H P F C CF CYTSYWVRWD
Subjt:  RRGTVHPSPP--IISDH--LSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTSYWVRWD

Query:  SSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAAG--GGGEEEAEKGSVRKIVNFLGERIWGGW
        +SP RQLIHEIIDAYE+ L E K  KK++++R  + SG  +     + SE  S+  E    + E  G  GG E E EKGSV K+++F+G+R  G W
Subjt:  SSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAAG--GGGEEEAEKGSVRKIVNFLGERIWGGW

AT5G13090.1 unknown protein4.1e-2238.85Show/hide
Query:  RRRGTVHPSPP---------IISDHLS-----------FLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPA
        +++G V+PSPP           S+HL+            LPA IL L + LS E+REVLAYLI+         RG +  +  +   ++ ++K  +  PP 
Subjt:  RRRGTVHPSPP---------IISDHLS-----------FLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPA

Query:  FSCGCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSG
        F C CF CYT+YW RWDSSPNR+LIHEII+A+E    E  +  ++K +R K+    G
Subjt:  FSCGCFRCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAETKTGKKNKKERKKRNSGSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAACCTCTACCGGAGAAGAGGAACGGTTCACCCATCCCCGCCGATAATCTCCGACCACCTCTCGTTTCTCCCAGCCGCCATCCTCACCCTCGCGGCGGCGCTCTC
CCCGGAAGACCGAGAGGTTTTGGCCTACCTCATCTCTTCCGCCGTCAACAACTGCTACGGCCACCGCGGCAAGGCTGGCCAACAGAAGCCCTCGGCCACCGCCACCTCCC
CCGCCGCAAAAGGGGGTTCCGATCACCCTCCGGCTTTCTCCTGCGGGTGTTTCCGGTGCTACACGAGCTACTGGGTGAGATGGGACTCGTCTCCAAATCGGCAACTCATA
CACGAAATCATCGACGCGTATGAAGAAAAGTTGGCTGAGACCAAAACCGGTAAGAAGAATAAGAAAGAAAGGAAGAAGAGGAATAGCGGGTCGGGTTCCGGGTCGAGCGA
GGGGAAAGGGTCGGAACCAAGGTCGAATGAAGAAGAGTCGCGGGTGACGGAGATGGAGGCGGCGGGAGGCGGCGGTGAGGAAGAGGCAGAGAAAGGGTCGGTGAGGAAGA
TCGTGAATTTCCTAGGGGAAAGAATCTGGGGAGGTTGGATTAATTAG
mRNA sequenceShow/hide mRNA sequence
ATCGTTTCGAATCCCCACTCGCATCTTTTTCCCACGCGCTTTCACGCGCCGACACCCCCAATAAAATCTCTGCAACTTTTATCATTTCAACCCTTTCGCTTACTTCTCTC
TCTCTCTCTCTCTCTGTTTTAATCATATTCTGCACCGTCGACCTGAAATTTTTTTTGTTCTCCGATGAAGAACCTCTACCGGAGAAGAGGAACGGTTCACCCATCCCCGC
CGATAATCTCCGACCACCTCTCGTTTCTCCCAGCCGCCATCCTCACCCTCGCGGCGGCGCTCTCCCCGGAAGACCGAGAGGTTTTGGCCTACCTCATCTCTTCCGCCGTC
AACAACTGCTACGGCCACCGCGGCAAGGCTGGCCAACAGAAGCCCTCGGCCACCGCCACCTCCCCCGCCGCAAAAGGGGGTTCCGATCACCCTCCGGCTTTCTCCTGCGG
GTGTTTCCGGTGCTACACGAGCTACTGGGTGAGATGGGACTCGTCTCCAAATCGGCAACTCATACACGAAATCATCGACGCGTATGAAGAAAAGTTGGCTGAGACCAAAA
CCGGTAAGAAGAATAAGAAAGAAAGGAAGAAGAGGAATAGCGGGTCGGGTTCCGGGTCGAGCGAGGGGAAAGGGTCGGAACCAAGGTCGAATGAAGAAGAGTCGCGGGTG
ACGGAGATGGAGGCGGCGGGAGGCGGCGGTGAGGAAGAGGCAGAGAAAGGGTCGGTGAGGAAGATCGTGAATTTCCTAGGGGAAAGAATCTGGGGAGGTTGGATTAATTA
GTGGATTGATTGATGAAGAATTTGGAGGTTTACTGTGTTAGTTAATTAATTGCAGTAATTAAGGATGGAGGAATATGGGAATTGTGAAGAAGAAAAAAGAAGGTGATCTC
ATAAACTAATTAAGCTACCTTCTTCTTCTTGTTGTTGTTGATCTTCTTTTTTTGATTATATTCTGGTTCTTAGTTGTAAATAGAATCTTATGTTTATGAATGTAAAAATG
AATGAATTACATATCTATAGAATTGCAGTTAATGAATTTGGGATGTTTATGTATTATATAGAATCTTTAGTTGGTATGTATTTGTTTGTAATTAGTTGTTTTATTGCCAC
TTTGAAAGGCACTTAGATCCTAAAGCAATTTACTCAAGGAATTGTATTCTATTTCAGGAACTTCTAGCAACTCTTTTTTTTTTTGAAGTTTTTTTTATTTGTTAGTTCAC
TTCAAATAACTGAGGAGTTAGAATTAGAAATGAAATACTATGGCGTCGTAAGCTCCAGTGGACCAAGCTTTTGATCA
Protein sequenceShow/hide protein sequence
MKNLYRRRGTVHPSPPIISDHLSFLPAAILTLAAALSPEDREVLAYLISSAVNNCYGHRGKAGQQKPSATATSPAAKGGSDHPPAFSCGCFRCYTSYWVRWDSSPNRQLI
HEIIDAYEEKLAETKTGKKNKKERKKRNSGSGSGSSEGKGSEPRSNEEESRVTEMEAAGGGGEEEAEKGSVRKIVNFLGERIWGGWIN