; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005168 (gene) of Snake gourd v1 genome

Gene IDTan0005168
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTYR_PHOSPHATASE_2 domain-containing protein
Genome locationLG11:7931049..7933569
RNA-Seq ExpressionTan0005168
SyntenyTan0005168
Gene Ontology termsGO:0006470 - protein dephosphorylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008138 - protein tyrosine/serine/threonine phosphatase activity (molecular function)
InterPro domainsIPR000340 - Dual specificity phosphatase, catalytic domain
IPR000387 - Tyrosine specific protein phosphatases domain
IPR020422 - Dual specificity protein phosphatase domain
IPR029021 - Protein-tyrosine phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032936.1 ynbD [Cucurbita argyrosperma subsp. argyrosperma]2.0e-13292.52Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPGLS LIGLKAAALFS FLFLRFYGFRLLS P LYASLVSLLVSIASLPSINLPLLLGKKSDGTFP+WSIVIFGPFLFFVRFLPSLRGL+ RDDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        ICEGVYVG WPCS DRLPP NPA+VDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWIC+KRELKKP+FIHCAYGHGRSVAVTCAVLVALG A
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS
        +DWKNAEKIIKEKR CIRMNPSHRKALEEWSK RLSTPKKKKDNDV+SML+SGS
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS

XP_022921744.1 uncharacterized protein LOC111429896 [Cucurbita moschata]7.2e-13091.73Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPGLS LIGLKAAALFS FLFLRFYGFRLLS P LYASLVSLLVSIASLPSINLPLLLGKKSDGTFP+WSIVIFGPFLFFVRFLPSLRGL+ RDDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        ICEGVYVG WP S DRLPP NPA+VDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWIC+KRELKKP+FIHCAYGHGRSVAVTCAVLVALG A
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS
        EDWKNAEKIIKEKR CIRMNPSHRKALEEWSK RLS PKKKKD DV+SML+SGS
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS

XP_022990444.1 uncharacterized protein LOC111487299 [Cucurbita maxima]9.4e-13091.73Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPGLS LIGLKAAALFS FLFLRFYGFRLLS P LYASLVSLLVSIASLPSINLPLLLGKKSDGTFP+WSIVIFGPFLFFVRFLPSLRGL+ +DDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        ICEGVYVG WPCS DRLPPG PA+VDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIE AVRWIC+KRELKKP+FIHCAYGHGRSVAVTCAVLVALG A
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS
        EDWKNAEKIIKEKR CIRMNPSHRKALEEWSK RLS P KKKDNDV+SMLLSGS
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS

XP_023533014.1 uncharacterized protein LOC111795019 [Cucurbita pepo subsp. pepo]1.9e-13091.7Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPGLS LIGLKAAALFS FLFLRFYGFRLLS P LYASLVSLLVSIASLPSINLPLLLGKKSDGTFP+WSIVIFGPFLFFVRFLPSLRGL+ +DDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        ICEGVYVG WPCS DRLPPG PA+VDCTCELPR LEVSGTGYLCVPTWDTRSPQPEEIERAVRWIC+KRELKKP+FIHCAYGHGRSVAVTCAVLVALG A
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSG
        EDWKNAEKIIKEKR CIRMNPSHRKALEEWSK RLS PKKKKDNDV+SML+SG
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSG

XP_038886707.1 uncharacterized protein YnbD-like [Benincasa hispida]6.8e-12888.98Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPGLSSLIGLKAA LFSLFLFLRFYGFRLLSF  LYASLVSLLVS+ASLPSINLPLLLGKKSDGTFPIWSI+IFGPFL+FVR+LPSLRGL+RRDDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        ICEG++VG WP S DRLPP NPAIVDCTCELPRCL+VSG+GYLCVPTWDTRSPQP+EIE AVRWICRKRE KKP+FIHCAYGHGRSVAVTCA LVALGEA
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS
        EDWK+AEKIIKEKRPCIRMN SHRKALEEWSK RLS PKKK+DNDV+S LLSGS
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS

TrEMBL top hitse value%identityAlignment
A0A0A0KE65 TYR_PHOSPHATASE_2 domain-containing protein3.3e-12086.36Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        +KPGLSSLIGLKA ALFSLFLF RFYGFRLLSF  LYASLVS LVS+ASLPSINLPLLLGKKSDGTFPIWS++IFGPFL+FVR+LPSLRGL+R+DDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        IC+G++VG WPCS DRLPP NPAIVDCTCELPRCLE+SG GYLCVPTWDTRSPQP EIE AVRWICRKRE KKP+FIHCAYGHGRSVAVTCA LVALGEA
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKK
        EDWK+AEKI KEKRPCIRMN SHRKALEEWSK RLS PKK++
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKK

A0A1S3C3L2 uncharacterized protein YnbD-like1.3e-12185.43Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPGLSSLIGLKA ALFSLFLF RFYGFRLLSF  LYASLVS LVS+ASLPSINLPLLLGKKSDGTFPIWS++IFGPFL+FVR+LPSLRGL+R+DDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        IC+G++VG WPCS DRLPP NPAIVDCTCELPRCLEVSG GYLC+PTWDTRSPQP +IE AVRWICRKRE KKP+FIHCAYGHGRSVAV CA LVALGEA
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVN
        EDWK+AEKIIKEKRPCIRMN SHRKALEEWSK +LS PKKK+ ND++
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVN

A0A6J1BZH1 uncharacterized protein LOC1110066591.2e-12586.96Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPG+S LIGLKAAALFSLFLFLRFYGFRLLSF  LYASLVSLLVS+ASLPSINLPLLLGK++DG+FPIWSIVIF PFLFFVRFLPSLRGL+RRDDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        ICEG++VG WP S DRLPP NPAI+DCTCELPRCLEVSG  YLC+PTWDTRSPQPE IE AVRW+CRKRE K+P+FIHCAYGHGRSVAVTCAVLVALGEA
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSG
        EDWKNAEK+IKEKRPCIRMN SHRKALEEWSK RLS P KK+DNDV+SMLLSG
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSG

A0A6J1E1D8 uncharacterized protein LOC1114298963.5e-13091.73Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPGLS LIGLKAAALFS FLFLRFYGFRLLS P LYASLVSLLVSIASLPSINLPLLLGKKSDGTFP+WSIVIFGPFLFFVRFLPSLRGL+ RDDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        ICEGVYVG WP S DRLPP NPA+VDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWIC+KRELKKP+FIHCAYGHGRSVAVTCAVLVALG A
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS
        EDWKNAEKIIKEKR CIRMNPSHRKALEEWSK RLS PKKKKD DV+SML+SGS
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS

A0A6J1JS23 uncharacterized protein LOC1114872994.6e-13091.73Show/hide
Query:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE
        MKPGLS LIGLKAAALFS FLFLRFYGFRLLS P LYASLVSLLVSIASLPSINLPLLLGKKSDGTFP+WSIVIFGPFLFFVRFLPSLRGL+ +DDPYSE
Subjt:  MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSE

Query:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA
        ICEGVYVG WPCS DRLPPG PA+VDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIE AVRWIC+KRELKKP+FIHCAYGHGRSVAVTCAVLVALG A
Subjt:  ICEGVYVGAWPCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEA

Query:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS
        EDWKNAEKIIKEKR CIRMNPSHRKALEEWSK RLS P KKKDNDV+SMLLSGS
Subjt:  EDWKNAEKIIKEKRPCIRMNPSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS

SwissProt top hitse value%identityAlignment
P76093 Uncharacterized protein YnbD8.9e-1429.77Show/hide
Query:  LFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSEICEGVYVGAWPCSSD
        L  L + ++ +    L +P+L  SL+ +      L +I      GK S G  P    V +      +    S+R   RR +P S++  GVY+GA+P    
Subjt:  LFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSEICEGVYVGAWPCSSD

Query:  RLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEAEDWKNAEKIIKEKRP
        R  P   A++D T E PR        Y CVP  D   P+  E+ +AV  +   RE +  + +HCA G  RS  V  A L+  G  +    A   I+ +RP
Subjt:  RLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEAEDWKNAEKIIKEKRP

Query:  CIRMNPSHRKALEEW
         I +   H+  L  W
Subjt:  CIRMNPSHRKALEEW

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCAGGATTATCATCATTGATCGGGTTGAAGGCAGCAGCCTTGTTCTCTCTATTTCTATTTCTTAGATTCTATGGCTTCAGATTGTTATCGTTTCCAATCTTGTA
CGCCTCTTTAGTTTCTCTCTTGGTTTCAATCGCTTCTCTTCCATCCATCAATCTTCCCCTGCTTTTGGGGAAGAAATCAGACGGGACTTTTCCAATTTGGTCGATCGTTA
TTTTTGGTCCCTTTTTGTTTTTCGTTCGATTTCTTCCTTCATTGCGTGGATTGTTCCGTAGGGATGATCCTTACTCTGAAATTTGTGAGGGTGTTTATGTTGGGGCATGG
CCTTGTTCATCTGATAGATTGCCTCCTGGTAATCCTGCCATTGTTGATTGTACTTGTGAATTGCCAAGGTGTTTGGAAGTCTCTGGGACTGGTTATTTGTGTGTTCCAAC
TTGGGACACACGTTCTCCTCAGCCTGAAGAAATTGAAAGGGCTGTTCGGTGGATTTGTAGAAAGAGAGAGCTGAAGAAGCCGTTGTTTATCCATTGTGCTTATGGCCATG
GAAGAAGTGTTGCTGTTACATGTGCAGTGTTAGTGGCTCTAGGAGAGGCAGAAGATTGGAAAAATGCAGAGAAGATAATAAAAGAAAAACGACCTTGCATTCGGATGAAT
CCTTCTCATCGTAAAGCTTTGGAAGAATGGTCGAAACTTCGGTTATCCACTCCGAAGAAGAAGAAAGATAATGATGTGAATTCTATGCTTCTTTCTGGTAGCTGA
mRNA sequenceShow/hide mRNA sequence
CAGAAATTTCTTCCTGTCATTGTATTGTAATTTACGACAAAACAAAGAAGAAAAAAAATCATCCTTTTTTCCCATTGGCTTCATTTCAACGAGGATTGGAGGACAATTCT
CGATGTTGCTTCTTCCAGTTCTCCCTCGGCTTCCACGATTCATAAGATCGTAATTGATTTCCAAACTGTGAGAACCCTTTTGATTTTCCGATCTGATATTCATCTGGGTT
TTGAATAATCCAACTTTTTGAGGCTCCAAAACGATGAAACCAGGATTATCATCATTGATCGGGTTGAAGGCAGCAGCCTTGTTCTCTCTATTTCTATTTCTTAGATTCTA
TGGCTTCAGATTGTTATCGTTTCCAATCTTGTACGCCTCTTTAGTTTCTCTCTTGGTTTCAATCGCTTCTCTTCCATCCATCAATCTTCCCCTGCTTTTGGGGAAGAAAT
CAGACGGGACTTTTCCAATTTGGTCGATCGTTATTTTTGGTCCCTTTTTGTTTTTCGTTCGATTTCTTCCTTCATTGCGTGGATTGTTCCGTAGGGATGATCCTTACTCT
GAAATTTGTGAGGGTGTTTATGTTGGGGCATGGCCTTGTTCATCTGATAGATTGCCTCCTGGTAATCCTGCCATTGTTGATTGTACTTGTGAATTGCCAAGGTGTTTGGA
AGTCTCTGGGACTGGTTATTTGTGTGTTCCAACTTGGGACACACGTTCTCCTCAGCCTGAAGAAATTGAAAGGGCTGTTCGGTGGATTTGTAGAAAGAGAGAGCTGAAGA
AGCCGTTGTTTATCCATTGTGCTTATGGCCATGGAAGAAGTGTTGCTGTTACATGTGCAGTGTTAGTGGCTCTAGGAGAGGCAGAAGATTGGAAAAATGCAGAGAAGATA
ATAAAAGAAAAACGACCTTGCATTCGGATGAATCCTTCTCATCGTAAAGCTTTGGAAGAATGGTCGAAACTTCGGTTATCCACTCCGAAGAAGAAGAAAGATAATGATGT
GAATTCTATGCTTCTTTCTGGTAGCTGAGGAGCAAAAAAATGAATCTTATAGGCCATTTCAGACCCCTCTTGTAATCTTGCATATCTTACTTATTGCTTGGATGACCATT
TGATGTAATGAATGAACTTTTGAGTACTGCTAATATTAATAGGTAAACAAACAATCAAACTTTGTAGCAAGAACTTTTCATTTTTATTTTTCTAGGATTCAGTCCTTCAA
TTTCTGTGGCTTTCAATTGCCATTATTGCTCCAAAACTGCAAATTGTTAGTTCAGCTTTGAAGTGTGGTTATGGAAATGAGTATACAAACTGTTCAAACT
Protein sequenceShow/hide protein sequence
MKPGLSSLIGLKAAALFSLFLFLRFYGFRLLSFPILYASLVSLLVSIASLPSINLPLLLGKKSDGTFPIWSIVIFGPFLFFVRFLPSLRGLFRRDDPYSEICEGVYVGAW
PCSSDRLPPGNPAIVDCTCELPRCLEVSGTGYLCVPTWDTRSPQPEEIERAVRWICRKRELKKPLFIHCAYGHGRSVAVTCAVLVALGEAEDWKNAEKIIKEKRPCIRMN
PSHRKALEEWSKLRLSTPKKKKDNDVNSMLLSGS