; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020556 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020556
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr7:344827..348000
RNA-Seq ExpressionLag0020556
SyntenyLag0020556
Gene Ontology termsGO:0043666 - regulation of phosphoprotein phosphatase activity (biological process)
GO:0019903 - protein phosphatase binding (molecular function)
InterPro domainsIPR007587 - SIT4 phosphatase-associated protein family
IPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]1.8e-2927.78Show/hide
Query:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI
        S  +E+K F +S D+ + +    +TE    +S +++     L WL++ F  L   P T +FF E R EE  LWV+K  +++ + AEI R+   G    ++
Subjt:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI

Query:  VPVGENRKGWKSLI------------TYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQK-QTADDIAPSQPEFATIFLSSSVIIQR-
        VP G  + GW   +            T  +++ N   R         +  +  SY +A+ KG       N + +       S   F+  +  ++V+ +R 
Subjt:  VPVGENRKGWKSLI------------TYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQK-QTADDIAPSQPEFATIFLSSSVIIQR-

Query:  ----------------------KHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD
                              K +H +KAL+  ++EEQ ++L K KGW  VG++ V+F  WS         +PSYGGW K+R +    WN+++F +IGD
Subjt:  ----------------------KHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD

Query:  VCGGYIETANNTERAPPIAKSPKK
         CGG++E A  T     I ++  K
Subjt:  VCGGYIETANNTERAPPIAKSPKK

KAA0050054.1 hypothetical protein E6C27_scaffold675G00340 [Cucumis melo var. makuwa]1.4e-2928.35Show/hide
Query:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI
        S  +E+K F  S D+R+ +L   +TE    +S +++     L WL++ F  L     T +FF E R E+  LWV+K  +++ + AEI R+   G    ++
Subjt:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI

Query:  VPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPT------------DINRSSSYRDALQKG-----QEDTPTCNQKQTADDIAPSQPEF-ATIFLSS--
        VP G  + GW   ++ +    +  ++   R    T            D  +  SY +A+ KG     + ++ T N K+T         E+  T+ L+   
Subjt:  VPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPT------------DINRSSSYRDALQKG-----QEDTPTCNQKQTADDIAPSQPEF-ATIFLSS--

Query:  ----------------SVIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD
                           ++ K +H +KAL+  ++EEQ  ++ K KGW  VG++ V+F  W+  A      +PSYGGW K+R +    WN+++F +IGD
Subjt:  ----------------SVIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD

Query:  VCGGYIETANNTERAPPIAKS
         CGG+IE A  T     I ++
Subjt:  VCGGYIETANNTERAPPIAKS

KAA0067710.1 hypothetical protein E6C27_scaffold352G00160 [Cucumis melo var. makuwa]5.5e-2630.47Show/hide
Query:  LTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLIVPVGENRKGWKSLITYIQSLTNQ
        +TE    +S +++     L WL++ F  L   P T +FF + R EE  LWV+K  +++ + AEI R+   G    ++VP G  + GW   ++ +      
Subjt:  LTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLIVPVGENRKGWKSLITYIQSLTNQ

Query:  HQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSS---VIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYL
                    D +   +Y+  L    +D    +  ++ DD        A I  SSS        K +H +KAL+  +D EQ  ++ K KGW  VG++ 
Subjt:  HQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSS---VIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYL

Query:  VRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNT
        V+F  W+         VPSYGGW K+R +    WN+++F +IGDV GG++E A  T
Subjt:  VRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNT

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]1.3e-3028.66Show/hide
Query:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI
        S  +E+K F +S D+++ +L   +TE    +S +++     L WL++ F  L   P T +FF E R E+  LWV+K  +++ + AEI R+   G    ++
Subjt:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI

Query:  VPVGENRKGWKSLITYI----QSLTNQHQRAVIRPQAPT--------DINRSSSYRDALQKG-----QEDTPTCNQKQTADDIAPSQPEF-ATIFLSS--
        VP G  + GW   ++ +     S T  + R ++  +           D  +  SY +A+ KG     + ++ T N K+T         E+  T+ L+   
Subjt:  VPVGENRKGWKSLITYI----QSLTNQHQRAVIRPQAPT--------DINRSSSYRDALQKG-----QEDTPTCNQKQTADDIAPSQPEF-ATIFLSS--

Query:  ----------------SVIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD
                           ++ K +H +KAL+  ++EEQ  ++ K KGW  VG++ V+F  W+  A      +PSYGGW K+R +    WN+++F +IGD
Subjt:  ----------------SVIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD

Query:  VCGGYIETANNTERAPPIAKS
         CGG+IE A  T     I ++
Subjt:  VCGGYIETANNTERAPPIAKS

TYK24535.1 hypothetical protein E5676_scaffold266G00770 [Cucumis melo var. makuwa]1.1e-2629.25Show/hide
Query:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI
        S  +E+K F +S D+R+ +    +TE    +S +++     L WL++ F  L   P T +FF E R EE  LWV+K  +++ + AEI R+   G    ++
Subjt:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI

Query:  VPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSSVIIQRKHYHDNKALLACED
        VP G  +   +++I    S      R V       +I ++           E T    ++   DD    +     +       ++ K +H +KAL+  ++
Subjt:  VPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSSVIIQRKHYHDNKALLACED

Query:  EEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNTERAPPIAKSPKKSQSQKS
        EEQ ++L K KGW  VG++ V+F  WS         +PSYGGW K+R +    WN+++F +IGD CGG++E A  T     I ++  K +   S
Subjt:  EEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNTERAPPIAKSPKKSQSQKS

TrEMBL top hitse value%identityAlignment
A0A5A7TFK7 DUF4283 domain-containing protein8.9e-3027.78Show/hide
Query:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI
        S  +E+K F +S D+ + +    +TE    +S +++     L WL++ F  L   P T +FF E R EE  LWV+K  +++ + AEI R+   G    ++
Subjt:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI

Query:  VPVGENRKGWKSLI------------TYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQK-QTADDIAPSQPEFATIFLSSSVIIQR-
        VP G  + GW   +            T  +++ N   R         +  +  SY +A+ KG       N + +       S   F+  +  ++V+ +R 
Subjt:  VPVGENRKGWKSLI------------TYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQK-QTADDIAPSQPEFATIFLSSSVIIQR-

Query:  ----------------------KHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD
                              K +H +KAL+  ++EEQ ++L K KGW  VG++ V+F  WS         +PSYGGW K+R +    WN+++F +IGD
Subjt:  ----------------------KHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD

Query:  VCGGYIETANNTERAPPIAKSPKK
         CGG++E A  T     I ++  K
Subjt:  VCGGYIETANNTERAPPIAKSPKK

A0A5A7U495 DUF4283 domain-containing protein6.8e-3028.35Show/hide
Query:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI
        S  +E+K F  S D+R+ +L   +TE    +S +++     L WL++ F  L     T +FF E R E+  LWV+K  +++ + AEI R+   G    ++
Subjt:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI

Query:  VPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPT------------DINRSSSYRDALQKG-----QEDTPTCNQKQTADDIAPSQPEF-ATIFLSS--
        VP G  + GW   ++ +    +  ++   R    T            D  +  SY +A+ KG     + ++ T N K+T         E+  T+ L+   
Subjt:  VPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPT------------DINRSSSYRDALQKG-----QEDTPTCNQKQTADDIAPSQPEF-ATIFLSS--

Query:  ----------------SVIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD
                           ++ K +H +KAL+  ++EEQ  ++ K KGW  VG++ V+F  W+  A      +PSYGGW K+R +    WN+++F +IGD
Subjt:  ----------------SVIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD

Query:  VCGGYIETANNTERAPPIAKS
         CGG+IE A  T     I ++
Subjt:  VCGGYIETANNTERAPPIAKS

A0A5A7VKI3 DUF4283 domain-containing protein2.7e-2630.47Show/hide
Query:  LTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLIVPVGENRKGWKSLITYIQSLTNQ
        +TE    +S +++     L WL++ F  L   P T +FF + R EE  LWV+K  +++ + AEI R+   G    ++VP G  + GW   ++ +      
Subjt:  LTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLIVPVGENRKGWKSLITYIQSLTNQ

Query:  HQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSS---VIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYL
                    D +   +Y+  L    +D    +  ++ DD        A I  SSS        K +H +KAL+  +D EQ  ++ K KGW  VG++ 
Subjt:  HQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSS---VIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYL

Query:  VRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNT
        V+F  W+         VPSYGGW K+R +    WN+++F +IGDV GG++E A  T
Subjt:  VRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNT

A0A5D3CFS8 DUF4283 domain-containing protein6.1e-3128.66Show/hide
Query:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI
        S  +E+K F +S D+++ +L   +TE    +S +++     L WL++ F  L   P T +FF E R E+  LWV+K  +++ + AEI R+   G    ++
Subjt:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI

Query:  VPVGENRKGWKSLITYI----QSLTNQHQRAVIRPQAPT--------DINRSSSYRDALQKG-----QEDTPTCNQKQTADDIAPSQPEF-ATIFLSS--
        VP G  + GW   ++ +     S T  + R ++  +           D  +  SY +A+ KG     + ++ T N K+T         E+  T+ L+   
Subjt:  VPVGENRKGWKSLITYI----QSLTNQHQRAVIRPQAPT--------DINRSSSYRDALQKG-----QEDTPTCNQKQTADDIAPSQPEF-ATIFLSS--

Query:  ----------------SVIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD
                           ++ K +H +KAL+  ++EEQ  ++ K KGW  VG++ V+F  W+  A      +PSYGGW K+R +    WN+++F +IGD
Subjt:  ----------------SVIIQRKHYHDNKALLACEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGD

Query:  VCGGYIETANNTERAPPIAKS
         CGG+IE A  T     I ++
Subjt:  VCGGYIETANNTERAPPIAKS

A0A5D3DLP0 DUF4283 domain-containing protein5.4e-2729.25Show/hide
Query:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI
        S  +E+K F +S D+R+ +    +TE    +S +++     L WL++ F  L   P T +FF E R EE  LWV+K  +++ + AEI R+   G    ++
Subjt:  SFRVERKTFSISNDHRNPNL-FRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEIARLGPNGGINKLI

Query:  VPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSSVIIQRKHYHDNKALLACED
        VP G  +   +++I    S      R V       +I ++           E T    ++   DD    +     +       ++ K +H +KAL+  ++
Subjt:  VPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSSVIIQRKHYHDNKALLACED

Query:  EEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNTERAPPIAKSPKKSQSQKS
        EEQ ++L K KGW  VG++ V+F  WS         +PSYGGW K+R +    WN+++F +IGD CGG++E A  T     I ++  K +   S
Subjt:  EEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNTERAPPIAKSPKKSQSQKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G07990.1 SIT4 phosphatase-associated family protein8.0e-0742.17Show/hide
Query:  DVVQQDIQISVLEM---GCSASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL
        DV+Q     ++LEM       S+  EV +NA   LCAI+R AP  L+ ++SS  +V  +  H LEDS   S L++SLSV  SL
Subjt:  DVVQQDIQISVLEM---GCSASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL

AT1G30470.1 SIT4 phosphatase-associated family protein3.7e-1257.14Show/hide
Query:  ASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL
        +S   EVH+NA  +LC + R+AP GL+ K+SS S  G L+ H LEDSRP SVL+NSLSV  SL
Subjt:  ASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL

AT1G30470.2 SIT4 phosphatase-associated family protein3.7e-1257.14Show/hide
Query:  ASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL
        +S   EVH+NA  +LC + R+AP GL+ K+SS S  G L+ H LEDSRP SVL+NSLSV  SL
Subjt:  ASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL

AT1G30470.3 SIT4 phosphatase-associated family protein3.7e-1257.14Show/hide
Query:  ASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL
        +S   EVH+NA  +LC + R+AP GL+ K+SS S  G L+ H LEDSRP SVL+NSLSV  SL
Subjt:  ASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL

AT2G28360.1 SIT4 phosphatase-associated family protein8.0e-0742.17Show/hide
Query:  DVVQQDIQISVLEM---GCSASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL
        DV++      +LEM     + S+  EV +NA   LCAITR AP  L+ K+SS  FV  +  H +EDS   S L++SL+V  SL
Subjt:  DVVQQDIQISVLEM---GCSASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFRSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGTATTTGTTACAACCCTCATAAAGTCCAGCTGTCTTAGGTTGTTGGAATCTCCTACTAAGAATGTTAATGTCGTTAATTGCATATTAAAGGATGTTGTTCAGCA
GGACATTCAAATATCAGTGCTGGAAATGGGTTGTTCAGCTTCAGCTTGTTCAGAAGTTCATTCTAATGCACCAGGATTACTTTGTGCTATTACTCGATTTGCTCCTCTTG
GTCTTTCGGCTAAAATTTCCAGCACAAGCTTTGTAGGAAGTTTGGTTCACCACGTCTTAGAAGATTCTCGTCCAACGTCTGTTTTAATAAACTCGTTATCAGTGTTTAGA
TCCTTGGAGGCTTTCTACTGGATGATCTTTTGTATACTCCGGCCAAATTGCCCTTGGATATTCTGTTTCAGCAACTATGGAGGCAGTGGATGGAAGGTTGAACTTGTATT
CACAATCAAAAGTAGTACAAGCCAACACCGATTTGAGATGGTGAATCTCCCTATAATCCATTCTAGAGTTGATGTGAACCAAGCTACATCTATTGTCATGCCTCAAGGTC
CAATTACAAGGTCGAGAGCCAAGAAGCTACAACAAGCATTAATCACCTATATTCAAGCTATGAGTCATTCAAGTGGTGTTGATCATACCTCTTGTGGAGTGATTCGAATC
ACGAGTTTAGAACGAGTTTTCTTTACTCTTGATCTTGATCGTCAAGCGACCATGGCCAACACTACAAACGACTCTTTCAGGGTAGAAAGGAAAACGTTCTCCATCTCCAA
TGACCACAGAAACCCAAATCTCTTCCGCCTCACCGAAACCTGCAAGGACCGAAGCGTTACCTTATCTTTTGCAAAGTCTCTTCTTCCATGGCTACAGACGTGCTTCGACA
GACTCTGCTCCATCCCCTTAACTCAGAAATTCTTCAATGAAACGAGAACGGAAGAAACAGTATTATGGGTGGAAAAAATATCCAGTAAGAGGGATCACTCTGCAGAAATA
GCTAGACTTGGGCCCAATGGAGGCATTAATAAACTCATTGTTCCGGTTGGTGAGAATAGAAAGGGATGGAAAAGTCTCATTACTTACATTCAATCTCTCACCAACCAACA
CCAGCGTGCAGTTATTAGACCCCAAGCCCCGACTGATATAAATAGGAGCTCCTCCTACAGAGATGCTTTACAGAAAGGACAAGAAGATACCCCTACGTGCAACCAAAAAC
AAACCGCCGATGATATAGCCCCATCCCAGCCAGAGTTCGCAACCATATTCTTATCCTCGTCGGTCATTATTCAAAGGAAACACTACCATGACAACAAAGCTCTCCTAGCT
TGCGAAGATGAAGAACAAATGCGAATTCTATCCAAGATCAAAGGATGGTATAAAGTGGGGAAATACTTGGTTCGATTTCATCCTTGGAGCGTCGCAGCTATGCTTGGAGA
ACCTAAGGTTCCCTCTTATGGAGGATGGAAAAAGATCAGAAATCTTTCGCGGGATAAATGGAACATTGATAATTTCAGGAAAATAGGTGACGTTTGTGGAGGATATATAG
AGACCGCCAATAACACAGAGAGGGCCCCACCGATAGCCAAGTCTCCCAAGAAATCCCAATCCCAAAAATCCCACAACATCGTCAAAGTGGGAACCAAAGAAGGTTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGTATTTGTTACAACCCTCATAAAGTCCAGCTGTCTTAGGTTGTTGGAATCTCCTACTAAGAATGTTAATGTCGTTAATTGCATATTAAAGGATGTTGTTCAGCA
GGACATTCAAATATCAGTGCTGGAAATGGGTTGTTCAGCTTCAGCTTGTTCAGAAGTTCATTCTAATGCACCAGGATTACTTTGTGCTATTACTCGATTTGCTCCTCTTG
GTCTTTCGGCTAAAATTTCCAGCACAAGCTTTGTAGGAAGTTTGGTTCACCACGTCTTAGAAGATTCTCGTCCAACGTCTGTTTTAATAAACTCGTTATCAGTGTTTAGA
TCCTTGGAGGCTTTCTACTGGATGATCTTTTGTATACTCCGGCCAAATTGCCCTTGGATATTCTGTTTCAGCAACTATGGAGGCAGTGGATGGAAGGTTGAACTTGTATT
CACAATCAAAAGTAGTACAAGCCAACACCGATTTGAGATGGTGAATCTCCCTATAATCCATTCTAGAGTTGATGTGAACCAAGCTACATCTATTGTCATGCCTCAAGGTC
CAATTACAAGGTCGAGAGCCAAGAAGCTACAACAAGCATTAATCACCTATATTCAAGCTATGAGTCATTCAAGTGGTGTTGATCATACCTCTTGTGGAGTGATTCGAATC
ACGAGTTTAGAACGAGTTTTCTTTACTCTTGATCTTGATCGTCAAGCGACCATGGCCAACACTACAAACGACTCTTTCAGGGTAGAAAGGAAAACGTTCTCCATCTCCAA
TGACCACAGAAACCCAAATCTCTTCCGCCTCACCGAAACCTGCAAGGACCGAAGCGTTACCTTATCTTTTGCAAAGTCTCTTCTTCCATGGCTACAGACGTGCTTCGACA
GACTCTGCTCCATCCCCTTAACTCAGAAATTCTTCAATGAAACGAGAACGGAAGAAACAGTATTATGGGTGGAAAAAATATCCAGTAAGAGGGATCACTCTGCAGAAATA
GCTAGACTTGGGCCCAATGGAGGCATTAATAAACTCATTGTTCCGGTTGGTGAGAATAGAAAGGGATGGAAAAGTCTCATTACTTACATTCAATCTCTCACCAACCAACA
CCAGCGTGCAGTTATTAGACCCCAAGCCCCGACTGATATAAATAGGAGCTCCTCCTACAGAGATGCTTTACAGAAAGGACAAGAAGATACCCCTACGTGCAACCAAAAAC
AAACCGCCGATGATATAGCCCCATCCCAGCCAGAGTTCGCAACCATATTCTTATCCTCGTCGGTCATTATTCAAAGGAAACACTACCATGACAACAAAGCTCTCCTAGCT
TGCGAAGATGAAGAACAAATGCGAATTCTATCCAAGATCAAAGGATGGTATAAAGTGGGGAAATACTTGGTTCGATTTCATCCTTGGAGCGTCGCAGCTATGCTTGGAGA
ACCTAAGGTTCCCTCTTATGGAGGATGGAAAAAGATCAGAAATCTTTCGCGGGATAAATGGAACATTGATAATTTCAGGAAAATAGGTGACGTTTGTGGAGGATATATAG
AGACCGCCAATAACACAGAGAGGGCCCCACCGATAGCCAAGTCTCCCAAGAAATCCCAATCCCAAAAATCCCACAACATCGTCAAAGTGGGAACCAAAGAAGGTTACTAA
Protein sequenceShow/hide protein sequence
MNVFVTTLIKSSCLRLLESPTKNVNVVNCILKDVVQQDIQISVLEMGCSASACSEVHSNAPGLLCAITRFAPLGLSAKISSTSFVGSLVHHVLEDSRPTSVLINSLSVFR
SLEAFYWMIFCILRPNCPWIFCFSNYGGSGWKVELVFTIKSSTSQHRFEMVNLPIIHSRVDVNQATSIVMPQGPITRSRAKKLQQALITYIQAMSHSSGVDHTSCGVIRI
TSLERVFFTLDLDRQATMANTTNDSFRVERKTFSISNDHRNPNLFRLTETCKDRSVTLSFAKSLLPWLQTCFDRLCSIPLTQKFFNETRTEETVLWVEKISSKRDHSAEI
ARLGPNGGINKLIVPVGENRKGWKSLITYIQSLTNQHQRAVIRPQAPTDINRSSSYRDALQKGQEDTPTCNQKQTADDIAPSQPEFATIFLSSSVIIQRKHYHDNKALLA
CEDEEQMRILSKIKGWYKVGKYLVRFHPWSVAAMLGEPKVPSYGGWKKIRNLSRDKWNIDNFRKIGDVCGGYIETANNTERAPPIAKSPKKSQSQKSHNIVKVGTKEGY