; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037068 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037068
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSodium-coupled neutral amino acid transporter 5-like
Genome locationchr2:3138522..3147314
RNA-Seq ExpressionLag0037068
SyntenyLag0037068
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591445.1 hypothetical protein SDJN03_13791, partial [Cucurbita argyrosperma subsp. sororia]7.0e-10593.09Show/hide
Query:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL
        MVEFS SDP LW EALSSYPSQIEALGKPNLVSLD+FYRNELP LLH RNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDES+VK ASQKAFQCL
Subjt:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL

Query:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PD+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS
Subjt:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  KLDPKNEGKTGNKRKRK
        +LDPKN GKTG+KRKRK
Subjt:  KLDPKNEGKTGNKRKRK

XP_004141302.1 uncharacterized protein LOC101204707 [Cucumis sativus]3.3e-10289.86Show/hide
Query:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL
        M+EFS SDP+LWREALS+Y SQIEALGKPNLVSLD+FYRNELPL+LH RNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDES+VK ASQKAFQCL
Subjt:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL

Query:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PD+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL+FANKL+ KAKELS EG+IFTPSDVERALWS AIGEKLKGS+S
Subjt:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  KLDPKNEGKTGNKRKRK
        +LDP N GKTG KRKRK
Subjt:  KLDPKNEGKTGNKRKRK

XP_022936826.1 uncharacterized protein LOC111443296 [Cucurbita moschata]4.3e-10291.67Show/hide
Query:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP
        +EFS SDP LW EALSSYPS+IEALGKPNLVSLDEFYRNELP LLH RNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDES+VK ASQKAFQCLP
Subjt:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP

Query:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSK
        D+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS+
Subjt:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSK

Query:  LDPKNEGKTGNKRKRK
        L  KN GKTG+KRKRK
Subjt:  LDPKNEGKTGNKRKRK

XP_022977105.1 uncharacterized protein LOC111477273 [Cucurbita maxima]3.9e-10391.71Show/hide
Query:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL
        MVEF+ SDP LW EALSSYPSQIEALGKPNLVSLD+FYRNELP LLH RNPSPYITTSELSKLMQWKLTRGKWRPRL DFVSSLDES+VK ASQKAFQCL
Subjt:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL

Query:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PD+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS
Subjt:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  KLDPKNEGKTGNKRKRK
        +LDPKN GK G KRKRK
Subjt:  KLDPKNEGKTGNKRKRK

XP_023536029.1 uncharacterized protein LOC111797290 [Cucurbita pepo subsp. pepo]1.7e-10392.17Show/hide
Query:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL
        MVEFS S+P LW EAL SYPSQIEALGKPNLVSLD+FYRNELP LLH RNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDES+VK ASQKAFQCL
Subjt:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL

Query:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PD+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS
Subjt:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  KLDPKNEGKTGNKRKRK
        +LDPKN GKTG KRKRK
Subjt:  KLDPKNEGKTGNKRKRK

TrEMBL top hitse value%identityAlignment
A0A0A0L3W6 Uncharacterized protein1.6e-10289.86Show/hide
Query:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL
        M+EFS SDP+LWREALS+Y SQIEALGKPNLVSLD+FYRNELPL+LH RNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDES+VK ASQKAFQCL
Subjt:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL

Query:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PD+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL+FANKL+ KAKELS EG+IFTPSDVERALWS AIGEKLKGS+S
Subjt:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  KLDPKNEGKTGNKRKRK
        +LDP N GKTG KRKRK
Subjt:  KLDPKNEGKTGNKRKRK

A0A1S3BUF7 uncharacterized protein LOC103493629 isoform X11.3e-10189.35Show/hide
Query:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP
        ++FS SDP+LWREALSSY SQIEALGKPNLVSLD+FYRNELPL+LH R PSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDES+VK ASQKAFQCLP
Subjt:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP

Query:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSK
        D+SKAVSELTPLKGVGPATASA+LAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL+FANKLQ+KAKELS EG+ FTPSDVERALWS AIGEKLKGSQS+
Subjt:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSK

Query:  LDPKNEGKTGNKRKRK
        LDP N GKTG KRKRK
Subjt:  LDPKNEGKTGNKRKRK

A0A6J1CCK9 uncharacterized protein LOC1110102562.6e-9788.02Show/hide
Query:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL
        MVEF  SDP LW EALSSYPSQIEALGKPNLVSLD FYRNELP LLH RNP+PYITT ELSKLMQWKLTRGKWRPRLLDFVSSLDES+VK ASQKAFQCL
Subjt:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL

Query:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PD+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FAN LQ+KAKELSS+GQIFTPSDVERALWS+A GEKLK SQS
Subjt:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  KLDPKNEGKTGNKRKRK
        +L+PKN   +G KRKRK
Subjt:  KLDPKNEGKTGNKRKRK

A0A6J1F8K0 uncharacterized protein LOC1114432962.1e-10291.67Show/hide
Query:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP
        +EFS SDP LW EALSSYPS+IEALGKPNLVSLDEFYRNELP LLH RNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDES+VK ASQKAFQCLP
Subjt:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP

Query:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSK
        D+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS+
Subjt:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSK

Query:  LDPKNEGKTGNKRKRK
        L  KN GKTG+KRKRK
Subjt:  LDPKNEGKTGNKRKRK

A0A6J1ILC1 uncharacterized protein LOC1114772731.9e-10391.71Show/hide
Query:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL
        MVEF+ SDP LW EALSSYPSQIEALGKPNLVSLD+FYRNELP LLH RNPSPYITTSELSKLMQWKLTRGKWRPRL DFVSSLDES+VK ASQKAFQCL
Subjt:  MVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCL

Query:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PD+SKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS
Subjt:  PDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  KLDPKNEGKTGNKRKRK
        +LDPKN GK G KRKRK
Subjt:  KLDPKNEGKTGNKRKRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12210.1 DNA binding6.4e-5676.81Show/hide
Query:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP
        +EF  SD  +W+EALSSY S+IE+L KP LVSLD+FYR +LP LLH+R+P+PY+TTSELS+LM+WKL+RGKWRPRLLDFVSSLD+SVVKSAS+KAF+ LP
Subjt:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP

Query:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDE
        D+SKAV ELT LKGVG ATASAVLAAYAPD+APFMSDE
Subjt:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDE

AT3G12210.2 DNA binding3.5e-7870.23Show/hide
Query:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP
        +EF  SD  +W+EALSSY S+IE+L KP LVSLD+FYR +LP LLH+R+P+PY+TTSELS+LM+WKL+RGKWRPRLLDFVSSLD+SVVKSAS+KAF+ LP
Subjt:  VEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSPYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLP

Query:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSK
        D+SKAV ELT LKGVG ATASAVLAAYAPD+APFMSDEAME ALGNSKDYSLKQYL+FA KLQDKAKEL  +G+   PSD+ERALWS  +  K +  +S 
Subjt:  DVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSK

Query:  LDPKNEGKTGNKRKR
                +G KRKR
Subjt:  LDPKNEGKTGNKRKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCAAAACCTCTGCCCTCAACGACAGACGACATAGAAGCCGACCTAAAGACCGCTGCATTTCGCGCGAGAATATGCGCAATCTTGTTTCCTACTCGTGATGGACA
AGGAGCTGATGAGGACAATCGGACAGAGGTAGGACCAAAAGACCGACCCAGAGGAAGACTGGACCAATGGGTCGGGCCAAAATGGCCCGACCCATATGGTCGGCCTCGGC
AAAAGGTCGAGGCCGACCATTCGGCCCGTTTGCGCGGGCCGAGCCCGGTGACCTCTTTTCGGTCCCTGATGCCCCAAATCGCCCCGGTTTCGTCTGGTTCGGCCCGAAAC
ACCACCGAATGCCTAGAAACCCTAGGAGGGAAAACAGCTTCTTCTCGGTTTTCTGACTTAGGCATCGGAGGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTG
CTGGTCTTGCAGGTCACGTCTTCCCCAGTTTCTACAAATTCACTGTTGGTGTCACGTGAAGGTCAGGAACAAGAGTGTTCATGGTGAAGTAATTCCATCTACACAAATCA
GAAGCAAATGGATTAGGAATTACCTTGAATTTTGTTTGCGATCAAACTCTAGTAATTCGCCCAGTATGTCGAACATCCATAGTGAGATCCTTTGCGCCTCAGTTGGAATT
GATCGATGGAGTCCACCGAATAGAGGCTTTTGGAAGCTCAACACTGATGTTGCATGTATTTTCAAATCCCTCAATGACAAACATTGGGATGATATCCGCGAAAAAACGCC
CATCTTCAAATATCAAAACGCCGGTGGAGACTGGAGCGGGAACGTCGCATCACAAGCTGCAATGGTGGAGTTCTCAACCTCCGACCCAAGTCTCTGGAGAGAAGCCCTCT
CTTCTTATCCATCTCAAATCGAAGCCCTAGGCAAGCCCAATCTGGTTTCTCTCGACGAATTTTACCGCAACGAACTCCCTCTTCTTCTTCACAACCGAAACCCTAGCCCT
TACATCACTACTTCCGAGCTCTCAAAACTCATGCAATGGAAGCTCACTCGCGGCAAATGGAGGCCGCGTCTCTTGGACTTTGTTTCGTCATTGGATGAATCGGTCGTCAA
ATCGGCCTCTCAGAAGGCCTTCCAGTGCCTTCCTGATGTTTCCAAAGCCGTATCTGAGCTTACGCCGCTCAAAGGCGTTGGTCCGGCCACTGCCTCGGCGGTTCTGGCTG
CTTACGCGCCGGATGTTGCGCCATTTATGTCCGACGAGGCTATGGAGGCGGCTCTAGGCAACTCCAAGGATTATTCGTTGAAGCAGTACTTAATGTTCGCCAATAAGTTG
CAAGATAAAGCCAAGGAATTAAGCTCAGAAGGACAAATTTTCACACCATCTGATGTAGAGAGGGCTTTGTGGAGTACAGCTATAGGGGAAAAATTGAAAGGTTCACAATC
AAAATTAGATCCCAAGAATGAAGGCAAAACTGGCAACAAAAGAAAGAGAAAAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCAAAACCTCTGCCCTCAACGACAGACGACATAGAAGCCGACCTAAAGACCGCTGCATTTCGCGCGAGAATATGCGCAATCTTGTTTCCTACTCGTGATGGACA
AGGAGCTGATGAGGACAATCGGACAGAGGTAGGACCAAAAGACCGACCCAGAGGAAGACTGGACCAATGGGTCGGGCCAAAATGGCCCGACCCATATGGTCGGCCTCGGC
AAAAGGTCGAGGCCGACCATTCGGCCCGTTTGCGCGGGCCGAGCCCGGTGACCTCTTTTCGGTCCCTGATGCCCCAAATCGCCCCGGTTTCGTCTGGTTCGGCCCGAAAC
ACCACCGAATGCCTAGAAACCCTAGGAGGGAAAACAGCTTCTTCTCGGTTTTCTGACTTAGGCATCGGAGGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTG
CTGGTCTTGCAGGTCACGTCTTCCCCAGTTTCTACAAATTCACTGTTGGTGTCACGTGAAGGTCAGGAACAAGAGTGTTCATGGTGAAGTAATTCCATCTACACAAATCA
GAAGCAAATGGATTAGGAATTACCTTGAATTTTGTTTGCGATCAAACTCTAGTAATTCGCCCAGTATGTCGAACATCCATAGTGAGATCCTTTGCGCCTCAGTTGGAATT
GATCGATGGAGTCCACCGAATAGAGGCTTTTGGAAGCTCAACACTGATGTTGCATGTATTTTCAAATCCCTCAATGACAAACATTGGGATGATATCCGCGAAAAAACGCC
CATCTTCAAATATCAAAACGCCGGTGGAGACTGGAGCGGGAACGTCGCATCACAAGCTGCAATGGTGGAGTTCTCAACCTCCGACCCAAGTCTCTGGAGAGAAGCCCTCT
CTTCTTATCCATCTCAAATCGAAGCCCTAGGCAAGCCCAATCTGGTTTCTCTCGACGAATTTTACCGCAACGAACTCCCTCTTCTTCTTCACAACCGAAACCCTAGCCCT
TACATCACTACTTCCGAGCTCTCAAAACTCATGCAATGGAAGCTCACTCGCGGCAAATGGAGGCCGCGTCTCTTGGACTTTGTTTCGTCATTGGATGAATCGGTCGTCAA
ATCGGCCTCTCAGAAGGCCTTCCAGTGCCTTCCTGATGTTTCCAAAGCCGTATCTGAGCTTACGCCGCTCAAAGGCGTTGGTCCGGCCACTGCCTCGGCGGTTCTGGCTG
CTTACGCGCCGGATGTTGCGCCATTTATGTCCGACGAGGCTATGGAGGCGGCTCTAGGCAACTCCAAGGATTATTCGTTGAAGCAGTACTTAATGTTCGCCAATAAGTTG
CAAGATAAAGCCAAGGAATTAAGCTCAGAAGGACAAATTTTCACACCATCTGATGTAGAGAGGGCTTTGTGGAGTACAGCTATAGGGGAAAAATTGAAAGGTTCACAATC
AAAATTAGATCCCAAGAATGAAGGCAAAACTGGCAACAAAAGAAAGAGAAAAGCTTGA
Protein sequenceShow/hide protein sequence
MSSKPLPSTTDDIEADLKTAAFRARICAILFPTRDGQGADEDNRTEVGPKDRPRGRLDQWVGPKWPDPYGRPRQKVEADHSARLRGPSPVTSFRSLMPQIAPVSSGSARN
TTECLETLGGKTASSRFSDLGIGGGVAYTTPVCSGFCWSCRSRLPQFLQIHCWCHVKVRNKSVHGEVIPSTQIRSKWIRNYLEFCLRSNSSNSPSMSNIHSEILCASVGI
DRWSPPNRGFWKLNTDVACIFKSLNDKHWDDIREKTPIFKYQNAGGDWSGNVASQAAMVEFSTSDPSLWREALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSP
YITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESVVKSASQKAFQCLPDVSKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLMFANKL
QDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSKLDPKNEGKTGNKRKRKA