; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G028150 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G028150
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionKH domain-containing protein
Genome locationchr02:34319366..34323578
RNA-Seq ExpressionLsi02G028150
SyntenyLsi02G028150
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR004087 - K Homology domain
IPR004088 - K Homology domain, type 1
IPR009210 - Activating signal cointegrator 1 complex subunit 1
IPR036612 - K Homology domain, type 1 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059318.1 activating signal cointegrator 1 complex subunit 1 [Cucumis melo var. makuwa]4.0e-9478.52Show/hide
Query:  NGAIEMSGRGSGANGNNGKRQ----------------------RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPV
        N AIE+SGRGSGANGNNGKRQ                       V+RFLKYT+PY LHQ G YAYNGW LNANMTGK EF+SAADQKKKRKTISQAWRPV
Subjt:  NGAIEMSGRGSGANGNNGKRQ----------------------RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPV

Query:  CTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKG
        CTH   SEDLS+KDDRVESEDGSQVQEMDCRMH+ST SAQ  VEV EEINVVTELSV      N+GGDTNLEGQSVPSGEKFSVKL+VGSSLIRFVRGKG
Subjt:  CTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKG

Query:  GSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDEAS-CPSL
        GSTQE+IEEEMGVKIMIPSSK+EEFVVIEGNSVDSVTKASEKIQSIIDEA+  PSL
Subjt:  GSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDEAS-CPSL

KAG6575524.1 Activating signal cointegrator 1 complex subunit 1, partial [Cucurbita argyrosperma subsp. sororia]3.0e-8985.92Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        RV RFLKYTSP+   QPG Y YNGWSLNANM GKKEF+  ADQKKKRKTI+QAWRPVCT  SPSEDL +K+DRVESEDGS+VQE    +HTSTVSAQ+VV
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        EVAEEINVVT+L VGSSAFPN GGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDEA-SCPSL
        QSIIDEA   PSL
Subjt:  QSIIDEA-SCPSL

XP_022954063.1 uncharacterized protein LOC111456441 isoform X1 [Cucurbita moschata]6.0e-9086.38Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        RV RFLKYTSP+   QPG Y YNGWSLNANM GKKEF+  ADQKKKRKTI+QAWRPVCT  SPSEDL +K+DRVESEDGS+VQE    +HTSTVSAQ+VV
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        EVAEEINVVT+LSVGSSAFPN GGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDEA-SCPSL
        QSIIDEA   PSL
Subjt:  QSIIDEA-SCPSL

XP_038899990.1 uncharacterized protein LOC120087160 isoform X1 [Benincasa hispida]5.6e-9689.67Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        RV+RFLKYTSPY LHQPG YAYNGWSLNANMT KKEF+SAADQKKKRKTISQAW+PVCT  SPSEDLS+KDDRVE EDGS+VQEMDCRMHTST SA++VV
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        EVAEEINVVTELSV SSAFPNM GD NLEGQSVPS EKFSVKLDVGSSLIRFVRGKGGSTQE+IEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDEA-SCPSL
        QSIIDEA   PSL
Subjt:  QSIIDEA-SCPSL

XP_038899991.1 uncharacterized protein LOC120087160 isoform X2 [Benincasa hispida]5.6e-9689.67Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        RV+RFLKYTSPY LHQPG YAYNGWSLNANMT KKEF+SAADQKKKRKTISQAW+PVCT  SPSEDLS+KDDRVE EDGS+VQEMDCRMHTST SA++VV
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        EVAEEINVVTELSV SSAFPNM GD NLEGQSVPS EKFSVKLDVGSSLIRFVRGKGGSTQE+IEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDEA-SCPSL
        QSIIDEA   PSL
Subjt:  QSIIDEA-SCPSL

TrEMBL top hitse value%identityAlignment
A0A0A0K963 KH domain-containing protein5.1e-8785.92Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        RV+RFLKYT+P+ LHQ G YAYNGW LNANMTGK EF+SAADQKKKRKTISQAWRPVCTH  PSEDLS++DDRVESEDGSQVQEMD RMHTST SAQ  V
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        EVAEEINVVTELSV      NMGGDTNLEGQSV SGEKFSVKLDVGSSLIRFVRGKGGSTQE+IE+EMGVKIMIPSSK+EEFVVIEGNSVDSVTKASEKI
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDEA-SCPSL
        QSIIDEA   PSL
Subjt:  QSIIDEA-SCPSL

A0A1S3CGC2 activating signal cointegrator 1 complex subunit 14.2e-8985.92Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        RV+RFLKYT+PY LHQ G YAYNGW LNANMTGK EF+SAADQKKKRKTISQAWRPVCTH   SEDLS+KDDRVESEDGSQVQEMDCRMH+ST SAQ  V
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        EV EEINVVTELSV      N+GGDTNLEGQSVPSGEKFSVKL+VGSSLIRFVRGKGGSTQE+IEEEMGVKIMIPSSK+EEFVVIEGNSVDSVTKASEKI
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDEAS-CPSL
        QSIIDEA+  PSL
Subjt:  QSIIDEAS-CPSL

A0A5D3C0W5 Activating signal cointegrator 1 complex subunit 11.9e-9478.52Show/hide
Query:  NGAIEMSGRGSGANGNNGKRQ----------------------RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPV
        N AIE+SGRGSGANGNNGKRQ                       V+RFLKYT+PY LHQ G YAYNGW LNANMTGK EF+SAADQKKKRKTISQAWRPV
Subjt:  NGAIEMSGRGSGANGNNGKRQ----------------------RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPV

Query:  CTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKG
        CTH   SEDLS+KDDRVESEDGSQVQEMDCRMH+ST SAQ  VEV EEINVVTELSV      N+GGDTNLEGQSVPSGEKFSVKL+VGSSLIRFVRGKG
Subjt:  CTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKG

Query:  GSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDEAS-CPSL
        GSTQE+IEEEMGVKIMIPSSK+EEFVVIEGNSVDSVTKASEKIQSIIDEA+  PSL
Subjt:  GSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDEAS-CPSL

A0A6J1D7P1 uncharacterized protein LOC111017681 isoform X11.1e-8481.69Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        RV+R LKYTS Y L QPG YAYNG SL  NMTG+KEF+SAADQKKKRKTISQAWRPVCTH SPSEDLS+KD RVES+DG+Q+Q+MDC    S+VSAQ V 
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        EVAEE  VVT+LSVGSSAFPN  GDTN+EGQSV S EKFSVK+DVGSSLIRFVRGK GSTQEKIEEEMG+KIMIPSSKKEEFVVIEGNSVDSVTKASEKI
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDEA-SCPSL
        QSIIDEA   PSL
Subjt:  QSIIDEA-SCPSL

A0A6J1GQ24 uncharacterized protein LOC111456441 isoform X12.9e-9086.38Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        RV RFLKYTSP+   QPG Y YNGWSLNANM GKKEF+  ADQKKKRKTI+QAWRPVCT  SPSEDL +K+DRVESEDGS+VQE    +HTSTVSAQ+VV
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        EVAEEINVVT+LSVGSSAFPN GGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDEA-SCPSL
        QSIIDEA   PSL
Subjt:  QSIIDEA-SCPSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G16230.1 Predicted eukaryotic LigT1.1e-2040.7Show/hide
Query:  DQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSV
        D  KK+K ++  WRP+ T  S           V +E G++VQE              V + ++  +V  E+  G +A             SV S  K SV
Subjt:  DQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSV

Query:  KLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDE-ASCPSL
         L+VG+SLI+F+RGK G+TQ K+EEEMGVKI++PSS+ ++ + IEG SVD VTKAS++I +IIDE    PSL
Subjt:  KLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDE-ASCPSL

AT3G16230.2 Predicted eukaryotic LigT1.3e-2136.15Show/hide
Query:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV
        R++R   +TS     +P  ++         ++ +   +SA D  KK+K ++  WRP+ T  S           V +E G++VQE              V 
Subjt:  RVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVV

Query:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI
        + ++  +V  E+  G +A             SV S  K SV L+VG+SLI+F+RGK G+TQ K+EEEMGVKI++PSS+ ++ + IEG SVD VTKAS++I
Subjt:  EVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKI

Query:  QSIIDE-ASCPSL
         +IIDE    PSL
Subjt:  QSIIDE-ASCPSL

AT3G16230.3 Predicted eukaryotic LigT1.1e-2040.7Show/hide
Query:  DQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSV
        D  KK+K ++  WRP+ T  S           V +E G++VQE              V + ++  +V  E+  G +A             SV S  K SV
Subjt:  DQKKKRKTISQAWRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSV

Query:  KLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDE-ASCPSL
         L+VG+SLI+F+RGK G+TQ K+EEEMGVKI++PSS+ ++ + IEG SVD VTKAS++I +IIDE    PSL
Subjt:  KLDVGSSLIRFVRGKGGSTQEKIEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDE-ASCPSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTATAGTGCCATAGTCTATCAAGCGAGATCACAAATGCATACCTTTCAGCGATTGCATGCACCAAATCAGAGATGGAACGCAGCGGCGTCGGCGAGTGGGAGGAA
TGGAGCCATCGAGATGAGCGGGAGAGGGAGTGGAGCTAACGGCAACAACGGTAAGCGGCAGCGAGTCAACCGCTTCCTGAAATATACAAGTCCGTATGCCCTACACCAGC
CGGGACAGTATGCTTATAATGGTTGGAGCTTGAATGCGAACATGACTGGTAAAAAGGAATTTCAGAGTGCTGCTGACCAAAAGAAAAAGCGCAAAACAATCAGCCAAGCA
TGGAGACCAGTGTGTACTCATGGCAGCCCTTCTGAGGATTTGTCGCTTAAGGATGATAGAGTTGAGTCAGAGGATGGAAGTCAAGTTCAAGAAATGGATTGCAGAATGCA
TACCAGTACTGTTAGTGCTCAACATGTCGTAGAAGTAGCTGAAGAAATTAATGTGGTGACTGAGCTAAGTGTTGGTTCAAGTGCATTTCCAAATATGGGTGGAGACACAA
ACTTGGAGGGTCAATCTGTGCCTTCTGGTGAGAAGTTTTCGGTGAAACTGGATGTAGGGAGCTCTCTAATTCGATTTGTCAGAGGGAAAGGTGGATCCACACAGGAAAAG
ATTGAAGAAGAGATGGGAGTCAAGATTATGATTCCATCATCTAAGAAGGAAGAATTTGTTGTTATTGAAGGTAATTCAGTTGATAGTGTAACGAAAGCTTCAGAAAAAAT
ACAATCAATAATTGACGAGGCAAGTTGTCCATCTTTATGTTTGTTTTAG
mRNA sequenceShow/hide mRNA sequence
GATCCCAATGAAGTATAGTGCCATAGTCTATCAAGCGAGATCACAAATGCATACCTTTCAGCGATTGCATGCACCAAATCAGAGATGGAACGCAGCGGCGTCGGCGAGTG
GGAGGAATGGAGCCATCGAGATGAGCGGGAGAGGGAGTGGAGCTAACGGCAACAACGGTAAGCGGCAGCGAGTCAACCGCTTCCTGAAATATACAAGTCCGTATGCCCTA
CACCAGCCGGGACAGTATGCTTATAATGGTTGGAGCTTGAATGCGAACATGACTGGTAAAAAGGAATTTCAGAGTGCTGCTGACCAAAAGAAAAAGCGCAAAACAATCAG
CCAAGCATGGAGACCAGTGTGTACTCATGGCAGCCCTTCTGAGGATTTGTCGCTTAAGGATGATAGAGTTGAGTCAGAGGATGGAAGTCAAGTTCAAGAAATGGATTGCA
GAATGCATACCAGTACTGTTAGTGCTCAACATGTCGTAGAAGTAGCTGAAGAAATTAATGTGGTGACTGAGCTAAGTGTTGGTTCAAGTGCATTTCCAAATATGGGTGGA
GACACAAACTTGGAGGGTCAATCTGTGCCTTCTGGTGAGAAGTTTTCGGTGAAACTGGATGTAGGGAGCTCTCTAATTCGATTTGTCAGAGGGAAAGGTGGATCCACACA
GGAAAAGATTGAAGAAGAGATGGGAGTCAAGATTATGATTCCATCATCTAAGAAGGAAGAATTTGTTGTTATTGAAGGTAATTCAGTTGATAGTGTAACGAAAGCTTCAG
AAAAAATACAATCAATAATTGACGAGGCAAGTTGTCCATCTTTATGTTTGTTTTAGAAAATATATTATGCCCATTTCTTCCAATTTTTGTACTTCATTTTTTTTACCTTT
TTTGTATGAGTTGATGTTAAGGGTGTTTTTTTAAAACAATAAATAAAACTTTTCATTAAAGAAATGAAAAGAGACTAATGCTAACAAGAAAATACAAACTTTGAAAAGGA
GGGAAAGAAAAAACAAAATAAAATAATAAAGAAACTATAAACGCAATCCAAAATCCAAATCAAACAATTGTCTTGTAGAGGGAAACCAATAACAAGATTGGATAAAGA
Protein sequenceShow/hide protein sequence
MKYSAIVYQARSQMHTFQRLHAPNQRWNAAASASGRNGAIEMSGRGSGANGNNGKRQRVNRFLKYTSPYALHQPGQYAYNGWSLNANMTGKKEFQSAADQKKKRKTISQA
WRPVCTHGSPSEDLSLKDDRVESEDGSQVQEMDCRMHTSTVSAQHVVEVAEEINVVTELSVGSSAFPNMGGDTNLEGQSVPSGEKFSVKLDVGSSLIRFVRGKGGSTQEK
IEEEMGVKIMIPSSKKEEFVVIEGNSVDSVTKASEKIQSIIDEASCPSLCLF