; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G013860 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G013860
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA binding protein
Genome locationchr05:21753065..21754567
RNA-Seq ExpressionLsi05G013860
SyntenyLsi05G013860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052154.1 DNA binding protein [Cucumis melo var. makuwa]1.1e-10288.65Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR   NHLQ+NSISHCQECGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMC+RCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSG  L QSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD
        +NGDAVA    VEEEE KL EKEVTAIFD
Subjt:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD

KAE8650412.1 hypothetical protein Csa_010702 [Cucumis sativus]5.4e-10789.96Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR H NHLQSNSISHCQ+CGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMCFRCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSGG LCQSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAV-AGGTVEEEESKLQEKEVTAIFD
        +NGDAV +   VEEEE KLQEK+VTAIFD
Subjt:  TNGDAV-AGGTVEEEESKLQEKEVTAIFD

KAG7018542.1 hypothetical protein SDJN02_20411, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-10471.67Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR H NHLQSNSIS+C ECGISQS CWILHNVR KASFRRLCTNCVLK+NLSRFCPLCFD+YDDSTPP SHQRVMCFRCPSISH+SC SFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT
        +CSDPCFAFFDGFDS GL QSE  VA LAGR  DGKSAKAIVAAARV+AQSMRRAA DARAVAEMKI+NA FAKKQATLALERLA+LVLQEKD+NGYAKT
Subjt:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT

Query:  NGDAVAGGTVEEEESKLQEKEVTAIFD--PSSSFCSLF-------------VDAIVATFELLESPLAHCRMEPQWAFFIFKCQFWQSDGNIKILAAIAAA
        NG+A AG   EEEE++LQ + VTAI +   ++   SLF             V+AIVA F+L ESPL            IFK Q+ ++ G+IK  AAI +A
Subjt:  NGDAVAGGTVEEEESKLQEKEVTAIFD--PSSSFCSLF-------------VDAIVATFELLESPLAHCRMEPQWAFFIFKCQFWQSDGNIKILAAIAAA

XP_004147629.1 uncharacterized protein LOC101204799 [Cucumis sativus]9.5e-10489.69Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR H NHLQSNSISHCQ+CGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMCFRCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSGG LCQSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAV-AGGTVEEEESKLQEKE
        +NGDAV +   VEEEE KLQEK+
Subjt:  TNGDAV-AGGTVEEEESKLQEKE

XP_038902666.1 uncharacterized protein LOC120089301 [Benincasa hispida]2.2e-11695.61Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNRSHCNHLQSNS+SHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSC SFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT
        LCSDPCFAFFDGFDSGGLCQSESTVAFLA RNVD KSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKD+NGYAKT
Subjt:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT

Query:  NGDAVAG-GTVEEEESKLQEKEVTAIFD
        NGDAVAG  TVEEE+SKLQEKEVT+IF+
Subjt:  NGDAVAG-GTVEEEESKLQEKEVTAIFD

TrEMBL top hitse value%identityAlignment
A0A0A0L5G4 Uncharacterized protein2.6e-10789.96Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR H NHLQSNSISHCQ+CGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMCFRCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSGG LCQSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSGG-LCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAV-AGGTVEEEESKLQEKEVTAIFD
        +NGDAV +   VEEEE KLQEK+VTAIFD
Subjt:  TNGDAV-AGGTVEEEESKLQEKEVTAIFD

A0A1S4DSP5 uncharacterized protein LOC1034839242.4e-10086.46Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR   NHLQ+NSISHCQECGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMC+RCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSG  L QSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD
        +NGDAVA    VEEEE KL EKE   +F+
Subjt:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD

A0A5A7U9X0 DNA binding protein5.1e-10388.65Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR   NHLQ+NSISHCQECGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMC+RCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSG  L QSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD
        +NGDAVA    VEEEE KL EKEVTAIFD
Subjt:  TNGDAVAG-GTVEEEESKLQEKEVTAIFD

A0A6J1IXP5 uncharacterized protein LOC1114794683.1e-10084.14Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR HCNHLQSNSIS+C ECGISQS CWILHNVR KASFRRLCTNCVLK+NLSRFCPLCFD+YDDSTPP SHQRVMCFRCPSISH+SC SFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT
        LCSDPCFAFFDGFDSG L QSE  V+ LAGR  DGKSAKAIVAAARV AQSMRRAA DARAVAEMK +NA FAKKQATLALERLAYLVLQEKD+NGYAKT
Subjt:  LCSDPCFAFFDGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKT

Query:  NGDAVAGGTVEEEESKLQEKEVTAIFD
        NG+A AG   EEEE++LQ + VTAI +
Subjt:  NGDAVAGGTVEEEESKLQEKEVTAIFD

E5GC39 DNA binding protein9.0e-10088.34Show/hide
Query:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP
        MNR   NHLQ+NSISHCQECGISQSACWILHNVR KA+FRRLCTNCVLKHNLSRFCPLCFDVY+DSTPPPSH RVMC+RCPSISHLSCVSFRFSSTFLCP
Subjt:  MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCP

Query:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
        LCSDP F FFDGFDSG  L QSESTVAFLAG+NVD KS KAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALE+LAYLVLQEKDKNGY+K
Subjt:  LCSDPCFAFFDGFDSG-GLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK

Query:  TNGDAVAG-GTVEEEESKLQEKE
        +NGDAVA    VEEEE KL EKE
Subjt:  TNGDAVAG-GTVEEEESKLQEKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09520.1 LOCATED IN: chloroplast7.2e-2536.07Show/hide
Query:  SNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCF--RCPSISHLSCVSFRFSSTFLCPLCSDP-CF
        +NS   C +CG S +  W++H VRL+AS R  CT+C+L+++ + FCP CF +YD S  PPS +RV C    C S++H+ C       ++LCP C DP  F
Subjt:  SNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCF--RCPSISHLSCVSFRFSSTFLCPLCSDP-CF

Query:  AFFDGF-DSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTN-----
        +FF    D  G             R VD   ++A + AA+++A SM +A + A+   + + K AA AKK+A  ALE++  L  +EK ++   K       
Subjt:  AFFDGF-DSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTN-----

Query:  -----GDAVAGGTVEEEES
               A  G TV+E ES
Subjt:  -----GDAVAGGTVEEEES

AT3G17460.1 PHD finger family protein2.3e-1533.15Show/hide
Query:  LHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFS--------------STFLCPLCSDPCFAFFDGFDS
        +H V    +FRRLCT+C+LK     FC +CF+++D++ PP +  R++C  CPS +HLSC +   S              S+F C  CS+P F FF     
Subjt:  LHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFS--------------STFLCPLCSDPCFAFFDGFDS

Query:  GGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK
              E+ +          KSA A+VAA  +S  +M +A    +  A  KI  A  AK +A  AL  L  +V+++    G  K
Subjt:  GGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGCTCACACTGCAACCATCTTCAATCCAATTCGATTTCTCACTGCCAAGAATGCGGCATCTCTCAGTCCGCTTGTTGGATCCTCCACAATGTCCGTCTCAAAGC
TTCCTTCCGTCGTCTCTGCACCAATTGCGTCCTCAAGCACAACCTCTCCCGTTTCTGTCCTCTTTGCTTCGACGTTTACGACGATTCGACTCCTCCGCCGTCTCATCAGC
GAGTTATGTGCTTCAGATGCCCTTCAATCTCGCATCTCTCTTGCGTTTCCTTTCGATTTTCGTCTACCTTTCTATGCCCTCTCTGCTCCGATCCTTGTTTTGCCTTCTTC
GATGGGTTTGACTCCGGTGGCCTTTGTCAATCGGAGTCCACTGTCGCCTTTTTGGCCGGGAGAAACGTTGATGGTAAATCAGCGAAAGCGATTGTCGCTGCGGCTCGTGT
CTCCGCTCAATCCATGCGGAGAGCAGCTCTTGACGCTAGGGCTGTAGCGGAGATGAAGATCAAGAACGCTGCCTTTGCTAAGAAACAAGCTACTCTCGCATTGGAACGGC
TTGCTTATCTTGTGCTTCAGGAGAAGGACAAAAATGGATATGCTAAAACTAATGGAGATGCTGTTGCTGGTGGGACGGTTGAAGAAGAAGAATCCAAGCTACAGGAGAAA
GAGGTAACAGCCATTTTCGATCCTTCTTCTTCATTTTGCTCTCTCTTCGTGGACGCCATTGTTGCCACCTTTGAGCTACTAGAATCGCCATTGGCCCATTGCAGAATGGA
GCCACAGTGGGCATTTTTTATATTCAAATGCCAGTTCTGGCAGAGCGATGGGAATATCAAGATCCTGGCGGCGATCGCGGCGGCGGTAGATTGTGCTGGTTTTTGTCTCG
TAAAATTGGCTATTTCCCCTTCTTCGTTGTCTCTACAAAACTGCCAGTTTCAATTTCTCGAGCATTCAACCAATAAAGTTTCGTTCTGCTGTGGAGCTCTGAGTTGTTCA
TATGATTTGACCGAGTCCGCTCGACTCGACTCGCTTCGAGGAAACCGAGTTGACTCACTGAATTTCCCCGTCCATCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGCTCACACTGCAACCATCTTCAATCCAATTCGATTTCTCACTGCCAAGAATGCGGCATCTCTCAGTCCGCTTGTTGGATCCTCCACAATGTCCGTCTCAAAGC
TTCCTTCCGTCGTCTCTGCACCAATTGCGTCCTCAAGCACAACCTCTCCCGTTTCTGTCCTCTTTGCTTCGACGTTTACGACGATTCGACTCCTCCGCCGTCTCATCAGC
GAGTTATGTGCTTCAGATGCCCTTCAATCTCGCATCTCTCTTGCGTTTCCTTTCGATTTTCGTCTACCTTTCTATGCCCTCTCTGCTCCGATCCTTGTTTTGCCTTCTTC
GATGGGTTTGACTCCGGTGGCCTTTGTCAATCGGAGTCCACTGTCGCCTTTTTGGCCGGGAGAAACGTTGATGGTAAATCAGCGAAAGCGATTGTCGCTGCGGCTCGTGT
CTCCGCTCAATCCATGCGGAGAGCAGCTCTTGACGCTAGGGCTGTAGCGGAGATGAAGATCAAGAACGCTGCCTTTGCTAAGAAACAAGCTACTCTCGCATTGGAACGGC
TTGCTTATCTTGTGCTTCAGGAGAAGGACAAAAATGGATATGCTAAAACTAATGGAGATGCTGTTGCTGGTGGGACGGTTGAAGAAGAAGAATCCAAGCTACAGGAGAAA
GAGGTAACAGCCATTTTCGATCCTTCTTCTTCATTTTGCTCTCTCTTCGTGGACGCCATTGTTGCCACCTTTGAGCTACTAGAATCGCCATTGGCCCATTGCAGAATGGA
GCCACAGTGGGCATTTTTTATATTCAAATGCCAGTTCTGGCAGAGCGATGGGAATATCAAGATCCTGGCGGCGATCGCGGCGGCGGTAGATTGTGCTGGTTTTTGTCTCG
TAAAATTGGCTATTTCCCCTTCTTCGTTGTCTCTACAAAACTGCCAGTTTCAATTTCTCGAGCATTCAACCAATAAAGTTTCGTTCTGCTGTGGAGCTCTGAGTTGTTCA
TATGATTTGACCGAGTCCGCTCGACTCGACTCGCTTCGAGGAAACCGAGTTGACTCACTGAATTTCCCCGTCCATCAATAA
Protein sequenceShow/hide protein sequence
MNRSHCNHLQSNSISHCQECGISQSACWILHNVRLKASFRRLCTNCVLKHNLSRFCPLCFDVYDDSTPPPSHQRVMCFRCPSISHLSCVSFRFSSTFLCPLCSDPCFAFF
DGFDSGGLCQSESTVAFLAGRNVDGKSAKAIVAAARVSAQSMRRAALDARAVAEMKIKNAAFAKKQATLALERLAYLVLQEKDKNGYAKTNGDAVAGGTVEEEESKLQEK
EVTAIFDPSSSFCSLFVDAIVATFELLESPLAHCRMEPQWAFFIFKCQFWQSDGNIKILAAIAAAVDCAGFCLVKLAISPSSLSLQNCQFQFLEHSTNKVSFCCGALSCS
YDLTESARLDSLRGNRVDSLNFPVHQ