; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030983 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030983
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionF12P19.7, putative isoform 2
Genome locationchr11:3556304..3561203
RNA-Seq ExpressionLag0030983
SyntenyLag0030983
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR017956 - AT hook, DNA-binding motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044364.1 F12P19.7, putative isoform 2 [Cucumis melo var. makuwa]2.3e-14987Show/hide
Query:  FGPVFLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQ
        F    L+++ LL SLKGITSE VTSECVLKQYEKGEIQIINKTET+QLAQF+AHFVADVDQPQSCNFATFLPSSEDTPLQ+AEWIKFLG FAN+E RA Q
Subjt:  FGPVFLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQ

Query:  IYTAIKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNV
        IYTAIKENY+CLKNIATTRKTFKPIVAWMGYYDG+WSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFTSDP AYN+
Subjt:  IYTAIKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNV

Query:  STFLELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        STFL+LINIQDQSCLSFLS+QSIWRFDKRFH+S A DWFDGAISQPQLVLADIIEVLFPT N+TTTYFRNLAKEGV +I SEMCERD ++ALEPTI+ CG
Subjt:  STFLELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

TYK29493.1 uncharacterized protein E5676_scaffold655G00920 [Cucumis melo var. makuwa]5.6e-14889.55Show/hide
Query:  SLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTAIKENYMCLK
        SLKGITSE VTSECVLKQYEKGEIQIINKTET+QLAQF+AHFVADVDQPQSCNFATFLPSSEDTPLQ+AEWIKFLG FAN+E RA QIYTAIKENY+CLK
Subjt:  SLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTAIKENYMCLK

Query:  NIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFLELINIQDQS
        NIATTRKTFKPIVAWMGYYDG+WSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFTSDP AYN+STFL+LINIQDQS
Subjt:  NIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFLELINIQDQS

Query:  CLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        CLSFLS+QSIWRFDKRFH+S A DWFDGAISQPQLVLADIIEVLFPT N+TTTYFRNLAKEGV +I SEMCERD ++ALEPTI+ CG
Subjt:  CLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

XP_004152259.1 uncharacterized protein LOC101208429 isoform X1 [Cucumis sativus]1.5e-15391.22Show/hide
Query:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA
        F  LLGLL SLKGITSE VTSECVLKQYEKGEIQIINKTET+QLAQF+AHF+ADVDQPQSCNFATFLPSSEDTPLQ+AEWIKFLG FAN+E RA QIYTA
Subjt:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA

Query:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL
        IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFTSDPTAYN+STFL
Subjt:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL

Query:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        +LINIQDQSCLSFLS+QSIWRFDKRFHNS A DWFDGAISQPQLVLADIIEVLFPT N+TTTYFRNLAKEGV +I SEMCERD SSALEPTIIACG
Subjt:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

XP_031739521.1 uncharacterized protein LOC101208429 isoform X2 [Cucumis sativus]1.5e-15391.22Show/hide
Query:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA
        F  LLGLL SLKGITSE VTSECVLKQYEKGEIQIINKTET+QLAQF+AHF+ADVDQPQSCNFATFLPSSEDTPLQ+AEWIKFLG FAN+E RA QIYTA
Subjt:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA

Query:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL
        IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFTSDPTAYN+STFL
Subjt:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL

Query:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        +LINIQDQSCLSFLS+QSIWRFDKRFHNS A DWFDGAISQPQLVLADIIEVLFPT N+TTTYFRNLAKEGV +I SEMCERD SSALEPTIIACG
Subjt:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

XP_038903360.1 uncharacterized protein LOC120089977 [Benincasa hispida]1.1e-15491.89Show/hide
Query:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA
        F  LLGLL SLKGITSERVTSECVLKQYEKG+IQIINKTET+QLAQF+AHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLG FANLE+RATQIY+A
Subjt:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA

Query:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL
        IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFTSDP  YN+STFL
Subjt:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL

Query:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        +LINIQDQSCLSFLS+QSIWRFDKRFHNS A DWFDGAISQPQLVLADIIEVLFPT N+TTTYFRNLAKEGV +I SEMCERDSSSALEPTIIACG
Subjt:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

TrEMBL top hitse value%identityAlignment
A0A0A0KV51 Uncharacterized protein7.4e-15491.22Show/hide
Query:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA
        F  LLGLL SLKGITSE VTSECVLKQYEKGEIQIINKTET+QLAQF+AHF+ADVDQPQSCNFATFLPSSEDTPLQ+AEWIKFLG FAN+E RA QIYTA
Subjt:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA

Query:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL
        IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFTSDPTAYN+STFL
Subjt:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL

Query:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        +LINIQDQSCLSFLS+QSIWRFDKRFHNS A DWFDGAISQPQLVLADIIEVLFPT N+TTTYFRNLAKEGV +I SEMCERD SSALEPTIIACG
Subjt:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

A0A5A7TMJ6 F12P19.7, putative isoform 21.1e-14987Show/hide
Query:  FGPVFLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQ
        F    L+++ LL SLKGITSE VTSECVLKQYEKGEIQIINKTET+QLAQF+AHFVADVDQPQSCNFATFLPSSEDTPLQ+AEWIKFLG FAN+E RA Q
Subjt:  FGPVFLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQ

Query:  IYTAIKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNV
        IYTAIKENY+CLKNIATTRKTFKPIVAWMGYYDG+WSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFTSDP AYN+
Subjt:  IYTAIKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNV

Query:  STFLELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        STFL+LINIQDQSCLSFLS+QSIWRFDKRFH+S A DWFDGAISQPQLVLADIIEVLFPT N+TTTYFRNLAKEGV +I SEMCERD ++ALEPTI+ CG
Subjt:  STFLELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

A0A5D3E0B2 Uncharacterized protein2.7e-14889.55Show/hide
Query:  SLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTAIKENYMCLK
        SLKGITSE VTSECVLKQYEKGEIQIINKTET+QLAQF+AHFVADVDQPQSCNFATFLPSSEDTPLQ+AEWIKFLG FAN+E RA QIYTAIKENY+CLK
Subjt:  SLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTAIKENYMCLK

Query:  NIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFLELINIQDQS
        NIATTRKTFKPIVAWMGYYDG+WSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFTSDP AYN+STFL+LINIQDQS
Subjt:  NIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFLELINIQDQS

Query:  CLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        CLSFLS+QSIWRFDKRFH+S A DWFDGAISQPQLVLADIIEVLFPT N+TTTYFRNLAKEGV +I SEMCERD ++ALEPTI+ CG
Subjt:  CLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

A0A6J1CNU5 uncharacterized protein LOC111013325 isoform X27.4e-14687.29Show/hide
Query:  FLWLLGLLESLKGI-TSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYT
        F  LLG++ SLKGI TS  + SECVLKQYEKGEIQIIN T+  QLAQFSAHFVADVDQ Q CNFA FLPSSEDTPLQRAEWIKFLGVFANLE+RA+QIYT
Subjt:  FLWLLGLLESLKGI-TSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYT

Query:  AIKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTF
        A+KENYMCLKNIATTRKTFKPIVAW+GYYDGIWSFTKD++KLKYIEDAGGENVDESINKITYNVSNPDDLD FHGILCTVEV+IDET+ SDPTAY VSTF
Subjt:  AIKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTF

Query:  LELINIQDQSCLSFLSSQSIWRFDKRFHNST--ALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG
        L+L NI+DQSCLSF+SSQSIWRFDKRFHNST  ALDWFDGAISQPQLVLAD+IEVLFPTANYTTTYFRNLAKEGV +ISSEMCERD SSALEPTIIACG
Subjt:  LELINIQDQSCLSFLSSQSIWRFDKRFHNST--ALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG

A0A6J1F572 uncharacterized protein LOC1114422352.5e-14687.46Show/hide
Query:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA
        F  LLGL+ +LK ITSERVTSECVLKQYEKGEIQIINKTET+QLAQF+AHF+ADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLE+RATQIY+A
Subjt:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA

Query:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL
        +KENYMCLKNIATTRKTFKPIVAWMGY DG+WSFTKDA+KLKYIEDAGGENVD+SINKITYNVSNPDDLD FHGILCTVEVIIDETFT DPT YN+STFL
Subjt:  IKENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFL

Query:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIAC
        ELI+IQDQSCLSFLS+QSIWRFDKRF +ST LDW DG +SQPQLVLAD+I +LF   NYTTTYFRNLAKEGV  ISSEMCER+SSSALEPTIIAC
Subjt:  ELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIAC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G65900.1 unknown protein7.1e-10963Show/hide
Query:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA
        F  LLGLL SLKGITS+ V S C+LK  E GE+  ++K E  QL+QF+AHF++D DQPQ+CNFA F P SE TPLQRAEWIKFLG F NLE++A Q+Y +
Subjt:  FLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTA

Query:  IKENYMCLKNIATTR-KTFKPIVAWMGY--YDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVS
        +K +Y CL  +A  + K+FKPIVAWMGY    G+WSFTK++ KLK++EDAGGEN+D+SINK++YNVS+PDDL+  H ILCTV+ +IDET +SDP  Y  +
Subjt:  IKENYMCLKNIATTR-KTFKPIVAWMGY--YDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVS

Query:  TFLELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAK-EGVKSISSEMCERDSSSALEPTIIACG
        TFL  IN+ D SC +FL++QSIWR+DKR  N T LDW+DGAISQP LVLADI+E LFPT NYTT+YFRN+AK EGV +IS +MC+RD+S  L P+I ACG
Subjt:  TFLELINIQDQSCLSFLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAK-EGVKSISSEMCERDSSSALEPTIIACG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGCCTTGGAGTTAGGATGTTTGGCCTTGGCATGACGATGCACACACTCTTCTGGGGAAAAGGCTTGGGGAGGCCCAAGAAATTTCCGAGCCTCTCTTCAAGCTG
TCACGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCACGGTCATCCCAGGGGTGCGTACA
CTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCTCGAGCCTCTCTTTCAAGCTGTCACAGTCTTTTGAGCAAAAGGCCGAGGCCGACCATATGGGTCGGGCCATT
TTGGCCCGACCCTTTGGTCCGGTCTTCCTCTGGCTACTAGGGTTATTGGAAAGCTTGAAGGGCATAACATCGGAGAGGGTGACGTCAGAATGCGTATTGAAGCAATACGA
AAAAGGGGAAATTCAAATTATAAATAAAACGGAAACACGCCAGCTGGCACAGTTTTCGGCTCACTTCGTTGCTGACGTGGACCAACCACAGTCCTGCAATTTTGCCACCT
TTCTCCCTTCCTCCGAGGATACGCCTCTGCAAAGAGCAGAGTGGATAAAGTTTTTGGGAGTTTTTGCAAATCTTGAATCAAGAGCCACTCAAATTTACACTGCGATCAAA
GAAAACTACATGTGTCTGAAGAACATAGCAACCACTAGAAAGACTTTTAAACCTATAGTTGCTTGGATGGGTTACTATGATGGCATATGGTCTTTCACTAAGGACGCCTT
CAAGCTCAAGTACATAGAAGATGCGGGAGGGGAGAATGTGGACGAGTCGATCAACAAAATCACATACAACGTCTCTAATCCCGACGATTTAGACACCTTTCATGGAATCC
TATGCACGGTGGAGGTGATCATCGATGAAACGTTTACGTCAGATCCAACGGCGTACAACGTGTCCACGTTTCTTGAACTCATCAATATTCAAGATCAATCTTGCCTCTCT
TTTCTTTCCTCTCAAAGCATTTGGCGATTCGATAAGCGATTTCACAACTCCACTGCTCTCGATTGGTTCGACGGAGCAATCTCACAGCCCCAATTGGTACTGGCAGACAT
CATAGAGGTTTTGTTCCCTACGGCCAATTACACAACAACCTATTTTAGGAACTTGGCAAAGGAAGGAGTTAAAAGCATTAGTTCAGAAATGTGTGAGAGAGATAGTTCTT
CTGCATTGGAGCCCACCATCATAGCCTGTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGCCTTGGAGTTAGGATGTTTGGCCTTGGCATGACGATGCACACACTCTTCTGGGGAAAAGGCTTGGGGAGGCCCAAGAAATTTCCGAGCCTCTCTTCAAGCTG
TCACGGTCATCCCAGGGGTGCGTACACTCTTCTGGGGAAAAGCTTGGGGAGGCCAAGGAAATTCCCGAGCCTCTCTTCAAGCTGTCACGGTCATCCCAGGGGTGCGTACA
CTCTTCTGGGGAAAGGCTTGGGGAGGCCAAGGAAATTCTCGAGCCTCTCTTTCAAGCTGTCACAGTCTTTTGAGCAAAAGGCCGAGGCCGACCATATGGGTCGGGCCATT
TTGGCCCGACCCTTTGGTCCGGTCTTCCTCTGGCTACTAGGGTTATTGGAAAGCTTGAAGGGCATAACATCGGAGAGGGTGACGTCAGAATGCGTATTGAAGCAATACGA
AAAAGGGGAAATTCAAATTATAAATAAAACGGAAACACGCCAGCTGGCACAGTTTTCGGCTCACTTCGTTGCTGACGTGGACCAACCACAGTCCTGCAATTTTGCCACCT
TTCTCCCTTCCTCCGAGGATACGCCTCTGCAAAGAGCAGAGTGGATAAAGTTTTTGGGAGTTTTTGCAAATCTTGAATCAAGAGCCACTCAAATTTACACTGCGATCAAA
GAAAACTACATGTGTCTGAAGAACATAGCAACCACTAGAAAGACTTTTAAACCTATAGTTGCTTGGATGGGTTACTATGATGGCATATGGTCTTTCACTAAGGACGCCTT
CAAGCTCAAGTACATAGAAGATGCGGGAGGGGAGAATGTGGACGAGTCGATCAACAAAATCACATACAACGTCTCTAATCCCGACGATTTAGACACCTTTCATGGAATCC
TATGCACGGTGGAGGTGATCATCGATGAAACGTTTACGTCAGATCCAACGGCGTACAACGTGTCCACGTTTCTTGAACTCATCAATATTCAAGATCAATCTTGCCTCTCT
TTTCTTTCCTCTCAAAGCATTTGGCGATTCGATAAGCGATTTCACAACTCCACTGCTCTCGATTGGTTCGACGGAGCAATCTCACAGCCCCAATTGGTACTGGCAGACAT
CATAGAGGTTTTGTTCCCTACGGCCAATTACACAACAACCTATTTTAGGAACTTGGCAAAGGAAGGAGTTAAAAGCATTAGTTCAGAAATGTGTGAGAGAGATAGTTCTT
CTGCATTGGAGCCCACCATCATAGCCTGTGGATGA
Protein sequenceShow/hide protein sequence
MVGLGVRMFGLGMTMHTLFWGKGLGRPKKFPSLSSSCHGHPRGAYTLLGKSLGRPRKFPSLSSSCHGHPRGAYTLLGKGLGRPRKFSSLSFKLSQSFEQKAEADHMGRAI
LARPFGPVFLWLLGLLESLKGITSERVTSECVLKQYEKGEIQIINKTETRQLAQFSAHFVADVDQPQSCNFATFLPSSEDTPLQRAEWIKFLGVFANLESRATQIYTAIK
ENYMCLKNIATTRKTFKPIVAWMGYYDGIWSFTKDAFKLKYIEDAGGENVDESINKITYNVSNPDDLDTFHGILCTVEVIIDETFTSDPTAYNVSTFLELINIQDQSCLS
FLSSQSIWRFDKRFHNSTALDWFDGAISQPQLVLADIIEVLFPTANYTTTYFRNLAKEGVKSISSEMCERDSSSALEPTIIACG