; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G003630 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G003630
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSANT domain-containing protein
Genome locationchr07:3880756..3894704
RNA-Seq ExpressionLsi07G003630
SyntenyLsi07G003630
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581997.1 hypothetical protein SDJN03_21999, partial [Cucurbita argyrosperma subsp. sororia]2.6e-7581.58Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
                              ERVSDPS+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTG+LLEQNAHAM+QISSNLASFQ++
Subjt:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

XP_004152146.1 uncharacterized protein LOC101222201 isoform X2 [Cucumis sativus]1.5e-7582.63Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVP ADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
                              ERVSD S+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTGELLEQNAHAM+QISSNLASFQ++
Subjt:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

XP_016901549.1 PREDICTED: uncharacterized protein LOC103494613 isoform X2 [Cucumis melo]2.3e-7993.45Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVP ADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
        ERVSD S+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTGELLEQNAHAM+QISSNLASFQ++
Subjt:  ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

XP_022955472.1 uncharacterized protein LOC111457487 [Cucurbita moschata]2.6e-7581.58Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
                              ERVSDPS+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTG+LLEQNAHAM+QISSNLASFQ++
Subjt:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

XP_038882972.1 uncharacterized protein LOC120074053 [Benincasa hispida]2.0e-7582.11Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPV  ADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYA+ESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
                              ERVSDPS+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTGELLEQN HAMHQISSNLASFQ++
Subjt:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

TrEMBL top hitse value%identityAlignment
A0A0A0KX84 SANT domain-containing protein7.4e-7682.63Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVP ADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
                              ERVSD S+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTGELLEQNAHAM+QISSNLASFQ++
Subjt:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

A0A1S3BYL7 uncharacterized protein LOC103494613 isoform X17.4e-7682.63Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVP ADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
                              ERVSD S+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTGELLEQNAHAM+QISSNLASFQ++
Subjt:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

A0A1S4E017 uncharacterized protein LOC103494613 isoform X21.1e-7993.45Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVP ADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
        ERVSD S+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTGELLEQNAHAM+QISSNLASFQ++
Subjt:  ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

A0A6J1CVN7 uncharacterized protein LOC1110152032.8e-7582.11Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHQEAGQPSSSFDGGNPSNGNSTPV  ADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYA ESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
                              ERVSDPS+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTGELLEQNAHAM+QISSNLASFQ++
Subjt:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

A0A6J1GTR4 uncharacterized protein LOC1114574871.3e-7581.58Show/hide
Query:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

Query:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK
                              ERVSDPS+KSAQVAARPNVPPYGMPMIPMDNDD     AIGGTTG+LLEQNAHAM+QISSNLASFQ++
Subjt:  ----------------------ERVSDPSLKSAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10820.1 Protein of unknown function (DUF3755)1.2e-1434.84Show/hide
Query:  VPTADNSSS-ALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWM--NERVSDPSLKSAQVAARPNV---PPYGM
        +PT D S S A  +K    +  DW+ +EQ  LE GL K   E  + +Y KIA  LP+KTVRDVALRCRWM    R  + +  +  ++ R  V   P   M
Subjt:  VPTADNSSS-ALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWM--NERVSDPSLKSAQVAARPNV---PPYGM

Query:  -PMIPMDND---------------DAIGGTTGELLEQNAHAMHQISSNLASFQVK
           +P  N                + +     +LL+QNA A  QIS NL++ +++
Subjt:  -PMIPMDND---------------DAIGGTTGELLEQNAHAMHQISSNLASFQVK

AT3G07565.1 Protein of unknown function (DUF3755)5.3e-3443.94Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V            ADNS +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM---------------------NERVSDPSLK-SAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQV
        VALRCRWM                      E+ +D S K S+ +   PN P Y  PM+P+D DD     AIGG +G+LLEQNA   +Q+S+N ++FQ+
Subjt:  VALRCRWM---------------------NERVSDPSLK-SAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQV

AT3G07565.2 Protein of unknown function (DUF3755)5.3e-2643.37Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V            ADNS +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM---------------------NERVSDPSLK-SAQVAARPNVPPYGMPMIPMDNDDAI
        VALRCRWM                      E+ +D S K S+ +   PN P Y  PM+P+D DD I
Subjt:  VALRCRWM---------------------NERVSDPSLK-SAQVAARPNVPPYGMPMIPMDNDDAI

AT3G07565.3 Protein of unknown function (DUF3755)7.0e-3443.72Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V            ADNS +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM----------------------NERVSDPSLK-SAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQV
        VALRCRWM                       E+ +D S K S+ +   PN P Y  PM+P+D DD     AIGG +G+LLEQNA   +Q+S+N ++FQ+
Subjt:  VALRCRWM----------------------NERVSDPSLK-SAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQV

AT3G07565.4 Protein of unknown function (DUF3755)3.1e-3444.22Show/hide
Query:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+QE    +      + +  N   V            ADNS +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHQEAGQPSSSFDGGNPSNGNSTPV----------PTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM----------------------NERVSDPSLK-SAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQV
        VALRCRWM                       E+ +D S K S+ +   PN P Y  PM+P+D DD     AIGG +G+LLEQNA   +Q+S+N ++FQV
Subjt:  VALRCRWM----------------------NERVSDPSLK-SAQVAARPNVPPYGMPMIPMDNDD-----AIGGTTGELLEQNAHAMHQISSNLASFQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCATCGTCTTCTTTCGATGGAGGGAACCCCAGCAACGGTAATTCGACTCCTGTGCCTACAGCGGATAATTC
GAGTTCGGCTCTTGCGATGAAGCACAACCCAGGTATCTCAACGGATTGGACATCTGATGAGCAGGTCACACTGGAAGAAGGGCTTAAGAAATATGCTGCAGAGTCTAGTG
TTATTCGATATGCAAAGATTGCAATGCAGCTACCGAATAAGACTGTACGAGATGTTGCTTTGCGCTGCAGATGGATGAATGAAAGAGTATCTGACCCTTCATTGAAGTCA
GCACAGGTTGCAGCAAGGCCTAATGTGCCTCCTTATGGAATGCCAATGATTCCCATGGACAATGATGATGCTATTGGTGGTACAACTGGAGAACTTCTTGAACAGAATGC
ACATGCAATGCATCAAATTTCTTCTAATCTTGCATCTTTTCAGGTGAAGTACTTGGTTGTCTGGATGTTGTTTTAG
mRNA sequenceShow/hide mRNA sequence
AAAAAGACAGAGAAAAAGAAAGAAAGAAGGAAGAAAAAGGAAGTTAAAAGAAAAAAGAAAAAAATGTAATGTTGCTTACGAATATGTTGACACTTTCCGACGTCAAACCT
CCAAACCCTCTCTCTAATTGCTGTATACAGAGACATCGGTCTCATTTCTCTCTCTTTTCTGCAACTTTCTTTAGTTTATATTGTTTACCCGAGCTCCCTTCAGTCTTAAT
CCGTACAAATCTTTGACACCATCTATTTTCCCTCCTTTTTTCTCTTTTTGGTTTCTTCATTGATTGACCCATCTATCTTCAATTTTGATTTTTGAGGGTGTTGTTTTGTT
TGTTTGATTTTTTGGGTCTCTTTATTCTGGGTTCATTGATTTTGGTTGAAGGAATCTGAATTGGCGGGAGGTAACGGTCAATTATGTGGTTTTAGGAGGTGGGTTTTGAT
TGGATTTGGATTTGGGATTGGAAAACGTTTGAATTGTTGGGAGTTTTGATGGCTAACCCATCTGGGAACCATCAAGAAGCTGGCCAACCATCGTCTTCTTTCGATGGAGG
GAACCCCAGCAACGGTAATTCGACTCCTGTGCCTACAGCGGATAATTCGAGTTCGGCTCTTGCGATGAAGCACAACCCAGGTATCTCAACGGATTGGACATCTGATGAGC
AGGTCACACTGGAAGAAGGGCTTAAGAAATATGCTGCAGAGTCTAGTGTTATTCGATATGCAAAGATTGCAATGCAGCTACCGAATAAGACTGTACGAGATGTTGCTTTG
CGCTGCAGATGGATGAATGAAAGAGTATCTGACCCTTCATTGAAGTCAGCACAGGTTGCAGCAAGGCCTAATGTGCCTCCTTATGGAATGCCAATGATTCCCATGGACAA
TGATGATGCTATTGGTGGTACAACTGGAGAACTTCTTGAACAGAATGCACATGCAATGCATCAAATTTCTTCTAATCTTGCATCTTTTCAGGTGAAGTACTTGGTTGTCT
GGATGTTGTTTTAGCATTTTTAAGTTTAAGCAAAAATCTAGTCCACCATACCTCAGGTAGTGTTAATTGTAAATTTTATAACCATTTCTAAATGATCCCGAGTAATTTTT
CTTTCAGCCATCTCTGATAGTGATATATTAACTTCTGTACTGGACAGCAAGATTTGCACTCTCTGTAGGTTATGAATCACTCTGGAGACTCCTTTTTCACTTTTGTCCCC
TCGTATAATTGCTGTGAATTAATTTTTCCTTCATATTATTGTGATACCCCTTTTTCGCAGTTGATTTTATGAGCAAGTACATGTTTTTCCTTTCCTTTTATTTTTTCTTC
TTAAATAGTAATTGTTTTTTACTAAAGCCTTTGTAGTTATAATTTTACTTTTCCAATGATTTTTCCCGTTGGAAATCCATTTATATAAGATATCTTCGGTTTTGGGTGAT
TTCTCATCCCTCTCTTTTCTATTTCTCAACCCCCTTGAAACTTCATAACTTTCAAGTTATGCTTAGGGTTGAAACTATTCAAAGTCCCTTTGATTTGTTGGGAGACTCCT
CATCTCCCGTGTTCCCCTCGGCTCCAGTAAAAAAAAAAAGTGTTGTTTTCCATCCAAAAAATGCCTTAGGGTGAAATTCGGTGGCCTGTGGATATTTGTCATGATCTGTG
AGCAGTGGGCACTGGTTCTTTTCTTTCTTAGTACTTGCCTTTTCTTGTCTTCCTTTTTAATCACTTAATTGCATTTACCAAGCCTTTCATGTGTGTGTGTTTCTATGTGT
ATATGTAATATGATATATAGCTGTCCATTCAAACATATTATTTCTAAAACTAAAACATGGGATGTGGCTATAGCGGGCAATATTTGATCATTTGATAATGGGAATTTAGG
TGCACTGGTTTTTTATAATTTTTTGCCAGACAACCTAATGTTCTTTTATAGGAGTAGCCAACATAATGTTTGAACTGATAACCTTTTATTTTATGTAGATACAAGATAAT
ATCAGTCTCTTCTGCCAAACGCGGGATAACATCCTCAAAATAATGAACGACTTAAATGAAATGCCAGAAGTAATGAAGCAGATGCCACCTCTTCCGGTGAAGGTGAACGA
AGAGTTGGCGAACACGATCCTTCCACCGACCGCTCATTCCTTGCAATCATGAAAAATCTTCAACAAAAACTGGTTCAGCAAACAACAATTCTCCCTTTCAAATCACTTCC
TCAAGCAAGGAAAGGAAAGTATATTTTCTCAACTGAACTTACCTTCCCAATTCAGAATTTTCTGCATATCATCAAATTGTTTAGTCTTTCGGTCCACAAAGTTGTTCCCC
TTTAATTATTATTATTATTATTTTTTTTTTGTTTCTTTAGCATATATGGATTCATACATTAGCTTCAAGAGGTAGATTGTAACTTAGATGCTGTGTAATAATTAGGAATG
GTTTTCTTTCTAATGGTCTGGGAAATGAAAAGTAATAAACAATGGTTGATTATTTGT
Protein sequenceShow/hide protein sequence
MANPSGNHQEAGQPSSSFDGGNPSNGNSTPVPTADNSSSALAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNERVSDPSLKS
AQVAARPNVPPYGMPMIPMDNDDAIGGTTGELLEQNAHAMHQISSNLASFQVKYLVVWMLF