; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G017710 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G017710
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDUF2301 domain-containing protein
Genome locationchr09:26480869..26487461
RNA-Seq ExpressionLsi09G017710
SyntenyLsi09G017710
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR019275 - Protein of unknown function DUF2301


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039859.1 DUF2301 domain-containing protein [Cucumis melo var. makuwa]7.1e-10272.51Show/hide
Query:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
        MACGCLCASILS PKL  LNYSA+++ KLL RS +SFPSPSKLSALKCKAAGQTSP+ TVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
Subjt:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF

Query:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--
        LPDNSSL DTLKQNLDLLY +GGGGLGLSL LIHIYVTAIK+TLQALWVLGVAGSLVTY NLAQPAG+SLVQYV+DNPSAVWF+GPLYAALTGLVFKE  
Subjt:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--

Query:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD
            +  G L+ +                                          DDIGDKSVF+FNALGE+EKKALI KLEQQ VSQNAD
Subjt:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD

XP_004140559.1 uncharacterized protein LOC101223108 [Cucumis sativus]1.4e-10272.51Show/hide
Query:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
        MACGCLCASILSSPKL  LNYSA+++ KLL RS +SFP PSKLSA KCKAAGQTSP+ TVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
Subjt:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF

Query:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--
        LPD+SSL DTLKQNLDLLY +GGGGLGLSL LIHIYVTAIK+TLQALWVLGVAGSLVTYLNL+QPAG+SLVQYV+DNPSAVWF+GPLYAALTGLVFKE  
Subjt:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--

Query:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD
            +  G L+ +                                          DDIGDKSVF+FNALGEDEKKALI KLEQQEVSQNAD
Subjt:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD

XP_008459924.1 PREDICTED: uncharacterized protein LOC103498896 [Cucumis melo]9.3e-10272.16Show/hide
Query:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
        MACGCLCASILS PKL  LNYSA+++ +LL RS +SFPSPSKLSALKCKAAGQTSP+ TVYQGIYGPWTVDSSDVREVILYR GLVTAATSFVIASSVAF
Subjt:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF

Query:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--
        LPDNSSL DTLKQNLDLLY +GGGGLGLSL LIHIYVTAIK+TLQALWVLGVAGSLVTYLNLAQPAG+SLVQYV+DNPSAVWF+GPLYAALTGLVFKE  
Subjt:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--

Query:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD
            +  G L+ +                                          DDIGDKSVF+FNALGE+EKKALI KLEQQ VSQNAD
Subjt:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD

XP_023004744.1 uncharacterized protein LOC111497955 [Cucurbita maxima]3.4e-10471.15Show/hide
Query:  NRKQCQILWRRQSEMACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLV
        N  + +ILWRR+SEMACGCLCASILS   LL LNYSA  + KLL  S ++FPSPSKLSALKCKAAGQ+SPTSTVY+GIYGPWTVD SDVREVILYRAGLV
Subjt:  NRKQCQILWRRQSEMACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLV

Query:  TAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGP
        TAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYA+GGGGLGLSLVLIHIYVTAIK+TLQA WVLGVAGSLV Y+NLAQPAGDSLVQYV+DNPSAVWFIGP
Subjt:  TAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGP

Query:  LYAALTGLVFKE----------------ESVSCGSLS------------------------------LLDDIGDKSVFMFNALGEDEKKALITKLEQQEV
        L+AALTGLVFKE                 ++  G LS                              + DDIGDKSVFMFN LGEDEKKALI KLEQQE+
Subjt:  LYAALTGLVFKE----------------ESVSCGSLS------------------------------LLDDIGDKSVFMFNALGEDEKKALITKLEQQEV

Query:  SQNAD
         QN D
Subjt:  SQNAD

XP_038874651.1 uncharacterized protein LOC120067214 [Benincasa hispida]1.9e-9973.33Show/hide
Query:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
        MACGCLCASILSSPKL  LNYSAV+  KLL RS +S PSPSKLSALKCKAAGQTSP+ TVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
Subjt:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF

Query:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--
        LP+NSSLSD LKQNLDLLYA+GGGGLGLSLVLIHIYVTAIK+TLQALWVLGVAGSLVTYLNLAQPAGDSLVQYV+DNP AVWFIGPL+AALTGLVFKE  
Subjt:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--

Query:  --------------ESVSCGSLS------------------------------LLDDIGDKSVFMFNALGEDEKKALITKLEQQE
                       ++  G L+                              + DDIGDKSVFMFNALGE+EKKALI KLEQ+E
Subjt:  --------------ESVSCGSLS------------------------------LLDDIGDKSVFMFNALGEDEKKALITKLEQQE

TrEMBL top hitse value%identityAlignment
A0A0A0KC45 Uncharacterized protein6.9e-10372.51Show/hide
Query:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
        MACGCLCASILSSPKL  LNYSA+++ KLL RS +SFP PSKLSA KCKAAGQTSP+ TVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
Subjt:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF

Query:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--
        LPD+SSL DTLKQNLDLLY +GGGGLGLSL LIHIYVTAIK+TLQALWVLGVAGSLVTYLNL+QPAG+SLVQYV+DNPSAVWF+GPLYAALTGLVFKE  
Subjt:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--

Query:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD
            +  G L+ +                                          DDIGDKSVF+FNALGEDEKKALI KLEQQEVSQNAD
Subjt:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD

A0A1S3CBS2 uncharacterized protein LOC1034988964.5e-10272.16Show/hide
Query:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
        MACGCLCASILS PKL  LNYSA+++ +LL RS +SFPSPSKLSALKCKAAGQTSP+ TVYQGIYGPWTVDSSDVREVILYR GLVTAATSFVIASSVAF
Subjt:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF

Query:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--
        LPDNSSL DTLKQNLDLLY +GGGGLGLSL LIHIYVTAIK+TLQALWVLGVAGSLVTYLNLAQPAG+SLVQYV+DNPSAVWF+GPLYAALTGLVFKE  
Subjt:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--

Query:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD
            +  G L+ +                                          DDIGDKSVF+FNALGE+EKKALI KLEQQ VSQNAD
Subjt:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD

A0A5D3DMA3 DUF2301 domain-containing protein3.4e-10272.51Show/hide
Query:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
        MACGCLCASILS PKL  LNYSA+++ KLL RS +SFPSPSKLSALKCKAAGQTSP+ TVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
Subjt:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF

Query:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--
        LPDNSSL DTLKQNLDLLY +GGGGLGLSL LIHIYVTAIK+TLQALWVLGVAGSLVTY NLAQPAG+SLVQYV+DNPSAVWF+GPLYAALTGLVFKE  
Subjt:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--

Query:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD
            +  G L+ +                                          DDIGDKSVF+FNALGE+EKKALI KLEQQ VSQNAD
Subjt:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD

A0A6J1KX72 uncharacterized protein LOC1114979551.6e-10471.15Show/hide
Query:  NRKQCQILWRRQSEMACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLV
        N  + +ILWRR+SEMACGCLCASILS   LL LNYSA  + KLL  S ++FPSPSKLSALKCKAAGQ+SPTSTVY+GIYGPWTVD SDVREVILYRAGLV
Subjt:  NRKQCQILWRRQSEMACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLV

Query:  TAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGP
        TAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYA+GGGGLGLSLVLIHIYVTAIK+TLQA WVLGVAGSLV Y+NLAQPAGDSLVQYV+DNPSAVWFIGP
Subjt:  TAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGP

Query:  LYAALTGLVFKE----------------ESVSCGSLS------------------------------LLDDIGDKSVFMFNALGEDEKKALITKLEQQEV
        L+AALTGLVFKE                 ++  G LS                              + DDIGDKSVFMFN LGEDEKKALI KLEQQE+
Subjt:  LYAALTGLVFKE----------------ESVSCGSLS------------------------------LLDDIGDKSVFMFNALGEDEKKALITKLEQQEV

Query:  SQNAD
         QN D
Subjt:  SQNAD

E5GB55 Uncharacterized protein4.5e-10272.16Show/hide
Query:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF
        MACGCLCASILS PKL  LNYSA+++ +LL RS +SFPSPSKLSALKCKAAGQTSP+ TVYQGIYGPWTVDSSDVREVILYR GLVTAATSFVIASSVAF
Subjt:  MACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAF

Query:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--
        LPDNSSL DTLKQNLDLLY +GGGGLGLSL LIHIYVTAIK+TLQALWVLGVAGSLVTYLNLAQPAG+SLVQYV+DNPSAVWF+GPLYAALTGLVFKE  
Subjt:  LPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE--

Query:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD
            +  G L+ +                                          DDIGDKSVF+FNALGE+EKKALI KLEQQ VSQNAD
Subjt:  --ESVSCGSLSLL------------------------------------------DDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G28140.1 unknown protein1.9e-6054.24Show/hide
Query:  AGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVL
        + Q +   TVY+G+YGPWT+D +DV+EVILYR+GLVTAA SFV ASS AFLP +S LS+T+KQN DL Y VG  GLGLSL LIHIYVT IK+TLQALW L
Subjt:  AGQTSPTSTVYQGIYGPWTVDSSDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVL

Query:  GVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE----------------ESVSCGSLS---------------------------
        G  GS  TY  LA+PAGD+LV YV+D+PSAVWF+GPL+A+LTGLVFKE                 SV  G LS                           
Subjt:  GVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVWFIGPLYAALTGLVFKE----------------ESVSCGSLS---------------------------

Query:  ---LLDDIGDKSVFMFNALGEDEKKALITKLEQQEV
           + DDIGDKSVF F +L +DEKKA++ KLEQ+++
Subjt:  ---LLDDIGDKSVFMFNALGEDEKKALITKLEQQEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTAATTATTATTATTATTTTTTTGTTAAACGAAGAAGAAGAAGAAAATATTTACTGGTACGGCGGAATCGGAAACAGTGTCAGATTCTGTGGCGGAGGCAGTC
CGAAATGGCGTGTGGATGTTTATGTGCCTCCATACTTTCTTCCCCGAAGCTTCTTTTTCTTAATTATTCAGCTGTTTCTAGAATTAAGTTGCTGCATCGTTCCGCACTCT
CTTTCCCTTCACCATCTAAATTATCAGCTCTCAAGTGCAAGGCCGCTGGCCAAACCTCCCCCACTTCCACCGTTTATCAGGGAATTTACGGTCCTTGGACTGTTGATTCT
TCCGACGTTCGAGAGGTAATTTTATATAGAGCGGGATTAGTGACAGCTGCTACCTCTTTTGTGATAGCTTCATCAGTTGCCTTTTTACCCGATAACTCTTCATTGAGTGA
CACACTTAAGCAAAATCTTGATTTGCTCTACGCCGTAGGTGGAGGAGGATTAGGCTTATCCCTAGTCCTGATTCACATATATGTAACTGCAATTAAGCAAACTCTTCAAG
CTTTATGGGTGCTTGGTGTTGCTGGATCGTTGGTAACTTACTTAAATCTTGCACAACCAGCTGGGGACAGCTTAGTGCAGTATGTTCTTGATAATCCATCGGCAGTCTGG
TTTATTGGTCCTCTCTATGCAGCACTGACTGGACTTGTCTTCAAAGAAGAATCTGTCTCCTGTGGTTCTTTGTCCCTCCTGGATGATATTGGCGACAAATCTGTTTTCAT
GTTCAATGCACTTGGAGAGGACGAAAAGAAGGCCTTGATTACAAAGCTTGAGCAGCAAGAGGTTAGTCAGAATGCTGATTAG
mRNA sequenceShow/hide mRNA sequence
GTACGCTCTGTAGTTAGTTATGAATTCTAATTATTATTATTATTTTTTTGTTAAACGAAGAAGAAGAAGAAAATATTTACTGGTACGGCGGAATCGGAAACAGTGTCAGA
TTCTGTGGCGGAGGCAGTCCGAAATGGCGTGTGGATGTTTATGTGCCTCCATACTTTCTTCCCCGAAGCTTCTTTTTCTTAATTATTCAGCTGTTTCTAGAATTAAGTTG
CTGCATCGTTCCGCACTCTCTTTCCCTTCACCATCTAAATTATCAGCTCTCAAGTGCAAGGCCGCTGGCCAAACCTCCCCCACTTCCACCGTTTATCAGGGAATTTACGG
TCCTTGGACTGTTGATTCTTCCGACGTTCGAGAGGTAATTTTATATAGAGCGGGATTAGTGACAGCTGCTACCTCTTTTGTGATAGCTTCATCAGTTGCCTTTTTACCCG
ATAACTCTTCATTGAGTGACACACTTAAGCAAAATCTTGATTTGCTCTACGCCGTAGGTGGAGGAGGATTAGGCTTATCCCTAGTCCTGATTCACATATATGTAACTGCA
ATTAAGCAAACTCTTCAAGCTTTATGGGTGCTTGGTGTTGCTGGATCGTTGGTAACTTACTTAAATCTTGCACAACCAGCTGGGGACAGCTTAGTGCAGTATGTTCTTGA
TAATCCATCGGCAGTCTGGTTTATTGGTCCTCTCTATGCAGCACTGACTGGACTTGTCTTCAAAGAAGAATCTGTCTCCTGTGGTTCTTTGTCCCTCCTGGATGATATTG
GCGACAAATCTGTTTTCATGTTCAATGCACTTGGAGAGGACGAAAAGAAGGCCTTGATTACAAAGCTTGAGCAGCAAGAGGTTAGTCAGAATGCTGATTAGATCATCAGA
AAAACACCCTGCAGGAAGAATCGTTCAGCCAGGAGTCTTAGCAGACGATAAGTCCATACCTTCTTACTGCATCTTGTACTATAGTTAATAGTTTATACAGAACTCTACTA
GTAGTAGCTGTTCATGTTATTTTTGCAGTATAATATATATACTAGAAATTTACCAATGATAGGTTGTGGAAGGATACTTGGCAATCGCAATTCCATAGTTACTTGAACAA
GAACAAACATGAAGATATAGACATAAACTTCAGTATCTAATTTATTCAAATTGTACAAGAAGTTCAGCGCGATGGGACTAGGCAGGGAAGAAAACAAAAAAAAAAGGTTT
CAAATAGCAGGGAATAGAGAGCCCCTTAGAAATCCATCCATTACAGGCAAATTCTGGTGGAGAACCTTATTATTATTATCACCAGTTCCCTTGCCGTCAAAGCAGTAGAC
AGGGGCTTGCCATTTCTCATCCTCCAGCACAGCACCCACTTCAAATGTCATCACATGAGCCTGTCTTCCTGCACAATTGGGGAAGGGAAAGGGAAAGGGAAACCTGATTG
AATAATGAATGGCAGGAAATTAAGAATGAAAGTGAAGAGGTAGGGGAAAAAGAGGGTTACCAGTGTAGAAGAGCCAATGAACAGGCCTCTTGGTCTCAACATCCTCGTAA
TACCAGATGAAATCGACCTTCTCCCAGACATTACAGAGGAAGCCATCGACATGGCGTTGACCCAGATAATTGGCTCCGTCGAGCCAGTTGGGTCGGAGAATACCCACCTC
AAGCTGAGCGGAAGAGCAAGTCTTGGAAGAGTCCAAAGTGTAGAAGAAGGAAGTGCCGTTGTTCCATTCGAGGTCGTAGAGAACGTTGCCGAGCTGGTGTTGGATGATGT
TGAAATTCCGACCATTAGGCCAGTCGTACCAGAGGTTGATTATCTGCAGAATTCCGGTGTGATTCATGAGGAGAATGGAGTGAAATTGGAGAGGCCATGGAGTTGGAACG
GGATCCTCCGAAATGGAATAACGTACACTCACACTCAGCAATGAAAGGATAAGGAAGAAGCTCGCCTTGGAAGCCATGGTTTTCTGCCTCTGGGTTCTCTCTCCTCTCTC
GCCCTTTCTCTAATTTATTTCTTTCTCGTCCTATCAACTCAACTCGCTTCAATTTTTTCTTTTGTTCGAATATTTTTTACGAAAAGTACTTAGTGCTGACTACGATTGTG
CTCTATTCAATTGTTAAATCAAAATTTCTCCGGTGACAAATATATTTTTTAAAATTTTTTTTAGATAAATTACAAATTTGGTCCAATCGGAGGAAA
Protein sequenceShow/hide protein sequence
MNSNYYYYFFVKRRRRRKYLLVRRNRKQCQILWRRQSEMACGCLCASILSSPKLLFLNYSAVSRIKLLHRSALSFPSPSKLSALKCKAAGQTSPTSTVYQGIYGPWTVDS
SDVREVILYRAGLVTAATSFVIASSVAFLPDNSSLSDTLKQNLDLLYAVGGGGLGLSLVLIHIYVTAIKQTLQALWVLGVAGSLVTYLNLAQPAGDSLVQYVLDNPSAVW
FIGPLYAALTGLVFKEESVSCGSLSLLDDIGDKSVFMFNALGEDEKKALITKLEQQEVSQNAD