; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018047 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018047
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPHD-type domain-containing protein
Genome locationscaffold9:30913454..30923861
RNA-Seq ExpressionSpg018047
SyntenySpg018047
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592183.1 hypothetical protein SDJN03_14529, partial [Cucurbita argyrosperma subsp. sororia]2.3e-9977.37Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG
        H YCLD LPKTFDEYVTWFCEDCE    PIVKS+V+LK+KKL +R K+K K  +K+  NDDK QSS PLQLP+AHC G K+D  PGG GE VREGG NH 
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG

Query:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG
        EV   SDTS+FL HNCYVAQPIVDPVWRGTL  WN+SF  VCVLV HMSSLACSKVYEEAK LPELLS+E+LRR +IWPKGFEKLGPTDQSIALYFFAEG
Subjt:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG

Query:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
        E SQKAFDLL+N+MM  DLAMK VL NAELLVFTSSVLPMRYW
Subjt:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

XP_022937265.1 uncharacterized protein LOC111443603 [Cucurbita moschata]1.1e-9977.78Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG
        H YCLD LPKTFDEYVTWFCEDCE    PIVKS+V+LK+KKL +R K+K K  +K K NDDK QSS PLQLP+ HCSG ++D  PGG GE VREGG NH 
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG

Query:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG
        EV   SDTS+FL HNCYVAQPIVDPVWRGTL  WN+SF  VCVLV HMSSLACSKVYEEAK+LPELLSVE+LRR +IWPKGFEKLGPTDQSIALYFFAEG
Subjt:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG

Query:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
        E SQKAFDLL+N+MM  DLAMK VL NAELLVFTSSVLPMRYW
Subjt:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

XP_023536446.1 uncharacterized protein LOC111797622 [Cucurbita pepo subsp. pepo]2.8e-10078.19Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG
        H YCLD LPKTFDEYVTWFCEDCE    PIVKS+V+LK+KKL +R K+K K  +K   NDDK QSS PLQLP+AHCSG ++D  PGG GE VREGG NH 
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG

Query:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG
        EV   SDTS+FL HNCYVAQPIVDPVWRGTL  WN+SF  VCVLV HMSSLACSKVYEEAK LPELLSVE+LRR +IWPKGFEKLGPTDQSIALYFFAEG
Subjt:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG

Query:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
        E SQKAFDLL+N+MMC DLAMK VL NAELLVFTSSVLPMRYW
Subjt:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

XP_038899415.1 uncharacterized protein LOC120086715 isoform X1 [Benincasa hispida]1.6e-10076.33Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARR---SKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGA
        H YCLDPLP++FDEYVTW CEDCE   +P V S+VQLKK+KL +R    K+KKKK+ KTKRNDD GQ S+PLQLPEAHCS KK+D TPGG GEPV EGGA
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARR---SKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGA

Query:  NHGEVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFF
        +H +V T SD  N + H+ YVAQPIVDPVWRGTL FWN+S SRVCV+V H+SSLACSKVYEEAK+LPELLSVELLRRC++WP+GF+KLGPTDQSIALYFF
Subjt:  NHGEVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFF

Query:  AEGESQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
         + ESQKAFDLLVNAMMC DLAMKAVL NAELLVFTSS+LPMRYW
Subjt:  AEGESQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

XP_038899416.1 uncharacterized protein LOC120086715 isoform X2 [Benincasa hispida]1.6e-10076.33Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARR---SKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGA
        H YCLDPLP++FDEYVTW CEDCE   +P V S+VQLKK+KL +R    K+KKKK+ KTKRNDD GQ S+PLQLPEAHCS KK+D TPGG GEPV EGGA
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARR---SKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGA

Query:  NHGEVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFF
        +H +V T SD  N + H+ YVAQPIVDPVWRGTL FWN+S SRVCV+V H+SSLACSKVYEEAK+LPELLSVELLRRC++WP+GF+KLGPTDQSIALYFF
Subjt:  NHGEVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFF

Query:  AEGESQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
         + ESQKAFDLLVNAMMC DLAMKAVL NAELLVFTSS+LPMRYW
Subjt:  AEGESQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

TrEMBL top hitse value%identityAlignment
A0A6J1CRJ1 uncharacterized protein LOC111013528 isoform X12.9e-9573.2Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKR-------NDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVR
        H YCL  LP+   EYVTWFCEDCE + VP++  +V+ KKKKLA+R K+KK K+ K K           KGQS SP+QLPEA CS K  D TPG  GEPVR
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKR-------NDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVR

Query:  EGGANHGEVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIA
        E G NH E TT S  SNFLGHNCYVAQPIV+PVWRG+LRFWNKSF R+CVLV HMSSLACSKVYEEAKLLP+LLSVELLRRC+IWPKGFEK+GPTDQSIA
Subjt:  EGGANHGEVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIA

Query:  LYFFAEGE-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
        LYFF EGE S+KAFDLLVN+MMC DLAMKAVL NAELLVFTSS+LPM+YW
Subjt:  LYFFAEGE-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

A0A6J1FG38 uncharacterized protein LOC1114436035.1e-10077.78Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG
        H YCLD LPKTFDEYVTWFCEDCE    PIVKS+V+LK+KKL +R K+K K  +K K NDDK QSS PLQLP+ HCSG ++D  PGG GE VREGG NH 
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG

Query:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG
        EV   SDTS+FL HNCYVAQPIVDPVWRGTL  WN+SF  VCVLV HMSSLACSKVYEEAK+LPELLSVE+LRR +IWPKGFEKLGPTDQSIALYFFAEG
Subjt:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG

Query:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
        E SQKAFDLL+N+MM  DLAMK VL NAELLVFTSSVLPMRYW
Subjt:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

A0A6J1IER8 uncharacterized protein LOC1114753581.1e-9977.37Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG
        H YCLD LPKTFDEYVTWFCEDCE    PIVKS+V+LK+KKL +R K+K K  +K K NDD  QS  PLQLP+AHCSG ++D  PGG GE VREGG NH 
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG

Query:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG
        EV   SDTS+FL HNCYVAQPIVDPVWRGTL  WN+SF  VCVLV HMSSLACSKVYEEAK LPELLSVE+LRR +IWPKGFEKLGPTDQSIALYFFAEG
Subjt:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG

Query:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
        E SQKAFDLL+N+MMC DLAMK VL NAELLVFTSSVLPM+YW
Subjt:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

A0A6J1IWW2 uncharacterized protein LOC111480637 isoform X27.9e-9373.25Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG
        H YCLD LPKTFDEYVTWFCEDCE + +P VKS+V+LKKK +            ++K+ND           PEAHCS KK+D TPGGLGE VREGGANHG
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG

Query:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG
        EVTT SD SNFLGHNCYVA+PIVDP+WRGTL+FW K+F+RV V V HMSSLACSKVYEEAK LPE L VELL RCEIWP+GFEKLGPTD SIALYFF E 
Subjt:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG

Query:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
        E SQ+AFDLLVNAMMC DLAMKAVL NAELLVFTSS+LPMRYW
Subjt:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

A0A6J1J269 uncharacterized protein LOC111480637 isoform X17.9e-9373.25Show/hide
Query:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG
        H YCLD LPKTFDEYVTWFCEDCE + +P VKS+V+LKKK +            ++K+ND           PEAHCS KK+D TPGGLGE VREGGANHG
Subjt:  HSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHG

Query:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG
        EVTT SD SNFLGHNCYVA+PIVDP+WRGTL+FW K+F+RV V V HMSSLACSKVYEEAK LPE L VELL RCEIWP+GFEKLGPTD SIALYFF E 
Subjt:  EVTTLSDTSNFLGHNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG

Query:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW
        E SQ+AFDLLVNAMMC DLAMKAVL NAELLVFTSS+LPMRYW
Subjt:  E-SQKAFDLLVNAMMCNDLAMKAVLNNAELLVFTSSVLPMRYW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43770.1 RING/FYVE/PHD zinc finger superfamily protein3.8e-1547.67Show/hide
Query:  YVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKL-GPTDQSIALYFFAEGE
        Y AQPI  P+WRG +     +   +  +V H+SSLAC KV+E A  L   LS E+L R E+WPK F K  GP D+S+AL+FF   E
Subjt:  YVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKL-GPTDQSIALYFFAEGE

AT1G43770.1 RING/FYVE/PHD zinc finger superfamily protein1.8e-0131.06Show/hide
Query:  TCSFWSAQHSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDET-PGGLGEP
        +C F S  H YCL   P  F EY+TW CEDC+        +EV    KK   + K + +  V    ++   +     +  E+ CS K H+ T   G GE 
Subjt:  TCSFWSAQHSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDET-PGGLGEP

Query:  VREGG-----ANHGEVTT----LSDTSNFLGH
        V E        +H   T+    +  T+N LGH
Subjt:  VREGG-----ANHGEVTT----LSDTSNFLGH

AT1G43770.2 RING/FYVE/PHD zinc finger superfamily protein1.3e-2650Show/hide
Query:  YVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKL-GPTDQSIALYFFAEGES--QKAFDLLVNAM
        Y AQPI  P+WRG +     +   +  +V H+SSLAC KV+E A  L   LS E+L R E+WPK F K  GP D+S+AL+FF   ES  +K FD LV+ M
Subjt:  YVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKL-GPTDQSIALYFFAEGES--QKAFDLLVNAM

Query:  MCNDLAMKAVLNNAELLVFTSSVLPMRYWS
          ND AM+ VLN+AELL+FTS +LP   W+
Subjt:  MCNDLAMKAVLNNAELLVFTSSVLPMRYWS

AT1G43770.2 RING/FYVE/PHD zinc finger superfamily protein1.8e-0131.06Show/hide
Query:  TCSFWSAQHSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDET-PGGLGEP
        +C F S  H YCL   P  F EY+TW CEDC+        +EV    KK   + K + +  V    ++   +     +  E+ CS K H+ T   G GE 
Subjt:  TCSFWSAQHSYCLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDET-PGGLGEP

Query:  VREGG-----ANHGEVTT----LSDTSNFLGH
        V E        +H   T+    +  T+N LGH
Subjt:  VREGG-----ANHGEVTT----LSDTSNFLGH

AT3G02890.1 RING/FYVE/PHD zinc finger superfamily protein1.5e-1135.2Show/hide
Query:  AQPIVDPVWRGTLRFWNKSFSRVCVLVG---HMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG--ESQKAFDLLVNAM
        A P  + +W+G L    KS +   +  G   ++S+LA  KV E  K  PE +++  + R   WP  F+  G  +Q +AL+FFA+     +K +  LV+ M
Subjt:  AQPIVDPVWRGTLRFWNKSFSRVCVLVG---HMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEG--ESQKAFDLLVNAM

Query:  MCNDLAMKAVLNNAELLVFTSSVLP
        +  DLA+K  L   ELL+F S+ LP
Subjt:  MCNDLAMKAVLNNAELLVFTSSVLP

AT5G16680.1 RING/FYVE/PHD zinc finger superfamily protein8.0e-1337.29Show/hide
Query:  VWRGTLRFWNKSFSRVCVLVG---HMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEGES--QKAFDLLVNAMMCNDLAM
        +W+G L    K  ++  +  G   H+S+LA  +V E     PE  S+  + R   WP  FEKLG  +  IAL+FFA+     ++ +  LV+ M+ NDLA+
Subjt:  VWRGTLRFWNKSFSRVCVLVG---HMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEGES--QKAFDLLVNAMMCNDLAM

Query:  KAVLNNAELLVFTSSVLP
        K  L+N +LL+F S+ LP
Subjt:  KAVLNNAELLVFTSSVLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAACATTAGAAGGAAAAGAACGATGATTTATGTCCCAGAGGGGTTCAAAGGTAAGGGATGGGGATTGTTAGCAGGGGAAATATCAGATGTGTTATACGGATCCAA
TAAGGCAGGGAAGGCGAAAATGGGGTCCATCGAAAATCCCGACCGAAGGAAAGAAGTAAAGGGATGGGTCATGACCCTTGATATTCCACCATTTCTCCGGTCCAAAAGTT
TGATTTTAGCTGTGGCGAGTCTTTGTGGAGGAATCAAATTGGAGAAGGAGATGATGGAGGCGGATGGAGACAAGTTTGGAAAAGCTGAGAGAATCGTCCTTCATCTGCAG
AAGGATGCAGTCGTAGGGGAACCGATCAATTTGGTTTGTGGGGGTCCCTTAGACTCGGTTCCTGTGAGTAGACAGATGAATAGTTTTGGGGGAAAGGAGAAATTGGAAGA
AGGAACTTATCCCAATCGATGGAAAAATAATGGCTTGAAGTTCCAGCGTTTCGAGAGAGCGAATTTGCAGAGTTTGTTCAGCGAGGAAGGGGCAATCAATCAAAACTCAA
ACCAAAAGAAGAAGAAAGGAAAGGGAGGGGAAGGTCACAAGGGAGAGAACTCTTTGTCTGACGGGTTTAGGGCTATTGAAAGGGCAGTCAATCACGCTAAGCAGATGACT
GAAGTTCAACGAAACATCCAACCAACAGATTCTCATTCTCAGTCCATTATCGAGGTACGCTCTGAAAGGGAAGCGAGTCAGGCGAGACACTTCAGAGACTACTGGGAGAT
TATCCAGGAGTTCCTCCTCTATTCGCCATTCTGCGATAAGGGGAGGTTTTTATGGTTAGCTGGGATGTGGGCTGTTTTATGGGGTCTTTGGGGTGAGAGAAATAATGAGG
TGTTTAGTGGTCTTGAGAGGGGTCCTTCTGATGTGTGGGCCCTCACTAGATTCTATGTTTCTCTTTGGGCCTTGTTTACTTGTTCTTTTTGGAGTGCCCAACACAGCTAT
TGCTTAGATCCTTTGCCAAAAACTTTTGATGAGTATGTTACTTGGTTTTGTGAAGATTGTGAGGAAATGACGGTGCCAATAGTAAAATCTGAAGTTCAATTAAAAAAGAA
AAAGCTTGCTCGACGATCGAAGAGGAAAAAGAAGAAAAGGGTGAAAACGAAAAGGAATGATGATAAAGGTCAAAGTAGTTCTCCTCTTCAACTACCTGAGGCACATTGCA
GTGGAAAGAAGCATGATGAAACACCAGGAGGGCTGGGAGAGCCAGTTCGAGAAGGTGGGGCCAATCATGGTGAAGTCACTACATTATCTGATACATCCAATTTCTTAGGG
CACAACTGTTATGTTGCGCAGCCTATAGTAGATCCGGTCTGGAGGGGAACGTTAAGATTTTGGAACAAAAGTTTTTCCAGAGTTTGTGTACTTGTTGGCCATATGTCTAG
CTTAGCATGCTCAAAAGTGTATGAGGAGGCAAAATTGTTACCTGAGTTACTTTCCGTTGAATTACTTCGTAGATGTGAAATATGGCCCAAGGGATTTGAGAAATTGGGGC
CAACTGATCAAAGTATTGCTCTTTATTTCTTTGCAGAAGGGGAAAGCCAAAAGGCATTTGATCTTCTGGTGAATGCAATGATGTGCAACGACCTAGCCATGAAGGCTGTG
CTGAACAATGCAGAGCTATTAGTTTTTACTTCGTCCGTGTTACCAATGCGATACTGGAGTAAGGCCACAAATGAACTTTTCATAATTTTCAGTTTGTTGATTCAATCTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGGATAACATTAGAAGGAAAAGAACGATGATTTATGTCCCAGAGGGGTTCAAAGGTAAGGGATGGGGATTGTTAGCAGGGGAAATATCAGATGTGTTATACGGATCCAA
TAAGGCAGGGAAGGCGAAAATGGGGTCCATCGAAAATCCCGACCGAAGGAAAGAAGTAAAGGGATGGGTCATGACCCTTGATATTCCACCATTTCTCCGGTCCAAAAGTT
TGATTTTAGCTGTGGCGAGTCTTTGTGGAGGAATCAAATTGGAGAAGGAGATGATGGAGGCGGATGGAGACAAGTTTGGAAAAGCTGAGAGAATCGTCCTTCATCTGCAG
AAGGATGCAGTCGTAGGGGAACCGATCAATTTGGTTTGTGGGGGTCCCTTAGACTCGGTTCCTGTGAGTAGACAGATGAATAGTTTTGGGGGAAAGGAGAAATTGGAAGA
AGGAACTTATCCCAATCGATGGAAAAATAATGGCTTGAAGTTCCAGCGTTTCGAGAGAGCGAATTTGCAGAGTTTGTTCAGCGAGGAAGGGGCAATCAATCAAAACTCAA
ACCAAAAGAAGAAGAAAGGAAAGGGAGGGGAAGGTCACAAGGGAGAGAACTCTTTGTCTGACGGGTTTAGGGCTATTGAAAGGGCAGTCAATCACGCTAAGCAGATGACT
GAAGTTCAACGAAACATCCAACCAACAGATTCTCATTCTCAGTCCATTATCGAGGTACGCTCTGAAAGGGAAGCGAGTCAGGCGAGACACTTCAGAGACTACTGGGAGAT
TATCCAGGAGTTCCTCCTCTATTCGCCATTCTGCGATAAGGGGAGGTTTTTATGGTTAGCTGGGATGTGGGCTGTTTTATGGGGTCTTTGGGGTGAGAGAAATAATGAGG
TGTTTAGTGGTCTTGAGAGGGGTCCTTCTGATGTGTGGGCCCTCACTAGATTCTATGTTTCTCTTTGGGCCTTGTTTACTTGTTCTTTTTGGAGTGCCCAACACAGCTAT
TGCTTAGATCCTTTGCCAAAAACTTTTGATGAGTATGTTACTTGGTTTTGTGAAGATTGTGAGGAAATGACGGTGCCAATAGTAAAATCTGAAGTTCAATTAAAAAAGAA
AAAGCTTGCTCGACGATCGAAGAGGAAAAAGAAGAAAAGGGTGAAAACGAAAAGGAATGATGATAAAGGTCAAAGTAGTTCTCCTCTTCAACTACCTGAGGCACATTGCA
GTGGAAAGAAGCATGATGAAACACCAGGAGGGCTGGGAGAGCCAGTTCGAGAAGGTGGGGCCAATCATGGTGAAGTCACTACATTATCTGATACATCCAATTTCTTAGGG
CACAACTGTTATGTTGCGCAGCCTATAGTAGATCCGGTCTGGAGGGGAACGTTAAGATTTTGGAACAAAAGTTTTTCCAGAGTTTGTGTACTTGTTGGCCATATGTCTAG
CTTAGCATGCTCAAAAGTGTATGAGGAGGCAAAATTGTTACCTGAGTTACTTTCCGTTGAATTACTTCGTAGATGTGAAATATGGCCCAAGGGATTTGAGAAATTGGGGC
CAACTGATCAAAGTATTGCTCTTTATTTCTTTGCAGAAGGGGAAAGCCAAAAGGCATTTGATCTTCTGGTGAATGCAATGATGTGCAACGACCTAGCCATGAAGGCTGTG
CTGAACAATGCAGAGCTATTAGTTTTTACTTCGTCCGTGTTACCAATGCGATACTGGAGTAAGGCCACAAATGAACTTTTCATAATTTTCAGTTTGTTGATTCAATCTTG
A
Protein sequenceShow/hide protein sequence
MDNIRRKRTMIYVPEGFKGKGWGLLAGEISDVLYGSNKAGKAKMGSIENPDRRKEVKGWVMTLDIPPFLRSKSLILAVASLCGGIKLEKEMMEADGDKFGKAERIVLHLQ
KDAVVGEPINLVCGGPLDSVPVSRQMNSFGGKEKLEEGTYPNRWKNNGLKFQRFERANLQSLFSEEGAINQNSNQKKKKGKGGEGHKGENSLSDGFRAIERAVNHAKQMT
EVQRNIQPTDSHSQSIIEVRSEREASQARHFRDYWEIIQEFLLYSPFCDKGRFLWLAGMWAVLWGLWGERNNEVFSGLERGPSDVWALTRFYVSLWALFTCSFWSAQHSY
CLDPLPKTFDEYVTWFCEDCEEMTVPIVKSEVQLKKKKLARRSKRKKKKRVKTKRNDDKGQSSSPLQLPEAHCSGKKHDETPGGLGEPVREGGANHGEVTTLSDTSNFLG
HNCYVAQPIVDPVWRGTLRFWNKSFSRVCVLVGHMSSLACSKVYEEAKLLPELLSVELLRRCEIWPKGFEKLGPTDQSIALYFFAEGESQKAFDLLVNAMMCNDLAMKAV
LNNAELLVFTSSVLPMRYWSKATNELFIIFSLLIQS