; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002872 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002872
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPH domain-containing protein
Genome locationscaffold6:415692..419359
RNA-Seq ExpressionSpg002872
SyntenySpg002872
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035739.1 hypothetical protein E6C27_scaffold403G00100 [Cucumis melo var. makuwa]6.2e-4340.23Show/hide
Query:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDS
        +L +L +  V +SDD R WS+E+ G+F+  SLS  L ++ P+   LF AI +S +P+RINIL+ I+    + +S++LQ+K P  ++ PSICPLCLKA  +
Subjt:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDS

Query:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKS
          H+F  C  S   W + F++FN+ W F +S+  +V+QLL G +    PR++W    KAL+ EIW+ERNQR+F DKA      + +A L A++WC+L K 
Subjt:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKS

Query:  FVAFSSQDICFNW---HSFIFPLNVKYVNKEAYWVRKNW---DVLKIDLESSLVVS
        FV +S QDIC NW   H     + +++  K+    R  W     LKI + S L ++
Subjt:  FVAFSSQDICFNW---HSFIFPLNVKYVNKEAYWVRKNW---DVLKIDLESSLVVS

TYK21876.1 hypothetical protein E5676_scaffold494G00090 [Cucumis melo var. makuwa]5.6e-4444.24Show/hide
Query:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDS
        +L +L +  V +SDD R WS+E+ G+F+  SLS  L ++ P+   LF AI +S +P+RINIL+ I+    +N+S++LQ+K P  ++ PSICPLCLKA  +
Subjt:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDS

Query:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKS
          H+F  C  S   W + F++FN+ W F +S+  +V+QLL G +    PR++W    KAL+ EIW+ERNQR+F DKA      + +A L A++WC+L K 
Subjt:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKS

Query:  FVAFSSQDICFNWHSFI
        FV +S QDIC NW+ F+
Subjt:  FVAFSSQDICFNWHSFI

XP_038888105.1 uncharacterized protein LOC120078006 isoform X1 [Benincasa hispida]1.4e-4258.54Show/hide
Query:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK
        D+H SPSQTE+LHLE SP+V+S QRFEAV  FQL                        IYFGD+++KGFE PN+SIIL  +GDEEL+K PLDFQLVGQIK
Subjt:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK

Query:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR
        +Q+G+E ISFWL +APPGFV LGC A   K  L+ FSALG ++MD+V+WDQF++ SAWDSSD +
Subjt:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR

XP_038888106.1 uncharacterized protein LOC120078006 isoform X2 [Benincasa hispida]1.4e-4258.54Show/hide
Query:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK
        D+H SPSQTE+LHLE SP+V+S QRFEAV  FQL                        IYFGD+++KGFE PN+SIIL  +GDEEL+K PLDFQLVGQIK
Subjt:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK

Query:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR
        +Q+G+E ISFWL +APPGFV LGC A   K  L+ FSALG ++MD+V+WDQF++ SAWDSSD +
Subjt:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR

XP_038903695.1 uncharacterized protein LOC120090219 [Benincasa hispida]1.4e-4747.32Show/hide
Query:  SDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDSALHLFFECAYSQ
        S D RVWS+ N+ Q+TV SL + L    P++  +F  IWK+K+P+R+NIL+ I+  G LN ++VLQ+K P  SL P++CP CL   + +LHLFF C YS 
Subjt:  SDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDSALHLFFECAYSQ

Query:  LCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKSFVAFSSQDICFN
         CW+K    FN+     N  K NV QLL  P+     RLLW N VKAL++++W ERNQR+F +KA      LE+A  +ASSWC LS  F A+S  D   N
Subjt:  LCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKSFVAFSSQDICFN

Query:  WHSFI
        W +FI
Subjt:  WHSFI

TrEMBL top hitse value%identityAlignment
A0A0A0LEI9 PH domain-containing protein8.8e-4354.44Show/hide
Query:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK
        D+H SPSQTE+ HLE SP+V+S QRFEAVA+FQL                        IYFGD+ +KGFE PN+SI+LH +GDEEL+K PLDFQLVGQIK
Subjt:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK

Query:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR----SFSIAVLDVFAG
         Q+G+E ISFWL +AP GFV LGC A  +K  L+ FSALG M+MD+V+WDQ ++ SAWDSSD +     FS+ ++ +  G
Subjt:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR----SFSIAVLDVFAG

A0A1S3CPY5 uncharacterized protein LOC103503494 isoform X11.9e-4253.33Show/hide
Query:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK
        D+H  P QTE+ HLE SP+V+S QRFEAVADFQL                        I+FGD+ +KGFERP++SI+LH +GDEEL++ PLDFQLVGQIK
Subjt:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK

Query:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR----SFSIAVLDVFAG
         Q+G+E ISFWL +APPGFV LGC A  +K  L+ FSALG M+MD+V+WDQ ++ SAWDSSD +     FS+ ++ +  G
Subjt:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR----SFSIAVLDVFAG

A0A5A7T2Y0 zf-RVT domain-containing protein3.0e-4340.23Show/hide
Query:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDS
        +L +L +  V +SDD R WS+E+ G+F+  SLS  L ++ P+   LF AI +S +P+RINIL+ I+    + +S++LQ+K P  ++ PSICPLCLKA  +
Subjt:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDS

Query:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKS
          H+F  C  S   W + F++FN+ W F +S+  +V+QLL G +    PR++W    KAL+ EIW+ERNQR+F DKA      + +A L A++WC+L K 
Subjt:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKS

Query:  FVAFSSQDICFNW---HSFIFPLNVKYVNKEAYWVRKNW---DVLKIDLESSLVVS
        FV +S QDIC NW   H     + +++  K+    R  W     LKI + S L ++
Subjt:  FVAFSSQDICFNW---HSFIFPLNVKYVNKEAYWVRKNW---DVLKIDLESSLVVS

A0A5A7T4X4 PH domain-containing protein1.9e-4253.33Show/hide
Query:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK
        D+H  P QTE+ HLE SP+V+S QRFEAVADFQL                        I+FGD+ +KGFERP++SI+LH +GDEEL++ PLDFQLVGQIK
Subjt:  DNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQL------------------------IYFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIK

Query:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR----SFSIAVLDVFAG
         Q+G+E ISFWL +APPGFV LGC A  +K  L+ FSALG M+MD+V+WDQ ++ SAWDSSD +     FS+ ++ +  G
Subjt:  QQKGLEVISFWLTKAPPGFVPLGCFA--YKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVR----SFSIAVLDVFAG

A0A5D3DE60 zf-RVT domain-containing protein2.7e-4444.24Show/hide
Query:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDS
        +L +L +  V +SDD R WS+E+ G+F+  SLS  L ++ P+   LF AI +S +P+RINIL+ I+    +N+S++LQ+K P  ++ PSICPLCLKA  +
Subjt:  ILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDS

Query:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKS
          H+F  C  S   W + F++FN+ W F +S+  +V+QLL G +    PR++W    KAL+ EIW+ERNQR+F DKA      + +A L A++WC+L K 
Subjt:  ALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKS

Query:  FVAFSSQDICFNWHSFI
        FV +S QDIC NW+ F+
Subjt:  FVAFSSQDICFNWHSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48090.1 calcium-dependent lipid-binding family protein3.6e-0438.81Show/hide
Query:  GDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIKQQKGLEVISFWLTKAPPGFVPLGCFAYK
        GD I +G E P    IL  + D E+   P+ F  V  I   KG + +  W   APPG+V LGC   K
Subjt:  GDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIKQQKGLEVISFWLTKAPPGFVPLGCFAYK

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.5e-0729.01Show/hide
Query:  DDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFW--AIW-KSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDSALHLFFECAY
        DD+ +W  +        S      +  P    + W  A+W K+  PK   I  ++ +N  L+T D LQ    +    P+ C LC    DS  HLFFEC +
Subjt:  DDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFW--AIW-KSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAGDSALHLFFECAY

Query:  SQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRV
        S + W  F A  N+      +   + L  L+ PS      L+      + +  IW ERNQR+
Subjt:  SQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRV

AT4G17140.1 pleckstrin homology (PH) domain-containing protein1.5e-2644.44Show/hide
Query:  VSSSQRFEAVADFQLI------------------------YFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIKQQKGLEVISFWLTKAPPGF
        V+S  RFEAVA F+LI                        YFGDI V G+E PNS ++LH + D+E+ K  +DFQLVG++K+ +G+E ISFW+ +APPGF
Subjt:  VSSSQRFEAVADFQLI------------------------YFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIKQQKGLEVISFWLTKAPPGF

Query:  VPLGCFAYKGLLEV--FSALGFMQMDLVSWDQFLKVSAWDSSDV
        V LGC A KG  +   F+ L   + D+V+ D F   S WD+SDV
Subjt:  VPLGCFAYKGLLEV--FSALGFMQMDLVSWDQFLKVSAWDSSDV

AT4G17140.2 pleckstrin homology (PH) domain-containing protein1.5e-2644.44Show/hide
Query:  VSSSQRFEAVADFQLI------------------------YFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIKQQKGLEVISFWLTKAPPGF
        V+S  RFEAVA F+LI                        YFGDI V G+E PNS ++LH + D+E+ K  +DFQLVG++K+ +G+E ISFW+ +APPGF
Subjt:  VSSSQRFEAVADFQLI------------------------YFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIKQQKGLEVISFWLTKAPPGF

Query:  VPLGCFAYKGLLEV--FSALGFMQMDLVSWDQFLKVSAWDSSDV
        V LGC A KG  +   F+ L   + D+V+ D F   S WD+SDV
Subjt:  VPLGCFAYKGLLEV--FSALGFMQMDLVSWDQFLKVSAWDSSDV

AT4G17140.3 pleckstrin homology (PH) domain-containing protein1.5e-2644.44Show/hide
Query:  VSSSQRFEAVADFQLI------------------------YFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIKQQKGLEVISFWLTKAPPGF
        V+S  RFEAVA F+LI                        YFGDI V G+E PNS ++LH + D+E+ K  +DFQLVG++K+ +G+E ISFW+ +APPGF
Subjt:  VSSSQRFEAVADFQLI------------------------YFGDIIVKGFERPNSSIILHLSGDEELFKFPLDFQLVGQIKQQKGLEVISFWLTKAPPGF

Query:  VPLGCFAYKGLLEV--FSALGFMQMDLVSWDQFLKVSAWDSSDV
        V LGC A KG  +   F+ L   + D+V+ D F   S WD+SDV
Subjt:  VPLGCFAYKGLLEV--FSALGFMQMDLVSWDQFLKVSAWDSSDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAGTTTCGGAATTTTGGGATTGTCAACATAATTCTTGGTGTTTTGGACTCTTTGGTGGTTGGTTCATCTGATGATGCTCGTGTTTGGTCTCTTGAAAATTCTGG
ACAGTTTACCGTTAGCTCTTTGTCTCATCAACTTGGTTCGAGTTTCCCCATTCAATCGGATTTGTTTTGGGCCATTTGGAAATCTAAAAACCCTAAACGAATAAATATTT
TGATGTTGATTATTTTTAATGGTAGTTTGAATACTTCTGATGTTCTTCAAAGAAAATTGCCGTTCTTCAGCTTATTTCCTTCGATTTGTCCTCTTTGTTTGAAGGCAGGA
GACTCCGCATTGCATTTGTTCTTTGAGTGTGCTTATTCACAACTTTGTTGGTCAAAGTTCTTTGCTATTTTCAATATGCAGTGGGTTTTTTCAAATTCCGTAAAAGAGAA
TGTGCTTCAACTGCTTATTGGTCCTTCTTTTTCTTCAAGACCGAGATTATTATGGATTAATGGTGTTAAAGCTTTGATATCAGAAATTTGGTTGGAAAGAAATCAGAGGG
TTTTTGAAGATAAAGCGTGGCATTCTTTAGCTCATCTGGAATCAGCTTGCTTAAAGGCTTCTTCCTGGTGCACTCTTTCGAAATCTTTTGTAGCTTTCTCTTCACAGGAT
ATTTGTTTTAATTGGCATTCTTTTATTTTTCCCCTAAATGTTAAGTACGTCAATAAAGAAGCTTACTGGGTTCGAAAGAATTGGGATGTGCTGAAAATTGATTTGGAAAG
CTCACTTGTTGTTTCTAGATTGATGGCCCATTATTGTTGGAAGGATGTTAAGATTACCCTTGAGAATTTCTTTAAATCTTCAGTCTCGATTAACCCTTTTTTGGATGATA
AAACTTTGATTCAAGTAGCTGTTGGTGGCTTGGATCTTTCTGCGAATGGCAAAAATCTTCCTTTGAATTTATGGCAAAGTGATTCCTTTGAAGCTATTGGAAAGAACCTT
GGAGGGTTGGTTAGTATTTCATCCAATACGCTTAATTTGCTAGATTGTTCTGAAGCCTACATTGAAGTAGAAAAGAATTTTTGTGGATTTATCCCTGCTGATATTAATGT
TAAGATTGGTAATAAGCATGAATTTTCATTAAGATTTGGTGATATTAATGCTTTAGAGGACAAAAATTTGAAGTTTGATTTAAGAAGGAAGTTAGATGTAAATGACTTTT
CGAATTCCTTGGATTTAATTAGGGTTAGGCAGGTGGTATTGGATGAAAATTCGGATCTGCTTATTGGGGGGGAAAGGATGAATGATTTTCCATTTGCTTGTCGGTATCAG
GAGGAATTAAATGGGGAGTTGGCTTCTTCAAAAGATGCCTCGTTGCATGATGATCTAATTAATTATGCTGGCTGTAATGTTTCTTCAAAAAATGCTGGCGGTAATGTTTC
TCCTCCCAAGATGATTATTGACAATAGTTGTAATTTGAATTATGATGTACAGCAGATTCCAAGAGTGAGAGACCAATTTAATGAGGCGTTGGGTTCTCCAATAGGTGCTG
CATTGCATGAAGAGGGTATTAATTACGTGGGAGATAAAGGCATTAAAGATAGCATTAATGAGCCGGTTATTGTTCTCTCTCCATCTAAAGATGATAATGTGTTTAATATG
CTTAGCCCTCAGAAAGTCCAACCGACTCAGTTTTTTGAATCATCTTCTAAGGATTTTAATGCCGTTAATTGCAATTTAATTAATGATGTCCAGCAGGTAGCATTAAAGAC
CTATTCTCGGAAAAAATCTTCTATTTCATTGGTTGCTAAGTCAAGCATTGATGCTGATAATTTGGAGTCTGAATGTACTCATTTAATTGCTGCAAAAAAGGCTTCAGGAT
CTTCAGATGTCAATTCTGGAAATGGGTTGTTTCAGGGTAAGGAATTTAAGGAATCTTCCGTTCAAATTCCAAGGGGAAGTGAGGTTTTCATAAGAGGAATTGGTAGTTCC
TTCAACCATAGTATTCATTCTCCGGGGGATTCAGATGATGAGTCTACGGTTAGTGTAAGTAGTGAGGATTCTGATCCGTTGTTAGAAAAAGATGATTGTGTGGATCTCTT
TTCAGAAGATCAAATTGATAATCATCCTTCTCCAAGTCAAACTGAGTCTTTGCATTTGGAGTGGTCTCCTGTTGTATCTTCTAGTCAACGTTTTGAGGCTGTTGCTGATT
TTCAGTTGATATATTTTGGTGATATTATTGTTAAAGGGTTTGAGCGTCCAAATTCATCTATCATTCTTCATCTCTCTGGAGATGAAGAGTTGTTCAAGTTCCCACTTGAT
TTCCAACTTGTGGGACAAATCAAGCAGCAAAAAGGGTTGGAAGTTATATCTTTCTGGTTGACAAAAGCTCCTCCAGGGTTTGTTCCCTTGGGTTGTTTTGCCTACAAAGG
TTTGCTTGAAGTCTTCAGTGCATTGGGATTCATGCAGATGGATTTGGTTTCGTGGGATCAGTTTTTGAAGGTTAGTGCTTGGGATTCTTCAGATGTCAGATCGTTTTCTA
TAGCAGTTTTAGATGTCTTTGCTGGTTTGGATTCAAAATTTTTCCTCATTTATTTTGCCCGGCTCAAAGCTTCGTCTTGGTGTTCTTTGTCCAAGCTCTTCTCAGGATTC
TCTATTCAAGTTATTTGCCTCAATTGGGAAACTTTCATTTTTCCATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAGTTTCGGAATTTTGGGATTGTCAACATAATTCTTGGTGTTTTGGACTCTTTGGTGGTTGGTTCATCTGATGATGCTCGTGTTTGGTCTCTTGAAAATTCTGG
ACAGTTTACCGTTAGCTCTTTGTCTCATCAACTTGGTTCGAGTTTCCCCATTCAATCGGATTTGTTTTGGGCCATTTGGAAATCTAAAAACCCTAAACGAATAAATATTT
TGATGTTGATTATTTTTAATGGTAGTTTGAATACTTCTGATGTTCTTCAAAGAAAATTGCCGTTCTTCAGCTTATTTCCTTCGATTTGTCCTCTTTGTTTGAAGGCAGGA
GACTCCGCATTGCATTTGTTCTTTGAGTGTGCTTATTCACAACTTTGTTGGTCAAAGTTCTTTGCTATTTTCAATATGCAGTGGGTTTTTTCAAATTCCGTAAAAGAGAA
TGTGCTTCAACTGCTTATTGGTCCTTCTTTTTCTTCAAGACCGAGATTATTATGGATTAATGGTGTTAAAGCTTTGATATCAGAAATTTGGTTGGAAAGAAATCAGAGGG
TTTTTGAAGATAAAGCGTGGCATTCTTTAGCTCATCTGGAATCAGCTTGCTTAAAGGCTTCTTCCTGGTGCACTCTTTCGAAATCTTTTGTAGCTTTCTCTTCACAGGAT
ATTTGTTTTAATTGGCATTCTTTTATTTTTCCCCTAAATGTTAAGTACGTCAATAAAGAAGCTTACTGGGTTCGAAAGAATTGGGATGTGCTGAAAATTGATTTGGAAAG
CTCACTTGTTGTTTCTAGATTGATGGCCCATTATTGTTGGAAGGATGTTAAGATTACCCTTGAGAATTTCTTTAAATCTTCAGTCTCGATTAACCCTTTTTTGGATGATA
AAACTTTGATTCAAGTAGCTGTTGGTGGCTTGGATCTTTCTGCGAATGGCAAAAATCTTCCTTTGAATTTATGGCAAAGTGATTCCTTTGAAGCTATTGGAAAGAACCTT
GGAGGGTTGGTTAGTATTTCATCCAATACGCTTAATTTGCTAGATTGTTCTGAAGCCTACATTGAAGTAGAAAAGAATTTTTGTGGATTTATCCCTGCTGATATTAATGT
TAAGATTGGTAATAAGCATGAATTTTCATTAAGATTTGGTGATATTAATGCTTTAGAGGACAAAAATTTGAAGTTTGATTTAAGAAGGAAGTTAGATGTAAATGACTTTT
CGAATTCCTTGGATTTAATTAGGGTTAGGCAGGTGGTATTGGATGAAAATTCGGATCTGCTTATTGGGGGGGAAAGGATGAATGATTTTCCATTTGCTTGTCGGTATCAG
GAGGAATTAAATGGGGAGTTGGCTTCTTCAAAAGATGCCTCGTTGCATGATGATCTAATTAATTATGCTGGCTGTAATGTTTCTTCAAAAAATGCTGGCGGTAATGTTTC
TCCTCCCAAGATGATTATTGACAATAGTTGTAATTTGAATTATGATGTACAGCAGATTCCAAGAGTGAGAGACCAATTTAATGAGGCGTTGGGTTCTCCAATAGGTGCTG
CATTGCATGAAGAGGGTATTAATTACGTGGGAGATAAAGGCATTAAAGATAGCATTAATGAGCCGGTTATTGTTCTCTCTCCATCTAAAGATGATAATGTGTTTAATATG
CTTAGCCCTCAGAAAGTCCAACCGACTCAGTTTTTTGAATCATCTTCTAAGGATTTTAATGCCGTTAATTGCAATTTAATTAATGATGTCCAGCAGGTAGCATTAAAGAC
CTATTCTCGGAAAAAATCTTCTATTTCATTGGTTGCTAAGTCAAGCATTGATGCTGATAATTTGGAGTCTGAATGTACTCATTTAATTGCTGCAAAAAAGGCTTCAGGAT
CTTCAGATGTCAATTCTGGAAATGGGTTGTTTCAGGGTAAGGAATTTAAGGAATCTTCCGTTCAAATTCCAAGGGGAAGTGAGGTTTTCATAAGAGGAATTGGTAGTTCC
TTCAACCATAGTATTCATTCTCCGGGGGATTCAGATGATGAGTCTACGGTTAGTGTAAGTAGTGAGGATTCTGATCCGTTGTTAGAAAAAGATGATTGTGTGGATCTCTT
TTCAGAAGATCAAATTGATAATCATCCTTCTCCAAGTCAAACTGAGTCTTTGCATTTGGAGTGGTCTCCTGTTGTATCTTCTAGTCAACGTTTTGAGGCTGTTGCTGATT
TTCAGTTGATATATTTTGGTGATATTATTGTTAAAGGGTTTGAGCGTCCAAATTCATCTATCATTCTTCATCTCTCTGGAGATGAAGAGTTGTTCAAGTTCCCACTTGAT
TTCCAACTTGTGGGACAAATCAAGCAGCAAAAAGGGTTGGAAGTTATATCTTTCTGGTTGACAAAAGCTCCTCCAGGGTTTGTTCCCTTGGGTTGTTTTGCCTACAAAGG
TTTGCTTGAAGTCTTCAGTGCATTGGGATTCATGCAGATGGATTTGGTTTCGTGGGATCAGTTTTTGAAGGTTAGTGCTTGGGATTCTTCAGATGTCAGATCGTTTTCTA
TAGCAGTTTTAGATGTCTTTGCTGGTTTGGATTCAAAATTTTTCCTCATTTATTTTGCCCGGCTCAAAGCTTCGTCTTGGTGTTCTTTGTCCAAGCTCTTCTCAGGATTC
TCTATTCAAGTTATTTGCCTCAATTGGGAAACTTTCATTTTTCCATCTTAG
Protein sequenceShow/hide protein sequence
MVQFRNFGIVNIILGVLDSLVVGSSDDARVWSLENSGQFTVSSLSHQLGSSFPIQSDLFWAIWKSKNPKRINILMLIIFNGSLNTSDVLQRKLPFFSLFPSICPLCLKAG
DSALHLFFECAYSQLCWSKFFAIFNMQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLAHLESACLKASSWCTLSKSFVAFSSQD
ICFNWHSFIFPLNVKYVNKEAYWVRKNWDVLKIDLESSLVVSRLMAHYCWKDVKITLENFFKSSVSINPFLDDKTLIQVAVGGLDLSANGKNLPLNLWQSDSFEAIGKNL
GGLVSISSNTLNLLDCSEAYIEVEKNFCGFIPADINVKIGNKHEFSLRFGDINALEDKNLKFDLRRKLDVNDFSNSLDLIRVRQVVLDENSDLLIGGERMNDFPFACRYQ
EELNGELASSKDASLHDDLINYAGCNVSSKNAGGNVSPPKMIIDNSCNLNYDVQQIPRVRDQFNEALGSPIGAALHEEGINYVGDKGIKDSINEPVIVLSPSKDDNVFNM
LSPQKVQPTQFFESSSKDFNAVNCNLINDVQQVALKTYSRKKSSISLVAKSSIDADNLESECTHLIAAKKASGSSDVNSGNGLFQGKEFKESSVQIPRGSEVFIRGIGSS
FNHSIHSPGDSDDESTVSVSSEDSDPLLEKDDCVDLFSEDQIDNHPSPSQTESLHLEWSPVVSSSQRFEAVADFQLIYFGDIIVKGFERPNSSIILHLSGDEELFKFPLD
FQLVGQIKQQKGLEVISFWLTKAPPGFVPLGCFAYKGLLEVFSALGFMQMDLVSWDQFLKVSAWDSSDVRSFSIAVLDVFAGLDSKFFLIYFARLKASSWCSLSKLFSGF
SIQVICLNWETFIFPS