; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015046 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015046
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold3:50096259..50105501
RNA-Seq ExpressionSpg015046
SyntenySpg015046
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]4.8e-2933.33Show/hide
Query:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKL--VNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEK
        +W+ +FRR L D EI    +L+  +  ++L     D   W L +SG F+ KS F  L  ++G P +    ++L+WN + P K+K F+W +A++ +NT++ 
Subjt:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKL--VNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEK

Query:  LQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDK
        LQ    + ++SP  C LC++  E +DHLFLHC      W  + +L  I +  PR + D +    N +    + ++L   A    LW +W+ERNAR FEDK
Subjt:  LQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDK

Query:  SNSFDSFCDIVQNTAT
        S + ++  D++   A+
Subjt:  SNSFDSFCDIVQNTAT

RVW15141.1 putative ribonuclease H protein [Vitis vinifera]6.2e-2934.58Show/hide
Query:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQ
        +W+ +FRR L D EI    +L+  +  ++L     D   W L +SG F+ KS F  L   S       ++L+WN + P K+K F+W +A++ +NT++ LQ
Subjt:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQ

Query:  KFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKSN
            + ++SP  C LC++  E  DHLFLHC      W  + +L  I +  PR I D L    N +    + +IL   A    LW +W+ERNAR FEDKS 
Subjt:  KFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKSN

Query:  SFDSFCDIVQNTAT
        + ++  D++   A+
Subjt:  SFDSFCDIVQNTAT

RVW72196.1 Flowering time control protein FY [Vitis vinifera]6.2e-2924.76Show/hide
Query:  GWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIADCWIVEHQ-TWDLSFRRGLLDREVSSWS
        GW ++ + R      W  IA ++  F  F  L V  G  IRFWED W  NQ+L A    LYR+S  +  T+++        +W+ +FRR L D E+    
Subjt:  GWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIADCWIVEHQ-TWDLSFRRGLLDREVSSWS

Query:  VLVEKIHMVNLMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIASCAFRASLWLLWKERNSRSSNEDRDDLQPEFLASMEASIAD
         L+  +H V L                                                                S SS++ R                 
Subjt:  VLVEKIHMVNLMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIASCAFRASLWLLWKERNSRSSNEDRDDLQPEFLASMEASIAD

Query:  CWSYDSQTWDLDFRRGLFDREICSWIALVDKIKEVNLVYDHDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLN
                                                     W L +SG FS KS FY L   S  L    ++ +W+ K P KVK   W +A+  +N
Subjt:  CWSYDSQTWDLDFRRGLFDREICSWIALVDKIKEVNLVYDHDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLN

Query:  TDEKLQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNART
        T++KLQ    + ++ P  C LC +  E++DHLFLHC      W  +  L+G+ +  PR I+D L+          +  IL   A  T +W +W+ERN R 
Subjt:  TDEKLQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNART

Query:  FEDKSNSFDSFCDIVQ
        FEDK  + +   D+++
Subjt:  FEDKSNSFDSFCDIVQ

RVW74148.1 putative ribonuclease H protein [Vitis vinifera]8.1e-2933.33Show/hide
Query:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKL--VNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEK
        +W+ +FRR L D EI    +L+  +  ++L     D   W L +SG F+ KS F  L  ++G P +    ++L+WN + P K+K F+W +A++ +NT++ 
Subjt:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKL--VNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEK

Query:  LQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDK
        LQ    + ++SP  C LC++  E +DHLFLHC      W  + +L  I +  PR + D +    N +    + ++L   A    LW +W+ERNAR FEDK
Subjt:  LQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDK

Query:  SNSFDSFCDIVQNTAT
        S + ++  D++   A+
Subjt:  SNSFDSFCDIVQNTAT

RVW90769.1 putative ribonuclease H protein [Vitis vinifera]8.1e-2924.82Show/hide
Query:  GWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIADCWIVEHQ-TWDLSFRRGLLDREVSSWS
        GW ++ + R      W  IA ++  F  F  L V  G  IRFWED W  NQ+L A    LYR+S  +  T+++        +W+ +FRR L D E+    
Subjt:  GWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIADCWIVEHQ-TWDLSFRRGLLDREVSSWS

Query:  VLVEKIHMVNLMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIASCAFRASLWLLWKERNSRSSNEDRDDLQPEFLASMEASIAD
         L+  +H V L                                                                S SS++ R                 
Subjt:  VLVEKIHMVNLMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIASCAFRASLWLLWKERNSRSSNEDRDDLQPEFLASMEASIAD

Query:  CWSYDSQTWDLDFRRGLFDREICSWIALVDKIKEVNLVYDHDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLN
                                                     W L +SG FS KS FY L   S  L    ++ +W+ K P KVK   W +A+  +N
Subjt:  CWSYDSQTWDLDFRRGLFDREICSWIALVDKIKEVNLVYDHDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLN

Query:  TDEKLQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNART
        T++KLQ    + ++ P  C LC +  E++DHLFLHC      W  +  L+G+ +  PR I+D L+          +  IL   A  T +W +W+ERN R 
Subjt:  TDEKLQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNART

Query:  FEDKSNSFDSFCDIV
        FEDK  + +   D++
Subjt:  FEDKSNSFDSFCDIV

TrEMBL top hitse value%identityAlignment
A0A2N9GN16 Reverse transcriptase domain-containing protein1.2e-3325.45Show/hide
Query:  SSAGGWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIAD--CWIVEHQTWDLSFRRGLLDRE
        S  GGW S+++    G  +W  I   +  F ++++  V  G+ I+FW D WC +  L+   P+L+RL+   EA++AD  C+    + WD++F R + D E
Subjt:  SSAGGWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIAD--CWIVEHQTWDLSFRRGLLDRE

Query:  VSSWSVLVEKIHMVNLMEGQ-DILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIAS----CAFRAS------LWLL----WKERNSRSSN
        + + +  +E ++  ++ +G  D L W   S             +  +  E  A W     +K  +     C+ R S      LW      W      +S 
Subjt:  VSSWSVLVEKIHMVNLMEGQ-DILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIAS----CAFRAS------LWLL----WKERNSRSSN

Query:  EDRDDLQPEF---------------------LASMEASIAD--CWSYDSQTWDLDFRRGLFDREICSWIALVDKIKEVNLVYDH-DLIRWKLEASGKFST
           D  + +F                       + EAS+AD  C++   + WD+ F R + D E+ +  AL++ +    +   H D + W+  +   F  
Subjt:  EDRDDLQPEF---------------------LASMEASIAD--CWSYDSQTWDLDFRRGLFDREICSWIALVDKIKEVNLVYDH-DLIRWKLEASGKFST

Query:  KSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLP
        +S +  L+N S ++  P  R +W  K P +V  F W+     + T + L+K     +    C +C    E+++HL LHC  A+  W+LV  L GI + +P
Subjt:  KSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLP

Query:  RKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKS
        R +++ L    +     + +  +   A    +W LW+ERN+RTF  +S
Subjt:  RKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKS

A0A2N9GTP9 CRM domain-containing protein7.7e-3324.79Show/hide
Query:  HDSVVEESRLS---AIDSRIVKSVWSSRHIAWVALDAVSSAGGWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIP
        H+   E +RLS    ID++   ++   R   +V    +S       +K++   G  +W DI + +  F+ +   TV  G  +R W DRWC +  LK + P
Subjt:  HDSVVEESRLS---AIDSRIVKSVWSSRHIAWVALDAVSSAGGWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIP

Query:  NLYRLSGKKEATIADCWIVEH--QTWDLSFRRGLLDREVSSWSVLVEKIHMVNLMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKI
         L+  +  K+A ++   + ++    W+++F R   D E+ + +  +  I     M   +   W       R  +++ +  +  ++ +  +   + G  K+
Subjt:  NLYRLSGKKEATIADCWIVEH--QTWDLSFRRGLLDREVSSWSVLVEKIHMVNLMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKI

Query:  IASCAFRASLW----LLWKERNSRSSNE----------------DRD--DLQPEFLA-------SMEASIADCWSYDSQTWDLDFRRGLFDREICSWIAL
                SLW    + W+  +S  S E                DR   +L P           ++ +++       SQ W+L F R   D E+   +A 
Subjt:  IASCAFRASLW----LLWKERNSRSSNE----------------DRD--DLQPEFLA-------SMEASIADCWSYDSQTWDLDFRRGLFDREICSWIAL

Query:  VDKIKEVNLVYDH-------DLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVSPSACRLC
               +L++ H       D + W     G F ++S FY +++   +++ P  + IW  K P +V  F+WS A+  + T + L+K  +  V    C +C
Subjt:  VDKIKEVNLVYDH-------DLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVSPSACRLC

Query:  LKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAM--ILASCAFKTTLWTLWKERNARTFEDK
          ADE +DHL LHC  AR  WS V + +G+ + LP ++ + L    N +   S  +  ++ +C     +WT+W+ERN+RTFEDK
Subjt:  LKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAM--ILASCAFKTTLWTLWKERNARTFEDK

A0A438BVX7 Putative ribonuclease H protein3.0e-2934.58Show/hide
Query:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQ
        +W+ +FRR L D EI    +L+  +  ++L     D   W L +SG F+ KS F  L   S       ++L+WN + P K+K F+W +A++ +NT++ LQ
Subjt:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQ

Query:  KFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKSN
            + ++SP  C LC++  E  DHLFLHC      W  + +L  I +  PR I D L    N +    + +IL   A    LW +W+ERNAR FEDKS 
Subjt:  KFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKSN

Query:  SFDSFCDIVQNTAT
        + ++  D++   A+
Subjt:  SFDSFCDIVQNTAT

A0A438GJ06 Flowering time control protein FY3.0e-2924.76Show/hide
Query:  GWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIADCWIVEHQ-TWDLSFRRGLLDREVSSWS
        GW ++ + R      W  IA ++  F  F  L V  G  IRFWED W  NQ+L A    LYR+S  +  T+++        +W+ +FRR L D E+    
Subjt:  GWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIADCWIVEHQ-TWDLSFRRGLLDREVSSWS

Query:  VLVEKIHMVNLMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIASCAFRASLWLLWKERNSRSSNEDRDDLQPEFLASMEASIAD
         L+  +H V L                                                                S SS++ R                 
Subjt:  VLVEKIHMVNLMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIASCAFRASLWLLWKERNSRSSNEDRDDLQPEFLASMEASIAD

Query:  CWSYDSQTWDLDFRRGLFDREICSWIALVDKIKEVNLVYDHDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLN
                                                     W L +SG FS KS FY L   S  L    ++ +W+ K P KVK   W +A+  +N
Subjt:  CWSYDSQTWDLDFRRGLFDREICSWIALVDKIKEVNLVYDHDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLN

Query:  TDEKLQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNART
        T++KLQ    + ++ P  C LC +  E++DHLFLHC      W  +  L+G+ +  PR I+D L+          +  IL   A  T +W +W+ERN R 
Subjt:  TDEKLQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNART

Query:  FEDKSNSFDSFCDIVQ
        FEDK  + +   D+++
Subjt:  FEDKSNSFDSFCDIVQ

A5BPI6 Uncharacterized protein2.3e-2933.33Show/hide
Query:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKL--VNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEK
        +W+ +FRR L D EI    +L+  +  ++L     D   W L +SG F+ KS F  L  ++G P +    ++L+WN + P K+K F+W +A++ +NT++ 
Subjt:  TWDLDFRRGLFDREICSWIALVDKIKEVNLVYD-HDLIRWKLEASGKFSTKSMFYKL--VNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEK

Query:  LQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDK
        LQ    + ++SP  C LC++  E +DHLFLHC      W  + +L  I +  PR + D +    N +    + ++L   A    LW +W+ERNAR FEDK
Subjt:  LQKFSKW-SVSPSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDK

Query:  SNSFDSFCDIVQNTAT
        S + ++  D++   A+
Subjt:  SNSFDSFCDIVQNTAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33710.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.3e-0526.19Show/hide
Query:  SRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVS-PSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIP---FRLPRKIDDWLLEGLNAW
        ++ +W      K    +W      L T  +L   + W +   + C LC    E+ DHLFL C+FA   W  V   L +P   F +   + DW L+     
Subjt:  SRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVS-PSACRLCLKADENLDHLFLHCDFARSAWSLVGRLLGIP---FRLPRKIDDWLLEGLNAW

Query:  NLKSKAMILASCAFKTTLWTLWKERN
          +     L     ++ L+ +WK+RN
Subjt:  NLKSKAMILASCAFKTTLWTLWKERN

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-0925.86Show/hide
Query:  DLIRWKLEASG---KFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVS-PSACRLCLKADENLDHLFLHCDF
        D   W+  A      FS++  + ++   SP +  P ++++W  +   +  +  W      L T ++L+    W ++ PS+  LC   DE   HLF  C F
Subjt:  DLIRWKLEASG---KFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVS-PSACRLCLKADENLDHLFLHCDF

Query:  ARSAWSLVGRLL--GIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKSNSFDS
        + + W           PF LP     W+L+      L+S +  +     ++ ++ +WKERNAR F   S+S  S
Subjt:  ARSAWSLVGRLL--GIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKSNSFDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCGGATTTATCTTTTTCGAGCCCTGCTAGCATCGAAAGTGATAGCGGAAAAGATTTCAGACTTAGTAGAATGGAAGCCGAAGACAGTGATATACCTGAAGGCTA
TCAGCTATGTTTCTCCAATGATTTCGACCAAGATAGAAGCTCCATTCCGCAAAGCATGGAGGAATCGGTTGAGATGCCGAAGGAAGTACCCAGTAGAAATAAGGATGAAG
AGGTTCAGCTTCTGAGAGCCTCCCCAGAGCTCAGGACCACCGGAAAAGCTGTGCTCGAAAAGCATCAAATGATTTCTCCAGAAGCTCTAGCTATGATCCCGTCAGAAAAT
GAAGAGAGAATATCTACAAATACAGACCATGGAGAGCTCTTTCTAAGTAAAGAGCTAATTCTCACCCTTAGAAGGAACAACTTATGCATTAGGCCGATTGTAGGCTCGAG
TGCTAAGAAAGGCAACTCGAGTAAGAAAAAGCGTAACAGGGAAGTGACCAACCTTCTTAGAACATGGGAAAAGGAAGTAGAGCCTATTATAGAAGAGGAAATTGACCATG
ATAGTGTTGTAGAGGAATCCAGATTATCTGCCATTGATAGTCGTATTGTGAAGTCTGTTTGGAGCTCTAGGCACATAGCTTGGGTGGCCTTAGATGCAGTGTCCTCGGCT
GGAGGGTGGAAGTCCGATAAGCTCAAACGGCGTAGAGGCAACAGAGTTTGGGTGGACATAGCGGTTCTCTACCCTTTTTTTGATCAATTCTCGACCTTAACTGTTAAGTG
TGGGACAAACATCAGATTTTGGGAAGATCGTTGGTGTGATAACCAATCTCTGAAGGCCCTCATCCCGAATTTATATAGGCTTTCTGGTAAGAAAGAGGCTACTATTGCTG
ATTGCTGGATTGTTGAGCACCAAACGTGGGATCTTTCATTTAGAAGGGGGCTCCTAGATAGAGAGGTGAGTAGCTGGTCGGTCCTTGTTGAGAAAATTCATATGGTTAAC
TTGATGGAAGGGCAAGACATTTTAAAATGGACCCTCGATAGTTCCGCGTGCAGACTCTGCCTCAAGGCTGAAGAGAGAATGATTGACGAATGGATGTTTGAAGGCCTGGC
TGCTTGGAATTTGAAAGGAAAGGCCAAGATTATTGCGAGTTGTGCGTTTAGAGCTTCGTTGTGGCTTCTTTGGAAGGAAAGAAACTCAAGATCGTCAAACGAAGATCGGG
ACGACCTACAACCCGAATTTCTCGCATCAATGGAGGCATCCATAGCAGATTGTTGGAGCTATGATTCTCAAACGTGGGATTTGGACTTTAGAAGAGGCCTATTTGATAGG
GAAATTTGCAGCTGGATTGCGTTGGTGGACAAAATTAAAGAGGTGAATTTGGTGTATGACCATGATTTGATTAGATGGAAGTTGGAAGCCTCTGGTAAGTTCTCGACCAA
ATCCATGTTCTATAAGCTGGTCAATGGTTCCCCTAAATTGAAGCAGCCCATGAGTAGACTTATATGGAACCATAAATGCCCTAAAAAAGTTAAAGTTTTCTTATGGTCCC
TAGCCTATAGAAGCTTAAACACTGACGAGAAATTGCAAAAGTTCAGTAAGTGGTCGGTGTCCCCCTCTGCTTGTAGATTGTGCCTTAAAGCTGATGAAAATCTAGACCAC
CTCTTCCTGCACTGTGATTTTGCGAGGTCAGCCTGGAGCTTGGTTGGAAGGCTGCTAGGAATACCCTTTCGTCTGCCTAGGAAAATTGATGATTGGCTCTTGGAAGGTCT
GAATGCGTGGAACCTTAAGAGTAAGGCGATGATTTTGGCTAGTTGTGCTTTCAAGACTACTCTTTGGACCTTATGGAAAGAAAGAAATGCTAGAACCTTTGAAGACAAGA
GTAATAGCTTCGATTCTTTTTGTGATATTGTACAAAATACGGCCACTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATCGGATTTATCTTTTTCGAGCCCTGCTAGCATCGAAAGTGATAGCGGAAAAGATTTCAGACTTAGTAGAATGGAAGCCGAAGACAGTGATATACCTGAAGGCTA
TCAGCTATGTTTCTCCAATGATTTCGACCAAGATAGAAGCTCCATTCCGCAAAGCATGGAGGAATCGGTTGAGATGCCGAAGGAAGTACCCAGTAGAAATAAGGATGAAG
AGGTTCAGCTTCTGAGAGCCTCCCCAGAGCTCAGGACCACCGGAAAAGCTGTGCTCGAAAAGCATCAAATGATTTCTCCAGAAGCTCTAGCTATGATCCCGTCAGAAAAT
GAAGAGAGAATATCTACAAATACAGACCATGGAGAGCTCTTTCTAAGTAAAGAGCTAATTCTCACCCTTAGAAGGAACAACTTATGCATTAGGCCGATTGTAGGCTCGAG
TGCTAAGAAAGGCAACTCGAGTAAGAAAAAGCGTAACAGGGAAGTGACCAACCTTCTTAGAACATGGGAAAAGGAAGTAGAGCCTATTATAGAAGAGGAAATTGACCATG
ATAGTGTTGTAGAGGAATCCAGATTATCTGCCATTGATAGTCGTATTGTGAAGTCTGTTTGGAGCTCTAGGCACATAGCTTGGGTGGCCTTAGATGCAGTGTCCTCGGCT
GGAGGGTGGAAGTCCGATAAGCTCAAACGGCGTAGAGGCAACAGAGTTTGGGTGGACATAGCGGTTCTCTACCCTTTTTTTGATCAATTCTCGACCTTAACTGTTAAGTG
TGGGACAAACATCAGATTTTGGGAAGATCGTTGGTGTGATAACCAATCTCTGAAGGCCCTCATCCCGAATTTATATAGGCTTTCTGGTAAGAAAGAGGCTACTATTGCTG
ATTGCTGGATTGTTGAGCACCAAACGTGGGATCTTTCATTTAGAAGGGGGCTCCTAGATAGAGAGGTGAGTAGCTGGTCGGTCCTTGTTGAGAAAATTCATATGGTTAAC
TTGATGGAAGGGCAAGACATTTTAAAATGGACCCTCGATAGTTCCGCGTGCAGACTCTGCCTCAAGGCTGAAGAGAGAATGATTGACGAATGGATGTTTGAAGGCCTGGC
TGCTTGGAATTTGAAAGGAAAGGCCAAGATTATTGCGAGTTGTGCGTTTAGAGCTTCGTTGTGGCTTCTTTGGAAGGAAAGAAACTCAAGATCGTCAAACGAAGATCGGG
ACGACCTACAACCCGAATTTCTCGCATCAATGGAGGCATCCATAGCAGATTGTTGGAGCTATGATTCTCAAACGTGGGATTTGGACTTTAGAAGAGGCCTATTTGATAGG
GAAATTTGCAGCTGGATTGCGTTGGTGGACAAAATTAAAGAGGTGAATTTGGTGTATGACCATGATTTGATTAGATGGAAGTTGGAAGCCTCTGGTAAGTTCTCGACCAA
ATCCATGTTCTATAAGCTGGTCAATGGTTCCCCTAAATTGAAGCAGCCCATGAGTAGACTTATATGGAACCATAAATGCCCTAAAAAAGTTAAAGTTTTCTTATGGTCCC
TAGCCTATAGAAGCTTAAACACTGACGAGAAATTGCAAAAGTTCAGTAAGTGGTCGGTGTCCCCCTCTGCTTGTAGATTGTGCCTTAAAGCTGATGAAAATCTAGACCAC
CTCTTCCTGCACTGTGATTTTGCGAGGTCAGCCTGGAGCTTGGTTGGAAGGCTGCTAGGAATACCCTTTCGTCTGCCTAGGAAAATTGATGATTGGCTCTTGGAAGGTCT
GAATGCGTGGAACCTTAAGAGTAAGGCGATGATTTTGGCTAGTTGTGCTTTCAAGACTACTCTTTGGACCTTATGGAAAGAAAGAAATGCTAGAACCTTTGAAGACAAGA
GTAATAGCTTCGATTCTTTTTGTGATATTGTACAAAATACGGCCACTTGGTAG
Protein sequenceShow/hide protein sequence
MESDLSFSSPASIESDSGKDFRLSRMEAEDSDIPEGYQLCFSNDFDQDRSSIPQSMEESVEMPKEVPSRNKDEEVQLLRASPELRTTGKAVLEKHQMISPEALAMIPSEN
EERISTNTDHGELFLSKELILTLRRNNLCIRPIVGSSAKKGNSSKKKRNREVTNLLRTWEKEVEPIIEEEIDHDSVVEESRLSAIDSRIVKSVWSSRHIAWVALDAVSSA
GGWKSDKLKRRRGNRVWVDIAVLYPFFDQFSTLTVKCGTNIRFWEDRWCDNQSLKALIPNLYRLSGKKEATIADCWIVEHQTWDLSFRRGLLDREVSSWSVLVEKIHMVN
LMEGQDILKWTLDSSACRLCLKAEERMIDEWMFEGLAAWNLKGKAKIIASCAFRASLWLLWKERNSRSSNEDRDDLQPEFLASMEASIADCWSYDSQTWDLDFRRGLFDR
EICSWIALVDKIKEVNLVYDHDLIRWKLEASGKFSTKSMFYKLVNGSPKLKQPMSRLIWNHKCPKKVKVFLWSLAYRSLNTDEKLQKFSKWSVSPSACRLCLKADENLDH
LFLHCDFARSAWSLVGRLLGIPFRLPRKIDDWLLEGLNAWNLKSKAMILASCAFKTTLWTLWKERNARTFEDKSNSFDSFCDIVQNTATW