; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009211 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009211
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:36854205..36858948
RNA-Seq ExpressionLag0009211
SyntenyLag0009211
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAM93462.1 putative reverse transcriptase [Oryza sativa Japonica Group]1.7e-3733.44Show/hide
Query:  EGKRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSA-------
        + KR GK G+ AIKLDMSKAYDRVEW F+  +++++GF E W+  IM+C+ TV Y + VNG+  E  KP++GLRQGDP+SPY+F+I AE FSA       
Subjt:  EGKRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSA-------

Query:  ----LLIREESLNN-------------------------------------------------FKGLK----ALLAKASWRILKNPNSLISQMLRGK---
            LL   ES+ N                                                 F+ L     A+LA+  WR+++NP+SL S++L+ K   
Subjt:  ----LLIREESLNN-------------------------------------------------FKGLK----ALLAKASWRILKNPNSLISQMLRGK---

Query:  ------------------------DLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIID-DHGAWIEDKVRGSFSAIDAEVILNT
                                 L  KG+ W+VG+G HI I  DPW+  +   V +     L   RVCE+ID   G W  + ++  F+  DA++I   
Subjt:  ------------------------DLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIID-DHGAWIEDKVRGSFSAIDAEVILNT

Query:  PVGGEGRRDEIIWNIKKEGL
        P+  EG+ D I W    +G+
Subjt:  PVGGEGRRDEIIWNIKKEGL

GAU47878.1 hypothetical protein TSUD_404500 [Trifolium subterraneum]8.9e-3431.49Show/hide
Query:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREES--
        K KG+ G +A+K+D+SKAYD+V+W F+R+VM +MGF++ W   +M C+ +V YSVL+N +      P +GLRQG P+SPY+F++V E   AL+ +     
Subjt:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREES--

Query:  ------------------------------LNNFKGL-KALLAKASWRILKNPNSLISQMLR---------------------------GKDLFTKGIRW
                                        NF+   KA++AK  W I++NPNSL++++++                            + + + G  W
Subjt:  ------------------------------LNNFKGL-KALLAKASWRILKNPNSLISQMLR---------------------------GKDLFTKGIRW

Query:  KVGDGRHIMIDQDPWIALEGSE---VPSLTNEELRGKRVCEII-DDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEG
        ++G G +I + QDPW  L GS+   V SL    +    V +++ +++ AW   KVR  FS   AE IL TP+    R D+++W  ++ G
Subjt:  KVGDGRHIMIDQDPWIALEGSE---VPSLTNEELRGKRVCEII-DDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEG

KAA3477433.1 reverse transcriptase [Gossypium australe]3.1e-3431.37Show/hide
Query:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLN
        KR  K G +A+KLDMSKAYDRVEW F++ V+ QMGFAE+W   +M C+ TV Y+V +N    + F+P +GL+QGDP+SPY+FLI +EG SAL+   +   
Subjt:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLN

Query:  NFKGLKA------------------------------------------------------LLAKASWRILKNPNSLISQMLR-----------------
          KG+KA                                                      LLAK  WRI+ N NSL++++L+                 
Subjt:  NFKGLKA------------------------------------------------------LLAKASWRILKNPNSLISQMLR-----------------

Query:  ----------GKDLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIIDDH-GAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEII
                   KD   KG+ W+VG G +I ID D WI    +   S   + +R      +ID++   W  + ++ +F+  D E IL  P+  +   D + 
Subjt:  ----------GKDLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIIDDH-GAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEII

Query:  WNIKKE
        W +  +
Subjt:  WNIKKE

XP_024162452.1 uncharacterized protein LOC112169623 [Rosa chinensis]2.1e-3531.27Show/hide
Query:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESL-
        KR+G+ G++A+ LD+SKAYDR+EW F+RK++ + GFA  W   +M C+ +V YS LV G+P+    P++GLRQGDP+SPY+FL+ AEGFS  L +++ L 
Subjt:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESL-

Query:  -----------------------NNFKGLK------------ALLAKASWRILKNPNSLISQMLRG---------------------------KDLFTKG
                                 F+ +K             LL+ A+WR++ NP SL++Q+ +                            +D    G
Subjt:  -----------------------NNFKGLK------------ALLAKASWRILKNPNSLISQMLRG---------------------------KDLFTKG

Query:  IRWKVGDGRHIMIDQDPWIALEGSEVP--SLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEGL
          W+VG+G ++ +  D WI    +  P  +L ++      V E+I     W E KVR  F+  D E IL  P+      D + W+ +++G+
Subjt:  IRWKVGDGRHIMIDQDPWIALEGSEVP--SLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEGL

XP_024195622.1 uncharacterized protein LOC112198734 [Rosa chinensis]8.6e-3733.33Show/hide
Query:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLN
        KR+G  G +A+KLD+SKAYDR+E  F+RKVM++ GFA  W   +M C+ +V +S L+ G+P+    P++GLRQGDP+SPY+FLI AEGFSALL +++ L 
Subjt:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLN

Query:  NFKGLKALL-----------------AKASWRILKNPNSLISQMLRGK---------------------------DLFTKGIRWKVGDGRHIMIDQDPWI
           G++A L                  + +WRI+  P SLI+ + + K                           +L   G  W++G G  + +  D W+
Subjt:  NFKGLKALL-----------------AKASWRILKNPNSLISQMLRGK---------------------------DLFTKGIRWKVGDGRHIMIDQDPWI

Query:  ALEGSEVPSLTNEELRGK-RVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEG
            + VP +T  ++     V E++   G W E++VR  F+ ++A+ IL  P+      D + W ++  G
Subjt:  ALEGSEVPSLTNEELRGK-RVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEG

TrEMBL top hitse value%identityAlignment
A0A2N9FNX2 Reverse transcriptase domain-containing protein3.0e-3530Show/hide
Query:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREE---
        K KGK G +A+KLDMSKAYDRVEWVF+  VM+++GFAE+W   IM C+ TV YSVL+NG     F  ++G+RQGD +SPY+FLI AEG S+LL R +   
Subjt:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREE---

Query:  ------------------------------------------------------------------------SLNN------------------------
                                                                                 LNN                        
Subjt:  ------------------------------------------------------------------------SLNN------------------------

Query:  ----------FKGLK----ALLAKASWRILKNPNSLISQMLRGK---------------------------DLFTKGIRWKVGDGRHIMIDQDPWIALEG
                  F+ LK    ALLAK  WRIL+ P SL++++ + K                           ++   G+RW +G+GR + I  DPW+ L+ 
Subjt:  ----------FKGLK----ALLAKASWRILKNPNSLISQMLRGK---------------------------DLFTKGIRWKVGDGRHIMIDQDPWIALEG

Query:  S----EVPSLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEGL
        S     VP + + E     +  I DD+  W  +KVR  FS  +A  I++ P+    RRD + W+  K GL
Subjt:  S----EVPSLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEGL

A0A2N9IVJ2 Uncharacterized protein3.9e-4337.37Show/hide
Query:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLN
        K KGK G +A+KLDMSKAYDRVEWVF+  VM+++GFAE+W   IM C+ TV YSVL+NG     F  ++G+RQGD +SPY+FLI  EG S LL R +   
Subjt:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLN

Query:  NFKG---------LK-----------------------ALLAKASWRILKNPNSLISQMLRGK---------------------------DLFTKGIRWK
          KG         LK                       ALLAK  WRIL+ P SL++++ + K                             ++ G+RW 
Subjt:  NFKG---------LK-----------------------ALLAKASWRILKNPNSLISQMLRGK---------------------------DLFTKGIRWK

Query:  VGDGRHIMIDQDPWIALEGS----EVPSLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEGL
        +G+GR + I  DPW+ L+ S     VP + + E     +  I DD+  W  +KVR  FS  +A  I++ P+    RRD + W+  K GL
Subjt:  VGDGRHIMIDQDPWIALEGS----EVPSLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEGL

A0A803NJ60 Uncharacterized protein7.6e-3938.94Show/hide
Query:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLN
        K  G+ G  A+KLDMSKA+DRVEW F++ VM++MGFA+ W+L IM C+ T +++ L+NGE   S  P KGLRQG P+SPY+FLI +EG   LL  EE L 
Subjt:  KRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLN

Query:  NFKGLKALLAKASWRILKNPNSLISQMLRGKDLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDA
        N  G+                      L G++L    +RWK+G+GRHI    DPWI    + +P+    +     V  +I +   W +  +   FS+ID 
Subjt:  NFKGLKALLAKASWRILKNPNSLISQMLRGKDLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDA

Query:  EVILNTPVGGEGRRDEIIWNIKKEGL
        E IL  P+      D++IW+    G+
Subjt:  EVILNTPVGGEGRRDEIIWNIKKEGL

A0A803NU77 Uncharacterized protein8.7e-3531.05Show/hide
Query:  RKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNN
        ++GK G+ AIKLDMSKA+DRVEW F++ +M  +GF       I  CI +V +S L+N   Q    P++G+RQGDP+SPY+F+I AEG S LL  EE+  N
Subjt:  RKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNN

Query:  FKGLK-------------------------------------------------------ALLAKASWRILKNPNSLISQMLR-----------------
         +GLK                                                       A+LAK +WR+  NP SL+S++L+                 
Subjt:  FKGLK-------------------------------------------------------ALLAKASWRILKNPNSLISQMLR-----------------

Query:  ----------GKDLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIW
                  GK+L  KG+RWKVGDG  I+   DPWI    +  P  +      ++V  +I     W  + +  +F   D + IL  P+      D++IW
Subjt:  ----------GKDLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIW

Query:  NIKKEG
        + +  G
Subjt:  NIKKEG

Q8LMV3 Putative reverse transcriptase8.4e-3833.44Show/hide
Query:  EGKRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSA-------
        + KR GK G+ AIKLDMSKAYDRVEW F+  +++++GF E W+  IM+C+ TV Y + VNG+  E  KP++GLRQGDP+SPY+F+I AE FSA       
Subjt:  EGKRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSA-------

Query:  ----LLIREESLNN-------------------------------------------------FKGLK----ALLAKASWRILKNPNSLISQMLRGK---
            LL   ES+ N                                                 F+ L     A+LA+  WR+++NP+SL S++L+ K   
Subjt:  ----LLIREESLNN-------------------------------------------------FKGLK----ALLAKASWRILKNPNSLISQMLRGK---

Query:  ------------------------DLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIID-DHGAWIEDKVRGSFSAIDAEVILNT
                                 L  KG+ W+VG+G HI I  DPW+  +   V +     L   RVCE+ID   G W  + ++  F+  DA++I   
Subjt:  ------------------------DLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNEELRGKRVCEIID-DHGAWIEDKVRGSFSAIDAEVILNT

Query:  PVGGEGRRDEIIWNIKKEGL
        P+  EG+ D I W    +G+
Subjt:  PVGGEGRRDEIIWNIKKEGL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.7e-0827.66Show/hide
Query:  HIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCI-GTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLK
        H+ I +D  KA+D+++  F+ K + ++G  +   LKI+  I      ++++NG+  E+F    G RQG P+SP +F IV E  +  + +E+ +   +  K
Subjt:  HIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCI-GTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLK

Query:  -----ALLAKASWRILKNPNSLISQMLRGKDLFTKGIRWKV
             +L A      L+NP      +L+    F+K   +K+
Subjt:  -----ALLAKASWRILKNPNSLISQMLRGKDLFTKGIRWKV

P08548 LINE-1 reverse transcriptase homolog7.4e-0734.07Show/hide
Query:  HIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCI-GTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREE
        H+ + +D  KA+D ++  F+ + ++++G  E   LK+++ I      ++++NG   +SF    G RQG P+SP +F IV E   A+ IREE
Subjt:  HIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCI-GTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREE

P11369 LINE-1 retrotransposable element ORF2 protein7.9e-0925.69Show/hide
Query:  HIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLK-
        H+ I LD  KA+D+++  F+ KV+++ G    +   I         ++ VNGE  E+     G RQG P+SPY+F IV E  +  + +++ +   +  K 
Subjt:  HIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLK-

Query:  ----ALLAKASWRILKNPNSLISQMLRGKDLFTKGIRWKVGDGR
            +LLA      + +P +   ++L   + F + + +K+   +
Subjt:  ----ALLAKASWRILKNPNSLISQMLRGKDLFTKGIRWKVGDGR

P92555 Uncharacterized mitochondrial protein AtMg012502.1e-0632.94Show/hide
Query:  LVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLKALLAKASWRILKNPNSLISQMLRGKDLFTKGIRW
        ++NG PQ    P++GLRQGDP+SPY+F++  E  S L  R +      G++          + N +  I+ +L   D  T   RW
Subjt:  LVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLKALLAKASWRILKNPNSLISQMLRGKDLFTKGIRW

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)6.9e-0530.59Show/hide
Query:  KGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAE
        KGK  ++ + LD+ KA+D V    + + M+  G  +     IM  I     +++V G          G++QGDP+SP +F IV +
Subjt:  KGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNGEPQESFKPNKGLRQGDPISPYIFLIVAE

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.5e-0732.94Show/hide
Query:  LVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLKALLAKASWRILKNPNSLISQMLRGKDLFTKGIRW
        ++NG PQ    P++GLRQGDP+SPY+F++  E  S L  R +      G++          + N +  I+ +L   D  T   RW
Subjt:  LVNGEPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLKALLAKASWRILKNPNSLISQMLRGKDLFTKGIRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGTGGCTAATGAAGAAAGATCTGACGAGGAGGGAGGAGAGACCTCCCACCTCCTCCGTTTCCGGTCGGAGGGGGCGAGCGATCAACCGCCGGCCGTCGGAGATTT
GAAGAAGAAGATGGAGGGGTTGAGGGAAGAAGATGAAGAAGGCAAGAGGAAGGGCAAGGCAGGCCACATCGCTATAAAATTAGATATGAGCAAGGCCTATGACCGGGTGG
AATGGGTGTTCGTGAGAAAGGTTATGCAGCAAATGGGCTTTGCCGAGGATTGGAGTTTGAAGATCATGGATTGCATTGGAACGGTGGAATACTCGGTTTTAGTTAATGGA
GAGCCCCAGGAGTCTTTCAAACCAAACAAGGGTTTACGCCAAGGAGATCCGATATCACCTTATATTTTCCTTATAGTTGCAGAAGGCTTTTCTGCTCTTCTTATCAGGGA
AGAATCTTTGAACAATTTTAAAGGTCTTAAGGCGCTACTTGCCAAGGCTAGCTGGAGGATTTTAAAAAACCCTAATAGCCTCATTTCCCAGATGCTTAGAGGAAAAGACC
TATTCACAAAAGGAATCCGTTGGAAAGTGGGTGATGGCAGGCATATAATGATAGATCAAGACCCCTGGATTGCATTGGAAGGTAGTGAAGTGCCATCCCTGACCAACGAG
GAGCTTAGAGGCAAGAGAGTCTGCGAGATTATCGATGATCATGGAGCTTGGATTGAGGACAAAGTGAGAGGCTCCTTTTCAGCCATAGATGCTGAAGTTATTCTCAATAC
CCCTGTAGGTGGGGAAGGCAGAAGAGATGAGATTATCTGGAATATAAAAAAAGAAGGGCTTGTTCACGGTCCTTCGAGGAGACCACAAAGCACTCTTTATGGGATTGTAA
GCCTAAACCCTAAACAGGGGATCCTGGAATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGTGGCTAATGAAGAAAGATCTGACGAGGAGGGAGGAGAGACCTCCCACCTCCTCCGTTTCCGGTCGGAGGGGGCGAGCGATCAACCGCCGGCCGTCGGAGATTT
GAAGAAGAAGATGGAGGGGTTGAGGGAAGAAGATGAAGAAGGCAAGAGGAAGGGCAAGGCAGGCCACATCGCTATAAAATTAGATATGAGCAAGGCCTATGACCGGGTGG
AATGGGTGTTCGTGAGAAAGGTTATGCAGCAAATGGGCTTTGCCGAGGATTGGAGTTTGAAGATCATGGATTGCATTGGAACGGTGGAATACTCGGTTTTAGTTAATGGA
GAGCCCCAGGAGTCTTTCAAACCAAACAAGGGTTTACGCCAAGGAGATCCGATATCACCTTATATTTTCCTTATAGTTGCAGAAGGCTTTTCTGCTCTTCTTATCAGGGA
AGAATCTTTGAACAATTTTAAAGGTCTTAAGGCGCTACTTGCCAAGGCTAGCTGGAGGATTTTAAAAAACCCTAATAGCCTCATTTCCCAGATGCTTAGAGGAAAAGACC
TATTCACAAAAGGAATCCGTTGGAAAGTGGGTGATGGCAGGCATATAATGATAGATCAAGACCCCTGGATTGCATTGGAAGGTAGTGAAGTGCCATCCCTGACCAACGAG
GAGCTTAGAGGCAAGAGAGTCTGCGAGATTATCGATGATCATGGAGCTTGGATTGAGGACAAAGTGAGAGGCTCCTTTTCAGCCATAGATGCTGAAGTTATTCTCAATAC
CCCTGTAGGTGGGGAAGGCAGAAGAGATGAGATTATCTGGAATATAAAAAAAGAAGGGCTTGTTCACGGTCCTTCGAGGAGACCACAAAGCACTCTTTATGGGATTGTAA
GCCTAAACCCTAAACAGGGGATCCTGGAATCTTAG
Protein sequenceShow/hide protein sequence
MRVANEERSDEEGGETSHLLRFRSEGASDQPPAVGDLKKKMEGLREEDEEGKRKGKAGHIAIKLDMSKAYDRVEWVFVRKVMQQMGFAEDWSLKIMDCIGTVEYSVLVNG
EPQESFKPNKGLRQGDPISPYIFLIVAEGFSALLIREESLNNFKGLKALLAKASWRILKNPNSLISQMLRGKDLFTKGIRWKVGDGRHIMIDQDPWIALEGSEVPSLTNE
ELRGKRVCEIIDDHGAWIEDKVRGSFSAIDAEVILNTPVGGEGRRDEIIWNIKKEGLVHGPSRRPQSTLYGIVSLNPKQGILES