; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028701 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028701
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:28665635..28666453
RNA-Seq ExpressionLag0028701
SyntenyLag0028701
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_018821989.1 uncharacterized protein LOC108992010 [Juglans regia]6.9e-7053.39Show/hide
Query:  RAEFHPNRGLRQGDPLSPYLFLICAEGLSSILT-SRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAFMV
        +  F P+RGLRQGDPLSPYLFL+C EGL S++  +  + DLS ++I   +P ISHL + DDSLLF KA  ++ + I+SLL+ YE +SGQ IN  K+  + 
Subjt:  RAEFHPNRGLRQGDPLSPYLFLICAEGLSSILT-SRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAFMV

Query:  SPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAICA
        S N +    E L +  GVE      +YLGLPS +GRSK   F  IK RVW+ LQGWK K   A G+E LIK+VAQAIP Y+MSCFKLP SLCN+L  + A
Subjt:  SPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAICA

Query:  RFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
        +FWWG     +KIH  SW+K+   K  GG+GF+D+ IFN ALLAKQ W II
Subjt:  RFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

XP_021762976.1 uncharacterized protein LOC110727703 [Chenopodium quinoa]1.1e-7049.81Show/hide
Query:  ICKFPGSAQWDTRAE--FHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKMDL-SGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERA
        +C    S +W+ +      P RGLRQGDP+SPYLFL+CAE  SS+L      +L SG R+   +P ISH+F+ DDS+LF +A  ++C  I S++  YERA
Subjt:  ICKFPGSAQWDTRAE--FHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKMDL-SGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERA

Query:  SGQTINFEKSAFMVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFK
        SGQ INF+KS    S N + D+  ++  +L V+  S   +YLGLP+ IGRSKK VF  +K+R WK LQGWK +    AG+E LIK+VAQAIP Y M  F+
Subjt:  SGQTINFEKSAFMVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFK

Query:  LPKSLCNDLNAICARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
        +P  + N++N++ A+FWWG+  + RKIH  SW+KLC  K +GG+GFRD+  FNQALLAKQ WR++
Subjt:  LPKSLCNDLNAICARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

XP_030931068.1 uncharacterized protein LOC115956947 [Quercus lobata]3.1e-7051.61Show/hide
Query:  FHPNRGLRQGDPLSPYLFLICAEGLSSIL-TSRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAFMVSPN
        FHP+RGLRQGDPLSPYLFL+CAEGL S++  +    +L G+ +  + P I+HLF+ DDSLLF +A E DC+ + ++L +YE AS Q IN  K+    S N
Subjt:  FHPNRGLRQGDPLSPYLFLICAEGLSSIL-TSRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAFMVSPN

Query:  TKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAICARFW
        T  D   ++K ++GVE  +   +YLG+PS +GR+K+E F  I++RVW  +QGWK K    AGRE LIK+V QA+P + M CFKLPK+LC D+ A+  +FW
Subjt:  TKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAICARFW

Query:  WGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
        WG +   RKIH  +W+KLC  K  GGLGFR++  FN ALL KQ WR+I
Subjt:  WGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]3.1e-7049.8Show/hide
Query:  DTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSIL-TSRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAF
        +    F PNRGLRQGDPLSPYLFL+CAEGL S++  +     + G+ + +  P +SHLF+ DDSLLF +A  ++  +I  +L  YE ASGQ IN EK+  
Subjt:  DTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSIL-TSRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAF

Query:  MVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAI
          SPNT P   E++K +LGV   +N  +YLGLPS +GR KK+ F  I++R+W  +QGWK +     GRE LIK+V QA+P + M CFK+PKSLC D+ ++
Subjt:  MVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAI

Query:  CARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
          +FWWG     RKIH   W+KLC  K+ GGLGF+D+ +FN A+L KQ WR+I
Subjt:  CARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]1.8e-7050.59Show/hide
Query:  DTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTS-RGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAF
        +    F PNRGLRQGDPLSPYLFL+CAEGL S++        + G+ + +  P +SHLF+ DDSLLF +A  ++  +I  +L  YE ASGQ IN EK+  
Subjt:  DTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTS-RGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAF

Query:  MVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAI
          SPNT P   E++K +LGV   +N  +YLGLPS +GR KK+ F  I++RVW+ +QGWK +     GRE LIK+V QA+P + M CFKLPKSLC D+ ++
Subjt:  MVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAI

Query:  CARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
          +FWWG     RKIH   W+KLC  K+ GGLGF+D+ +FN A+L KQ WR+I
Subjt:  CARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein3.2e-7352.96Show/hide
Query:  DTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKMD-LSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAF
        D +    P+RGLRQGDPLSPYLFLICAEGLS++L    + + + G+ ++   P +SHLF+ DDSL+F +A E DC  ++ +L LYERASGQ IN +K+A 
Subjt:  DTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKMD-LSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAF

Query:  MVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAI
          S N  P     +  M G    +   +YLGLP  IGRSKK+ F  IKDR+W+ LQGWK KF   AG+E LIK+V QAIP YAMSCFKLP  LC++++ +
Subjt:  MVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAI

Query:  CARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
          RFWWG     RKIH  S +KLC  K  GG+GFRD+  FNQALLA+Q WR++
Subjt:  CARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

A0A2N9FRF7 Reverse transcriptase domain-containing protein8.5e-7451.71Show/hide
Query:  ICKFPGSAQWDTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKM-DLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASG
        +CK   + +W       P RG+RQGDPLSPYLFLICAEGL+++L +      L+GL +N   P ISHLF+ DDSLLF KA  ++CR + S L++YERASG
Subjt:  ICKFPGSAQWDTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKM-DLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASG

Query:  QTINFEKSAFMVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLP
        Q +N+EK++   S NT     + + + L      +LG+YLGLP  IGR KK+ F  IK +V + L GWKGK    AGRE LIKSVAQA+P Y MSCF+LP
Subjt:  QTINFEKSAFMVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLP

Query:  KSLCNDLNAICARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
         SLC +LN++  +FWWG   + RKIH + W KLC  K   G+GFRD+ +F+QALLAKQ WR++
Subjt:  KSLCNDLNAICARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

A0A2N9GL12 Reverse transcriptase domain-containing protein1.6e-7252.85Show/hide
Query:  PNRGLRQGDPLSPYLFLICAEGLSSILTSRGK-MDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAFMVSPNTK
        P+RG+RQGDPLSPYLFLICAEG S++L    +   L G+ I    P +SHL + DDSLLF +A +++ RT+  +L LYE +SGQ IN  K++   S NTK
Subjt:  PNRGLRQGDPLSPYLFLICAEGLSSILTSRGK-MDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAFMVSPNTK

Query:  PDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAICARFWWG
         ++ E+LK++ G +  +   +YLG+P+ IGRSK + F  +K+R+ K LQGW  +F   AGRE LIK+V+Q+IPNY MSCF+LPK LC+D+NA+ A FWWG
Subjt:  PDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAICARFWWG

Query:  ANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
         + +GRK+H  +W +LC+ K  GGLGFRD N FN ALLAKQ WR +
Subjt:  ANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

A0A2N9H9Z1 Reverse transcriptase domain-containing protein4.6e-7252.96Show/hide
Query:  DTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKMDL-SGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAF
        + +    P+RGLRQGDPLSPYLFLICAEGLS++L    + +L  G+ I   SP ISHLF+ DDS++F +A   DC  + + L LYERASGQ +N  K+A 
Subjt:  DTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKMDL-SGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAF

Query:  MVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAI
          SPNT  D    +  + G    ++  +YLGLP  +GRSKK  F+ IKDRVW+ LQGWK K    AGRE LIK+V QAIP YAMSCFK P  LC+D+ ++
Subjt:  MVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAI

Query:  CARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
          RFWWG  +  RKIH  S  KL   K+ GG+GFRD+ +FNQALLA+Q WR++
Subjt:  CARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

A0A2N9HJV7 Reverse transcriptase domain-containing protein3.6e-7252.63Show/hide
Query:  HPNRGLRQGDPLSPYLFLICAEGLSSILT-SRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAFMVSPNT
        HP+RGLRQGDPLSPY+FL+CAEGL ++L  +  +  + G+++   +P+ISHLF+ DDS+LF +A  + C TI+ +L  YE ASGQ IN EK+    S NT
Subjt:  HPNRGLRQGDPLSPYLFLICAEGLSSILT-SRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKSAFMVSPNT

Query:  KPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAICARFWW
        KP+Q + LK+ L V+      +YLGLPS IGRSK++VF +IK+RVWK +QGWK K     G+E LIK+VAQA+P YAMS FKLP S+C+DL+ I +RFWW
Subjt:  KPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAICARFWW

Query:  GANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
        G     R +H   W ++C  +  GGLGFR++  FNQA+LAK  WR++
Subjt:  GANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein2.9e-1526.23Show/hide
Query:  GLRQGDPLSPYLFLICAEGLSSILTSRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKS-AFMVSPNTKPDQ
        G RQG PLSPYLF I  E L+  +  R + ++ G++I  +   IS L   DD +++    +   R + +L++ +    G  IN  KS AF+ + N + + 
Subjt:  GLRQGDPLSPYLFLICAEGLSSILTSRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQTINFEKS-AFMVSPNTKPDQ

Query:  MEKLKDMLGVEYKSNLGQYLG--LPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKS--VAQAIPNYAMSCFKLPKSLCNDLNAICARFWW
         +++++       +N  +YLG  L  ++     + F ++K  + + L+ WK       GR N++K   + +AI  +     K+P    N+L     +F W
Subjt:  MEKLKDMLGVEYKSNLGQYLG--LPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKS--VAQAIPNYAMSCFKLPKSLCNDLNAICARFWW

Query:  GANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSW
          N+  R   S     L   +  GG+   D+ ++ +A++ K +W
Subjt:  GANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSW

P92555 Uncharacterized mitochondrial protein AtMg012501.3e-1052.63Show/hide
Query:  PNRGLRQGDPLSPYLFLICAEGLSSIL-TSRGKMDLSGLRINNDSPSISHLFYVDDS
        P+RGLRQGDPLSPYLF++C E LS +   ++ +  L G+R++N+SP I+HL + DD+
Subjt:  PNRGLRQGDPLSPYLFLICAEGLSSIL-TSRGKMDLSGLRINNDSPSISHLFYVDDS

P93295 Uncharacterized mitochondrial protein AtMg003101.1e-1758.44Show/hide
Query:  AIPNYAMSCFKLPKSLCNDLNAICARFWWGANEAGRKIHSRSWRKLC-SHKNMGGLGFRDMNIFNQALLAKQSWRII
        A+P YAMSCF+L K LC  L +    FWW + E  RKI   +W+KLC S ++ GGLGFRD+  FNQALLAKQS+RII
Subjt:  AIPNYAMSCFKLPKSLCNDLNAICARFWWGANEAGRKIHSRSWRKLC-SHKNMGGLGFRDMNIFNQALLAKQSWRII

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-1642.11Show/hide
Query:  AIPNYAMSCFKLPKSLCNDLNAICARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII
        A+P Y M+CF LPK++C  + ++ A FWW   +  + +H ++W  L  +K  GG+GF+D+  FN ALL KQ WR++
Subjt:  AIPNYAMSCFKLPKSLCNDLNAICARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.7e-1958.44Show/hide
Query:  AIPNYAMSCFKLPKSLCNDLNAICARFWWGANEAGRKIHSRSWRKLC-SHKNMGGLGFRDMNIFNQALLAKQSWRII
        A+P YAMSCF+L K LC  L +    FWW + E  RKI   +W+KLC S ++ GGLGFRD+  FNQALLAKQS+RII
Subjt:  AIPNYAMSCFKLPKSLCNDLNAICARFWWGANEAGRKIHSRSWRKLC-SHKNMGGLGFRDMNIFNQALLAKQSWRII

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.1e-1252.63Show/hide
Query:  PNRGLRQGDPLSPYLFLICAEGLSSIL-TSRGKMDLSGLRINNDSPSISHLFYVDDS
        P+RGLRQGDPLSPYLF++C E LS +   ++ +  L G+R++N+SP I+HL + DD+
Subjt:  PNRGLRQGDPLSPYLFLICAEGLSSIL-TSRGKMDLSGLRINNDSPSISHLFYVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAACAAAATTATGAAGTGTGTGGAATCTGTAAGTTTCCAGGTTCTGCTCAATGGGATACCCGAGCTGAGTTCCACCCGAATAGAGGCCTTAGGCAAGGAGACCC
TTTATCTCCATATTTGTTCTTGATTTGTGCTGAAGGCTTGTCCAGTATTCTCACAAGCAGAGGGAAGATGGACCTATCAGGTTTGCGTATCAATAACGACTCTCCCTCTA
TATCTCACCTCTTTTATGTTGATGACAGTCTTTTGTTCTTTAAAGCTGTGGAAAAAGATTGCAGGACCATTAAAAGTCTCCTTCATCTATATGAAAGGGCTTCGGGGCAG
ACCATTAATTTTGAGAAATCAGCTTTCATGGTTAGCCCAAACACAAAGCCAGATCAGATGGAAAAGCTTAAAGACATGCTTGGAGTAGAATACAAAAGCAATCTGGGGCA
ATACCTAGGACTTCCGTCACAAATAGGGAGGAGCAAGAAGGAAGTTTTTGACAACATCAAAGATCGTGTTTGGAAAGCTTTACAAGGCTGGAAAGGGAAGTTTTTTTTTG
CTGCTGGAAGAGAAAATCTTATAAAATCTGTAGCTCAAGCAATCCCAAACTATGCTATGAGTTGTTTTAAACTTCCTAAGTCTCTTTGTAATGATCTGAATGCTATTTGT
GCCAGGTTCTGGTGGGGAGCAAATGAAGCAGGAAGGAAAATTCACTCGAGAAGTTGGAGAAAACTTTGCTCACATAAGAACATGGGAGGGTTGGGTTTCCGAGATATGAA
CATCTTCAACCAGGCGTTGCTGGCTAAACAGAGCTGGAGAATCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGAACAAAATTATGAAGTGTGTGGAATCTGTAAGTTTCCAGGTTCTGCTCAATGGGATACCCGAGCTGAGTTCCACCCGAATAGAGGCCTTAGGCAAGGAGACCC
TTTATCTCCATATTTGTTCTTGATTTGTGCTGAAGGCTTGTCCAGTATTCTCACAAGCAGAGGGAAGATGGACCTATCAGGTTTGCGTATCAATAACGACTCTCCCTCTA
TATCTCACCTCTTTTATGTTGATGACAGTCTTTTGTTCTTTAAAGCTGTGGAAAAAGATTGCAGGACCATTAAAAGTCTCCTTCATCTATATGAAAGGGCTTCGGGGCAG
ACCATTAATTTTGAGAAATCAGCTTTCATGGTTAGCCCAAACACAAAGCCAGATCAGATGGAAAAGCTTAAAGACATGCTTGGAGTAGAATACAAAAGCAATCTGGGGCA
ATACCTAGGACTTCCGTCACAAATAGGGAGGAGCAAGAAGGAAGTTTTTGACAACATCAAAGATCGTGTTTGGAAAGCTTTACAAGGCTGGAAAGGGAAGTTTTTTTTTG
CTGCTGGAAGAGAAAATCTTATAAAATCTGTAGCTCAAGCAATCCCAAACTATGCTATGAGTTGTTTTAAACTTCCTAAGTCTCTTTGTAATGATCTGAATGCTATTTGT
GCCAGGTTCTGGTGGGGAGCAAATGAAGCAGGAAGGAAAATTCACTCGAGAAGTTGGAGAAAACTTTGCTCACATAAGAACATGGGAGGGTTGGGTTTCCGAGATATGAA
CATCTTCAACCAGGCGTTGCTGGCTAAACAGAGCTGGAGAATCATTTGA
Protein sequenceShow/hide protein sequence
MDEQNYEVCGICKFPGSAQWDTRAEFHPNRGLRQGDPLSPYLFLICAEGLSSILTSRGKMDLSGLRINNDSPSISHLFYVDDSLLFFKAVEKDCRTIKSLLHLYERASGQ
TINFEKSAFMVSPNTKPDQMEKLKDMLGVEYKSNLGQYLGLPSQIGRSKKEVFDNIKDRVWKALQGWKGKFFFAAGRENLIKSVAQAIPNYAMSCFKLPKSLCNDLNAIC
ARFWWGANEAGRKIHSRSWRKLCSHKNMGGLGFRDMNIFNQALLAKQSWRII