; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040968 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040968
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:10364449..10365480
RNA-Seq ExpressionLag0040968
SyntenyLag0040968
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]4.5e-6642.65Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQWR
        MK L WNV GLGNP TFR LR+ +R   PQLVFL ETK N     +  R+L FDC ++V S GKSGGL+LLW ++++V I+S S GHID+ + ++   WR
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQWR

Query:  FSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFLI
        F+G YGNP       +W L+ RL    ++PWIIGGDFNEI+S +EK GG  R+E  M+        C                       +WERLDRFLI
Subjt:  FSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFLI

Query:  NCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSGSIVDKTRICLERLGNWSRYRYGGSI
        N  M NKC  LKV HL L++SDHRPI+  W  +         +R  RFEE W++ + C+DI+   W      G  +   K   CL RL  W++ R   S+
Subjt:  NCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSGSIVDKTRICLERLGNWSRYRYGGSI

Query:  RGAIARKETELRQLNSVRDVQSLNVMREKE---KELETLL
        +GAIA KE EL +L  + D  S N +  K+   +E+E  L
Subjt:  RGAIARKETELRQLNSVRDVQSLNVMREKE---KELETLL

XP_030957478.1 uncharacterized protein LOC115979567 [Quercus lobata]9.9e-5837.07Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKE-RNWQW
        M++L WN +GLGNPR+ RAL   ++ + P  VFL +TKL  N  DK    + F  GL VPS G+SGGL LLW+ +  V+++SYS  HIDA V E   ++W
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKE-RNWQW

Query:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL
        R +G YGNP       +W L+  L     +PW+  GDFNEI+S +EK GG  RS+  M EFR+ I+ C   D G+   + TWCN       ++ RLDR L
Subjt:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL

Query:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVW-QERREDGSGSIVDKTRICLERLGNWSRYRYGG
        +  +  +    ++VHHL    SDH  +++    D   P+ P  RR  +FE MW + E+C+D++++ W    R +    IV   + C + L  W++  + G
Subjt:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVW-QERREDGSGSIVDKTRICLERLGNWSRYRYGG

Query:  SIRGAIARKETELRQL----NSVRDVQSLNVMREKEKELETLLADDEV
         +   I +K   L  L     + R+ + +N +R   KE+  LL  +EV
Subjt:  SIRGAIARKETELRQL----NSVRDVQSLNVMREKEKELETLLADDEV

XP_030970584.1 uncharacterized protein LOC115990959 [Quercus lobata]1.1e-5943.7Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKER--NWQ
        M  L WN +GLGNPR+ RAL + ++ + P+LVFL ETK+   + +++   + F  GL VP  G+SGGL LLW  ETD+ I+S+S  HIDA + E   N++
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKER--NWQ

Query:  WRFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRF
        WRF+G YG+P   L   +W L++ L+G   +PW   GDFNEI+S  EK GG  RS+  M +FR+ ++ C   D GY+  D TWCN       V+ RLDR 
Subjt:  WRFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRF

Query:  LINCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKR--FEEMWVKYEECKDIVQDVW
           CD  NK   ++V+HL    SDH  + V       DP+ P   R +R  FE MW K EECKDI++  W
Subjt:  LINCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKR--FEEMWVKYEECKDIVQDVW

XP_042972796.1 uncharacterized protein LOC122304603 [Carya illinoinensis]2.1e-6040.67Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVK--ERNWQ
        MK +SWN +GLGNPR  RAL   +R   P ++FLMETKL   + +++   + F+C   V S+G+ GG+ LLW+NE  +SI+S+S  HIDA +   +   +
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVK--ERNWQ

Query:  WRFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRF
        W+F+G+YG+   E    TW+L+  L+  GN+PW++ GDFNE++SQ EK GGR R EM MQ FR  +D C L D G+     TWCN  +   VV ERLDRF
Subjt:  WRFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRF

Query:  LINCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGS-GSIVDKTRICLERLGNWSRYRYG
        +   + +    + +V H    +SDH PI++      ++ R     +  R E MWV+ +EC D++   WQ   E+   GSI+D+  IC + L  W+R ++G
Subjt:  LINCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGS-GSIVDKTRICLERLGNWSRYRYG

XP_042988712.1 uncharacterized protein LOC122316247 [Carya illinoinensis]3.3e-6138.33Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATV--KERNWQ
        M  +SWN +GLGNPR  RALR  +R   P ++FL ETKL+  K + + R L ++C   V SEG+SGGL L+WQ ET+++++SYSK HIDA +   E + Q
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATV--KERNWQ

Query:  WRFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRF
        W+F+G+YG+P+ EL   TW  +  L+G  ++PW++ GDFNE++   EK+GGR R E  M+ FR  +D C   D G++    TWCN       V ERLDR+
Subjt:  WRFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRF

Query:  LINCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSGSIVDKTRI--CLERLGNWSRYRY
        L N    +     +V H    +SDH PI++    ++S  R     +  RFE MW +  E ++IV+D W  R      + + KTRI  C +RL  W++ +Y
Subjt:  LINCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSGSIVDKTRI--CLERLGNWSRYRY

Query:  GGSIRGAIARKETELRQLNSVRDVQS-LNVMREKEKELETLLADDEV
         G+++  I +  T L+Q+       S + + +E   +L+  L  +E+
Subjt:  GGSIRGAIARKETELRQLNSVRDVQS-LNVMREKEKELETLLADDEV

TrEMBL top hitse value%identityAlignment
A0A2N9G656 Reverse transcriptase domain-containing protein2.6e-5938.26Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKE-RNWQW
        M  L WN +GLGN RT + L   +RS  P++VFL+ET  N  + + L   L+F+  L V ++G+ GGL L WQ+E ++SIRS+S  HIDA +    N  W
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKE-RNWQW

Query:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL
        RF+G YG P+      +W L+  L    ++PW+  GDFNEI    EK+G   R E  M+ FR+ +D CEL D GY  A  TWCNN  ++  VW RLDR +
Subjt:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL

Query:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVW-QERREDGSGSIVDKTRICLERLGNWSRYRYGG
         + D  NK    +V HL   +SDH P+++  Y ++  P  P+ ++P RFE+MW     C + V + W Q     G   ++ K ++C   L +WS+ ++ G
Subjt:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVW-QERREDGSGSIVDKTRICLERLGNWSRYRYGG

Query:  SIRGAIARKETELRQLNSVRDVQSLNVMREK--EKELETLLADDE
        S+R  +  K  +LRQ   V  +Q  +  + K  +KE+  L+  DE
Subjt:  SIRGAIARKETELRQLNSVRDVQSLNVMREK--EKELETLLADDE

A0A2N9GII4 Uncharacterized protein3.3e-5937.57Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQ-W
        MK+LSWN QGLGNP T  +L   ++S  PQ++FLMETKL   K + +   L F     VPS G+S GL LLWQ E  + +++++  HID+ +   N   W
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQ-W

Query:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL
        R  G YG P  +    +W L+ +L    ++PW+  GDFNEI+ Q+EK+G R R    M EFR+ ++RC+  D GY     TW NN      V ERLDR +
Subjt:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL

Query:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSG--SIVDKTRICLERLGNWSRYRYG
              N  +I+ V HL +  SDH PI+VE    RS  R    RR  RFEE W  + +C+ +++ +W+E   +GS    + +K + C   L  WS+  +G
Subjt:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSG--SIVDKTRICLERLGNWSRYRYG

Query:  GSIRGAIARKETELRQLNSVRDVQSLNVMREKEKELETLLADDEVY
        GS     AR E  +  L +    Q+ +++ + ++E+ +LL  DE++
Subjt:  GSIRGAIARKETELRQLNSVRDVQSLNVMREKEKELETLLADDEVY

A0A2N9HE04 Reverse transcriptase domain-containing protein3.3e-5937.57Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQ-W
        MK+LSWN QGLGNP T  +L   ++S  PQ++FLMETKL   K + +   L F     VPS G+S GL LLWQ E  + +++++  HID+ +   N   W
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQ-W

Query:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL
        R  G YG P  +    +W L+ +L    ++PW+  GDFNEI+ Q+EK+G R R    M EFR+ ++RC+  D GY     TW NN      V ERLDR +
Subjt:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL

Query:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSG--SIVDKTRICLERLGNWSRYRYG
              N  +I+ V HL +  SDH PI+VE    RS  R    RR  RFEE W  + +C+ +++ +W+E   +GS    + +K + C   L  WS+  +G
Subjt:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSG--SIVDKTRICLERLGNWSRYRYG

Query:  GSIRGAIARKETELRQLNSVRDVQSLNVMREKEKELETLLADDEVY
        GS     AR E  +  L +    Q+ +++ + ++E+ +LL  DE++
Subjt:  GSIRGAIARKETELRQLNSVRDVQSLNVMREKEKELETLLADDEVY

A0A2N9IJF6 Uncharacterized protein3.3e-5937.57Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQ-W
        MK+LSWN QGLGNP T  +L   ++S  PQ++FLMETKL   K + +   L F     VPS G+S GL LLWQ E  + +++++  HID+ +   N   W
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQ-W

Query:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL
        R  G YG P  +    +W L+ +L    ++PW+  GDFNEI+ Q+EK+G R R    M EFR+ ++RC+  D GY     TW NN      V ERLDR +
Subjt:  RFSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFL

Query:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSG--SIVDKTRICLERLGNWSRYRYG
              N  +I+ V HL +  SDH PI+VE    RS  R    RR  RFEE W  + +C+ +++ +W+E   +GS    + +K + C   L  WS+  +G
Subjt:  INCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSG--SIVDKTRICLERLGNWSRYRYG

Query:  GSIRGAIARKETELRQLNSVRDVQSLNVMREKEKELETLLADDEVY
        GS     AR E  +  L +    Q+ +++ + ++E+ +LL  DE++
Subjt:  GSIRGAIARKETELRQLNSVRDVQSLNVMREKEKELETLLADDEVY

A0A6J1DUG8 uncharacterized protein LOC1110241352.2e-6642.65Show/hide
Query:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQWR
        MK L WNV GLGNP TFR LR+ +R   PQLVFL ETK N     +  R+L FDC ++V S GKSGGL+LLW ++++V I+S S GHID+ + ++   WR
Subjt:  MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQWR

Query:  FSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFLI
        F+G YGNP       +W L+ RL    ++PWIIGGDFNEI+S +EK GG  R+E  M+        C                       +WERLDRFLI
Subjt:  FSGIYGNPNRELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFLI

Query:  NCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSGSIVDKTRICLERLGNWSRYRYGGSI
        N  M NKC  LKV HL L++SDHRPI+  W  +         +R  RFEE W++ + C+DI+   W      G  +   K   CL RL  W++ R   S+
Subjt:  NCDMQNKCSILKVHHLPLIASDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSGSIVDKTRICLERLGNWSRYRYGGSI

Query:  RGAIARKETELRQLNSVRDVQSLNVMREKE---KELETLL
        +GAIA KE EL +L  + D  S N +  K+   +E+E  L
Subjt:  RGAIARKETELRQLNSVRDVQSLNVMREKE---KELETLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCCTTAGTTGGAATGTCCAGGGGTTGGGGAACCCCAGGACATTCAGAGCGCTACGACACCGTATTCGTAGCTACCATCCCCAACTAGTTTTCTTGATGGAAAC
TAAACTAAATGGTAATAAAGGTGATAAGCTCATAAGGGATTTACAGTTTGACTGTGGTTTGACCGTTCCAAGTGAGGGTAAGAGTGGAGGTTTGGTGCTTCTCTGGCAAA
ATGAGACTGATGTCTCCATAAGATCTTACTCTAAAGGTCATATAGATGCAACGGTAAAGGAAAGAAATTGGCAATGGAGATTTTCCGGCATTTATGGAAATCCGAATAGG
GAGCTTCATCATACCACATGGACTTTGATGAATCGCTTGAAAGGAAATGGTAACATGCCTTGGATTATCGGAGGAGATTTTAACGAGATCATAAGTCAGTCAGAAAAGAA
GGGGGGCAGAGCTAGATCGGAGATGGATATGCAAGAGTTTAGAGATACAATTGATCGATGCGAGCTATATGATCCCGGCTATATTGTCGCTGACTCCACTTGGTGCAATA
ACCATTTTAATAGTGAGGTGGTATGGGAACGTCTTGATCGATTTTTGATTAATTGTGACATGCAAAATAAGTGTAGTATTCTAAAGGTTCATCATCTCCCTCTCATCGCC
TCAGACCATAGACCAATAATGGTCGAATGGTATGAGGACAGAAGTGACCCGAGGATGCCAGTTTCCAGGAGGCCTAAAAGATTTGAAGAGATGTGGGTTAAATATGAGGA
ATGCAAAGATATTGTGCAAGATGTTTGGCAGGAGCGGAGGGAGGATGGTTCGGGCTCTATTGTTGATAAAACTAGGATTTGTCTCGAACGCCTTGGTAACTGGAGTAGGT
ATAGATATGGTGGATCGATTAGGGGAGCAATAGCCAGGAAGGAAACTGAATTACGTCAACTCAACAGCGTAAGAGATGTGCAAAGTCTAAATGTGATGAGAGAGAAAGAA
AAGGAGTTGGAGACATTGCTTGCAGACGATGAGGTCTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATCCTTAGTTGGAATGTCCAGGGGTTGGGGAACCCCAGGACATTCAGAGCGCTACGACACCGTATTCGTAGCTACCATCCCCAACTAGTTTTCTTGATGGAAAC
TAAACTAAATGGTAATAAAGGTGATAAGCTCATAAGGGATTTACAGTTTGACTGTGGTTTGACCGTTCCAAGTGAGGGTAAGAGTGGAGGTTTGGTGCTTCTCTGGCAAA
ATGAGACTGATGTCTCCATAAGATCTTACTCTAAAGGTCATATAGATGCAACGGTAAAGGAAAGAAATTGGCAATGGAGATTTTCCGGCATTTATGGAAATCCGAATAGG
GAGCTTCATCATACCACATGGACTTTGATGAATCGCTTGAAAGGAAATGGTAACATGCCTTGGATTATCGGAGGAGATTTTAACGAGATCATAAGTCAGTCAGAAAAGAA
GGGGGGCAGAGCTAGATCGGAGATGGATATGCAAGAGTTTAGAGATACAATTGATCGATGCGAGCTATATGATCCCGGCTATATTGTCGCTGACTCCACTTGGTGCAATA
ACCATTTTAATAGTGAGGTGGTATGGGAACGTCTTGATCGATTTTTGATTAATTGTGACATGCAAAATAAGTGTAGTATTCTAAAGGTTCATCATCTCCCTCTCATCGCC
TCAGACCATAGACCAATAATGGTCGAATGGTATGAGGACAGAAGTGACCCGAGGATGCCAGTTTCCAGGAGGCCTAAAAGATTTGAAGAGATGTGGGTTAAATATGAGGA
ATGCAAAGATATTGTGCAAGATGTTTGGCAGGAGCGGAGGGAGGATGGTTCGGGCTCTATTGTTGATAAAACTAGGATTTGTCTCGAACGCCTTGGTAACTGGAGTAGGT
ATAGATATGGTGGATCGATTAGGGGAGCAATAGCCAGGAAGGAAACTGAATTACGTCAACTCAACAGCGTAAGAGATGTGCAAAGTCTAAATGTGATGAGAGAGAAAGAA
AAGGAGTTGGAGACATTGCTTGCAGACGATGAGGTCTATTGA
Protein sequenceShow/hide protein sequence
MKILSWNVQGLGNPRTFRALRHRIRSYHPQLVFLMETKLNGNKGDKLIRDLQFDCGLTVPSEGKSGGLVLLWQNETDVSIRSYSKGHIDATVKERNWQWRFSGIYGNPNR
ELHHTTWTLMNRLKGNGNMPWIIGGDFNEIISQSEKKGGRARSEMDMQEFRDTIDRCELYDPGYIVADSTWCNNHFNSEVVWERLDRFLINCDMQNKCSILKVHHLPLIA
SDHRPIMVEWYEDRSDPRMPVSRRPKRFEEMWVKYEECKDIVQDVWQERREDGSGSIVDKTRICLERLGNWSRYRYGGSIRGAIARKETELRQLNSVRDVQSLNVMREKE
KELETLLADDEVY