; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036631 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036631
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:49719585..49730647
RNA-Seq ExpressionLag0036631
SyntenyLag0036631
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69855.1 transposable element gene [Prunus dulcis]4.5e-6264.65Show/hide
Query:  VSPDQCVPKKGGVTVVTNKDNELIPTRFI-------IAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIES------------TVEA
        VSP Q VPKK G+TVV N+ NEL+PTR I       IA EDQEKTTFT P+ TFA++RMPF LCNAPATFQRCM+AIFSDM+E              +EA
Subjt:  VSPDQCVPKKGGVTVVTNKDNELIPTRFI-------IAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIES------------TVEA

Query:  FETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATI
        F TLK  L SAPI+ AP+WSL FE+MCDA+D AIG +LGQK +K+ H I+YASR LN+AQ+NY+TTEKELLA+VFA EKFRPYLVGSKV V++DHA +
Subjt:  FETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATI

GEV44874.1 reverse transcriptase domain-containing protein [Tanacetum cinerariifolium]7.5e-5758.16Show/hide
Query:  VSPDQCVPKKGGVTVVTNKDNELIPTRFIIAHE-------------------DQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIESTVEA
        VSP QCVPKKGG TVV NKDNELIPTR +  ++                   DQ KTTFT PY TF ++RMPF LCNAP  FQRCM+AIF+DMIE T+EA
Subjt:  VSPDQCVPKKGGVTVVTNKDNELIPTRFIIAHE-------------------DQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIESTVEA

Query:  FETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHA
        F+TLK  L  APIL AP+W + FE+MCDA+D AIG +LGQ+ DK   PI+YAS+ + EA+ NYTTTEKE LA+V+A EKF+ YL+ +K  V+TD++
Subjt:  FETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHA

GFA47621.1 hypothetical protein [Tanacetum cinerariifolium]1.0e-5860.22Show/hide
Query:  VSPDQCVPKKGGVTVVTNKDNELIPTRFIIAH-------EDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIESTVEAFETLKAALISAP
        VSP  CVPKKGG+TV+ N +NEL+PTR +  +       +DQEKTTFT PY TFA+KRMPF LCNAP TFQRCM+AIF DMIE T+EAF TLK  L  AP
Subjt:  VSPDQCVPKKGGVTVVTNKDNELIPTRFIIAH-------EDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIESTVEAFETLKAALISAP

Query:  ILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATI
        IL A NW   FE+MCDA+D A+G +LGQ+++K   PI+Y S+ +N+ + NYTTTEKE+LA+V+AFEKFR YL+  K  V+TDH+ +
Subjt:  ILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATI

RVW43526.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]4.4e-6540.76Show/hide
Query:  CFLGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKN
        C   NH+AL+CWNRFD+ YQ EEIP+ALAAM L+ E  DP  Y  SGAT+H+ NDPGK+S + PYKG D IFVGNG  L ISH+G++++ T   +L LK 
Subjt:  CFLGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKN

Query:  VLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKNRQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH---------LNEKF-
        +LVVPEIKKNL+S+ +LT DN CSI F +  F++K++ +++LA+ T+   LYALE+   +     +  + ++KA   +WH RMGH         L++KF 
Subjt:  VLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKNRQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH---------LNEKF-

Query:  --------------------------------------------------------------------TWIYPLKRKSEFISCFKKFKLMIENQVDKKIK
                                                                            TW+YPL+RKS+F  CF KF++++ENQ++++IK
Subjt:  --------------------------------------------------------------------TWIYPLKRKSEFISCFKKFKLMIENQVDKKIK

Query:  VLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE
        + QSDGGGEF S++ +  L + GI+ Q+SCP TP+QNGV E
Subjt:  VLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE

XP_026435521.1 uncharacterized protein LOC113333225 [Papaver somniferum]7.7e-5461.38Show/hide
Query:  VSPDQCVPKKGGVTVVTNKDNELIPTR-------FIIAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCML---AIFSDMIESTVEAFETLKAALI
        VSP Q VPKK G+TVV N+DN+L+PTR        +IA EDQEKTTFT P+ TFA++RMPF LCNAPATFQRCM     +  D  +   EAF+TLK  L 
Subjt:  VSPDQCVPKKGGVTVVTNKDNELIPTR-------FIIAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCML---AIFSDMIESTVEAFETLKAALI

Query:  SAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATI
        +API+  P WS  FE+MCD +D A+G +LGQ+  KV + IYYAS  LN AQINY+TTEKELLAIVFA EKFR YLVG+KV VF+DHA +
Subjt:  SAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATI

TrEMBL top hitse value%identityAlignment
A0A2N9FIQ8 Integrase catalytic domain-containing protein3.4e-5538.12Show/hide
Query:  CFLGNHTALECWNRFDHSYQSEEIPKALAAMNLNEGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNV
        C    H AL+CWNRF+H++Q+ +IP+ALAA+ L  G +      +GA++H+  DPG +  + PY G DK+ VG+GN L ISH+G ++I      +ILKNV
Subjt:  CFLGNHTALECWNRFDHSYQSEEIPKALAAMNLNEGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNV

Query:  LVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKN-RQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH--------LNEK---
        L+VP +KKNLIS+S+LT D  C + F +  FL+K+ + RKILA  T+   LY L D H    EA  +T     +  ++WH R+GH        LN K   
Subjt:  LVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKN-RQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH--------LNEK---

Query:  -------------------------------------------------------------------FTWIYPLKRKSEFISCFKKFKLMIENQVDKKIK
                                                                           FTWIYPLKRKS+F  CF +F+ M+ENQ DKKIK
Subjt:  -------------------------------------------------------------------FTWIYPLKRKSEFISCFKKFKLMIENQVDKKIK

Query:  VLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE
        + QSDGGGE++  +  + L + GI+HQ+SC  TP+QNGV E
Subjt:  VLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE

A0A2N9H612 Integrase catalytic domain-containing protein3.4e-5538.12Show/hide
Query:  CFLGNHTALECWNRFDHSYQSEEIPKALAAMNLNEGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNV
        C    H AL+CWNRF+H++Q+ +IP+ALAA+ L  G +      +GA++H+  DPG +  + PY G DK+ VG+GN L ISH+G ++I      +ILKNV
Subjt:  CFLGNHTALECWNRFDHSYQSEEIPKALAAMNLNEGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNV

Query:  LVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKN-RQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH--------LNEK---
        L+VP +KKNLIS+S+LT D  C + F +  FL+K+ + RKILA  T+   LY L D H    EA  +T     +  ++WH R+GH        LN K   
Subjt:  LVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKN-RQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH--------LNEK---

Query:  -------------------------------------------------------------------FTWIYPLKRKSEFISCFKKFKLMIENQVDKKIK
                                                                           FTWIYPLKRKS+F  CF +F+ M+ENQ DKKIK
Subjt:  -------------------------------------------------------------------FTWIYPLKRKSEFISCFKKFKLMIENQVDKKIK

Query:  VLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE
        + QSDGGGE++  +  + L + GI+HQ+SC  TP+QNGV E
Subjt:  VLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE

A0A438E6Z5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-6540.76Show/hide
Query:  CFLGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKN
        C   NH+AL+CWNRFD+ YQ EEIP+ALAAM L+ E  DP  Y  SGAT+H+ NDPGK+S + PYKG D IFVGNG  L ISH+G++++ T   +L LK 
Subjt:  CFLGNHTALECWNRFDHSYQSEEIPKALAAMNLN-EGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKN

Query:  VLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKNRQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH---------LNEKF-
        +LVVPEIKKNL+S+ +LT DN CSI F +  F++K++ +++LA+ T+   LYALE+   +     +  + ++KA   +WH RMGH         L++KF 
Subjt:  VLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKNRQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH---------LNEKF-

Query:  --------------------------------------------------------------------TWIYPLKRKSEFISCFKKFKLMIENQVDKKIK
                                                                            TW+YPL+RKS+F  CF KF++++ENQ++++IK
Subjt:  --------------------------------------------------------------------TWIYPLKRKSEFISCFKKFKLMIENQVDKKIK

Query:  VLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE
        + QSDGGGEF S++ +  L + GI+ Q+SCP TP+QNGV E
Subjt:  VLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE

A0A438H0L2 Reverse transcriptase3.7e-5455.92Show/hide
Query:  VSPDQCVPKKGGVTVVTNKDNELIPT----------------------RFIIAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIEST
        VSP Q VPKK G+TV+ N+  E + T                      +  I  ED+EKTTFT P+ TFA++RMPF LCNAPATFQRCML+IFSDM+E  
Subjt:  VSPDQCVPKKGGVTVVTNKDNELIPT----------------------RFIIAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIEST

Query:  VE----------AFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGS
        +E          +FE LK  L +API+ APNW L FEVMCDAND+A+GV+LGQ+ D   + IYYAS+ LNEAQ NYTTTEKELLA+VFA +KFR YLVGS
Subjt:  VE----------AFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGS

Query:  KVTVFTDHATI
         + VFTDH+T+
Subjt:  KVTVFTDHATI

A0A5H2XVG6 Transposable element protein2.2e-6264.65Show/hide
Query:  VSPDQCVPKKGGVTVVTNKDNELIPTRFI-------IAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIES------------TVEA
        VSP Q VPKK G+TVV N+ NEL+PTR I       IA EDQEKTTFT P+ TFA++RMPF LCNAPATFQRCM+AIFSDM+E              +EA
Subjt:  VSPDQCVPKKGGVTVVTNKDNELIPTRFI-------IAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMIES------------TVEA

Query:  FETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATI
        F TLK  L SAPI+ AP+WSL FE+MCDA+D AIG +LGQK +K+ H I+YASR LN+AQ+NY+TTEKELLA+VFA EKFRPYLVGSKV V++DHA +
Subjt:  FETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.3e-1446.88Show/hide
Query:  AFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDH
        AF+ LK  +   PIL  P+++  F +  DA+DVA+G +L Q      HP+ Y SR LNE +INY+T EKELLAIV+A + FR YL+G    + +DH
Subjt:  AFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDH

P20825 Retrovirus-related Pol polyprotein from transposon 2975.6e-1544.9Show/hide
Query:  VEAFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDH
        +EAFE LKA +I  PIL  P++   F +  DA+++A+G +L Q      HPI + SR LN+ ++NY+  EKELLAIV+A + FR YL+G +  + +DH
Subjt:  VEAFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDH

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus2.0e-1240.2Show/hide
Query:  ESTVEAFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGS-KVTVFT
        E+ +++F  LK+ L S+ IL  P ++  F +  DA++ AIG +L Q       PI Y SR LN+ + NY T EKE+LAI+++ +  R YL G+  + V+T
Subjt:  ESTVEAFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGS-KVTVFT

Query:  DH
        DH
Subjt:  DH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-2530.07Show/hide
Query:  SGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNVLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKNRQRKI-LAK
        SGAT H+ +D   +S  QPY G D + V +G+ + ISH G + + T    L L N+L VP I KNLISV +L   N  S+ FF   F VK+    + L +
Subjt:  SGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNVLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKNRQRKI-LAK

Query:  VTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH-----LNE---------------------------------------------------
              LY  E   +      +  S ++KA +S WH R+GH     LN                                                    
Subjt:  VTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH-----LNE---------------------------------------------------

Query:  ---------------------KFTWIYPLKRKSEFISCFKKFKLMIENQVDKKIKVLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE
                             ++TW+YPLK+KS+    F  FK ++EN+   +I    SD GGEF +  L E   Q GI H  S PHTP+ NG+ E
Subjt:  ---------------------KFTWIYPLKRKSEFISCFKKFKLMIENQVDKKIKVLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-2429.73Show/hide
Query:  SGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNVLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKNRQRKI-LAK
        SGAT H+ +D   +S  QPY G D + + +G+ + I+H G + + T    L L  VL VP I KNLISV +L   N  S+ FF   F VK+    + L +
Subjt:  SGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNVLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVKNRQRKI-LAK

Query:  VTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH-----LNE---------------------------------------------------
              LY  E   +      +  S  +KA +S WH R+GH     LN                                                    
Subjt:  VTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGH-----LNE---------------------------------------------------

Query:  ---------------------KFTWIYPLKRKSEFISCFKKFKLMIENQVDKKIKVLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE
                             ++TW+YPLK+KS+    F  FK ++EN+   +I  L SD GGEF  + L++ L Q GI H  S PHTP+ NG+ E
Subjt:  ---------------------KFTWIYPLKRKSEFISCFKKFKLMIENQVDKKIKVLQSDGGGEFTSLELKELLEQSGIVHQLSCPHTPQQNGVVE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGCCTGTTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACGTAGTCTTAC
AAGATGGAATTCACTCCTTCCCGATATTAGGGTAAGTAGAGGTCCATTAGGTCCCACCGGTAGCTCATTCAGGGCGTTGAGCAAAGACAAGGTGATTTTGCTGCTGTTTT
TTCAACTTTATCAAAGGTTTTTGGCGAAGCAATTCAAGAAATCTTCAAGGGTTCTGATCGAGGTAACCACTTCTGTTATGGGCTACTCCATTGATGCATCTTTCCCAATA
TGTTCTGAAAGTTTAAGAAATTTCCAGCGAAGAAACTACGAGAGGGCTGCTGCATTTTTGTTCATTGAAGCATCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGTAAG
CCCCGACCAATGTGTTCCTAAGAAAGGAGGTGTTACGGTAGTGACTAATAAGGACAATGAGTTGATCCCCACCAGATTTATCATTGCTCATGAGGACCAGGAAAAAACAA
CTTTCACCTTCCCGTATGAGACATTTGCTTTCAAGCGAATGCCTTTCGATCTTTGCAATGCTCCAGCAACATTTCAACGGTGTATGTTAGCTATTTTTTCTGATATGATT
GAGTCCACTGTTGAGGCTTTTGAGACTTTAAAAGCTGCTTTGATCTCAGCGCCCATTCTTTGTGCACCGAACTGGAGTTTGTCGTTCGAGGTAATGTGTGATGCCAACGA
TGTGGCAATAGGTGTAATGTTGGGGCAAAAACTTGACAAAGTTATCCATCCTATTTATTATGCAAGCAGGGTTTTGAATGAAGCACAAATCAACTACACAACTACTGAGA
AGGAGTTGTTGGCGATTGTGTTTGCCTTTGAGAAATTCCGCCCATACTTGGTGGGATCTAAAGTCACAGTGTTTACGGATCATGCAACAATAATGCCTTTTACTCCTACA
GAGGAAGGGTTAGAGGCCGAAGAAGATATTTCTCCTCACGAGGAAGAGGATTCCCTCAAGGTAGAGCAGTACCCTCCTCAAATCAGTCCACTGCCTTCACGCAAGACATA
TTTGCAACACCTCAAAAATCAAGAATCCACCCTGCTGCAACCATTGACCCAAAGCAACAAAGTGGTGAAATTGGTAAGCCCACCTTCAATAAATTGGTGTATCGCCGTGT
CCTGTCCTATCCGTGTCCGTGTCCCGTGTCCGTGTCCGTGCTTCTTAGGAAATCACACAGCATTGGAGTGTTGGAATCGTTTTGATCACTCCTACCAATCAGAGGAGATC
CCTAAAGCACTTGCTGCCATGAACCTAAATGAAGGGGTTGATCCATTGATGTATGCTTATTCAGGAGCCACTTCCCACATGGTTAATGATCCTGGTAAAGTTTCCACCTT
ACAACCTTATAAAGGATTAGACAAAATATTTGTTGGAAATGGAAACCAACTTGAAATTTCACATGTTGGTCAAAGTAAAATAGTCACCGATGATAATGAACTAATCCTTA
AAAATGTGCTTGTTGTGCCTGAAATTAAAAAGAATCTTATCTCAGTTAGTAAGCTCACTCGAGACAACGATTGTTCCATAACATTCTTTGCTAATGATTTTCTTGTGAAG
AATCGACAGAGGAAGATACTTGCTAAAGTCACAAGGACAGCAAGTTTATATGCATTAGAAGATCAACATTCAGAGAAAAATGAAGCCTGTGTTGCAACTAGTACAAATAA
TAAAGCTCCATACTCCATTTGGCATATGAGAATGGGGCATTTGAATGAAAAATTCACATGGATATATCCTTTAAAAAGAAAGTCTGAATTCATCTCTTGTTTCAAAAAGT
TTAAACTCATGATTGAAAATCAAGTTGACAAGAAGATAAAAGTTTTGCAAAGTGATGGTGGTGGAGAATTTACCTCACTTGAACTCAAAGAACTGTTGGAACAAAGTGGC
ATTGTGCACCAGTTATCATGTCCACACACTCCACAACAAAATGGAGTAGTGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGCCTGTTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACGTAGTCTTAC
AAGATGGAATTCACTCCTTCCCGATATTAGGGTAAGTAGAGGTCCATTAGGTCCCACCGGTAGCTCATTCAGGGCGTTGAGCAAAGACAAGGTGATTTTGCTGCTGTTTT
TTCAACTTTATCAAAGGTTTTTGGCGAAGCAATTCAAGAAATCTTCAAGGGTTCTGATCGAGGTAACCACTTCTGTTATGGGCTACTCCATTGATGCATCTTTCCCAATA
TGTTCTGAAAGTTTAAGAAATTTCCAGCGAAGAAACTACGAGAGGGCTGCTGCATTTTTGTTCATTGAAGCATCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGTAAG
CCCCGACCAATGTGTTCCTAAGAAAGGAGGTGTTACGGTAGTGACTAATAAGGACAATGAGTTGATCCCCACCAGATTTATCATTGCTCATGAGGACCAGGAAAAAACAA
CTTTCACCTTCCCGTATGAGACATTTGCTTTCAAGCGAATGCCTTTCGATCTTTGCAATGCTCCAGCAACATTTCAACGGTGTATGTTAGCTATTTTTTCTGATATGATT
GAGTCCACTGTTGAGGCTTTTGAGACTTTAAAAGCTGCTTTGATCTCAGCGCCCATTCTTTGTGCACCGAACTGGAGTTTGTCGTTCGAGGTAATGTGTGATGCCAACGA
TGTGGCAATAGGTGTAATGTTGGGGCAAAAACTTGACAAAGTTATCCATCCTATTTATTATGCAAGCAGGGTTTTGAATGAAGCACAAATCAACTACACAACTACTGAGA
AGGAGTTGTTGGCGATTGTGTTTGCCTTTGAGAAATTCCGCCCATACTTGGTGGGATCTAAAGTCACAGTGTTTACGGATCATGCAACAATAATGCCTTTTACTCCTACA
GAGGAAGGGTTAGAGGCCGAAGAAGATATTTCTCCTCACGAGGAAGAGGATTCCCTCAAGGTAGAGCAGTACCCTCCTCAAATCAGTCCACTGCCTTCACGCAAGACATA
TTTGCAACACCTCAAAAATCAAGAATCCACCCTGCTGCAACCATTGACCCAAAGCAACAAAGTGGTGAAATTGGTAAGCCCACCTTCAATAAATTGGTGTATCGCCGTGT
CCTGTCCTATCCGTGTCCGTGTCCCGTGTCCGTGTCCGTGCTTCTTAGGAAATCACACAGCATTGGAGTGTTGGAATCGTTTTGATCACTCCTACCAATCAGAGGAGATC
CCTAAAGCACTTGCTGCCATGAACCTAAATGAAGGGGTTGATCCATTGATGTATGCTTATTCAGGAGCCACTTCCCACATGGTTAATGATCCTGGTAAAGTTTCCACCTT
ACAACCTTATAAAGGATTAGACAAAATATTTGTTGGAAATGGAAACCAACTTGAAATTTCACATGTTGGTCAAAGTAAAATAGTCACCGATGATAATGAACTAATCCTTA
AAAATGTGCTTGTTGTGCCTGAAATTAAAAAGAATCTTATCTCAGTTAGTAAGCTCACTCGAGACAACGATTGTTCCATAACATTCTTTGCTAATGATTTTCTTGTGAAG
AATCGACAGAGGAAGATACTTGCTAAAGTCACAAGGACAGCAAGTTTATATGCATTAGAAGATCAACATTCAGAGAAAAATGAAGCCTGTGTTGCAACTAGTACAAATAA
TAAAGCTCCATACTCCATTTGGCATATGAGAATGGGGCATTTGAATGAAAAATTCACATGGATATATCCTTTAAAAAGAAAGTCTGAATTCATCTCTTGTTTCAAAAAGT
TTAAACTCATGATTGAAAATCAAGTTGACAAGAAGATAAAAGTTTTGCAAAGTGATGGTGGTGGAGAATTTACCTCACTTGAACTCAAAGAACTGTTGGAACAAAGTGGC
ATTGTGCACCAGTTATCATGTCCACACACTCCACAACAAAATGGAGTAGTGGAATGA
Protein sequenceShow/hide protein sequence
MRDCPLICTGESGLFADSISLPFWGQDRVGSWERSLTRWNSLLPDIRVSRGPLGPTGSSFRALSKDKVILLLFFQLYQRFLAKQFKKSSRVLIEVTTSVMGYSIDASFPI
CSESLRNFQRRNYERAAAFLFIEASLAKNGQVYNEVSPDQCVPKKGGVTVVTNKDNELIPTRFIIAHEDQEKTTFTFPYETFAFKRMPFDLCNAPATFQRCMLAIFSDMI
ESTVEAFETLKAALISAPILCAPNWSLSFEVMCDANDVAIGVMLGQKLDKVIHPIYYASRVLNEAQINYTTTEKELLAIVFAFEKFRPYLVGSKVTVFTDHATIMPFTPT
EEGLEAEEDISPHEEEDSLKVEQYPPQISPLPSRKTYLQHLKNQESTLLQPLTQSNKVVKLVSPPSINWCIAVSCPIRVRVPCPCPCFLGNHTALECWNRFDHSYQSEEI
PKALAAMNLNEGVDPLMYAYSGATSHMVNDPGKVSTLQPYKGLDKIFVGNGNQLEISHVGQSKIVTDDNELILKNVLVVPEIKKNLISVSKLTRDNDCSITFFANDFLVK
NRQRKILAKVTRTASLYALEDQHSEKNEACVATSTNNKAPYSIWHMRMGHLNEKFTWIYPLKRKSEFISCFKKFKLMIENQVDKKIKVLQSDGGGEFTSLELKELLEQSG
IVHQLSCPHTPQQNGVVE