; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042132 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042132
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA-directed DNA polymerase
Genome locationchr13:36950917..36961525
RNA-Seq ExpressionLag0042132
SyntenyLag0042132
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022951570.1 uncharacterized protein LOC111454344 [Cucurbita moschata]4.5e-5651.09Show/hide
Query:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGR--PLTGPKILIK--------------------QKQKSKEFV---VQPE------------
        +GQLA+EL+ RP GKLPAD E PKREGKEQ Q IELRSG+  P  G K   +                    QK+ SK++     QP+            
Subjt:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGR--PLTGPKILIK--------------------QKQKSKEFV---VQPE------------

Query:  -----------------------------------------------VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGK
                                                       V FLKD+L  +++F E++ VSL EECSAILKNK+P K KDPGSFTIPVSI GK
Subjt:  -----------------------------------------------VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGK

Query:  ELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE
        ELG ALCDLGASINLMPLSIYKKLGIGEARPT  TL+L DRSITY EGKIEDIL+Q+DKFIFP DFIILDYEAD +
Subjt:  ELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE

XP_022960431.1 uncharacterized protein LOC111461167 [Cucurbita moschata]8.0e-5345.67Show/hide
Query:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGRPLTGPKILIKQ------------KQKSKEFVVQPE-------------------------
        +GQLA+EL+ RP GKLP+D E+PKREG EQ Q IELRSG+ ++  +  IK+            +Q+ +E VVQ E                         
Subjt:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGRPLTGPKILIKQ------------KQKSKEFVVQPE-------------------------

Query:  ------------------------------------------------------------VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKD
                                                                    V FLKD+L  +++F E++ V L EECSAILKNK+P K KD
Subjt:  ------------------------------------------------------------VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKD

Query:  PGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE
        PGSFTIP+SI GK+LG ALCDLG+SINLMPLSIYKKLGIGEARPT  TL+L DRS T+ EGKIEDIL+Q+DKFIFP DFIILDYEAD +
Subjt:  PGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE

XP_024022362.1 uncharacterized protein LOC112091881 [Morus notabilis]1.6e-5354.15Show/hide
Query:  MGQLASELKARPQGKLPADIEVPKREGKEQ----VQTIELRSGRPLTGP----------KILIKQKQ----KSKEFVVQPE-------------------
        +GQLA+ L  RPQG LP+D E P+R+GKEQ     + I LR+GR +  P           I  ++ Q    +S++ V+  +                   
Subjt:  MGQLASELKARPQGKLPADIEVPKREGKEQ----VQTIELRSGRPLTGP----------KILIKQKQ----KSKEFVVQPE-------------------

Query:  VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLE
        V F+K IL KK+R GE+E+V+LTEECSAILKN+LPPK+KDPGSFTIP SI  + +G ALCDLGASINLMP+SI++KLGIGEARPT  TL+L DRS  + E
Subjt:  VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLE

Query:  GKIEDILVQLDKFIFPIDFIILDYEADKE
        GKIED+LV++DKFIFP DFI+LDYEADKE
Subjt:  GKIEDILVQLDKFIFPIDFIILDYEADKE

XP_030505532.1 uncharacterized protein LOC115720524 [Cannabis sativa]7.2e-5451.02Show/hide
Query:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGR---------------------------------PLTGPKILIKQKQ--KSKEFV------
        +G LA+ELKARPQG LP+D + P+R+GKEQ ++I+LRSG+                                 PL  P+   KQ+Q  + K+F+      
Subjt:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGR---------------------------------PLTGPKILIKQKQ--KSKEFV------

Query:  ------------VQPEVNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARP
                    +   V FLKDIL KK+R GE+E+V+LTE C+A+LK+K+PPK+KDPGSFTIP SI G+++G AL DLGASINLMP+SI+K LGIGEARP
Subjt:  ------------VQPEVNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARP

Query:  TMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE
        T  TL+L DRS+ + EGKIED+LVQ+DKFI P DFIILDYEAD++
Subjt:  TMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]2.5e-5448.15Show/hide
Query:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGR----------------------------------------------------------PL
        +G LA+ELKARPQG LP+D E P+R+GKEQ ++I LRSG+                                                          PL
Subjt:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGR----------------------------------------------------------PL

Query:  TGPKILIKQKQ--KSKEFV------------------VQPEVNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHAL
          P+   KQ+Q  + K+F+                  +   V FLKDIL KK+R GE+E+V+LTE CSA+LK+K+PPK+KDPGSFTIP SI G+++G AL
Subjt:  TGPKILIKQKQ--KSKEFV------------------VQPEVNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHAL

Query:  CDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE
        CDLGASINLMP+SI+KKLGIGEARPT  TL+L DRS+ + EGKIED+LVQ+DKFIFP DFIILDYEAD++
Subjt:  CDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE

TrEMBL top hitse value%identityAlignment
A0A2G9HWL8 DNA-directed DNA polymerase1.4e-5053.74Show/hide
Query:  MGQLASELKARPQGKLPADIEV-PKREGKEQVQTIELRSGRPLTGPKILIKQKQKSKEFVVQPE---------------------VNFLKDILPKKKRFG
        +GQLA+ +  RPQG LP++IE  P+++GK Q Q + LR+GR L   + ++K+ +KSKE  V  E                     V F+KDIL KK+R G
Subjt:  MGQLASELKARPQGKLPADIEV-PKREGKEQVQTIELRSGRPLTGPKILIKQKQKSKEFVVQPE---------------------VNFLKDILPKKKRFG

Query:  EYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIF
        +YE+V+LTEECSAI++NKLPPK+KDPGSFTIP +I     G ALCDLGASINLMP SIY+ LG+GE + T  TL+L DRS+TY +G IEDILV++DKFIF
Subjt:  EYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIF

Query:  PIDFIILDYEADKE
        P +F++LD E D E
Subjt:  PIDFIILDYEADKE

A0A2G9HWL8 DNA-directed DNA polymerase1.4e-2647.47Show/hide
Query:  HLKYMYLWEGETLPIIVASDLMPEDEKTLITLLQQYK-DRSFRSIEQQRRLNPAMKKVVKKEMTKWLDARIIYPNA------------------------
        HL Y YL E +TLP+I++S L     + L+ +L+ +K D    S+E QRRLNP MK+VVKKE+ KWLD  IIYP +                        
Subjt:  HLKYMYLWEGETLPIIVASDLMPEDEKTLITLLQQYK-DRSFRSIEQQRRLNPAMKKVVKKEMTKWLDARIIYPNA------------------------

Query:  --------IGRLNKATHKDHFPLPFIDQMLDRLVDQAYYYFLDGYSGYNQITIAPEDQ
                  +LNKAT KD+F LPFIDQM DRL    +Y FLDGYSGYNQI IAPEDQ
Subjt:  --------IGRLNKATHKDHFPLPFIDQMLDRLVDQAYYYFLDGYSGYNQITIAPEDQ

A0A2G9HWL8 DNA-directed DNA polymerase1.8e-5040.69Show/hide
Query:  GIGEARPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRLFLAT------------------------------------
        GIGEARP T+TLQLADRSITY EGKIEDVLV+VDKFIFP DFIILDYEADK++PIILGR FL+T                                    
Subjt:  GIGEARPTTITLQLADRSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRLFLAT------------------------------------

Query:  ---------------------------------------------------------DHLKYMYLWEGETLPIIVASDLMPEDEKTLITLLQQYK-----
                                                                  HLKY YL E ETLP+ +A+DL  E E  LI +L+ +K     
Subjt:  ---------------------------------------------------------DHLKYMYLWEGETLPIIVASDLMPEDEKTLITLLQQYK-----

Query:  --------------------DRSFRSIEQQRRLNPAMKKVVKKEMTKWLDARIIYPNAIG----------------------------------------
                            +    SIE QRRLNPAMK+VVKKE+ KWLDA IIYP A G                                        
Subjt:  --------------------DRSFRSIEQQRRLNPAMKKVVKKEMTKWLDARIIYPNAIG----------------------------------------

Query:  -RLNKATHKDHFPLPFIDQMLDRLVDQAYYYFLDGYSGYNQITIAPEDQ
         +LNKAT KDHFPLPFIDQMLD LV Q YYY LDGY+GYNQITI P+DQ
Subjt:  -RLNKATHKDHFPLPFIDQMLDRLVDQAYYYFLDGYSGYNQITIAPEDQ

A0A6J1DV77 uncharacterized protein LOC1110238182.2e-4878.29Show/hide
Query:  VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLE
        V FLKDIL KK+R GE+E V+LT+E SAIL  KLP K+ DPGSFTIPV I GK +GHALCDLGASINLMPLS+Y+KLGIGEARP   TL+L DRSITYLE
Subjt:  VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLE

Query:  GKIEDILVQLDKFIFPIDFIILDYEADKE
        GKIED+LVQ+DKFIFP DFIILDYEADKE
Subjt:  GKIEDILVQLDKFIFPIDFIILDYEADKE

A0A6J1DZC3 uncharacterized protein LOC1110244492.5e-5258.05Show/hide
Query:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGRPLTGPKILIKQKQ---KSKEFVVQPE----------VNFLKDILPKKKRFGEYESVSLTE
        MGQLASELK RP+G LP+  E PK EG+E  +TI  RSG     PK+  +      K KE   +P+            FLKDI+ +KK+ GEYE+V+LTE
Subjt:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGRPLTGPKILIKQKQ---KSKEFVVQPE----------VNFLKDILPKKKRFGEYESVSLTE

Query:  ECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDY
          S + K+K+ PK+KDPGSFTIP SI GK++G ALCDL ASINLMPLSI+KKL IG+A PT  TL+L DRSIT  EGKIED+LV++DKFIFP DFIIL+ 
Subjt:  ECSAILKNKLPPKVKDPGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDY

Query:  EADKE
        EADK+
Subjt:  EADKE

A0A6J1GJ68 uncharacterized protein LOC1114543442.2e-5651.09Show/hide
Query:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGR--PLTGPKILIK--------------------QKQKSKEFV---VQPE------------
        +GQLA+EL+ RP GKLPAD E PKREGKEQ Q IELRSG+  P  G K   +                    QK+ SK++     QP+            
Subjt:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGR--PLTGPKILIK--------------------QKQKSKEFV---VQPE------------

Query:  -----------------------------------------------VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGK
                                                       V FLKD+L  +++F E++ VSL EECSAILKNK+P K KDPGSFTIPVSI GK
Subjt:  -----------------------------------------------VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIPVSICGK

Query:  ELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE
        ELG ALCDLGASINLMPLSIYKKLGIGEARPT  TL+L DRSITY EGKIEDIL+Q+DKFIFP DFIILDYEAD +
Subjt:  ELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE

A0A6J1H7K8 uncharacterized protein LOC1114611673.9e-5345.67Show/hide
Query:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGRPLTGPKILIKQ------------KQKSKEFVVQPE-------------------------
        +GQLA+EL+ RP GKLP+D E+PKREG EQ Q IELRSG+ ++  +  IK+            +Q+ +E VVQ E                         
Subjt:  MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGRPLTGPKILIKQ------------KQKSKEFVVQPE-------------------------

Query:  ------------------------------------------------------------VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKD
                                                                    V FLKD+L  +++F E++ V L EECSAILKNK+P K KD
Subjt:  ------------------------------------------------------------VNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKD

Query:  PGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE
        PGSFTIP+SI GK+LG ALCDLG+SINLMPLSIYKKLGIGEARPT  TL+L DRS T+ EGKIEDIL+Q+DKFIFP DFIILDYEAD +
Subjt:  PGSFTIPVSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKE

SwissProt top hitse value%identityAlignment
Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.4e-0431.36Show/hide
Query:  GETLPIIVASDLMPEDEKTLITLLQQYKDRSFRSIEQQRRLNPAMKKVVKKEMTKWL--DARIIYPNAIGRLNKATHKDHFPLPFIDQMLDRLVDQAYYY
        G  LP +    +  ++E+ +  ++Q+  D  F  +  +   +  +  V KK+ T  L  D R         LNKAT  D FPLP ID +L R+ +   + 
Subjt:  GETLPIIVASDLMPEDEKTLITLLQQYKDRSFRSIEQQRRLNPAMKKVVKKEMTKWL--DARIIYPNAIGRLNKATHKDHFPLPFIDQMLDRLVDQAYYY

Query:  FLDGYSGYNQITIAPEDQ
         LD +SGY+QI + P+D+
Subjt:  FLDGYSGYNQITIAPEDQ

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.4e-0431.36Show/hide
Query:  GETLPIIVASDLMPEDEKTLITLLQQYKDRSFRSIEQQRRLNPAMKKVVKKEMTKWL--DARIIYPNAIGRLNKATHKDHFPLPFIDQMLDRLVDQAYYY
        G  LP +    +  ++E+ +  ++Q+  D  F  +  +   +  +  V KK+ T  L  D R         LNKAT  D FPLP ID +L R+ +   + 
Subjt:  GETLPIIVASDLMPEDEKTLITLLQQYKDRSFRSIEQQRRLNPAMKKVVKKEMTKWL--DARIIYPNAIGRLNKATHKDHFPLPFIDQMLDRLVDQAYYY

Query:  FLDGYSGYNQITIAPEDQ
         LD +SGY+QI + P+D+
Subjt:  FLDGYSGYNQITIAPEDQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACAATTGGCAAGTGAACTGAAGGCTAGACCCCAAGGAAAATTACCTGCTGATATTGAGGTTCCAAAGAGAGAAGGTAAGGAGCAAGTCCAAACGATTGAATTGAG
AAGTGGTAGACCTCTGACTGGACCTAAAATTTTGATCAAGCAAAAACAGAAAAGTAAAGAGTTTGTTGTACAACCAGAAGTGAATTTTCTCAAGGATATCTTACCCAAGA
AGAAAAGGTTTGGGGAGTATGAGTCTGTCTCTTTAACCGAGGAGTGTAGTGCTATATTGAAAAATAAACTCCCACCTAAGGTTAAAGATCCTGGTTCTTTTACTATTCCT
GTTTCTATATGTGGAAAAGAGCTAGGGCATGCTCTATGTGACCTTGGTGCTAGTATAAACCTTATGCCATTATCTATATACAAGAAATTGGGCATAGGTGAAGCTAGACC
CACAATGGATACATTAAAACTTGTTGATAGATCCATAACTTACCTGGAAGGAAAGATTGAAGATATTTTGGTTCAATTAGATAAGTTCATATTCCCAATTGATTTTATTA
TTTTGGATTATGAAGCGGACAAAGAGGGCCATACAAGGGTTAAACTCGGGATATGGGTCCCTGCAGCAGGTGTGTGGGAGCTGCAACCTTCGTTAAAATGCAAGATATCC
GCACGCGATGTGATTCCAGGATGGCTAATCAAGCCCCATCGCTTAGTTGGAGCGTTGCAATTTCTGACTAAGTGTAGATCGTCACAATTGGGGACTTCTAATCAGACCCA
CGCCGTTGCCGCTCCCTCATCGTCGCCAGCTGCCTCGTGCCGCCTCCTCACGTCGCCAGTCTTGTCTCGCGCTGCCGCCACTGTTGCCCTTGGCTCAGTAAGTACTCTTC
CCTCTCGTTTTCTCTCTTCCATTCAGCAAGCATGCAATTTCTTCTCCCTCTCATTCTCGTTCCTGTCAGGCAGCATGTGGGTTCTCCAATCTCGGCCTACTCGTCCGACC
ACGAGCATGACGACATGGAACCCATGGAAATCTGTGATTTTCAGCAGGAAAGTAGTGGTTTCAGTGCGTTTTCTTCCTCCATTCGGCCTCAATTCGCGAGCTTCAACATG
CACCTTCTTCTCTTTAGCGTTTTGGCCGAACAACCTTGAAATTGGATACCCATATGTTAGGAGGTTCGGATTAGTGATTTCGAACTCCTTCGTCCTTCGGACAACAAGCA
TTCGAGATCATTTAGATGCTGAATGGTTCCCGCCCCTGTTCGATTATGCAGGTGATGAACTTTATGTGCCCGGTGATGTAGACGAGGAGATAAGCTTAAAAGAATTGATA
GATACTACCTATACCAACAAGGTGCACCTTCCTTTTCGGTGGCTCAATCATAGGAACTTTGAAGTTAAGCGTGCTTTGCTTGGAGAAGTTCTATGTAGGGCCAATGAAAA
GGCCAAAGTCTATATCTTGGCGATTCTGTCAAACGTTCTTGCCAAGAAGCATGAGGTCATGGTCTCGACTCGAGAGATCATGGAGTCCCTCCATGAGATGTTTGGACAAC
CGTCATCTCAGCTCCACCACAAAACCCACAAGTACATTTATAATGTGCACATAAAGAAAGACCAGTGTGTTCGAGACCACGTTCTTGGCATGATGCTTCACTCCAATGTC
ACCGAGACAAATGGGGTGGTTATAAATGAGCGTAGGCAGGTGTTGTTCATTATGGAGTCTCTTTTGAAGACCTTCCTTCAATTCCGTAGTAATGCAATCCAAAAGAAGAT
AGGAGGAAAGGGGAAGGCTCTTGCTGTTGTTGATAAAGGCAAGGGTAAGGCCAAGGTGCCTGATAAGGGTAAGGCCAAGGAGCCTCCTTGTTCTTCTCCGCTGGAAAATT
CTTCTCAAGAACACAATAACCAAAGAGGTTACACCGGTATAGGTGAAGCTAGGCCCACTACAATCACCCTCCAACTAGCTGATAGATCCATTACTTATCCAGAGGGGAAA
ATTGAGGATGTCTTGGTGAAGGTAGATAAATTTATATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCAGATAAAGATGTCCCAATTATCCTTGGTCGTCTATTTTT
AGCTACTGATCATCTAAAGTACATGTATCTCTGGGAAGGAGAAACGTTGCCTATTATTGTTGCATCAGATTTGATGCCAGAGGATGAGAAAACACTAATAACATTGCTAC
AGCAATACAAGGACAGATCCTTTAGAAGTATTGAGCAGCAAAGAAGGCTTAATCCCGCTATGAAGAAGGTTGTCAAAAAGGAAATGACAAAGTGGTTAGATGCAAGGATA
ATCTACCCGAATGCAATCGGGAGGCTTAATAAAGCCACCCATAAAGACCACTTCCCTTTACCGTTCATTGACCAGATGTTGGACAGATTGGTCGACCAAGCATATTACTA
CTTCCTAGATGGTTATTCTGGATATAACCAGATCACAATTGCTCCAGAGGACCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGACAATTGGCAAGTGAACTGAAGGCTAGACCCCAAGGAAAATTACCTGCTGATATTGAGGTTCCAAAGAGAGAAGGTAAGGAGCAAGTCCAAACGATTGAATTGAG
AAGTGGTAGACCTCTGACTGGACCTAAAATTTTGATCAAGCAAAAACAGAAAAGTAAAGAGTTTGTTGTACAACCAGAAGTGAATTTTCTCAAGGATATCTTACCCAAGA
AGAAAAGGTTTGGGGAGTATGAGTCTGTCTCTTTAACCGAGGAGTGTAGTGCTATATTGAAAAATAAACTCCCACCTAAGGTTAAAGATCCTGGTTCTTTTACTATTCCT
GTTTCTATATGTGGAAAAGAGCTAGGGCATGCTCTATGTGACCTTGGTGCTAGTATAAACCTTATGCCATTATCTATATACAAGAAATTGGGCATAGGTGAAGCTAGACC
CACAATGGATACATTAAAACTTGTTGATAGATCCATAACTTACCTGGAAGGAAAGATTGAAGATATTTTGGTTCAATTAGATAAGTTCATATTCCCAATTGATTTTATTA
TTTTGGATTATGAAGCGGACAAAGAGGGCCATACAAGGGTTAAACTCGGGATATGGGTCCCTGCAGCAGGTGTGTGGGAGCTGCAACCTTCGTTAAAATGCAAGATATCC
GCACGCGATGTGATTCCAGGATGGCTAATCAAGCCCCATCGCTTAGTTGGAGCGTTGCAATTTCTGACTAAGTGTAGATCGTCACAATTGGGGACTTCTAATCAGACCCA
CGCCGTTGCCGCTCCCTCATCGTCGCCAGCTGCCTCGTGCCGCCTCCTCACGTCGCCAGTCTTGTCTCGCGCTGCCGCCACTGTTGCCCTTGGCTCAGTAAGTACTCTTC
CCTCTCGTTTTCTCTCTTCCATTCAGCAAGCATGCAATTTCTTCTCCCTCTCATTCTCGTTCCTGTCAGGCAGCATGTGGGTTCTCCAATCTCGGCCTACTCGTCCGACC
ACGAGCATGACGACATGGAACCCATGGAAATCTGTGATTTTCAGCAGGAAAGTAGTGGTTTCAGTGCGTTTTCTTCCTCCATTCGGCCTCAATTCGCGAGCTTCAACATG
CACCTTCTTCTCTTTAGCGTTTTGGCCGAACAACCTTGAAATTGGATACCCATATGTTAGGAGGTTCGGATTAGTGATTTCGAACTCCTTCGTCCTTCGGACAACAAGCA
TTCGAGATCATTTAGATGCTGAATGGTTCCCGCCCCTGTTCGATTATGCAGGTGATGAACTTTATGTGCCCGGTGATGTAGACGAGGAGATAAGCTTAAAAGAATTGATA
GATACTACCTATACCAACAAGGTGCACCTTCCTTTTCGGTGGCTCAATCATAGGAACTTTGAAGTTAAGCGTGCTTTGCTTGGAGAAGTTCTATGTAGGGCCAATGAAAA
GGCCAAAGTCTATATCTTGGCGATTCTGTCAAACGTTCTTGCCAAGAAGCATGAGGTCATGGTCTCGACTCGAGAGATCATGGAGTCCCTCCATGAGATGTTTGGACAAC
CGTCATCTCAGCTCCACCACAAAACCCACAAGTACATTTATAATGTGCACATAAAGAAAGACCAGTGTGTTCGAGACCACGTTCTTGGCATGATGCTTCACTCCAATGTC
ACCGAGACAAATGGGGTGGTTATAAATGAGCGTAGGCAGGTGTTGTTCATTATGGAGTCTCTTTTGAAGACCTTCCTTCAATTCCGTAGTAATGCAATCCAAAAGAAGAT
AGGAGGAAAGGGGAAGGCTCTTGCTGTTGTTGATAAAGGCAAGGGTAAGGCCAAGGTGCCTGATAAGGGTAAGGCCAAGGAGCCTCCTTGTTCTTCTCCGCTGGAAAATT
CTTCTCAAGAACACAATAACCAAAGAGGTTACACCGGTATAGGTGAAGCTAGGCCCACTACAATCACCCTCCAACTAGCTGATAGATCCATTACTTATCCAGAGGGGAAA
ATTGAGGATGTCTTGGTGAAGGTAGATAAATTTATATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCAGATAAAGATGTCCCAATTATCCTTGGTCGTCTATTTTT
AGCTACTGATCATCTAAAGTACATGTATCTCTGGGAAGGAGAAACGTTGCCTATTATTGTTGCATCAGATTTGATGCCAGAGGATGAGAAAACACTAATAACATTGCTAC
AGCAATACAAGGACAGATCCTTTAGAAGTATTGAGCAGCAAAGAAGGCTTAATCCCGCTATGAAGAAGGTTGTCAAAAAGGAAATGACAAAGTGGTTAGATGCAAGGATA
ATCTACCCGAATGCAATCGGGAGGCTTAATAAAGCCACCCATAAAGACCACTTCCCTTTACCGTTCATTGACCAGATGTTGGACAGATTGGTCGACCAAGCATATTACTA
CTTCCTAGATGGTTATTCTGGATATAACCAGATCACAATTGCTCCAGAGGACCAGTAA
Protein sequenceShow/hide protein sequence
MGQLASELKARPQGKLPADIEVPKREGKEQVQTIELRSGRPLTGPKILIKQKQKSKEFVVQPEVNFLKDILPKKKRFGEYESVSLTEECSAILKNKLPPKVKDPGSFTIP
VSICGKELGHALCDLGASINLMPLSIYKKLGIGEARPTMDTLKLVDRSITYLEGKIEDILVQLDKFIFPIDFIILDYEADKEGHTRVKLGIWVPAAGVWELQPSLKCKIS
ARDVIPGWLIKPHRLVGALQFLTKCRSSQLGTSNQTHAVAAPSSSPAASCRLLTSPVLSRAAATVALGSVSTLPSRFLSSIQQACNFFSLSFSFLSGSMWVLQSRPTRPT
TSMTTWNPWKSVIFSRKVVVSVRFLPPFGLNSRASTCTFFSLAFWPNNLEIGYPYVRRFGLVISNSFVLRTTSIRDHLDAEWFPPLFDYAGDELYVPGDVDEEISLKELI
DTTYTNKVHLPFRWLNHRNFEVKRALLGEVLCRANEKAKVYILAILSNVLAKKHEVMVSTREIMESLHEMFGQPSSQLHHKTHKYIYNVHIKKDQCVRDHVLGMMLHSNV
TETNGVVINERRQVLFIMESLLKTFLQFRSNAIQKKIGGKGKALAVVDKGKGKAKVPDKGKAKEPPCSSPLENSSQEHNNQRGYTGIGEARPTTITLQLADRSITYPEGK
IEDVLVKVDKFIFPVDFIILDYEADKDVPIILGRLFLATDHLKYMYLWEGETLPIIVASDLMPEDEKTLITLLQQYKDRSFRSIEQQRRLNPAMKKVVKKEMTKWLDARI
IYPNAIGRLNKATHKDHFPLPFIDQMLDRLVDQAYYYFLDGYSGYNQITIAPEDQ