; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006490 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006490
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr6:42966976..42969664
RNA-Seq ExpressionLag0006490
SyntenyLag0006490
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78049.1 hypothetical protein VITISV_015861 [Vitis vinifera]1.7e-4432.58Show/hide
Query:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF
        +++NT     +  R  ++ FLS+ NP +V+LQETK    D++ V SVW  +++ W A+ A  ASGGI +LW+   F   E V G+FS+++ L+  +   F
Subjt:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF

Query:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPT----------------------------------------------
        W+T VYGP+ +  RK FW EL DL  L  P W +GGDFN+ R   EK   S  T                                              
Subjt:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPT----------------------------------------------

Query:  --------------RVQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWK
                       V+GW GH  ++KLK +K +LK WN  +FG  +E K  +  +L  ID  E+   L       R+  + EL  L   EEV WRQK +
Subjt:  --------------RVQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWK

Query:  FKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYF
         KW  EGD N  FFHR+    R +  I  ++S  G +L     I +E + F  N +
Subjt:  FKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYF

RVW70784.1 hypothetical protein CK203_058030 [Vitis vinifera]1.7e-4733.43Show/hide
Query:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF
        +++NT     +  R  ++ FLS+ NP +V+LQETK  T D++ V SVW  + + WAA+ A GASGGI +LW+ S F+  + V G+FS+++  +  +  SF
Subjt:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF

Query:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNI-----------------------TRWSWEKSTF----------SAPTRVQ----------
        W+T VY P +   RK FW EL DL  L  P W +GGDFN+                        RW+ + S            S P R Q          
Subjt:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNI-----------------------TRWSWEKSTF----------SAPTRVQ----------

Query:  -------------GWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWF
                     GW GH  ++KLK +K +LK WN   FG  KE K  +  +LS ID  E+   L     + R+  + EL  +   EEV WRQK + KW 
Subjt:  -------------GWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWF

Query:  VEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYFLRKI
         EGD N  +FHR+    R +  I  ++S  G +L +   I +E + F  N + + +
Subjt:  VEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYFLRKI

RVW83303.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.9e-4632.03Show/hide
Query:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF
        +++NT     +  R  ++ FLS+ NP +V+LQETK    D+++V S+W  +++ W A+ A GASGGI +LW+   F+  E V G+FS+++ L+  +  SF
Subjt:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF

Query:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNI------------------------------TRWSWEKSTFSAPTR---------------
        W+T VYGP+ +  R+ FW EL DL  L  P W +GGDFN+                               RW+ + S     T                
Subjt:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNI------------------------------TRWSWEKSTFSAPTR---------------

Query:  ------------------VQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQ
                          V+GW GH  ++KLK +K +LK WN  +FG  +E K  +  +L  ID  E+   L  +    R   + EL  L   EEV WRQ
Subjt:  ------------------VQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQ

Query:  KWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYF
        K + KW  EGD N  FFHR+    R +  I  ++S  G +L     I +E + F  N +
Subjt:  KWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYF

RVX05281.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.7e-4432.49Show/hide
Query:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF
        +++N      +  R ++KDFL S NP +V++QETK    D++ V SVW+ RN  W A+ A GAS GI ++W+       EVV  +FS+S+  SL      
Subjt:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF

Query:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPT----------------------------------------------
        WI+ VYGP+S   RK FW EL D+  L  P W +GGDFN+ R S EK   S+ T                                              
Subjt:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPT----------------------------------------------

Query:  -----------RVQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKW
                   +  GW GH  +++L+ +K +LK WN+  FG  KE K  +  +L+I D+ E+   L      +R++ K EL VL   EE+ WRQK K KW
Subjt:  -----------RVQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKW

Query:  FVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYFLRKI
          EGD N  F+H++    R +  I E+ +  G  L     I +E L +   ++   I
Subjt:  FVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYFLRKI

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.1e-5435.7Show/hide
Query:  ALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSFWITGVYGPSSSHDR
        ALIK F+S  NP +VILQETKLS +D  IVKS+WS+  I W+A++A G + GI +LWN+      E++EG FSL+I+  L+DGF FW++G+YGPS++   
Subjt:  ALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSFWITGVYGPSSSHDR

Query:  KFFWKELIDLQALCIPNWVLGGDFNITRWSWEKS-------------------------------TFSAPTR----------------------------
          FW+EL+DL  LC  +W+L GDFN+TRWSWEKS                               T+S  T                             
Subjt:  KFFWKELIDLQALCIPNWVLGGDFNITRWSWEKS-------------------------------TFSAPTR----------------------------

Query:  ---------------------------------------------VQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLT
                                                     + GW GHGL+ KLK LK  +K W  E F      K  L   ++ +D  E +  +T
Subjt:  ---------------------------------------------VQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLT

Query:  EQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLF
              R   K +LL + A EE  WRQ+ K KW  EGD N  FFHR +A+ RR+S+ITEILS  G  L    DIE+EF+ F
Subjt:  EQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLF

TrEMBL top hitse value%identityAlignment
A0A438GEZ6 Uncharacterized protein8.2e-4833.43Show/hide
Query:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF
        +++NT     +  R  ++ FLS+ NP +V+LQETK  T D++ V SVW  + + WAA+ A GASGGI +LW+ S F+  + V G+FS+++  +  +  SF
Subjt:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF

Query:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNI-----------------------TRWSWEKSTF----------SAPTRVQ----------
        W+T VY P +   RK FW EL DL  L  P W +GGDFN+                        RW+ + S            S P R Q          
Subjt:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNI-----------------------TRWSWEKSTF----------SAPTRVQ----------

Query:  -------------GWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWF
                     GW GH  ++KLK +K +LK WN   FG  KE K  +  +LS ID  E+   L     + R+  + EL  +   EEV WRQK + KW 
Subjt:  -------------GWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWF

Query:  VEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYFLRKI
         EGD N  +FHR+    R +  I  ++S  G +L +   I +E + F  N + + +
Subjt:  VEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYFLRKI

A0A438HFR2 Transposon TX1 uncharacterized 149 kDa protein9.0e-4732.03Show/hide
Query:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF
        +++NT     +  R  ++ FLS+ NP +V+LQETK    D+++V S+W  +++ W A+ A GASGGI +LW+   F+  E V G+FS+++ L+  +  SF
Subjt:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF

Query:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNI------------------------------TRWSWEKSTFSAPTR---------------
        W+T VYGP+ +  R+ FW EL DL  L  P W +GGDFN+                               RW+ + S     T                
Subjt:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNI------------------------------TRWSWEKSTFSAPTR---------------

Query:  ------------------VQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQ
                          V+GW GH  ++KLK +K +LK WN  +FG  +E K  +  +L  ID  E+   L  +    R   + EL  L   EEV WRQ
Subjt:  ------------------VQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQ

Query:  KWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYF
        K + KW  EGD N  FFHR+    R +  I  ++S  G +L     I +E + F  N +
Subjt:  KWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYF

A0A438J8L0 Transposon TX1 uncharacterized 149 kDa protein8.5e-4532.49Show/hide
Query:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF
        +++N      +  R ++KDFL S NP +V++QETK    D++ V SVW+ RN  W A+ A GAS GI ++W+       EVV  +FS+S+  SL      
Subjt:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF

Query:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPT----------------------------------------------
        WI+ VYGP+S   RK FW EL D+  L  P W +GGDFN+ R S EK   S+ T                                              
Subjt:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPT----------------------------------------------

Query:  -----------RVQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKW
                   +  GW GH  +++L+ +K +LK WN+  FG  KE K  +  +L+I D+ E+   L      +R++ K EL VL   EE+ WRQK K KW
Subjt:  -----------RVQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKW

Query:  FVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYFLRKI
          EGD N  F+H++    R +  I E+ +  G  L     I +E L +   ++   I
Subjt:  FVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYFLRKI

A0A6J1E2G6 uncharacterized protein LOC1110254055.3e-5535.7Show/hide
Query:  ALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSFWITGVYGPSSSHDR
        ALIK F+S  NP +VILQETKLS +D  IVKS+WS+  I W+A++A G + GI +LWN+      E++EG FSL+I+  L+DGF FW++G+YGPS++   
Subjt:  ALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSFWITGVYGPSSSHDR

Query:  KFFWKELIDLQALCIPNWVLGGDFNITRWSWEKS-------------------------------TFSAPTR----------------------------
          FW+EL+DL  LC  +W+L GDFN+TRWSWEKS                               T+S  T                             
Subjt:  KFFWKELIDLQALCIPNWVLGGDFNITRWSWEKS-------------------------------TFSAPTR----------------------------

Query:  ---------------------------------------------VQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLT
                                                     + GW GHGL+ KLK LK  +K W  E F      K  L   ++ +D  E +  +T
Subjt:  ---------------------------------------------VQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLT

Query:  EQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLF
              R   K +LL + A EE  WRQ+ K KW  EGD N  FFHR +A+ RR+S+ITEILS  G  L    DIE+EF+ F
Subjt:  EQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLF

A5ALP8 Reverse transcriptase domain-containing protein8.5e-4532.58Show/hide
Query:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF
        +++NT     +  R  ++ FLS+ NP +V+LQETK    D++ V SVW  +++ W A+ A  ASGGI +LW+   F   E V G+FS+++ L+  +   F
Subjt:  VNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNESTFDVLEVVEGNFSLSIHLSLADGFSF

Query:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPT----------------------------------------------
        W+T VYGP+ +  RK FW EL DL  L  P W +GGDFN+ R   EK   S  T                                              
Subjt:  WITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPT----------------------------------------------

Query:  --------------RVQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWK
                       V+GW GH  ++KLK +K +LK WN  +FG  +E K  +  +L  ID  E+   L       R+  + EL  L   EEV WRQK +
Subjt:  --------------RVQGWSGHGLIQKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWK

Query:  FKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYF
         KW  EGD N  FFHR+    R +  I  ++S  G +L     I +E + F  N +
Subjt:  FKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISNYF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.0e-0527.61Show/hide
Query:  QKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSN--IKAELLVLAANEEVMWRQKWKFKWFVEGDVNFAFFHRIVASNRR
        + LK  KK  K  N++ FG+   I+ +  + L  ++S +   L    D + R     + +    AA  E  +RQK + KW  +GD N  FFH+++ +N+ 
Subjt:  QKLKGLKKELKSWNQEIFGHQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSN--IKAELLVLAANEEVMWRQKWKFKWFVEGDVNFAFFHRIVASNRR

Query:  KSVITEILSADGHSLMTDVDIEKEFLLFISNYFL
        K++I + L  D    + +V   KE ++    + L
Subjt:  KSVITEILSADGHSLMTDVDIEKEFLLFISNYFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATGATGGAGGTAGGGATAAAGGTAAAGAATAATCACACAGGCTTCATCCCTGCCGAGGTGCATATTCCTTCATCTTCTAATAGCCCAATTAAGGTTGTTATCGA
CCCATTCTTTGTGGAAGATTACAATATTGGTTATATCACCGGCATCCATGGAAAAATTCCGGCAGCCTCGACGACAAAAGAAAACCCACGCGCCGGAATAAATAAAAAAG
AAGATGGTAAAAAGGTTATGTCCGGTCCACGCGCTGTCGTTGGAAAGGGGAATGGAAAAGGGGTTGATCCCCAAATCTCGTACAGTTTCGCTGCCCAAGAAATATCAGAT
TCGGAAGGTGATTTCTCTTCCCCGTGTTCTCCAGATGTGAATGACTCTCCTATCGTGCCCCATCCTAGAGAGGCACACAAAGTTGATTCTCCCCCAACCATTTCCAATCT
CTTTGGTACTCAAGAAGAACAAACCACCAAAATAGAATCTCCTATCCCTTTGAGGATCGAGGAACCAGTGGAGGAAGCTATACAACAGAACCTGGCCATGGAGAATTCTA
TGTTGATTGACATAAATGTTGAGGAATTCGAAGAGACAGATACGAATCTCGGCACTCCACACCAACCGAAAGATCCCACTTTATGTTTACCCATCATTTTCCCTTGGTTG
GCCGAACATGGCATGTGTATCATGCCAATACCAAACAGACAGAAGCTCACAAACACAGCAAAAAAGAAGAAGAATTGGGTGAAAGAGTTAGAGAATCTACACTCCACAGT
AAACTACAACACATCAGCAGCTAATCGCCAAATGGGAAGAGCCCTTATAAAAGATTTTCTCTCCTCTCACAATCCAGCATTGGTGATTCTACAAGAGACAAAGCTTTCCA
CTATCGACCAGAAGATTGTTAAATCTGTTTGGAGCTCTCGAAACATTGCTTGGGCAGCTATTAATGCCATTGGTGCTTCCGGAGGCATTACCATGTTGTGGAATGAATCC
ACCTTTGATGTTTTGGAGGTTGTCGAAGGTAATTTCTCCCTCTCTATACATCTCTCCCTTGCCGATGGATTCTCTTTTTGGATCACAGGAGTATATGGGCCTAGTTCTTC
TCATGATAGGAAGTTTTTTTGGAAAGAACTTATAGATTTGCAAGCTCTCTGCATCCCAAATTGGGTCTTAGGGGGAGATTTCAACATCACTAGGTGGTCATGGGAGAAAT
CGACTTTTTCAGCTCCCACTCGGGTGCAAGGGTGGTCGGGTCACGGACTCATCCAAAAATTAAAGGGGCTGAAAAAGGAATTAAAATCATGGAATCAGGAGATTTTCGGA
CACCAGAAGGAGATCAAAATACAGCTGGGCAAAGAACTCTCTATCATTGACAGCAAAGAAGAAAATGATCTTTTAACTGAGCAAGACTGCATTAGAAGATCGAATATCAA
GGCTGAGTTATTGGTTTTAGCAGCAAATGAAGAAGTCATGTGGAGGCAAAAGTGGAAATTCAAGTGGTTTGTCGAAGGAGACGTAAACTTTGCCTTCTTCCATCGTATAG
TTGCATCAAACAGAAGGAAAAGTGTCATTACCGAAATTTTATCTGCTGATGGCCATAGCCTTATGACCGATGTTGATATAGAAAAGGAATTCCTCCTTTTTATAAGCAAC
TATTTTCTAAGAAAGATTATAGGCAACCTCTTCCATCCATTGATGATTGGAATCCCATTTCGGCGGAACAGAATACAATGCTTGAATCTCCATTTACTGAGGCAGAAATC
TTTAGAGCAGTGTGTGATCTTGGCTCGAATAAGACTCCCGAACCTGACGGTTTCACAGCTGAATTCCTTAAAAAATCTTGGAACATCCTTAAAGCAGATATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGACATGATGGAGGTAGGGATAAAGGTAAAGAATAATCACACAGGCTTCATCCCTGCCGAGGTGCATATTCCTTCATCTTCTAATAGCCCAATTAAGGTTGTTATCGA
CCCATTCTTTGTGGAAGATTACAATATTGGTTATATCACCGGCATCCATGGAAAAATTCCGGCAGCCTCGACGACAAAAGAAAACCCACGCGCCGGAATAAATAAAAAAG
AAGATGGTAAAAAGGTTATGTCCGGTCCACGCGCTGTCGTTGGAAAGGGGAATGGAAAAGGGGTTGATCCCCAAATCTCGTACAGTTTCGCTGCCCAAGAAATATCAGAT
TCGGAAGGTGATTTCTCTTCCCCGTGTTCTCCAGATGTGAATGACTCTCCTATCGTGCCCCATCCTAGAGAGGCACACAAAGTTGATTCTCCCCCAACCATTTCCAATCT
CTTTGGTACTCAAGAAGAACAAACCACCAAAATAGAATCTCCTATCCCTTTGAGGATCGAGGAACCAGTGGAGGAAGCTATACAACAGAACCTGGCCATGGAGAATTCTA
TGTTGATTGACATAAATGTTGAGGAATTCGAAGAGACAGATACGAATCTCGGCACTCCACACCAACCGAAAGATCCCACTTTATGTTTACCCATCATTTTCCCTTGGTTG
GCCGAACATGGCATGTGTATCATGCCAATACCAAACAGACAGAAGCTCACAAACACAGCAAAAAAGAAGAAGAATTGGGTGAAAGAGTTAGAGAATCTACACTCCACAGT
AAACTACAACACATCAGCAGCTAATCGCCAAATGGGAAGAGCCCTTATAAAAGATTTTCTCTCCTCTCACAATCCAGCATTGGTGATTCTACAAGAGACAAAGCTTTCCA
CTATCGACCAGAAGATTGTTAAATCTGTTTGGAGCTCTCGAAACATTGCTTGGGCAGCTATTAATGCCATTGGTGCTTCCGGAGGCATTACCATGTTGTGGAATGAATCC
ACCTTTGATGTTTTGGAGGTTGTCGAAGGTAATTTCTCCCTCTCTATACATCTCTCCCTTGCCGATGGATTCTCTTTTTGGATCACAGGAGTATATGGGCCTAGTTCTTC
TCATGATAGGAAGTTTTTTTGGAAAGAACTTATAGATTTGCAAGCTCTCTGCATCCCAAATTGGGTCTTAGGGGGAGATTTCAACATCACTAGGTGGTCATGGGAGAAAT
CGACTTTTTCAGCTCCCACTCGGGTGCAAGGGTGGTCGGGTCACGGACTCATCCAAAAATTAAAGGGGCTGAAAAAGGAATTAAAATCATGGAATCAGGAGATTTTCGGA
CACCAGAAGGAGATCAAAATACAGCTGGGCAAAGAACTCTCTATCATTGACAGCAAAGAAGAAAATGATCTTTTAACTGAGCAAGACTGCATTAGAAGATCGAATATCAA
GGCTGAGTTATTGGTTTTAGCAGCAAATGAAGAAGTCATGTGGAGGCAAAAGTGGAAATTCAAGTGGTTTGTCGAAGGAGACGTAAACTTTGCCTTCTTCCATCGTATAG
TTGCATCAAACAGAAGGAAAAGTGTCATTACCGAAATTTTATCTGCTGATGGCCATAGCCTTATGACCGATGTTGATATAGAAAAGGAATTCCTCCTTTTTATAAGCAAC
TATTTTCTAAGAAAGATTATAGGCAACCTCTTCCATCCATTGATGATTGGAATCCCATTTCGGCGGAACAGAATACAATGCTTGAATCTCCATTTACTGAGGCAGAAATC
TTTAGAGCAGTGTGTGATCTTGGCTCGAATAAGACTCCCGAACCTGACGGTTTCACAGCTGAATTCCTTAAAAAATCTTGGAACATCCTTAAAGCAGATATTATAG
Protein sequenceShow/hide protein sequence
MDMMEVGIKVKNNHTGFIPAEVHIPSSSNSPIKVVIDPFFVEDYNIGYITGIHGKIPAASTTKENPRAGINKKEDGKKVMSGPRAVVGKGNGKGVDPQISYSFAAQEISD
SEGDFSSPCSPDVNDSPIVPHPREAHKVDSPPTISNLFGTQEEQTTKIESPIPLRIEEPVEEAIQQNLAMENSMLIDINVEEFEETDTNLGTPHQPKDPTLCLPIIFPWL
AEHGMCIMPIPNRQKLTNTAKKKKNWVKELENLHSTVNYNTSAANRQMGRALIKDFLSSHNPALVILQETKLSTIDQKIVKSVWSSRNIAWAAINAIGASGGITMLWNES
TFDVLEVVEGNFSLSIHLSLADGFSFWITGVYGPSSSHDRKFFWKELIDLQALCIPNWVLGGDFNITRWSWEKSTFSAPTRVQGWSGHGLIQKLKGLKKELKSWNQEIFG
HQKEIKIQLGKELSIIDSKEENDLLTEQDCIRRSNIKAELLVLAANEEVMWRQKWKFKWFVEGDVNFAFFHRIVASNRRKSVITEILSADGHSLMTDVDIEKEFLLFISN
YFLRKIIGNLFHPLMIGIPFRRNRIQCLNLHLLRQKSLEQCVILARIRLPNLTVSQLNSLKNLGTSLKQIL