; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026307 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026307
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr10:34329842..34332045
RNA-Seq ExpressionLag0026307
SyntenyLag0026307
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN69126.1 hypothetical protein VITISV_008195 [Vitis vinifera]5.1e-5834.82Show/hide
Query:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK
        K+RV+KN LSS  P +V+IQETK    DR+++ S+WS RN  W+++ A GASGGILI+W+      +E+V G FS+S++ ++    + W++ VYGPN+S 
Subjt:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK

Query:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------
         R+ FW EL+D+  L  P W +GGDFNV R + EK                        +P R+   + +         RLDR                 
Subjt:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------

Query:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE
             TS H+PI L     KWGP PF+  N WL H SF +    WW      GW GH F++KL+ +K +LK+WN T +G+   ++  +    A+ D  E+
Subjt:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE

Query:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE
         G LS+    +R+  K  L      EE  WRQK ++K + E D N+ FFH++    R +  I EL +++   +   E+I++E
Subjt:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE

RVW83303.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.9e-6038.05Show/hide
Query:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK
        K+R ++  LS+ NP +V++QETK    DR+++ S+W  +++ W ++ A GASGGI+ILW+   F+  E V G FS++++L+  +  +FW+T VYGPN + 
Subjt:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK

Query:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK---SSFTAPTRATKNSTATSSHRLDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVE
         R  FW EL DL  L  P W +GGDFNV R   EK   S  T   R   +     S  L R TS H PICL      WGP PF+  N WL H  F +   
Subjt:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK---SSFTAPTRATKNSTATSSHRLDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVE

Query:  SWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREELGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERD
         WW+    +GW GH F++KLK +K +LK+WN  V+G  +  +  + T+   +D+ E+ G L+    S R   +  L      EE  WRQK ++K + E D
Subjt:  SWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREELGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERD

Query:  VNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE
         N+ FFHR+    R +  I  L+S+   ++   E I +E
Subjt:  VNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE

RVW91038.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.1e-5834.82Show/hide
Query:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK
        K+RV+KN LSS  P +V+IQETK    DR+++ S+WS RN  W+++ A GASGGILI+W+      +E+V G FS+S++ ++    + W++ VYGPN+S 
Subjt:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK

Query:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------
         R+ FW EL+D+  L  P W +GGDFNV R + EK                        +P R+   + +         RLDR                 
Subjt:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------

Query:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE
             TS H+PI L     KWGP PF+  N WL H SF +    WW      GW GH F++KL+ +K +LK+WN T +G+   ++  +    A+ D  E+
Subjt:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE

Query:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE
         G LS+    +R+  K  L      EE  WRQK ++K + E D N+ FFH++    R +  I EL +++   +   E+I++E
Subjt:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE

RVW99790.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.0e-5835.08Show/hide
Query:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK
        K+RV+KN LSS  P +V+IQETK    DR+++ S+WS RN  W+++ A GASGGILI+W+      +E+V G FS+S++ ++    + W++ VYGPN+S 
Subjt:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK

Query:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------
         R+ FW EL+D+  L  P W +GGDFNV R + EK                        +P R+   + +         RLDR                 
Subjt:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------

Query:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE
             TS H+PI L     KWGP PFK  N WL H SF +    WW      GW GH F++KL+ +K +LK+WN T +G+   ++  +    A+ D  E+
Subjt:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE

Query:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE
         G LS     +R+  K  L      EE  WRQK ++K + E D N+ FFH++    R +  I EL +++   +   E+I++E
Subjt:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]6.2e-7239.27Show/hide
Query:  WKK-RVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNS
        WKK  +IK  +S  NP +VI+QETK+  +D  I+KSLWS+  I WS++DA G + GILILWN+      E++EG+FSL++   L+DGF FW++G+YGP++
Subjt:  WKK-RVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNS

Query:  SKDRRLFWKELADLQALCPPNWILGGDFNVTRWTWEKSSFTAPTRA-------------------------TKNSTAT------------------SSHR
        ++   LFW+EL DL  LC  +WIL GDFNVTRW+WEKS+    T++                         ++N++ +                   + R
Subjt:  SKDRRLFWKELADLQALCPPNWILGGDFNVTRWTWEKSSFTAPTRA-------------------------TKNSTAT------------------SSHR

Query:  LDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREEL
        + R TS H+PI L+ G+  WG  PF+  N WLSH +F   +E+WW N P  GW GHG + KLK LK  +K W    +     ++  L      LD  E  
Subjt:  LDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREEL

Query:  GMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKEF
          ++   +  R + K  L+   A EE  WRQ+CK K L E D NT FFHR +A  RR+S ITE++S     +T  ++IE+EF
Subjt:  GMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKEF

TrEMBL top hitse value%identityAlignment
A0A438HFR2 Transposon TX1 uncharacterized 149 kDa protein9.1e-6138.05Show/hide
Query:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK
        K+R ++  LS+ NP +V++QETK    DR+++ S+W  +++ W ++ A GASGGI+ILW+   F+  E V G FS++++L+  +  +FW+T VYGPN + 
Subjt:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK

Query:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK---SSFTAPTRATKNSTATSSHRLDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVE
         R  FW EL DL  L  P W +GGDFNV R   EK   S  T   R   +     S  L R TS H PICL      WGP PF+  N WL H  F +   
Subjt:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK---SSFTAPTRATKNSTATSSHRLDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVE

Query:  SWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREELGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERD
         WW+    +GW GH F++KLK +K +LK+WN  V+G  +  +  + T+   +D+ E+ G L+    S R   +  L      EE  WRQK ++K + E D
Subjt:  SWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREELGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERD

Query:  VNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE
         N+ FFHR+    R +  I  L+S+   ++   E I +E
Subjt:  VNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE

A0A438I2T6 Transposon TX1 uncharacterized 149 kDa protein2.5e-5834.82Show/hide
Query:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK
        K+RV+KN LSS  P +V+IQETK    DR+++ S+WS RN  W+++ A GASGGILI+W+      +E+V G FS+S++ ++    + W++ VYGPN+S 
Subjt:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK

Query:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------
         R+ FW EL+D+  L  P W +GGDFNV R + EK                        +P R+   + +         RLDR                 
Subjt:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------

Query:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE
             TS H+PI L     KWGP PF+  N WL H SF +    WW      GW GH F++KL+ +K +LK+WN T +G+   ++  +    A+ D  E+
Subjt:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE

Query:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE
         G LS+    +R+  K  L      EE  WRQK ++K + E D N+ FFH++    R +  I EL +++   +   E+I++E
Subjt:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE

A0A438ISU8 Transposon TX1 uncharacterized 149 kDa protein1.4e-5835.08Show/hide
Query:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK
        K+RV+KN LSS  P +V+IQETK    DR+++ S+WS RN  W+++ A GASGGILI+W+      +E+V G FS+S++ ++    + W++ VYGPN+S 
Subjt:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK

Query:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------
         R+ FW EL+D+  L  P W +GGDFNV R + EK                        +P R+   + +         RLDR                 
Subjt:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------

Query:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE
             TS H+PI L     KWGP PFK  N WL H SF +    WW      GW GH F++KL+ +K +LK+WN T +G+   ++  +    A+ D  E+
Subjt:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE

Query:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE
         G LS     +R+  K  L      EE  WRQK ++K + E D N+ FFH++    R +  I EL +++   +   E+I++E
Subjt:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE

A0A6J1E2G6 uncharacterized protein LOC1110254053.0e-7239.27Show/hide
Query:  WKK-RVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNS
        WKK  +IK  +S  NP +VI+QETK+  +D  I+KSLWS+  I WS++DA G + GILILWN+      E++EG+FSL++   L+DGF FW++G+YGP++
Subjt:  WKK-RVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNS

Query:  SKDRRLFWKELADLQALCPPNWILGGDFNVTRWTWEKSSFTAPTRA-------------------------TKNSTAT------------------SSHR
        ++   LFW+EL DL  LC  +WIL GDFNVTRW+WEKS+    T++                         ++N++ +                   + R
Subjt:  SKDRRLFWKELADLQALCPPNWILGGDFNVTRWTWEKSSFTAPTRA-------------------------TKNSTAT------------------SSHR

Query:  LDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREEL
        + R TS H+PI L+ G+  WG  PF+  N WLSH +F   +E+WW N P  GW GHG + KLK LK  +K W    +     ++  L      LD  E  
Subjt:  LDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREEL

Query:  GMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKEF
          ++   +  R + K  L+   A EE  WRQ+CK K L E D NT FFHR +A  RR+S ITE++S     +T  ++IE+EF
Subjt:  GMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKEF

A5AK27 Reverse transcriptase domain-containing protein2.5e-5834.82Show/hide
Query:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK
        K+RV+KN LSS  P +V+IQETK    DR+++ S+WS RN  W+++ A GASGGILI+W+      +E+V G FS+S++ ++    + W++ VYGPN+S 
Subjt:  KKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSK

Query:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------
         R+ FW EL+D+  L  P W +GGDFNV R + EK                        +P R+   + +         RLDR                 
Subjt:  DRRLFWKELADLQALCPPNWILGGDFNVTRWTWEK--------------------SSFTAPTRATKNSTATSSH-----RLDRV----------------

Query:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE
             TS H+PI L     KWGP PF+  N WL H SF +    WW      GW GH F++KL+ +K +LK+WN T +G+   ++  +    A+ D  E+
Subjt:  -----TSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLKGLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREE

Query:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE
         G LS+    +R+  K  L      EE  WRQK ++K + E D N+ FFH++    R +  I EL +++   +   E+I++E
Subjt:  LGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSITCDENIEKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATTACAGCATTGGATACATAGTCGGAATACACGACAAAATTCCATCCAAAAAGGTTGACTGCGATGTTGTTGTGGAAGAAGAAGTTGGCCCACGCGCCGCCCC
TGCGAAGATGGAAACAAATGTGGAAAAATCTCATTGTTCGTTCAAATGGACTGACCAACATGCCCCAACTGAAGAATTGTTCTCTGACACTGATTTGACCCCAGCCGTAT
ATACAAAGAGGGCCCCACCGAAAACCTCATCTCCCAAGAAACCTCAATCCCAAAAATCCCACCAAATCTACCAGGGTGGGACCCATCCTCCAAAAGCCCCCATTGAACCT
TTACCACCTGTCCCTATCCATAAAAGCCCAAGCCCACCAGAGCTTCCTAATAAGCCAAACACCTCTACCCTAGATAATCCCGTCGTGTTCATTGACTATCATCCCAGCAT
GAAAAGGCCAATCACCATTAATGACAAAGAAACCTTTCTTCTTACGGGTACAAAATTCTCCATGCATACTGAGCTTCCCCTGTCAGAATCTGAAGGTGGGGCATCCTCGC
CCTGCTCAACAGCCATGGAAGAATCACCTCTGACTAACCGCAGAATTATCCAGGCCGACTCTCCTCCATCGATAGGCAAGCTTTTTGAAGATATCCACGACCATCAGCAA
CAAACTAAGAAGCCCCTCCCTCTTTGTATTGAGGAACCAGATAACAACGATTTTTATCACCCCAATGACAAGGAAGACAAAGCCCTCATTGATATTAATGTGGAGGAGGA
AGAGACAGATGACTTGTTATCCGACAACGATACTACCGATCCAGTGATTTATCTCCCTATCTTATTCCCTTGGCTTGCCGAACATGGCATGTGTATCATGCCAATGCCAA
GTAGGCATAAGCTATCCAAGGCCACCAAGAAGAAAATTCAATGGGCCAAGGAATTACAGAATCTCCACACCAATGTGAATTATGATAAATCTCCCGCAGAGGCTTGGGCT
CTTTGGAAAAAGAGAGTTATTAAAAATCTTCTATCCTCCCACAACCCTGCTTTGGTGATCATCCAAGAAACTAAGATGGTTGGTATTGACAGAAAAATTATCAAATCTCT
CTGGAGTTCGAGAAATATCGCTTGGTCCTCTATTGATGCTGTTGGAGCCTCCGGGGGCATTCTAATCCTTTGGAACGAATCTTTCTTTGATGTTAAAGAGATCGTTGAAG
GTTTGTTCTCTCTATCCCTTCAACTCTCTTTAGCTGATGGCTTCACCTTCTGGATTACAGGAGTTTATGGTCCAAATTCCTCAAAGGATAGGCGTTTATTCTGGAAAGAG
TTGGCGGATCTCCAAGCCTTATGTCCCCCTAATTGGATTTTGGGTGGCGATTTCAATGTGACTCGATGGACATGGGAGAAATCTTCATTCACGGCCCCAACCCGAGCTAC
GAAAAATTCAACAGCAACCTCTTCCCATAGACTGGACAGAGTTACATCTTATCACTACCCTATTTGTCTCAATTTAGGGAAAGAAAAATGGGGACCGGCCCCTTTCAAGC
TCAACAATGCTTGGCTTTCCCATCATTCTTTCCTTAAAACGGTCGAATCTTGGTGGAAGAACACTCCTTCTCAAGGATGGTCGGGACATGGATTCATTCAGAAACTTAAA
GGCCTCAAGTTGGAGCTCAAACAGTGGAACCACACTGTATATGGTCAGCAAAAGGGGGAACGGTCTCGGTTACAAACAGAACATGCTGATCTCGACAAGAGGGAGGAACT
TGGTATGCTATCCGAACTAGATGCCAGCCGTAGATCTAAGATTAAGGCCCATCTTATTCTGTCATCAGCCAACGAGGAGACTTTGTGGAGACAAAAATGCAAATTGAAAT
GCCTCAGCGAAAGGGATGTTAACACAGCCTTCTTCCACAGAATTATGGCAGCCCATAGACGAAAAAGTTCCATCACGGAATTGGTTTCTGATACAAGAACCAGCATCACT
TGTGATGAAAATATTGAGAAGGAATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATTACAGCATTGGATACATAGTCGGAATACACGACAAAATTCCATCCAAAAAGGTTGACTGCGATGTTGTTGTGGAAGAAGAAGTTGGCCCACGCGCCGCCCC
TGCGAAGATGGAAACAAATGTGGAAAAATCTCATTGTTCGTTCAAATGGACTGACCAACATGCCCCAACTGAAGAATTGTTCTCTGACACTGATTTGACCCCAGCCGTAT
ATACAAAGAGGGCCCCACCGAAAACCTCATCTCCCAAGAAACCTCAATCCCAAAAATCCCACCAAATCTACCAGGGTGGGACCCATCCTCCAAAAGCCCCCATTGAACCT
TTACCACCTGTCCCTATCCATAAAAGCCCAAGCCCACCAGAGCTTCCTAATAAGCCAAACACCTCTACCCTAGATAATCCCGTCGTGTTCATTGACTATCATCCCAGCAT
GAAAAGGCCAATCACCATTAATGACAAAGAAACCTTTCTTCTTACGGGTACAAAATTCTCCATGCATACTGAGCTTCCCCTGTCAGAATCTGAAGGTGGGGCATCCTCGC
CCTGCTCAACAGCCATGGAAGAATCACCTCTGACTAACCGCAGAATTATCCAGGCCGACTCTCCTCCATCGATAGGCAAGCTTTTTGAAGATATCCACGACCATCAGCAA
CAAACTAAGAAGCCCCTCCCTCTTTGTATTGAGGAACCAGATAACAACGATTTTTATCACCCCAATGACAAGGAAGACAAAGCCCTCATTGATATTAATGTGGAGGAGGA
AGAGACAGATGACTTGTTATCCGACAACGATACTACCGATCCAGTGATTTATCTCCCTATCTTATTCCCTTGGCTTGCCGAACATGGCATGTGTATCATGCCAATGCCAA
GTAGGCATAAGCTATCCAAGGCCACCAAGAAGAAAATTCAATGGGCCAAGGAATTACAGAATCTCCACACCAATGTGAATTATGATAAATCTCCCGCAGAGGCTTGGGCT
CTTTGGAAAAAGAGAGTTATTAAAAATCTTCTATCCTCCCACAACCCTGCTTTGGTGATCATCCAAGAAACTAAGATGGTTGGTATTGACAGAAAAATTATCAAATCTCT
CTGGAGTTCGAGAAATATCGCTTGGTCCTCTATTGATGCTGTTGGAGCCTCCGGGGGCATTCTAATCCTTTGGAACGAATCTTTCTTTGATGTTAAAGAGATCGTTGAAG
GTTTGTTCTCTCTATCCCTTCAACTCTCTTTAGCTGATGGCTTCACCTTCTGGATTACAGGAGTTTATGGTCCAAATTCCTCAAAGGATAGGCGTTTATTCTGGAAAGAG
TTGGCGGATCTCCAAGCCTTATGTCCCCCTAATTGGATTTTGGGTGGCGATTTCAATGTGACTCGATGGACATGGGAGAAATCTTCATTCACGGCCCCAACCCGAGCTAC
GAAAAATTCAACAGCAACCTCTTCCCATAGACTGGACAGAGTTACATCTTATCACTACCCTATTTGTCTCAATTTAGGGAAAGAAAAATGGGGACCGGCCCCTTTCAAGC
TCAACAATGCTTGGCTTTCCCATCATTCTTTCCTTAAAACGGTCGAATCTTGGTGGAAGAACACTCCTTCTCAAGGATGGTCGGGACATGGATTCATTCAGAAACTTAAA
GGCCTCAAGTTGGAGCTCAAACAGTGGAACCACACTGTATATGGTCAGCAAAAGGGGGAACGGTCTCGGTTACAAACAGAACATGCTGATCTCGACAAGAGGGAGGAACT
TGGTATGCTATCCGAACTAGATGCCAGCCGTAGATCTAAGATTAAGGCCCATCTTATTCTGTCATCAGCCAACGAGGAGACTTTGTGGAGACAAAAATGCAAATTGAAAT
GCCTCAGCGAAAGGGATGTTAACACAGCCTTCTTCCACAGAATTATGGCAGCCCATAGACGAAAAAGTTCCATCACGGAATTGGTTTCTGATACAAGAACCAGCATCACT
TGTGATGAAAATATTGAGAAGGAATTTTGA
Protein sequenceShow/hide protein sequence
MEDYSIGYIVGIHDKIPSKKVDCDVVVEEEVGPRAAPAKMETNVEKSHCSFKWTDQHAPTEELFSDTDLTPAVYTKRAPPKTSSPKKPQSQKSHQIYQGGTHPPKAPIEP
LPPVPIHKSPSPPELPNKPNTSTLDNPVVFIDYHPSMKRPITINDKETFLLTGTKFSMHTELPLSESEGGASSPCSTAMEESPLTNRRIIQADSPPSIGKLFEDIHDHQQ
QTKKPLPLCIEEPDNNDFYHPNDKEDKALIDINVEEEETDDLLSDNDTTDPVIYLPILFPWLAEHGMCIMPMPSRHKLSKATKKKIQWAKELQNLHTNVNYDKSPAEAWA
LWKKRVIKNLLSSHNPALVIIQETKMVGIDRKIIKSLWSSRNIAWSSIDAVGASGGILILWNESFFDVKEIVEGLFSLSLQLSLADGFTFWITGVYGPNSSKDRRLFWKE
LADLQALCPPNWILGGDFNVTRWTWEKSSFTAPTRATKNSTATSSHRLDRVTSYHYPICLNLGKEKWGPAPFKLNNAWLSHHSFLKTVESWWKNTPSQGWSGHGFIQKLK
GLKLELKQWNHTVYGQQKGERSRLQTEHADLDKREELGMLSELDASRRSKIKAHLILSSANEETLWRQKCKLKCLSERDVNTAFFHRIMAAHRRKSSITELVSDTRTSIT
CDENIEKEF