; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025738 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025738
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold13:32315536..32319765
RNA-Seq ExpressionSpg025738
SyntenySpg025738
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.5e-5639.35Show/hide
Query:  MWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAA
        MWN+  F++L V +G FS++I +   +   +W++ IYGP   ++R LFW EL  L+++CLP WI+GGDFN+ RW  E +        + R+FN FI++  
Subjt:  MWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAA

Query:  LQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPG
        L D PLSN KYTWS+ R   ++S +DRFL T +    F     + L R TSDHFPI L      WGP PFRF NA+L    +   +E WW      G+ G
Subjt:  LQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPG

Query:  HSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
        +SF+++LK L   +K W +   G+   +K     E+  IDK+E  GS T+    +R  +KA+L  I   E  +W Q+
Subjt:  HSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

RVW54885.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.2e-5739.5Show/hide
Query:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI
        GIVI+W+   F   E + G FS+T+ L+  ++ SFW+T +YGPN +  R+ FW EL DL  L  P+W +GGDFN+ R  +E+   S   T   R+F+ FI
Subjt:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI

Query:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS
          + L D PL N  +TWS+ + +P    +DRFL + E  + F   +   L R TSDH PICL      WGP PFRF N WL H  F      WW+   + 
Subjt:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS

Query:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
        GW  H F++KLK +K +LK+WN +VFG     K  +  +L  ID++E+ G++    ++ R+  + EL  ++  EEV W Q+
Subjt:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

RVW67743.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.5e-5639.86Show/hide
Query:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI
        GIVI+W+   F   E + G FS+T+ L+  ++  FW+T +YGPN +  RK FW EL DL  L  P+W +GGDFN+ R   E+   S   T   R F+ FI
Subjt:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI

Query:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS
          + L D PL N  +TWS+ + +P    +DRFL + E  + F   +   L R TSDH PICL      WGP PFRF N WL H  F      WW+   + 
Subjt:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS

Query:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
        GW GH F++KLK +K +LK+WN +VFG     K  +  +L  ID++E+ G++    ++ R   + EL  ++  EEV W Q+
Subjt:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

RVX12042.1 Splicing factor 3A subunit 2 [Vitis vinifera]1.8e-5739.86Show/hide
Query:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI
        GIVI+W+   F   E + G FS+T+ L+  ++ SFW+T +YGPN +  R+ FW EL DL  L  P+W +GGDFN+ R   E+   S   T   R F+ FI
Subjt:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI

Query:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS
          + L D PL N  +TWS+ + +P    +DRFL + E  + F   +   L R TSDH PICL      WGP PFRF N WL H  F      WW+   + 
Subjt:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS

Query:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
        GW GH F++KLK +K +LK+WN +VFG     K  +  +L  ID++E+ G++    ++ R+  + EL  ++  EEV W Q+
Subjt:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]8.5e-6841.84Show/hide
Query:  LIEVFVEEDILEELYTEDTKIDPAVYLPMIFPWLTEHGICIMPMPSKQKTSLTGIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISR
        LI+ F+       +  ++TK+     L +   W + HGI    + +    S  GI+I+WN+      E+IEG+FSLTI+  L+D + FW++GIYGP+ + 
Subjt:  LIEVFVEEDILEELYTEDTKIDPAVYLPMIFPWLTEHGICIMPMPSKQKTSLTGIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISR

Query:  DRKLFWRELADLEALCLPKWIIGGDFNITRWSWERS---PLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGT
           LFW+EL DL  LC   WI+ GDFN+TRWSWE+S   PL    T++   FN FI  ++L D+PL+NG++TWS    N S SLID FL+T+    K G 
Subjt:  DRKLFWRELADLEALCLPKWIIGGDFNITRWSWERS---PLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGT

Query:  IVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANID
         + +++ R TSDHFPI L  G + WG  PFRF N WLSH +F   +E+WW   PL GWPGH  + KLK LK  +K W  + F      K  L   + ++D
Subjt:  IVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANID

Query:  KVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
         +E S  +T      R+Q K +L+S+VA EE  W QR
Subjt:  KVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

TrEMBL top hitse value%identityAlignment
A0A438F4H2 Transposon TX1 uncharacterized 149 kDa protein2.5e-5739.5Show/hide
Query:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI
        GIVI+W+   F   E + G FS+T+ L+  ++ SFW+T +YGPN +  R+ FW EL DL  L  P+W +GGDFN+ R  +E+   S   T   R+F+ FI
Subjt:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI

Query:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS
          + L D PL N  +TWS+ + +P    +DRFL + E  + F   +   L R TSDH PICL      WGP PFRF N WL H  F      WW+   + 
Subjt:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS

Query:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
        GW  H F++KLK +K +LK+WN +VFG     K  +  +L  ID++E+ G++    ++ R+  + EL  ++  EEV W Q+
Subjt:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

A0A438G6A4 Transposon TX1 uncharacterized 149 kDa protein7.3e-5739.86Show/hide
Query:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI
        GIVI+W+   F   E + G FS+T+ L+  ++  FW+T +YGPN +  RK FW EL DL  L  P+W +GGDFN+ R   E+   S   T   R F+ FI
Subjt:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI

Query:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS
          + L D PL N  +TWS+ + +P    +DRFL + E  + F   +   L R TSDH PICL      WGP PFRF N WL H  F      WW+   + 
Subjt:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS

Query:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
        GW GH F++KLK +K +LK+WN +VFG     K  +  +L  ID++E+ G++    ++ R   + EL  ++  EEV W Q+
Subjt:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

A0A438JSU9 Splicing factor 3A subunit 28.6e-5839.86Show/hide
Query:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI
        GIVI+W+   F   E + G FS+T+ L+  ++ SFW+T +YGPN +  R+ FW EL DL  L  P+W +GGDFN+ R   E+   S   T   R F+ FI
Subjt:  GIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFI

Query:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS
          + L D PL N  +TWS+ + +P    +DRFL + E  + F   +   L R TSDH PICL      WGP PFRF N WL H  F      WW+   + 
Subjt:  ASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLS

Query:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
        GW GH F++KLK +K +LK+WN +VFG     K  +  +L  ID++E+ G++    ++ R+  + EL  ++  EEV W Q+
Subjt:  GWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein3.6e-5639.35Show/hide
Query:  MWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAA
        MWN+  F++L V +G FS++I +   +   +W++ IYGP   ++R LFW EL  L+++CLP WI+GGDFN+ RW  E +        + R+FN FI++  
Subjt:  MWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAA

Query:  LQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPG
        L D PLSN KYTWS+ R   ++S +DRFL T +    F     + L R TSDHFPI L      WGP PFRF NA+L    +   +E WW      G+ G
Subjt:  LQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPG

Query:  HSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
        +SF+++LK L   +K W +   G+   +K     E+  IDK+E  GS T+    +R  +KA+L  I   E  +W Q+
Subjt:  HSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

A0A6J1E2G6 uncharacterized protein LOC1110254054.1e-6841.84Show/hide
Query:  LIEVFVEEDILEELYTEDTKIDPAVYLPMIFPWLTEHGICIMPMPSKQKTSLTGIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISR
        LI+ F+       +  ++TK+     L +   W + HGI    + +    S  GI+I+WN+      E+IEG+FSLTI+  L+D + FW++GIYGP+ + 
Subjt:  LIEVFVEEDILEELYTEDTKIDPAVYLPMIFPWLTEHGICIMPMPSKQKTSLTGIVIMWNESTFTVLEVIEGLFSLTIHLSLADDYSFWITGIYGPNISR

Query:  DRKLFWRELADLEALCLPKWIIGGDFNITRWSWERS---PLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGT
           LFW+EL DL  LC   WI+ GDFN+TRWSWE+S   PL    T++   FN FI  ++L D+PL+NG++TWS    N S SLID FL+T+    K G 
Subjt:  DRKLFWRELADLEALCLPKWIIGGDFNITRWSWERS---PLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELSTKFGT

Query:  IVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANID
         + +++ R TSDHFPI L  G + WG  PFRF N WLSH +F   +E+WW   PL GWPGH  + KLK LK  +K W  + F      K  L   + ++D
Subjt:  IVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANID

Query:  KVEESGSITKADINRRVQIKAELISIVANEEVLWHQR
         +E S  +T      R+Q K +L+S+VA EE  W QR
Subjt:  KVEESGSITKADINRRVQIKAELISIVANEEVLWHQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATACAACATAGGTTATGTCGTCGGTATTCATGGCAAGATACCATCATCTTCGATGGCTTCCAGTGCTACACGCGCCGACGAGAAAGTATTTGTTGCGGATAA
CGAACGGGTTCATAACCCACGCGCCACCACCCTGAATGATGAAACAAAAGGGGAAAAGAAACAGTCCATATACAGAAACGACTTTCCACAGGCCCCAAATGATGTATTGG
ACACATCCACTGCTCTGATGTCTGCGTCTTTATCTGACAGGGACCCTCTGGCGCCATCTATCACCACAGAGCCCCAATCCCCAAAATCCTCTCTTGATAAGCACTCGGAG
CCTTTTATCGAAGATCCTATTCCCTTGCAAATAGAAGAGCCTCAATCCGATCAGCAGGGAATAGGTTTACAATACACAGATCTCATTGAGGTTTTTGTGGAAGAAGACAT
CTTGGAAGAGTTGTATACAGAGGACACCAAAATTGACCCAGCTGTATATCTTCCCATGATCTTCCCCTGGCTGACTGAGCACGGAATATGCATTATGCCCATGCCTAGTA
AACAGAAGACATCTCTTACTGGTATTGTCATAATGTGGAACGAATCCACCTTCACTGTGTTAGAGGTTATTGAAGGTCTTTTCTCTCTCACCATCCACCTTTCTCTCGCT
GATGACTACTCTTTTTGGATTACAGGGATTTATGGACCTAACATCTCTCGAGACAGGAAGCTTTTTTGGCGAGAGCTTGCTGATTTAGAGGCCCTATGCTTACCTAAATG
GATTATCGGTGGTGATTTCAACATCACTCGCTGGTCTTGGGAACGGTCCCCTCTCTCCTTCACCCCAACCCGTGCCACAAGGAAATTCAATCGCTTCATTGCCTCTGCCG
CCCTACAAGACCTTCCCCTCTCAAACGGCAAGTACACTTGGTCTAGTTTCAGGCCAAATCCATCGATGTCGCTTATTGATAGGTTCTTGATCACAGACGAATTATCTACA
AAATTTGGAACAATTGTGGTTCGCAAGTTAGATAGAGCTACATCTGATCATTTCCCGATTTGCCTCACTCTGGGGAATGATCGTTGGGGTCCTCCTCCATTCAGATTTGT
CAATGCTTGGCTATCTCATGCCTCCTTCCTTCATACTGTTGAGTCATGGTGGAAGGCAAACCCATTGTCTGGATGGCCTGGCCATAGTTTTATTCAAAAGCTAAAAGGTC
TTAAAAAGGAGTTGAAGCAATGGAACCAACAAGTTTTTGGCCAGCAATCGGTTAATAAAAACAGGCTGGGGCTAGAACTTGCGAATATCGATAAGGTAGAAGAAAGTGGT
TCCATTACCAAAGCCGATATTAACAGAAGGGTACAGATTAAGGCTGAATTGATCTCTATTGTGGCAAATGAAGAAGTTTTATGGCATCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAATACAACATAGGTTATGTCGTCGGTATTCATGGCAAGATACCATCATCTTCGATGGCTTCCAGTGCTACACGCGCCGACGAGAAAGTATTTGTTGCGGATAA
CGAACGGGTTCATAACCCACGCGCCACCACCCTGAATGATGAAACAAAAGGGGAAAAGAAACAGTCCATATACAGAAACGACTTTCCACAGGCCCCAAATGATGTATTGG
ACACATCCACTGCTCTGATGTCTGCGTCTTTATCTGACAGGGACCCTCTGGCGCCATCTATCACCACAGAGCCCCAATCCCCAAAATCCTCTCTTGATAAGCACTCGGAG
CCTTTTATCGAAGATCCTATTCCCTTGCAAATAGAAGAGCCTCAATCCGATCAGCAGGGAATAGGTTTACAATACACAGATCTCATTGAGGTTTTTGTGGAAGAAGACAT
CTTGGAAGAGTTGTATACAGAGGACACCAAAATTGACCCAGCTGTATATCTTCCCATGATCTTCCCCTGGCTGACTGAGCACGGAATATGCATTATGCCCATGCCTAGTA
AACAGAAGACATCTCTTACTGGTATTGTCATAATGTGGAACGAATCCACCTTCACTGTGTTAGAGGTTATTGAAGGTCTTTTCTCTCTCACCATCCACCTTTCTCTCGCT
GATGACTACTCTTTTTGGATTACAGGGATTTATGGACCTAACATCTCTCGAGACAGGAAGCTTTTTTGGCGAGAGCTTGCTGATTTAGAGGCCCTATGCTTACCTAAATG
GATTATCGGTGGTGATTTCAACATCACTCGCTGGTCTTGGGAACGGTCCCCTCTCTCCTTCACCCCAACCCGTGCCACAAGGAAATTCAATCGCTTCATTGCCTCTGCCG
CCCTACAAGACCTTCCCCTCTCAAACGGCAAGTACACTTGGTCTAGTTTCAGGCCAAATCCATCGATGTCGCTTATTGATAGGTTCTTGATCACAGACGAATTATCTACA
AAATTTGGAACAATTGTGGTTCGCAAGTTAGATAGAGCTACATCTGATCATTTCCCGATTTGCCTCACTCTGGGGAATGATCGTTGGGGTCCTCCTCCATTCAGATTTGT
CAATGCTTGGCTATCTCATGCCTCCTTCCTTCATACTGTTGAGTCATGGTGGAAGGCAAACCCATTGTCTGGATGGCCTGGCCATAGTTTTATTCAAAAGCTAAAAGGTC
TTAAAAAGGAGTTGAAGCAATGGAACCAACAAGTTTTTGGCCAGCAATCGGTTAATAAAAACAGGCTGGGGCTAGAACTTGCGAATATCGATAAGGTAGAAGAAAGTGGT
TCCATTACCAAAGCCGATATTAACAGAAGGGTACAGATTAAGGCTGAATTGATCTCTATTGTGGCAAATGAAGAAGTTTTATGGCATCAAAGATGA
Protein sequenceShow/hide protein sequence
MEEYNIGYVVGIHGKIPSSSMASSATRADEKVFVADNERVHNPRATTLNDETKGEKKQSIYRNDFPQAPNDVLDTSTALMSASLSDRDPLAPSITTEPQSPKSSLDKHSE
PFIEDPIPLQIEEPQSDQQGIGLQYTDLIEVFVEEDILEELYTEDTKIDPAVYLPMIFPWLTEHGICIMPMPSKQKTSLTGIVIMWNESTFTVLEVIEGLFSLTIHLSLA
DDYSFWITGIYGPNISRDRKLFWRELADLEALCLPKWIIGGDFNITRWSWERSPLSFTPTRATRKFNRFIASAALQDLPLSNGKYTWSSFRPNPSMSLIDRFLITDELST
KFGTIVVRKLDRATSDHFPICLTLGNDRWGPPPFRFVNAWLSHASFLHTVESWWKANPLSGWPGHSFIQKLKGLKKELKQWNQQVFGQQSVNKNRLGLELANIDKVEESG
SITKADINRRVQIKAELISIVANEEVLWHQR