; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G13600 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G13600
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr5:13411586..13413376
RNA-Seq ExpressionCSPI05G13600
SyntenyCSPI05G13600
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW16209.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]6.0e-8033.33Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKIISWN RGL   NKR  +K F+++ +PD+V+IQE+K E  +  F+ ++W+  +  W ++ + GASGGIL +WD   ++  E +   +S+S+K      
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
           W++ VYGP     RK  W EL  I       W +GGDFN+ R   E+      T  MR F++FI    L++  L+N  FTWS    S     LDRF 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKW----------------------------------DDEFENSRIKECNKVIEEVLK---IALPDIGWAKEERLLRELEII---------DGFAESV
         +++W                                     FEN  ++  N   +E  +        IGW +  + +R L+ +           F E  
Subjt:  INSKW----------------------------------DDEFENSRIKECNKVIEEVLK---IALPDIGWAKEERLLRELEII---------DGFAESV

Query:  GLNEVELA-----------------------------HKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWFYEDLYKKILGVRYLPFIADWPCVSVS
           + EL                              HK  N ++ +  I +L +E+G++ K+   I   IL ++E LY    G  +     DW  +S  
Subjt:  GLNEVELA-----------------------------HKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWFYEDLYKKILGVRYLPFIADWPCVSVS

Query:  QKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKKEDAIHVRDFRLISLTTITYKVVAK
            L + F+  EI KAI  L  +KAP PDGFT   F + W V+KE L+R+F EFHR+G +N     +FI LI KK  +  + DFR ISL T  YK++AK
Subjt:  QKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKKEDAIHVRDFRLISLTTITYKVVAK

Query:  VLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL
        VL+ RL+GV+   I   Q AF++GRQ+LD +LIANE+V++ +  G++G + K+D EKA+D V W FL
Subjt:  VLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL

RVW27595.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]7.1e-8132.16Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKIISWN RGL   NKR  +K F++   PD+V+IQE+K E  +   + ++W+  +  W+ + + GASGGIL +WD  K++  E +   +S+S+K      
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
           W++ VYGP     RK  W EL  I       W +GGDFN+ R   E+    R T  MR F++FI  + L++  L+N  FTWS    S     LDRF 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL
         +++W                                                                         +F  ++ KE NK    V+ E  
Subjt:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL

Query:  KIALPDI----------GWAKE---ERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW
        K  L D+          G   E   +R LR+ E+ D                    G    +  HK  N ++ +  I  L +E G++  + + I   IL 
Subjt:  KIALPDI----------GWAKE---ERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW

Query:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI
        ++E LY   +G  +     DW  +S     SL+  F+  EI KAI  +  +KAP PDGFT   F   W V+KE L+R+F EFHR+G +N     +FI L+
Subjt:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI

Query:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV
         KK     + DFR ISL T  YK++AKVL+ RL+GV+   I   Q AF++GRQ++D +LIANE+V++ +  G++G + K+D EKA+D V W FL +V
Subjt:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV

RVW43689.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.9e-7931.99Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKIISWN RGL   NKR  +K F+K   PD+V+IQE+K E  +   + ++W+  +  W  + + GASGGIL +WD  K++  E +   +S+ +K      
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
           W++ VYGP     RK  W EL  I       W +GGDFN+ R   E+    R T  MR F++FI  + L++  L+N  FTWS    S     LD F 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL
         +++W                                                                         +F  ++ KE NK    V+ E  
Subjt:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL

Query:  KIALPDI----------GWAKE---ERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW
        K  L D+          G   E   +R LR+ E+ D                    G    +  HK  N ++ +  I  L +E G++  +   I   IL 
Subjt:  KIALPDI----------GWAKE---ERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW

Query:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI
        ++E LY   +G  +     DW  +S     SL+  F+  EI KAI  +  +KAP PDGFT   F   W V+KE L+R+F EFHR+G +N     +FI L+
Subjt:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI

Query:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV
         KK     + DFR ISL T  YK++AKVL+ RL+GV+   I   Q AF++GRQ++D +LIANE+V++ +  G++G + K+D EKA+D V W FL +V
Subjt:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.5e-8031.65Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKI+SWNTRGL    KR  +++F+   +PD+V++QE+K E  +  F+ ++W    + W ++ + GASGGI+ LWD SK+   E +   +S+++K  +  +
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
           W+T+VYGP     RK  W EL  +       W +GGDFN+ R + E+    R T  MR F+ FI  + L++  L+N  FTWS          LDRF 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKWDD----------------------------------EFEN-------------------------------------SRIKECNKV----IEEVL
         +S+WD                                    FEN                                     S++KE N +    ++E  
Subjt:  INSKWDD----------------------------------EFEN-------------------------------------SRIKECNKV----IEEVL

Query:  KIALPDIGWAK-------------EERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW
        K+ L D+                  ER L+  E+ D   +                G    +  H+    ++ +  I  L+ E+G    +  DI   I+ 
Subjt:  KIALPDIGWAK-------------EERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW

Query:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI
        F+ +LY K +G  +     DW  +S      L   F+  E+ +A+  L   KAP PDGFT   + + W V+KE LMR+F EFH NG +N      FI L+
Subjt:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI

Query:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL
         KK  ++ + D+R ISL T  YK++AKVL+ RL+ V+   IS  Q AF+EGR +LD +LIANEVV++ +  G++G + K+D EKA+D VDWGFL
Subjt:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL

RVX20328.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.6e-8031.83Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKIISWN RGL   NKR  +K F++   PD+V+IQE+K E  +   + ++W+  +  W  + + GASGGIL +WD  K++  E +   +S+S+K V    
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
           W++ VYGP     RK  W EL  I       W +GGDFN+ R   E+    R T  MR F++FI  + L++  L+N  FTWS    S     LDRF 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL
         +++W                                                                         +F  ++ KE NK    V+ E  
Subjt:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL

Query:  KIALPDIG--------WAKEERLLRELEIIDGFAESVGLNE-------------------VELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW
        K  L D+               LL +  +  G  E + L E                    +  HK  N ++ +  I  L +E G++  + + I   IL 
Subjt:  KIALPDIG--------WAKEERLLRELEIIDGFAESVGLNE-------------------VELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW

Query:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI
        ++E LY   +G  +     DW  +S     SL+  F+  EI KAI  +  +KAP PDGFT   F   W V+KE L+R+F EFHR+G +N     +FI L+
Subjt:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI

Query:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV
         KK     + DFR ISL T  YK++AKVL+ RL+GV+   I   Q AF++GRQ++D +LIANE+V++ +  G++G + K+D EKA+D + W FL +V
Subjt:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV

TrEMBL top hitse value%identityAlignment
A0A438CWL6 Transposon TX1 uncharacterized 149 kDa protein3.4e-8132.16Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKIISWN RGL   NKR  +K F++   PD+V+IQE+K E  +   + ++W+  +  W+ + + GASGGIL +WD  K++  E +   +S+S+K      
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
           W++ VYGP     RK  W EL  I       W +GGDFN+ R   E+    R T  MR F++FI  + L++  L+N  FTWS    S     LDRF 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL
         +++W                                                                         +F  ++ KE NK    V+ E  
Subjt:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL

Query:  KIALPDI----------GWAKE---ERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW
        K  L D+          G   E   +R LR+ E+ D                    G    +  HK  N ++ +  I  L +E G++  + + I   IL 
Subjt:  KIALPDI----------GWAKE---ERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW

Query:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI
        ++E LY   +G  +     DW  +S     SL+  F+  EI KAI  +  +KAP PDGFT   F   W V+KE L+R+F EFHR+G +N     +FI L+
Subjt:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI

Query:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV
         KK     + DFR ISL T  YK++AKVL+ RL+GV+   I   Q AF++GRQ++D +LIANE+V++ +  G++G + K+D EKA+D V W FL +V
Subjt:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV

A0A438FWU5 LINE-1 retrotransposable element ORF2 protein1.7e-8031.65Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKI+SWNTRGL    KR  +++F+   +PD+V++QE+K E  +  F+ ++W    + W ++ + GASGGI+ LWD SK+   E +   +S+++K  +  +
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
           W+T+VYGP     RK  W EL  +       W +GGDFN+ R + E+    R T  MR F+ FI  + L++  L+N  FTWS          LDRF 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKWDD----------------------------------EFEN-------------------------------------SRIKECNKV----IEEVL
         +S+WD                                    FEN                                     S++KE N +    ++E  
Subjt:  INSKWDD----------------------------------EFEN-------------------------------------SRIKECNKV----IEEVL

Query:  KIALPDIGWAK-------------EERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW
        K+ L D+                  ER L+  E+ D   +                G    +  H+    ++ +  I  L+ E+G    +  DI   I+ 
Subjt:  KIALPDIGWAK-------------EERLLRELEIIDGFAE--------------SVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW

Query:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI
        F+ +LY K +G  +     DW  +S      L   F+  E+ +A+  L   KAP PDGFT   + + W V+KE LMR+F EFH NG +N      FI L+
Subjt:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI

Query:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL
         KK  ++ + D+R ISL T  YK++AKVL+ RL+ V+   IS  Q AF+EGR +LD +LIANEVV++ +  G++G + K+D EKA+D VDWGFL
Subjt:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL

A0A438KGJ2 Transposon TX1 uncharacterized 149 kDa protein7.6e-8131.83Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKIISWN RGL   NKR  +K F++   PD+V+IQE+K E  +   + ++W+  +  W  + + GASGGIL +WD  K++  E +   +S+S+K V    
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
           W++ VYGP     RK  W EL  I       W +GGDFN+ R   E+    R T  MR F++FI  + L++  L+N  FTWS    S     LDRF 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL
         +++W                                                                         +F  ++ KE NK    V+ E  
Subjt:  INSKWDD-----------------------------------------------------------------------EFENSRIKECNK----VIEEVL

Query:  KIALPDIG--------WAKEERLLRELEIIDGFAESVGLNE-------------------VELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW
        K  L D+               LL +  +  G  E + L E                    +  HK  N ++ +  I  L +E G++  + + I   IL 
Subjt:  KIALPDIG--------WAKEERLLRELEIIDGFAESVGLNE-------------------VELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW

Query:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI
        ++E LY   +G  +     DW  +S     SL+  F+  EI KAI  +  +KAP PDGFT   F   W V+KE L+R+F EFHR+G +N     +FI L+
Subjt:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI

Query:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV
         KK     + DFR ISL T  YK++AKVL+ RL+GV+   I   Q AF++GRQ++D +LIANE+V++ +  G++G + K+D EKA+D + W FL +V
Subjt:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV

A0A803P8A0 Uncharacterized protein2.1e-8332.32Show/hide
Query:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK
        MKI++WN RG  D  KR+A+K  I   +PDMV++QE K   ++  FI +IW S    W  + + G SGG L +WD   I+V++++   +S+S+      K
Subjt:  MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCK

Query:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF
        +  W + VYGPC Y+ R + W EL  ++    E+W +GGDFN+TR V E+     STR M+ F+  I    L++  L+NG FTWS        S LDRF 
Subjt:  KCCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFF

Query:  INSKWDDEFENSRIKECNKVIEE----VLKIALPDIG----------------------WAKEE--------RLLRELEIIDGFAE-----SVGLNEV--
          + W+  F   R +   +++ +    V+    P  G                      W +EE        + +++L+ + G A+     + G N+   
Subjt:  INSKWDDEFENSRIKECNKVIEE----VLKIALPDIG----------------------WAKEE--------RLLRELEIIDGFAE-----SVGLNEV--

Query:  -------------------------------------------------------------ELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW
                                                                        H  LNA+K +N I+++  + G I  S  +I   ++ 
Subjt:  -------------------------------------------------------------ELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILW

Query:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI
        F+  LY     +       +W  ++      L   F   E+   + +  G+KAP PDGF+   F  +W V+K  LM +F  FH  G++   I + FICLI
Subjt:  FYEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLI

Query:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL
         K+ ++  V+DFR ISL T  YK++AK LA RL+GV+   IS  QSAF+EGRQ+LD +L+ANE VEDY+++GKKG++LK+D EKA+DRVDWGFL
Subjt:  QKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL

A0A803QEA6 Uncharacterized protein5.3e-8231.53Show/hide
Query:  KIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCKK
        +I++WN RG  D  KR+A+K  I   +PD+V++QE K   ++  FI +IW S    W  + + G SGG L +WD   I+V++++   +S+S+      K+
Subjt:  KIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCKK

Query:  CCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFFI
          W + VYGPC Y+ R   W EL  ++    ++W + GDFN+TR V E+      TR M+ F+  I    L++  L+NG FTWS    S   S LDRF  
Subjt:  CCWVTNVYGPCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFFI

Query:  NSKWDDEFENSRIKECNKVIEEVLKIAL----PDIG----------------------WAKEE--------RLLRELEIIDGFAE-----SVGLNEVE--
         + W+  F   R +   +++ +   + +    P  G                      W KEE        + +++L+I+ G  +     + G N  +  
Subjt:  NSKWDDEFENSRIKECNKVIEEVLKIAL----PDIG----------------------WAKEE--------RLLRELEIIDGFAE-----SVGLNEVE--

Query:  -------------------------------------------------------------LAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWF
                                                                       H  LNA+K +N I+++  E G I  +  +I   ++ F
Subjt:  -------------------------------------------------------------LAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWF

Query:  YEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQ
        +  LY     +       +W  ++ S    L   F   E+   + +  GNKAP PDGF+      +W  +K  LM +F  FHR G++   I + FICLI 
Subjt:  YEDLYKKILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQ

Query:  KKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL
        K+ ++  V+DFR ISL T  YK++AK LA RL+GV+   IS  QSAF+EGRQ+LD +L+ANE VEDY+++G+KG++LK+D EKA+DRVDWGFL
Subjt:  KKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.0e-2128.53Show/hide
Query:  RSAARSLLDRFFINSKWDDEFENSRIKECNKVIEEVLKIALPDIGWAKEERLLRELEIIDGFAESVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKS
        R   RS +D      K  ++ E +  K   +  +E+ KI         ++ L +  E    F E +   +  LA + +  K+ KN I  + +++G IT  
Subjt:  RSAARSLLDRFFINSKWDDEFENSRIKECNKVIEEVLKIALPDIGWAKEERLLRELEIIDGFAESVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKS

Query:  FLDIERVILWFYEDLY-KKILGVRYLPFIAD---WPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNG
          +I+  I  +Y+ LY  K+  +  +    D    P ++  +  SL    +  EI   I +L   K+P PDGFT EF+ ++   L   L++LF+   + G
Subjt:  FLDIERVILWFYEDLY-KKILGVRYLPFIAD---WPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNG

Query:  KLNACIQENFICLIQKK-EDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDY-QAKGKKGWILKLDLEK
         L     E  I LI K   D     +FR ISL  I  K++ K+LA+R++  +  +I   Q  FI G Q    I  +  V++   +AK K   I+ +D EK
Subjt:  KLNACIQENFICLIQKK-EDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDY-QAKGKKGWILKLDLEK

Query:  AFDRVDWGFLVK
        AFD++   F++K
Subjt:  AFDRVDWGFLVK

P08548 LINE-1 reverse transcriptase homolog1.2e-1728.52Show/hide
Query:  SRIKECNKVIEEVLKIALPDIGWAKEERLLREL-EIIDGFAESVGLNEVELAHKFLNAKKR-KNLITKLVDEQGVITKSFLDIERVILWFYEDL----YK
        SR KE  K+  E+ +I        + +R+++++ +    F E +   +  LA+  L  KKR K+LI+ + +    IT    +I++++  +Y+ L    Y+
Subjt:  SRIKECNKVIEEVLKIALPDIGWAKEERLLREL-EIIDGFAESVGLNEVELAHKFLNAKKR-KNLITKLVDEQGVITKSFLDIERVILWFYEDL----YK

Query:  KILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKK-EDA
         +  +         P +S  +   L    S+ EI   I+ L   K+P PDGFT+EF+      L   L+ LF+   + G L     E  I LI K  +D 
Subjt:  KILGVRYLPFIADWPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKK-EDA

Query:  IHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDY-QAKGKKGWILKLDLEKAFDRVDWGFLVK
            ++R ISL  I  K++ K+L +R++  +  II   Q  FI G Q    I  +  V++   + K K   IL +D EKAFD +   F+++
Subjt:  IHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDY-QAKGKKGWILKLDLEKAFDRVDWGFLVK

P11369 LINE-1 retrotransposable element ORF2 protein5.6e-2030.6Show/hide
Query:  EEVLKIALPDIGWAKEERLLREL-EIIDGFAESVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWFYEDLYK-KILGVRYLPFIAD-
        +E++K+   +I   +  R ++ + +    F E +   +  LA +     + K LI K+ +E+G IT    +I+  I  FY+ LY  K+  +  +    D 
Subjt:  EEVLKIALPDIGWAKEERLLREL-EIIDGFAESVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWFYEDLYK-KILGVRYLPFIAD-

Query:  --WPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQK-KEDAIHVRDFRLISL
           P ++  Q   L +  S  EI   I +L   K+P PDGF+ EF+      L   L +LF +    G L     E  I LI K ++D   + +FR ISL
Subjt:  --WPCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQK-KEDAIHVRDFRLISL

Query:  TTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDY-QAKGKKGWILKLDLEKAFDRVDWGFLVKV
          I  K++ K+LA+R++  + +II P Q  FI G Q    I  +  V+    + K K   I+ LD EKAFD++   F++KV
Subjt:  TTITYKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDY-QAKGKKGWILKLDLEKAFDRVDWGFLVKV

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-2133.33Show/hide
Query:  PCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKKEDAIHVRDFRLISLTTIT
        P VS  +K  L    +  E+ +A++ +  NK+P  DG T EFF   W  L     R+  E  + G+L    +   + L+ KK D   ++++R +SL +  
Subjt:  PCVSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKKEDAIHVRDFRLISLTTIT

Query:  YKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLV
        YK+VAK ++ RLK V+  +I P QS  + GR + D + +  +++   +  G     L LD EKAFDRVD  +L+
Subjt:  YKVVAKVLADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLV

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.3e-1130.95Show/hide
Query:  HKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWFYEDLY---KKIL---GVRYLPFIADWPC-VSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPD
        HK + A + KNLI  L  +  V  ++   ++ +I+ +Y  L      IL    V+ +  I  + C  +++ + S + S    EI  A+ A+  NKAP PD
Subjt:  HKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWFYEDLY---KKIL---GVRYLPFIADWPC-VSVSQKFSLINSFSAVEIFKAIKALGGNKAPRPD

Query:  GFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKKEDAIHVRDFRLISLTTITYKVV
         FT EFF + W V+K+  +   +EF R G L        I LI K      +  FR +S  T+ YK++
Subjt:  GFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKKEDAIHVRDFRLISLTTITYKVV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.1e-1047.06Show/hide
Query:  LADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDY-QAKGKKGW-ILKLDLEKAFDRVDWGFL
        + +RLK +M ++I P Q++FI GR   D I+   E V    + KG KGW +LKLDLEKA+DR+ W +L
Subjt:  LADRLKGVMDSIISPFQSAFIEGRQVLDPILIANEVVEDY-QAKGKKGW-ILKLDLEKAFDRVDWGFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATCATTTCCTGGAACACAAGAGGCCTAAAAGATCCAAATAAACGTTCCGCTCTTAAGAAGTTTATAAAAAATCATCACCCGGACATGGTGCTAATCCAAGAATC
AAAAATGGAAATTCTGGAAGTAAATTTCATTAAAACAATTTGGAGTTCTATGGATATAGGATGGGAATCAGTGGAATCTTATGGCGCTTCTGGAGGCATTCTTACCCTAT
GGGATAAAAGTAAAATCACAGTGGTGGAAACCATAAGAAGACATTATTCCCTCTCAATAAAATGTGTAACTTTGTGCAAGAAGTGTTGTTGGGTTACCAATGTTTATGGT
CCATGTGGTTACAGAGAGAGGAAACTTGTTTGGCCAGAATTATTAACAATTGCAGAATGTGGAGAAGAGGCTTGGTCTTTGGGAGGAGATTTTAATATCACTAGATGGGT
CTATGAGAGGTTTCCAGTTGGCAGAAGCACAAGAGGAATGAGACAGTTTAATGCCTTTATAGATTCTGCCAATCTAATGGAAATTTCCCTTCAAAATGGCAAATTTACTT
GGTCAAGAGAGGATCGCAGTGCTGCAAGATCTCTGTTGGACAGATTTTTTATTAACAGTAAATGGGATGATGAATTTGAAAACTCAAGAATAAAAGAGTGCAACAAGGTT
ATTGAGGAAGTCTTGAAAATCGCCCTCCCAGACATTGGGTGGGCTAAAGAGGAAAGGCTTTTGAGAGAGTTAGAAATCATTGATGGATTTGCAGAAAGTGTTGGTCTGAA
TGAAGTGGAATTGGCTCACAAATTCCTTAATGCAAAAAAAAGGAAAAACCTAATCACAAAATTGGTGGATGAGCAAGGGGTGATAACGAAGTCTTTTCTCGACATTGAAA
GGGTGATATTGTGGTTCTATGAAGACCTGTATAAAAAAATTCTAGGAGTCAGATATCTTCCTTTTATTGCAGATTGGCCTTGTGTCTCTGTTTCTCAAAAATTTTCACTA
ATCAACAGTTTCTCTGCGGTGGAGATTTTCAAGGCAATTAAAGCATTAGGAGGCAATAAAGCACCCAGACCGGATGGTTTTACTACAGAATTTTTTGTGAAACACTGGCC
AGTTCTAAAAGAAGGTTTGATGAGATTGTTTGAAGAATTCCACAGAAATGGTAAGCTCAATGCTTGCATTCAAGAAAACTTCATTTGCTTAATTCAGAAGAAAGAAGATG
CGATTCATGTTAGAGACTTCAGACTCATTAGTCTCACAACCATCACTTACAAGGTGGTTGCCAAGGTACTAGCAGATAGGCTTAAAGGGGTAATGGATTCAATAATCAGC
CCTTTTCAAAGCGCGTTCATTGAAGGTAGACAGGTTTTAGACCCCATTCTTATAGCTAACGAAGTTGTAGAAGACTATCAAGCCAAAGGGAAAAAAGGGTGGATTTTGAA
ACTAGACCTTGAAAAGGCCTTTGATAGGGTGGATTGGGGCTTCCTCGTAAAAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGATCATTTCCTGGAACACAAGAGGCCTAAAAGATCCAAATAAACGTTCCGCTCTTAAGAAGTTTATAAAAAATCATCACCCGGACATGGTGCTAATCCAAGAATC
AAAAATGGAAATTCTGGAAGTAAATTTCATTAAAACAATTTGGAGTTCTATGGATATAGGATGGGAATCAGTGGAATCTTATGGCGCTTCTGGAGGCATTCTTACCCTAT
GGGATAAAAGTAAAATCACAGTGGTGGAAACCATAAGAAGACATTATTCCCTCTCAATAAAATGTGTAACTTTGTGCAAGAAGTGTTGTTGGGTTACCAATGTTTATGGT
CCATGTGGTTACAGAGAGAGGAAACTTGTTTGGCCAGAATTATTAACAATTGCAGAATGTGGAGAAGAGGCTTGGTCTTTGGGAGGAGATTTTAATATCACTAGATGGGT
CTATGAGAGGTTTCCAGTTGGCAGAAGCACAAGAGGAATGAGACAGTTTAATGCCTTTATAGATTCTGCCAATCTAATGGAAATTTCCCTTCAAAATGGCAAATTTACTT
GGTCAAGAGAGGATCGCAGTGCTGCAAGATCTCTGTTGGACAGATTTTTTATTAACAGTAAATGGGATGATGAATTTGAAAACTCAAGAATAAAAGAGTGCAACAAGGTT
ATTGAGGAAGTCTTGAAAATCGCCCTCCCAGACATTGGGTGGGCTAAAGAGGAAAGGCTTTTGAGAGAGTTAGAAATCATTGATGGATTTGCAGAAAGTGTTGGTCTGAA
TGAAGTGGAATTGGCTCACAAATTCCTTAATGCAAAAAAAAGGAAAAACCTAATCACAAAATTGGTGGATGAGCAAGGGGTGATAACGAAGTCTTTTCTCGACATTGAAA
GGGTGATATTGTGGTTCTATGAAGACCTGTATAAAAAAATTCTAGGAGTCAGATATCTTCCTTTTATTGCAGATTGGCCTTGTGTCTCTGTTTCTCAAAAATTTTCACTA
ATCAACAGTTTCTCTGCGGTGGAGATTTTCAAGGCAATTAAAGCATTAGGAGGCAATAAAGCACCCAGACCGGATGGTTTTACTACAGAATTTTTTGTGAAACACTGGCC
AGTTCTAAAAGAAGGTTTGATGAGATTGTTTGAAGAATTCCACAGAAATGGTAAGCTCAATGCTTGCATTCAAGAAAACTTCATTTGCTTAATTCAGAAGAAAGAAGATG
CGATTCATGTTAGAGACTTCAGACTCATTAGTCTCACAACCATCACTTACAAGGTGGTTGCCAAGGTACTAGCAGATAGGCTTAAAGGGGTAATGGATTCAATAATCAGC
CCTTTTCAAAGCGCGTTCATTGAAGGTAGACAGGTTTTAGACCCCATTCTTATAGCTAACGAAGTTGTAGAAGACTATCAAGCCAAAGGGAAAAAAGGGTGGATTTTGAA
ACTAGACCTTGAAAAGGCCTTTGATAGGGTGGATTGGGGCTTCCTCGTAAAAGTTTAG
Protein sequenceShow/hide protein sequence
MKIISWNTRGLKDPNKRSALKKFIKNHHPDMVLIQESKMEILEVNFIKTIWSSMDIGWESVESYGASGGILTLWDKSKITVVETIRRHYSLSIKCVTLCKKCCWVTNVYG
PCGYRERKLVWPELLTIAECGEEAWSLGGDFNITRWVYERFPVGRSTRGMRQFNAFIDSANLMEISLQNGKFTWSREDRSAARSLLDRFFINSKWDDEFENSRIKECNKV
IEEVLKIALPDIGWAKEERLLRELEIIDGFAESVGLNEVELAHKFLNAKKRKNLITKLVDEQGVITKSFLDIERVILWFYEDLYKKILGVRYLPFIADWPCVSVSQKFSL
INSFSAVEIFKAIKALGGNKAPRPDGFTTEFFVKHWPVLKEGLMRLFEEFHRNGKLNACIQENFICLIQKKEDAIHVRDFRLISLTTITYKVVAKVLADRLKGVMDSIIS
PFQSAFIEGRQVLDPILIANEVVEDYQAKGKKGWILKLDLEKAFDRVDWGFLVKV