; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019031 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019031
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold12:25119927..25133727
RNA-Seq ExpressionSpg019031
SyntenySpg019031
Gene Ontology termsGO:0007064 - mitotic sister chromatid cohesion (biological process)
GO:0050789 - regulation of biological process (biological process)
GO:0031390 - Ctf18 RFC-like complex (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR018607 - Chromosome transmission fidelity protein 8
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.8e-8744.81Show/hide
Query:  ILMIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLA
        + M I+SWN RG+GS KKR +++ FLS+QNP +V++QETK    DR+ + SVW  + + WA++ A  ASGGI+ILW+ S     E V G FS+++  +  
Subjt:  ILMIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLA

Query:  DDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDR
        ++ SFW+T VYGP +   R+ FW EL DL  L  P W +GGDFNVIR   EK   T  T  M+ F+ FI   GLID PL N  +TWS+ + +P    +DR
Subjt:  DDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDR

Query:  FLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKE
        FL +    T F  +    LPR TSDH PI L     KWGP P+RF N WL H  F      WW       W  H F++KLK +K +LK+WN  TFG  KE
Subjt:  FLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKE

Query:  KKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
        +K  +  +LS +DL E++GNL       RT  + +L  +   EE+ WRQ  +VKW+KEGD NS FF
Subjt:  KKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]5.4e-2328.14Show/hide
Query:  PWRAIHKSKYLIYDHIDIRVGKGDKTLFWEDIWLGSSPLQSKYPSLFS-LSLKKDALIVDLWEPTNGAWNLHLRRHLQDSEILEWALLSHQLSSFS----
        PW+AI +        + + VG G++  FWED+W G+  L S++  L+  +S+K   +   L      AWNL+ RR+L DSEI    LL   +SS S    
Subjt:  PWRAIHKSKYLIYDHIDIRVGKGDKTLFWEDIWLGSSPLQSKYPSLFS-LSLKKDALIVDLWEPTNGAWNLHLRRHLQDSEILEWALLSHQLSSFS----

Query:  FNNIEDT---------------------------------WLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRC
        F ++ D+                                 +LW   +P KVK   W ++H  +NT D +Q R P  SL P++C LC    ES  H+F  C
Subjt:  FNNIEDT---------------------------------WLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRC

Query:  EYAAALWDHIQSAFG--WQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPF
             LW+ +    G  W   RS + + ++ F  LG+  +   K  W+         +W ERN RIF ++ ++     +   + +  W+S  + F
Subjt:  EYAAALWDHIQSAFG--WQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPF

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.2e-8644.78Show/hide
Query:  MIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADD
        M IISWN RG+GS KKR ++K+FLSS+ P +V+IQETK    DR+++ SVWS RN  WA++ A  ASGGILI+W+      +E+V G FS+SI  ++   
Subjt:  MIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADD

Query:  FSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFL
         S W++ VYGPN+S  R+ FW EL+D+  L  P W +GGDFNVIR + EK   +  T  MK F+ FI +  LID PL +  YTWS+ + NP    +DRFL
Subjt:  FSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFL

Query:  ITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKK
         ++     F  +L   LPR TSDH+PI L     KWGP P+RF N WL H SF     +WW       W  H F++KL+ +K +LK+WN+++FG+  +KK
Subjt:  ITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKK

Query:  SCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
          +   L+  D  E++G L+++   +R   K +L  +   EE+ WRQ  +VKW+K+GD NS FF
Subjt:  SCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

RVX15530.1 putative ribonuclease H protein [Vitis vinifera]8.2e-8845.03Show/hide
Query:  IISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADDFS
        I+SWN RG+GS KKR +++ FLS+QNP +V++QETK    DR+ + SVW  + + WA++ A  ASGGI+ILW+ S F   E V G FS+++  +  ++ S
Subjt:  IISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADDFS

Query:  FWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFLIT
        FW+T VYGP +   R+ FW EL DL  L  P W +GGDFNVIR   EK   T  T  M+ F+ FI   GL+D PL N  +TWS+ + +P    +DRFL +
Subjt:  FWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFLIT

Query:  DNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKKSC
            T F  +    LPR TSDH PI L     KWGP P+RF N WL H  F      WW+      W  H F++KLK +K +LK+WN  TFG  KE+K  
Subjt:  DNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKKSC

Query:  LSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
        +  +LS +DL E++GNL       RT  + +L  +   EE+ WRQ  +VKW+KEGD NS FF
Subjt:  LSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

RVX15530.1 putative ribonuclease H protein [Vitis vinifera]9.6e-1228.57Show/hide
Query:  ILEWALLSHQLSS-----FSFNNIED-------TWLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRCEYAAAL
        ILEW+L S  L S      + + + +        +LW   +P KVK   W ++H  +NT D +Q R P  SL P++C LC    ES  H+F  C     L
Subjt:  ILEWALLSHQLSS-----FSFNNIED-------TWLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRCEYAAAL

Query:  WDHIQSAFG--WQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPF
        W+ +    G  W   RS + + ++ F  LG+  +   K  W+         +W ERN RIF ++ ++     +   + +  W+S  + F
Subjt:  WDHIQSAFG--WQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPF

RVX15530.1 putative ribonuclease H protein [Vitis vinifera]1.1e-8744.21Show/hide
Query:  CEKGRLSYG-KVDCILMIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEI
        CE   +  G +V    M IISWN RG+GS KKR ++K+FLSS+ P +V+IQETK    DR+++ SVWS RN  WA++ A  ASGGILI+W+      +E+
Subjt:  CEKGRLSYG-KVDCILMIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEI

Query:  VEGIFSLSIHLSLADDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTW
        V G FS+SI  ++    S W++ VYGPN+S  R+ FW EL+D+  L  P W +GGDFNVIR + EK   +  T  MK F+ FI +  LID PL +  YTW
Subjt:  VEGIFSLSIHLSLADDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTW

Query:  SSYRPNPTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKE
        S+ + NP    +DRFL ++     F  +L   LPR TSDH+PI L     KWGP P+RF N WL H SF     +WW       W  H F++KL+ +K +
Subjt:  SSYRPNPTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKE

Query:  LKQWNQSTFGQQKEKKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
        LK+WN+++FG+  +KK  +   L+  D  E++G L+++   +R   K +L  +   EE+ WRQ  +VKW+KEGD NS FF
Subjt:  LKQWNQSTFGQQKEKKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.4e-9546.43Show/hide
Query:  MIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADD
        M  ++WNVRG+ SWKK ALIK F+S  NP +VI+QETK++ +D  I+KS+WS+  I W+++DA   + GILILWN+      E++EG+FSL+I+  L+D 
Subjt:  MIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADD

Query:  FSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFL
        F FW++G+YGP++++   +FW+EL DL  LC  +WIL GDFNV RW+WEKS+    T++M  FN FI +  LID+PL NG++TWS    N + +LID FL
Subjt:  FSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFL

Query:  ITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKK
        +T+    K    + +R+ R TSDHFPI L  G+  WG  P+RF N WL+H +F   ++ WW + P   WP HG + KLK LK  +K W    F     +K
Subjt:  ITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKK

Query:  SCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
          L+  ++ LD  E    +T  +S  R + K DLLS+ A EE  WRQ CK KWL EGD N+ FF
Subjt:  SCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein8.9e-8844.81Show/hide
Query:  ILMIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLA
        + M I+SWN RG+GS KKR +++ FLS+QNP +V++QETK    DR+ + SVW  + + WA++ A  ASGGI+ILW+ S     E V G FS+++  +  
Subjt:  ILMIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLA

Query:  DDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDR
        ++ SFW+T VYGP +   R+ FW EL DL  L  P W +GGDFNVIR   EK   T  T  M+ F+ FI   GLID PL N  +TWS+ + +P    +DR
Subjt:  DDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDR

Query:  FLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKE
        FL +    T F  +    LPR TSDH PI L     KWGP P+RF N WL H  F      WW       W  H F++KLK +K +LK+WN  TFG  KE
Subjt:  FLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKE

Query:  KKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
        +K  +  +LS +DL E++GNL       RT  + +L  +   EE+ WRQ  +VKW+KEGD NS FF
Subjt:  KKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

A0A438FWU5 LINE-1 retrotransposable element ORF2 protein2.6e-2328.14Show/hide
Query:  PWRAIHKSKYLIYDHIDIRVGKGDKTLFWEDIWLGSSPLQSKYPSLFS-LSLKKDALIVDLWEPTNGAWNLHLRRHLQDSEILEWALLSHQLSSFS----
        PW+AI +        + + VG G++  FWED+W G+  L S++  L+  +S+K   +   L      AWNL+ RR+L DSEI    LL   +SS S    
Subjt:  PWRAIHKSKYLIYDHIDIRVGKGDKTLFWEDIWLGSSPLQSKYPSLFS-LSLKKDALIVDLWEPTNGAWNLHLRRHLQDSEILEWALLSHQLSSFS----

Query:  FNNIEDT---------------------------------WLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRC
        F ++ D+                                 +LW   +P KVK   W ++H  +NT D +Q R P  SL P++C LC    ES  H+F  C
Subjt:  FNNIEDT---------------------------------WLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRC

Query:  EYAAALWDHIQSAFG--WQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPF
             LW+ +    G  W   RS + + ++ F  LG+  +   K  W+         +W ERN RIF ++ ++     +   + +  W+S  + F
Subjt:  EYAAALWDHIQSAFG--WQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPF

A0A438FWU5 LINE-1 retrotransposable element ORF2 protein5.8e-8744.78Show/hide
Query:  MIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADD
        M IISWN RG+GS KKR ++K+FLSS+ P +V+IQETK    DR+++ SVWS RN  WA++ A  ASGGILI+W+      +E+V G FS+SI  ++   
Subjt:  MIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADD

Query:  FSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFL
         S W++ VYGPN+S  R+ FW EL+D+  L  P W +GGDFNVIR + EK   +  T  MK F+ FI +  LID PL +  YTWS+ + NP    +DRFL
Subjt:  FSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFL

Query:  ITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKK
         ++     F  +L   LPR TSDH+PI L     KWGP P+RF N WL H SF     +WW       W  H F++KL+ +K +LK+WN+++FG+  +KK
Subjt:  ITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKK

Query:  SCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
          +   L+  D  E++G L+++   +R   K +L  +   EE+ WRQ  +VKW+K+GD NS FF
Subjt:  SCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

A0A438K2W1 Putative ribonuclease H protein4.0e-8845.03Show/hide
Query:  IISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADDFS
        I+SWN RG+GS KKR +++ FLS+QNP +V++QETK    DR+ + SVW  + + WA++ A  ASGGI+ILW+ S F   E V G FS+++  +  ++ S
Subjt:  IISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADDFS

Query:  FWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFLIT
        FW+T VYGP +   R+ FW EL DL  L  P W +GGDFNVIR   EK   T  T  M+ F+ FI   GL+D PL N  +TWS+ + +P    +DRFL +
Subjt:  FWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFLIT

Query:  DNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKKSC
            T F  +    LPR TSDH PI L     KWGP P+RF N WL H  F      WW+      W  H F++KLK +K +LK+WN  TFG  KE+K  
Subjt:  DNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKKSC

Query:  LSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
        +  +LS +DL E++GNL       RT  + +L  +   EE+ WRQ  +VKW+KEGD NS FF
Subjt:  LSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

A0A438K2W1 Putative ribonuclease H protein4.7e-1228.57Show/hide
Query:  ILEWALLSHQLSS-----FSFNNIED-------TWLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRCEYAAAL
        ILEW+L S  L S      + + + +        +LW   +P KVK   W ++H  +NT D +Q R P  SL P++C LC    ES  H+F  C     L
Subjt:  ILEWALLSHQLSS-----FSFNNIED-------TWLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRCEYAAAL

Query:  WDHIQSAFG--WQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPF
        W+ +    G  W   RS + + ++ F  LG+  +   K  W+         +W ERN RIF ++ ++     +   + +  W+S  + F
Subjt:  WDHIQSAFG--WQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPF

A0A438K2W1 Putative ribonuclease H protein5.2e-8844.21Show/hide
Query:  CEKGRLSYG-KVDCILMIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEI
        CE   +  G +V    M IISWN RG+GS KKR ++K+FLSS+ P +V+IQETK    DR+++ SVWS RN  WA++ A  ASGGILI+W+      +E+
Subjt:  CEKGRLSYG-KVDCILMIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEI

Query:  VEGIFSLSIHLSLADDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTW
        V G FS+SI  ++    S W++ VYGPN+S  R+ FW EL+D+  L  P W +GGDFNVIR + EK   +  T  MK F+ FI +  LID PL +  YTW
Subjt:  VEGIFSLSIHLSLADDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTW

Query:  SSYRPNPTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKE
        S+ + NP    +DRFL ++     F  +L   LPR TSDH+PI L     KWGP P+RF N WL H SF     +WW       W  H F++KL+ +K +
Subjt:  SSYRPNPTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKE

Query:  LKQWNQSTFGQQKEKKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
        LK+WN+++FG+  +KK  +   L+  D  E++G L+++   +R   K +L  +   EE+ WRQ  +VKW+KEGD NS FF
Subjt:  LKQWNQSTFGQQKEKKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

A0A6J1E2G6 uncharacterized protein LOC1110254056.8e-9646.43Show/hide
Query:  MIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADD
        M  ++WNVRG+ SWKK ALIK F+S  NP +VI+QETK++ +D  I+KS+WS+  I W+++DA   + GILILWN+      E++EG+FSL+I+  L+D 
Subjt:  MIIISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADD

Query:  FSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFL
        F FW++G+YGP++++   +FW+EL DL  LC  +WIL GDFNV RW+WEKS+    T++M  FN FI +  LID+PL NG++TWS    N + +LID FL
Subjt:  FSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFL

Query:  ITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKK
        +T+    K    + +R+ R TSDHFPI L  G+  WG  P+RF N WL+H +F   ++ WW + P   WP HG + KLK LK  +K W    F     +K
Subjt:  ITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKK

Query:  SCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
          L+  ++ LD  E    +T  +S  R + K DLLS+ A EE  WRQ CK KWL EGD N+ FF
Subjt:  SCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

SwissProt top hitse value%identityAlignment
P0CG13 Chromosome transmission fidelity protein 8 homolog1.4e-0533.01Show/hide
Query:  EWAVVELQGIVEAQPSFQDRLQNLEIGILCRSSAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEVIGIIRQRILFKSRPKA
        EW ++ELQG +EA+  +   L    +G L   + + +    VG+H L G  + L+KP  VL K    D+D             V  +I+ +ILFK+RPK 
Subjt:  EWAVVELQGIVEAQPSFQDRLQNLEIGILCRSSAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEVIGIIRQRILFKSRPKA

Query:  LIS
        +I+
Subjt:  LIS

P0CG15 Chromosome transmission fidelity protein 8 homolog1.4e-0532.04Show/hide
Query:  EWAVVELQGIVEAQPSFQDRLQNLEIGILCRSSAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEVIGIIRQRILFKSRPKA
        EW ++ELQG +EA+  +   L    +G L   + + +    VG+H L G  + L+KP  VL K     +D          +  V  +I+ +ILFK+RPK 
Subjt:  EWAVVELQGIVEAQPSFQDRLQNLEIGILCRSSAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEVIGIIRQRILFKSRPKA

Query:  LIS
        +I+
Subjt:  LIS

P11369 LINE-1 retrotransposable element ORF2 protein1.8e-0824.78Show/hide
Query:  IISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASI---DAVDASGGILILWNE----SSFAVKEIVEGIFSLSIHL
        +IS N+ G+ S  KR  + D+L  Q+PT   +QET +   DR  +      R   W +I   + +    G+ IL ++        +K+  EG F L    
Subjt:  IISWNVRGMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASI---DAVDASGGILILWNE----SSFAVKEIVEGIFSLSIHL

Query:  SLADDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDI-----PLANGKYTWSSYRPN
         L ++ S  I  +Y PN ++        L  L+A   P+ I+ GDFN    + ++S      R   K    +    L DI     P   G YT+ S  P+
Subjt:  SLADDFSFWITGVYGPNSSKERRIFWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDI-----PLANGKYTWSSYRPN

Query:  PTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPP---YRFINAWLNH-----------HSFLRTVDQWWKSVPSLWWPSHGFIQ
         T + ID  +       +++   +  +P I SDH  + L          P   ++  N  LN              FL   +    + P+LW     F++
Subjt:  PTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISLSLGKEKWGPPP---YRFINAWLNH-----------HSFLRTVDQWWKSVPSLWWPSHGFIQ

Query:  -KLKGLKKELKQWNQSTFGQQKEKKSCLSQELSCLDLKE
         KL  L    K+       ++    S L+  L  L+ KE
Subjt:  -KLKGLKKELKQWNQSTFGQQKEKKSCLSQELSCLDLKE

Q54JL4 Putative uncharacterized protein DDB_G02879756.7e-0831.06Show/hide
Query:  MQIKVKCNCGETKCLEWAVVELQGIVEAQPSFQDRLQNLEIGILCRS-SAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSAT-----
        MQI +K     T   EW +++LQG +E    +   L+N  +GIL +  + ++ ++F +G   L G +VPLKKPLLV+KK +  + + S + N        
Subjt:  MQIKVKCNCGETKCLEWAVVELQGIVEAQPSFQDRLQNLEIGILCRS-SAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSAT-----

Query:  ---------AELEVIGIIRQRILFKSRPKALI
                  E  + GI   +I F +RP   I
Subjt:  ---------AELEVIGIIRQRILFKSRPKALI

Q65ZA6 Chromosome transmission fidelity protein 82.6e-0436.56Show/hide
Query:  IVEAQPSFQDRLQNLEIG----ILCRSSAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEVIGIIRQRILFKSRP
        +VE Q + + +  +L IG    I  ++S ++  T TVG   + G    LKKPL VL+K    + D     +S + EL+   IIR+RI F SRP
Subjt:  IVEAQPSFQDRLQNLEIG----ILCRSSAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEVIGIIRQRILFKSRP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.9e-1426.69Show/hide
Query:  ILGGDFNVIRWTWEKSSY---TVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYR-PNPTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISLSL
        IL GDF+ I  T +  S    ++P R +++F   + +  L+DIP     YTWS+++  NP +  +DR +   +  + F +A+        SDH P  + L
Subjt:  ILGGDFNVIRWTWEKSSY---TVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYR-PNPTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISLSL

Query:  -GKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKKSCLSQELSCLDLKEEQGNLTEQESYRRTE-
            K     +R+ +    H +FL ++   W+    +        + LK  KK  K  N+  FG  + K     + L  L+  + Q      +S  R E 
Subjt:  -GKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKKSCLSQELSCLDLKEEQGNLTEQESYRRTE-

Query:  -IKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF
          +      +A  E  +RQ  ++KWL++GD N+ FF
Subjt:  -IKADLLSISANEEMLWRQSCKVKWLKEGDVNSAFF

AT5G52220.1 CONTAINS InterPro DOMAIN/s: Chromosome transmission fidelity protein 8 (InterPro:IPR018607); Has 127 Blast hits to 127 proteins in 63 species: Archae - 0; Bacteria - 0; Metazoa - 70; Fungi - 17; Plants - 31; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink).1.7e-3563.87Show/hide
Query:  MQIKVKCNCGETKCLEWAVVELQGIVEAQPSFQDRLQNLEIGILCRS-SAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEV
        M+I+VKC CGE +C EWA+VELQG+VE Q SFQ  +QNLEIG LC S S+Q  YTFTVGYHEL G+KV LKKPLLVLKK       Q  + +    ELEV
Subjt:  MQIKVKCNCGETKCLEWAVVELQGIVEAQPSFQDRLQNLEIGILCRS-SAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEV

Query:  IGIIRQRILFKSRPKALIS
        +GIIR +ILFK+RPK LIS
Subjt:  IGIIRQRILFKSRPKALIS

AT5G52220.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 18 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Chromosome transmission fidelity protein 8 (InterPro:IPR018607).1.7e-3563.87Show/hide
Query:  MQIKVKCNCGETKCLEWAVVELQGIVEAQPSFQDRLQNLEIGILCRS-SAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEV
        M+I+VKC CGE +C EWA+VELQG+VE Q SFQ  +QNLEIG LC S S+Q  YTFTVGYHEL G+KV LKKPLLVLKK       Q  + +    ELEV
Subjt:  MQIKVKCNCGETKCLEWAVVELQGIVEAQPSFQDRLQNLEIGILCRS-SAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEV

Query:  IGIIRQRILFKSRPKALIS
        +GIIR +ILFK+RPK LIS
Subjt:  IGIIRQRILFKSRPKALIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTTCCAGGAGTTTGGCTTTTGTCTCGTCAGAGCTTAGGCTTTCAGGATGGCTTTCGAGGAGCTTCTTCTTTATCTGCGCTTCTGGGACAGATGGGGCTTTTTGT
GGCAGAGGAAGACCTCAGCCATTTCCTATGGAGCTATGATTTTGTCCGTTCTGTCTGGGAATTTTTCTTTGATGCGTTTGGTTTGCAATTTGTTAGTCATAAAGATCATA
GGGAGATGATTGAGGAGTTCCTTCTCCATCCGTCATTTTGTGAGAAAGGGAGGTTATCTTATGGGAAGGTCGACTGTATCCTCATGATTATTATCTCCTGGAATGTGCGT
GGCATGGGCTCATGGAAAAAGAGAGCCCTTATTAAAGATTTTCTCTCTTCACAGAACCCCACTCTGGTCATTATTCAAGAAACCAAGATGGCCGGTATTGACCGAAAGAT
CATTAAATCTGTTTGGAGCTCGAGGAACATAGCCTGGGCTTCCATTGATGCCGTCGATGCCTCTGGAGGGATTTTGATCCTTTGGAATGAATCCTCTTTTGCCGTTAAGG
AGATTGTTGAAGGTATTTTCTCTCTCTCCATACATCTTTCCCTAGCTGATGATTTCTCCTTTTGGATCACAGGAGTTTATGGCCCCAATTCCTCCAAAGAGAGGCGTATA
TTCTGGAAGGAATTAACAGATCTCCAGGCTCTTTGTCCCCCTAATTGGATTTTGGGTGGCGACTTTAATGTCATCCGATGGACATGGGAAAAATCCTCTTATACAGTGCC
AACCCGAGCCATGAAGAAATTCAACCGTTTCATAGTCAACAAGGGCCTTATAGACATTCCCCTCGCCAATGGAAAGTATACTTGGTCCAGTTATCGACCCAACCCCACAA
TGACCCTCATTGATAGATTCCTCATCACTGATAATATTGCAACCAAGTTTCAAGCAGCCTTGGTCCGAAGATTGCCTAGAATCACCTCTGACCATTTCCCTATTAGCCTT
TCGTTGGGGAAAGAGAAATGGGGGCCTCCCCCTTACAGATTCATTAATGCGTGGCTCAATCATCATTCCTTCCTCCGAACGGTTGATCAATGGTGGAAGAGCGTTCCATC
TCTATGGTGGCCGAGTCACGGCTTTATCCAGAAATTAAAAGGCCTTAAAAAGGAATTGAAGCAATGGAACCAATCTACTTTTGGACAACAAAAAGAAAAAAAGTCCTGTT
TGAGTCAAGAGTTATCATGCTTAGATCTCAAAGAGGAACAAGGCAATCTAACTGAGCAAGAATCCTATAGAAGAACTGAGATTAAAGCTGATTTGCTCTCGATATCTGCC
AATGAGGAGATGCTGTGGCGACAATCTTGCAAAGTTAAGTGGCTCAAGGAGGGGGATGTAAACTCGGCCTTTTTTCCACCGAATTATGGCAGCCCACAGAAGAAAGAGCA
CAATAGTGGAGATTGTATCAGATTTGGAGGCCCATGGCGTGCTATCCACAAATCAAAATACCTTATATATGACCACATTGACATTCGAGTTGGGAAAGGTGACAAAACGC
TTTTTTGGGAGGACATTTGGCTGGGGTCTTCTCCTTTACAGTCCAAGTACCCCTCTTTATTCAGTCTCTCGCTCAAAAAGGATGCTCTTATAGTTGATTTATGGGAACCA
ACCAATGGGGCATGGAACCTCCATTTAAGAAGACACCTCCAAGATTCGGAAATTCTGGAATGGGCTTTATTATCTCACCAACTGTCCTCGTTCTCCTTCAACAATATCGA
AGATACATGGCTTTGGAAGGGTCTTATGCCGAAGAAAGTTAAGTTCTTCATGTGGGAGCTCAGCCACAGATGTATTAATACTGCAGATGTCATACAGAGGCGATTTCCCA
ACTCCTCATTATCTCCTCGCTATTGCTGCTTGTGTAACAAGGCTGCCGAATCACAGATTCATATTTTCAGTCGTTGTGAATATGCTGCAGCTCTCTGGGATCACATACAA
AGCGCTTTTGGTTGGCAATTTGCTCGTTCGGGTGATGTCCTTTCCCTTCTTCAATTCACTCTTCTTGGACATCCTTTTAAAAATGATACTAAGGTTCAATGGCGGAATTT
TTTGTACGCATTCTTCGGGAACTTATGGCTTGAAAGAAATGCCAGAATCTTCAATAATCAGCAGCAAAATGTCTATGCCTTTATTGAATCTACTTCCTATCTTGCCATGT
ATTGGAGTAGTCATATCTCCCCATTTTTAGAGGAATCAGAAATCGAAAGCCAGAAGATGCAGATTAAGGTAAAATGTAACTGCGGCGAAACCAAATGCTTAGAATGGGCC
GTTGTTGAGTTGCAAGGCATAGTTGAAGCTCAACCCTCCTTCCAAGATCGCCTTCAAAACCTCGAAATCGGCATTCTATGTCGGTCCTCCGCTCAGGAAGTTTATACTTT
TACGGTAGGGTATCATGAACTGACAGGAAACAAAGTGCCATTGAAGAAGCCGCTGTTGGTGTTGAAGAAAAAGAGGTGCGTGGATGAGGATCAGAGTGGAGATACCAATT
CCGCTACTGCAGAGTTGGAAGTCATTGGGATAATCAGGCAGCGGATTTTGTTTAAGAGCAGACCAAAGGCCCTCATATCCAAACCACAGCCATTGGTGAAGGAGAGATCC
CGCGCTTCAGGATCTGCAGTGACAGGCCAATCGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCTTCCAGGAGTTTGGCTTTTGTCTCGTCAGAGCTTAGGCTTTCAGGATGGCTTTCGAGGAGCTTCTTCTTTATCTGCGCTTCTGGGACAGATGGGGCTTTTTGT
GGCAGAGGAAGACCTCAGCCATTTCCTATGGAGCTATGATTTTGTCCGTTCTGTCTGGGAATTTTTCTTTGATGCGTTTGGTTTGCAATTTGTTAGTCATAAAGATCATA
GGGAGATGATTGAGGAGTTCCTTCTCCATCCGTCATTTTGTGAGAAAGGGAGGTTATCTTATGGGAAGGTCGACTGTATCCTCATGATTATTATCTCCTGGAATGTGCGT
GGCATGGGCTCATGGAAAAAGAGAGCCCTTATTAAAGATTTTCTCTCTTCACAGAACCCCACTCTGGTCATTATTCAAGAAACCAAGATGGCCGGTATTGACCGAAAGAT
CATTAAATCTGTTTGGAGCTCGAGGAACATAGCCTGGGCTTCCATTGATGCCGTCGATGCCTCTGGAGGGATTTTGATCCTTTGGAATGAATCCTCTTTTGCCGTTAAGG
AGATTGTTGAAGGTATTTTCTCTCTCTCCATACATCTTTCCCTAGCTGATGATTTCTCCTTTTGGATCACAGGAGTTTATGGCCCCAATTCCTCCAAAGAGAGGCGTATA
TTCTGGAAGGAATTAACAGATCTCCAGGCTCTTTGTCCCCCTAATTGGATTTTGGGTGGCGACTTTAATGTCATCCGATGGACATGGGAAAAATCCTCTTATACAGTGCC
AACCCGAGCCATGAAGAAATTCAACCGTTTCATAGTCAACAAGGGCCTTATAGACATTCCCCTCGCCAATGGAAAGTATACTTGGTCCAGTTATCGACCCAACCCCACAA
TGACCCTCATTGATAGATTCCTCATCACTGATAATATTGCAACCAAGTTTCAAGCAGCCTTGGTCCGAAGATTGCCTAGAATCACCTCTGACCATTTCCCTATTAGCCTT
TCGTTGGGGAAAGAGAAATGGGGGCCTCCCCCTTACAGATTCATTAATGCGTGGCTCAATCATCATTCCTTCCTCCGAACGGTTGATCAATGGTGGAAGAGCGTTCCATC
TCTATGGTGGCCGAGTCACGGCTTTATCCAGAAATTAAAAGGCCTTAAAAAGGAATTGAAGCAATGGAACCAATCTACTTTTGGACAACAAAAAGAAAAAAAGTCCTGTT
TGAGTCAAGAGTTATCATGCTTAGATCTCAAAGAGGAACAAGGCAATCTAACTGAGCAAGAATCCTATAGAAGAACTGAGATTAAAGCTGATTTGCTCTCGATATCTGCC
AATGAGGAGATGCTGTGGCGACAATCTTGCAAAGTTAAGTGGCTCAAGGAGGGGGATGTAAACTCGGCCTTTTTTCCACCGAATTATGGCAGCCCACAGAAGAAAGAGCA
CAATAGTGGAGATTGTATCAGATTTGGAGGCCCATGGCGTGCTATCCACAAATCAAAATACCTTATATATGACCACATTGACATTCGAGTTGGGAAAGGTGACAAAACGC
TTTTTTGGGAGGACATTTGGCTGGGGTCTTCTCCTTTACAGTCCAAGTACCCCTCTTTATTCAGTCTCTCGCTCAAAAAGGATGCTCTTATAGTTGATTTATGGGAACCA
ACCAATGGGGCATGGAACCTCCATTTAAGAAGACACCTCCAAGATTCGGAAATTCTGGAATGGGCTTTATTATCTCACCAACTGTCCTCGTTCTCCTTCAACAATATCGA
AGATACATGGCTTTGGAAGGGTCTTATGCCGAAGAAAGTTAAGTTCTTCATGTGGGAGCTCAGCCACAGATGTATTAATACTGCAGATGTCATACAGAGGCGATTTCCCA
ACTCCTCATTATCTCCTCGCTATTGCTGCTTGTGTAACAAGGCTGCCGAATCACAGATTCATATTTTCAGTCGTTGTGAATATGCTGCAGCTCTCTGGGATCACATACAA
AGCGCTTTTGGTTGGCAATTTGCTCGTTCGGGTGATGTCCTTTCCCTTCTTCAATTCACTCTTCTTGGACATCCTTTTAAAAATGATACTAAGGTTCAATGGCGGAATTT
TTTGTACGCATTCTTCGGGAACTTATGGCTTGAAAGAAATGCCAGAATCTTCAATAATCAGCAGCAAAATGTCTATGCCTTTATTGAATCTACTTCCTATCTTGCCATGT
ATTGGAGTAGTCATATCTCCCCATTTTTAGAGGAATCAGAAATCGAAAGCCAGAAGATGCAGATTAAGGTAAAATGTAACTGCGGCGAAACCAAATGCTTAGAATGGGCC
GTTGTTGAGTTGCAAGGCATAGTTGAAGCTCAACCCTCCTTCCAAGATCGCCTTCAAAACCTCGAAATCGGCATTCTATGTCGGTCCTCCGCTCAGGAAGTTTATACTTT
TACGGTAGGGTATCATGAACTGACAGGAAACAAAGTGCCATTGAAGAAGCCGCTGTTGGTGTTGAAGAAAAAGAGGTGCGTGGATGAGGATCAGAGTGGAGATACCAATT
CCGCTACTGCAGAGTTGGAAGTCATTGGGATAATCAGGCAGCGGATTTTGTTTAAGAGCAGACCAAAGGCCCTCATATCCAAACCACAGCCATTGGTGAAGGAGAGATCC
CGCGCTTCAGGATCTGCAGTGACAGGCCAATCGACATGA
Protein sequenceShow/hide protein sequence
MFLPGVWLLSRQSLGFQDGFRGASSLSALLGQMGLFVAEEDLSHFLWSYDFVRSVWEFFFDAFGLQFVSHKDHREMIEEFLLHPSFCEKGRLSYGKVDCILMIIISWNVR
GMGSWKKRALIKDFLSSQNPTLVIIQETKMAGIDRKIIKSVWSSRNIAWASIDAVDASGGILILWNESSFAVKEIVEGIFSLSIHLSLADDFSFWITGVYGPNSSKERRI
FWKELTDLQALCPPNWILGGDFNVIRWTWEKSSYTVPTRAMKKFNRFIVNKGLIDIPLANGKYTWSSYRPNPTMTLIDRFLITDNIATKFQAALVRRLPRITSDHFPISL
SLGKEKWGPPPYRFINAWLNHHSFLRTVDQWWKSVPSLWWPSHGFIQKLKGLKKELKQWNQSTFGQQKEKKSCLSQELSCLDLKEEQGNLTEQESYRRTEIKADLLSISA
NEEMLWRQSCKVKWLKEGDVNSAFFPPNYGSPQKKEHNSGDCIRFGGPWRAIHKSKYLIYDHIDIRVGKGDKTLFWEDIWLGSSPLQSKYPSLFSLSLKKDALIVDLWEP
TNGAWNLHLRRHLQDSEILEWALLSHQLSSFSFNNIEDTWLWKGLMPKKVKFFMWELSHRCINTADVIQRRFPNSSLSPRYCCLCNKAAESQIHIFSRCEYAAALWDHIQ
SAFGWQFARSGDVLSLLQFTLLGHPFKNDTKVQWRNFLYAFFGNLWLERNARIFNNQQQNVYAFIESTSYLAMYWSSHISPFLEESEIESQKMQIKVKCNCGETKCLEWA
VVELQGIVEAQPSFQDRLQNLEIGILCRSSAQEVYTFTVGYHELTGNKVPLKKPLLVLKKKRCVDEDQSGDTNSATAELEVIGIIRQRILFKSRPKALISKPQPLVKERS
RASGSAVTGQST