; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002364 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002364
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold6:3711380..3720756
RNA-Seq ExpressionSpg002364
SyntenySpg002364
Gene Ontology termsGO:0006260 - DNA replication (biological process)
GO:0050789 - regulation of biological process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR007257 - DNA replication complex GINS protein Psf2
IPR021151 - GINS subunit, domain A
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036224 - GINS, helical bundle-like domain superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW65579.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]9.2e-6744.64Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN
        MK ++WN RGLGS KKR ++K  +  + P +V+IQETKK +   R++ S+WS  +  W +L + GASGGIL +W   +   +E   G F++SI   M   
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN

Query:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
         S WLSA+YGP+  A R DFW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I +  LID+PL++  YTWS+  EN  C  +DRFL
Subjt:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN
         ++     F  +    L R TSDH+P  L      WGP PFRFEN WL+  SF+     WWS+    GW GH FM KL+ +K++L++WN
Subjt:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN

RVX11275.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]8.3e-6844.37Show/hide
Query:  RECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIY
        R  +MK ++WN RGLGS KKR ++K  +  + P +V+IQETKK +   R++ S+WS  +  W +L + GASGGIL +W   +   +E   G F++SI   
Subjt:  RECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIY

Query:  MANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLI
        M    S WLSA+YGP+  A R DFW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I +  LID+PL++  YTWS+  EN  C  +
Subjt:  MANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLI

Query:  DRFLMTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN
        DRFL ++     F  +    L R TSDH+P  L      WGP PFRFEN WL+  SF+     WWS+    GW GH FM KL+ +K++L++WN
Subjt:  DRFLMTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN

RVX11275.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.3e-1731.19Show/hide
Query:  WNLSQRSSA--AQLPSLVTQLKLLDDTE-DRNRNDSWMWPLESSNIFSVKSLMEDLVDYSNMANDL-YKAIWTDFYPKKIKIFLWELSHGAINTVDRLQR
        WN + R +   +++  L + ++ LD      +  D   W +  S +F+VKS    L  Y         K +W    P K+K F+W ++H  +NT D LQ 
Subjt:  WNLSQRSSA--AQLPSLVTQLKLLDDTE-DRNRNDSWMWPLESSNIFSVKSLMEDLVDYSNMANDL-YKAIWTDFYPKKIKIFLWELSHGAINTVDRLQR

Query:  RMPHFHLSPFWCIMCAASSEYPGHLFVHCTFASRYWSEILDAFGWSTVFPNCINDVLNLIFVDHPFHGEKK---ILWLALNIVFFWFLWGERNSRIFRDS
        R PH  LSP  C +C    E   HLF+HC+     W  +        V P  I+D   + F +    G  K   +LW    I   W +W ERN+RIF D 
Subjt:  RMPHFHLSPFWCIMCAASSEYPGHLFVHCTFASRYWSEILDAFGWSTVFPNCINDVLNLIFVDHPFHGEKK---ILWLALNIVFFWFLWGERNSRIFRDS

Query:  FS
        F+
Subjt:  FS

RVX11275.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.4e-6744.41Show/hide
Query:  RGRECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIH
        R R  +MK ++WN RGLGS KKR ++K  +  + P +V+IQETKK +   R++ S+WS  +  W +L + GASGGIL +W   +   +E   G F++SI 
Subjt:  RGRECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIH

Query:  IYMANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCS
          M    S WLSA+YGP+  A R DFW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I +  LID+PL++  YTWS+  EN  C 
Subjt:  IYMANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCS

Query:  LIDRFLMTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN
         +DRFL ++     F  +    L R TSDH+P  L      WGP PFRFEN WL+  SF+     WWS+    GW GH FM KL+ +K++L++WN
Subjt:  LIDRFLMTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN

XP_021820446.1 uncharacterized protein LOC110762145 [Prunus avium]5.9e-6644.98Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN
        MK ++WN+RGLGS +KR ++K+ + +  P IV++QETKK +I  R++ S+W S    W  + S G SGGI+ +W+    SV ++    F++SI I   + 
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN

Query:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
          +WLS IYGP R  DR  FW EL  L GL G NW +GGDFNV R+  +KS+G  VT SMR FN +I++ +L D  L N  +TWS+  EN  C  +DRFL
Subjt:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN
         T    + F   R   L RVTSDH P  L    L WGP PFRFEN WL+   F     +WW++  + GWPG  FM +LK +K +LR W+
Subjt:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.5e-8549.68Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN
        MKFLTWNVRGL SWKK ALIK+ I + NP +V++QETK S +   I+KSLWS+  I W++LD+ G + GIL +W++P+    E  +G+F+L+I+  +++ 
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN

Query:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
        F FW+S IYGPS       FW EL DL+ L  N+WIL GDFNVTRWSWEKS+GRP+T+SM +FN +I++  LID PL NG +TWS    N   SLID FL
Subjt:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWNLSQ-RSSAAQL
        +T+ C++K G     R+ R TSDH+P  L FG  +WG  PFRFEN WL   +F+  ++ WW    + GWPGHG MMKLK LK  ++ W     R   +Q 
Subjt:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWNLSQ-RSSAAQL

Query:  PSLVTQLKLLDDTE
          L   +  LDD E
Subjt:  PSLVTQLKLLDDTE

TrEMBL top hitse value%identityAlignment
A0A438G038 Transposon TX1 uncharacterized 149 kDa protein4.4e-6744.64Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN
        MK ++WN RGLGS KKR ++K  +  + P +V+IQETKK +   R++ S+WS  +  W +L + GASGGIL +W   +   +E   G F++SI   M   
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN

Query:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
         S WLSA+YGP+  A R DFW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I +  LID+PL++  YTWS+  EN  C  +DRFL
Subjt:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN
         ++     F  +    L R TSDH+P  L      WGP PFRFEN WL+  SF+     WWS+    GW GH FM KL+ +K++L++WN
Subjt:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN

A0A438JQQ0 LINE-1 retrotransposable element ORF2 protein4.0e-6844.37Show/hide
Query:  RECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIY
        R  +MK ++WN RGLGS KKR ++K  +  + P +V+IQETKK +   R++ S+WS  +  W +L + GASGGIL +W   +   +E   G F++SI   
Subjt:  RECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIY

Query:  MANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLI
        M    S WLSA+YGP+  A R DFW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I +  LID+PL++  YTWS+  EN  C  +
Subjt:  MANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLI

Query:  DRFLMTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN
        DRFL ++     F  +    L R TSDH+P  L      WGP PFRFEN WL+  SF+     WWS+    GW GH FM KL+ +K++L++WN
Subjt:  DRFLMTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN

A0A438JQQ0 LINE-1 retrotransposable element ORF2 protein6.5e-1831.19Show/hide
Query:  WNLSQRSSA--AQLPSLVTQLKLLDDTE-DRNRNDSWMWPLESSNIFSVKSLMEDLVDYSNMANDL-YKAIWTDFYPKKIKIFLWELSHGAINTVDRLQR
        WN + R +   +++  L + ++ LD      +  D   W +  S +F+VKS    L  Y         K +W    P K+K F+W ++H  +NT D LQ 
Subjt:  WNLSQRSSA--AQLPSLVTQLKLLDDTE-DRNRNDSWMWPLESSNIFSVKSLMEDLVDYSNMANDL-YKAIWTDFYPKKIKIFLWELSHGAINTVDRLQR

Query:  RMPHFHLSPFWCIMCAASSEYPGHLFVHCTFASRYWSEILDAFGWSTVFPNCINDVLNLIFVDHPFHGEKK---ILWLALNIVFFWFLWGERNSRIFRDS
        R PH  LSP  C +C    E   HLF+HC+     W  +        V P  I+D   + F +    G  K   +LW    I   W +W ERN+RIF D 
Subjt:  RMPHFHLSPFWCIMCAASSEYPGHLFVHCTFASRYWSEILDAFGWSTVFPNCINDVLNLIFVDHPFHGEKK---ILWLALNIVFFWFLWGERNSRIFRDS

Query:  FS
        F+
Subjt:  FS

A0A438JQQ0 LINE-1 retrotransposable element ORF2 protein1.2e-6744.41Show/hide
Query:  RGRECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIH
        R R  +MK ++WN RGLGS KKR ++K  +  + P +V+IQETKK +   R++ S+WS  +  W +L + GASGGIL +W   +   +E   G F++SI 
Subjt:  RGRECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIH

Query:  IYMANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCS
          M    S WLSA+YGP+  A R DFW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I +  LID+PL++  YTWS+  EN  C 
Subjt:  IYMANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCS

Query:  LIDRFLMTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN
         +DRFL ++     F  +    L R TSDH+P  L      WGP PFRFEN WL+  SF+     WWS+    GW GH FM KL+ +K++L++WN
Subjt:  LIDRFLMTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN

A0A6J1E2G6 uncharacterized protein LOC1110254057.3e-8649.68Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN
        MKFLTWNVRGL SWKK ALIK+ I + NP +V++QETK S +   I+KSLWS+  I W++LD+ G + GIL +W++P+    E  +G+F+L+I+  +++ 
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN

Query:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
        F FW+S IYGPS       FW EL DL+ L  N+WIL GDFNVTRWSWEKS+GRP+T+SM +FN +I++  LID PL NG +TWS    N   SLID FL
Subjt:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWNLSQ-RSSAAQL
        +T+ C++K G     R+ R TSDH+P  L FG  +WG  PFRFEN WL   +F+  ++ WW    + GWPGHG MMKLK LK  ++ W     R   +Q 
Subjt:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWNLSQ-RSSAAQL

Query:  PSLVTQLKLLDDTE
          L   +  LDD E
Subjt:  PSLVTQLKLLDDTE

A0A6P5T1U8 uncharacterized protein LOC1107621452.9e-6644.98Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN
        MK ++WN+RGLGS +KR ++K+ + +  P IV++QETKK +I  R++ S+W S    W  + S G SGGI+ +W+    SV ++    F++SI I   + 
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSIHIYMANN

Query:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
          +WLS IYGP R  DR  FW EL  L GL G NW +GGDFNV R+  +KS+G  VT SMR FN +I++ +L D  L N  +TWS+  EN  C  +DRFL
Subjt:  FSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN
         T    + F   R   L RVTSDH P  L    L WGP PFRFEN WL+   F     +WW++  + GWPG  FM +LK +K +LR W+
Subjt:  MTDTCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWN

SwissProt top hitse value%identityAlignment
Q54BL9 Probable DNA replication complex GINS protein PSF21.0e-1551.52Show/hide
Query:  EVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSVGK
        ++EF+AED  + +VPN +ME L  + G YGPF P    E+PLWLAI+LKK+ KC + PP+WM+  K
Subjt:  EVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSVGK

Q7ZT46 DNA replication complex GINS protein PSF25.0e-1555.88Show/hide
Query:  AEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSVGK
        A EVEF+AE E V ++PN  ++ + LI G+ GPF P +  EVPLWLAI LK+R KC + PPEWM V K
Subjt:  AEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSVGK

Q7ZT46 DNA replication complex GINS protein PSF23.8e-0731.37Show/hide
Query:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLES-IDTRTSAVKIKDLSAMEVNIVRPFVGRALQAI
        +EKL  I + ER  +    +   +Y+E+ KLL +HA D++P    +R+L++D  D R  K+  S +S +  + +  K+ +L+ ME+N +  F   +L  +
Subjt:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLES-IDTRTSAVKIKDLSAMEVNIVRPFVGRALQAI

Query:  YK
        YK
Subjt:  YK

Q9C7A8 DNA replication complex GINS protein PSF26.4e-3965.41Show/hide
Query:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSAVKIKDLSAMEVNIVRPFVGRALQAIY
        ++ LTQILEAERESQ +FQ LPF YVEIA+LLFDHARDD+PD+Y+VRSL+EDIRDVR HK+ET+L S    TSAVKI ++SAMEVNIVRPFV RAL+A Y
Subjt:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSAVKIKDLSAMEVNIVRPFVGRALQAIY

Query:  KHGNPELVPDQERTASVQPQGHDHGQRRPLRRR
        KH  PE   D++  +S Q +  ++  RRPLR+R
Subjt:  KHGNPELVPDQERTASVQPQGHDHGQRRPLRRR

Q9C7A8 DNA replication complex GINS protein PSF25.1e-2872.15Show/hide
Query:  MAGQSDPHLTLFSAEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSV
        MAGQ+DPH++LFS +EVEF+AEDE+VEIVPNM ME LN I G++G F PQI T+VPLWLA+ALK+RGKC  RPP WMSV
Subjt:  MAGQSDPHLTLFSAEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSV

Q9D600 DNA replication complex GINS protein PSF21.1e-1455.88Show/hide
Query:  AEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSVGK
        A EVEF+AE E+V I+PN  ++ + LI G+ GPF P +  +VPLWLAI LK+R KC + PPEWM V K
Subjt:  AEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSVGK

Q9D600 DNA replication complex GINS protein PSF24.1e-0935.29Show/hide
Query:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSA-VKIKDLSAMEVNIVRPFVGRALQAI
        +EKL Q+ + ER+ +    +   HY+EI KLL +HA D++P    +R+LI+D+ D R  K+  S +S   +  A  K+ +L+ ME++    F+ +AL  +
Subjt:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSA-VKIKDLSAMEVNIVRPFVGRALQAI

Query:  YK
        YK
Subjt:  YK

Q9Y248 DNA replication complex GINS protein PSF25.0e-1557.35Show/hide
Query:  AEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSVGK
        A EVEF+AE E+V I+PN  ++ + LI G+ GPF P +  EVPLWLAI LK+R KC + PPEWM V K
Subjt:  AEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSVGK

Q9Y248 DNA replication complex GINS protein PSF21.3e-0732.35Show/hide
Query:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSA-VKIKDLSAMEVNIVRPFVGRALQAI
        +EKL ++ + ER+ +    +   +Y+E+ KLL +HA D++P    +R+L++D+ D R  K+  S +S   +  A  K+ +L+ ME+N    F+ +AL  +
Subjt:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSA-VKIKDLSAMEVNIVRPFVGRALQAI

Query:  YK
        YK
Subjt:  YK

Arabidopsis top hitse value%identityAlignment
AT3G12530.1 PSF24.6e-4065.41Show/hide
Query:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSAVKIKDLSAMEVNIVRPFVGRALQAIY
        ++ LTQILEAERESQ +FQ LPF YVEIA+LLFDHARDD+PD+Y+VRSL+EDIRDVR HK+ET+L S    TSAVKI ++SAMEVNIVRPFV RAL+A Y
Subjt:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSAVKIKDLSAMEVNIVRPFVGRALQAIY

Query:  KHGNPELVPDQERTASVQPQGHDHGQRRPLRRR
        KH  PE   D++  +S Q +  ++  RRPLR+R
Subjt:  KHGNPELVPDQERTASVQPQGHDHGQRRPLRRR

AT3G12530.1 PSF23.6e-2972.15Show/hide
Query:  MAGQSDPHLTLFSAEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSV
        MAGQ+DPH++LFS +EVEF+AEDE+VEIVPNM ME LN I G++G F PQI T+VPLWLA+ALK+RGKC  RPP WMSV
Subjt:  MAGQSDPHLTLFSAEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSV

AT3G12530.2 PSF24.6e-4065.41Show/hide
Query:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSAVKIKDLSAMEVNIVRPFVGRALQAIY
        ++ LTQILEAERESQ +FQ LPF YVEIA+LLFDHARDD+PD+Y+VRSL+EDIRDVR HK+ET+L S    TSAVKI ++SAMEVNIVRPFV RAL+A Y
Subjt:  LEKLTQILEAERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSAVKIKDLSAMEVNIVRPFVGRALQAIY

Query:  KHGNPELVPDQERTASVQPQGHDHGQRRPLRRR
        KH  PE   D++  +S Q +  ++  RRPLR+R
Subjt:  KHGNPELVPDQERTASVQPQGHDHGQRRPLRRR

AT3G12530.2 PSF26.9e-2071.67Show/hide
Query:  VAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSV
        +AEDE+VEIVPNM ME LN I G++G F PQI T+VPLWLA+ALK+RGKC  RPP WMSV
Subjt:  VAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPLWLAIALKKRGKCAVRPPEWMSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GACTCTTCTTCTTCACGCCGTTTCCTCCTGCAGCTCAGTGCCGCCGCCACCTCTCTCCCGTTGGTGCCTCCGCCGCCGTCGTCTTTCTTCCGCTCGTGCCACCGCAGCAT
CCCTCCTCGCGGTTTTGCACTTATTTTCCAGTCTCTCGTTGTTAGTGAAATGGCTGGTCAATCAGATCCTCACCTGACCCTATTTTCCGCAGAAGAGGTCGAGTTTGTCG
CTGAAGACGAAATGGTGGAGATTGTTCCTAATATGAGAATGGAACCTCTGAATTTGATTTGTGGGAATTATGGTCCGTTCTATCCCCAAATAGCAACTGAAGTTCCATTG
TGGCTAGCGATTGCTCTGAAAAAAAGAGGGAAATGTGCAGTTAGGCCTCCAGAGTGGATGTCAGTGGGTAAGTTCTATTTGTTATATGGGGTAAAATTTCTTTTATGCAT
ATGTTCTTTGGACCTACGTGAAGGATTGACAGTTAAAGAGGAAACTCTGTTTCCTAGATATAGTCATGCACTAAAGAGCCCTAAGATTGTGGCATCTCAGAAGGGAAAAA
TGTTTTGTAGTTTGGCACTTTGGCCACACATGAAAGAACTGAATGTTATGGTGACATGTAGGGGAAAAGTGTTGGTGTTATCACACTACTTTCTGGCATACATATTTAAT
TGCGTCAAATTTAAAGATAATGCTACTGGTTTCATCAAGAACTTGATCAGTCTCATTAAAAAGCCTCCAATAGATACTATTACTCCTCCCTCGAAAGAGTCCTCGAAAGA
GCCTTCAGCCTCCACAATTGATGAAGAATGGAATGAGATCATTGTTCTCCAACGCAGCAATCTTCATGATGACTGGCCGAGCATTCATCAATCACTTATTGCCGGGCAAG
CCATTCGATGTAGCATCAATCCGTTCCAGGCAAATAAAGCCATGCTCCATGTGTATGATCGAGCCATTGCTACAAATTTATGCTCTCACTCTGATTGGACCTTTCTTGGT
AAGCATAAGTTGAAATTTTATCCTTTAACCACTACTTCTGCACAACAGGATATTATGACACCATCTTATGGAGGTTGGATTGAGATCTCTTCTCTTCCCCCTACCTTGTG
GACTGAGCGTATTTTCCGGTTCATTGGGGATTCCTGCGGCGGTTTTGTGGAGACTTCTAACCTCACTAATAGGATGATTATCGCAACTGAGGCTAGGATAAAAATTCGGC
CAAATACTTCTGGTTTCATTCCCGCCGCCGTAAAGCTCACATCAGACCTTGCCGGCGTTGAACTCACGGTGCAGACAAAAGGCATTTCCGGCAACCCTCACAGAATCGGC
CTCATTAAAGATGACAAACCGAATATGGAATTTCAGGATATTGAATTAAAGAAGAAAGAGGAATCGGAAAAAGAGAATTCGAATTTTAATTCGAAAAGGAAATCTCCACC
AGCTAATTTCCCAAAAATCTCGGTACCAAATTTTATCACCTCCAGTGCACCGCTTTTATCTGATAAAATCGACAAAGGAAAGAATTATCTCCCACCGCCTCCTCCTGATT
CATCGGTTAGTCAACTGCCTGGGCCCACAATTCTTAAATTCGGCCACATTGGATCTACATCAAGGAATGAATTGAACGTTGGATCCGACACTGAAGTTTTTCTCTCCAGC
CCATCTACAAACCCTACGGCCCACAACTCAACTCAAGACCCAACATCTCCTCGACCGTTGGACCTCACCATCTTTAATGATCCACTAATTGAAGGCCCAATTGATCCGAG
CCAACCGTACCAGAACTCTCCATCCCCGATAGACATCATGCCCCCACTGCAACAGAATCCTACCCATAATACCTCCTCTCCAAATCCATTGGAACCTCCACAAATCCCAC
CCTACCATTCCCCACGGCTTTCTCCAGTACCAAATATGAAGTCTCCAACACCGAACACATTTCCCAATTGCCTTCAACATTTAGCCCCGATCTTAAGTAAACATGGCCTT
TGTATTATGGCTCTACCAACAGTACCAAAGTCAAGGGGCCGAGAATGTTATATGAAATTTTTGACATGGAATGTGCGTGGTTTGGGATCATGGAAGAAAAGGGCTTTAAT
TAAGAAAACTATTCAGCAGCAAAACCCGGGCATTGTTCTTATTCAGGAGACTAAAAAATCTCAGATTTGTAGTAGGATTATTAAATCCCTATGGAGCTCTTCTCATATTG
GTTGGACTTCTCTTGACTCTGTGGGTGCCTCTGGAGGCATTCTTACTATGTGGAGTGAACCAGAATTTTCAGTAAAGGAGACTACTCAAGGTCTTTTCACTCTCTCTATT
CATATCTATATGGCTAATAACTTCTCTTTTTGGCTATCGGCTATTTATGGCCCCTCTAGGCATGCTGACAGATCGGACTTTTGGAATGAACTTCACGACTTGGCTGGTTT
AGGTGGTAACAATTGGATTCTTGGAGGAGATTTTAATGTCACCCGCTGGTCATGGGAAAAATCGCATGGCCGGCCCGTGACTAGGAGTATGCGTATTTTCAACCAATGGA
TCGATAATTACCATCTCATAGACACTCCTTTACAGAATGGATGCTACACGTGGTCCAGTTGTGGTGAAAATCATTATTGCTCATTGATTGATCGATTCTTAATGACAGAT
ACCTGTCTCAATAAATTTGGTGCAGCTCGTTTTCTTCGTCTTGATAGGGTTACATCTGACCATTACCCATGTACTCTATCATTTGGGGATCTCTCTTGGGGCCCTTGTCC
CTTTAGATTCGAGAATGCTTGGCTGAAAATAGACTCTTTTCGTGGTCTTATGGATAATTGGTGGTCTCAAAACACTGTTCAGGGTTGGCCAGGCCATGGGTTTATGATGA
AACTTAAAGGGTTGAAATCTGAGCTCAGAAAATGGAATTTATCTCAGCGATCATCTGCTGCTCAACTTCCATCTCTTGTTACACAATTGAAATTGTTGGATGATACAGAG
GACAGGAACCGTAATGATTCCTGGATGTGGCCTTTGGAATCGTCCAATATTTTTTCTGTTAAATCTCTCATGGAAGATTTAGTAGACTATTCGAATATGGCAAATGATCT
ATATAAGGCCATTTGGACAGATTTCTATCCAAAAAAGATCAAGATTTTTCTATGGGAGCTTAGTCATGGTGCTATTAATACAGTTGATCGACTTCAACGACGAATGCCTC
ATTTTCACTTGTCTCCATTTTGGTGCATAATGTGTGCTGCTAGCTCAGAATATCCTGGGCATTTATTTGTTCATTGTACCTTCGCATCCAGATATTGGTCAGAGATTCTT
GATGCTTTTGGATGGTCCACCGTTTTTCCAAATTGCATTAACGATGTTCTTAATCTCATTTTTGTGGATCATCCCTTTCATGGAGAAAAGAAGATTTTGTGGCTTGCCTT
GAACATAGTCTTCTTTTGGTTTTTATGGGGCGAACGAAATTCTAGAATTTTCAGGGATTCTTTCTCTTCCTTTCATAAATTTATGGATCTAATTCTCTTTCATGCTTTGT
ATTGGTGTAAATGTAAACACCCCTTCTCTGATTATATCGAAGTAGATAGTTCGCAATCTCATTGTTTTTTGCATTGTGGATTAGAAAAGTTGACACAAATTTTGGAGGCA
GAACGAGAGTCTCAAGGATCTTTCCAGATTCTACCCTTCCATTATGTGGAAATAGCAAAACTTTTGTTTGACCATGCACGAGATGACGTTCCTGACATATATTTGGTGAG
GTCTCTTATTGAAGATATCAGGGATGTTAGGTTTCACAAAGTTGAAACCAGCTTGGAGTCAATTGATACACGCACATCTGCAGTAAAGATTAAAGATCTATCTGCCATGG
AAGTGAATATAGTTCGACCATTTGTCGGTAGAGCGTTGCAGGCAATTTACAAGCATGGAAATCCGGAGTTGGTTCCAGATCAAGAAAGGACGGCCAGTGTGCAGCCACAA
GGACACGATCACGGACAAAGACGACCTCTTCGGAGACGGTAG
mRNA sequenceShow/hide mRNA sequence
GACTCTTCTTCTTCACGCCGTTTCCTCCTGCAGCTCAGTGCCGCCGCCACCTCTCTCCCGTTGGTGCCTCCGCCGCCGTCGTCTTTCTTCCGCTCGTGCCACCGCAGCAT
CCCTCCTCGCGGTTTTGCACTTATTTTCCAGTCTCTCGTTGTTAGTGAAATGGCTGGTCAATCAGATCCTCACCTGACCCTATTTTCCGCAGAAGAGGTCGAGTTTGTCG
CTGAAGACGAAATGGTGGAGATTGTTCCTAATATGAGAATGGAACCTCTGAATTTGATTTGTGGGAATTATGGTCCGTTCTATCCCCAAATAGCAACTGAAGTTCCATTG
TGGCTAGCGATTGCTCTGAAAAAAAGAGGGAAATGTGCAGTTAGGCCTCCAGAGTGGATGTCAGTGGGTAAGTTCTATTTGTTATATGGGGTAAAATTTCTTTTATGCAT
ATGTTCTTTGGACCTACGTGAAGGATTGACAGTTAAAGAGGAAACTCTGTTTCCTAGATATAGTCATGCACTAAAGAGCCCTAAGATTGTGGCATCTCAGAAGGGAAAAA
TGTTTTGTAGTTTGGCACTTTGGCCACACATGAAAGAACTGAATGTTATGGTGACATGTAGGGGAAAAGTGTTGGTGTTATCACACTACTTTCTGGCATACATATTTAAT
TGCGTCAAATTTAAAGATAATGCTACTGGTTTCATCAAGAACTTGATCAGTCTCATTAAAAAGCCTCCAATAGATACTATTACTCCTCCCTCGAAAGAGTCCTCGAAAGA
GCCTTCAGCCTCCACAATTGATGAAGAATGGAATGAGATCATTGTTCTCCAACGCAGCAATCTTCATGATGACTGGCCGAGCATTCATCAATCACTTATTGCCGGGCAAG
CCATTCGATGTAGCATCAATCCGTTCCAGGCAAATAAAGCCATGCTCCATGTGTATGATCGAGCCATTGCTACAAATTTATGCTCTCACTCTGATTGGACCTTTCTTGGT
AAGCATAAGTTGAAATTTTATCCTTTAACCACTACTTCTGCACAACAGGATATTATGACACCATCTTATGGAGGTTGGATTGAGATCTCTTCTCTTCCCCCTACCTTGTG
GACTGAGCGTATTTTCCGGTTCATTGGGGATTCCTGCGGCGGTTTTGTGGAGACTTCTAACCTCACTAATAGGATGATTATCGCAACTGAGGCTAGGATAAAAATTCGGC
CAAATACTTCTGGTTTCATTCCCGCCGCCGTAAAGCTCACATCAGACCTTGCCGGCGTTGAACTCACGGTGCAGACAAAAGGCATTTCCGGCAACCCTCACAGAATCGGC
CTCATTAAAGATGACAAACCGAATATGGAATTTCAGGATATTGAATTAAAGAAGAAAGAGGAATCGGAAAAAGAGAATTCGAATTTTAATTCGAAAAGGAAATCTCCACC
AGCTAATTTCCCAAAAATCTCGGTACCAAATTTTATCACCTCCAGTGCACCGCTTTTATCTGATAAAATCGACAAAGGAAAGAATTATCTCCCACCGCCTCCTCCTGATT
CATCGGTTAGTCAACTGCCTGGGCCCACAATTCTTAAATTCGGCCACATTGGATCTACATCAAGGAATGAATTGAACGTTGGATCCGACACTGAAGTTTTTCTCTCCAGC
CCATCTACAAACCCTACGGCCCACAACTCAACTCAAGACCCAACATCTCCTCGACCGTTGGACCTCACCATCTTTAATGATCCACTAATTGAAGGCCCAATTGATCCGAG
CCAACCGTACCAGAACTCTCCATCCCCGATAGACATCATGCCCCCACTGCAACAGAATCCTACCCATAATACCTCCTCTCCAAATCCATTGGAACCTCCACAAATCCCAC
CCTACCATTCCCCACGGCTTTCTCCAGTACCAAATATGAAGTCTCCAACACCGAACACATTTCCCAATTGCCTTCAACATTTAGCCCCGATCTTAAGTAAACATGGCCTT
TGTATTATGGCTCTACCAACAGTACCAAAGTCAAGGGGCCGAGAATGTTATATGAAATTTTTGACATGGAATGTGCGTGGTTTGGGATCATGGAAGAAAAGGGCTTTAAT
TAAGAAAACTATTCAGCAGCAAAACCCGGGCATTGTTCTTATTCAGGAGACTAAAAAATCTCAGATTTGTAGTAGGATTATTAAATCCCTATGGAGCTCTTCTCATATTG
GTTGGACTTCTCTTGACTCTGTGGGTGCCTCTGGAGGCATTCTTACTATGTGGAGTGAACCAGAATTTTCAGTAAAGGAGACTACTCAAGGTCTTTTCACTCTCTCTATT
CATATCTATATGGCTAATAACTTCTCTTTTTGGCTATCGGCTATTTATGGCCCCTCTAGGCATGCTGACAGATCGGACTTTTGGAATGAACTTCACGACTTGGCTGGTTT
AGGTGGTAACAATTGGATTCTTGGAGGAGATTTTAATGTCACCCGCTGGTCATGGGAAAAATCGCATGGCCGGCCCGTGACTAGGAGTATGCGTATTTTCAACCAATGGA
TCGATAATTACCATCTCATAGACACTCCTTTACAGAATGGATGCTACACGTGGTCCAGTTGTGGTGAAAATCATTATTGCTCATTGATTGATCGATTCTTAATGACAGAT
ACCTGTCTCAATAAATTTGGTGCAGCTCGTTTTCTTCGTCTTGATAGGGTTACATCTGACCATTACCCATGTACTCTATCATTTGGGGATCTCTCTTGGGGCCCTTGTCC
CTTTAGATTCGAGAATGCTTGGCTGAAAATAGACTCTTTTCGTGGTCTTATGGATAATTGGTGGTCTCAAAACACTGTTCAGGGTTGGCCAGGCCATGGGTTTATGATGA
AACTTAAAGGGTTGAAATCTGAGCTCAGAAAATGGAATTTATCTCAGCGATCATCTGCTGCTCAACTTCCATCTCTTGTTACACAATTGAAATTGTTGGATGATACAGAG
GACAGGAACCGTAATGATTCCTGGATGTGGCCTTTGGAATCGTCCAATATTTTTTCTGTTAAATCTCTCATGGAAGATTTAGTAGACTATTCGAATATGGCAAATGATCT
ATATAAGGCCATTTGGACAGATTTCTATCCAAAAAAGATCAAGATTTTTCTATGGGAGCTTAGTCATGGTGCTATTAATACAGTTGATCGACTTCAACGACGAATGCCTC
ATTTTCACTTGTCTCCATTTTGGTGCATAATGTGTGCTGCTAGCTCAGAATATCCTGGGCATTTATTTGTTCATTGTACCTTCGCATCCAGATATTGGTCAGAGATTCTT
GATGCTTTTGGATGGTCCACCGTTTTTCCAAATTGCATTAACGATGTTCTTAATCTCATTTTTGTGGATCATCCCTTTCATGGAGAAAAGAAGATTTTGTGGCTTGCCTT
GAACATAGTCTTCTTTTGGTTTTTATGGGGCGAACGAAATTCTAGAATTTTCAGGGATTCTTTCTCTTCCTTTCATAAATTTATGGATCTAATTCTCTTTCATGCTTTGT
ATTGGTGTAAATGTAAACACCCCTTCTCTGATTATATCGAAGTAGATAGTTCGCAATCTCATTGTTTTTTGCATTGTGGATTAGAAAAGTTGACACAAATTTTGGAGGCA
GAACGAGAGTCTCAAGGATCTTTCCAGATTCTACCCTTCCATTATGTGGAAATAGCAAAACTTTTGTTTGACCATGCACGAGATGACGTTCCTGACATATATTTGGTGAG
GTCTCTTATTGAAGATATCAGGGATGTTAGGTTTCACAAAGTTGAAACCAGCTTGGAGTCAATTGATACACGCACATCTGCAGTAAAGATTAAAGATCTATCTGCCATGG
AAGTGAATATAGTTCGACCATTTGTCGGTAGAGCGTTGCAGGCAATTTACAAGCATGGAAATCCGGAGTTGGTTCCAGATCAAGAAAGGACGGCCAGTGTGCAGCCACAA
GGACACGATCACGGACAAAGACGACCTCTTCGGAGACGGTAG
Protein sequenceShow/hide protein sequence
DSSSSRRFLLQLSAAATSLPLVPPPPSSFFRSCHRSIPPRGFALIFQSLVVSEMAGQSDPHLTLFSAEEVEFVAEDEMVEIVPNMRMEPLNLICGNYGPFYPQIATEVPL
WLAIALKKRGKCAVRPPEWMSVGKFYLLYGVKFLLCICSLDLREGLTVKEETLFPRYSHALKSPKIVASQKGKMFCSLALWPHMKELNVMVTCRGKVLVLSHYFLAYIFN
CVKFKDNATGFIKNLISLIKKPPIDTITPPSKESSKEPSASTIDEEWNEIIVLQRSNLHDDWPSIHQSLIAGQAIRCSINPFQANKAMLHVYDRAIATNLCSHSDWTFLG
KHKLKFYPLTTTSAQQDIMTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSNLTNRMIIATEARIKIRPNTSGFIPAAVKLTSDLAGVELTVQTKGISGNPHRIG
LIKDDKPNMEFQDIELKKKEESEKENSNFNSKRKSPPANFPKISVPNFITSSAPLLSDKIDKGKNYLPPPPPDSSVSQLPGPTILKFGHIGSTSRNELNVGSDTEVFLSS
PSTNPTAHNSTQDPTSPRPLDLTIFNDPLIEGPIDPSQPYQNSPSPIDIMPPLQQNPTHNTSSPNPLEPPQIPPYHSPRLSPVPNMKSPTPNTFPNCLQHLAPILSKHGL
CIMALPTVPKSRGRECYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSQICSRIIKSLWSSSHIGWTSLDSVGASGGILTMWSEPEFSVKETTQGLFTLSI
HIYMANNFSFWLSAIYGPSRHADRSDFWNELHDLAGLGGNNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIDNYHLIDTPLQNGCYTWSSCGENHYCSLIDRFLMTD
TCLNKFGAARFLRLDRVTSDHYPCTLSFGDLSWGPCPFRFENAWLKIDSFRGLMDNWWSQNTVQGWPGHGFMMKLKGLKSELRKWNLSQRSSAAQLPSLVTQLKLLDDTE
DRNRNDSWMWPLESSNIFSVKSLMEDLVDYSNMANDLYKAIWTDFYPKKIKIFLWELSHGAINTVDRLQRRMPHFHLSPFWCIMCAASSEYPGHLFVHCTFASRYWSEIL
DAFGWSTVFPNCINDVLNLIFVDHPFHGEKKILWLALNIVFFWFLWGERNSRIFRDSFSSFHKFMDLILFHALYWCKCKHPFSDYIEVDSSQSHCFLHCGLEKLTQILEA
ERESQGSFQILPFHYVEIAKLLFDHARDDVPDIYLVRSLIEDIRDVRFHKVETSLESIDTRTSAVKIKDLSAMEVNIVRPFVGRALQAIYKHGNPELVPDQERTASVQPQ
GHDHGQRRPLRRR