; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G41350 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G41350
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLactamase_B domain-containing protein
Genome locationChr3:35460351..35472817
RNA-Seq ExpressionCSPI03G41350
SyntenyCSPI03G41350
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001279 - Metallo-beta-lactamase
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036687.1 putative hydrolase [Cucumis melo var. makuwa]4.5e-3584.62Show/hide
Query:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQS-----------VSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPV
        MVLFVGTTLPPTSMACLAALRTQVQLRSSAF IARKGFS FQRIAQSRLQS           VS GEIGGT+SANESEIIFIGTGTSEGIPRVSCLTDP+
Subjt:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQS-----------VSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPV

Query:  KKCP
        KKCP
Subjt:  KKCP

KAA0036687.1 putative hydrolase [Cucumis melo var. makuwa]1.0e-14096.08Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRY GPSG+RNILIDVGKFFYHSALRWFPAF IRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVM+K
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIP EPFVV+DLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCE+LILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRT FTGMMHLMDHEEVNSYL+KLKETEG+DAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

XP_004146818.1 putative hydrolase C777.06c isoform X1 [Cucumis sativus]2.4e-14599.61Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRTLFTGMMHLMDHEEVN YLLKLKETEGLDAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

XP_004146818.1 putative hydrolase C777.06c isoform X1 [Cucumis sativus]1.6e-3791.18Show/hide
Query:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQS---------VSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKK
        MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQS         VSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKK
Subjt:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQS---------VSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKK

Query:  CP
        CP
Subjt:  CP

XP_004146818.1 putative hydrolase C777.06c isoform X1 [Cucumis sativus]2.4e-14599.61Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRTLFTGMMHLMDHEEVN YLLKLKETEGLDAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

XP_008447639.1 PREDICTED: putative hydrolase C777.06c [Cucumis melo]1.3e-3794.62Show/hide
Query:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP
        MVLFVGTTLPPTSMACLAALRTQVQLRSSAF IARKGFS FQRIAQSRLQSVS GEIGGT+SANESEIIFIGTGTSEGIPRVSCLTDP+KKCP
Subjt:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP

XP_008447639.1 PREDICTED: putative hydrolase C777.06c [Cucumis melo]1.7e-14398.04Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRY GPSGNRNILIDVGKFFYHSALRWFPAF IRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIPEEPFVV+DLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCE+LILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEG+DAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

XP_011652347.1 putative hydrolase C777.06c isoform X2 [Cucumis sativus]7.9e-40100Show/hide
Query:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP
        MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP
Subjt:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP

XP_011652347.1 putative hydrolase C777.06c isoform X2 [Cucumis sativus]1.7e-14398.04Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRY GPSGNRNILIDVGKFFYHSALRWFPAF IRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIPEEPFVV+DLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCE+LILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEG+DAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

XP_022148298.1 putative hydrolase C777.06c [Momordica charantia]1.9e-2577.08Show/hide
Query:  MVLFVGTTLPPTSMACLAAL--RTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIG-GTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP
        MVL VGT LP TSMA LAAL  R Q+QLRSSAFSI+RKGFS F+RIAQ+RLQSVS  E    T+ +NESEIIFIGTGTSEGIPRVSCLT+PVKKCP
Subjt:  MVLFVGTTLPPTSMACLAAL--RTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIG-GTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP

TrEMBL top hitse value%identityAlignment
A0A0A0LFS7 Lactamase_B domain-containing protein1.1e-14599.61Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRTLFTGMMHLMDHEEVN YLLKLKETEGLDAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

A0A1S3BIJ3 putative hydrolase C777.06c6.1e-3894.62Show/hide
Query:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP
        MVLFVGTTLPPTSMACLAALRTQVQLRSSAF IARKGFS FQRIAQSRLQSVS GEIGGT+SANESEIIFIGTGTSEGIPRVSCLTDP+KKCP
Subjt:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP

A0A1S3BIJ3 putative hydrolase C777.06c5.0e-14196.08Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRY GPSG+RNILIDVGKFFYHSALRWFPAF IRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVM+K
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIP EPFVV+DLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCE+LILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRT FTGMMHLMDHEEVNSYL+KLKETEG+DAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

A0A5D3BVB8 Putative hydrolase8.2e-14498.04Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRY GPSGNRNILIDVGKFFYHSALRWFPAF IRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIPEEPFVV+DLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCE+LILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEG+DAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

A0A5D3BVB8 Putative hydrolase2.2e-3584.62Show/hide
Query:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQS-----------VSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPV
        MVLFVGTTLPPTSMACLAALRTQVQLRSSAF IARKGFS FQRIAQSRLQS           VS GEIGGT+SANESEIIFIGTGTSEGIPRVSCLTDP+
Subjt:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQS-----------VSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPV

Query:  KKCP
        KKCP
Subjt:  KKCP

A0A5D3BVB8 Putative hydrolase8.2e-14498.04Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNTSILVRY GPSGNRNILIDVGKFFYHSALRWFPAF IRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVILPGAAVSELQFNIIPEEPFVV+DLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCE+LILDALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEG+DAQLSYDGLRIPVTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

A0A6J1D3Q7 putative hydrolase C777.06c9.1e-2677.08Show/hide
Query:  MVLFVGTTLPPTSMACLAAL--RTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIG-GTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP
        MVL VGT LP TSMA LAAL  R Q+QLRSSAFSI+RKGFS F+RIAQ+RLQSVS  E    T+ +NESEIIFIGTGTSEGIPRVSCLT+PVKKCP
Subjt:  MVLFVGTTLPPTSMACLAAL--RTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIG-GTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP

A0A6J1D3Q7 putative hydrolase C777.06c5.7e-13792.55Show/hide
Query:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK
        VCFKAAEPGNKNRRLNT ILVRY GPSG RNILIDVGKFFYHSAL+WFP F IRTIDAVIITHSHADAIGGLDDLRDWTNNVQPS+PIYVAQRDFEVM+K
Subjt:  VCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQK

Query:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR
        THYYLVDTSVI PGAAVS+LQFNIIPEEPFVV+ LKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCE+LI+DALRPDRSSSTHFGLPR
Subjt:  THYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPR

Query:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        ALEEVRKIQPKRTLFTGMMHLMDH+EVN YLLKLKE+EG+DAQLSYDGLRI VTL
Subjt:  ALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

A0A6J1GUY3 putative hydrolase C777.06c1.4e-2979.57Show/hide
Query:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP
        MVLFVGTTLPPTSMA LAAL T+V L +S+FSI RKGFS+F++IA+ RLQSVSGG+IG  +SANESEIIF G+GTSEGIPRVSCLTDPVKKCP
Subjt:  MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCP

SwissProt top hitse value%identityAlignment
O74545 Putative hydrolase C777.06c6.5e-4540Show/hide
Query:  CFKAAEP-GNKNRRLNTSILVRYVGPSGNR--NILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWT-NNVQPSVPIYVAQRDFEV
        C  +  P G KN R NTS+L++    SG+R  NILID GK FY SAL+ F   +IR +DAVI+TH HADAI G+DDLR+WT   +QPSV IY+ +R ++V
Subjt:  CFKAAEP-GNKNRRLNTSILVRYVGPSGNR--NILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWT-NNVQPSVPIYVAQRDFEV

Query:  MQKTHYYLVDTSVILPGAAVSELQFNII-PEEPFVVN--DLKVTPLPVWHG---------RGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDA
        ++++  Y+V+      G +V    F++  P++PF ++  D+ VTPLPV HG         + Y  +GFR G++ YISD + +P  T  L++   V+++DA
Subjt:  MQKTHYYLVDTSVILPGAAVSELQFNII-PEEPFVVN--DLKVTPLPVWHG---------RGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDA

Query:  LRPDRSSSTHFGLPRALEEVRKIQ--PKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDG
        L+ +   S HF   +A E +  ++  P R L+TG  H ++H E    L  LK    +  + +YDG
Subjt:  LRPDRSSSTHFGLPRALEEVRKIQ--PKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDG

P04323 Retrovirus-related Pol polyprotein from transposon 17.63.4e-3027.52Show/hide
Query:  VNRRKLERLNRLGKKF--IKDKESDILDLVVAGHSSPDCSKHVPLKVQELLDQMSPQEYEILHQRMEKLLKKGHIQPSINPCVVPALLTPKKDDS-----
        +N  + +RL  L +K+  I+  E D L        + +   ++PL  +    Q   QE E    +++ +L +G I+ S +P   P  + PKK D+     
Subjt:  VNRRKLERLNRLGKKF--IKDKESDILDLVVAGHSSPDCSKHVPLKVQELLDQMSPQEYEILHQRMEKLLKKGHIQPSINPCVVPALLTPKKDDS-----

Query:  WRMSL------------------------------HFSKVDLKSGYHQIQIRPEDEWKTVFKTNEGPFEWLVMPFGLSNARSTFMHLMQ-----------
        +R+ +                              +F+ +DL  G+HQI++ PE   KT F T  G +E+L MPFGL NA +TF   M            
Subjt:  WRMSL------------------------------HFSKVDLKSGYHQIQIRPEDEWKTVFKTNEGPFEWLVMPFGLSNARSTFMHLMQ-----------

Query:  ------------------QHLKLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEAVVNWPTSNSVKEVQAFMGLASFYRKFIRNFSS
                          Q L LVFE L K  + + L KC     + + LG ++    ++ +   +EA+  +P     KE++AF+GL  +YRKFI NF+ 
Subjt:  ------------------QHLKLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEAVVNWPTSNSVKEVQAFMGLASFYRKFIRNFSS

Query:  IRAPITDCLEKG--AFTWEKNQQQSF----------------DLIKKKLNTVDASGAGIGAVLSQGRHPMKKENRV-----VDAQSRKGTLLTVLSAEIR
        I  P+T CL+K     T       +F                D  KK   T DAS   +GAVLSQ  HP+   +R      ++  + +  LL ++ A  +
Subjt:  IRAPITDCLEKG--AFTWEKNQQQSF----------------DLIKKKLNTVDASGAGIGAVLSQGRHPMKKENRV-----VDAQSRKGTLLTVLSAEIR

Query:  VFNHGLL
         F H LL
Subjt:  VFNHGLL

P20825 Retrovirus-related Pol polyprotein from transposon 2977.2e-2827.3Show/hide
Query:  DQERWRYTDKWDLLGAFVNRRKLERLNRLGKKF--IKDKESD-------ILDLVVAGHSSPDCSKHVPLKVQELLDQMSPQEYEI-LHQRMEKLLKKGHI
        DQE  +  D        +N+ +  +L  L  KF  ++ KE +       I  ++   H+SP  SK  PL           Q +EI +  +++++L +G I
Subjt:  DQERWRYTDKWDLLGAFVNRRKLERLNRLGKKF--IKDKESD-------ILDLVVAGHSSPDCSKHVPLKVQELLDQMSPQEYEI-LHQRMEKLLKKGHI

Query:  QPSINPCVVPALLTPKKDDS-----WRMSL------------------------------HFSKVDLKSGYHQIQIRPEDEWKTVFKTNEGPFEWLVMPF
        + S +P   P  + PKK D+     +R+ +                              +F+ +DL  G+HQI++  E   KT F T  G +E+L MPF
Subjt:  QPSINPCVVPALLTPKKDDS-----WRMSL------------------------------HFSKVDLKSGYHQIQIRPEDEWKTVFKTNEGPFEWLVMPF

Query:  GLSNARSTFMHLMQ--------------------------QHL---KLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEAVVNWPTS
        GL NA +TF   M                           +HL   +LVF  L    + + L KC     + + LG ++    ++ +   V+A+V++P  
Subjt:  GLSNARSTFMHLMQ--------------------------QHL---KLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEAVVNWPTS

Query:  NSVKEVQAFMGLASFYRKFIRNFSSIRAPITDCLEKGA----------FTWEKNQQ--------QSFDLIKKKLNTVDASGAGIGAVLSQGRHPMKKENR
           KE++AF+GL  +YRKFI N++ I  P+T CL+K              +EK +         Q  D  KK + T DAS   +GAVLSQ  HP+   +R
Subjt:  NSVKEVQAFMGLASFYRKFIRNFSSIRAPITDCLEKGA----------FTWEKNQQ--------QSFDLIKKKLNTVDASGAGIGAVLSQGRHPMKKENR

Query:  VVD
         ++
Subjt:  VVD

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.5e-2829.15Show/hide
Query:  LHQRMEKLLKKGHIQPSINPCVVPALLTPKKDDSWRMSLH------------------------------FSKVDLKSGYHQIQIRPEDEWKTVFKTNEG
        +++ ++KLL    I PS +PC  P +L PKKD ++R+ +                               F+ +DL SGYHQI + P+D +KT F T  G
Subjt:  LHQRMEKLLKKGHIQPSINPCVVPALLTPKKDDSWRMSLH------------------------------FSKVDLKSGYHQIQIRPEDEWKTVFKTNEG

Query:  PFEWLVMPFGLSNARSTFMHLMQ---------------------------QHLKLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEA
         +E+ VMPFGL NA STF   M                            +HL  V E L+   + +  KKC   + +   LG+ I   ++   +    A
Subjt:  PFEWLVMPFGLSNARSTFMHLMQ---------------------------QHLKLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEA

Query:  VVNWPTSNSVKEVQAFMGLASFYRKFIRNFSSIRAPITDCLEKGAFTWEKNQQQSFDLIKKKL----------------NTVDASGAGIGAVLSQ
        + ++PT  +VK+ Q F+G+ ++YR+FI N S I  PI          W + Q ++ + +K  L                 T DAS  GIGAVL +
Subjt:  VVNWPTSNSVKEVQAFMGLASFYRKFIRNFSSIRAPITDCLEKGAFTWEKNQQQSFDLIKKKL----------------NTVDASGAGIGAVLSQ

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.9e-2829.49Show/hide
Query:  LHQRMEKLLKKGHIQPSINPCVVPALLTPKKDDSWRMSLH------------------------------FSKVDLKSGYHQIQIRPEDEWKTVFKTNEG
        +++ ++KLL    I PS +PC  P +L PKKD ++R+ +                               F+ +DL SGYHQI + P+D +KT F T  G
Subjt:  LHQRMEKLLKKGHIQPSINPCVVPALLTPKKDDSWRMSLH------------------------------FSKVDLKSGYHQIQIRPEDEWKTVFKTNEG

Query:  PFEWLVMPFGLSNARSTFMHLMQ---------------------------QHLKLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEA
         +E+ VMPFGL NA STF   M                            +HL  V E L+   + +  KKC   + +   LG+ I   ++   +    A
Subjt:  PFEWLVMPFGLSNARSTFMHLMQ---------------------------QHLKLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEA

Query:  VVNWPTSNSVKEVQAFMGLASFYRKFIRNFSSIRAPITDCLEKGAFTWEKNQQQSFDLIKKKL----------------NTVDASGAGIGAVLSQ
        + ++PT  +VK+ Q F+G+ ++YR+FI N S I  PI          W + Q ++ D +K  L                 T DAS  GIGAVL +
Subjt:  VVNWPTSNSVKEVQAFMGLASFYRKFIRNFSSIRAPITDCLEKGAFTWEKNQQQSFDLIKKKL----------------NTVDASGAGIGAVLSQ

Arabidopsis top hitse value%identityAlignment
AT1G30300.1 Metallo-hydrolase/oxidoreductase superfamily protein2.7e-4640.38Show/hide
Query:  NKNRRLNTSILVRYVGPSG-NRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDW-----TNNVQPSVPIYVAQRDFEVMQKTHY
        N N R NTS+L+ Y    G ++ I IDVGK F    LRWF   +I  +D++I+TH HADA+ GLDD+R       TN++ P+ PI+V+Q   E +     
Subjt:  NKNRRLNTSILVRYVGPSG-NRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDW-----TNNVQPSVPIYVAQRDFEVMQKTHY

Query:  YLVDTSVI--LPGAAVSELQFNIIPEE---PFVVNDLKVTPLPVWHGRGYRSLGFRFG---NVCYISDVSEIPEET-YPLLK----DCEVLILDALRPDR
        YLV   +        V++L + +I E+   PFV + L  TPLPV HG  Y  LGF FG    V YISDVS  P  T Y + K      ++LILD L    
Subjt:  YLVDTSVI--LPGAAVSELQFNIIPEE---PFVVNDLKVTPLPVWHGRGYRSLGFRFG---NVCYISDVSEIPEET-YPLLK----DCEVLILDALRPDR

Query:  SSSTHFGLPRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        S +TH   P+ L+ ++++ PKR L  GM H  DH + N +L +  + EG+  +L++DGLR+P+ L
Subjt:  SSSTHFGLPRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

AT3G13800.1 Metallo-hydrolase/oxidoreductase superfamily protein1.5e-12681.32Show/hide
Query:  AVCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQ
        +VC KA EPGN+NRRLNTSILVRY+ PSG  NILID GKFFYHSALRWFP F +RT+DAV+ITHSHADAIGGLDDLRDWTNNVQP +PIY A RD EVM+
Subjt:  AVCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQ

Query:  KTHYYLVDTSVILPGAAVSELQFNIIPE-EPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGL
        KTHYYLVDTSVI+PGAAVSEL+F +I E +PFVVNDLK+TPLPVWHG  YRSLGFRFGNVCYISDVS+IPEETYPLLKDC++LI+DALRPDRSS+THFGL
Subjt:  KTHYYLVDTSVILPGAAVSELQFNIIPE-EPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGL

Query:  PRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        PRALEEVRKI+PKRTLFTGMMHLMDHE+V+  L KL+ TEGLD QLSYDGLR+P+++
Subjt:  PRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

AT3G13800.1 Metallo-hydrolase/oxidoreductase superfamily protein5.5e-0743.27Show/hide
Query:  LFVGTTLPP-TSMACLAALRTQVQ--LRSSAFSIARKGF-SSFQRIAQSRLQSVS-GGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCPQRWT
        L +GT  P   S++C  +LR      LR     I+R    S   +I Q+ LQS S  G+   + S   SEI+F+GTGTSEGIPRVSCLT+P+K C     
Subjt:  LFVGTTLPP-TSMACLAALRTQVQ--LRSSAFSIARKGF-SSFQRIAQSRLQSVS-GGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCPQRWT

Query:  CSKS
        C+K+
Subjt:  CSKS

AT3G13800.2 Metallo-hydrolase/oxidoreductase superfamily protein1.5e-12681.32Show/hide
Query:  AVCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQ
        +VC KA EPGN+NRRLNTSILVRY+ PSG  NILID GKFFYHSALRWFP F +RT+DAV+ITHSHADAIGGLDDLRDWTNNVQP +PIY A RD EVM+
Subjt:  AVCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQ

Query:  KTHYYLVDTSVILPGAAVSELQFNIIPE-EPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGL
        KTHYYLVDTSVI+PGAAVSEL+F +I E +PFVVNDLK+TPLPVWHG  YRSLGFRFGNVCYISDVS+IPEETYPLLKDC++LI+DALRPDRSS+THFGL
Subjt:  KTHYYLVDTSVILPGAAVSELQFNIIPE-EPFVVNDLKVTPLPVWHGRGYRSLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGL

Query:  PRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
        PRALEEVRKI+PKRTLFTGMMHLMDHE+V+  L KL+ TEGLD QLSYDGLR+P+++
Subjt:  PRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

AT3G13800.2 Metallo-hydrolase/oxidoreductase superfamily protein9.7e-0464.52Show/hide
Query:  IGTGTSEGIPRVSCLTDPVKKCPQRWTCSKS
        +GTGTSEGIPRVSCLT+P+K C     C+K+
Subjt:  IGTGTSEGIPRVSCLTDPVKKCPQRWTCSKS

AT4G03610.1 Metallo-hydrolase/oxidoreductase superfamily protein1.9e-3638.01Show/hide
Query:  NKNRRLNTSILVRYVG---PSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQP---------SVPIYVAQRDFEV
        N N R NTS+L+ Y        ++ ILIDVGK F                + +I+TH HADA+ GLD++R    ++QP          +P++++Q   E 
Subjt:  NKNRRLNTSILVRYVG---PSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQP---------SVPIYVAQRDFEV

Query:  MQKTHYYLVDTSVILPGAAVSELQFNIIPE---EPFVVNDLKVTPLPVWHGRGYRSLGFRFGN---VCYISDVSEIPEET-YPLLK----DCEVLILDAL
        +     YLV+  V      VS L +  I E   EPF  + L  TPLPV HG  Y +LGF FG+   V YISDVS IP  T Y + K      ++LILD  
Subjt:  MQKTHYYLVDTSVILPGAAVSELQFNIIPE---EPFVVNDLKVTPLPVWHGRGYRSLGFRFGN---VCYISDVSEIPEET-YPLLK----DCEVLILDAL

Query:  RPDRSS--STHFGLPRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL
         P +     TH     ALE ++++ PKR L TGM H  DH E N  L +    EG+  QL++DGLR+P+ L
Subjt:  RPDRSS--STHFGLPRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL

AT4G03610.2 Metallo-hydrolase/oxidoreductase superfamily protein2.0e-3339.48Show/hide
Query:  NKNRRLNTSILVRYVG---PSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQP---------SVPIYVAQRDFEV
        N N R NTS+L+ Y        ++ ILIDVGK F    LRWF  ++I  +D++I+TH HADA+ GLD++R    ++QP          +P++++Q   E 
Subjt:  NKNRRLNTSILVRYVG---PSGNRNILIDVGKFFYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQP---------SVPIYVAQRDFEV

Query:  MQKTHYYLVDTSVILPGAAVSELQFNIIPE---EPFVVNDLKVTPLPVWHGRGYRSLGFRFGN---VCYISDVSEIPEET-YPLLK----DCEVLILDAL
        +     YLV+  V      VS L +  I E   EPF  + L  TPLPV HG  Y +LGF FG+   V YISDVS IP  T Y + K      ++LILD  
Subjt:  MQKTHYYLVDTSVILPGAAVSELQFNIIPE---EPFVVNDLKVTPLPVWHGRGYRSLGFRFGN---VCYISDVSEIPEET-YPLLK----DCEVLILDAL

Query:  RPDRSS--STHFGLPRALEEVRKIQPKRTLFTG
         P +     TH     ALE ++++ PKR L TG
Subjt:  RPDRSS--STHFGLPRALEEVRKIQPKRTLFTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGTTTGTGGGTACGACTCTTCCACCCACTTCTATGGCCTGTCTTGCTGCTCTCAGAACCCAAGTTCAACTTCGAAGCTCTGCATTTTCGATTGCTCGAAAAGG
GTTTTCCTCCTTTCAACGCATTGCCCAATCTCGCCTTCAATCCGTTTCTGGTGGTGAGATTGGAGGGACGATATCTGCCAATGAATCAGAAATAATATTTATAGGTACGG
GTACCAGCGAAGGAATCCCACGAGTGAGCTGCCTGACTGATCCTGTAAAGAAATGTCCGCAGCGGTGGACCTGCTCAAAATCAAAGGTAGACGAAAACCCTGTCCTCTCT
GCAAGATCTACCATGCGTCGCTTGCTGCCCATGGACGCATCTGTGGAGGAGAGAATTGAAGCAACACTTCAAGAATTGAAAGCTAGAGCATTAGCTTGGTGGGAGCAAAT
TGAAGTCAATAGGAGAAGAAATGGTAAAAGATCTATTGCTTCTTGGCAGAAAATGAAGAAATTGAAGAAAGCTCGATTCCTTCCACCCAACTATGAACAAACCTTCTACA
ACCAATATCAAAGCTGCTGGCAAGGAACTCAAAGATTAAAGTTTGATATAGAAGAAAAAGTTAAACTTCAACCTTTCCTAACCCTATCTGATGCCATCACTTATGCAGAA
TCTGTTGAGGAGATTATTGAACCCAGTCAAGAAATACTACCAGAAGAAGCCCGTGGAATCCACCTCTCCAATGCTTGCCCTAAAAGGAGGGCTGTAGCTTTTCTAGAAGA
AGATGAAGACCTCATTGAGGAGCAAGAAGGAGATTTTGATCAAAATGAAGAGATACTGGAGCCTGATGAAGGGGAAAGACTCTCTTGTGTACTTCAAAGAGTGCTAATTG
CACCTAAAATTTATAACTCGCACCAGCATAAGTCTTTTCAAGACCCGATGGATCAAGAAAGGTGGAGATACACAGATAAATGGGATTTGCTCGGTGCCTTTGTCAATCGA
AGGAAGCTAGAAAGACTAAATCGTCTTGGTAAGAAGTTTATTAAAGACAAAGAATCTGATATCTTAGACCTTGTTGTTGCTGGCCATTCTTCCCCAGATTGCTCCAAACA
TGTGCCACTGAAAGTTCAAGAATTGTTGGATCAGATGAGTCCACAGGAGTACGAAATCTTGCATCAACGTATGGAAAAGCTACTGAAAAAGGGTCATATCCAGCCGAGTA
TCAACCCTTGTGTAGTTCCAGCCCTATTGACTCCAAAGAAAGATGACAGTTGGAGAATGAGCCTTCATTTTTCAAAGGTAGACTTGAAGAGTGGCTACCATCAAATCCAA
ATAAGGCCGGAAGATGAATGGAAAACGGTTTTCAAGACAAATGAAGGGCCATTTGAGTGGCTAGTGATGCCTTTCGGCTTGTCCAATGCACGAAGCACTTTCATGCATCT
AATGCAACAGCACCTAAAATTGGTTTTTGAAGCCCTAGAAAAGAATGAGATGTATATTAACTTGAAGAAGTGTACTTCCTGCACAGGGAAAGTTTCACTCCTAGGGTTTC
TTATTTGTGAACACCAAGTAAGGATGGATGAGAGCAATGTTGAAGCTGTGGTAAATTGGCCAACTTCTAATTCAGTGAAGGAAGTCCAAGCCTTCATGGGGCTAGCTTCA
TTTTACAGGAAATTCATTAGAAATTTCAGCTCCATAAGAGCCCCAATAACTGATTGTTTGGAGAAAGGAGCTTTTACTTGGGAGAAGAATCAACAACAAAGCTTTGATTT
AATAAAAAAGAAGCTCAACACAGTAGATGCCAGTGGAGCTGGAATTGGGGCAGTCCTATCACAAGGGAGGCATCCAATGAAGAAAGAAAATAGGGTTGTCGATGCTCAAA
GTAGAAAAGGAACCTTATTGACTGTTCTTTCAGCCGAGATTAGAGTCTTCAATCATGGCCTTTTAAGGTTCGCTACAATGAAGATTTGGGCTGATGTTGAACCCTTCAGG
CAGTTCAATCTTGTAGGCATTTGGCTCATATTTGGCCATTATTCTACATGGGCCAATCTGTTTATCTTTCATCTTTCCATAAGTTTCAGTTGGGAACCTCTTATTCTTCA
AGTGGGCCATTACAAGATCTCGAATTTGGAAATGAACCTCCAATTTGATGTGGAACTACAAGAAGAAGTAGAAGCCATGGCTGAAAGAATCCAAAAACTTCACACAAAAG
TCATAGAACATCTAACTAAGACTGCAGAATCTTACAAAGAGGAGAAAGACAAAAAGAGAAGGGAAGTCCGTTTTCAACTTGGTGACCTGGTAATGGCCCATTTGAAGAAG
AAGAAGAGGTTCCTAGCTGGAACCTATGAAAAGTTAAAAGACAGACAAATTGGCCCGTGTAGAATAATAGCTAAATATAAGCCAAACGCCTATAATCTTGATCTGCCTGA
AGGGATCAACATTAGCCTAGTTTTCAACATTGCAGATCTGAAAAGCGCACACCTCTCCAGTGGCAAGAAGATTCTCACGGCTTCTGCCTTCTCTTCTTCAGCGGTATGCT
TCAAAGCTGCAGAACCGGGCAATAAAAACAGGAGGCTTAATACCAGTATCCTAGTTCGATATGTTGGACCTTCTGGAAATCGTAATATTCTTATCGACGTTGGAAAGTTT
TTCTACCACAGTGCTCTCCGATGGTTTCCAGCATTTGAGATAAGAACAATTGATGCAGTTATTATTACACATTCTCATGCCGATGCAATTGGAGGTCTTGATGATCTTCG
CGATTGGACAAACAATGTCCAGCCTTCAGTCCCAATTTATGTGGCTCAACGTGATTTTGAGGTGATGCAGAAGACTCACTATTATTTGGTAGACACAAGTGTAATTTTAC
CTGGTGCTGCAGTTTCAGAATTGCAGTTCAATATCATACCTGAGGAGCCTTTTGTAGTCAACGATTTGAAGGTTACACCTTTACCAGTTTGGCACGGTCGTGGCTATCGT
TCATTGGGTTTTCGATTTGGGAATGTTTGTTACATTAGTGATGTTAGTGAAATACCTGAAGAAACTTATCCATTGTTGAAAGATTGTGAAGTTCTCATACTGGATGCATT
ACGGCCTGATCGATCTTCTTCCACCCACTTTGGGCTTCCAAGAGCTTTAGAGGAAGTAAGGAAAATACAACCAAAGAGAACTCTATTCACTGGTATGATGCATTTAATGG
ATCATGAAGAAGTAAACAGTTATCTGTTGAAGCTGAAGGAAACTGAAGGTCTTGATGCTCAACTAAGCTACGATGGGCTTCGAATACCAGTAACCCTC
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGTTTGTGGGTACGACTCTTCCACCCACTTCTATGGCCTGTCTTGCTGCTCTCAGAACCCAAGTTCAACTTCGAAGCTCTGCATTTTCGATTGCTCGAAAAGG
GTTTTCCTCCTTTCAACGCATTGCCCAATCTCGCCTTCAATCCGTTTCTGGTGGTGAGATTGGAGGGACGATATCTGCCAATGAATCAGAAATAATATTTATAGGTACGG
GTACCAGCGAAGGAATCCCACGAGTGAGCTGCCTGACTGATCCTGTAAAGAAATGTCCGCAGCGGTGGACCTGCTCAAAATCAAAGGTAGACGAAAACCCTGTCCTCTCT
GCAAGATCTACCATGCGTCGCTTGCTGCCCATGGACGCATCTGTGGAGGAGAGAATTGAAGCAACACTTCAAGAATTGAAAGCTAGAGCATTAGCTTGGTGGGAGCAAAT
TGAAGTCAATAGGAGAAGAAATGGTAAAAGATCTATTGCTTCTTGGCAGAAAATGAAGAAATTGAAGAAAGCTCGATTCCTTCCACCCAACTATGAACAAACCTTCTACA
ACCAATATCAAAGCTGCTGGCAAGGAACTCAAAGATTAAAGTTTGATATAGAAGAAAAAGTTAAACTTCAACCTTTCCTAACCCTATCTGATGCCATCACTTATGCAGAA
TCTGTTGAGGAGATTATTGAACCCAGTCAAGAAATACTACCAGAAGAAGCCCGTGGAATCCACCTCTCCAATGCTTGCCCTAAAAGGAGGGCTGTAGCTTTTCTAGAAGA
AGATGAAGACCTCATTGAGGAGCAAGAAGGAGATTTTGATCAAAATGAAGAGATACTGGAGCCTGATGAAGGGGAAAGACTCTCTTGTGTACTTCAAAGAGTGCTAATTG
CACCTAAAATTTATAACTCGCACCAGCATAAGTCTTTTCAAGACCCGATGGATCAAGAAAGGTGGAGATACACAGATAAATGGGATTTGCTCGGTGCCTTTGTCAATCGA
AGGAAGCTAGAAAGACTAAATCGTCTTGGTAAGAAGTTTATTAAAGACAAAGAATCTGATATCTTAGACCTTGTTGTTGCTGGCCATTCTTCCCCAGATTGCTCCAAACA
TGTGCCACTGAAAGTTCAAGAATTGTTGGATCAGATGAGTCCACAGGAGTACGAAATCTTGCATCAACGTATGGAAAAGCTACTGAAAAAGGGTCATATCCAGCCGAGTA
TCAACCCTTGTGTAGTTCCAGCCCTATTGACTCCAAAGAAAGATGACAGTTGGAGAATGAGCCTTCATTTTTCAAAGGTAGACTTGAAGAGTGGCTACCATCAAATCCAA
ATAAGGCCGGAAGATGAATGGAAAACGGTTTTCAAGACAAATGAAGGGCCATTTGAGTGGCTAGTGATGCCTTTCGGCTTGTCCAATGCACGAAGCACTTTCATGCATCT
AATGCAACAGCACCTAAAATTGGTTTTTGAAGCCCTAGAAAAGAATGAGATGTATATTAACTTGAAGAAGTGTACTTCCTGCACAGGGAAAGTTTCACTCCTAGGGTTTC
TTATTTGTGAACACCAAGTAAGGATGGATGAGAGCAATGTTGAAGCTGTGGTAAATTGGCCAACTTCTAATTCAGTGAAGGAAGTCCAAGCCTTCATGGGGCTAGCTTCA
TTTTACAGGAAATTCATTAGAAATTTCAGCTCCATAAGAGCCCCAATAACTGATTGTTTGGAGAAAGGAGCTTTTACTTGGGAGAAGAATCAACAACAAAGCTTTGATTT
AATAAAAAAGAAGCTCAACACAGTAGATGCCAGTGGAGCTGGAATTGGGGCAGTCCTATCACAAGGGAGGCATCCAATGAAGAAAGAAAATAGGGTTGTCGATGCTCAAA
GTAGAAAAGGAACCTTATTGACTGTTCTTTCAGCCGAGATTAGAGTCTTCAATCATGGCCTTTTAAGGTTCGCTACAATGAAGATTTGGGCTGATGTTGAACCCTTCAGG
CAGTTCAATCTTGTAGGCATTTGGCTCATATTTGGCCATTATTCTACATGGGCCAATCTGTTTATCTTTCATCTTTCCATAAGTTTCAGTTGGGAACCTCTTATTCTTCA
AGTGGGCCATTACAAGATCTCGAATTTGGAAATGAACCTCCAATTTGATGTGGAACTACAAGAAGAAGTAGAAGCCATGGCTGAAAGAATCCAAAAACTTCACACAAAAG
TCATAGAACATCTAACTAAGACTGCAGAATCTTACAAAGAGGAGAAAGACAAAAAGAGAAGGGAAGTCCGTTTTCAACTTGGTGACCTGGTAATGGCCCATTTGAAGAAG
AAGAAGAGGTTCCTAGCTGGAACCTATGAAAAGTTAAAAGACAGACAAATTGGCCCGTGTAGAATAATAGCTAAATATAAGCCAAACGCCTATAATCTTGATCTGCCTGA
AGGGATCAACATTAGCCTAGTTTTCAACATTGCAGATCTGAAAAGCGCACACCTCTCCAGTGGCAAGAAGATTCTCACGGCTTCTGCCTTCTCTTCTTCAGCGGTATGCT
TCAAAGCTGCAGAACCGGGCAATAAAAACAGGAGGCTTAATACCAGTATCCTAGTTCGATATGTTGGACCTTCTGGAAATCGTAATATTCTTATCGACGTTGGAAAGTTT
TTCTACCACAGTGCTCTCCGATGGTTTCCAGCATTTGAGATAAGAACAATTGATGCAGTTATTATTACACATTCTCATGCCGATGCAATTGGAGGTCTTGATGATCTTCG
CGATTGGACAAACAATGTCCAGCCTTCAGTCCCAATTTATGTGGCTCAACGTGATTTTGAGGTGATGCAGAAGACTCACTATTATTTGGTAGACACAAGTGTAATTTTAC
CTGGTGCTGCAGTTTCAGAATTGCAGTTCAATATCATACCTGAGGAGCCTTTTGTAGTCAACGATTTGAAGGTTACACCTTTACCAGTTTGGCACGGTCGTGGCTATCGT
TCATTGGGTTTTCGATTTGGGAATGTTTGTTACATTAGTGATGTTAGTGAAATACCTGAAGAAACTTATCCATTGTTGAAAGATTGTGAAGTTCTCATACTGGATGCATT
ACGGCCTGATCGATCTTCTTCCACCCACTTTGGGCTTCCAAGAGCTTTAGAGGAAGTAAGGAAAATACAACCAAAGAGAACTCTATTCACTGGTATGATGCATTTAATGG
ATCATGAAGAAGTAAACAGTTATCTGTTGAAGCTGAAGGAAACTGAAGGTCTTGATGCTCAACTAAGCTACGATGGGCTTCGAATACCAGTAACCCTC
Protein sequenceShow/hide protein sequence
MVLFVGTTLPPTSMACLAALRTQVQLRSSAFSIARKGFSSFQRIAQSRLQSVSGGEIGGTISANESEIIFIGTGTSEGIPRVSCLTDPVKKCPQRWTCSKSKVDENPVLS
ARSTMRRLLPMDASVEERIEATLQELKARALAWWEQIEVNRRRNGKRSIASWQKMKKLKKARFLPPNYEQTFYNQYQSCWQGTQRLKFDIEEKVKLQPFLTLSDAITYAE
SVEEIIEPSQEILPEEARGIHLSNACPKRRAVAFLEEDEDLIEEQEGDFDQNEEILEPDEGERLSCVLQRVLIAPKIYNSHQHKSFQDPMDQERWRYTDKWDLLGAFVNR
RKLERLNRLGKKFIKDKESDILDLVVAGHSSPDCSKHVPLKVQELLDQMSPQEYEILHQRMEKLLKKGHIQPSINPCVVPALLTPKKDDSWRMSLHFSKVDLKSGYHQIQ
IRPEDEWKTVFKTNEGPFEWLVMPFGLSNARSTFMHLMQQHLKLVFEALEKNEMYINLKKCTSCTGKVSLLGFLICEHQVRMDESNVEAVVNWPTSNSVKEVQAFMGLAS
FYRKFIRNFSSIRAPITDCLEKGAFTWEKNQQQSFDLIKKKLNTVDASGAGIGAVLSQGRHPMKKENRVVDAQSRKGTLLTVLSAEIRVFNHGLLRFATMKIWADVEPFR
QFNLVGIWLIFGHYSTWANLFIFHLSISFSWEPLILQVGHYKISNLEMNLQFDVELQEEVEAMAERIQKLHTKVIEHLTKTAESYKEEKDKKRREVRFQLGDLVMAHLKK
KKRFLAGTYEKLKDRQIGPCRIIAKYKPNAYNLDLPEGINISLVFNIADLKSAHLSSGKKILTASAFSSSAVCFKAAEPGNKNRRLNTSILVRYVGPSGNRNILIDVGKF
FYHSALRWFPAFEIRTIDAVIITHSHADAIGGLDDLRDWTNNVQPSVPIYVAQRDFEVMQKTHYYLVDTSVILPGAAVSELQFNIIPEEPFVVNDLKVTPLPVWHGRGYR
SLGFRFGNVCYISDVSEIPEETYPLLKDCEVLILDALRPDRSSSTHFGLPRALEEVRKIQPKRTLFTGMMHLMDHEEVNSYLLKLKETEGLDAQLSYDGLRIPVTL