; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0194 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0194
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationMC05:1383848..1394600
RNA-Seq ExpressionMC05g0194
SyntenyMC05g0194
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133985.1 uncharacterized protein LOC101211137 [Cucumis sativus]6.55e-24183.74Show/hide
Query:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS
        MPESMEATPSVPP LDLQAVR ELEE QRSLEE+E S+TDSLGSEKLL+ECALHLESR+QQ+LSE+SNVDSFLGIDDLDAYVE MKEELV VEAESSKIS
Subjt:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS

Query:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED
        NEIE +KR NIEDSN+LKMDLEVLKLSLDR  S+DPE+ATFNC S +GED +N+I  RECNAFEVL+L+SQIE+NK+ILKSLQE+DEIFKSLDVIEQVE 
Subjt:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED

Query:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC
        TIGG+KVIDVA+N IRLSL THIPN+ED S+LQRLEG+IE SEL+HEL+IEVL+GTMELKNAEIFP DVHLHDIINASKS SNSSLEWFVRKVQDRIVLC
Subjt:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC

Query:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL
        TLRRF VK+ANKS HSFEYLD+D  I+C+MIGGI A IKVSQGWPL+DSPLKLISLK+SDHYTKG+SLSLICKVEKMANSLD  IR NLSSFADA+EKIL
Subjt:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL

Query:  KEQMHLELQSDG
        KEQMHLELQ+D 
Subjt:  KEQMHLELQSDG

XP_022147070.1 uncharacterized protein LOC111016098 [Momordica charantia]1.90e-28298.08Show/hide
Query:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS
        MPESMEATPSVPPILDLQAVRC   ELEE QRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVE MKEELVMVEAESS
Subjt:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ
        KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ
Subjt:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ

Query:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
        VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
Subjt:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI

Query:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE
        VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVI+VSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE
Subjt:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE

Query:  KILKEQMHLELQSDGAL
        KILKE+MHLEL SDGAL
Subjt:  KILKEQMHLELQSDGAL

XP_023538691.1 uncharacterized protein LOC111799561 [Cucurbita pepo subsp. pepo]3.63e-24184.17Show/hide
Query:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS
        M E MEATPSV P +D+QAVR    E+EE QRSLEEDEA +TDSLGSEKLLKEC+L LESRLQQ LSE+SNVDSFLGIDDLDAYVERMKEEL+ VEAESS
Subjt:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ
        KIS+EIE +KR +IEDSN+L+MDLEVLKLSLDR  S+DPEKATFNC S +GEDQ++MI +RECNAFEVL+LDSQIE+N+R LKSLQELDEIFKSLDVIEQ
Subjt:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ

Query:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
        VEDTIGGLKVIDV +NF+R+SL +HIPNLE+ SSLQRLEG+IEPSEL+HELLIEVLEGTMEL NAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
Subjt:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI

Query:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE
        VLC LRRFVVK+ANKSSHSFEYLD+D TIICTMIGGI AVIKVSQGWPLSDSPLKLISLK+SDHY KG+SLSLICKVEKMANSLDV +RHNLSSFADA+E
Subjt:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE

Query:  KILKEQMHLELQSDGAL
        KI+KEQMHLELQ D AL
Subjt:  KILKEQMHLELQSDGAL

XP_038897559.1 uncharacterized protein LOC120085576 isoform X1 [Benincasa hispida]2.33e-24585.92Show/hide
Query:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS
        MPESMEATPSVPP LDLQ+VR ELEE QRSLEE+EA + DSLGSEKLLKECALHLESRLQQILSE+SNVDSFLGIDDLDAYVE MKEELV VEAESSKIS
Subjt:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS

Query:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED
        NEIE +KR NIEDSN+LKMDLEVLKLSLDR TS+DPE+ATFNC S +GEDQ+N+++ RECNAFEVL+L+ QIEQNK+ILKSLQE+D+IFKSLDVIEQVED
Subjt:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED

Query:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC
        TIGG+KVIDVA+NFIRLSLRTHIPNLED S+LQRLEG+IE S L+HELLIEVLEGTMELKNAEIFPGDVHLHDIINASKS SNSSLEWFVRKVQDRIVLC
Subjt:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC

Query:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL
        TLRRFVVK+ANKSSHSFEY D+D  IIC+MIGGI A IKVSQGWPL+DSPLKLISLK+SDHYTKG+SLSLICKVEKMANSLD RI  NLSSFADA+EKIL
Subjt:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL

Query:  KEQMHLELQSDG
        KEQMHLELQ+D 
Subjt:  KEQMHLELQSDG

XP_038897565.1 uncharacterized protein LOC120085576 isoform X2 [Benincasa hispida]4.85e-24285.44Show/hide
Query:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS
        MPESMEATPSVPP LDLQ+VR ELEE QRSLEE+EA + DSLGSEKLLKECALHLESRLQQILSE+SNVDSFLGIDDLDAYVE MKEELV VEAESSKIS
Subjt:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS

Query:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED
        NEIE +KR NIEDSN+LKMDLEVLKLSLDR TS+DPE+ATFNC S +GEDQ+N+++ RECNAFEVL+L+ QIEQNK+ILKSLQE+D+IFKSLDVIEQVED
Subjt:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED

Query:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC
        TIGG+KVIDVA+NFIRLSLRTHIPNLED S+LQRLEG+IE S L+HELLIEVLEGTMELKNAEIFPGDVHLHDIINASKS SNSSLEWFVRKVQDRIVLC
Subjt:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC

Query:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL
        TLRRFVVK+ANKSSHSFEY D+D  IIC+MIGGI A IKVSQGWPL+DSPLKLISLK+SDHYTKG+SLSLICK  KMANSLD RI  NLSSFADA+EKIL
Subjt:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL

Query:  KEQMHLELQSDG
        KEQMHLELQ+D 
Subjt:  KEQMHLELQSDG

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q3 Uncharacterized protein3.17e-24183.74Show/hide
Query:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS
        MPESMEATPSVPP LDLQAVR ELEE QRSLEE+E S+TDSLGSEKLL+ECALHLESR+QQ+LSE+SNVDSFLGIDDLDAYVE MKEELV VEAESSKIS
Subjt:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS

Query:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED
        NEIE +KR NIEDSN+LKMDLEVLKLSLDR  S+DPE+ATFNC S +GED +N+I  RECNAFEVL+L+SQIE+NK+ILKSLQE+DEIFKSLDVIEQVE 
Subjt:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED

Query:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC
        TIGG+KVIDVA+N IRLSL THIPN+ED S+LQRLEG+IE SEL+HEL+IEVL+GTMELKNAEIFP DVHLHDIINASKS SNSSLEWFVRKVQDRIVLC
Subjt:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC

Query:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL
        TLRRF VK+ANKS HSFEYLD+D  I+C+MIGGI A IKVSQGWPL+DSPLKLISLK+SDHYTKG+SLSLICKVEKMANSLD  IR NLSSFADA+EKIL
Subjt:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL

Query:  KEQMHLELQSDG
        KEQMHLELQ+D 
Subjt:  KEQMHLELQSDG

A0A5A7U6L2 Uncharacterized protein2.13e-23983.74Show/hide
Query:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS
        MPESME TPSVPP LDLQAVR ELEE QRSLEE+E SS DSLGSEKLL+ECALHLESR+QQ+LSE+SNVDSFLGIDDLDAYVE MKEELV VEAESSKIS
Subjt:  MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKIS

Query:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED
        NEIE +KR  IEDSN+LKMDLEVLKLSLDR  S+DPE+ATFNC S +GED++N+I  RECNAFEVL+L+SQIE+NK+ILKSLQE+DEIFKSLDVIEQVE 
Subjt:  NEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVED

Query:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC
        TIGG+KVIDVA+N IRLSL THIPN+ED S+LQRLEG+IE SEL+HEL+IEV  GTMELKNAEIFP DVHLHDIINASKS SNSSLEWFVRKVQDRIVLC
Subjt:  TIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLC

Query:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL
        TLRRF VK+ANKSSHSFEYLD+D  I+C+MIGGI A IKVSQGWPL+DSPLKLISLK+SDHYTKGISLSLICKVEKMANSLD RIR NLSSFADA+EKIL
Subjt:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL

Query:  KEQMHLELQSDG
        KEQMHLELQ+D 
Subjt:  KEQMHLELQSDG

A0A6J1CZ44 uncharacterized protein LOC1110160989.18e-28398.08Show/hide
Query:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS
        MPESMEATPSVPPILDLQAVRC   ELEE QRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVE MKEELVMVEAESS
Subjt:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ
        KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ
Subjt:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ

Query:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
        VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
Subjt:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI

Query:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE
        VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVI+VSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE
Subjt:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE

Query:  KILKEQMHLELQSDGAL
        KILKE+MHLEL SDGAL
Subjt:  KILKEQMHLELQSDGAL

A0A6J1F0V9 uncharacterized protein LOC1114413631.18e-23984.17Show/hide
Query:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS
        M E MEATPSV P +D+QAVR    ELEE QRSLEEDEA +TDSLGS KLLKEC+L LESRLQQ LSE+SNVDSFLGIDDLDAYVERMKEEL+ VEAESS
Subjt:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ
        KISNEIE +KR +IEDSN+L+MDLEVLKLSLDR  S+DPEKAT NC S +GEDQ++MI +RECNAFEVL+LDSQIE+N+R LKSLQELDEIFKSLDVIEQ
Subjt:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ

Query:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
        VEDTIGGLKVIDV +NFIRLSL +HIPNLE+ SSLQRLEG+IEPSEL+HELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
Subjt:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI

Query:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE
        VLC LRRFVVK+ANKSSHSFEYLD+D TIICTMIGGI AVIKV QGWPLSDSPLKLISLK+SDHY  G+SLSLICKVEKMANSLDV +RH+LSSFADA+E
Subjt:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE

Query:  KILKEQMHLELQSDGAL
        KI+KEQMHLELQ D AL
Subjt:  KILKEQMHLELQSDGAL

A0A6J1IHB4 uncharacterized protein LOC1114734981.18e-23983.93Show/hide
Query:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS
        M E MEATPSV   +DLQAVR    E+EE QRSLEEDEA +TDSLGSEKLLKEC+L LESRLQQ LSE+SNVDS LGIDDLDAYVERMKEEL+ VEAESS
Subjt:  MPESMEATPSVPPILDLQAVRC---ELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ
        KISNEIE +KR +IEDSN+LKMDLEVLKLSLDR  S+DPEKATFNC S +GEDQ++M  +RECNAFEVL+LDSQIE+N+R LKSLQELDEIFKSLDVIEQ
Subjt:  KISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQ

Query:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
        VEDTIGGLKVIDV +NF+RLSL +H+PNLE+ SSLQRLEG+IEPSEL+HELLIEVLEGTM+LKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI
Subjt:  VEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRI

Query:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE
        VLC LRRFVVK+ANKSSHSFEYLD+D TIICTMIGGI AVIKVSQGWPLSDSPLKLISLK+SDHY KG+SLSLICKVEKMANSLDV +RH+LSSFADA+E
Subjt:  VLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIE

Query:  KILKEQMHLELQSDGAL
        KI+KEQMHLELQ D AL
Subjt:  KILKEQMHLELQSDGAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G23910.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT3G24255.2)4.4e-9045.26Show/hide
Query:  LDLQAVR---CELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKISNEIEAIKRNNI
        LDLQ +R    EL+ F R+  E+   S  S     ++++  L  E ++++I+ E+ +VD  L ++D DAY+E ++ EL  VEAES+K+S EIE + +++ 
Subjt:  LDLQAVR---CELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKISNEIEAIKRNNI

Query:  EDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVEDTIGGLKVIDVA
        +DS+RL+ DLE L LSLD ++S+D EK+  N  S    +   +ID    + F++ +L++Q+E+ + ILKSL++LD + K  D  EQVED + GLKV++  
Subjt:  EDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVEDTIGGLKVIDVA

Query:  ENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSF-----------SNSSLEWFVRKVQDRIVLC
         NFIRL LRT+I  L+      + + + EPSEL HELLI + + T E+   E+FP D+++ DII A+ SF           + SS++W V KVQD+I+  
Subjt:  ENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSF-----------SNSSLEWFVRKVQDRIVLC

Query:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL
        TLR+++V ++    ++FEY D+D TI+  + GGI A +KVS GWPL ++PLKL SLKNSD+ +KGISLSLICKVE++ANSLD+  R NLS F DAIEKIL
Subjt:  TLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKIL

Query:  KEQMHLELQSD
         EQ   ELQS+
Subjt:  KEQMHLELQSD

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.9e-7846.51Show/hide
Query:  DAYVERMKEELVMVEAESSKISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRI
        DAY+E ++ EL  VEAES+K+S EIE + +++  DS+RL+ DLE L LSLD ++S+D EK+  N  S    +   +ID    + F++ +L++Q+E+ + I
Subjt:  DAYVERMKEELVMVEAESSKISNEIEAIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRI

Query:  LKSLQELDEIFKSLDVIEQVEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINAS
        LKSL++LD + K  D  EQVED + GLKV++   NFIRL LRT+I  L+      + + + EPSEL HELLI + + T E+   E+FP D+++ DII A+
Subjt:  LKSLQELDEIFKSLDVIEQVEDTIGGLKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINAS

Query:  KSF-----------SNSSLEWFVRKVQDRIVLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGIS
         SF           + SS++W V KVQD+I+  TLR+  V ++    ++FEY D+D TI+  + GGI A +KVS GWPL ++PLKL SLKNSD+ +KG S
Subjt:  KSF-----------SNSSLEWFVRKVQDRIVLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGIS

Query:  LSLICKVEKMANSLDVRIRHNLSSFADAIEKILKEQMHLELQSD
        LSLI K+E++ANSLD+  R NLS F DA+EKIL +Q   EL+S+
Subjt:  LSLICKVEKMANSLDVRIRHNLSSFADAIEKILKEQMHLELQSD

AT3G24255.2 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.8e-8342.58Show/hide
Query:  LDLQAVRCELEEFQ---RSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDD-------LDAYVERMKEELVMVEAESSKISNEIE
        LDLQ +R  ++EF    R+  E+   S  S     ++++  L  E ++++I+ ++ +VD  L +D         DAY+E ++ EL  VEAES+K+S EIE
Subjt:  LDLQAVRCELEEFQ---RSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDD-------LDAYVERMKEELVMVEAESSKISNEIE

Query:  AIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVEDTIGG
         + +++  DS+RL+ DLE L LSLD ++S+D EK+  N  S    +   +ID    + F++ +L++Q+E+ + ILKSL++LD + K  D  EQVED + G
Subjt:  AIKRNNIEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVEDTIGG

Query:  LKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSF-----------SNSSLEWFVRKV
        LKV++   NFIRL LRT+I  L+      + + + EPSEL HELLI + + T E+   E+FP D+++ DII A+ SF           + SS++W V KV
Subjt:  LKVIDVAENFIRLSLRTHIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSF-----------SNSSLEWFVRKV

Query:  QDRIVLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFA
        QD+I+  TLR+  V ++    ++FEY D+D TI+  + GGI A +KVS GWPL ++PLKL SLKNSD+ +KG SLSLI K+E++ANSLD+  R NLS F 
Subjt:  QDRIVLCTLRRFVVKTANKSSHSFEYLDEDNTIICTMIGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFA

Query:  DAIEKILKEQMHLELQSD
        DA+EKIL +Q   EL+S+
Subjt:  DAIEKILKEQMHLELQSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAATCCATGGAAGCAACACCGTCTGTACCTCCAATCCTCGATCTCCAAGCAGTTCGCTGCGAGCTAGAAGAGTTTCAGAGATCTTTGGAGGAAGATGAAGCTTC
TTCGACGGATTCATTAGGTTCTGAGAAGTTGTTGAAGGAGTGTGCTCTCCATCTCGAGAGCAGACTGCAGCAAATTCTGTCAGAATTCTCTAACGTTGATAGTTTCTTGG
GGATTGATGATCTAGATGCGTACGTTGAACGTATGAAAGAGGAACTTGTCATGGTGGAAGCTGAAAGCAGCAAAATCTCCAATGAGATTGAGGCTATTAAGAGAAACAAC
ATAGAAGATTCTAATAGATTGAAGATGGATCTCGAAGTATTAAAATTGTCATTAGATCGTCTTACATCAGAGGATCCAGAAAAGGCAACATTTAATTGCCGCTCTCCGGA
TGGTGAAGATCAAATAAACATGATAGACAAGCGCGAATGCAATGCTTTTGAGGTATTGCAACTTGATAGTCAGATTGAGCAGAACAAAAGAATTCTAAAATCTTTACAGG
AACTGGATGAGATATTTAAAAGTTTGGATGTTATAGAACAGGTTGAGGACACAATTGGTGGCCTGAAGGTCATTGACGTTGCTGAAAATTTCATTAGATTGTCACTACGA
ACACACATTCCAAACTTGGAAGATCTATCAAGTCTACAGAGACTAGAAGGTGTGATTGAGCCATCTGAATTGAATCACGAGTTGCTAATAGAAGTTTTGGAGGGGACAAT
GGAGTTAAAGAATGCTGAGATCTTTCCTGGTGATGTCCACTTGCACGATATCATCAATGCTTCAAAGTCATTCAGCAATTCTTCGTTGGAATGGTTTGTGAGAAAAGTAC
AAGATAGGATTGTTTTGTGTACTCTTAGGCGATTCGTTGTGAAGACTGCAAACAAATCGAGTCATTCCTTTGAGTATTTAGATGAAGACAATACGATAATATGTACTATG
ATTGGAGGAATTGGCGCAGTTATTAAGGTGTCTCAAGGTTGGCCATTATCTGATTCTCCTTTGAAGCTTATATCTCTTAAGAACTCCGACCATTACACAAAAGGAATTTC
CTTGAGCCTCATTTGCAAGGTGGAGAAAATGGCAAACTCCTTGGACGTACGCATTCGCCACAATTTATCAAGCTTTGCAGACGCTATTGAAAAAATATTGAAGGAGCAAA
TGCATTTAGAACTCCAATCGGACGGTGCTCTCTGA
mRNA sequenceShow/hide mRNA sequence
GCCAGGGTTAAACTTAATTTTAATCGAAAATAGTGAGTTATTATTATTATTTGTAATTATTTTCTTTAATTTAGGAAAATAGAATGTTTGAACTTTGAACCAAAAATAAC
TTTTGAACGCTTTAATAAAATTGAAGATTTATTACGAAAATTGATGGCAAAAATGGCAACTTGAACATCGTCTGGGCGGCCCACTAAAGCCCACTGCATCATGGGCTGTA
AAGCCCAATGGGCCTGAGCAGAGCCCATGGCGGGCAGGGCAGGAGAGAGAGTACGGGCAGCAGCGAAGAGGACAAGAGCAGCGAAGATGGCAAATAAAGAATAATTTCGA
GGAAAGGAGGATCGCAAGGATGTAAAGTTTTTCAAAGAAACACCTCTACATATATGGAATGGAAGCCAAGATTACTTTTATATCGATTGCCGCCTCCTCGCCCTTCTCCC
ACTTTCTTCCCCCATTTCTTCGTCTCTGCAAAATTCGTGCGTTTCCTCTTTCTATCGCTCCGCTCCAGCCGCAGTTTGCCGCTCCGTTCCGTAGCAGTTCTGATAGATAA
GAGAGGCTGTTGGAATGAGCATTCAAGGTAGAAATGATCTACATGAATGGAATGCAAGAGTTGTTGAGAGAAGAACAGTAATGGACTTCGACCTCAATTGTCCACCTCCA
GATGAGTGCATCAATCCAACTGGCCCTCGCGAGGAAGCAGCACAGTTCTTTAATCATCACCAACGACAAGCTACAGACAATCCTGACGTTGTTGATGAGGACATTGCTAT
AATATCCCCTAGGAAATTTGCTGAAGCCAGAAAAAATTTTCGAAGAAACCACTTTGACAGTAGCTGTGGTGTAGCCGTCAGACTTAATGGCAACACAGAAGTTTATGGTG
CTCTCTCAGATGTAACAAGTTGGCCCCCTTTTACAATTTGGCCGCCTCTTACAATTAACAATAACGTTTCCATACGGGAACAAACAATTCACAACTTGGACCTTTGCTTG
AGCTCTGAAAGCAGTAGCAGGGCCAGGACTAAGGCAACTGACACTGACATTCCTTCTGCACTTGCACAAAGTAGTAGCATCATCCCGCCTGCAGCCCCCAGTTTGCGGTG
TGCAATCTGCATAGAACCGTTGGTTGAAGAAACAACAACGAAATGTGGGCACGTCTTCTGCAGGAACTGCATCGAGACCGCCATAGCTACGCAGCACAGATGTCCTATAT
GTCGGCGTAAGCTTAGAAAACGAGACATTATCCGGATTTACCTTCCTTTCTCAAGTTAAAAAAAAAATAAAAAAGAAAAAAAAAAGAAGATGAGCACAACACACAAGAGA
ATATTGCATCTGTATTCTTTTTCTGATGATTTGATGACTAGGTTTGTACTTGCACACAGTCCCATTAACAAGTATAGAATTCTTCTTTCTTCTTGTTGTTGTTAATCTCG
TTTTGTATTGAACTATAAAGAATCAAAAAAATTGGTTTGTATTTGACTGAAAAGACAGGCGATTGCGGTTGGTTCATTCATTAAATAGAGTTTATTGATCAGTGAATTAG
AATGGGGAATTGGTTCTCTTTTTGGGCTGTATTTTCTAATGCTTTCCTGTCTTCTTACACTAAGGATACCTTCTGCTCTGCTCTTATGGTGCTTCCTTACTTCCTCCACG
GCGGAAAAGAATCTGGAACCAGGTGGGGTGGTGCTTTTGTCGCTACATATTCTCGAATTTTTTTATTTTTTTCTTTTTGGCGCAAAATATACGAGGTTTCGGGATTCAAA
CTTTTAACTCAGCTCTCGCGACATTTACATCTACATAGAAGATATTAATATAGTTTACAAGTCATAGAAGATATTAATATAGTTTACAAGTCATCTACATCAATGATGAA
CGGAGTAACTATTTTTTGTTGGTTTGATGGTCAAGTGACTTTAGGAACTTAATTTATTTTCCAAATTTTAGTGATTAAAAATATATATTTTGTAGCTCTATAAATGAATA
CTGAATTTAAACTTATTAAAAGTGTATTGGAAATTGTTTTTTTTTTTAAATTTTTGGGAGGAATGTTTATAAGTTTGGTTGTTAAAAATATAAAATGTAAAATATAACTT
TTTGTATCGAGAGAGAATGGATGCAAAATTTTGTAATTTAACGTAATTATGTGGTTTTTGCTATTTTTTAAGTTAAATATTTAAGGTTTAAAAAATCAATCCGGTTCTCA
ATTTCGCGGGCTAAAGCTTGCGGCCTAAGCCCAATTTTCAAGCGCCGGGCTTCATTTTCGGGTTTGGTTAAAAGAAATCCCCTGCGTGAAGTGACGAATACTCCCGCTTC
TCTCTCAATTTGATTTTTCGCGCCGCGAGGAATCTCTCTCGATTCTGTGCAAATTCCGGCAGAGGAAGGAGAAAACAATGCCGGAATCCATGGAAGCAACACCGTCTGTA
CCTCCAATCCTCGATCTCCAAGCAGTTCGCTGCGAGCTAGAAGAGTTTCAGAGATCTTTGGAGGAAGATGAAGCTTCTTCGACGGATTCATTAGGTTCTGAGAAGTTGTT
GAAGGAGTGTGCTCTCCATCTCGAGAGCAGACTGCAGCAAATTCTGTCAGAATTCTCTAACGTTGATAGTTTCTTGGGGATTGATGATCTAGATGCGTACGTTGAACGTA
TGAAAGAGGAACTTGTCATGGTGGAAGCTGAAAGCAGCAAAATCTCCAATGAGATTGAGGCTATTAAGAGAAACAACATAGAAGATTCTAATAGATTGAAGATGGATCTC
GAAGTATTAAAATTGTCATTAGATCGTCTTACATCAGAGGATCCAGAAAAGGCAACATTTAATTGCCGCTCTCCGGATGGTGAAGATCAAATAAACATGATAGACAAGCG
CGAATGCAATGCTTTTGAGGTATTGCAACTTGATAGTCAGATTGAGCAGAACAAAAGAATTCTAAAATCTTTACAGGAACTGGATGAGATATTTAAAAGTTTGGATGTTA
TAGAACAGGTTGAGGACACAATTGGTGGCCTGAAGGTCATTGACGTTGCTGAAAATTTCATTAGATTGTCACTACGAACACACATTCCAAACTTGGAAGATCTATCAAGT
CTACAGAGACTAGAAGGTGTGATTGAGCCATCTGAATTGAATCACGAGTTGCTAATAGAAGTTTTGGAGGGGACAATGGAGTTAAAGAATGCTGAGATCTTTCCTGGTGA
TGTCCACTTGCACGATATCATCAATGCTTCAAAGTCATTCAGCAATTCTTCGTTGGAATGGTTTGTGAGAAAAGTACAAGATAGGATTGTTTTGTGTACTCTTAGGCGAT
TCGTTGTGAAGACTGCAAACAAATCGAGTCATTCCTTTGAGTATTTAGATGAAGACAATACGATAATATGTACTATGATTGGAGGAATTGGCGCAGTTATTAAGGTGTCT
CAAGGTTGGCCATTATCTGATTCTCCTTTGAAGCTTATATCTCTTAAGAACTCCGACCATTACACAAAAGGAATTTCCTTGAGCCTCATTTGCAAGGTGGAGAAAATGGC
AAACTCCTTGGACGTACGCATTCGCCACAATTTATCAAGCTTTGCAGACGCTATTGAAAAAATATTGAAGGAGCAAATGCATTTAGAACTCCAATCGGACGGTGCTCTCT
GATGATGATTAAGAACCAAAGTTCTTCATCATGCAATTCAGTCAGATTTCTACATTGTATCTATTGGATATTTCGGATTGGAACAAATCGAGAGATCATTTTACCAGGTT
AGAAAACTAAACACTTCTCTACTTGTACTAGAGCTCAGTATAAAAGTTGTAGTCTGATGATTTTACATCATCATAATTTTAGCTCGATTATTAATTGCTAATTTGTTTTT
GATAACTGAAATGTTGAAAATCCCACCTTAATAAAAATCCAGAGGCATAAAGAATTTTAATTTATTAAGAACATGTTTAGGAATGATTTTTGAACAAGTAAATTTAGCCT
AAGTGATTTAATTATAGGTAATTTTCTAAAGGGGTGGGTACTAATAAACCATTTATATATTT
Protein sequenceShow/hide protein sequence
MPESMEATPSVPPILDLQAVRCELEEFQRSLEEDEASSTDSLGSEKLLKECALHLESRLQQILSEFSNVDSFLGIDDLDAYVERMKEELVMVEAESSKISNEIEAIKRNN
IEDSNRLKMDLEVLKLSLDRLTSEDPEKATFNCRSPDGEDQINMIDKRECNAFEVLQLDSQIEQNKRILKSLQELDEIFKSLDVIEQVEDTIGGLKVIDVAENFIRLSLR
THIPNLEDLSSLQRLEGVIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFSNSSLEWFVRKVQDRIVLCTLRRFVVKTANKSSHSFEYLDEDNTIICTM
IGGIGAVIKVSQGWPLSDSPLKLISLKNSDHYTKGISLSLICKVEKMANSLDVRIRHNLSSFADAIEKILKEQMHLELQSDGAL