; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020965 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020965
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationchr7:3507021..3517146
RNA-Seq ExpressionLag0020965
SyntenyLag0020965
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018870.1 hypothetical protein SDJN02_20743, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-25367.89Show/hide
Query:  AGVVERRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGNTEVYGSLSDVTSW
        A VVERRT +D DLNCPPPDECIDPT PH+EAAQY N +  QA D    VDEDIAIISPRKFAEARKNFRRNHFESS GV VRRNG+TEVY +L+DV+SW
Subjt:  AGVVERRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGNTEVYGSLSDVTSW

Query:  PPFTIWPPLTISNNVSIQE-QAIHNLDLCLSCESSSRAIKA-TDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRC
        PPFTIW P T  N+VSIQE Q  HNLDL LSCE+SSRA K  TD  IPSA A NSSI PA   LRCAICIEPLVEETTTKCGH+FCRNCIE AIATQH+C
Subjt:  PPFTIWPPLTISNNVSIQE-QAIHNLDLCLSCESSSRAIKA-TDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRC

Query:  PICRRFGFGFYNAFLSSYSKDISVSQPEGVSNWPASIDLAFVFLVHHAGKESENRWLCLYLFCLFSSLSFIKAAFEVDFGGHQVRFCANSGQRKEKTMPE
        PICRR                        +  W +S       L  H G ES+ R +   +                           + G  +   + E
Subjt:  PICRRFGFGFYNAFLSSYSKDISVSQPEGVSNWPASIDLAFVFLVHHAGKESENRWLCLYLFCLFSSLSFIKAAFEVDFGGHQVRFCANSGQRKEKTMPE

Query:  SMEATPSVSPSLDLQAVRS-RISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESSKI
           A    S ++ +   RS   S+LEEL RSL EDE+ +TDSLGSEKLLKECA+HLESRLQQVLSECSNV+SFLGIDDLDAYVE MKEELV VEAESSKI
Subjt:  SMEATPSVSPSLDLQAVRS-RISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESSKI

Query:  SNEIE------VLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSL
        SNEIE      VL    +  SNKL++DLE+L  S+D  FTSQD EK TFN CS+NGEDQMN+IV+ ECNAFEVLELDS IEKNKRILKSLQE+DEIFKS+
Subjt:  SNEIE------VLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSL

Query:  ----------------DVIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIIN
                        +  + VEDTIGGLKVI VADNFIRLSL THIPNLE FSSLQRLEGMIEPSEL+HELLIEVLEGTMELKNAEIFPGDVHLHDIIN
Subjt:  ----------------DVIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIIN

Query:  ASKSFRNSSLEWFVRKVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEK
        ASKS  N SLEWFV+KVQDRIVLCTLRR VVKSANKSSHSF+Y+DQDETI+C MIGGIDA IKV QGWPL+DSPLKL+SLK+SDHYTKG SLSL+CKVEK
Subjt:  ASKSFRNSSLEWFVRKVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEK

Query:  MANSLDARVRQNLSTFADAIEKILKEQMHLELQADSAL
        MANSLDAR+RQNLS+FADA+EKILKEQMHLELQAD  L
Subjt:  MANSLDARVRQNLSTFADAIEKILKEQMHLELQADSAL

KGN56746.2 hypothetical protein Csa_011800 [Cucumis sativus]8.6e-27772.04Show/hide
Query:  MSIHSTNDIREWNAGVVERRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGN
        MSI S+ND+ +W++ V+ERRTV+DFDLNCPPPDECIDPT   +EAAQYYN Y GQA D    +DEDIAIISPRKFAEARKNFRRNHFES CG  +RRNGN
Subjt:  MSIHSTNDIREWNAGVVERRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGN

Query:  TEVYGSLSDVTSWPPFTIWPPLTISNNVSIQE-QAIHNLDLCLSCESSSRAIKA-TDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCR
        TEVYG+LSDVT+WPPFTIW PLTISNNVS+QE Q IHNLDL LSCESSSRA KA TDTDIPS LA +SSIPP   +LRCAICIEPLVEETTTKCGH    
Subjt:  TEVYGSLSDVTSWPPFTIWPPLTISNNVSIQE-QAIHNLDLCLSCESSSRAIKA-TDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCR

Query:  NCIETAIATQHRCPICRRFGFGFYNAFLSSYSKDISVSQPEGVSNWPASIDLAFVFLVHHAGKESENRWLCLYLFCLFSSLSFIKAAFEVDFGGHQVRFC
                                                                        +EN+ L                           RF 
Subjt:  NCIETAIATQHRCPICRRFGFGFYNAFLSSYSKDISVSQPEGVSNWPASIDLAFVFLVHHAGKESENRWLCLYLFCLFSSLSFIKAAFEVDFGGHQVRFC

Query:  ANSGQRKEKTMPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKE
        ANSG+ + K MPESMEATPSV PSLDLQAVR   SELEEL RSLEE+E STTDSLGSEKLL+ECA+HLESR+QQVLSE SNV+SFLGIDDLDAYVE MKE
Subjt:  ANSGQRKEKTMPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKE

Query:  ELVMVEAESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELD
        ELV VEAESSKISNEIEVLKRTN+EDSNKLKMDLEVLK S+D  F SQDPE+ATFNC S+NGED MN+IV RECNAFEVLEL+SQIEKNK+ILKSLQE+D
Subjt:  ELVMVEAESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELD

Query:  EIFKSLDVIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSL
        EIFKSLDVIEQVE TIGG+KVIDVADN IRLSLHTHIPN+E FS+LQRLEG+IE SELDHEL+IEVL+GTMELKNAEIFP DVHLHDIINASKS  NSSL
Subjt:  EIFKSLDVIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSL

Query:  EWFVRKVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVR
        EWFVRKVQDRIVLCTLRR  VKSANKS HSFEYLDQDE I+C+MIGGIDA IKV QGWPL+DSPLKLISLK+SDHYTKGVSLSLICKVEKMANSLDA +R
Subjt:  EWFVRKVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVR

Query:  QNLSTFADAIEKILKEQMHLELQADS
        +NLS+FADA+EKILKEQMHLELQADS
Subjt:  QNLSTFADAIEKILKEQMHLELQADS

XP_022147070.1 uncharacterized protein LOC111016098 [Momordica charantia]5.7e-19687.08Show/hide
Query:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS
        MPESMEATPSV P LDLQAVR RISELEEL RSLEEDE+S+TDSLGSEKLLKECA+HLESRLQQ+LSE SNV+SFLGIDDLDAYVE MKEELVMVEAESS
Subjt:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE
        KISNEIE +KR N+EDSN+LKMDLEVLK S+D   TS+DPEKATFNC S +GEDQ+NMI +RECNAFEVL+LDSQIE+NKRILKSLQELDEIFKSLDVIE
Subjt:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE

Query:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR
        QVEDTIGGLKVIDVA+NFIRLSL THIPNLE  SSLQRLEG+IEPSEL+HELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSF NSSLEWFVRKVQDR
Subjt:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR

Query:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI
        IVLCTLRR VVK+ANKSSHSFEYLD+D TIICTMIGGI A+I+V QGWPLSDSPLKLISLKNSDHYTKG+SLSLICKVEKMANSLD R+R NLS+FADAI
Subjt:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI

Query:  EKILKEQMHLELQADSAL
        EKILKE+MHLEL +D AL
Subjt:  EKILKEQMHLELQADSAL

XP_023528067.1 uncharacterized protein LOC111791098 isoform X1 [Cucurbita pepo subsp. pepo]3.0e-19787.17Show/hide
Query:  EKTMPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEA
        EKTMPESMEATPSVS SLDLQAVRSRISELEEL RSLEEDE+  TDSLGSEKLLKECA+HLESRLQQVLSECSNV+SFL IDDLDAYVE MKEELV VEA
Subjt:  EKTMPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEA

Query:  ESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLD
        ESSKISNEIEVLKRTN+E SNKL++DLE+L  S+D  FTSQDPE  TFN CS+NGEDQMN+IV+ ECNAFEVLELDS IEKNKRILKSLQE+DEIFKSLD
Subjt:  ESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLD

Query:  VIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKV
        V+EQVEDTIGGLKVIDVADNFIRLSL THIPNLE FSSLQ+LEGMIEPSEL+HELLIEVLEGTMELKNAEIFPGDVHLHDIINASKS  N SLEWFV+KV
Subjt:  VIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKV

Query:  QDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFA
        QDRIVLCTLRR VVKSANKSSHSF+Y+DQDETI+C+MIGGIDA IKV QGWPL+DSPLKL+SLK+SDHYTKG SLSL+CKVEKMANSLDAR+RQNLS+FA
Subjt:  QDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFA

Query:  DAIEKILKEQMHLELQADSAL
        DA+EKILKEQMHLELQADS L
Subjt:  DAIEKILKEQMHLELQADSAL

XP_023538691.1 uncharacterized protein LOC111799561 [Cucurbita pepo subsp. pepo]1.7e-19587.56Show/hide
Query:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS
        M E MEATPSVSPS+D+QAVRS ISE+EEL RSLEEDE+ TTDSLGSEKLLKEC++ LESRLQQ LSE SNV+SFLGIDDLDAYVERMKEEL+ VEAESS
Subjt:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE
        KIS+EIEVLKRT++EDSNKL+MDLEVLK S+D    SQDPEKATFNC SVNGEDQM+MIVERECNAFEVLELDSQIEKN+R LKSLQELDEIFKSLDVIE
Subjt:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE

Query:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR
        QVEDTIGGLKVIDV DNF+R+SLH+HIPNLE FSSLQRLEGMIEPSELDHELLIEVLEGTMEL NAEIFPGDVHLHDIINASKSF NSSLEWFVRKVQDR
Subjt:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR

Query:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI
        IVLC LRR VVKSANKSSHSFEYLDQDETIICTMIGGIDA+IKV QGWPLSDSPLKLISLK+SDHY KGVSLSLICKVEKMANSLD  +R NLS+FADA+
Subjt:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI

Query:  EKILKEQMHLELQADSAL
        EKI+KEQMHLELQ DSAL
Subjt:  EKILKEQMHLELQADSAL

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q3 Uncharacterized protein3.2e-19286.3Show/hide
Query:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS
        MPESMEATPSV PSLDLQAVR   SELEEL RSLEE+E STTDSLGSEKLL+ECA+HLESR+QQVLSE SNV+SFLGIDDLDAYVE MKEELV VEAESS
Subjt:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE
        KISNEIEVLKRTN+EDSNKLKMDLEVLK S+D  F SQDPE+ATFNC S+NGED MN+IV RECNAFEVLEL+SQIEKNK+ILKSLQE+DEIFKSLDVIE
Subjt:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE

Query:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR
        QVE TIGG+KVIDVADN IRLSLHTHIPN+E FS+LQRLEG+IE SELDHEL+IEVL+GTMELKNAEIFP DVHLHDIINASKS  NSSLEWFVRKVQDR
Subjt:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR

Query:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI
        IVLCTLRR  VKSANKS HSFEYLDQDE I+C+MIGGIDA IKV QGWPL+DSPLKLISLK+SDHYTKGVSLSLICKVEKMANSLDA +R+NLS+FADA+
Subjt:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI

Query:  EKILKEQMHLELQADS
        EKILKEQMHLELQADS
Subjt:  EKILKEQMHLELQADS

A0A6J1CZ44 uncharacterized protein LOC1110160982.8e-19687.08Show/hide
Query:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS
        MPESMEATPSV P LDLQAVR RISELEEL RSLEEDE+S+TDSLGSEKLLKECA+HLESRLQQ+LSE SNV+SFLGIDDLDAYVE MKEELVMVEAESS
Subjt:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE
        KISNEIE +KR N+EDSN+LKMDLEVLK S+D   TS+DPEKATFNC S +GEDQ+NMI +RECNAFEVL+LDSQIE+NKRILKSLQELDEIFKSLDVIE
Subjt:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE

Query:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR
        QVEDTIGGLKVIDVA+NFIRLSL THIPNLE  SSLQRLEG+IEPSEL+HELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSF NSSLEWFVRKVQDR
Subjt:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR

Query:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI
        IVLCTLRR VVK+ANKSSHSFEYLD+D TIICTMIGGI A+I+V QGWPLSDSPLKLISLKNSDHYTKG+SLSLICKVEKMANSLD R+R NLS+FADAI
Subjt:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI

Query:  EKILKEQMHLELQADSAL
        EKILKE+MHLEL +D AL
Subjt:  EKILKEQMHLELQADSAL

A0A6J1E9V8 uncharacterized protein LOC111432106 isoform X31.3e-19385.75Show/hide
Query:  EKTMPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEA
        +K MPESMEATPSVS SLDLQAVRSRI ELEEL RSL EDE+ +TDSLGSEKLLKECA+HLESRLQQVLSECSNV+SFLGIDDLDAYVE MKEELV VEA
Subjt:  EKTMPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEA

Query:  ESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLD
        ESS+ISNEIEVLKRTN+E SNKL+++LE+L  S+D  FTSQDPEK TFN CS+NGEDQMN+IV+RE NAFEVLELDS IEKNKRILKSLQE+DEIFKSLD
Subjt:  ESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLD

Query:  VIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKV
        V+EQVEDTIGGLKVI VADNFIRLSL THIPNLE FSSLQRLEGMIEPSEL+HELLIEVLEGTMELKNAEIFPGDVHLHDIINASKS  N SLEWFV+KV
Subjt:  VIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKV

Query:  QDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFA
        QDRIVLCTLRR VVKSANKSSHSF+Y+DQDETI+C MIGGIDA IKV QGWPL+DSPLKL+SLK+SDHYTKG SLSL+CKVEKMANSLDAR+RQNLS+FA
Subjt:  QDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFA

Query:  DAIEKILKEQMHLELQADSAL
        DA+EKILKEQMHLEL+AD  L
Subjt:  DAIEKILKEQMHLELQADSAL

A0A6J1F0V9 uncharacterized protein LOC1114413638.9e-19587.8Show/hide
Query:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS
        M E MEATPSVSPS+D+QAVRS ISELEEL RSLEEDE+ TTDSLGS KLLKEC++ LESRLQQ LSE SNV+SFLGIDDLDAYVERMKEEL+ VEAESS
Subjt:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE
        KISNEIEVLKRT++EDSNKL+MDLEVLK S+D    SQDPEKAT NC SVNGEDQM+MIVERECNAFEVLELDSQIEKN+R LKSLQELDEIFKSLDVIE
Subjt:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE

Query:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR
        QVEDTIGGLKVIDV DNFIRLSLH+HIPNLE FSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSF NSSLEWFVRKVQDR
Subjt:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR

Query:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI
        IVLC LRR VVKSANKSSHSFEYLDQDETIICTMIGGIDA+IKV QGWPLSDSPLKLISLK+SDHY  GVSLSLICKVEKMANSLD  +R +LS+FADA+
Subjt:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI

Query:  EKILKEQMHLELQADSAL
        EKI+KEQMHLELQ DSAL
Subjt:  EKILKEQMHLELQADSAL

A0A6J1IHB4 uncharacterized protein LOC1114734982.0e-19487.32Show/hide
Query:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS
        M E MEATPSVS S+DLQAVRS ISE+EEL RSLEEDE+ TTDSLGSEKLLKEC++ LESRLQQ LSE SNV+S LGIDDLDAYVERMKEEL+ VEAESS
Subjt:  MPESMEATPSVSPSLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESS

Query:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE
        KISNEIEVLKRT++EDSNKLKMDLEVLK S+D    SQDPEKATFNC SVNGEDQM+M VERECNAFEVLELDSQIEKN+R LKSLQELDEIFKSLDVIE
Subjt:  KISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIE

Query:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR
        QVEDTIGGLKVIDV DNF+RLSLH+H+PNLE FSSLQRLEGMIEPSELDHELLIEVLEGTM+LKNAEIFPGDVHLHDIINASKSF NSSLEWFVRKVQDR
Subjt:  QVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDR

Query:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI
        IVLC LRR VVKSANKSSHSFEYLDQDETIICTMIGGIDA+IKV QGWPLSDSPLKLISLK+SDHY KGVSLSLICKVEKMANSLD  +R +LS+FADA+
Subjt:  IVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAI

Query:  EKILKEQMHLELQADSAL
        EKI+KEQMHLELQ DSAL
Subjt:  EKILKEQMHLELQADSAL

SwissProt top hitse value%identityAlignment
Q14258 E3 ubiquitin/ISG15 ligase TRIM255.7e-0546.94Show/hide
Query:  PAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQ---HRCPICR
        P A  L C+IC+EP  E  TT CGH FC +C+    A Q   + CP CR
Subjt:  PAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQ---HRCPICR

Q61510 E3 ubiquitin/ISG15 ligase TRIM251.6e-0438.89Show/hide
Query:  SSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQ---HRCPICRR
        + + P A  L C++C+E   E  TT CGH FC +C++     Q   +RCP CR+
Subjt:  SSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQ---HRCPICRR

Q95KF1 E3 ubiquitin-protein ligase RNF1251.9e-0536.99Show/hide
Query:  SSSRAIKATDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIA-TQHRCPICRRF
        SS     A  +  P AL +         S  CA+C+E L +   T+CGHVFCR+CI T++   +  CP CR +
Subjt:  SSSRAIKATDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIA-TQHRCPICRRF

Q96EQ8 E3 ubiquitin-protein ligase RNF1252.5e-0546.67Show/hide
Query:  SLRCAICIEPLVEETTTKCGHVFCRNCIETAIA-TQHRCPICRRF
        S  CA+C+E L +   T+CGHVFCR+CI T++   +  CP CR +
Subjt:  SLRCAICIEPLVEETTTKCGHVFCRNCIETAIA-TQHRCPICRRF

Q9D9R0 E3 ubiquitin-protein ligase RNF1251.9e-0535.9Show/hide
Query:  LSCESSSRAIKATDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHR--CPICRRF
        L    SS++  A+ T  P  L ++        S  C++C+E L +   T+CGHVFCR+CI T+I   ++  CP CR +
Subjt:  LSCESSSRAIKATDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHR--CPICRRF

Arabidopsis top hitse value%identityAlignment
AT3G23910.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT3G24255.2)4.2e-8844.34Show/hide
Query:  SLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESSKISNEIEVLKRTN
        SLDLQ +R R+ EL+   R+  E+   +  S     ++++  +  E ++++++ E  +V+  L ++D DAY+E ++ EL  VEAES+K+S EIE L +++
Subjt:  SLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESSKISNEIEVLKRTN

Query:  MEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIEQVEDTIGGLKVID
         +DS++L+ DLE L  S+D   +SQD EK+  N  S +  +   +I   + + F++ EL++Q+E+ + ILKSL++LD + K  D  EQVED + GLKV++
Subjt:  MEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIEQVEDTIGGLKVID

Query:  VADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFR-----------NSSLEWFVRKVQDRIV
           NFIRL L T+I  L+ F    + + + EPSEL HELLI + + T E+   E+FP D+++ DII A+ SFR            SS++W V KVQD+I+
Subjt:  VADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFR-----------NSSLEWFVRKVQDRIV

Query:  LCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAIEK
          TLR+ +V S+    ++FEY D+DETI+  + GGIDA +KV  GWPL ++PLKL SLKNSD+ +KG+SLSLICKVE++ANSLD   RQNLS F DAIEK
Subjt:  LCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAIEK

Query:  ILKEQMHLELQADSA
        IL EQ   ELQ++ +
Subjt:  ILKEQMHLELQADSA

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-7646.69Show/hide
Query:  DAYVERMKEELVMVEAESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKR
        DAY+E ++ EL  VEAES+K+S EIE L +++  DS++L+ DLE L  S+D   +SQD EK+  N  S +  +   +I   + + F++ EL++Q+E+ + 
Subjt:  DAYVERMKEELVMVEAESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKR

Query:  ILKSLQELDEIFKSLDVIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINA
        ILKSL++LD + K  D  EQVED + GLKV++   NFIRL L T+I  L+ F    + + + EPSEL HELLI + + T E+   E+FP D+++ DII A
Subjt:  ILKSLQELDEIFKSLDVIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINA

Query:  SKSFR-----------NSSLEWFVRKVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGV
        + SFR            SS++W V KVQD+I+  TLR+  V S+    ++FEY D+DETI+  + GGIDA +KV  GWPL ++PLKL SLKNSD+ +KG 
Subjt:  SKSFR-----------NSSLEWFVRKVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGV

Query:  SLSLICKVEKMANSLDARVRQNLSTFADAIEKILKEQMHLELQADSA
        SLSLI K+E++ANSLD   RQNLS F DA+EKIL +Q   EL+++ +
Subjt:  SLSLICKVEKMANSLDARVRQNLSTFADAIEKILKEQMHLELQADSA

AT3G24255.2 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.5e-8141.94Show/hide
Query:  SLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDD-------LDAYVERMKEELVMVEAESSKISNEI
        SLDLQ +R R+ E +   R+  E+   +  S     ++++  +  E ++++++ +  +V+  L +D         DAY+E ++ EL  VEAES+K+S EI
Subjt:  SLDLQAVRSRISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDD-------LDAYVERMKEELVMVEAESSKISNEI

Query:  EVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIEQVEDTI
        E L +++  DS++L+ DLE L  S+D   +SQD EK+  N  S +  +   +I   + + F++ EL++Q+E+ + ILKSL++LD + K  D  EQVED +
Subjt:  EVLKRTNMEDSNKLKMDLEVLKFSVDHHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIEQVEDTI

Query:  GGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFR-----------NSSLEWFVR
         GLKV++   NFIRL L T+I  L+ F    + + + EPSEL HELLI + + T E+   E+FP D+++ DII A+ SFR            SS++W V 
Subjt:  GGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGMIEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFR-----------NSSLEWFVR

Query:  KVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLST
        KVQD+I+  TLR+  V S+    ++FEY D+DETI+  + GGIDA +KV  GWPL ++PLKL SLKNSD+ +KG SLSLI K+E++ANSLD   RQNLS 
Subjt:  KVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSDSPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLST

Query:  FADAIEKILKEQMHLELQADSA
        F DA+EKIL +Q   EL+++ +
Subjt:  FADAIEKILKEQMHLELQADSA

AT5G48655.1 RING/U-box superfamily protein9.9e-1328Show/hide
Query:  RRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGNTEVYGSLSDVTSWPPFTI
        RR     DLN  P D+                  P   M + + +++D+   S   FAEA K+  RN       V V   G T                 
Subjt:  RRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGNTEVYGSLSDVTSWPPFTI

Query:  WPPLTISNNVSIQEQAIHNLDLCLSCESSSRAIKATDTDIPSALAQNS--SIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRR
          P  ISN    + + I + +  + CE +S      + ++ S ++++   + PP  P   C IC+ P  EE +TKCGH+FC+ CI+ AI+ Q +CP CR+
Subjt:  WPPLTISNNVSIQEQAIHNLDLCLSCESSSRAIKATDTDIPSALAQNS--SIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRR

AT5G48655.2 RING/U-box superfamily protein9.9e-1328Show/hide
Query:  RRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGNTEVYGSLSDVTSWPPFTI
        RR     DLN  P D+                  P   M + + +++D+   S   FAEA K+  RN       V V   G T                 
Subjt:  RRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGNTEVYGSLSDVTSWPPFTI

Query:  WPPLTISNNVSIQEQAIHNLDLCLSCESSSRAIKATDTDIPSALAQNS--SIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRR
          P  ISN    + + I + +  + CE +S      + ++ S ++++   + PP  P   C IC+ P  EE +TKCGH+FC+ CI+ AI+ Q +CP CR+
Subjt:  WPPLTISNNVSIQEQAIHNLDLCLSCESSSRAIKATDTDIPSALAQNS--SIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATTCATAGTACAAATGATATACGGGAGTGGAATGCAGGAGTTGTTGAGAGGAGAACAGTAATGGACTTCGACCTAAATTGCCCACCTCCAGATGAGTGCATCGA
TCCAACTGTCCCTCATGAGGAAGCAGCACAGTACTACAATCGTTACCCAGGACAAGCAATGGACAACCCCGAGTTTGTCGATGAGGATATTGCTATAATCTCCCCAAGGA
AATTTGCCGAAGCCAGGAAGAATTTTCGAAGAAACCACTTTGAGAGTAGCTGTGGAGTAGCCGTCAGACGTAATGGCAACACAGAAGTTTATGGTTCTCTCTCAGATGTA
ACAAGTTGGCCCCCTTTTACAATATGGCCGCCTCTTACAATTAGCAATAACGTATCCATACAAGAACAAGCAATTCACAATTTGGACCTTTGCTTAAGCTGTGAAAGCAG
CAGTAGGGCCATTAAGGCAACTGACACTGACATTCCTTCTGCACTTGCACAAAATAGTAGCATCCCACCAGCAGCCCCGAGTTTGCGGTGTGCGATCTGCATAGAACCAT
TGGTCGAAGAAACAACGACAAAATGCGGGCACGTTTTCTGCAGGAACTGCATCGAAACAGCGATAGCTACCCAGCACAGATGTCCCATATGTCGGCGCTTTGGCTTTGGC
TTTTACAATGCTTTTCTGTCTTCTTACTCTAAGGATATTTCTGTATCTCAGCCTGAAGGCGTTTCAAATTGGCCTGCCTCTATAGACTTGGCTTTTGTGTTCCTTGTTCA
TCATGCTGGAAAAGAATCTGAGAACAGGTGGCTTTGCCTCTACTTATTCTGCCTTTTTTCTTCCTTGTCTTTCATTAAGGCTGCGTTTGAAGTTGATTTTGGAGGCCATC
AAGTTCGGTTCTGTGCAAATTCCGGCCAAAGGAAGGAGAAAACAATGCCAGAATCGATGGAAGCTACTCCGTCTGTATCTCCAAGCCTTGATCTCCAAGCAGTTCGCAGT
CGTATAAGCGAGCTAGAAGAGTTGCATAGATCTTTGGAGGAAGATGAATCTTCTACGACAGATTCATTAGGTTCTGAGAAGTTACTGAAGGAGTGTGCTGTCCATCTCGA
GAGTAGACTACAGCAGGTTCTGTCAGAATGCTCTAACGTTGAGAGTTTCTTGGGGATTGATGATTTAGATGCGTACGTTGAACGTATGAAAGAGGAACTTGTCATGGTGG
AAGCTGAAAGCAGCAAAATCTCCAATGAGATTGAGGTTCTTAAGAGAACCAATATGGAAGATTCTAATAAATTAAAGATGGATCTTGAAGTATTAAAATTTTCGGTAGAT
CATCATTTTACATCACAGGATCCAGAAAAGGCAACATTTAATTGCTGCTCTGTGAATGGTGAAGATCAAATGAACATGATAGTCGAGCGTGAATGCAATGCGTTTGAGGT
ATTGGAACTTGATAGTCAGATTGAGAAGAACAAAAGAATTCTTAAATCTTTGCAGGAACTAGATGAGATATTTAAAAGTTTGGATGTTATCGAACAGGTTGAGGACACAA
TTGGTGGTCTGAAGGTCATTGATGTTGCTGATAATTTCATTCGATTGTCATTACATACACATATTCCAAACTTGGAATATTTTTCAAGCTTACAGAGACTCGAAGGTATG
ATTGAGCCATCTGAATTGGATCACGAGTTGCTAATAGAAGTATTGGAGGGGACAATGGAGTTAAAGAATGCTGAGATCTTTCCTGGTGATGTCCACTTGCACGATATCAT
CAATGCTTCAAAGTCATTCCGAAATTCTTCATTGGAGTGGTTTGTGAGAAAAGTACAAGATAGGATTGTTTTGTGTACTCTCAGGCGAGTTGTTGTGAAGAGTGCAAACA
AATCGAGTCATTCTTTTGAGTATTTAGATCAAGATGAAACGATAATATGTACTATGATCGGAGGAATTGATGCACTTATCAAGGTGTTTCAAGGTTGGCCATTATCCGAT
TCTCCTTTGAAACTTATATCTCTCAAGAACTCAGATCATTATACGAAAGGAGTTTCTTTAAGCCTCATTTGCAAGGTGGAGAAAATGGCAAATTCCTTGGACGCACGCGT
TCGTCAAAATCTATCAACCTTTGCAGACGCTATTGAAAAAATATTGAAGGAGCAAATGCATTTAGAGCTCCAAGCTGACAGTGCTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATTCATAGTACAAATGATATACGGGAGTGGAATGCAGGAGTTGTTGAGAGGAGAACAGTAATGGACTTCGACCTAAATTGCCCACCTCCAGATGAGTGCATCGA
TCCAACTGTCCCTCATGAGGAAGCAGCACAGTACTACAATCGTTACCCAGGACAAGCAATGGACAACCCCGAGTTTGTCGATGAGGATATTGCTATAATCTCCCCAAGGA
AATTTGCCGAAGCCAGGAAGAATTTTCGAAGAAACCACTTTGAGAGTAGCTGTGGAGTAGCCGTCAGACGTAATGGCAACACAGAAGTTTATGGTTCTCTCTCAGATGTA
ACAAGTTGGCCCCCTTTTACAATATGGCCGCCTCTTACAATTAGCAATAACGTATCCATACAAGAACAAGCAATTCACAATTTGGACCTTTGCTTAAGCTGTGAAAGCAG
CAGTAGGGCCATTAAGGCAACTGACACTGACATTCCTTCTGCACTTGCACAAAATAGTAGCATCCCACCAGCAGCCCCGAGTTTGCGGTGTGCGATCTGCATAGAACCAT
TGGTCGAAGAAACAACGACAAAATGCGGGCACGTTTTCTGCAGGAACTGCATCGAAACAGCGATAGCTACCCAGCACAGATGTCCCATATGTCGGCGCTTTGGCTTTGGC
TTTTACAATGCTTTTCTGTCTTCTTACTCTAAGGATATTTCTGTATCTCAGCCTGAAGGCGTTTCAAATTGGCCTGCCTCTATAGACTTGGCTTTTGTGTTCCTTGTTCA
TCATGCTGGAAAAGAATCTGAGAACAGGTGGCTTTGCCTCTACTTATTCTGCCTTTTTTCTTCCTTGTCTTTCATTAAGGCTGCGTTTGAAGTTGATTTTGGAGGCCATC
AAGTTCGGTTCTGTGCAAATTCCGGCCAAAGGAAGGAGAAAACAATGCCAGAATCGATGGAAGCTACTCCGTCTGTATCTCCAAGCCTTGATCTCCAAGCAGTTCGCAGT
CGTATAAGCGAGCTAGAAGAGTTGCATAGATCTTTGGAGGAAGATGAATCTTCTACGACAGATTCATTAGGTTCTGAGAAGTTACTGAAGGAGTGTGCTGTCCATCTCGA
GAGTAGACTACAGCAGGTTCTGTCAGAATGCTCTAACGTTGAGAGTTTCTTGGGGATTGATGATTTAGATGCGTACGTTGAACGTATGAAAGAGGAACTTGTCATGGTGG
AAGCTGAAAGCAGCAAAATCTCCAATGAGATTGAGGTTCTTAAGAGAACCAATATGGAAGATTCTAATAAATTAAAGATGGATCTTGAAGTATTAAAATTTTCGGTAGAT
CATCATTTTACATCACAGGATCCAGAAAAGGCAACATTTAATTGCTGCTCTGTGAATGGTGAAGATCAAATGAACATGATAGTCGAGCGTGAATGCAATGCGTTTGAGGT
ATTGGAACTTGATAGTCAGATTGAGAAGAACAAAAGAATTCTTAAATCTTTGCAGGAACTAGATGAGATATTTAAAAGTTTGGATGTTATCGAACAGGTTGAGGACACAA
TTGGTGGTCTGAAGGTCATTGATGTTGCTGATAATTTCATTCGATTGTCATTACATACACATATTCCAAACTTGGAATATTTTTCAAGCTTACAGAGACTCGAAGGTATG
ATTGAGCCATCTGAATTGGATCACGAGTTGCTAATAGAAGTATTGGAGGGGACAATGGAGTTAAAGAATGCTGAGATCTTTCCTGGTGATGTCCACTTGCACGATATCAT
CAATGCTTCAAAGTCATTCCGAAATTCTTCATTGGAGTGGTTTGTGAGAAAAGTACAAGATAGGATTGTTTTGTGTACTCTCAGGCGAGTTGTTGTGAAGAGTGCAAACA
AATCGAGTCATTCTTTTGAGTATTTAGATCAAGATGAAACGATAATATGTACTATGATCGGAGGAATTGATGCACTTATCAAGGTGTTTCAAGGTTGGCCATTATCCGAT
TCTCCTTTGAAACTTATATCTCTCAAGAACTCAGATCATTATACGAAAGGAGTTTCTTTAAGCCTCATTTGCAAGGTGGAGAAAATGGCAAATTCCTTGGACGCACGCGT
TCGTCAAAATCTATCAACCTTTGCAGACGCTATTGAAAAAATATTGAAGGAGCAAATGCATTTAGAGCTCCAAGCTGACAGTGCTCTCTGA
Protein sequenceShow/hide protein sequence
MSIHSTNDIREWNAGVVERRTVMDFDLNCPPPDECIDPTVPHEEAAQYYNRYPGQAMDNPEFVDEDIAIISPRKFAEARKNFRRNHFESSCGVAVRRNGNTEVYGSLSDV
TSWPPFTIWPPLTISNNVSIQEQAIHNLDLCLSCESSSRAIKATDTDIPSALAQNSSIPPAAPSLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRRFGFG
FYNAFLSSYSKDISVSQPEGVSNWPASIDLAFVFLVHHAGKESENRWLCLYLFCLFSSLSFIKAAFEVDFGGHQVRFCANSGQRKEKTMPESMEATPSVSPSLDLQAVRS
RISELEELHRSLEEDESSTTDSLGSEKLLKECAVHLESRLQQVLSECSNVESFLGIDDLDAYVERMKEELVMVEAESSKISNEIEVLKRTNMEDSNKLKMDLEVLKFSVD
HHFTSQDPEKATFNCCSVNGEDQMNMIVERECNAFEVLELDSQIEKNKRILKSLQELDEIFKSLDVIEQVEDTIGGLKVIDVADNFIRLSLHTHIPNLEYFSSLQRLEGM
IEPSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSFRNSSLEWFVRKVQDRIVLCTLRRVVVKSANKSSHSFEYLDQDETIICTMIGGIDALIKVFQGWPLSD
SPLKLISLKNSDHYTKGVSLSLICKVEKMANSLDARVRQNLSTFADAIEKILKEQMHLELQADSAL