; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G018550 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G018550
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationchr05:25784852..25797237
RNA-Seq ExpressionLsi05G018550
SyntenyLsi05G018550
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018870.1 hypothetical protein SDJN02_20743, partial [Cucurbita argyrosperma subsp. argyrosperma]7.3e-24967.31Show/hide
Query:  ARVVERRTVLDFDLNCPPPDECIDPTGPRDEAAQYYSHYQGQATDAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEVYGALSDVTSWPPF
        ARVVERRT LD DLNCPPPDECIDPTGP DEAAQY +H++ QATDA+DEDIAIISPRKFAEARKNFRRNHFESS GVV+RRNG+TEVY AL+DV+SWPPF
Subjt:  ARVVERRTVLDFDLNCPPPDECIDPTGPRDEAAQYYSHYQGQATDAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEVYGALSDVTSWPPF

Query:  TIWSPLTISNNVSIQE-QTLHNLDLRLSCESSSRANKATTD--SDTAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPIC
        TIWSP T  N+VSIQE QT HNLDLRLSCE+SSRA K  TD    +A ALNSSI P DR LRCAICIEPLVEETTTKCGH+FCRNCIE AIATQH+CPIC
Subjt:  TIWSPLTISNNVSIQE-QTLHNLDLRLSCESSSRANKATTD--SDTAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPIC

Query:  RPKAQ-SSSSHF----RTNNARNLSRFCANSGRRRKMMPE----SMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRL
        R   +  SSS F      +  R ++R  +    R   + E     ++ + +V          S+LEELQRSL E+EAY+TDSLGSEKLLKECALHLESRL
Subjt:  RPKAQ-SSSSHF----RTNNARNLSRFCANSGRRRKMMPE----SMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRL

Query:  QQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKISNEIEDP------------------------------------EEATLNCSSMNGEDQMN
        QQ+LSE SNVDSFLGIDDLDAY+EHMKEELVAVEAESSKISNEIE P                                    E+ T N  SMNGEDQMN
Subjt:  QQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKISNEIEDP------------------------------------EEATLNCSSMNGEDQMN

Query:  MIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIP
        +IV+ ECNAFEVLELDS IEKNK++LKSLQEVDEIFK                               AP C      S  FL     P  C+  P    
Subjt:  MIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIP

Query:  SDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIG
                     + VEDTIGG+ VI VADNFIRLSLRTHIPNLEDFS+LQRLEG IE SEL+HELLIEVLEGTMELKNAEIFPGDVHLHDII ASKS+ 
Subjt:  SDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIG

Query:  NSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLD
        N SL+WFV+KVQDRIVLCTLRRFVVKSANKSSHSF+Y+DQDETI+C MIGG+DA IKV+QGWPLA+SPLKLVSLKSSDHYTKG SLSL+CKVEKMANSLD
Subjt:  NSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLD

Query:  ARIRRNLSSFADAVEKILKAQMHLELQADSG
        ARIR+NLSSFADAVEKILK QMHLELQAD G
Subjt:  ARIRRNLSSFADAVEKILKAQMHLELQADSG

KGN56746.2 hypothetical protein Csa_011800 [Cucumis sativus]1.6e-28073.36Show/hide
Query:  MSIQSSNDLRDWSARVVERRTVLDFDLNCPPPDECIDPTGPRDEAAQYYSHYQGQATDAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEV
        MSIQSSNDL DWS+RV+ERRTVLDFDLNCPPPDECIDPTG  DEAAQYY+HYQGQATDAIDEDIAIISPRKFAEARKNFRRNHFES CG VIRRNGNTEV
Subjt:  MSIQSSNDLRDWSARVVERRTVLDFDLNCPPPDECIDPTGPRDEAAQYYSHYQGQATDAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEV

Query:  YGALSDVTSWPPFTIWSPLTISNNVSIQE-QTLHNLDLRLSCESSSRANKATTDSD--TAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCI
        YGALSDVT+WPPFTIWSPLTISNNVS+QE QT+HNLDLRLSCESSSRA KA TD+D  +  AL+SSI PTDRTLRCAICIEPLVEETTTKCGH       
Subjt:  YGALSDVTSWPPFTIWSPLTISNNVSIQE-QTLHNLDLRLSCESSSRANKATTDSD--TAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCI

Query:  ETAIATQHRCPICRPKAQSSSSHFRTNNA--RNLSRFCANSGR-RRKMMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECA
                                 TNNA  +NLSRF ANSGR RRKMMPESMEATPSVPPSLDLQ VRSELEELQRSLEENE  TTDSLGSEKLL+ECA
Subjt:  ETAIATQHRCPICRPKAQSSSSHFRTNNA--RNLSRFCANSGR-RRKMMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECA

Query:  LHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKISNEIE------------------------------DPEEATLNCSSMNGEDQM
        LHLESR+QQ+LSEYSNVDSFLGIDDLDAY+EHMKEELVAVEAESSKISNEIE                              DPEEAT NCSSMNGED M
Subjt:  LHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKISNEIE------------------------------DPEEATLNCSSMNGEDQM

Query:  NMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAI
        N+IVNRECNAFEVLEL+SQIEKNKK+LKSLQEVDEIFK                                                              
Subjt:  NMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAI

Query:  PSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSI
                  LDVIEQVE TIGGM VIDVADN IRLSL THIPN+EDFSTLQRLEG IEKSELDHEL+IEVL+GTMELKNAEIFP DVHLHDII ASKSI
Subjt:  PSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSI

Query:  GNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSL
         NSSL+WFVRKVQDRIVLCTLRRF VKSANKS HSFEYLDQDE I+CSMIGG+DACIKV+QGWPLA+SPLKL+SLKSSDHYTKGVSLSLICKVEKMANSL
Subjt:  GNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSL

Query:  DARIRRNLSSFADAVEKILKAQMHLELQADSG
        DA IRRNLSSFADAVEKILK QMHLELQADSG
Subjt:  DARIRRNLSSFADAVEKILKAQMHLELQADSG

XP_004133985.1 uncharacterized protein LOC101211137 [Cucumis sativus]1.7e-17671.96Show/hide
Query:  MMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKI
        MMPESMEATPSVPPSLDLQ VRSELEELQRSLEENE  TTDSLGSEKLL+ECALHLESR+QQ+LSEYSNVDSFLGIDDLDAY+EHMKEELVAVEAESSKI
Subjt:  MMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKI

Query:  SNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQ
        SNEIE                              DPEEAT NCSSMNGED MN+IVNRECNAFEVLEL+SQIEKNKK+LKSLQEVDEIFK         
Subjt:  SNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQ

Query:  LREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLED
                                                                       LDVIEQVE TIGGM VIDVADN IRLSL THIPN+ED
Subjt:  LREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLED

Query:  FSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIIC
        FSTLQRLEG IEKSELDHEL+IEVL+GTMELKNAEIFP DVHLHDII ASKSI NSSL+WFVRKVQDRIVLCTLRRF VKSANKS HSFEYLDQDE I+C
Subjt:  FSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIIC

Query:  SMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG
        SMIGG+DACIKV+QGWPLA+SPLKL+SLKSSDHYTKGVSLSLICKVEKMANSLDA IRRNLSSFADAVEKILK QMHLELQADSG
Subjt:  SMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG

XP_038897559.1 uncharacterized protein LOC120085576 isoform X1 [Benincasa hispida]3.6e-17973.97Show/hide
Query:  MPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKIS
        MPESMEATPSVPPSLDLQ+VRSELEELQRSLEENEAYT DSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAY+EHMKEELVAVEAESSKIS
Subjt:  MPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKIS

Query:  NEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQL
        NEIE                              DPEEAT NCSSMNGEDQMN IVNRECNAFEVLEL+ QIE+NKK+LKSLQEVD+IFK          
Subjt:  NEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQL

Query:  REKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDF
                                                                      LDVIEQVEDTIGGM VIDVADNFIRLSLRTHIPNLEDF
Subjt:  REKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDF

Query:  STLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICS
        STLQRLEG IEKS LDHELLIEVLEGTMELKNAEIFPGDVHLHDII ASKSI NSSL+WFVRKVQDRIVLCTLRRFVVKSANKSSHSFEY DQDE IICS
Subjt:  STLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICS

Query:  MIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG
        MIGG+DACIKV+QGWPLA+SPLKL+SLKSSDHYTKGVSLSLICKVEKMANSLDARI RNLSSFADAVEKILK QMHLELQADSG
Subjt:  MIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG

XP_038897565.1 uncharacterized protein LOC120085576 isoform X2 [Benincasa hispida]1.7e-17673.55Show/hide
Query:  MPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKIS
        MPESMEATPSVPPSLDLQ+VRSELEELQRSLEENEAYT DSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAY+EHMKEELVAVEAESSKIS
Subjt:  MPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKIS

Query:  NEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQL
        NEIE                              DPEEAT NCSSMNGEDQMN IVNRECNAFEVLEL+ QIE+NKK+LKSLQEVD+IFK          
Subjt:  NEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQL

Query:  REKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDF
                                                                      LDVIEQVEDTIGGM VIDVADNFIRLSLRTHIPNLEDF
Subjt:  REKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDF

Query:  STLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICS
        STLQRLEG IEKS LDHELLIEVLEGTMELKNAEIFPGDVHLHDII ASKSI NSSL+WFVRKVQDRIVLCTLRRFVVKSANKSSHSFEY DQDE IICS
Subjt:  STLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICS

Query:  MIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG
        MIGG+DACIKV+QGWPLA+SPLKL+SLKSSDHYTKGVSLSLICK  KMANSLDARI RNLSSFADAVEKILK QMHLELQADSG
Subjt:  MIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q3 Uncharacterized protein8.0e-17771.96Show/hide
Query:  MMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKI
        MMPESMEATPSVPPSLDLQ VRSELEELQRSLEENE  TTDSLGSEKLL+ECALHLESR+QQ+LSEYSNVDSFLGIDDLDAY+EHMKEELVAVEAESSKI
Subjt:  MMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKI

Query:  SNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQ
        SNEIE                              DPEEAT NCSSMNGED MN+IVNRECNAFEVLEL+SQIEKNKK+LKSLQEVDEIFK         
Subjt:  SNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQ

Query:  LREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLED
                                                                       LDVIEQVE TIGGM VIDVADN IRLSL THIPN+ED
Subjt:  LREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLED

Query:  FSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIIC
        FSTLQRLEG IEKSELDHEL+IEVL+GTMELKNAEIFP DVHLHDII ASKSI NSSL+WFVRKVQDRIVLCTLRRF VKSANKS HSFEYLDQDE I+C
Subjt:  FSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIIC

Query:  SMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG
        SMIGG+DACIKV+QGWPLA+SPLKL+SLKSSDHYTKGVSLSLICKVEKMANSLDA IRRNLSSFADAVEKILK QMHLELQADSG
Subjt:  SMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG

A0A1S3AVT7 uncharacterized protein LOC1034834692.9e-17170.1Show/hide
Query:  MMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKI
        MM ES+E TPSVPPSLDLQ VRSELEELQRSLEENE  + DSLGSEKLL+ECALHLESR+QQ+LSEYSNVDSFLGIDDLDAY+E+MKEELVAVEAESSKI
Subjt:  MMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKI

Query:  SNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQ
        SNEIE                              DPEEAT NCSSMNGED+MN+IV+RECNAFEVLEL+SQIEKNKK+LKSLQEVDEIFK         
Subjt:  SNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQ

Query:  LREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLED
                                                                       LDVIEQVE TIGGM VIDVADN IRLSL THIPN+ED
Subjt:  LREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLED

Query:  FSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIIC
        FSTLQRLEG IEKSE DHEL+IEV  GTMELKNAEIFP DVHLHDII ASKSI NSSL+WFVRKVQDRIVLCTLRRF VKSANKSSHSFEYLDQDE I+C
Subjt:  FSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIIC

Query:  SMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG
        SMIGG+DACIKV+QGWPLA+SPLKL+SLKSSDHYTKG+SLSLICKVEKMANSLDARIR+NLSSFADAVEKILK QMHLELQADSG
Subjt:  SMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG

A0A5A7U6L2 Uncharacterized protein6.3e-17470.72Show/hide
Query:  MMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKI
        MMPESME TPSVPPSLDLQ VRSELEELQRSLEENE  + DSLGSEKLL+ECALHLESR+QQ+LSEYSNVDSFLGIDDLDAY+EHMKEELVAVEAESSKI
Subjt:  MMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKI

Query:  SNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQ
        SNEIE                              DPEEAT NCSSMNGED+MN+IV+RECNAFEVLEL+SQIEKNKK+LKSLQEVDEIFK         
Subjt:  SNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQ

Query:  LREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLED
                                                                       LDVIEQVE TIGGM VIDVADN IRLSL THIPN+ED
Subjt:  LREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLED

Query:  FSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIIC
        FSTLQRLEG IEKSELDHEL+IEV  GTMELKNAEIFP DVHLHDII ASKSI NSSL+WFVRKVQDRIVLCTLRRF VKSANKSSHSFEYLDQDE I+C
Subjt:  FSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIIC

Query:  SMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG
        SMIGG+DACIKV+QGWPLA+SPLKL+SLKSSDHYTKG+SLSLICKVEKMANSLD RIR+NLSSFADAVEKILK QMHLELQADSG
Subjt:  SMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG

A0A6J1E9V8 uncharacterized protein LOC111432106 isoform X39.2e-16568.16Show/hide
Query:  RKMMPESMEATPSVPPSLDLQTVRS---ELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEA
        +K+MPESMEATPSV  SLDLQ VRS   ELEELQRSL E+EAY+TDSLGSEKLLKECALHLESRLQQ+LSE SNVDSFLGIDDLDAY+EHMKEELVAVEA
Subjt:  RKMMPESMEATPSVPPSLDLQTVRS---ELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEA

Query:  ESSKISNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVP
        ESS+ISNEIE                              DPE+ T N  SMNGEDQMN+IV+RE NAFEVLELDS IEKNK++LKSLQEVDEIFK    
Subjt:  ESSKISNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVP

Query:  WCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHI
                                                                            LDV+EQVEDTIGG+ VI VADNFIRLSLRTHI
Subjt:  WCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHI

Query:  PNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQD
        PNLEDFS+LQRLEG IE SEL+HELLIEVLEGTMELKNAEIFPGDVHLHDII ASKS+ N SL+WFV+KVQDRIVLCTLRRFVVKSANKSSHSF+Y+DQD
Subjt:  PNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQD

Query:  ETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG
        ETI+C MIGG+DA IKV+QGWPLA+SPLKLVSLKSSDHYTKG SLSL+CKVEKMANSLDARIR+NLSSFADAVEKILK QMHLEL+AD G
Subjt:  ETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG

A0A6J1ED63 uncharacterized protein LOC111432106 isoform X24.5e-16467.82Show/hide
Query:  RKMMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDD----LDAYIEHMKEELVAVE
        +K+MPESMEATPSV  SLDLQ VR ELEELQRSL E+EAY+TDSLGSEKLLKECALHLESRLQQ+LSE SNVDSFLGIDD    LDAY+EHMKEELVAVE
Subjt:  RKMMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDD----LDAYIEHMKEELVAVE

Query:  AESSKISNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREV
        AESS+ISNEIE                              DPE+ T N  SMNGEDQMN+IV+RE NAFEVLELDS IEKNK++LKSLQEVDEIFK   
Subjt:  AESSKISNEIE------------------------------DPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREV

Query:  PWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTH
                                                                             LDV+EQVEDTIGG+ VI VADNFIRLSLRTH
Subjt:  PWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTH

Query:  IPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQ
        IPNLEDFS+LQRLEG IE SEL+HELLIEVLEGTMELKNAEIFPGDVHLHDII ASKS+ N SL+WFV+KVQDRIVLCTLRRFVVKSANKSSHSF+Y+DQ
Subjt:  IPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQ

Query:  DETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG
        DETI+C MIGG+DA IKV+QGWPLA+SPLKLVSLKSSDHYTKG SLSL+CKVEKMANSLDARIR+NLSSFADAVEKILK QMHLEL+AD G
Subjt:  DETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG

SwissProt top hitse value%identityAlignment
Q1L5Z9 LON peptidase N-terminal domain and RING finger protein 27.1e-0537.7Show/hide
Query:  DSDTAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRPK
        +S+T  +   S+  TD    CA+C+  L E  TT CGH FC  C+E  +     CP+C+ K
Subjt:  DSDTAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRPK

Q810I1 E3 ubiquitin-protein ligase TRIM506.0e-0436.67Show/hide
Query:  LRCAICIEPLVEETTTKCGHVFCRNCIETA---IATQHRCPICRPKAQSSSSHFRTNNAR
        L+C IC+E   E    +CGH +C+NC+++    + ++ RCP+CR     SSS    + AR
Subjt:  LRCAICIEPLVEETTTKCGHVFCRNCIETA---IATQHRCPICRPKAQSSSSHFRTNNAR

Q95KF1 E3 ubiquitin-protein ligase RNF1252.7e-0439.44Show/hide
Query:  SSRANKATTDSDTAHALNSSIQP--TDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIA-TQHRCPICR
        SS + K+   S T  AL     P     +  CA+C+E L +   T+CGHVFCR+CI T++   +  CP CR
Subjt:  SSRANKATTDSDTAHALNSSIQP--TDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIA-TQHRCPICR

Q96EQ8 E3 ubiquitin-protein ligase RNF1251.6e-0439.44Show/hide
Query:  SSRANKATTDSDTAHALNSSIQP--TDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIA-TQHRCPICR
        S+ + K+   S TA AL     P     +  CA+C+E L +   T+CGHVFCR+CI T++   +  CP CR
Subjt:  SSRANKATTDSDTAHALNSSIQP--TDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIA-TQHRCPICR

Q9D9R0 E3 ubiquitin-protein ligase RNF1251.2e-0437.33Show/hide
Query:  LSCESSSRANKATTDSDTAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHR--CPICR
        LS +SS  A  + T      + +S +  T  +  C++C+E L +   T+CGHVFCR+CI T+I   ++  CP CR
Subjt:  LSCESSSRANKATTDSDTAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHR--CPICR

Arabidopsis top hitse value%identityAlignment
AT3G23910.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT3G24255.2)2.3e-6734.79Show/hide
Query:  SLDLQTVRSELEELQ---RSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKISNEIE------
        SLDLQ +R  ++EL    R+  E    +  S     ++++  L  E ++++I+ EY +VD  L ++D DAY+E+++ EL +VEAES+K+S EIE      
Subjt:  SLDLQTVRSELEELQ---RSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEELVAVEAESSKISNEIE------

Query:  ---------DPEEATLNCSSMNGEDQMNMIVNRECNA------------FEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEVWSAMIFQ
                 D E   L+  SM+ +D      N+  ++            F++ EL++Q+E+ + +LKSL+++D + KR                      
Subjt:  ---------DPEEATLNCSSMNGEDQMNMIVNRECNA------------FEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEVWSAMIFQ

Query:  GSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKS
                                                          D  EQVED + G+ V++   NFIRL LRT+I  L+ F    + +   E S
Subjt:  GSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKS

Query:  ELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSI-----------GNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICSMI
        EL HELLI + + T E+   E+FP D+++ DII A+ S              SS+QW V KVQD+I+  TLR+++V S+    ++FEY D+DETI+  + 
Subjt:  ELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSI-----------GNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICSMI

Query:  GGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQAD
        GG+DA +KV+ GWPL  +PLKL SLK+SD+ +KG+SLSLICKVE++ANSLD   R+NLS F DA+EKIL  Q   ELQ++
Subjt:  GGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQAD

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-5635.19Show/hide
Query:  DAYIEHMKEELVAVEAESSKISNEIE---------------DPEEATLNCSSMNGEDQMNMIVNRECNA------------FEVLELDSQIEKNKKVLKS
        DAY+E+++ EL +VEAES+K+S EIE               D E   L+  SM+ +D      N+  ++            F++ EL++Q+E+ + +LKS
Subjt:  DAYIEHMKEELVAVEAESSKISNEIE---------------DPEEATLNCSSMNGEDQMNMIVNRECNA------------FEVLELDSQIEKNKKVLKS

Query:  LQEVDEIFKREVPWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDV
        L+++D + KR                                                                        D  EQVED + G+ V++ 
Subjt:  LQEVDEIFKREVPWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDV

Query:  ADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSI-----------GNSSLQWFVRKVQDRIVL
          NFIRL LRT+I  L+ F    + +   E SEL HELLI + + T E+   E+FP D+++ DII A+ S              SS+QW V KVQD+I+ 
Subjt:  ADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSI-----------GNSSLQWFVRKVQDRIVL

Query:  CTLRRFVVKSANKSSHSFEYLDQDETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKI
         TLR+  V S+    ++FEY D+DETI+  + GG+DA +KV+ GWPL  +PLKL SLK+SD+ +KG SLSLI K+E++ANSLD   R+NLS F DAVEKI
Subjt:  CTLRRFVVKSANKSSHSFEYLDQDETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKI

Query:  LKAQMHLELQAD
        L  Q   EL+++
Subjt:  LKAQMHLELQAD

AT3G24255.2 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.3e-6133.47Show/hide
Query:  SLDLQTVRSELEELQ---RSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDD-------LDAYIEHMKEELVAVEAESSKISNEI
        SLDLQ +R  ++E     R+  E    +  S     ++++  L  E ++++I+ +Y +VD  L +D         DAY+E+++ EL +VEAES+K+S EI
Subjt:  SLDLQTVRSELEELQ---RSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDD-------LDAYIEHMKEELVAVEAESSKISNEI

Query:  E---------------DPEEATLNCSSMNGEDQMNMIVNRECNA------------FEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEV
        E               D E   L+  SM+ +D      N+  ++            F++ EL++Q+E+ + +LKSL+++D + KR               
Subjt:  E---------------DPEEATLNCSSMNGEDQMNMIVNRECNA------------FEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEV

Query:  WSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRL
                                                                 D  EQVED + G+ V++   NFIRL LRT+I  L+ F    + 
Subjt:  WSAMIFQGSDMVCAVAPLCLVVAPFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRL

Query:  EGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSI-----------GNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDE
        +   E SEL HELLI + + T E+   E+FP D+++ DII A+ S              SS+QW V KVQD+I+  TLR+  V S+    ++FEY D+DE
Subjt:  EGTIEKSELDHELLIEVLEGTMELKNAEIFPGDVHLHDIITASKSI-----------GNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDE

Query:  TIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQAD
        TI+  + GG+DA +KV+ GWPL  +PLKL SLK+SD+ +KG SLSLI K+E++ANSLD   R+NLS F DAVEKIL  Q   EL+++
Subjt:  TIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKMANSLDARIRRNLSSFADAVEKILKAQMHLELQAD

AT5G48655.1 RING/U-box superfamily protein1.4e-1130.13Show/hide
Query:  DAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEVYGALSDVTSWPPFTIWSPLTISNNVSIQEQTLHNLDLRLSCESSSRANKATTDSDTA
        DAI++D+   S   FAEA K+  RN       V +   G T                   P  ISN    + + + + +  + CE +S  ++    S  +
Subjt:  DAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEVYGALSDVTSWPPFTIWSPLTISNNVSIQEQTLHNLDLRLSCESSSRANKATTDSDTA

Query:  HALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRPK
         +   +  P +    C IC+ P  EE +TKCGH+FC+ CI+ AI+ Q +CP CR K
Subjt:  HALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRPK

AT5G48655.2 RING/U-box superfamily protein1.4e-1130.13Show/hide
Query:  DAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEVYGALSDVTSWPPFTIWSPLTISNNVSIQEQTLHNLDLRLSCESSSRANKATTDSDTA
        DAI++D+   S   FAEA K+  RN       V +   G T                   P  ISN    + + + + +  + CE +S  ++    S  +
Subjt:  DAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEVYGALSDVTSWPPFTIWSPLTISNNVSIQEQTLHNLDLRLSCESSSRANKATTDSDTA

Query:  HALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRPK
         +   +  P +    C IC+ P  EE +TKCGH+FC+ CI+ AI+ Q +CP CR K
Subjt:  HALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATTCAAAGTTCAAATGATCTACGTGATTGGAGTGCAAGAGTTGTTGAGAGAAGAACGGTATTGGACTTCGACTTAAATTGTCCACCCCCAGATGAGTGCATCGA
TCCAACTGGCCCTCGTGACGAAGCAGCACAATACTATAGTCATTACCAAGGACAAGCTACAGACGCTATCGATGAGGACATTGCTATAATCTCCCCTAGGAAGTTTGCTG
AAGCCAGGAAGAATTTTCGAAGAAATCACTTTGAGAGTAGTTGCGGTGTAGTCATCAGACGTAACGGCAACACAGAAGTTTATGGTGCTCTCTCAGATGTAACAAGTTGG
CCCCCTTTCACAATTTGGTCGCCTCTTACAATTAGCAATAATGTATCCATACAGGAACAAACACTTCACAACTTGGACCTTCGCCTTAGCTGTGAAAGCAGTAGTAGGGC
CAATAAGGCAACAACTGACTCTGACACTGCACATGCACTAAATAGTAGTATCCAACCTACAGATCGGACTTTGCGGTGTGCGATCTGCATAGAACCATTGGTCGAAGAAA
CGACAACGAAATGTGGGCACGTTTTCTGCAGGAATTGCATCGAAACGGCCATAGCTACCCAGCATAGATGTCCCATATGTCGGCCCAAAGCCCAATCTTCAAGCTCTCAT
TTTCGGACGAATAACGCGAGGAATCTCTCTCGGTTCTGTGCAAATTCCGGCAGAAGGAGGAAAATGATGCCAGAATCGATGGAAGCTACACCGTCTGTACCTCCAAGCCT
CGATCTCCAAACAGTTCGCAGCGAGCTAGAAGAGTTGCAGAGATCTTTGGAGGAAAATGAAGCTTATACGACGGATTCATTAGGTTCTGAGAAGTTACTGAAGGAGTGTG
CTCTCCATCTCGAGAGCAGACTACAGCAGATTCTGTCCGAATACTCTAACGTTGATAGTTTCTTGGGAATTGATGATTTAGATGCGTACATTGAACACATGAAAGAGGAG
CTTGTCGCGGTGGAAGCTGAAAGCAGCAAAATCTCCAATGAGATCGAGGATCCTGAAGAGGCAACACTTAATTGCAGCTCTATGAATGGTGAAGATCAGATGAACATGAT
AGTCAACCGTGAATGCAATGCGTTTGAGGTTTTGGAACTTGATAGTCAGATCGAAAAGAACAAAAAAGTCCTAAAATCTTTGCAGGAAGTAGATGAGATATTTAAAAGGG
AGGTGCCATGGTGCCACTCTCAATTAAGGGAGAAATTAGAAGTTTGGAGCGCCATGATTTTTCAAGGAAGCGACATGGTGTGTGCTGTAGCACCATTGTGCCTTGTGGTA
GCACCCTTTTCTTGTTTTTTTCTTCGTTCTGCCATAGCACCACGGTTTTGTAACAAGCACCCAGGTGCTATTCCCTCGGATATTAAGCTTCTACCATCTCTTTTGGATGT
TATTGAGCAGGTTGAGGACACAATTGGTGGTATGATGGTCATTGATGTTGCTGATAATTTCATTAGATTGTCATTACGTACACACATTCCGAACTTGGAAGATTTTTCAA
CCTTACAAAGACTTGAAGGTACGATTGAGAAATCTGAATTGGATCACGAGTTGCTAATAGAAGTTTTGGAGGGGACAATGGAGTTAAAGAATGCTGAGATCTTTCCTGGT
GATGTCCATTTGCACGATATTATCACTGCTTCAAAGTCAATCGGCAATTCTTCATTGCAATGGTTTGTGAGAAAAGTACAAGATAGGATTGTTTTGTGTACTCTTAGGCG
GTTTGTTGTGAAGAGTGCAAACAAATCAAGTCATTCCTTTGAGTATTTAGATCAAGACGAAACGATAATATGTAGTATGATTGGAGGGGTTGATGCGTGTATTAAGGTGA
CTCAAGGTTGGCCATTAGCCGAATCTCCTCTGAAACTTGTATCACTCAAGAGCTCAGATCATTATACAAAAGGAGTTTCTTTAAGCCTCATTTGCAAGGTGGAGAAAATG
GCAAATTCCTTGGATGCTCGTATTCGCCGAAATCTATCCAGCTTTGCGGACGCTGTTGAAAAAATATTGAAGGCGCAAATGCATTTAGAACTCCAAGCTGACAGTGGTTA
A
mRNA sequenceShow/hide mRNA sequence
AGAGGACAAAGAGCAACGAAGATGGCAAATAAAGAATAATTTCGAGGAAAGGAGGATCGCAAGGATGTAAAGTTTTTCAAAGAAAACACCTCTACATATATGGAATGGAA
GCCAAGATTGCTTTTATATCGATTGCCGCCCCCTCTTTCTCTTCCTCTTTCTCCCTTTTTTTCCCTCATTTCTTCCTCCTCCCCTTCGTCTCTGCAAATTTCTTGCGTTT
TCTCTTTCGATCGGTCTGCTCAACCCGCAGTTCGCCCTTTCGTTTTCCACCAGGGCTAAAGGACAGAAAGGGCTGTCGGAATGAGCATTCAAAGTTCAAATGATCTACGT
GATTGGAGTGCAAGAGTTGTTGAGAGAAGAACGGTATTGGACTTCGACTTAAATTGTCCACCCCCAGATGAGTGCATCGATCCAACTGGCCCTCGTGACGAAGCAGCACA
ATACTATAGTCATTACCAAGGACAAGCTACAGACGCTATCGATGAGGACATTGCTATAATCTCCCCTAGGAAGTTTGCTGAAGCCAGGAAGAATTTTCGAAGAAATCACT
TTGAGAGTAGTTGCGGTGTAGTCATCAGACGTAACGGCAACACAGAAGTTTATGGTGCTCTCTCAGATGTAACAAGTTGGCCCCCTTTCACAATTTGGTCGCCTCTTACA
ATTAGCAATAATGTATCCATACAGGAACAAACACTTCACAACTTGGACCTTCGCCTTAGCTGTGAAAGCAGTAGTAGGGCCAATAAGGCAACAACTGACTCTGACACTGC
ACATGCACTAAATAGTAGTATCCAACCTACAGATCGGACTTTGCGGTGTGCGATCTGCATAGAACCATTGGTCGAAGAAACGACAACGAAATGTGGGCACGTTTTCTGCA
GGAATTGCATCGAAACGGCCATAGCTACCCAGCATAGATGTCCCATATGTCGGCCCAAAGCCCAATCTTCAAGCTCTCATTTTCGGACGAATAACGCGAGGAATCTCTCT
CGGTTCTGTGCAAATTCCGGCAGAAGGAGGAAAATGATGCCAGAATCGATGGAAGCTACACCGTCTGTACCTCCAAGCCTCGATCTCCAAACAGTTCGCAGCGAGCTAGA
AGAGTTGCAGAGATCTTTGGAGGAAAATGAAGCTTATACGACGGATTCATTAGGTTCTGAGAAGTTACTGAAGGAGTGTGCTCTCCATCTCGAGAGCAGACTACAGCAGA
TTCTGTCCGAATACTCTAACGTTGATAGTTTCTTGGGAATTGATGATTTAGATGCGTACATTGAACACATGAAAGAGGAGCTTGTCGCGGTGGAAGCTGAAAGCAGCAAA
ATCTCCAATGAGATCGAGGATCCTGAAGAGGCAACACTTAATTGCAGCTCTATGAATGGTGAAGATCAGATGAACATGATAGTCAACCGTGAATGCAATGCGTTTGAGGT
TTTGGAACTTGATAGTCAGATCGAAAAGAACAAAAAAGTCCTAAAATCTTTGCAGGAAGTAGATGAGATATTTAAAAGGGAGGTGCCATGGTGCCACTCTCAATTAAGGG
AGAAATTAGAAGTTTGGAGCGCCATGATTTTTCAAGGAAGCGACATGGTGTGTGCTGTAGCACCATTGTGCCTTGTGGTAGCACCCTTTTCTTGTTTTTTTCTTCGTTCT
GCCATAGCACCACGGTTTTGTAACAAGCACCCAGGTGCTATTCCCTCGGATATTAAGCTTCTACCATCTCTTTTGGATGTTATTGAGCAGGTTGAGGACACAATTGGTGG
TATGATGGTCATTGATGTTGCTGATAATTTCATTAGATTGTCATTACGTACACACATTCCGAACTTGGAAGATTTTTCAACCTTACAAAGACTTGAAGGTACGATTGAGA
AATCTGAATTGGATCACGAGTTGCTAATAGAAGTTTTGGAGGGGACAATGGAGTTAAAGAATGCTGAGATCTTTCCTGGTGATGTCCATTTGCACGATATTATCACTGCT
TCAAAGTCAATCGGCAATTCTTCATTGCAATGGTTTGTGAGAAAAGTACAAGATAGGATTGTTTTGTGTACTCTTAGGCGGTTTGTTGTGAAGAGTGCAAACAAATCAAG
TCATTCCTTTGAGTATTTAGATCAAGACGAAACGATAATATGTAGTATGATTGGAGGGGTTGATGCGTGTATTAAGGTGACTCAAGGTTGGCCATTAGCCGAATCTCCTC
TGAAACTTGTATCACTCAAGAGCTCAGATCATTATACAAAAGGAGTTTCTTTAAGCCTCATTTGCAAGGTGGAGAAAATGGCAAATTCCTTGGATGCTCGTATTCGCCGA
AATCTATCCAGCTTTGCGGACGCTGTTGAAAAAATATTGAAGGCGCAAATGCATTTAGAACTCCAAGCTGACAGTGGTTAAGAACTTTGGTTCTTCATCATGCGATTCAG
GTCGTTTCGGGGACCTAACGACAATTGGGCCATCGGAGGACGGAGAACTCCCCATATCCCTTGTTTTATGTAGGAGAAAAGTTCCGGAAAGAATGGTAACAAAACCGCAT
AGTTCGGTTGCAATTTGTGATGCATTTTGTGAATCCCAGTCCTGTAGGTAAAACAATCAGAGCAGAGAGAATGG
Protein sequenceShow/hide protein sequence
MSIQSSNDLRDWSARVVERRTVLDFDLNCPPPDECIDPTGPRDEAAQYYSHYQGQATDAIDEDIAIISPRKFAEARKNFRRNHFESSCGVVIRRNGNTEVYGALSDVTSW
PPFTIWSPLTISNNVSIQEQTLHNLDLRLSCESSSRANKATTDSDTAHALNSSIQPTDRTLRCAICIEPLVEETTTKCGHVFCRNCIETAIATQHRCPICRPKAQSSSSH
FRTNNARNLSRFCANSGRRRKMMPESMEATPSVPPSLDLQTVRSELEELQRSLEENEAYTTDSLGSEKLLKECALHLESRLQQILSEYSNVDSFLGIDDLDAYIEHMKEE
LVAVEAESSKISNEIEDPEEATLNCSSMNGEDQMNMIVNRECNAFEVLELDSQIEKNKKVLKSLQEVDEIFKREVPWCHSQLREKLEVWSAMIFQGSDMVCAVAPLCLVV
APFSCFFLRSAIAPRFCNKHPGAIPSDIKLLPSLLDVIEQVEDTIGGMMVIDVADNFIRLSLRTHIPNLEDFSTLQRLEGTIEKSELDHELLIEVLEGTMELKNAEIFPG
DVHLHDIITASKSIGNSSLQWFVRKVQDRIVLCTLRRFVVKSANKSSHSFEYLDQDETIICSMIGGVDACIKVTQGWPLAESPLKLVSLKSSDHYTKGVSLSLICKVEKM
ANSLDARIRRNLSSFADAVEKILKAQMHLELQADSG