; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg24772 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg24772
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationCarg_Chr14:13540030..13548969
RNA-Seq ExpressionCarg24772
SyntenyCarg24772
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582484.1 hypothetical protein SDJN03_22486, partial [Cucurbita argyrosperma subsp. sororia]3.2e-20291.77Show/hide
Query:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNK
        +LEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIE      VL    I GSNK
Subjt:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNK

Query:  LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVED
        LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKS+                +  + VED
Subjt:  LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVED

Query:  TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT
        TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT
Subjt:  TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT

Query:  LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK
        LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK
Subjt:  LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK

Query:  EQMHLELQADGGL
        EQMHLELQADGGL
Subjt:  EQMHLELQADGGL

KAG7018870.1 hypothetical protein SDJN02_20743, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  RIGKDVKFFKENTSTYMEWKPRLLLYRLPLPLSLPLSPFLFPISSSSASSRQVSWVSSFDRPAPAAVRRSLLLQRARVVERRTRLDIDLNCPPPDECIDP
        RIGKDVKFFKENTSTYMEWKPRLLLYRLPLPLSLPLSPFLFPISSSSASSRQVSWVSSFDRPAPAAVRRSLLLQRARVVERRTRLDIDLNCPPPDECIDP
Subjt:  RIGKDVKFFKENTSTYMEWKPRLLLYRLPLPLSLPLSPFLFPISSSSASSRQVSWVSSFDRPAPAAVRRSLLLQRARVVERRTRLDIDLNCPPPDECIDP

Query:  TGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSSWPPFTIWSPRTNRNSVSIQEQQTTHNLDL
        TGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSSWPPFTIWSPRTNRNSVSIQEQQTTHNLDL
Subjt:  TGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSSWPPFTIWSPRTNRNSVSIQEQQTTHNLDL

Query:  RLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPICRRLYEWLSSSLFIHGGNESQTRGIT
        RLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPICRRLYEWLSSSLFIHGGNESQTRGIT
Subjt:  RLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPICRRLYEWLSSSLFIHGGNESQTRGIT

Query:  RGISLGSVRIPAVDEENNAEIDGSYTVCVFKPRSPSSSQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEH
        RGISLGSVRIPAVDEENNAEIDGSYTVCVFKPRSPSSSQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEH
Subjt:  RGISLGSVRIPAVDEENNAEIDGSYTVCVFKPRSPSSSQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEH

Query:  MKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRI
        MKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRI
Subjt:  MKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRI

Query:  LKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNA
        LKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNA
Subjt:  LKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNA

Query:  EIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYT
        EIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYT
Subjt:  EIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYT

Query:  KGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQADGGL
        KGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQADGGL
Subjt:  KGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQADGGL

KGN56746.2 hypothetical protein Csa_011800 [Cucumis sativus]7.1e-25070.41Show/hide
Query:  ARVVERRTRLDIDLNCPPPDECIDPTGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSSWPPF
        +RV+ERRT LD DLNCPPPDECIDPTG  DEAAQY NH++ QATDA+DEDIAIISPRKFAEARKNFRRNHFES  G V+RRNG+TEVY AL+DV++WPPF
Subjt:  ARVVERRTRLDIDLNCPPPDECIDPTGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSSWPPF

Query:  TIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPIC
        TIWSP T  N+VS+QEQQT HNLDLRLSCE+SSRATK KTD  IPS  AL+SSI P DR LRCAICIEPLVEETTTKCGH    N  E    ++ +    
Subjt:  TIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPIC

Query:  RRLYEWLSSSLFIHGGNESQTRGITRGISLGSVRIPAVDEENNAEIDGSYTVCVFKPRSPSSSQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRL
        R   + +  S+         T  +   + L +VR                            S+LEELQRSL E+E  +TDSLGSEKLL+ECALHLESR+
Subjt:  RRLYEWLSSSLFIHGGNESQTRGITRGISLGSVRIPAVDEENNAEIDGSYTVCVFKPRSPSSSQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRL

Query:  QQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMN
        QQVLSE SNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIE      VL    I  SNKL++DLE+L +SLDRF SQD E+ TFN  SMNGED MN
Subjt:  QQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMN

Query:  VIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEG
        VIV+ ECNAFEVLEL+S IEKNK+ILKSLQEVDEIFKS+                +  + VE TIGG+KVI VADN IRLSL THIPN+EDFS+LQRLEG
Subjt:  VIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEG

Query:  MIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSN-SLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAF
        +IE SEL+HEL+IEVL+GTMELKNAEIFP DVHLHDIINASKS+SN SLEWFV+KVQDRIVLCTLRRF VKSANKS HSF+Y+DQDE I+C MIGGIDA 
Subjt:  MIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSN-SLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAF

Query:  IKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQADGG
        IKVSQGWPLADSPLKL+SLKSSDHYTKG SLSL+CKVEKMANSLDA IR+NLSSFADAVEKILKEQMHLELQAD G
Subjt:  IKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQADGG

XP_022924672.1 uncharacterized protein LOC111432106 isoform X1 [Cucurbita moschata]1.3e-19589.45Show/hide
Query:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD----LDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPIT
        +LEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD    LDAYVEHMKEELVAVEAESS+ISNEIE      VL    I 
Subjt:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD----LDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPIT

Query:  GSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHD
        GSNKLEV+LELLDVSLDRFTSQD EKETFNFCSMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKS+                +  +
Subjt:  GSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHD

Query:  SVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI
         VEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI
Subjt:  SVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI

Query:  VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE
        VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE
Subjt:  VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE

Query:  KILKEQMHLELQADGGL
        KILKEQMHLEL+ADGGL
Subjt:  KILKEQMHLELQADGGL

XP_022924674.1 uncharacterized protein LOC111432106 isoform X3 [Cucurbita moschata]2.4e-19790.31Show/hide
Query:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNK
        +LEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESS+ISNEIE      VL    I GSNK
Subjt:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNK

Query:  LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVED
        LEV+LELLDVSLDRFTSQD EKETFNFCSMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKS+                +  + VED
Subjt:  LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVED

Query:  TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT
        TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT
Subjt:  TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT

Query:  LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK
        LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK
Subjt:  LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK

Query:  EQMHLELQADGGL
        EQMHLEL+ADGGL
Subjt:  EQMHLELQADGGL

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q3 Uncharacterized protein5.1e-16978.26Show/hide
Query:  SQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSN
        S+LEELQRSL E+E  +TDSLGSEKLL+ECALHLESR+QQVLSE SNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIE      VL    I  SN
Subjt:  SQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSN

Query:  KLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVE
        KL++DLE+L +SLDRF SQD E+ TFN  SMNGED MNVIV+ ECNAFEVLEL+S IEKNK+ILKSLQEVDEIFKS+                +  + VE
Subjt:  KLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVE

Query:  DTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSN-SLEWFVKKVQDRIVL
         TIGG+KVI VADN IRLSL THIPN+EDFS+LQRLEG+IE SEL+HEL+IEVL+GTMELKNAEIFP DVHLHDIINASKS+SN SLEWFV+KVQDRIVL
Subjt:  DTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSN-SLEWFVKKVQDRIVL

Query:  CTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKI
        CTLRRF VKSANKS HSF+Y+DQDE I+C MIGGIDA IKVSQGWPLADSPLKL+SLKSSDHYTKG SLSL+CKVEKMANSLDA IR+NLSSFADAVEKI
Subjt:  CTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKI

Query:  LKEQMHLELQADGG
        LKEQMHLELQAD G
Subjt:  LKEQMHLELQADGG

A0A5A7U6L2 Uncharacterized protein1.0e-16978.74Show/hide
Query:  SQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSN
        S+LEELQRSL E+E  S DSLGSEKLL+ECALHLESR+QQVLSE SNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIE      VL    I  SN
Subjt:  SQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSN

Query:  KLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVE
        KL++DLE+L +SLDRF SQD E+ TFN  SMNGED+MNVIVD ECNAFEVLEL+S IEKNK+ILKSLQEVDEIFKS+                +  + VE
Subjt:  KLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVE

Query:  DTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSN-SLEWFVKKVQDRIVL
         TIGG+KVI VADN IRLSL THIPN+EDFS+LQRLEG+IE SEL+HEL+IEV  GTMELKNAEIFP DVHLHDIINASKS+SN SLEWFV+KVQDRIVL
Subjt:  DTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSN-SLEWFVKKVQDRIVL

Query:  CTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKI
        CTLRRF VKSANKSSHSF+Y+DQDE I+C MIGGIDA IKVSQGWPLADSPLKL+SLKSSDHYTKG SLSL+CKVEKMANSLD RIRQNLSSFADAVEKI
Subjt:  CTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKI

Query:  LKEQMHLELQADGG
        LKEQMHLELQAD G
Subjt:  LKEQMHLELQADGG

A0A6J1E9M5 uncharacterized protein LOC111432106 isoform X16.3e-19689.45Show/hide
Query:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD----LDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPIT
        +LEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD    LDAYVEHMKEELVAVEAESS+ISNEIE      VL    I 
Subjt:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD----LDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPIT

Query:  GSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHD
        GSNKLEV+LELLDVSLDRFTSQD EKETFNFCSMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKS+                +  +
Subjt:  GSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHD

Query:  SVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI
         VEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI
Subjt:  SVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI

Query:  VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE
        VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE
Subjt:  VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE

Query:  KILKEQMHLELQADGGL
        KILKEQMHLEL+ADGGL
Subjt:  KILKEQMHLELQADGGL

A0A6J1E9V8 uncharacterized protein LOC111432106 isoform X31.2e-19790.31Show/hide
Query:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNK
        +LEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESS+ISNEIE      VL    I GSNK
Subjt:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNK

Query:  LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVED
        LEV+LELLDVSLDRFTSQD EKETFNFCSMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKS+                +  + VED
Subjt:  LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVED

Query:  TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT
        TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT
Subjt:  TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCT

Query:  LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK
        LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK
Subjt:  LRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILK

Query:  EQMHLELQADGGL
        EQMHLEL+ADGGL
Subjt:  EQMHLELQADGGL

A0A6J1ED63 uncharacterized protein LOC111432106 isoform X26.3e-19689.45Show/hide
Query:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD----LDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPIT
        +LEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD    LDAYVEHMKEELVAVEAESS+ISNEIE      VL    I 
Subjt:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD----LDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPIT

Query:  GSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHD
        GSNKLEV+LELLDVSLDRFTSQD EKETFNFCSMNGEDQMNVIVD E NAFEVLELDSHIEKNKRILKSLQEVDEIFKS+                +  +
Subjt:  GSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHD

Query:  SVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI
         VEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI
Subjt:  SVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRI

Query:  VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE
        VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE
Subjt:  VLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVE

Query:  KILKEQMHLELQADGGL
        KILKEQMHLEL+ADGGL
Subjt:  KILKEQMHLELQADGGL

SwissProt top hitse value%identityAlignment
P87176 E3 ubiquitin-protein ligase complex slx8-rfp subunit slx85.8e-0529.06Show/hide
Query:  PFTIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVK-------TDIVIPSAPALNSSITPADRKL---RCAICIEPLVEETTTKCGHIFCRNCIE
        P ++ +  TN NS+  Q    +  +DL        R  K +       T+  I         + P+ ++L   +C IC++     + T CGHIFC  CI 
Subjt:  PFTIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVK-------TDIVIPSAPALNSSITPADRKL---RCAICIEPLVEETTTKCGHIFCRNCIE

Query:  IAIAT---QHKCPICRR
         A+ T     KCP+CRR
Subjt:  IAIAT---QHKCPICRR

Q14258 E3 ubiquitin/ISG15 ligase TRIM259.0e-0640.35Show/hide
Query:  SSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQ---HKCPICRRLYE
        + + P   +L C+IC+EP  E  TT CGH FC +C+    A Q   + CP CR +Y+
Subjt:  SSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQ---HKCPICRRLYE

Q496Y0 LON peptidase N-terminal domain and RING finger protein 37.6e-0540Show/hide
Query:  PALNSSITPADRK-LRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPICR
        PAL+  +   D   L CA+C+    E  TT CGH FC  C+E  +    KCP+C+
Subjt:  PALNSSITPADRK-LRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPICR

Q61510 E3 ubiquitin/ISG15 ligase TRIM252.6e-0535.09Show/hide
Query:  SSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQ---HKCPICRRLYE
        + + P   +L C++C+E   E  TT CGH FC +C++     Q   ++CP CR++Y+
Subjt:  SSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQ---HKCPICRRLYE

Q8HXH0 LON peptidase N-terminal domain and RING finger protein 37.6e-0540Show/hide
Query:  PALNSSITPADRK-LRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPICR
        PAL+  +   D   L CA+C+    E  TT CGH FC  C+E  +    KCP+C+
Subjt:  PALNSSITPADRK-LRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPICR

Arabidopsis top hitse value%identityAlignment
AT3G23910.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT3G24255.2)6.3e-7941.23Show/hide
Query:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNK
        +L+   R+  E+   S  S     ++++  L  E ++++++ E  +VD  L ++D DAY+E+++ EL +VEAES+K+S EIE       L       S++
Subjt:  QLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNK

Query:  LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVED
        L+ DLE L +SLD  +SQD EK   N  S +  +   VI D   + F++ EL++ +E+ + ILKSL+++D + K   A                 + VED
Subjt:  LEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVED

Query:  TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKS------------VSNSLEWF
         + GLKV+    NFIRL LRT+I  L+ F    + + + EPSEL HELLI + + T E+   E+FP D+++ DII A+ S              +S++W 
Subjt:  TIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKS------------VSNSLEWF

Query:  VKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNL
        V KVQD+I+  TLR+++V S+    ++F+Y D+DETIV  + GGIDAF+KVS GWPL ++PLKL SLK+SD+ +KG SLSL+CKVE++ANSLD   RQNL
Subjt:  VKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNL

Query:  SSFADAVEKILKEQMHLELQAD
        S F DA+EKIL EQ   ELQ++
Subjt:  SSFADAVEKILKEQMHLELQAD

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.8e-7143.44Show/hide
Query:  DAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHI
        DAY+E+++ EL +VEAES+K+S EIE       L       S++L+ DLE L +SLD  +SQD EK   N  S +  +   VI D   + F++ EL++ +
Subjt:  DAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNKLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHI

Query:  EKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGT
        E+ + ILKSL+++D + K   A                 + VED + GLKV+    NFIRL LRT+I  L+ F    + + + EPSEL HELLI + + T
Subjt:  EKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVEDTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGT

Query:  MELKNAEIFPGDVHLHDIINASKS------------VSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWP
         E+   E+FP D+++ DII A+ S              +S++W V KVQD+I+  TLR+  V S+    ++F+Y D+DETIV  + GGIDAF+KVS GWP
Subjt:  MELKNAEIFPGDVHLHDIINASKS------------VSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWP

Query:  LADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQAD
        L ++PLKL SLK+SD+ +KG SLSL+ K+E++ANSLD   RQNLS F DAVEKIL +Q   EL+++
Subjt:  LADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQAD

AT3G24255.2 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-7239.95Show/hide
Query:  RSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD-------LDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSN
        R+  E+   S  S     ++++  L  E ++++++ +  +VD  L +D         DAY+E+++ EL +VEAES+K+S EIE       L       S+
Subjt:  RSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDD-------LDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSN

Query:  KLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVE
        +L+ DLE L +SLD  +SQD EK   N  S +  +   VI D   + F++ EL++ +E+ + ILKSL+++D + K   A                 + VE
Subjt:  KLEVDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVE

Query:  DTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKS------------VSNSLEW
        D + GLKV+    NFIRL LRT+I  L+ F    + + + EPSEL HELLI + + T E+   E+FP D+++ DII A+ S              +S++W
Subjt:  DTIGGLKVIGVADNFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKS------------VSNSLEW

Query:  FVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQN
         V KVQD+I+  TLR+  V S+    ++F+Y D+DETIV  + GGIDAF+KVS GWPL ++PLKL SLK+SD+ +KG SLSL+ K+E++ANSLD   RQN
Subjt:  FVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQDETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQN

Query:  LSSFADAVEKILKEQMHLELQAD
        LS F DAVEKIL +Q   EL+++
Subjt:  LSSFADAVEKILKEQMHLELQAD

AT5G48655.1 RING/U-box superfamily protein1.5e-1130.58Show/hide
Query:  RRTRLDIDLNCPPPDE---------CIDPTGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSS
        RR +  IDLN  P D+            P  P   A    +       DA+++D+   S   FAEA K+  RN       V V   G+T           
Subjt:  RRTRLDIDLNCPPPDE---------CIDPTGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSS

Query:  WPPFTIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHK
          P  I + R  R   S +      +  +      SSR ++ K     P+ P       P + K  C IC+ P  EE +TKCGHIFC+ CI++AI+ Q K
Subjt:  WPPFTIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHK

Query:  CPICRR
        CP CR+
Subjt:  CPICRR

AT5G48655.2 RING/U-box superfamily protein1.5e-1130.58Show/hide
Query:  RRTRLDIDLNCPPPDE---------CIDPTGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSS
        RR +  IDLN  P D+            P  P   A    +       DA+++D+   S   FAEA K+  RN       V V   G+T           
Subjt:  RRTRLDIDLNCPPPDE---------CIDPTGPHDEAAQYNNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSS

Query:  WPPFTIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHK
          P  I + R  R   S +      +  +      SSR ++ K     P+ P       P + K  C IC+ P  EE +TKCGHIFC+ CI++AI+ Q K
Subjt:  WPPFTIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVKTDIVIPSAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHK

Query:  CPICRR
        CP CR+
Subjt:  CPICRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGGATCGGTAAGGATGTAAAGTTTTTCAAAGAAAACACCTCTACATATATGGAATGGAAGCCAAGATTGCTTTTATATCGATTGCCGCTCCCTCTTTCCCTTCCCCTTTC
TCCCTTTCTTTTCCCCATTTCTTCATCTTCCGCTTCGTCTCGCCAAGTTTCTTGGGTTTCCTCTTTCGATCGGCCTGCTCCAGCCGCAGTCCGCCGTTCCCTTCTCCTCC
AGAGAGCAAGAGTTGTTGAGAGAAGAACAAGATTGGACATTGACTTAAATTGTCCACCCCCGGATGAGTGCATTGACCCAACTGGCCCTCACGACGAAGCGGCACAGTAC
AATAATCATCACCGAGAACAAGCTACAGACGCGGTTGATGAGGACATTGCTATAATCTCCCCTCGGAAATTTGCTGAAGCCAGGAAGAATTTTCGAAGAAACCACTTTGA
GAGTAGCTATGGTGTAGTCGTCAGACGTAATGGCAGCACAGAAGTTTATAGTGCTCTCACAGATGTATCAAGTTGGCCCCCCTTTACAATTTGGTCGCCCCGTACGAATA
GGAATAGTGTATCCATACAGGAACAACAAACAACTCACAACTTGGACCTCCGCTTAAGCTGTGAAACCAGTAGTAGGGCCACTAAGGTAAAAACTGACATTGTTATTCCT
TCTGCACCTGCACTAAATAGTAGCATCACACCTGCAGATCGGAAGTTGCGGTGTGCGATCTGCATAGAACCATTGGTCGAAGAAACGACGACAAAATGCGGGCACATTTT
CTGCAGGAATTGCATTGAAATAGCCATTGCTACTCAACACAAATGTCCCATATGTCGGCGCCTTTATGAATGGCTTTCTTCTTCCTTGTTTATCCATGGCGGTAACGAAT
CTCAAACCAGGGGAATAACGCGAGGAATCTCTCTCGGTTCTGTGCGAATTCCGGCTGTCGATGAAGAAAATAATGCCGAAATCGATGGAAGCTACACCGTCTGTGTCTTC
AAGCCTCGATCTCCAAGCAGTTCGCAGCTAGAAGAGTTGCAGAGATCTTTGGCGGAAGATGAAGCTTATAGCACGGATTCATTAGGTTCTGAGAAGTTACTGAAGGAATG
CGCTCTCCATCTCGAGAGCAGACTACAGCAGGTTCTGTCAGAATGCTCTAACGTTGATAGTTTCTTGGGGATTGATGATTTAGATGCATATGTGGAACACATGAAAGAGG
AACTCGTTGCGGTGGAAGCTGAAAGCAGCAAAATCTCCAATGAGATAGAGCCTCCTATTTGTATTGACGTCCTTCTTTTGCTTCCGATCACAGGTTCTAATAAATTAGAG
GTGGATCTCGAATTATTAGACGTGTCGTTAGATCGTTTTACATCACAGGATACTGAGAAGGAAACATTTAATTTCTGCTCTATGAATGGTGAAGACCAAATGAACGTGAT
AGTTGACTGTGAATGCAATGCTTTTGAGGTTTTGGAACTTGATAGTCATATCGAGAAGAATAAAAGAATTCTGAAATCTTTGCAGGAAGTAGATGAGATATTTAAAAGCA
TCACTGCGCCTTCTTGTAGTGCTCTTTTCTTGCCTTTTCTTTGTTCTGCAGAGCCCCACGATTCTGTTGAGGACACAATTGGGGGTCTGAAGGTCATTGGTGTTGCTGAT
AATTTCATTAGATTGTCATTACGTACACACATTCCAAACTTGGAAGATTTTTCAAGCTTACAGAGACTTGAAGGAATGATCGAGCCATCGGAATTGAATCACGAGTTGCT
AATAGAAGTTTTGGAAGGAACAATGGAGCTAAAGAATGCTGAGATCTTTCCTGGTGATGTCCACTTGCACGATATCATCAATGCTTCAAAGTCAGTCAGCAATTCATTGG
AATGGTTTGTGAAAAAAGTACAAGATAGAATTGTTTTGTGTACTCTTAGGCGATTTGTTGTGAAGAGTGCAAACAAATCAAGTCATTCCTTTGATTACATAGACCAAGAC
GAAACGATAGTATGTTGTATGATTGGAGGGATTGATGCGTTTATTAAGGTGTCTCAAGGCTGGCCACTAGCCGATTCTCCACTGAAACTTGTATCACTCAAGAGCTCAGA
CCATTATACAAAAGGTGCTTCCTTAAGCCTCGTTTGCAAGGTGGAGAAAATGGCAAATTCCTTGGACGCACGTATTCGCCAAAATCTATCAAGCTTTGCAGACGCTGTTG
AAAAAATATTGAAGGAGCAAATGCATTTAGAACTCCAAGCTGACGGCGGTCTTTGA
mRNA sequenceShow/hide mRNA sequence
AGGATCGGTAAGGATGTAAAGTTTTTCAAAGAAAACACCTCTACATATATGGAATGGAAGCCAAGATTGCTTTTATATCGATTGCCGCTCCCTCTTTCCCTTCCCCTTTC
TCCCTTTCTTTTCCCCATTTCTTCATCTTCCGCTTCGTCTCGCCAAGTTTCTTGGGTTTCCTCTTTCGATCGGCCTGCTCCAGCCGCAGTCCGCCGTTCCCTTCTCCTCC
AGAGAGCAAGAGTTGTTGAGAGAAGAACAAGATTGGACATTGACTTAAATTGTCCACCCCCGGATGAGTGCATTGACCCAACTGGCCCTCACGACGAAGCGGCACAGTAC
AATAATCATCACCGAGAACAAGCTACAGACGCGGTTGATGAGGACATTGCTATAATCTCCCCTCGGAAATTTGCTGAAGCCAGGAAGAATTTTCGAAGAAACCACTTTGA
GAGTAGCTATGGTGTAGTCGTCAGACGTAATGGCAGCACAGAAGTTTATAGTGCTCTCACAGATGTATCAAGTTGGCCCCCCTTTACAATTTGGTCGCCCCGTACGAATA
GGAATAGTGTATCCATACAGGAACAACAAACAACTCACAACTTGGACCTCCGCTTAAGCTGTGAAACCAGTAGTAGGGCCACTAAGGTAAAAACTGACATTGTTATTCCT
TCTGCACCTGCACTAAATAGTAGCATCACACCTGCAGATCGGAAGTTGCGGTGTGCGATCTGCATAGAACCATTGGTCGAAGAAACGACGACAAAATGCGGGCACATTTT
CTGCAGGAATTGCATTGAAATAGCCATTGCTACTCAACACAAATGTCCCATATGTCGGCGCCTTTATGAATGGCTTTCTTCTTCCTTGTTTATCCATGGCGGTAACGAAT
CTCAAACCAGGGGAATAACGCGAGGAATCTCTCTCGGTTCTGTGCGAATTCCGGCTGTCGATGAAGAAAATAATGCCGAAATCGATGGAAGCTACACCGTCTGTGTCTTC
AAGCCTCGATCTCCAAGCAGTTCGCAGCTAGAAGAGTTGCAGAGATCTTTGGCGGAAGATGAAGCTTATAGCACGGATTCATTAGGTTCTGAGAAGTTACTGAAGGAATG
CGCTCTCCATCTCGAGAGCAGACTACAGCAGGTTCTGTCAGAATGCTCTAACGTTGATAGTTTCTTGGGGATTGATGATTTAGATGCATATGTGGAACACATGAAAGAGG
AACTCGTTGCGGTGGAAGCTGAAAGCAGCAAAATCTCCAATGAGATAGAGCCTCCTATTTGTATTGACGTCCTTCTTTTGCTTCCGATCACAGGTTCTAATAAATTAGAG
GTGGATCTCGAATTATTAGACGTGTCGTTAGATCGTTTTACATCACAGGATACTGAGAAGGAAACATTTAATTTCTGCTCTATGAATGGTGAAGACCAAATGAACGTGAT
AGTTGACTGTGAATGCAATGCTTTTGAGGTTTTGGAACTTGATAGTCATATCGAGAAGAATAAAAGAATTCTGAAATCTTTGCAGGAAGTAGATGAGATATTTAAAAGCA
TCACTGCGCCTTCTTGTAGTGCTCTTTTCTTGCCTTTTCTTTGTTCTGCAGAGCCCCACGATTCTGTTGAGGACACAATTGGGGGTCTGAAGGTCATTGGTGTTGCTGAT
AATTTCATTAGATTGTCATTACGTACACACATTCCAAACTTGGAAGATTTTTCAAGCTTACAGAGACTTGAAGGAATGATCGAGCCATCGGAATTGAATCACGAGTTGCT
AATAGAAGTTTTGGAAGGAACAATGGAGCTAAAGAATGCTGAGATCTTTCCTGGTGATGTCCACTTGCACGATATCATCAATGCTTCAAAGTCAGTCAGCAATTCATTGG
AATGGTTTGTGAAAAAAGTACAAGATAGAATTGTTTTGTGTACTCTTAGGCGATTTGTTGTGAAGAGTGCAAACAAATCAAGTCATTCCTTTGATTACATAGACCAAGAC
GAAACGATAGTATGTTGTATGATTGGAGGGATTGATGCGTTTATTAAGGTGTCTCAAGGCTGGCCACTAGCCGATTCTCCACTGAAACTTGTATCACTCAAGAGCTCAGA
CCATTATACAAAAGGTGCTTCCTTAAGCCTCGTTTGCAAGGTGGAGAAAATGGCAAATTCCTTGGACGCACGTATTCGCCAAAATCTATCAAGCTTTGCAGACGCTGTTG
AAAAAATATTGAAGGAGCAAATGCATTTAGAACTCCAAGCTGACGGCGGTCTTTGACGATTAAGAACTTTGGTTCATCATGCAATTCAGGTTTCTCAATTCTACCTCCTC
TACTTGTATAACGTGATATTGCTGCTGATGATTTTTCATGCCGAAAATTTTAGCTCGATTATTGATTGCTATTATTATTGTTGTTATTTGCCCTTTTGTGTGTATAGGCT
TAAAACATTGTTGTTTTGTTTATTTTAT
Protein sequenceShow/hide protein sequence
RIGKDVKFFKENTSTYMEWKPRLLLYRLPLPLSLPLSPFLFPISSSSASSRQVSWVSSFDRPAPAAVRRSLLLQRARVVERRTRLDIDLNCPPPDECIDPTGPHDEAAQY
NNHHREQATDAVDEDIAIISPRKFAEARKNFRRNHFESSYGVVVRRNGSTEVYSALTDVSSWPPFTIWSPRTNRNSVSIQEQQTTHNLDLRLSCETSSRATKVKTDIVIP
SAPALNSSITPADRKLRCAICIEPLVEETTTKCGHIFCRNCIEIAIATQHKCPICRRLYEWLSSSLFIHGGNESQTRGITRGISLGSVRIPAVDEENNAEIDGSYTVCVF
KPRSPSSSQLEELQRSLAEDEAYSTDSLGSEKLLKECALHLESRLQQVLSECSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEPPICIDVLLLLPITGSNKLE
VDLELLDVSLDRFTSQDTEKETFNFCSMNGEDQMNVIVDCECNAFEVLELDSHIEKNKRILKSLQEVDEIFKSITAPSCSALFLPFLCSAEPHDSVEDTIGGLKVIGVAD
NFIRLSLRTHIPNLEDFSSLQRLEGMIEPSELNHELLIEVLEGTMELKNAEIFPGDVHLHDIINASKSVSNSLEWFVKKVQDRIVLCTLRRFVVKSANKSSHSFDYIDQD
ETIVCCMIGGIDAFIKVSQGWPLADSPLKLVSLKSSDHYTKGASLSLVCKVEKMANSLDARIRQNLSSFADAVEKILKEQMHLELQADGGL