; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008064 (gene) of Snake gourd v1 genome

Gene IDTan0008064
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG06:17829341..17833449
RNA-Seq ExpressionTan0008064
SyntenyTan0008064
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576069.1 hypothetical protein SDJN03_26708, partial [Cucurbita argyrosperma subsp. sororia]2.2e-24375.73Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE
        MKR+LKPSIS DS+C KRIKV DLD  RPL CRRDTSP+SLKE  V   NNAKTSEFAFFKKFKEDAN RFSSSL RQKELQSKKFNS+D FRER   VE
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE

Query:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLIP
        N  +DFTSH  VENVTP+NFNSMHLPLGNSSKIS VDVK+AHKT +DIQS+QRNVENDDIFSRKRQKLRQFIQNMSF G     EK YGV+S LLSRLIP
Subjt:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLIP

Query:  ESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAAV
        ESNQY        F+NNLE+ Q LPGRC+PRLDYEH LNNSSSPCRLNKSRGRV +HSDFSTN+DDDNF VKYRTKE D ++EGKMTL DAN  P TAAV
Subjt:  ESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAAV

Query:  GNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSSP
         NYR LIS  F +QY  YDQGEPLH+RKQE+EPLLLGW DT++IKD+  SS+ TE  TFAEPPI F DDHQPNLH+SFGAVALCSSPFPS NHR+LYS P
Subjt:  GNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSSP

Query:  YSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFSG
        YSSLASYQ+HGLSR NVEKEE IDATFNNVHLNFSSVPK L Q +NYV DRG     FFCAQSA+W MN  LDDE +  S++S+CASG VFDFGWKY SG
Subjt:  YSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFSG

Query:  SNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSEINVNITEIDYI
        S E CQTAYH+L+YPLDEMRP +PVNEEC   SS     EY  PFFIQPESFFQ+GKV SLLTDKLSWD  RSEINV ITE+DY+
Subjt:  SNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSEINVNITEIDYI

XP_008461985.1 PREDICTED: uncharacterized protein LOC103500465 [Cucumis melo]3.6e-23875.13Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE
        MKRN +P+ISNDSS  KR KV DLD DRPL CRRD+SP+SLKER V  + NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQSKKFNSSD FRE  SPVE
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE

Query:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLIP
        N  KDFTSHHLVE VTPVNFNS+HLPLGN SKIS VDVK+AHKT EDIQS+QRNVENDDIFSRKRQKLRQFIQNMSFRG     EKGYGV+S LLSRLIP
Subjt:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLIP

Query:  ESNQYDFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAAVGNYRTLI
        E N Y F+NNLE+ Q+L GRC+PRLDYEHHLNNS SPCRLN SRGR S+HSDFSTNS+D+NF VKYRTKE DC+++ KMTL D NG PLTAAV NYR+ I
Subjt:  ESNQYDFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAAVGNYRTLI

Query:  SRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSSPYSSLASY
        S  F  QY LYDQ E LHLRKQ+LEPLLLGW DT+ IKDE SSSQLTEL+TFA+ PI F DDHQP LH+SFGAVALCSSPFPS N  N  S PYS+LASY
Subjt:  SRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSSPYSSLASY

Query:  QVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFSGSNEQCQT
        Q+ GLS  NV KEEDIDATFNN+HLNFSSVPK L Q  +YV D G   HD  CAQ+A W+MNNV++DE Q  SVESLCASG+VFDFGWKY SGS EQCQT
Subjt:  QVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFSGSNEQCQT

Query:  AYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLS-WDAARSEINV-NITEIDY
        +YH+LKYPLDE++P A +NEE  + SSDDVLV+Y PPF+IQPESFFQ+GKV S+LTDKLS WD  RSEINV +ITE++Y
Subjt:  AYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLS-WDAARSEINV-NITEIDY

XP_022954289.1 uncharacterized protein LOC111456585 isoform X2 [Cucurbita moschata]9.0e-23771.85Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE
        MKR+LKPSIS DS+C KRIKV DLD  RPL CRRDTSP+SLK   V   NNAKTSEFAFFKKFKEDAN RFSSSL RQKELQ KKFNS+D FRER   VE
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE

Query:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGIDE------------------
        N  +DFTSH  VENVTP+NFNSMHLPLGNSSKIS VDVK+AHKT +DIQS+QRNVENDDIFSRKRQKLRQFIQNMSF G  E                  
Subjt:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGIDE------------------

Query:  -----------KGYGVVSMLLSRLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYR
                     YGV+S LLSRLIPESNQY        F+NNLE+ Q LPGRC+PRLDYEH LNNSSSPCRLNKSRGRV +HSDFSTN+DDDNF VKYR
Subjt:  -----------KGYGVVSMLLSRLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYR

Query:  TKESDCELEGKMTLPDANGLPLTAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNL
        TKE D ++EGKMTL DAN  P TAAV NYR LIS  F +QY  YDQGEPLH+RKQE+EPLLLGW DT++IKD+  SS+ TE  TFAEPPI F DDHQPNL
Subjt:  TKESDCELEGKMTLPDANGLPLTAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNL

Query:  HKSFGAVALCSSPFPSGNHRNLYSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDD
        H+SFGAVALCSSPFPS NHR+LYS PYSSLASYQ+HGLSR NVEKEE IDAT NNVHLNFSSVPK L Q +NYV DRG     FFCAQSA+W MN  LDD
Subjt:  HKSFGAVALCSSPFPSGNHRNLYSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDD

Query:  ERQDSSVESLCASGVVFDFGWKYFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSE
        E +  S++S+CASG VFDFGWKY SGS E CQTAYH+L+YPLDEMRP +PVNEEC   SS     EY  PFFIQPESFFQ+GKV SLLTDKLSWD  RSE
Subjt:  ERQDSSVESLCASGVVFDFGWKYFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSE

Query:  INVNITEIDYI
        INV ITE+DY+
Subjt:  INVNITEIDYI

XP_022954290.1 uncharacterized protein LOC111456585 isoform X3 [Cucurbita moschata]5.6e-23975.09Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERVSPV
        MKR+LKPSIS DS+C KRIKV DLD  RPL CRRDTSP+SLK   V   NNAKTSEFAFFKKFKEDAN RFSSSL RQKELQ KKFNS+D FR ER   V
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERVSPV

Query:  ENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLI
        EN  +DFTSH  VENVTP+NFNSMHLPLGNSSKIS VDVK+AHKT +DIQS+QRNVENDDIFSRKRQKLRQFIQNMSF G     EK YGV+S LLSRLI
Subjt:  ENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLI

Query:  PESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAA
        PESNQY        F+NNLE+ Q LPGRC+PRLDYEH LNNSSSPCRLNKSRGRV +HSDFSTN+DDDNF VKYRTKE D ++EGKMTL DAN  P TAA
Subjt:  PESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAA

Query:  VGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSS
        V NYR LIS  F +QY  YDQGEPLH+RKQE+EPLLLGW DT++IKD+  SS+ TE  TFAEPPI F DDHQPNLH+SFGAVALCSSPFPS NHR+LYS 
Subjt:  VGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSS

Query:  PYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFS
        PYSSLASYQ+HGLSR NVEKEE IDAT NNVHLNFSSVPK L Q +NYV DRG     FFCAQSA+W MN  LDDE +  S++S+CASG VFDFGWKY S
Subjt:  PYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFS

Query:  GSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSEINVNITEIDYI
        GS E CQTAYH+L+YPLDEMRP +PVNEEC   SS     EY  PFFIQPESFFQ+GKV SLLTDKLSWD  RSEINV ITE+DY+
Subjt:  GSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSEINVNITEIDYI

XP_022991296.1 uncharacterized protein LOC111487990 isoform X4 [Cucurbita maxima]7.6e-23674.41Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVA---NSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERV
        MKRNLKPSIS DS C KRIKV DLD DRPL CRRDTSP+SLKE  V     +NNAKTSEFAFFKKFK DAN RFSSSL RQKELQSK+FNS+D FR ER 
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVA---NSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERV

Query:  SPVENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLS
          VEN  +DF SH  VENVTP+NFNSMHLPLGNSSKIS VDVK+AHKT +DIQS+QRNVENDDIFSRKRQKLRQFIQNMSF G     EK YGV+S LLS
Subjt:  SPVENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLS

Query:  RLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPL
        RLIPESNQY        F+NNLE+ Q LPGRC+PRLDYEH LNNSSSPCRLNKSRGRV +HSDFSTN+DDDNF VKYRTK+ D ++EGKMTL DAN  P 
Subjt:  RLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPL

Query:  TAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNL
        TAAV NYR+LI+  F  QY  YDQGEPLH+RKQE+EPLLLGW DT++IKD+  SS++TE  TFAEPPI F DDHQPNL +SFGAVALCSSPFPS  HRNL
Subjt:  TAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNL

Query:  YSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWK
        Y  PYSSL SYQ+HGLSRHNVEKEE IDATFNNVHLNFSSVPK L Q +NYV+DRG     FFCAQSA+WLMN   +DE +  S+ES+CASG VFDFGWK
Subjt:  YSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWK

Query:  YFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDG-KVCSLLTDKLSWDAARSEINVNITEIDYI
        Y SGS E CQTAYH+L+YPLDEMRP +PVNEEC   SS     EYR PFFIQPESFFQ+G KV SLLTDKLSWD  RSEINV ITE+DY+
Subjt:  YFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDG-KVCSLLTDKLSWDAARSEINVNITEIDYI

TrEMBL top hitse value%identityAlignment
A0A1S3CFT3 uncharacterized protein LOC1035004651.8e-23875.13Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE
        MKRN +P+ISNDSS  KR KV DLD DRPL CRRD+SP+SLKER V  + NAKTSEFAFFKKFKEDA+ RFSSSL RQKELQSKKFNSSD FRE  SPVE
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE

Query:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLIP
        N  KDFTSHHLVE VTPVNFNS+HLPLGN SKIS VDVK+AHKT EDIQS+QRNVENDDIFSRKRQKLRQFIQNMSFRG     EKGYGV+S LLSRLIP
Subjt:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLIP

Query:  ESNQYDFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAAVGNYRTLI
        E N Y F+NNLE+ Q+L GRC+PRLDYEHHLNNS SPCRLN SRGR S+HSDFSTNS+D+NF VKYRTKE DC+++ KMTL D NG PLTAAV NYR+ I
Subjt:  ESNQYDFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAAVGNYRTLI

Query:  SRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSSPYSSLASY
        S  F  QY LYDQ E LHLRKQ+LEPLLLGW DT+ IKDE SSSQLTEL+TFA+ PI F DDHQP LH+SFGAVALCSSPFPS N  N  S PYS+LASY
Subjt:  SRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSSPYSSLASY

Query:  QVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFSGSNEQCQT
        Q+ GLS  NV KEEDIDATFNN+HLNFSSVPK L Q  +YV D G   HD  CAQ+A W+MNNV++DE Q  SVESLCASG+VFDFGWKY SGS EQCQT
Subjt:  QVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFSGSNEQCQT

Query:  AYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLS-WDAARSEINV-NITEIDY
        +YH+LKYPLDE++P A +NEE  + SSDDVLV+Y PPF+IQPESFFQ+GKV S+LTDKLS WD  RSEINV +ITE++Y
Subjt:  AYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLS-WDAARSEINV-NITEIDY

A0A6J1GQP2 uncharacterized protein LOC111456585 isoform X11.1e-23571.73Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERVSPV
        MKR+LKPSIS DS+C KRIKV DLD  RPL CRRDTSP+SLK   V   NNAKTSEFAFFKKFKEDAN RFSSSL RQKELQ KKFNS+D FR ER   V
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERVSPV

Query:  ENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGIDE-----------------
        EN  +DFTSH  VENVTP+NFNSMHLPLGNSSKIS VDVK+AHKT +DIQS+QRNVENDDIFSRKRQKLRQFIQNMSF G  E                 
Subjt:  ENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGIDE-----------------

Query:  ------------KGYGVVSMLLSRLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKY
                      YGV+S LLSRLIPESNQY        F+NNLE+ Q LPGRC+PRLDYEH LNNSSSPCRLNKSRGRV +HSDFSTN+DDDNF VKY
Subjt:  ------------KGYGVVSMLLSRLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKY

Query:  RTKESDCELEGKMTLPDANGLPLTAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPN
        RTKE D ++EGKMTL DAN  P TAAV NYR LIS  F +QY  YDQGEPLH+RKQE+EPLLLGW DT++IKD+  SS+ TE  TFAEPPI F DDHQPN
Subjt:  RTKESDCELEGKMTLPDANGLPLTAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPN

Query:  LHKSFGAVALCSSPFPSGNHRNLYSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLD
        LH+SFGAVALCSSPFPS NHR+LYS PYSSLASYQ+HGLSR NVEKEE IDAT NNVHLNFSSVPK L Q +NYV DRG     FFCAQSA+W MN  LD
Subjt:  LHKSFGAVALCSSPFPSGNHRNLYSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLD

Query:  DERQDSSVESLCASGVVFDFGWKYFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARS
        DE +  S++S+CASG VFDFGWKY SGS E CQTAYH+L+YPLDEMRP +PVNEEC   SS     EY  PFFIQPESFFQ+GKV SLLTDKLSWD  RS
Subjt:  DERQDSSVESLCASGVVFDFGWKYFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARS

Query:  EINVNITEIDYI
        EINV ITE+DY+
Subjt:  EINVNITEIDYI

A0A6J1GS11 uncharacterized protein LOC111456585 isoform X32.7e-23975.09Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERVSPV
        MKR+LKPSIS DS+C KRIKV DLD  RPL CRRDTSP+SLK   V   NNAKTSEFAFFKKFKEDAN RFSSSL RQKELQ KKFNS+D FR ER   V
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERVSPV

Query:  ENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLI
        EN  +DFTSH  VENVTP+NFNSMHLPLGNSSKIS VDVK+AHKT +DIQS+QRNVENDDIFSRKRQKLRQFIQNMSF G     EK YGV+S LLSRLI
Subjt:  ENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLSRLI

Query:  PESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAA
        PESNQY        F+NNLE+ Q LPGRC+PRLDYEH LNNSSSPCRLNKSRGRV +HSDFSTN+DDDNF VKYRTKE D ++EGKMTL DAN  P TAA
Subjt:  PESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAA

Query:  VGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSS
        V NYR LIS  F +QY  YDQGEPLH+RKQE+EPLLLGW DT++IKD+  SS+ TE  TFAEPPI F DDHQPNLH+SFGAVALCSSPFPS NHR+LYS 
Subjt:  VGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSS

Query:  PYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFS
        PYSSLASYQ+HGLSR NVEKEE IDAT NNVHLNFSSVPK L Q +NYV DRG     FFCAQSA+W MN  LDDE +  S++S+CASG VFDFGWKY S
Subjt:  PYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFS

Query:  GSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSEINVNITEIDYI
        GS E CQTAYH+L+YPLDEMRP +PVNEEC   SS     EY  PFFIQPESFFQ+GKV SLLTDKLSWD  RSEINV ITE+DY+
Subjt:  GSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSEINVNITEIDYI

A0A6J1GSJ6 uncharacterized protein LOC111456585 isoform X24.3e-23771.85Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE
        MKR+LKPSIS DS+C KRIKV DLD  RPL CRRDTSP+SLK   V   NNAKTSEFAFFKKFKEDAN RFSSSL RQKELQ KKFNS+D FRER   VE
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVE

Query:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGIDE------------------
        N  +DFTSH  VENVTP+NFNSMHLPLGNSSKIS VDVK+AHKT +DIQS+QRNVENDDIFSRKRQKLRQFIQNMSF G  E                  
Subjt:  NCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGIDE------------------

Query:  -----------KGYGVVSMLLSRLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYR
                     YGV+S LLSRLIPESNQY        F+NNLE+ Q LPGRC+PRLDYEH LNNSSSPCRLNKSRGRV +HSDFSTN+DDDNF VKYR
Subjt:  -----------KGYGVVSMLLSRLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYR

Query:  TKESDCELEGKMTLPDANGLPLTAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNL
        TKE D ++EGKMTL DAN  P TAAV NYR LIS  F +QY  YDQGEPLH+RKQE+EPLLLGW DT++IKD+  SS+ TE  TFAEPPI F DDHQPNL
Subjt:  TKESDCELEGKMTLPDANGLPLTAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNL

Query:  HKSFGAVALCSSPFPSGNHRNLYSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDD
        H+SFGAVALCSSPFPS NHR+LYS PYSSLASYQ+HGLSR NVEKEE IDAT NNVHLNFSSVPK L Q +NYV DRG     FFCAQSA+W MN  LDD
Subjt:  HKSFGAVALCSSPFPSGNHRNLYSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDD

Query:  ERQDSSVESLCASGVVFDFGWKYFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSE
        E +  S++S+CASG VFDFGWKY SGS E CQTAYH+L+YPLDEMRP +PVNEEC   SS     EY  PFFIQPESFFQ+GKV SLLTDKLSWD  RSE
Subjt:  ERQDSSVESLCASGVVFDFGWKYFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCSLLTDKLSWDAARSE

Query:  INVNITEIDYI
        INV ITE+DY+
Subjt:  INVNITEIDYI

A0A6J1JQC1 uncharacterized protein LOC111487990 isoform X43.7e-23674.41Show/hide
Query:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVA---NSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERV
        MKRNLKPSIS DS C KRIKV DLD DRPL CRRDTSP+SLKE  V     +NNAKTSEFAFFKKFK DAN RFSSSL RQKELQSK+FNS+D FR ER 
Subjt:  MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVA---NSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFR-ERV

Query:  SPVENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLS
          VEN  +DF SH  VENVTP+NFNSMHLPLGNSSKIS VDVK+AHKT +DIQS+QRNVENDDIFSRKRQKLRQFIQNMSF G     EK YGV+S LLS
Subjt:  SPVENCYKDFTSHHLVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGID---EKGYGVVSMLLS

Query:  RLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPL
        RLIPESNQY        F+NNLE+ Q LPGRC+PRLDYEH LNNSSSPCRLNKSRGRV +HSDFSTN+DDDNF VKYRTK+ D ++EGKMTL DAN  P 
Subjt:  RLIPESNQY-------DFDNNLEQQQRLPGRCFPRLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPL

Query:  TAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNL
        TAAV NYR+LI+  F  QY  YDQGEPLH+RKQE+EPLLLGW DT++IKD+  SS++TE  TFAEPPI F DDHQPNL +SFGAVALCSSPFPS  HRNL
Subjt:  TAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDDTNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNL

Query:  YSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWK
        Y  PYSSL SYQ+HGLSRHNVEKEE IDATFNNVHLNFSSVPK L Q +NYV+DRG     FFCAQSA+WLMN   +DE +  S+ES+CASG VFDFGWK
Subjt:  YSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKDRGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWK

Query:  YFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDG-KVCSLLTDKLSWDAARSEINVNITEIDYI
        Y SGS E CQTAYH+L+YPLDEMRP +PVNEEC   SS     EYR PFFIQPESFFQ+G KV SLLTDKLSWD  RSEINV ITE+DY+
Subjt:  YFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDG-KVCSLLTDKLSWDAARSEINVNITEIDYI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20250.1 unknown protein5.1e-0430.43Show/hide
Query:  LKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVENC----YKDFTSHHLVENVTPVN-----FNSMHLPL----
        L E +   S +AKTSEFAFFKK K  +N    S  S  K   +K       F  R  P + C      D          TP++      + +H  L    
Subjt:  LKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVENC----YKDFTSHHLVENVTPVN-----FNSMHLPL----

Query:  -----GNSSKISGVDVKYAHKTLEDIQSEQRNV--ENDDIFSRKRQKLRQFIQNMSFRGIDE---KGYGVVSMLLSRLIPESNQ
             G SS     D +Y   + ++++SE   +  E  DIFS KR+KL Q++++     I E    G+ +VS+LL+RL P + +
Subjt:  -----GNSSKISGVDVKYAHKTLEDIQSEQRNV--ENDDIFSRKRQKLRQFIQNMSFRGIDE---KGYGVVSMLLSRLIPESNQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCGCAATCTCAAACCTTCCATTTCTAATGATAGTAGCTGTTGGAAGAGAATAAAAGTTAGAGATCTTGATTCTGATAGACCTTTGAGATGCAGGAGAGATACTTC
TCCCATGTCCTTGAAAGAAAGACGTGTTGCAAACTCAAATAATGCAAAAACTTCTGAGTTTGCATTTTTTAAGAAGTTCAAGGAAGATGCAAACCTTAGATTCAGTTCCT
CTCTTTCACGTCAAAAGGAACTTCAATCAAAGAAGTTCAACTCGAGTGATCGTTTCAGAGAGAGAGTCAGCCCTGTTGAAAACTGCTATAAAGATTTCACATCACATCAT
CTTGTTGAGAACGTCACTCCTGTTAACTTTAACTCGATGCATTTACCACTGGGTAATTCATCCAAAATTTCAGGGGTAGATGTGAAATACGCTCATAAAACATTGGAGGA
TATACAGAGCGAACAGAGAAACGTGGAAAATGATGATATTTTTAGTAGGAAGAGGCAGAAATTGCGTCAGTTCATTCAGAATATGTCATTCCGTGGAATTGATGAGAAGG
GGTATGGTGTTGTTTCCATGCTACTTAGCCGGCTTATACCTGAGAGCAATCAGTATGATTTTGATAATAACTTGGAACAACAACAACGGTTGCCTGGAAGGTGCTTCCCA
AGACTTGATTATGAACATCATTTGAATAATAGTTCATCACCTTGTCGTTTGAATAAATCAAGAGGAAGAGTTTCTTACCATTCTGATTTCTCAACCAATAGCGATGATGA
CAACTTCGACGTTAAGTACAGAACCAAGGAGTCGGACTGTGAACTAGAAGGAAAAATGACTTTGCCGGATGCCAATGGTTTGCCTCTTACTGCTGCAGTTGGAAACTATA
GAACACTTATTTCCCGCCATTTCAAACGGCAATATGATTTATATGATCAAGGTGAACCTTTGCACCTAAGAAAGCAAGAGCTAGAACCTCTTCTGTTGGGTTGGGACGAT
ACCAACAACATAAAAGATGAAAGCTCTTCTTCTCAACTTACAGAGTTGAGCACATTTGCCGAGCCACCAATTTTGTTCACTGATGATCATCAGCCAAACTTGCACAAGAG
TTTTGGTGCTGTTGCACTGTGTTCATCCCCTTTCCCTTCCGGTAATCATAGAAACTTATACTCATCACCATACTCCAGTTTAGCTAGCTATCAAGTTCATGGGTTAAGTA
GGCATAATGTAGAAAAGGAGGAAGATATAGATGCCACTTTCAACAACGTGCATTTGAATTTCTCATCTGTACCCAAATTTCTTGGTCAGTTCGAAAACTATGTCAAAGAC
AGAGGCAGCCAAAGCCATGACTTCTTCTGTGCACAAAGTGCTCATTGGCTTATGAATAATGTGTTGGATGACGAACGCCAAGATTCTTCTGTAGAAAGTCTGTGTGCTTC
TGGCGTGGTCTTTGATTTTGGATGGAAATACTTCTCAGGCTCAAATGAGCAATGCCAAACAGCTTATCATATGCTTAAATACCCACTGGATGAAATGAGACCTGCAGCCC
CTGTCAATGAAGAATGTATTCATGGAAGTTCAGATGATGTCCTCGTGGAATATCGACCGCCCTTCTTTATCCAACCCGAGTCATTCTTTCAAGATGGGAAGGTATGCTCC
TTACTGACTGATAAACTTAGCTGGGATGCAGCCAGAAGTGAAATAAATGTTAATATAACTGAAATAGATTACATATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCGCAATCTCAAACCTTCCATTTCTAATGATAGTAGCTGTTGGAAGAGAATAAAAGTTAGAGATCTTGATTCTGATAGACCTTTGAGATGCAGGAGAGATACTTC
TCCCATGTCCTTGAAAGAAAGACGTGTTGCAAACTCAAATAATGCAAAAACTTCTGAGTTTGCATTTTTTAAGAAGTTCAAGGAAGATGCAAACCTTAGATTCAGTTCCT
CTCTTTCACGTCAAAAGGAACTTCAATCAAAGAAGTTCAACTCGAGTGATCGTTTCAGAGAGAGAGTCAGCCCTGTTGAAAACTGCTATAAAGATTTCACATCACATCAT
CTTGTTGAGAACGTCACTCCTGTTAACTTTAACTCGATGCATTTACCACTGGGTAATTCATCCAAAATTTCAGGGGTAGATGTGAAATACGCTCATAAAACATTGGAGGA
TATACAGAGCGAACAGAGAAACGTGGAAAATGATGATATTTTTAGTAGGAAGAGGCAGAAATTGCGTCAGTTCATTCAGAATATGTCATTCCGTGGAATTGATGAGAAGG
GGTATGGTGTTGTTTCCATGCTACTTAGCCGGCTTATACCTGAGAGCAATCAGTATGATTTTGATAATAACTTGGAACAACAACAACGGTTGCCTGGAAGGTGCTTCCCA
AGACTTGATTATGAACATCATTTGAATAATAGTTCATCACCTTGTCGTTTGAATAAATCAAGAGGAAGAGTTTCTTACCATTCTGATTTCTCAACCAATAGCGATGATGA
CAACTTCGACGTTAAGTACAGAACCAAGGAGTCGGACTGTGAACTAGAAGGAAAAATGACTTTGCCGGATGCCAATGGTTTGCCTCTTACTGCTGCAGTTGGAAACTATA
GAACACTTATTTCCCGCCATTTCAAACGGCAATATGATTTATATGATCAAGGTGAACCTTTGCACCTAAGAAAGCAAGAGCTAGAACCTCTTCTGTTGGGTTGGGACGAT
ACCAACAACATAAAAGATGAAAGCTCTTCTTCTCAACTTACAGAGTTGAGCACATTTGCCGAGCCACCAATTTTGTTCACTGATGATCATCAGCCAAACTTGCACAAGAG
TTTTGGTGCTGTTGCACTGTGTTCATCCCCTTTCCCTTCCGGTAATCATAGAAACTTATACTCATCACCATACTCCAGTTTAGCTAGCTATCAAGTTCATGGGTTAAGTA
GGCATAATGTAGAAAAGGAGGAAGATATAGATGCCACTTTCAACAACGTGCATTTGAATTTCTCATCTGTACCCAAATTTCTTGGTCAGTTCGAAAACTATGTCAAAGAC
AGAGGCAGCCAAAGCCATGACTTCTTCTGTGCACAAAGTGCTCATTGGCTTATGAATAATGTGTTGGATGACGAACGCCAAGATTCTTCTGTAGAAAGTCTGTGTGCTTC
TGGCGTGGTCTTTGATTTTGGATGGAAATACTTCTCAGGCTCAAATGAGCAATGCCAAACAGCTTATCATATGCTTAAATACCCACTGGATGAAATGAGACCTGCAGCCC
CTGTCAATGAAGAATGTATTCATGGAAGTTCAGATGATGTCCTCGTGGAATATCGACCGCCCTTCTTTATCCAACCCGAGTCATTCTTTCAAGATGGGAAGGTATGCTCC
TTACTGACTGATAAACTTAGCTGGGATGCAGCCAGAAGTGAAATAAATGTTAATATAACTGAAATAGATTACATATGA
Protein sequenceShow/hide protein sequence
MKRNLKPSISNDSSCWKRIKVRDLDSDRPLRCRRDTSPMSLKERRVANSNNAKTSEFAFFKKFKEDANLRFSSSLSRQKELQSKKFNSSDRFRERVSPVENCYKDFTSHH
LVENVTPVNFNSMHLPLGNSSKISGVDVKYAHKTLEDIQSEQRNVENDDIFSRKRQKLRQFIQNMSFRGIDEKGYGVVSMLLSRLIPESNQYDFDNNLEQQQRLPGRCFP
RLDYEHHLNNSSSPCRLNKSRGRVSYHSDFSTNSDDDNFDVKYRTKESDCELEGKMTLPDANGLPLTAAVGNYRTLISRHFKRQYDLYDQGEPLHLRKQELEPLLLGWDD
TNNIKDESSSSQLTELSTFAEPPILFTDDHQPNLHKSFGAVALCSSPFPSGNHRNLYSSPYSSLASYQVHGLSRHNVEKEEDIDATFNNVHLNFSSVPKFLGQFENYVKD
RGSQSHDFFCAQSAHWLMNNVLDDERQDSSVESLCASGVVFDFGWKYFSGSNEQCQTAYHMLKYPLDEMRPAAPVNEECIHGSSDDVLVEYRPPFFIQPESFFQDGKVCS
LLTDKLSWDAARSEINVNITEIDYI