; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021481 (gene) of Chayote v1 genome

Gene IDSed0021481
OrganismSechium edule (Chayote v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationLG05:9858977..9862293
RNA-Seq ExpressionSed0021481
SyntenySed0021481
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]1.3e-5348.31Show/hide
Query:  SSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKA
        S I LL+NICNL+S++LDSTNY+LW++Q+++LLKAHKL+G+IDG    P             +  Y+ WF KDQAL+T++NATLS  AL++ + S TSK 
Subjt:  SSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKA

Query:  LWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEE
        +W  L K YSSS+R+N+V LKS+LQ+ISKK  ESID Y++RI  I  KLA V   ++ EDL+IY +NG P+ YN F+TS+RTR+T ++FEELH LL+ EE
Subjt:  LWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEE

Query:  NELDIQSKKDD--------SAAIQTLAMNMYTNNHGGWRSSNRGRGRNNG-------NRGGRGQYNQ
        + L  QSK+DD         A+ Q+L     T N+   R   RGRG  +G        RGG     Q
Subjt:  NELDIQSKKDD--------SAAIQTLAMNMYTNNHGGWRSSNRGRGRNNG-------NRGGRGQYNQ

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-5348.31Show/hide
Query:  SSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKA
        S I LL+NICNL+S++LDSTNY+LW++Q+++LLKAHKL+G+IDG    P             +  Y+ WF KDQAL+T++NATLS  AL++ + S TSK 
Subjt:  SSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKA

Query:  LWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEE
        +W  L K YSSS+R+N+V LKS+LQ+ISKK  ESID Y++RI  I  KLA V   ++ EDL+IY +NG P+ YN F+TS+RTR+T ++FEELH LL+ EE
Subjt:  LWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEE

Query:  NELDIQSKKDD--------SAAIQTLAMNMYTNNHGGWRSSNRGRGRNNG-------NRGGRGQYNQ
        + L  QSK+DD         A+ Q+L     T N+   R   RGRG  +G        RGG     Q
Subjt:  NELDIQSKKDD--------SAAIQTLAMNMYTNNHGGWRSSNRGRGRNNG-------NRGGRGQYNQ

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]2.0e-5146.72Show/hide
Query:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL
        M++ T +PSSS        I LL+NICNL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG    P   T      ST     +  YE W  KDQAL+T+
Subjt:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL

Query:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS
        +NATLS  AL++ + S +SK +W+ L K YSS +R+N+V LKS+LQ+I KK  ESID Y++RI  I  KLA V   I++EDL+IY +NG P+ YN F+TS
Subjt:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS

Query:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN
        +RTR+  ++FEELH LLR EE+ L  QSK DDS    T+ ++   +  +      +N  RG  +G   G G+++
Subjt:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]2.0e-5146.72Show/hide
Query:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL
        M++ T +PSSS        I LL+NICNL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG    P   T      ST     +  YE W  KDQAL+T+
Subjt:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL

Query:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS
        +NATLS  AL++ + S +SK +W+ L K YSS +R+N+V LKS+LQ+I KK  ESID Y++RI  I  KLA V   I++EDL+IY +NG P+ YN F+TS
Subjt:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS

Query:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN
        +RTR+  ++FEELH LLR EE+ L  QSK DDS    T+ ++   +  +      +N  RG  +G   G G+++
Subjt:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]2.8e-5347.43Show/hide
Query:  SSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTK----------SAEYELWFEKDQALITLLNATLSQTALS
        S I LL+NICNLVS+RLDST++ILW++Q++++LKAHKL+G+IDG ++ P        E  ++          +  +E W  KDQAL+TL+NATLS  AL+
Subjt:  SSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTK----------SAEYELWFEKDQALITLLNATLSQTALS

Query:  FAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFE
        + +RS TSK +WE LEK YSS++RTN+V LKS+LQSI KK  ESID YV+RI  I  K A V I I+ E L+IY +NG  + YN   TS+RTRA ++SFE
Subjt:  FAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFE

Query:  ELHSLLRIEENELDIQSKKDDSA-------AIQTLAMNMYTNNHGGWRSSNRGRGRNNGNRGGRGQYNQTPT
        ELH  ++ EE+ ++ Q K++D         A    + N  +  H   +S +RGRG+NN    GRG+ N  PT
Subjt:  ELHSLLRIEENELDIQSKKDDSA-------AIQTLAMNMYTNNHGGWRSSNRGRGRNNGNRGGRGQYNQTPT

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X29.8e-5246.72Show/hide
Query:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL
        M++ T +PSSS        I LL+NICNL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG    P   T      ST     +  YE W  KDQAL+T+
Subjt:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL

Query:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS
        +NATLS  AL++ + S +SK +W+ L K YSS +R+N+V LKS+LQ+I KK  ESID Y++RI  I  KLA V   I++EDL+IY +NG P+ YN F+TS
Subjt:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS

Query:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN
        +RTR+  ++FEELH LLR EE+ L  QSK DDS    T+ ++   +  +      +N  RG  +G   G G+++
Subjt:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X39.8e-5246.72Show/hide
Query:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL
        M++ T +PSSS        I LL+NICNL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG    P   T      ST     +  YE W  KDQAL+T+
Subjt:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL

Query:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS
        +NATLS  AL++ + S +SK +W+ L K YSS +R+N+V LKS+LQ+I KK  ESID Y++RI  I  KLA V   I++EDL+IY +NG P+ YN F+TS
Subjt:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS

Query:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN
        +RTR+  ++FEELH LLR EE+ L  QSK DDS    T+ ++   +  +      +N  RG  +G   G G+++
Subjt:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X19.8e-5246.72Show/hide
Query:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL
        M++ T +PSSS        I LL+NICNL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG    P   T      ST     +  YE W  KDQAL+T+
Subjt:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL

Query:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS
        +NATLS  AL++ + S +SK +W+ L K YSS +R+N+V LKS+LQ+I KK  ESID Y++RI  I  KLA V   I++EDL+IY +NG P+ YN F+TS
Subjt:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS

Query:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN
        +RTR+  ++FEELH LLR EE+ L  QSK DDS    T+ ++   +  +      +N  RG  +G   G G+++
Subjt:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN

A0A5D3CLI6 T4.59.8e-5246.72Show/hide
Query:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL
        M++ T +PSSS        I LL+NICNL+S+RLDSTN++LW++Q++++LKAHKLYG+IDG    P   T      ST     +  YE W  KDQAL+T+
Subjt:  MANITFVPSSS--------IHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEAST----KSAEYELWFEKDQALITL

Query:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS
        +NATLS  AL++ + S +SK +W+ L K YSS +R+N+V LKS+LQ+I KK  ESID Y++RI  I  KLA V   I++EDL+IY +NG P+ YN F+TS
Subjt:  LNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTS

Query:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN
        +RTR+  ++FEELH LLR EE+ L  QSK DDS    T+ ++   +  +      +N  RG  +G   G G+++
Subjt:  LRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMYTN--NHGGWRSSNRGRGRNNGNRGGRGQYN

A0A6J1D9L6 uncharacterized protein LOC1110188921.4e-5347.43Show/hide
Query:  SSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTK----------SAEYELWFEKDQALITLLNATLSQTALS
        S I LL+NICNLVS+RLDST++ILW++Q++++LKAHKL+G+IDG ++ P        E  ++          +  +E W  KDQAL+TL+NATLS  AL+
Subjt:  SSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTK----------SAEYELWFEKDQALITLLNATLSQTALS

Query:  FAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFE
        + +RS TSK +WE LEK YSS++RTN+V LKS+LQSI KK  ESID YV+RI  I  K A V I I+ E L+IY +NG  + YN   TS+RTRA ++SFE
Subjt:  FAIRSKTSKALWETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFE

Query:  ELHSLLRIEENELDIQSKKDDSA-------AIQTLAMNMYTNNHGGWRSSNRGRGRNNGNRGGRGQYNQTPT
        ELH  ++ EE+ ++ Q K++D         A    + N  +  H   +S +RGRG+NN    GRG+ N  PT
Subjt:  ELHSLLRIEENELDIQSKKDDSA-------AIQTLAMNMYTNNHGGWRSSNRGRGRNNGNRGGRGQYNQTPT

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-0924.78Show/hide
Query:  WRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSEL
        W+ ++  LL    L+  +D    +P     E+            W + D+   + +   LS   ++  I   T++ +W  LE  Y S T TN + LK +L
Subjt:  WRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNIVGLKSEL

Query:  QSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMY-
         ++   +  +  +++     ++ +LA + +KI++ED  I  +N  PSSY+   T++    TTI  +++ S L + E ++  + +    A I       Y 
Subjt:  QSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEENELDIQSKKDDSAAIQTLAMNMY-

Query:  --TNNHGGWRSSNRGRGRNNGNRGGRGQYN
          +NN+G  RS  RG+ +N      R  YN
Subjt:  --TNNHGGWRSSNRGRGRNNGNRGGRGQYN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.6e-1424.91Show/hide
Query:  LLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKALWET
        L  N+ N+   +L STNY++W  Q+ +L   ++L G++DG    P  + G +  A   + +Y  W  +D+ + + +   +S +      R+ T+  +WET
Subjt:  LLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKALWET

Query:  LEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEENELD
        L K Y++ +  ++  L+++L+  + K +++ID+Y+Q ++T   +LA +   +D ++ V   +   P  Y      +  + T  +  E+H  L   E+++ 
Subjt:  LEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEENELD

Query:  IQSKKDDSAAIQTLAMNMY-------TNNHGGWRSSNRGRGRNNGNRGGRGQYNQT---PTSKVKTPIL
          S    SA +  +  N         TNN+     +NR   RNN N     Q + T   P +    P L
Subjt:  IQSKKDDSAAIQTLAMNMY-------TNNHGGWRSSNRGRGRNNGNRGGRGQYNQT---PTSKVKTPIL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.6e-0923.53Show/hide
Query:  TNICNLVS---VRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKALWE
        TNI N+      +L STNY++W  Q+ +L   ++L G++DG    P  + G +      + +Y  W  +D+ + + +   +S +      R+ T+  +WE
Subjt:  TNICNLVS---VRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKALWE

Query:  TLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEENEL
        TL K Y++ +  ++  L                    R +T   +LA +   +D ++ V   +   P  Y      +  + T  S  E+H  L   E++L
Subjt:  TLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEENEL

Query:  DIQSKKDDSAAIQTLAMNMYT--NNHGGWRSSNRGRGRNNGNRGGRGQYNQTPTS
               +SA +  +  N+ T  N +     +NRG  RN  N   R    Q  +S
Subjt:  DIQSKKDDSAAIQTLAMNMYT--NNHGGWRSSNRGRGRNNGNRGGRGQYNQTPTS

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.5e-0420.3Show/hide
Query:  DSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNI
        D  NY+ W+ +  S L+  K +G+IDG + +P             S  Y+ W + +  ++  L  +++   L   + ++T+  +WE L + +       I
Subjt:  DSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKALWETLEKCYSSSTRTNI

Query:  VGLKSELQSISKKQSESIDNYVQRILTIVHKLA
          L+  L ++ ++  +S++ Y  ++  +  +L+
Subjt:  VGLKSELQSISKKQSESIDNYVQRILTIVHKLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAATATCACTTTTGTTCCTTCCTCTTCGATTCATCTATTAACAAATATCTGCAATCTTGTGTCTGTAAGATTGGATTCCACAAACTACATCCTCTGGAGGTATCA
AATTTCTTCTCTGCTCAAAGCCCATAAGCTGTATGGTTACATAGATGGTAAAATCGCTGAACCAAAAGTCTCAACAGGAGAAGAAGGTGAAGCTTCAACAAAATCTGCAG
AATATGAACTCTGGTTTGAGAAAGATCAAGCGCTTATCACTCTACTGAATGCGACGCTCTCACAAACGGCCCTTTCCTTTGCGATCAGATCCAAAACTTCTAAAGCATTA
TGGGAGACGCTGGAAAAATGCTACTCTTCTTCTACAAGAACGAATATTGTTGGGCTGAAATCTGAACTTCAGAGTATTTCCAAGAAACAAAGTGAGTCAATTGATAACTA
TGTTCAAAGAATATTAACTATCGTTCATAAGCTAGCTGCAGTAGAAATCAAGATTGATCAAGAAGATCTAGTTATTTACACAATCAATGGTTTTCCTTCATCTTATAACA
TTTTCAAGACTTCTTTACGAACTCGTGCAACCACTATTTCTTTTGAAGAACTACACTCATTATTGAGAATAGAGGAAAATGAGTTAGATATTCAATCCAAGAAAGATGAT
TCTGCAGCAATTCAGACTCTTGCAATGAACATGTATACAAACAATCATGGAGGATGGCGCAGTTCCAATCGGGGAAGAGGACGAAACAATGGAAATCGTGGTGGTCGTGG
ACAGTACAATCAAACTCCAACTTCAAAAGTCAAAACTCCTATTCTAATAATTCAAACTCCAACCTCAATAATCAAGTCTCCTACTCGAACAATCAGTTTTCTTATCTGA
mRNA sequenceShow/hide mRNA sequence
ATTCTCTTTCTCTCTCAATTCACATGGTATCAAAGCAGCACATGCCTGAACCTAATTGGCTTCTCTTGCCCACTAATCTTCTCTCTCATCAATCTACGCTTCACTTTCTT
AAAAATTCCCAATTTCTTCTCCTTCATTGAAATCCCTAATTTCTGCTTTTTAAAATCCCTAAAATTTCTTGCGATCTTCTTGCGTTTCCATGGCGAATATCACTTTTGTT
CCTTCCTCTTCGATTCATCTATTAACAAATATCTGCAATCTTGTGTCTGTAAGATTGGATTCCACAAACTACATCCTCTGGAGGTATCAAATTTCTTCTCTGCTCAAAGC
CCATAAGCTGTATGGTTACATAGATGGTAAAATCGCTGAACCAAAAGTCTCAACAGGAGAAGAAGGTGAAGCTTCAACAAAATCTGCAGAATATGAACTCTGGTTTGAGA
AAGATCAAGCGCTTATCACTCTACTGAATGCGACGCTCTCACAAACGGCCCTTTCCTTTGCGATCAGATCCAAAACTTCTAAAGCATTATGGGAGACGCTGGAAAAATGC
TACTCTTCTTCTACAAGAACGAATATTGTTGGGCTGAAATCTGAACTTCAGAGTATTTCCAAGAAACAAAGTGAGTCAATTGATAACTATGTTCAAAGAATATTAACTAT
CGTTCATAAGCTAGCTGCAGTAGAAATCAAGATTGATCAAGAAGATCTAGTTATTTACACAATCAATGGTTTTCCTTCATCTTATAACATTTTCAAGACTTCTTTACGAA
CTCGTGCAACCACTATTTCTTTTGAAGAACTACACTCATTATTGAGAATAGAGGAAAATGAGTTAGATATTCAATCCAAGAAAGATGATTCTGCAGCAATTCAGACTCTT
GCAATGAACATGTATACAAACAATCATGGAGGATGGCGCAGTTCCAATCGGGGAAGAGGACGAAACAATGGAAATCGTGGTGGTCGTGGACAGTACAATCAAACTCCAAC
TTCAAAAGTCAAAACTCCTATTCTAATAATTCAAACTCCAACCTCAATAATCAAGTCTCCTACTCGAACAATCAGTTTTCTTATCTGAATTCATATTCATCTGGTAATAT
CTCGACAGTGAATACGGCGGTTCCAAATGCTTATCCGAGTGCCTTTCCAGAATCATTGCCTTGTCAAATATGTGGAAAACTTGGGCATAATGCTCTTGACTGTTACAATC
GTATGAGTTTCTCTTACCAAGGACGCATTCCTCCATCCAAATTAGCTGTCATGGCTGCCTCGTCATCAAACTCAGATCCGAACACTCAACCTACTGTGATAGTACCAGAC
TCCTGTTTTGCTGGTCAAAATCCTCTGTCTCCATTTTCTCTGCCAACTATACAGTCTTCATCTCCTAGTAGTAATCTGGTGGATCAAACTACTTCTACCCTTCACACTGA
AGGTTTGTCTCCTGAGGTAACACCTTCCATTTCTATTCCTACTTCATTTGTACAGGAAGCTACATCTTTTGATTTATCATCAGAACCAACAAGTGAAATTGGTGTTCCTT
CTCCAACACCTTCTATTCCTCCACTCTCTGTCTCTATAAATCCTATAACCAATATACATCCTATGCAAACAAGGGGTTGTAACGGCCCAGTTTTCTTTTCCGGTTCTCGA
GGTGCTTCCGGTTCAGTTTCAGTGGTTCGGTTGGGCCGTTAGAGCTGTTTTGTCCCTTAAGGGCATTTTGGTCTTTTGCTCCCCGGGCAGGATTTGTGGATTTTCAGGGC
TTCGAGGCAGCTCCTGCCTTTGTTGAGGTTTGTTTGTTGGTGGCTGCGAGTCTCGAGATGTTTGGGCTTTTCGTTTGGGTGTTTCCTTCGGGATTCGAGCCCGGGGCCCT
TAGTGTTGCTTTGGGAGTTGGGTTTAAGTGAGATTTATTTATTGGAAAATGTTGCTTCAGTCGGGTCTCGAAGGAGGAAGCCGGGAAATGGGATTTTAAGGATTTTAAGG
ATTTTAAGGATTTT
Protein sequenceShow/hide protein sequence
MANITFVPSSSIHLLTNICNLVSVRLDSTNYILWRYQISSLLKAHKLYGYIDGKIAEPKVSTGEEGEASTKSAEYELWFEKDQALITLLNATLSQTALSFAIRSKTSKAL
WETLEKCYSSSTRTNIVGLKSELQSISKKQSESIDNYVQRILTIVHKLAAVEIKIDQEDLVIYTINGFPSSYNIFKTSLRTRATTISFEELHSLLRIEENELDIQSKKDD
SAAIQTLAMNMYTNNHGGWRSSNRGRGRNNGNRGGRGQYNQTPTSKVKTPILIIQTPTSIIKSPTRTISFLI