; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G000030 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G000030
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Genome locationCmo_Chr05:13032..15153
RNA-Seq ExpressionCmoCh05G000030
SyntenyCmoCh05G000030
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAF23632.1 Os08g0389500 [Oryza sativa Japonica Group]6.9e-7353.29Show/hide
Query:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP
        S TPPRR+  +    P RGRG     G +++V R I+E   A +QYP LTR+NY+EWA+LM+VN++A G+W+AVEP   E +EYR+DRLA AAILRSVP 
Subjt:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP

Query:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL
        EML +L  KR+A++AW+ IK+ RVGV+RVRES  +QLR+E + + +K+GE+ +DFS+RITGLAN++ TLG  IS+ ++V+KML VVP+HLEQ+A+++ETL
Subjt:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL

Query:  LDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTGESNGSSSSRKGG-KKPWMSHGRTRGKDGNQKKESTG
        LD+N ++VEEVTGRLR VEQR+      V+ +GRLLLT+EEW+A+LK    +GE    S S  GG K+   S    RGK G+++    G
Subjt:  LDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTGESNGSSSSRKGG-KKPWMSHGRTRGKDGNQKKESTG

CAE03285.2 OSJNBb0046P18.1 [Oryza sativa Japonica Group]7.6e-7255.99Show/hide
Query:  PRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSA
        P   RGRH   G +VVV RV+RE TT+ V YP LTR+NY +W L+MRVN+QAQGLW AVEPE  + ++YR+DR A  AILR+VP EMLA+L+ K T Q A
Subjt:  PRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSA

Query:  WEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRL
        WE IK+RR+GVQ VRE+N +QLR+E  +I FKDGE+VDDFSMRI  LAN++ TLG  I++ E+V+K+LQVVP+HL+Q+AISIETLLDVN+L++EEVTG L
Subjt:  WEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRL

Query:  RNVEQRKKN----ITSAVDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTGRSN
        R+VEQRK+      +S VD  G LL TEEEWLA+ +    L+D    S+G +S R+G        GR    DG  K+     +N
Subjt:  RNVEQRKKN----ITSAVDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTGRSN

CAE03692.2 OSJNBb0026E15.10 [Oryza sativa Japonica Group]1.8e-7658.14Show/hide
Query:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP
        S +PPRR+   SPSP    RGRH   G +VVVERV+RE TT+ V YP LTR+NY +WAL+MRVN+QAQ LW AVEPE  + ++YR+D+ A AAILR+VP 
Subjt:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP

Query:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL
        EMLA+L+ K TAQ AWE IK+RR+GVQRVRE+N +QLR+E  +I FKD E+VDDFSMRI GLAN++ TLG  I+E E+V+K+LQVVP+HL+Q+AISIETL
Subjt:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL

Query:  LDVNDLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGK--DGNQKKESTGRS
        LDVN+L++EEVTGRLR+VEQRK+  T+A    VD  GRLL TEEEWLA+ +    L+D    S+G+   R          GR RGK  DG  K+     +
Subjt:  LDVNDLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGK--DGNQKKESTGRS

Query:  N
        N
Subjt:  N

CAH68021.1 H0807C06-H0308C08.8 [Oryza sativa]2.0e-7256.99Show/hide
Query:  PRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSA
        P   RGRH   G +VVV+RV+RE TT+ V YP LTR+NY +W L+MRVN+QAQGLW AVEPE  + ++YR+DR A  AILR+VP EMLA+L+ K T Q A
Subjt:  PRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSA

Query:  WEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRL
        WE IK+RR+GVQ VRE+N +QLR+E  +I FKDGE+VDDFSMRI  LAN++ TLG  I+E E+V+K+LQVVP+HL+Q+AISIETLLDVN+L++EEVTGRL
Subjt:  WEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRL

Query:  RNVEQRKKN----ITSAVDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKES
        R+VEQ K+      +S VD  G LL TEEEWLA+ +    L+D    S+G +S  +G       HG    KD    KE+
Subjt:  RNVEQRKKN----ITSAVDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKES

XP_006648554.1 uncharacterized protein LOC102717319 [Oryza brachyantha]7.4e-7562.7Show/hide
Query:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP
        S TPPRR+ P SP    RGRG            R++RE++ + +QYP+LTR+NY EWALLM VNLQAQGLWHAVEP E E +EYR+DRLA AAILR+VPP
Subjt:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP

Query:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL
        EML SLS KRT+QSAWE IK+ RVG +RVRE+N + LR+E  + RFKDGE+VDDFSMR+ G+AN+I TLGG + E ++V K+L VVP+HL Q+AISIETL
Subjt:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL

Query:  LDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLA
        LD  DL++EEVTGRLR  E+RK      VD  GRLLLTEE+W A
Subjt:  LDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLA

TrEMBL top hitse value%identityAlignment
Q01HC5 H0807C06-H0308C08.8 protein9.7e-7356.99Show/hide
Query:  PRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSA
        P   RGRH   G +VVV+RV+RE TT+ V YP LTR+NY +W L+MRVN+QAQGLW AVEPE  + ++YR+DR A  AILR+VP EMLA+L+ K T Q A
Subjt:  PRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSA

Query:  WEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRL
        WE IK+RR+GVQ VRE+N +QLR+E  +I FKDGE+VDDFSMRI  LAN++ TLG  I+E E+V+K+LQVVP+HL+Q+AISIETLLDVN+L++EEVTGRL
Subjt:  WEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRL

Query:  RNVEQRKKN----ITSAVDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKES
        R+VEQ K+      +S VD  G LL TEEEWLA+ +    L+D    S+G +S  +G       HG    KD    KE+
Subjt:  RNVEQRKKN----ITSAVDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKES

Q0J5Y3 Os08g0389500 protein3.3e-7353.29Show/hide
Query:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP
        S TPPRR+  +    P RGRG     G +++V R I+E   A +QYP LTR+NY+EWA+LM+VN++A G+W+AVEP   E +EYR+DRLA AAILRSVP 
Subjt:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP

Query:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL
        EML +L  KR+A++AW+ IK+ RVGV+RVRES  +QLR+E + + +K+GE+ +DFS+RITGLAN++ TLG  IS+ ++V+KML VVP+HLEQ+A+++ETL
Subjt:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL

Query:  LDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTGESNGSSSSRKGG-KKPWMSHGRTRGKDGNQKKESTG
        LD+N ++VEEVTGRLR VEQR+      V+ +GRLLLT+EEW+A+LK    +GE    S S  GG K+   S    RGK G+++    G
Subjt:  LDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTGESNGSSSSRKGG-KKPWMSHGRTRGKDGNQKKESTG

Q7X7K3 OSJNBb0046P18.1 protein3.7e-7255.99Show/hide
Query:  PRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSA
        P   RGRH   G +VVV RV+RE TT+ V YP LTR+NY +W L+MRVN+QAQGLW AVEPE  + ++YR+DR A  AILR+VP EMLA+L+ K T Q A
Subjt:  PRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSA

Query:  WEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRL
        WE IK+RR+GVQ VRE+N +QLR+E  +I FKDGE+VDDFSMRI  LAN++ TLG  I++ E+V+K+LQVVP+HL+Q+AISIETLLDVN+L++EEVTG L
Subjt:  WEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRL

Query:  RNVEQRKKN----ITSAVDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTGRSN
        R+VEQRK+      +S VD  G LL TEEEWLA+ +    L+D    S+G +S R+G        GR    DG  K+     +N
Subjt:  RNVEQRKKN----ITSAVDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGKDGNQKKESTGRSN

Q7XPB1 OSJNBb0026E15.10 protein8.5e-7758.14Show/hide
Query:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP
        S +PPRR+   SPSP    RGRH   G +VVVERV+RE TT+ V YP LTR+NY +WAL+MRVN+QAQ LW AVEPE  + ++YR+D+ A AAILR+VP 
Subjt:  SLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPP

Query:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL
        EMLA+L+ K TAQ AWE IK+RR+GVQRVRE+N +QLR+E  +I FKD E+VDDFSMRI GLAN++ TLG  I+E E+V+K+LQVVP+HL+Q+AISIETL
Subjt:  EMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETL

Query:  LDVNDLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGK--DGNQKKESTGRS
        LDVN+L++EEVTGRLR+VEQRK+  T+A    VD  GRLL TEEEWLA+ +    L+D    S+G+   R          GR RGK  DG  K+     +
Subjt:  LDVNDLTVEEVTGRLRNVEQRKKNITSA----VDKEGRLLLTEEEWLARLK----LRDNTGESNGSSSSRKGGKKPWMSHGRTRGK--DGNQKKESTGRS

Query:  N
        N
Subjt:  N

Q8S5D4 Putative copia-type pol polyprotein5.7e-6552.82Show/hide
Query:  SISGSLTPPRRQVP------------ASPSPPRRGRGRHRDDGRR---VVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGET
        S S   TPPR + P             + SPP RGRGR    G R       RV+RE ++A V YP+LTR+NY E AL M VN QAQGLWHAVEP E E 
Subjt:  SISGSLTPPRRQVP------------ASPSPPRRGRGRHRDDGRR---VVVERVIRETTTAVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGET

Query:  IEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKK
        IEYR+DRL  AAILR VPPEML SLSTKRT +SAWE IK+  VG +RVRE+N + LR+E ++ RFKDGE++DDFSMRI G+ANSI TLGG + E ++   
Subjt:  IEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGESVDDFSMRITGLANSITTLGGGISETEIVKK

Query:  MLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTGESNGS---SSSRKGGKKPWMSHGRTRGK
                   +AISIETLLD+ DL+++EVT RLR+VE+ K      VD  GRL+LTEE+WLARLK R+  GES+G+   +++   GKK      R RG+
Subjt:  MLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTGESNGS---SSSRKGGKKPWMSHGRTRGK

Query:  D
        +
Subjt:  D

SwissProt top hitse value%identityAlignment
C7A2A0 Benzaldehyde dehydrogenase, mitochondrial4.4e-0648Show/hide
Query:  MAGRRLSSLLSRSLASSPALLSKGRRPSGGTTTGKYST--SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTP
        MA  R SSLLSRS+   P L   G++   G    +Y T  +AA+E PI P V V Y +LLINGQF+D+ SG   P
Subjt:  MAGRRLSSLLSRSLASSPALLSKGRRPSGGTTTGKYST--SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTP

Q8S528 Aldehyde dehydrogenase family 2 member B7, mitochondrial5.2e-0751.35Show/hide
Query:  MAGRRLSSLLSRSLASSPALLSKGRRPSGGTTTGKYST-SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTP
        MA RR+SSLLSRS  SS   +   R  + G    +YS  +AAVE  ITP VKV +TQLLI G+F+D++SG   P
Subjt:  MAGRRLSSLLSRSLASSPALLSKGRRPSGGTTTGKYST-SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTP

Q9SU63 Aldehyde dehydrogenase family 2 member B4, mitochondrial4.3e-0940Show/hide
Query:  MAGRRLSSLLSRSLASSPALL--SKGRRPSGGTTTGKYST-SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTPPRRQVPASPSPPRRGRG-RHRDDG
        MA RR+SSLLSRS ++S  LL  S+GR    G    ++ T SAA E  I PSV+V++TQLLING F+DS SG   P           PR G    H  +G
Subjt:  MAGRRLSSLLSRSLASSPALL--SKGRRPSGGTTTGKYST-SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTPPRRQVPASPSPPRRGRG-RHRDDG

Query:  RRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMR
            + R ++   TA  + P    S Y    +L+R
Subjt:  RRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMR

Arabidopsis top hitse value%identityAlignment
AT1G23800.1 aldehyde dehydrogenase 2B73.7e-0851.35Show/hide
Query:  MAGRRLSSLLSRSLASSPALLSKGRRPSGGTTTGKYST-SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTP
        MA RR+SSLLSRS  SS   +   R  + G    +YS  +AAVE  ITP VKV +TQLLI G+F+D++SG   P
Subjt:  MAGRRLSSLLSRSLASSPALLSKGRRPSGGTTTGKYST-SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTP

AT3G48000.1 aldehyde dehydrogenase 2B43.0e-1040Show/hide
Query:  MAGRRLSSLLSRSLASSPALL--SKGRRPSGGTTTGKYST-SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTPPRRQVPASPSPPRRGRG-RHRDDG
        MA RR+SSLLSRS ++S  LL  S+GR    G    ++ T SAA E  I PSV+V++TQLLING F+DS SG   P           PR G    H  +G
Subjt:  MAGRRLSSLLSRSLASSPALL--SKGRRPSGGTTTGKYST-SAAVETPITPSVKVNYTQLLINGQFLDSISGSLTPPRRQVPASPSPPRRGRG-RHRDDG

Query:  RRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMR
            + R ++   TA  + P    S Y    +L+R
Subjt:  RRVVVERVIRETTTAVVQYPMLTRSNYNEWALLMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGTCGGAGGCTATCCTCGCTCTTATCTCGTTCCCTTGCTTCTTCGCCTGCTCTCCTTTCCAAAGGGAGGAGGCCTTCGGGAGGCACAACAACTGGCAAATATAG
CACTTCGGCTGCTGTTGAAACCCCGATTACTCCATCCGTGAAAGTGAATTACACCCAGCTATTAATCAACGGACAGTTTCTGGATTCAATCTCAGGCTCGCTCACTCCTC
CACGCCGCCAAGTGCCGGCATCTCCATCACCACCACGAAGAGGCCGTGGCCGACATCGTGATGACGGGCGTCGAGTAGTTGTTGAGCGAGTGATCAGAGAGACAACGACC
GCCGTCGTTCAGTACCCGATGTTGACAAGGTCCAACTACAACGAGTGGGCCTTGTTGATGCGCGTTAACCTACAGGCGCAAGGGTTATGGCACGCTGTCGAGCCAGAGGA
AGGAGAAACAATCGAGTACCGGGAGGATCGGCTGGCGTTCGCCGCCATATTACGATCTGTGCCCCCAGAGATGCTGGCTTCTCTTTCCACCAAGCGCACCGCACAATCGG
CCTGGGAGGGAATCAAATCCCGTCGGGTCGGTGTACAGCGAGTGCGGGAATCCAACATCGAGCAACTTCGGAAGGAGCTCTCGGAGATTCGCTTTAAGGACGGCGAATCC
GTCGATGATTTTTCAATGCGGATCACGGGGCTCGCCAACAGCATCACCACCCTCGGTGGTGGCATCAGCGAGACAGAAATCGTGAAGAAGATGTTGCAGGTCGTTCCCGA
TCACCTCGAGCAAGTCGCGATTTCAATCGAGACATTGCTTGACGTGAACGACCTAACAGTGGAAGAGGTGACCGGAAGACTACGTAACGTCGAGCAGCGGAAGAAGAATA
TCACTTCAGCTGTTGACAAGGAGGGCCGTCTACTCCTCACGGAGGAGGAATGGCTTGCACGCCTAAAGCTCCGCGACAACACCGGCGAGAGCAACGGGTCCTCCTCCAGC
CGCAAAGGCGGTAAGAAGCCATGGATGTCCCACGGGCGCACGCGCGGGAAAGATGGGAATCAAAAGAAGGAGTCGACCGGCCGATCCAATGCTCGAACTGTGGGAAGAGA
GGACACCTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAAACAGTGCAAAAGCAATCATTCAACACCCACTCCCCGAGCTTTCTCATTGGGTAACTATTTCAAAAGGAAAGATTTTTTTTGGTGATTCTTCACGG
TCAGGTCATGGCGGGTCGGAGGCTATCCTCGCTCTTATCTCGTTCCCTTGCTTCTTCGCCTGCTCTCCTTTCCAAAGGGAGGAGGCCTTCGGGAGGCACAACAACTGGCA
AATATAGCACTTCGGCTGCTGTTGAAACCCCGATTACTCCATCCGTGAAAGTGAATTACACCCAGCTATTAATCAACGGACAGTTTCTGGATTCAATCTCAGGCTCGCTC
ACTCCTCCACGCCGCCAAGTGCCGGCATCTCCATCACCACCACGAAGAGGCCGTGGCCGACATCGTGATGACGGGCGTCGAGTAGTTGTTGAGCGAGTGATCAGAGAGAC
AACGACCGCCGTCGTTCAGTACCCGATGTTGACAAGGTCCAACTACAACGAGTGGGCCTTGTTGATGCGCGTTAACCTACAGGCGCAAGGGTTATGGCACGCTGTCGAGC
CAGAGGAAGGAGAAACAATCGAGTACCGGGAGGATCGGCTGGCGTTCGCCGCCATATTACGATCTGTGCCCCCAGAGATGCTGGCTTCTCTTTCCACCAAGCGCACCGCA
CAATCGGCCTGGGAGGGAATCAAATCCCGTCGGGTCGGTGTACAGCGAGTGCGGGAATCCAACATCGAGCAACTTCGGAAGGAGCTCTCGGAGATTCGCTTTAAGGACGG
CGAATCCGTCGATGATTTTTCAATGCGGATCACGGGGCTCGCCAACAGCATCACCACCCTCGGTGGTGGCATCAGCGAGACAGAAATCGTGAAGAAGATGTTGCAGGTCG
TTCCCGATCACCTCGAGCAAGTCGCGATTTCAATCGAGACATTGCTTGACGTGAACGACCTAACAGTGGAAGAGGTGACCGGAAGACTACGTAACGTCGAGCAGCGGAAG
AAGAATATCACTTCAGCTGTTGACAAGGAGGGCCGTCTACTCCTCACGGAGGAGGAATGGCTTGCACGCCTAAAGCTCCGCGACAACACCGGCGAGAGCAACGGGTCCTC
CTCCAGCCGCAAAGGCGGTAAGAAGCCATGGATGTCCCACGGGCGCACGCGCGGGAAAGATGGGAATCAAAAGAAGGAGTCGACCGGCCGATCCAATGCTCGAACTGTGG
GAAGAGAGGACACCTGA
Protein sequenceShow/hide protein sequence
MAGRRLSSLLSRSLASSPALLSKGRRPSGGTTTGKYSTSAAVETPITPSVKVNYTQLLINGQFLDSISGSLTPPRRQVPASPSPPRRGRGRHRDDGRRVVVERVIRETTT
AVVQYPMLTRSNYNEWALLMRVNLQAQGLWHAVEPEEGETIEYREDRLAFAAILRSVPPEMLASLSTKRTAQSAWEGIKSRRVGVQRVRESNIEQLRKELSEIRFKDGES
VDDFSMRITGLANSITTLGGGISETEIVKKMLQVVPDHLEQVAISIETLLDVNDLTVEEVTGRLRNVEQRKKNITSAVDKEGRLLLTEEEWLARLKLRDNTGESNGSSSS
RKGGKKPWMSHGRTRGKDGNQKKESTGRSNARTVGREDT