; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002877 (gene) of Snake gourd v1 genome

Gene IDTan0002877
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptiontRNA_int_end_N2 domain-containing protein
Genome locationLG10:66603816..66605901
RNA-Seq ExpressionTan0002877
SyntenyTan0002877
Gene Ontology termsGO:0000379 - tRNA-type intron splice site recognition and cleavage (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0000214 - tRNA-intron endonuclease complex (cellular component)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR024337 - tRNA-splicing endonuclease, subunit Sen54


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596967.1 Chromatin assembly factor 1 subunit FAS1, partial [Cucurbita argyrosperma subsp. sororia]1.2e-11075.94Show/hide
Query:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWE SS GAS DDD +EQDI++EEECLCS G MRKLQFRKHASTARWND+MGMAEV+ENKGSLWTT+                    VGALHLLD+
Subjt:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGK+ C+WE FEVYRHLKSLG+IVGKHKVPWSVKG RNG DISS+SSI +NKG+T+F SED++SI EL+D+ QLNEVTPIFDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+ +IEVI+R S GI MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

KAG7028441.1 tRNA-splicing endonuclease subunit Sen54 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-11075.94Show/hide
Query:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWE SS GAS DDD +EQDI++EEECLCS G MRKLQFRKHASTARWND+MGMAEV+ENKGSLWTT+                    VGALHLLD+
Subjt:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGK+ C+WE FEVYRHLKSLG+IVGKHKVPWSVKG RNG DISS+SSI +NKG+T+F SED++SI EL+D+ QLNEVTPIFDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+ +IEVI+R S GI MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

XP_022156818.1 uncharacterized protein LOC111023660 [Momordica charantia]5.4e-11477.82Show/hide
Query:  MEPTDWESSS-GASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWESSS GASGDD+IYEQD+ DEEECLC+ G MRKLQFRKHASTARWNDQMGMAEV+EN+GSLWTTT                    VGALHLLD+
Subjt:  MEPTDWESSS-GASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGKNGCLWE FEVYRHLKSLGFIVGKHKVPWSVKGVRNG+DIS +SSI +N+GA + ES+D+RSISELL S QL++V PIFDVFL
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+K+IE ++RTSRGI +KYCHVEHGRVCFFS DK+ELPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

XP_022940787.1 tRNA-splicing endonuclease subunit Sen54 [Cucurbita moschata]2.7e-11075.56Show/hide
Query:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWE SS GAS DDD +EQDI++EEECLCS G MRKLQFRKHASTARWND+MGMAEV+ENKGSLWTT+                    VGALHLLD+
Subjt:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGK+ C+WE FEVYRHLKSLG+IVGKHKVPWSVKG +NG DISS+SSI +NKG+T+F SED++SI EL+D+ QLNEVTPIFDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+ +IEVI+R S GI MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

XP_023538818.1 tRNA-splicing endonuclease subunit Sen54-like [Cucurbita pepo subsp. pepo]4.2e-11176.32Show/hide
Query:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWE SS GAS DDD +EQDI++EEECLCS G MRKLQFRKHASTARWND+MGMAEV+ENKGSLWTT+                    VGALHLLD+
Subjt:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGK+ C+WE FEVYRHLKSLG+IVGKHKVPWSVKG RNG DISSRSSI +NKG+T+F SED++SI EL+D+ QLNEVTPIFDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+ +IEVI+R S GI MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

TrEMBL top hitse value%identityAlignment
A0A6J1DRN3 uncharacterized protein LOC1110236602.6e-11477.82Show/hide
Query:  MEPTDWESSS-GASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWESSS GASGDD+IYEQD+ DEEECLC+ G MRKLQFRKHASTARWNDQMGMAEV+EN+GSLWTTT                    VGALHLLD+
Subjt:  MEPTDWESSS-GASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGKNGCLWE FEVYRHLKSLGFIVGKHKVPWSVKGVRNG+DIS +SSI +N+GA + ES+D+RSISELL S QL++V PIFDVFL
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+K+IE ++RTSRGI +KYCHVEHGRVCFFS DK+ELPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1FKK4 tRNA-splicing endonuclease subunit Sen541.3e-11075.56Show/hide
Query:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWE SS GAS DDD +EQDI++EEECLCS G MRKLQFRKHASTARWND+MGMAEV+ENKGSLWTT+                    VGALHLLD+
Subjt:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGK+ C+WE FEVYRHLKSLG+IVGKHKVPWSVKG +NG DISS+SSI +NKG+T+F SED++SI EL+D+ QLNEVTPIFDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+ +IEVI+R S GI MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1I794 tRNA-splicing endonuclease subunit Sen54 isoform X18.6e-11075.56Show/hide
Query:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWE SS GAS DDD +EQDI++EEECL S G MRKLQFRKHASTARWND+MGMAEV+ENKGSLWTT+                    VGALHLLD+
Subjt:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGK+ C+WE FEVYRHLKSLG+IVGKHKVPWSVKG RNG DISSRSSI +NKG+T+FESED++SI ELL++ QLNE+TPIFDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+ +IEVI+R S GI MKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1I8D9 tRNA-splicing endonuclease subunit Sen54 isoform X38.6e-11075.56Show/hide
Query:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWE SS GAS DDD +EQDI++EEECL S G MRKLQFRKHASTARWND+MGMAEV+ENKGSLWTT+                    VGALHLLD+
Subjt:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGK+ C+WE FEVYRHLKSLG+IVGKHKVPWSVKG RNG DISSRSSI +NKG+T+FESED++SI ELL++ QLNE+TPIFDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+ +IEVI+R S GI MKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1ICR8 tRNA-splicing endonuclease subunit Sen54 isoform X28.6e-11075.56Show/hide
Query:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY
        ME TDWE SS GAS DDD +EQDI++EEECL S G MRKLQFRKHASTARWND+MGMAEV+ENKGSLWTT+                    VGALHLLD+
Subjt:  MEPTDWE-SSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLLDY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        DN +LSLKDVYKKVAEGK+ C+WE FEVYRHLKSLG+IVGKHKVPWSVKG RNG DISSRSSI +NKG+T+FESED++SI ELL++ QLNE+TPIFDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP
        PHSKFRKSSPGDPNFMVCLTRGYPPP+ +IEVI+R S GI MKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02370.1 unknown protein1.5e-2941.21Show/hide
Query:  MAEVVENKGSLWTTT--------------------VGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGN
        MAEV   +G LWTTT                    +G L LL D D++ +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G+H VPW+ K   N  
Subjt:  MAEVVENKGSLWTTT--------------------VGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGN

Query:  DISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDPNFMVCLT
              S+     A  F+  D  S+++LL    + +  P+FDV+LP+S+F+KSSPG+P+F+ C +
Subjt:  DISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDPNFMVCLT

AT3G02370.2 unknown protein2.1e-3140.34Show/hide
Query:  MAEVVENKGSLWTTT--------------------VGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGN
        MAEV   +G LWTTT                    +G L LL D D++ +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G+H VPW+ K   N  
Subjt:  MAEVVENKGSLWTTT--------------------VGALHLL-DYDNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGN

Query:  DISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDPNFMVCLTRGYPPPRKNIE
              S+     A  F+  D  S+++LL    + +  P+FDV+LP+S+F+KSSPG+P+F+ C +   PP ++ I+
Subjt:  DISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDPNFMVCLTRGYPPPRKNIE

AT3G57360.1 unknown protein1.5e-4536.23Show/hide
Query:  MEPTDWESSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLL-DY
        ME  DWE+SS +  +         D++E   S G + KLQFR  +S ARW  ++GMAEV   +G LWTTT                    +G L +L + 
Subjt:  MEPTDWESSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTT--------------------VGALHLL-DY

Query:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL
        D++ + LKD+Y+K+AE K+GC WE +EVYR+LK LG+I+G+H V W++K        ++R +  +          D+ ++++LL   Q+ +   +FDV+L
Subjt:  DNLNLSLKDVYKKVAEGKNGCLWELFEVYRHLKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFL

Query:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVL
        P+S+F+KSSPG+P+F+ C +   PP +++I+V+ +      + +CH+  GR  FFS   ++LPVL
Subjt:  PHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQMKYCHVEHGRVCFFSLDKVELPVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCTACGGATTGGGAAAGCTCTTCAGGAGCTAGTGGCGATGACGACATTTACGAGCAAGACATAAGAGATGAAGAAGAATGTCTTTGTTCATTTGGGTACATGCG
CAAGTTGCAATTTAGGAAGCATGCTTCGACTGCTCGATGGAATGATCAGATGGGAATGGCAGAAGTTGTAGAGAACAAGGGCAGCCTTTGGACGACGACTGTTGGGGCCT
TGCATCTTCTTGATTATGATAATTTAAATCTTTCTTTGAAAGATGTATATAAAAAGGTAGCTGAAGGAAAAAATGGATGTCTTTGGGAGTTGTTTGAGGTTTATAGGCAC
CTCAAATCTCTTGGTTTCATTGTTGGAAAGCATAAAGTTCCTTGGTCTGTTAAGGGTGTTAGGAATGGAAATGACATTTCTTCTCGAAGTTCTATATTTAAGAACAAAGG
AGCGACAAATTTTGAATCAGAAGATGATAGGTCGATCTCTGAGCTATTAGATTCCACTCAACTCAATGAAGTGACACCCATTTTTGATGTTTTTCTTCCACATAGCAAGT
TTAGAAAATCTTCTCCGGGTGACCCAAATTTTATGGTATGCTTGACTAGGGGATACCCACCTCCAAGAAAAAATATTGAAGTTATTGATAGAACATCGAGAGGCATTCAA
ATGAAATATTGTCATGTTGAACATGGACGTGTTTGTTTCTTCTCACTTGATAAGGTGGAGTTGCCCGTCTTACCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCTACGGATTGGGAAAGCTCTTCAGGAGCTAGTGGCGATGACGACATTTACGAGCAAGACATAAGAGATGAAGAAGAATGTCTTTGTTCATTTGGGTACATGCG
CAAGTTGCAATTTAGGAAGCATGCTTCGACTGCTCGATGGAATGATCAGATGGGAATGGCAGAAGTTGTAGAGAACAAGGGCAGCCTTTGGACGACGACTGTTGGGGCCT
TGCATCTTCTTGATTATGATAATTTAAATCTTTCTTTGAAAGATGTATATAAAAAGGTAGCTGAAGGAAAAAATGGATGTCTTTGGGAGTTGTTTGAGGTTTATAGGCAC
CTCAAATCTCTTGGTTTCATTGTTGGAAAGCATAAAGTTCCTTGGTCTGTTAAGGGTGTTAGGAATGGAAATGACATTTCTTCTCGAAGTTCTATATTTAAGAACAAAGG
AGCGACAAATTTTGAATCAGAAGATGATAGGTCGATCTCTGAGCTATTAGATTCCACTCAACTCAATGAAGTGACACCCATTTTTGATGTTTTTCTTCCACATAGCAAGT
TTAGAAAATCTTCTCCGGGTGACCCAAATTTTATGGTATGCTTGACTAGGGGATACCCACCTCCAAGAAAAAATATTGAAGTTATTGATAGAACATCGAGAGGCATTCAA
ATGAAATATTGTCATGTTGAACATGGACGTGTTTGTTTCTTCTCACTTGATAAGGTGGAGTTGCCCGTCTTACCGTGA
Protein sequenceShow/hide protein sequence
MEPTDWESSSGASGDDDIYEQDIRDEEECLCSFGYMRKLQFRKHASTARWNDQMGMAEVVENKGSLWTTTVGALHLLDYDNLNLSLKDVYKKVAEGKNGCLWELFEVYRH
LKSLGFIVGKHKVPWSVKGVRNGNDISSRSSIFKNKGATNFESEDDRSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDPNFMVCLTRGYPPPRKNIEVIDRTSRGIQ
MKYCHVEHGRVCFFSLDKVELPVLP