; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012421 (gene) of Snake gourd v1 genome

Gene IDTan0012421
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptiontRNA_int_end_N2 domain-containing protein
Genome locationLG10:66656377..66658422
RNA-Seq ExpressionTan0012421
SyntenyTan0012421
Gene Ontology termsGO:0000379 - tRNA-type intron splice site recognition and cleavage (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0000214 - tRNA-intron endonuclease complex (cellular component)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR024337 - tRNA-splicing endonuclease, subunit Sen54


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596967.1 Chromatin assembly factor 1 subunit FAS1, partial [Cucurbita argyrosperma subsp. sororia]2.2e-8868.67Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++++EEECLCS  N  M ++  +KHAST RWND+MGMAEV+ENK +           + +C++ ++ LFLIEVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        K+ CIWEQFEVYRHLKSLG+I+G HKVPWSVKG RNG DISS+SSI ENKG+ +F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  K +IEVI+R S G  MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

KAG7028441.1 tRNA-splicing endonuclease subunit Sen54 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-8868.67Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++++EEECLCS  N  M ++  +KHAST RWND+MGMAEV+ENK +           + +C++ ++ LFLIEVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        K+ CIWEQFEVYRHLKSLG+I+G HKVPWSVKG RNG DISS+SSI ENKG+ +F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  K +IEVI+R S G  MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

XP_022156818.1 uncharacterized protein LOC111023660 [Momordica charantia]3.4e-8968.67Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++ DEEECLC+  N  M ++  +KHAST RWNDQMGMAEV+EN+ +           + +C+ +++ LFL+EVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        KNGC+WEQFEVYRHLKSLGFI+G HKVPWSVKGVRNG+DIS +SSI EN+GA + ES+DERSISELL S QL++V PIFDVFLPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  KK+IE ++RTSRG  +KYCHVEHGRVCFFS DK+ELPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

XP_022940787.1 tRNA-splicing endonuclease subunit Sen54 [Cucurbita moschata]5.0e-8868.27Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++++EEECLCS  N  M ++  +KHAST RWND+MGMAEV+ENK +           + +C++ ++ LFLIEVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        K+ CIWEQFEVYRHLKSLG+I+G HKVPWSVKG +NG DISS+SSI ENKG+ +F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  K +IEVI+R S G  MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

XP_023538818.1 tRNA-splicing endonuclease subunit Sen54-like [Cucurbita pepo subsp. pepo]7.7e-8969.08Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++++EEECLCS  N  M ++  +KHAST RWND+MGMAEV+ENK +           + +C++ ++ LFLIEVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        K+ CIWEQFEVYRHLKSLG+I+G HKVPWSVKG RNG DISSRSSI ENKG+ +F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  K +IEVI+R S G  MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

TrEMBL top hitse value%identityAlignment
A0A6J1DRN3 uncharacterized protein LOC1110236601.7e-8968.67Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++ DEEECLC+  N  M ++  +KHAST RWNDQMGMAEV+EN+ +           + +C+ +++ LFL+EVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        KNGC+WEQFEVYRHLKSLGFI+G HKVPWSVKGVRNG+DIS +SSI EN+GA + ES+DERSISELL S QL++V PIFDVFLPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  KK+IE ++RTSRG  +KYCHVEHGRVCFFS DK+ELPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1FKK4 tRNA-splicing endonuclease subunit Sen542.4e-8868.27Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++++EEECLCS  N  M ++  +KHAST RWND+MGMAEV+ENK +           + +C++ ++ LFLIEVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        K+ CIWEQFEVYRHLKSLG+I+G HKVPWSVKG +NG DISS+SSI ENKG+ +F SEDE+SI EL+D+ QLNEVTPIFDV+LPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  K +IEVI+R S G  MKYCHVEHGRVCFFS DKVELPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1I794 tRNA-splicing endonuclease subunit Sen54 isoform X11.6e-8768.27Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++++EEECL S  N  M ++  +KHAST RWND+MGMAEV+ENK +           + +C++ ++ LFLIEVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        K+ CIWEQFEVYRHLKSLG+I+G HKVPWSVKG RNG DISSRSSI ENKG+ +FESEDE+SI ELL++ QLNE+TPIFDV+LPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  K +IEVI+R S G  MKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1I8D9 tRNA-splicing endonuclease subunit Sen54 isoform X31.6e-8768.27Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++++EEECL S  N  M ++  +KHAST RWND+MGMAEV+ENK +           + +C++ ++ LFLIEVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        K+ CIWEQFEVYRHLKSLG+I+G HKVPWSVKG RNG DISSRSSI ENKG+ +FESEDE+SI ELL++ QLNE+TPIFDV+LPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  K +IEVI+R S G  MKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

A0A6J1ICR8 tRNA-splicing endonuclease subunit Sen54 isoform X21.6e-8768.27Show/hide
Query:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG
        EQ++++EEECL S  N  M ++  +KHAST RWND+MGMAEV+ENK +           + +C++ ++ LFLIEVGALHLLD+DN +LSLKDVYKKVAEG
Subjt:  EQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFER--------RRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEG

Query:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-
        K+ CIWEQFEVYRHLKSLG+I+G HKVPWSVKG RNG DISSRSSI ENKG+ +FESEDE+SI ELL++ QLNE+TPIFDV+LPHSKFRKSSPGD NFM 
Subjt:  KNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-

Query:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP
            G P  K +IEVI+R S G  MKYCHVEHGRVCFFS DKV+LPVLP
Subjt:  ----GIPTSKKNIEVIDRTSRGNRMKYCHVEHGRVCFFSLDKVELPVLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02370.1 unknown protein4.0e-2744.78Show/hide
Query:  QDLLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISE
        ++ L+L E+G L LL D D++ +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G H VPW+ K   N    ++ S   E+  A  F  +D  S+++
Subjt:  QDLLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISE

Query:  LLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM
        LL    + +  P+FDV+LP+S+F+KSSPG+ +F+
Subjt:  LLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM

AT3G02370.2 unknown protein3.1e-2742.95Show/hide
Query:  QDLLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISE
        ++ L+L E+G L LL D D++ +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G H VPW+ K   N    ++ S   E+  A  F  +D  S+++
Subjt:  QDLLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCIWEQFEVYRHLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISE

Query:  LLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-----GIPTSKKNIE
        LL    + +  P+FDV+LP+S+F+KSSPG+ +F+       P SK+ I+
Subjt:  LLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFM-----GIPTSKKNIE

AT3G57360.1 unknown protein8.9e-3535.43Show/hide
Query:  ASTTRWNDQMGMAEVVENKATF--------ERRRHCTLWQDLLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCIWEQFEVYRHLKSLGFIIGNHK
        +S  RW  ++GMAEV   +             + +C + ++ L+L E+G L +L + D++ + LKD+Y+K+AE K+GC WE +EVYR+LK LG+I+G H 
Subjt:  ASTTRWNDQMGMAEVVENKATF--------ERRRHCTLWQDLLFLIEVGALHLL-DYDNLNLSLKDVYKKVAEGKNGCIWEQFEVYRHLKSLGFIIGNHK

Query:  VPWSVKGVRNGNDISSRSSIFENKGAINFE-SEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFMGI-----PTSKKNIEVIDRTSRGNRM
        V W++K     N         E + A   E   D  ++++LL   Q+ +   +FDV+LP+S+F+KSSPG+ +F+       P SK++I+V+ +      +
Subjt:  VPWSVKGVRNGNDISSRSSIFENKGAINFE-SEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFMGI-----PTSKKNIEVIDRTSRGNRM

Query:  KYCHVEHGRVCFFSLDKVELPVL
         +CH+  GR  FFS   ++LPVL
Subjt:  KYCHVEHGRVCFFSLDKVELPVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACATTTACTGAGCAAAACATGAGAGATGAAGAAGAATGTCTTTGTTCATTTTTAAATTTTTACATGCATGAAGTTGCAATTCAGAAGCATGCTTCGACTACTCG
ATGGAATGATCAGATGGGAATGGCAGAAGTTGTAGAGAACAAGGCGACCTTTGAGCGACGGCGGCATTGTACGTTGTGGCAAGATTTATTATTTCTTATTGAAGTTGGGG
CCTTGCATCTTCTTGATTATGATAATTTAAATCTTTCTTTGAAAGACGTATACAAGAAGGTAGCTGAAGGAAAAAATGGATGTATTTGGGAGCAGTTTGAGGTTTATAGG
CACCTCAAATCTCTTGGTTTCATTATTGGAAATCATAAAGTTCCTTGGTCTGTGAAGGGTGTTAGGAATGGAAATGACATTTCTTCTCGAAGTTCTATATTTGAGAACAA
AGGAGCGATAAATTTTGAATCAGAAGATGAGAGGTCGATCTCTGAGCTATTAGATTCCACTCAACTCAATGAAGTGACACCCATTTTTGATGTTTTTCTTCCACATAGCA
AGTTTAGAAAATCTTCTCCTGGTGACACAAATTTTATGGGGATACCCACCTCCAAGAAAAATATTGAAGTTATTGATAGAACATCGAGAGGCAATCGAATGAAATATTGT
CATGTTGAACATGGACGTGTTTGTTTCTTCTCACTTGATAAGGTGGAGTTGCCCGTCTTACCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGACATTTACTGAGCAAAACATGAGAGATGAAGAAGAATGTCTTTGTTCATTTTTAAATTTTTACATGCATGAAGTTGCAATTCAGAAGCATGCTTCGACTACTCG
ATGGAATGATCAGATGGGAATGGCAGAAGTTGTAGAGAACAAGGCGACCTTTGAGCGACGGCGGCATTGTACGTTGTGGCAAGATTTATTATTTCTTATTGAAGTTGGGG
CCTTGCATCTTCTTGATTATGATAATTTAAATCTTTCTTTGAAAGACGTATACAAGAAGGTAGCTGAAGGAAAAAATGGATGTATTTGGGAGCAGTTTGAGGTTTATAGG
CACCTCAAATCTCTTGGTTTCATTATTGGAAATCATAAAGTTCCTTGGTCTGTGAAGGGTGTTAGGAATGGAAATGACATTTCTTCTCGAAGTTCTATATTTGAGAACAA
AGGAGCGATAAATTTTGAATCAGAAGATGAGAGGTCGATCTCTGAGCTATTAGATTCCACTCAACTCAATGAAGTGACACCCATTTTTGATGTTTTTCTTCCACATAGCA
AGTTTAGAAAATCTTCTCCTGGTGACACAAATTTTATGGGGATACCCACCTCCAAGAAAAATATTGAAGTTATTGATAGAACATCGAGAGGCAATCGAATGAAATATTGT
CATGTTGAACATGGACGTGTTTGTTTCTTCTCACTTGATAAGGTGGAGTTGCCCGTCTTACCGTGA
Protein sequenceShow/hide protein sequence
MTTFTEQNMRDEEECLCSFLNFYMHEVAIQKHASTTRWNDQMGMAEVVENKATFERRRHCTLWQDLLFLIEVGALHLLDYDNLNLSLKDVYKKVAEGKNGCIWEQFEVYR
HLKSLGFIIGNHKVPWSVKGVRNGNDISSRSSIFENKGAINFESEDERSISELLDSTQLNEVTPIFDVFLPHSKFRKSSPGDTNFMGIPTSKKNIEVIDRTSRGNRMKYC
HVEHGRVCFFSLDKVELPVLP