; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030374 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030374
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase
Genome locationscaffold6:24739797..24742359
RNA-Seq ExpressionSpg030374
SyntenySpg030374
Gene Ontology termsNA
InterPro domainsIPR004264 - Transposase, Tnp1/En/Spm-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.0e-7654.61Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK
        +L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH  RVRGVGEFV+PS+Y+NV + KSK +Q+LQ   S+ +    
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK

Query:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG
           KS  K  + V               P    + S+DNIVAVGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+QT+G FV 
Subjt:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG

Query:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        WPR+LVI  ++K     + ++   Q SKHTD+HV+I+LLNRY MLSMQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]1.0e-7654.61Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK
        +L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH  RVRGVGEFV+PS+Y+NV + KSK +Q+LQ   S+ +    
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK

Query:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG
           KS  K  + V               P    + S+DNIVAVGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+QT+G FV 
Subjt:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG

Query:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        WPR+LVI  ++K     + ++   Q SKHTD+HV+I+LLNRY MLSMQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]1.0e-7654.61Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK
        +L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH  RVRGVGEFV+PS+Y+NV + KSK +Q+LQ   S+ +    
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK

Query:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG
           KS  K  + V               P    + S+DNIVAVGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+QT+G FV 
Subjt:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG

Query:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        WPR+LVI  ++K     + ++   Q SKHTD+HV+I+LLNRY MLSMQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]7.7e-8051.08Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVKTKPL
        EL+ DP NRA LWKEARK KN EY D  T     RIDELAA+ +G+DILTEALGTPEHR R+RGVGEFV+P+++YNVA+ K KL Q+ Q+EA + +++  
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVKTKPL

Query:  DKS------------------------------NHKATLRVPRIK-----------------TMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVV
        D++                              N +   +VP+ K                  +GS+DNIVAVGTM+ES +Q  +I+ +PLG +NVR +V
Subjt:  DKS------------------------------NHKATLRVPRIK-----------------TMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVV

Query:  DMVIGDDCALPIPVNDELQTLHQTVGNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFL
        D+V+G+D ALPIP  D+++TL Q +GNFV WPRKLVIT  +K+ P P  +K I QSSK+TD+HVTI+LLNRYAM SMQ +D + IN+ E+I+GKE +I+L
Subjt:  DMVIGDDCALPIPVNDELQTLHQTVGNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFL

Query:  NREDIMQYCGNVEIGYSCILTYI
         R+DI+QYCG  EIGYSCIL YI
Subjt:  NREDIMQYCGNVEIGYSCILTYI

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]7.7e-8051.08Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVKTKPL
        EL+ DP NRA LWKEARK KN EY D  T     RIDELAA+ +G+DILTEALGTPEHR R+RGVGEFV+P+++YNVA+ K KL Q+ Q+EA + +++  
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVKTKPL

Query:  DKS------------------------------NHKATLRVPRIK-----------------TMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVV
        D++                              N +   +VP+ K                  +GS+DNIVAVGTM+ES +Q  +I+ +PLG +NVR +V
Subjt:  DKS------------------------------NHKATLRVPRIK-----------------TMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVV

Query:  DMVIGDDCALPIPVNDELQTLHQTVGNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFL
        D+V+G+D ALPIP  D+++TL Q +GNFV WPRKLVIT  +K+ P P  +K I QSSK+TD+HVTI+LLNRYAM SMQ +D + IN+ E+I+GKE +I+L
Subjt:  DMVIGDDCALPIPVNDELQTLHQTVGNFVGWPRKLVITVDDKEEP-PVKAKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFL

Query:  NREDIMQYCGNVEIGYSCILTYI
         R+DI+QYCG  EIGYSCIL YI
Subjt:  NREDIMQYCGNVEIGYSCILTYI

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X18.9e-7448.82Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQ-----------
        EL+ DP NRA LWKEARK KN    D+ T   V RIDELAA+ +G+DILTEALGTPEHR R+RGVGEFV+P+++ NVAR   KLSQQ Q           
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQ-----------

Query:  -------------------------SEASSVKTK----------PLDK---SNHKATLRV-------------PRIKTMGSMDNIVAVGTMYESPSQNAT
                                 S  S  KTK          P  K      + TL V             P    +GS+DN+VAVG M+ES  Q  T
Subjt:  -------------------------SEASSVKTK----------PLDK---SNHKATLRV-------------PRIKTMGSMDNIVAVGTMYESPSQNAT

Query:  IHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVGWPRKLVITVDDKEEPPVKA-KPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTI
        IHG+PLG EN+RV VD+ + +D ALPIP+  +++TL+Q +GNFV WPRKLVI   +K+ P + A +   QSSK+TD+HVTI+LLNRYAM +MQ ED + I
Subjt:  IHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVGWPRKLVITVDDKEEPPVKA-KPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTI

Query:  NMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        ++ E I GKE +I+L R+DI+QYCG  EIGYSCILTYI
Subjt:  NMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X15.0e-7754.61Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK
        +L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH  RVRGVGEFV+PS+Y+NV + KSK +Q+LQ   S+ +    
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK

Query:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG
           KS  K  + V               P    + S+DNIVAVGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+QT+G FV 
Subjt:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG

Query:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        WPR+LVI  ++K     + ++   Q SKHTD+HV+I+LLNRY MLSMQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X45.0e-7754.61Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK
        +L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH  RVRGVGEFV+PS+Y+NV + KSK +Q+LQ   S+ +    
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK

Query:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG
           KS  K  + V               P    + S+DNIVAVGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+QT+G FV 
Subjt:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG

Query:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        WPR+LVI  ++K     + ++   Q SKHTD+HV+I+LLNRY MLSMQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI

A0A6J1C398 uncharacterized protein LOC111007859 isoform X35.0e-7754.61Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK
        +L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH  RVRGVGEFV+PS+Y+NV + KSK +Q+LQ   S+ +    
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK

Query:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG
           KS  K  + V               P    + S+DNIVAVGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+QT+G FV 
Subjt:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG

Query:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        WPR+LVI  ++K     + ++   Q SKHTD+HV+I+LLNRY MLSMQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X25.0e-7754.61Show/hide
Query:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK
        +L+ DPSNRAILWKEARKGKN EY D+ T     RIDELAA+++G+DILTEALGT EH  RVRGVGEFV+PS+Y+NV + KSK +Q+LQ   S+ +    
Subjt:  ELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVK--TK

Query:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG
           KS  K  + V               P    + S+DNIVAVGT++++  Q  T+HGVPLGV+NVRV+VD+VI +   +PIPV  E++TL+QT+G FV 
Subjt:  PLDKSNHKATLRV---------------PRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVG

Query:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI
        WPR+LVI  ++K     + ++   Q SKHTD+HV+I+LLNRY MLSMQ EDT+ IN+ + I GKE +I+L R DIMQYC  +EIGYSCILTYI
Subjt:  WPRKLVITVDDKEEPPVK-AKPIVQSSKHTDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTCACTCCTTCCCCTATTAAGGAATTGACAGACGATCCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGGAAGGGAAAAAATAAAGAATATTGCGATGAGGT
TACTGTAGCACGTGTCAATCGAATTGACGAATTAGCTGCATTGAATGAAGGTAAGGACATCTTGACTGAAGCGTTGGGCACCCCAGAACACAGATGGCGTGTAAGGGGAG
TGGGCGAGTTCGTAACGCCCTCTGTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGAGTCAGCAACTACAAAGCGAAGCTTCGAGTGTCAAGACGAAGCCCCTCGAC
AAAAGCAACCACAAAGCGACGCTTCGAGTGCCACGCATAAAAACGATGGGCTCTATGGATAACATTGTTGCCGTAGGCACAATGTACGAGTCGCCTTCACAAAATGCAAC
CATCCATGGAGTTCCATTAGGAGTCGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACCGATTCCTGTGAACGATGAACTACAAACGTTGC
ATCAAACGGTCGGTAATTTTGTGGGATGGCCTCGCAAGCTTGTTATTACTGTAGATGACAAAGAGGAGCCTCCTGTCAAAGCTAAGCCCATAGTACAATCAAGCAAACAT
ACAGATATCCATGTTACTATTAGGCTCTTAAATAGATACGCGATGCTTTCGATGCAACAAGAAGATACACTAACGATCAATATGCACGAGCGTATCGTGGGAAAGGAAGC
ATCAATATTTTTAAATCGCGAAGACATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTCACTCCTTCCCCTATTAAGGAATTGACAGACGATCCTTCCAATCGTGCAATTCTATGGAAGGAAGCACGGAAGGGAAAAAATAAAGAATATTGCGATGAGGT
TACTGTAGCACGTGTCAATCGAATTGACGAATTAGCTGCATTGAATGAAGGTAAGGACATCTTGACTGAAGCGTTGGGCACCCCAGAACACAGATGGCGTGTAAGGGGAG
TGGGCGAGTTCGTAACGCCCTCTGTGTACTACAATGTTGCAAGAGAGAAGTCAAAATTGAGTCAGCAACTACAAAGCGAAGCTTCGAGTGTCAAGACGAAGCCCCTCGAC
AAAAGCAACCACAAAGCGACGCTTCGAGTGCCACGCATAAAAACGATGGGCTCTATGGATAACATTGTTGCCGTAGGCACAATGTACGAGTCGCCTTCACAAAATGCAAC
CATCCATGGAGTTCCATTAGGAGTCGAAAATGTTCGAGTTGTGGTGGACATGGTCATAGGTGATGATTGTGCATTACCGATTCCTGTGAACGATGAACTACAAACGTTGC
ATCAAACGGTCGGTAATTTTGTGGGATGGCCTCGCAAGCTTGTTATTACTGTAGATGACAAAGAGGAGCCTCCTGTCAAAGCTAAGCCCATAGTACAATCAAGCAAACAT
ACAGATATCCATGTTACTATTAGGCTCTTAAATAGATACGCGATGCTTTCGATGCAACAAGAAGATACACTAACGATCAATATGCACGAGCGTATCGTGGGAAAGGAAGC
ATCAATATTTTTAAATCGCGAAGACATCATGCAATATTGTGGGAATGTTGAGATAGGTTACTCATGCATACTCACGTACATTACGTAA
Protein sequenceShow/hide protein sequence
MEFTPSPIKELTDDPSNRAILWKEARKGKNKEYCDEVTVARVNRIDELAALNEGKDILTEALGTPEHRWRVRGVGEFVTPSVYYNVAREKSKLSQQLQSEASSVKTKPLD
KSNHKATLRVPRIKTMGSMDNIVAVGTMYESPSQNATIHGVPLGVENVRVVVDMVIGDDCALPIPVNDELQTLHQTVGNFVGWPRKLVITVDDKEEPPVKAKPIVQSSKH
TDIHVTIRLLNRYAMLSMQQEDTLTINMHERIVGKEASIFLNREDIMQYCGNVEIGYSCILTYIT