; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G015550 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G015550
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptiontRNA-splicing endonuclease subunit Sen54
Genome locationchr03:26523179..26527984
RNA-Seq ExpressionLsi03G015550
SyntenyLsi03G015550
Gene Ontology termsGO:0000379 - tRNA-type intron splice site recognition and cleavage (biological process)
GO:0000214 - tRNA-intron endonuclease complex (cellular component)
InterPro domainsIPR024337 - tRNA-splicing endonuclease, subunit Sen54


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011653726.1 uncharacterized protein LOC101210680 isoform X2 [Cucumis sativus]7.0e-9568.33Show/hide
Query:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES
        MEAT+WESSSGGAS  GD+DNYEQDI DEEECL ASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVR         K++ S  + +    
Subjt:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES

Query:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------
           FLIEVGAL LLDHDNSSLSLKDVYKKVAEG+SG LWEQFEVYRHLKSLG+IVGKH+VPWSLK+VRND D+SSPSS ENKGASDVKS+DE        
Subjt:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------

Query:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP
                                           RGYPPSK++IEVLERTSR IP+KYCHVEHGRVCFFSFD VELP+LP
Subjt:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP

XP_016902999.1 PREDICTED: uncharacterized protein LOC103501474 [Cucumis melo]2.0e-8966.67Show/hide
Query:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES
        MEAT+W+SSSGGAS  GD+DNYEQDI DEEECL AS CLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVR               G+      
Subjt:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES

Query:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------
           FLIEVGAL LLDHDNS+LSLKDVYKKVAEG+SGC+WEQFEVYRHLKSLG+IVGKHKVPWSLK+VRND D+SSPSS E KG SDVKSEDE        
Subjt:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------

Query:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFD
                                           RGYPPSK++IEVLERTSR IP+KYCHVEHGRVCFFSFD
Subjt:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFD

XP_022156818.1 uncharacterized protein LOC111023660 [Momordica charantia]9.5e-9265.95Show/hide
Query:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI
        MEATDWESSSGGASGDD+ YEQD++DEEECLCASG +RKLQFRKHASTARWNDQMGMAEVLEN+GSLWTTTGIVR         K++ S      +E  +
Subjt:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI

Query:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------
         FL+EVGAL LLDHDNSSLSLKDVYKKVAEGK+GCLWEQFEVYRHLKSLGFIVGKHKVPWS+K VRN SD+S  SSIEN+GA D++S+DE          
Subjt:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------

Query:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP
                                         RGYPP KKDIE LERTSR I +KYCHVEHGRVCFFSFD +ELP+LP
Subjt:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP

XP_031739606.1 uncharacterized protein LOC101210680 isoform X1 [Cucumis sativus]1.1e-9569.04Show/hide
Query:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES
        MEAT+WESSSGGAS  GD+DNYEQDI DEEECL ASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGI     +K +  K F    Q  A   
Subjt:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES

Query:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------
         +RFLIEVGAL LLDHDNSSLSLKDVYKKVAEG+SG LWEQFEVYRHLKSLG+IVGKH+VPWSLK+VRND D+SSPSS ENKGASDVKS+DE        
Subjt:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------

Query:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP
                                           RGYPPSK++IEVLERTSR IP+KYCHVEHGRVCFFSFD VELP+LP
Subjt:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP

XP_038883355.1 uncharacterized protein LOC120074337 [Benincasa hispida]1.7e-9668.82Show/hide
Query:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI
        MEA DWESSSGGASGD+DNYE+DI +EEECLCASG LRKLQFRKHASTARWND+MGMAEVLENKGSLWTTTGIVR         K++ S  + +      
Subjt:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI

Query:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------
         FLIEVGAL LLDHDNSSLSL+DVYKK+AEGK+GCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVR+ S++SSPSSIENKGASDVKSEDE          
Subjt:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------

Query:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP
                                         RGYPPSKKD+EVL RTSR IP+KYCHVEHGRVCFFSFD VELPILP
Subjt:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP

TrEMBL top hitse value%identityAlignment
A0A0A0KXV5 tRNA_int_end_N2 domain-containing protein3.4e-9568.33Show/hide
Query:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES
        MEAT+WESSSGGAS  GD+DNYEQDI DEEECL ASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVR         K++ S  + +    
Subjt:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES

Query:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------
           FLIEVGAL LLDHDNSSLSLKDVYKKVAEG+SG LWEQFEVYRHLKSLG+IVGKH+VPWSLK+VRND D+SSPSS ENKGASDVKS+DE        
Subjt:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------

Query:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP
                                           RGYPPSK++IEVLERTSR IP+KYCHVEHGRVCFFSFD VELP+LP
Subjt:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP

A0A1S4E447 uncharacterized protein LOC1035014749.6e-9066.67Show/hide
Query:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES
        MEAT+W+SSSGGAS  GD+DNYEQDI DEEECL AS CLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVR               G+      
Subjt:  MEATDWESSSGGAS--GDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYES

Query:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------
           FLIEVGAL LLDHDNS+LSLKDVYKKVAEG+SGC+WEQFEVYRHLKSLG+IVGKHKVPWSLK+VRND D+SSPSS E KG SDVKSEDE        
Subjt:  RIRFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE--------

Query:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFD
                                           RGYPPSK++IEVLERTSR IP+KYCHVEHGRVCFFSFD
Subjt:  -----------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFD

A0A6J1DRN3 uncharacterized protein LOC1110236604.6e-9265.95Show/hide
Query:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI
        MEATDWESSSGGASGDD+ YEQD++DEEECLCASG +RKLQFRKHASTARWNDQMGMAEVLEN+GSLWTTTGIVR         K++ S      +E  +
Subjt:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI

Query:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------
         FL+EVGAL LLDHDNSSLSLKDVYKKVAEGK+GCLWEQFEVYRHLKSLGFIVGKHKVPWS+K VRN SD+S  SSIEN+GA D++S+DE          
Subjt:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------

Query:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP
                                         RGYPP KKDIE LERTSR I +KYCHVEHGRVCFFSFD +ELP+LP
Subjt:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP

A0A6J1FKK4 tRNA-splicing endonuclease subunit Sen543.6e-8963.8Show/hide
Query:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI
        MEATDWE SSGGAS DDDN+EQDIK+EEECLC+SG +RKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVR         K++ S  + +      
Subjt:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI

Query:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------
         FLIEVGAL LLDHDNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWS+K  +N  D+SS SSIENKG++D  SEDE          
Subjt:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------

Query:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP
                                         RGYPP K DIEV+ER S  IP+KYCHVEHGRVCFFSFD VELP+LP
Subjt:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP

A0A6J1I8D9 tRNA-splicing endonuclease subunit Sen54 isoform X36.8e-8863.44Show/hide
Query:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI
        MEATDWE SSGGAS DDDN+EQDIK+EEECL +SG +RKLQFRKHASTARWND+MGMAEVLENKGSLWTT+GIVR         K++ S  + +      
Subjt:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI

Query:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------
         FLIEVGAL LLDHDNSSLSLKDVYKKVAEGKS C+WEQFEVYRHLKSLG+IVGKHKVPWS+K  RN  D+SS SSIENKG++D +SEDE          
Subjt:  RFLIEVGALDLLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDE----------

Query:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP
                                         RGYPP K DIEV+ER S  IP+KYCHVEHGRVCFFSFD V+LP+LP
Subjt:  ---------------------------------RGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPILP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02370.1 unknown protein6.4e-2242.62Show/hide
Query:  MAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRIRFLIEVGALDLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGK
        MAEV   +G LWTTTGI+R+              G+   +     +L E+G L LL D D+  +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G+
Subjt:  MAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRIRFLIEVGALDLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGK

Query:  HKVPWSLKSVRNDSDVSSPSSI
        H VPW+ K   N +      S+
Subjt:  HKVPWSLKSVRNDSDVSSPSSI

AT3G02370.2 unknown protein6.4e-2242.62Show/hide
Query:  MAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRIRFLIEVGALDLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGK
        MAEV   +G LWTTTGI+R+              G+   +     +L E+G L LL D D+  +SLKD+Y ++AEGK GC WE +EVYR+LK LG+I+G+
Subjt:  MAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRIRFLIEVGALDLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGK

Query:  HKVPWSLKSVRNDSDVSSPSSI
        H VPW+ K   N +      S+
Subjt:  HKVPWSLKSVRNDSDVSSPSSI

AT3G57360.1 unknown protein7.1e-3736.63Show/hide
Query:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI
        ME  DWE+SS  +S ++  +  D  D+EE   + G + KLQFR  +S ARW  ++GMAEV   +G LWTTTGI+RS              G+   +    
Subjt:  MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRI

Query:  RFLIEVGALDLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVR-----------------NDS-------------
         +L E+G L +L + D+  + LKD+Y+K+AE KSGC WE +EVYR+LK LG+I+G+H V W+LK                    ND+             
Subjt:  RFLIEVGALDLL-DHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVR-----------------NDS-------------

Query:  ----DVSSPSSIENK---GASDVKSEDERGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPIL
            DV  P+S   K   G     +      PPSK+DI+VL++     P+ +CH+  GR  FFSF +++LP+L
Subjt:  ----DVSSPSSIENK---GASDVKSEDERGYPPSKKDIEVLERTSRCIPVKYCHVEHGRVCFFSFDNVELPIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTACGGATTGGGAAAGCTCTTCAGGAGGAGCCAGTGGTGATGACGACAATTACGAGCAAGACATAAAAGATGAAGAAGAATGTCTCTGCGCATCTGGGTGCTT
GCGCAAGTTGCAATTCAGGAAGCATGCTTCAACTGCTCGATGGAACGATCAAATGGGAATGGCAGAAGTTTTAGAGAACAAGGGCAGCCTTTGGACGACAACTGGCATTG
TACGGTCATATCATGACAAAGAGAAGAAATTGAAAGTGTTTCCAAGCCAAGGCCAGGAAATGGCCTATGAGAGCCGCATTAGATTTCTTATTGAAGTTGGGGCCTTAGAT
CTTTTGGATCATGATAATTCAAGTCTTTCTTTGAAAGATGTATATAAGAAGGTAGCTGAAGGAAAAAGTGGGTGTCTTTGGGAGCAGTTCGAGGTTTATAGGCACCTCAA
ATCTCTTGGTTTCATTGTTGGAAAGCATAAAGTTCCTTGGTCTCTGAAGAGTGTTAGGAATGACAGTGACGTTTCATCTCCAAGTTCTATTGAAAACAAAGGAGCATCTG
ATGTCAAATCAGAAGATGAGAGGGGATATCCACCTTCAAAAAAAGATATTGAAGTTCTTGAGAGAACATCCAGATGCATTCCAGTGAAATATTGTCATGTTGAACATGGA
CGTGTCTGTTTCTTTTCATTCGATAATGTGGAGCTCCCCATCTTACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCTACGGATTGGGAAAGCTCTTCAGGAGGAGCCAGTGGTGATGACGACAATTACGAGCAAGACATAAAAGATGAAGAAGAATGTCTCTGCGCATCTGGGTGCTT
GCGCAAGTTGCAATTCAGGAAGCATGCTTCAACTGCTCGATGGAACGATCAAATGGGAATGGCAGAAGTTTTAGAGAACAAGGGCAGCCTTTGGACGACAACTGGCATTG
TACGGTCATATCATGACAAAGAGAAGAAATTGAAAGTGTTTCCAAGCCAAGGCCAGGAAATGGCCTATGAGAGCCGCATTAGATTTCTTATTGAAGTTGGGGCCTTAGAT
CTTTTGGATCATGATAATTCAAGTCTTTCTTTGAAAGATGTATATAAGAAGGTAGCTGAAGGAAAAAGTGGGTGTCTTTGGGAGCAGTTCGAGGTTTATAGGCACCTCAA
ATCTCTTGGTTTCATTGTTGGAAAGCATAAAGTTCCTTGGTCTCTGAAGAGTGTTAGGAATGACAGTGACGTTTCATCTCCAAGTTCTATTGAAAACAAAGGAGCATCTG
ATGTCAAATCAGAAGATGAGAGGGGATATCCACCTTCAAAAAAAGATATTGAAGTTCTTGAGAGAACATCCAGATGCATTCCAGTGAAATATTGTCATGTTGAACATGGA
CGTGTCTGTTTCTTTTCATTCGATAATGTGGAGCTCCCCATCTTACCTTGACATTATTGTGGATTGCTCTGCTATTTTATTTTTGATATGTTATTTGAATTCTATTTTAT
GTTAAAAAAGAGTGTTTGGTCTGGTGGGCAAATTGATAAGTTGAAAGTGTACTTATTTTTAAATCTGATTTTACAGTCCTC
Protein sequenceShow/hide protein sequence
MEATDWESSSGGASGDDDNYEQDIKDEEECLCASGCLRKLQFRKHASTARWNDQMGMAEVLENKGSLWTTTGIVRSYHDKEKKLKVFPSQGQEMAYESRIRFLIEVGALD
LLDHDNSSLSLKDVYKKVAEGKSGCLWEQFEVYRHLKSLGFIVGKHKVPWSLKSVRNDSDVSSPSSIENKGASDVKSEDERGYPPSKKDIEVLERTSRCIPVKYCHVEHG
RVCFFSFDNVELPILP