; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028712 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028712
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold7:11355467..11356973
RNA-Seq ExpressionSpg028712
SyntenySpg028712
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]3.8e-4942.66Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LL EV E R +VISF+L  K+VSFGK EFDLITGL + +     H  G RLR  Y  +S+ ++C +L+ ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE

Query:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQV-TYSLYGFPFAFQVWAYE
        F  +ED VK+ I YFIEL MMG+E++Q IDT  + ++D W AFCN DWS+MIF +TI SLK  LK K  +Y+ K +     V TYSLYGFP+        
Subjt:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQV-TYSLYGFPFAFQVWAYE

Query:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP
                           R+RR           L +EVF +  + V  +L+ TD E + M R +  P V   PD P +P  A VP
Subjt:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.5e-6948.71Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LLREV E R +VISF+L GK+VSFGK EFDLITGL + +     H  G RLR  Y  + + ++C +L+ ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE

Query:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQV-TYSLYGFPFAFQVWAYE
        F  +ED VK+ I YFIEL MMG+E++Q IDT+LL ++D W  FCN DWS+MIF +TI SLK ALK K   Y+ K +     V TYSLYGFP+AFQVWAYE
Subjt:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQV-TYSLYGFPFAFQVWAYE

Query:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP----QVEGGAGLD-
        TIS+L        SD A+PR+ RWSC +S  +  L +EVF +  + V  +L+ TD + + M R +  P V   PD P +P  A VP      E  A  D 
Subjt:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP----QVEGGAGLD-

Query:  --DMELDPLE
          D+E+ PLE
Subjt:  --DMELDPLE

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.2e-6851.55Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE
        MF QT FGP+L ++++FNG L+H++LLREV E + ++ISF L G +VSFGK EFDLITGLR+ +          RLR  Y  +  +++C +L+ ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE

Query:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAK-GSDSKKQVTYSLYGFPFAFQVWAYE
        F+ +ED VK++I YFIEL MMG+E++  +DTSLL I+D W  FCN DWS+MIF++T+ SLK ALK K E YK K   DS    TYSLY FP+AFQVWAYE
Subjt:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAK-GSDSKKQVTYSLYGFPFAFQVWAYE

Query:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEER
        TIS+L+ RVA R++D A+PR+ RWSC++S ++  LE EVF ++ + V + L  TD ER
Subjt:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEER

XP_022157199.1 uncharacterized protein LOC111023969 [Momordica charantia]5.8e-5045.59Show/hide
Query:  MFR-QTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGL-RYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPN
        MFR +T+FG  +DL M+F   LVHY LLREV +TR +V+ F++LG  V+F K+EF L+TGL R +    ++  + NRLR  Y  + +++R E+ +  Y  
Subjt:  MFR-QTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGL-RYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPN

Query:  LEFQTEEDGVKMSIFYFIELVMMGREK-RQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKK-QVTYSLYGFPFAFQVW
        + F  ++D VK+S+ Y+ E+VMMG+ K +  +D  L   ++D   F N DW   I+Q+T+K L+ A+K K  +YK K + +KK QV YSL GFP AFQVW
Subjt:  LEFQTEEDGVKMSIFYFIELVMMGREK-RQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKK-QVTYSLYGFPFAFQVW

Query:  AYETISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEER
        AYE I SL     NR+SDTAMPRI R+SCS S +   LE +VF S    +T  LV ++ ER
Subjt:  AYETISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEER

XP_022158673.1 uncharacterized protein LOC111025136 [Momordica charantia]8.7e-4644.19Show/hide
Query:  ILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLEFQTEEDGVKMSIFYFIELVMMGRE
        +LLRE+  +R +VI+ ++LG +VSFG SEF LITGL+Y+  P R+ ++  RLR+ Y ++ +++   + +  Y  ++F+ + D VK+S+  F+ELV+ GR+
Subjt:  ILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLEFQTEEDGVKMSIFYFIELVMMGRE

Query:  KRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQVTYSLYGFPFAFQVWAYETISSLTGRVANRMSDTAMPRIRRWS
        +   +D SLL ++DD    CN  W+ M F+KTI+SLK+AL         KG D   + TYSLYGFP+AFQVW YETIS LT RVA+ +    +PRI +W 
Subjt:  KRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQVTYSLYGFPFAFQVWAYETISSLTGRVANRMSDTAMPRIRRWS

Query:  CSHSPSYTRLETEVF
        C +SP++  +E E+F
Subjt:  CSHSPSYTRLETEVF

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156001.8e-4942.66Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LL EV E R +VISF+L  K+VSFGK EFDLITGL + +     H  G RLR  Y  +S+ ++C +L+ ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE

Query:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQV-TYSLYGFPFAFQVWAYE
        F  +ED VK+ I YFIEL MMG+E++Q IDT  + ++D W AFCN DWS+MIF +TI SLK  LK K  +Y+ K +     V TYSLYGFP+        
Subjt:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQV-TYSLYGFPFAFQVWAYE

Query:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP
                           R+RR           L +EVF +  + V  +L+ TD E + M R +  P V   PD P +P  A VP
Subjt:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP

A0A6J1DJX9 uncharacterized protein LOC1110207577.1e-7048.71Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LLREV E R +VISF+L GK+VSFGK EFDLITGL + +     H  G RLR  Y  + + ++C +L+ ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE

Query:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQV-TYSLYGFPFAFQVWAYE
        F  +ED VK+ I YFIEL MMG+E++Q IDT+LL ++D W  FCN DWS+MIF +TI SLK ALK K   Y+ K +     V TYSLYGFP+AFQVWAYE
Subjt:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQV-TYSLYGFPFAFQVWAYE

Query:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP----QVEGGAGLD-
        TIS+L        SD A+PR+ RWSC +S  +  L +EVF +  + V  +L+ TD + + M R +  P V   PD P +P  A VP      E  A  D 
Subjt:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVE--PDLPPLP--AAVP----QVEGGAGLD-

Query:  --DMELDPLE
          D+E+ PLE
Subjt:  --DMELDPLE

A0A6J1DP34 uncharacterized protein LOC1110218023.6e-4535.62Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE
        MFR+T F  LLD+ ++FNG L+H ILLREV E+  N ISF L  +++SF +++F LI+GL+Y  TP R ++  +RL   Y N+  ++   D + +Y    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE

Query:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQVTYSLYGFPFAFQVWAYET
        F+ + D VK+ I Y + + ++GRE+    D +LL I+DDW   CN +W+++ F+KTI SL+   +G  +  K    D K + +YSLYGFP+ FQVWAY+T
Subjt:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQVTYSLYGFPFAFQVWAYET

Query:  ISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVEPDLPPLPAAVPQVEGGAGLDDMELDPLEVG
        ISSL+ RVAN++    +P I +W   HS ++  L+ ++F S     T  L  TD E  F++R+ + P  + D                 D ME    E G
Subjt:  ISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVEPDLPPLPAAVPQVEGGAGLDDMELDPLEVG

Query:  DYLG---VEEGSFGSTHFIPQEVEMTKEKPNDEMEIVKEKD-IKGENGKDKVVDEEVIEQEKNKKKKKKEKEKEVETEK
        D  G   V EGS         +VEM ++    E E  K K+ +    G+ K V++ +   +K   ++  + E E+++ K
Subjt:  DYLG---VEEGSFGSTHFIPQEVEMTKEKPNDEMEIVKEKD-IKGENGKDKVVDEEVIEQEKNKKKKKKEKEKEVETEK

A0A6J1DRZ7 uncharacterized protein LOC1110238476.1e-6951.55Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE
        MF QT FGP+L ++++FNG L+H++LLREV E + ++ISF L G +VSFGK EFDLITGLR+ +          RLR  Y  +  +++C +L+ ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLE

Query:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAK-GSDSKKQVTYSLYGFPFAFQVWAYE
        F+ +ED VK++I YFIEL MMG+E++  +DTSLL I+D W  FCN DWS+MIF++T+ SLK ALK K E YK K   DS    TYSLY FP+AFQVWAYE
Subjt:  FQTEEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAK-GSDSKKQVTYSLYGFPFAFQVWAYE

Query:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEER
        TIS+L+ RVA R++D A+PR+ RWSC++S ++  LE EVF ++ + V + L  TD ER
Subjt:  TISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEER

A0A6J1DSS5 uncharacterized protein LOC1110239692.8e-5045.59Show/hide
Query:  MFR-QTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGL-RYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPN
        MFR +T+FG  +DL M+F   LVHY LLREV +TR +V+ F++LG  V+F K+EF L+TGL R +    ++  + NRLR  Y  + +++R E+ +  Y  
Subjt:  MFR-QTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGL-RYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPN

Query:  LEFQTEEDGVKMSIFYFIELVMMGREK-RQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKK-QVTYSLYGFPFAFQVW
        + F  ++D VK+S+ Y+ E+VMMG+ K +  +D  L   ++D   F N DW   I+Q+T+K L+ A+K K  +YK K + +KK QV YSL GFP AFQVW
Subjt:  LEFQTEEDGVKMSIFYFIELVMMGREK-RQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKK-QVTYSLYGFPFAFQVW

Query:  AYETISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEER
        AYE I SL     NR+SDTAMPRI R+SCS S +   LE +VF S    +T  LV ++ ER
Subjt:  AYETISSLTGRVANRMSDTAMPRIRRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAGGCAAACGGTTTTTGGACCTCTGTTGGATCTGTCGATGATTTTTAATGGGCAACTTGTTCATTACATTCTACTTAGAGAAGTTAATGAGACTAGGGCAAATGT
AATTAGTTTTGAGTTGTTGGGGAAGAAAGTCTCATTTGGTAAGAGTGAGTTTGACCTAATCACCGGTCTTAGATATGCAATTACACCGACTAGGAGACACTCAGCGGGTA
ATAGGCTTAGAGAAACTTACTTAAATAATAGCATAAACATGAGATGTGAGGACTTAGATAATTTATACCCTAATTTAGAGTTCCAAACTGAGGAGGATGGAGTGAAGATG
TCCATATTTTACTTTATTGAGCTCGTGATGATGGGGAGAGAGAAAAGACAGTTAATTGACACATCCCTGTTGAATATCATCGACGATTGGGTTGCTTTCTGTAATGAGGA
TTGGAGCAACATGATATTCCAAAAGACTATAAAGAGCCTCAAGAAAGCATTGAAAGGAAAGGCAGAGTCGTACAAGGCAAAAGGATCGGATTCAAAGAAGCAAGTGACTT
ATAGTTTATATGGATTTCCTTTCGCGTTTCAGGTTTGGGCGTATGAGACCATTTCCTCATTGACTGGAAGGGTTGCAAATCGCATGAGTGACACGGCCATGCCGCGCATT
CGAAGATGGTCATGCTCACACTCTCCTTCGTACACCCGTCTTGAAACTGAGGTGTTTGCATCGATGGCGGCTGTTGTCACGATAAATCTTGTTCCCACCGACGAAGAGAG
AGAGTTTATGTCTCGAACGTTGGAGGCTCCACATGTAGAACCTGACCTCCCCCCTCTCCCTGCCGCTGTCCCTCAGGTGGAGGGGGGTGCAGGGTTGGATGATATGGAGC
TGGATCCACTCGAAGTGGGGGATTACTTGGGTGTGGAAGAAGGAAGCTTTGGATCCACTCACTTTATCCCTCAGGAGGTCGAGATGACGAAAGAGAAACCAAATGACGAG
ATGGAGATAGTGAAAGAGAAAGATATTAAAGGAGAAAATGGTAAAGATAAAGTAGTGGATGAAGAAGTGATCGAACAAGAAAAGAATAAGAAGAAGAAGAAGAAAGAGAA
AGAGAAGGAAGTGGAGACGGAGAAAGTGAAAGAAAAAGAAATTGAAGGAGATAAGGGCAAAGAGAAAGTAGTGGATGAACAAGTGATCGAAGGAGAAGAGAAGAAGAAGA
AGAAGAAGAAGCGGAGTTGCGAATGTACGGAGATTCTATTAAGGATGGAGGCGGAGTTACTCAAGTTCACACCCAACTCTTGTCGAACATCTCGTATAATGTCTTTTGGT
CTATATGTAGTACCGACATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAGGCAAACGGTTTTTGGACCTCTGTTGGATCTGTCGATGATTTTTAATGGGCAACTTGTTCATTACATTCTACTTAGAGAAGTTAATGAGACTAGGGCAAATGT
AATTAGTTTTGAGTTGTTGGGGAAGAAAGTCTCATTTGGTAAGAGTGAGTTTGACCTAATCACCGGTCTTAGATATGCAATTACACCGACTAGGAGACACTCAGCGGGTA
ATAGGCTTAGAGAAACTTACTTAAATAATAGCATAAACATGAGATGTGAGGACTTAGATAATTTATACCCTAATTTAGAGTTCCAAACTGAGGAGGATGGAGTGAAGATG
TCCATATTTTACTTTATTGAGCTCGTGATGATGGGGAGAGAGAAAAGACAGTTAATTGACACATCCCTGTTGAATATCATCGACGATTGGGTTGCTTTCTGTAATGAGGA
TTGGAGCAACATGATATTCCAAAAGACTATAAAGAGCCTCAAGAAAGCATTGAAAGGAAAGGCAGAGTCGTACAAGGCAAAAGGATCGGATTCAAAGAAGCAAGTGACTT
ATAGTTTATATGGATTTCCTTTCGCGTTTCAGGTTTGGGCGTATGAGACCATTTCCTCATTGACTGGAAGGGTTGCAAATCGCATGAGTGACACGGCCATGCCGCGCATT
CGAAGATGGTCATGCTCACACTCTCCTTCGTACACCCGTCTTGAAACTGAGGTGTTTGCATCGATGGCGGCTGTTGTCACGATAAATCTTGTTCCCACCGACGAAGAGAG
AGAGTTTATGTCTCGAACGTTGGAGGCTCCACATGTAGAACCTGACCTCCCCCCTCTCCCTGCCGCTGTCCCTCAGGTGGAGGGGGGTGCAGGGTTGGATGATATGGAGC
TGGATCCACTCGAAGTGGGGGATTACTTGGGTGTGGAAGAAGGAAGCTTTGGATCCACTCACTTTATCCCTCAGGAGGTCGAGATGACGAAAGAGAAACCAAATGACGAG
ATGGAGATAGTGAAAGAGAAAGATATTAAAGGAGAAAATGGTAAAGATAAAGTAGTGGATGAAGAAGTGATCGAACAAGAAAAGAATAAGAAGAAGAAGAAGAAAGAGAA
AGAGAAGGAAGTGGAGACGGAGAAAGTGAAAGAAAAAGAAATTGAAGGAGATAAGGGCAAAGAGAAAGTAGTGGATGAACAAGTGATCGAAGGAGAAGAGAAGAAGAAGA
AGAAGAAGAAGCGGAGTTGCGAATGTACGGAGATTCTATTAAGGATGGAGGCGGAGTTACTCAAGTTCACACCCAACTCTTGTCGAACATCTCGTATAATGTCTTTTGGT
CTATATGTAGTACCGACATTCTGA
Protein sequenceShow/hide protein sequence
MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRANVISFELLGKKVSFGKSEFDLITGLRYAITPTRRHSAGNRLRETYLNNSINMRCEDLDNLYPNLEFQTEEDGVKM
SIFYFIELVMMGREKRQLIDTSLLNIIDDWVAFCNEDWSNMIFQKTIKSLKKALKGKAESYKAKGSDSKKQVTYSLYGFPFAFQVWAYETISSLTGRVANRMSDTAMPRI
RRWSCSHSPSYTRLETEVFASMAAVVTINLVPTDEEREFMSRTLEAPHVEPDLPPLPAAVPQVEGGAGLDDMELDPLEVGDYLGVEEGSFGSTHFIPQEVEMTKEKPNDE
MEIVKEKDIKGENGKDKVVDEEVIEQEKNKKKKKKEKEKEVETEKVKEKEIEGDKGKEKVVDEQVIEGEEKKKKKKKRSCECTEILLRMEAELLKFTPNSCRTSRIMSFG
LYVVPTF