; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028677 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028677
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1 peptidase-like
Genome locationscaffold7:13636354..13638920
RNA-Seq ExpressionSpg028677
SyntenySpg028677
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]2.0e-4846.43Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LL EV E R DVISF+L  K+VSFGK EFDLIT L + +     H    RLR  Y  +S+ ++C +LE ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKG-SDSKKQATYSLYGFPF--------
        F   ED VK+ I YFIEL MMG+E++Q IDT  + + D W AFCN DWS+MIF +TI SLK  LK K  +Y+ K  +D     TYSLYGFP+        
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKG-SDSKKQATYSLYGFPF--------

Query:  -AFQVV------VTIHLIPTDEEREFMSRRLETPHIE--PDLPPLP--AAVP
         A +V       V  HL+ TD E + M R +  P +   PD P +P  A VP
Subjt:  -AFQVV------VTIHLIPTDEEREFMSRRLETPHIE--PDLPPLP--AAVP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]5.0e-3936.16Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LLREV E R DVISF+L GK+VSFGK EFDLIT L + +     H    RLR  Y  + + ++C +LE ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKG-SDSKKQATYSLYGFPFAFQV----
        F   ED VK+ I YFIEL MMG+E++Q IDT+LL + D W  FCN DWS+MIF +TI SLK ALK K   Y+ K  +D     TYSLYGFP+AFQV    
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKG-SDSKKQATYSLYGFPFAFQV----

Query:  -------------------------------------VVTIHLIPTDEEREFMSRRLETPHIE--PDLPPLP--AAVP----QVEGGAGLD---DMELDP
                                              V  HL+ TD + + M R +  P +   PD P +P  A VP      E  A  D   D+E+ P
Subjt:  -------------------------------------VVTIHLIPTDEEREFMSRRLETPHIE--PDLPPLP--AAVP----QVEGGAGLD---DMELDP

Query:  LK--VGNYLGVEE--------ESFEAEMMKGKANDEM-EIVKEKDIKGEKGKEKVGDEKVIEQEKNKKKKKKEKEKEVETEKVKEKNVKGKKGKEKVVDE
        L+  V +   V+E        E  E  + K K    +   +K  D      ++K+GD  V  +      KK  K K  ++ K       G  G +   D+
Subjt:  LK--VGNYLGVEE--------ESFEAEMMKGKANDEM-EIVKEKDIKGEKGKEKVGDEKVIEQEKNKKKKKKEKEKEVETEKVKEKNVKGKKGKEKVVDE

Query:  EVIEQEKNKMKK--GKEKELETEKVKDKDIEGDKDKE
           +Q  ++  K  G  K ++ ++  D+D   D+D E
Subjt:  EVIEQEKNKMKK--GKEKELETEKVKDKDIEGDKDKE

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]1.9e-4350Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LLREV E R D+ISF+L GK+VSFGK EFDLIT L Y +          RLR  Y  +S+ ++C +LE ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYK
        F   ED VK+ I YF+EL MMG+E++Q ID +LL + D W  FCN DWS++IF++T+ SLK A+  K  +Y+
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYK

XP_022156465.1 uncharacterized protein LOC111023353 [Momordica charantia]6.5e-3943.61Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MFR+T+FG LLD+ ++FNG L+H ILLREV ++  + ISF L G++VSFG+ EFDLI+ L Y  +P R+ + S++LR  Y N+       D   LY    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKGSDSKKQATYSLYGFPFAFQVVV---
        F+   D +K+SI Y +ELV++GRE     D  LL + DDW   CN D +++ F KTI+SL    +G T   K    D   + +YSLYGFP+ FQV     
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKGSDSKKQATYSLYGFPFAFQVVV---

Query:  -TIHLIPTDEEREFMSRRLETPHIEPD
         T  L  TD E  FM R  E P  E D
Subjt:  -TIHLIPTDEEREFMSRRLETPHIEPD

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.5e-4852.04Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MF QT FGP+L ++++FNG L+H++LLREV E + D+ISF L G +VSFGK EFDLIT LR+ +        + RLR  Y  +  +++C +LE ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGK-GSDSKKQATYSLYGFPFAFQV
        F+  ED VK++I YFIEL MMG+E++  +DTSLL I D W  FCN DWS+MIF++T+ SLK ALK K E YK K   DS    TYSLY FP+AFQV
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGK-GSDSKKQATYSLYGFPFAFQV

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156009.7e-4946.43Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LL EV E R DVISF+L  K+VSFGK EFDLIT L + +     H    RLR  Y  +S+ ++C +LE ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKG-SDSKKQATYSLYGFPF--------
        F   ED VK+ I YFIEL MMG+E++Q IDT  + + D W AFCN DWS+MIF +TI SLK  LK K  +Y+ K  +D     TYSLYGFP+        
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKG-SDSKKQATYSLYGFPF--------

Query:  -AFQVV------VTIHLIPTDEEREFMSRRLETPHIE--PDLPPLP--AAVP
         A +V       V  HL+ TD E + M R +  P +   PD P +P  A VP
Subjt:  -AFQVV------VTIHLIPTDEEREFMSRRLETPHIE--PDLPPLP--AAVP

A0A6J1DJX9 uncharacterized protein LOC1110207572.4e-3936.16Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LLREV E R DVISF+L GK+VSFGK EFDLIT L + +     H    RLR  Y  + + ++C +LE ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKG-SDSKKQATYSLYGFPFAFQV----
        F   ED VK+ I YFIEL MMG+E++Q IDT+LL + D W  FCN DWS+MIF +TI SLK ALK K   Y+ K  +D     TYSLYGFP+AFQV    
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKG-SDSKKQATYSLYGFPFAFQV----

Query:  -------------------------------------VVTIHLIPTDEEREFMSRRLETPHIE--PDLPPLP--AAVP----QVEGGAGLD---DMELDP
                                              V  HL+ TD + + M R +  P +   PD P +P  A VP      E  A  D   D+E+ P
Subjt:  -------------------------------------VVTIHLIPTDEEREFMSRRLETPHIE--PDLPPLP--AAVP----QVEGGAGLD---DMELDP

Query:  LK--VGNYLGVEE--------ESFEAEMMKGKANDEM-EIVKEKDIKGEKGKEKVGDEKVIEQEKNKKKKKKEKEKEVETEKVKEKNVKGKKGKEKVVDE
        L+  V +   V+E        E  E  + K K    +   +K  D      ++K+GD  V  +      KK  K K  ++ K       G  G +   D+
Subjt:  LK--VGNYLGVEE--------ESFEAEMMKGKANDEM-EIVKEKDIKGEKGKEKVGDEKVIEQEKNKKKKKKEKEKEVETEKVKEKNVKGKKGKEKVVDE

Query:  EVIEQEKNKMKK--GKEKELETEKVKDKDIEGDKDKE
           +Q  ++  K  G  K ++ ++  D+D   D+D E
Subjt:  EVIEQEKNKMKK--GKEKELETEKVKDKDIEGDKDKE

A0A6J1DM82 uncharacterized protein LOC1110223009.4e-4450Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MFRQT FGP+LD+ ++FNG L+H++LLREV E R D+ISF+L GK+VSFGK EFDLIT L Y +          RLR  Y  +S+ ++C +LE ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYK
        F   ED VK+ I YF+EL MMG+E++Q ID +LL + D W  FCN DWS++IF++T+ SLK A+  K  +Y+
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYK

A0A6J1DQC8 uncharacterized protein LOC1110233533.1e-3943.61Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MFR+T+FG LLD+ ++FNG L+H ILLREV ++  + ISF L G++VSFG+ EFDLI+ L Y  +P R+ + S++LR  Y N+       D   LY    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKGSDSKKQATYSLYGFPFAFQVVV---
        F+   D +K+SI Y +ELV++GRE     D  LL + DDW   CN D +++ F KTI+SL    +G T   K    D   + +YSLYGFP+ FQV     
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKGSDSKKQATYSLYGFPFAFQVVV---

Query:  -TIHLIPTDEEREFMSRRLETPHIEPD
         T  L  TD E  FM R  E P  E D
Subjt:  -TIHLIPTDEEREFMSRRLETPHIEPD

A0A6J1DRZ7 uncharacterized protein LOC1110238477.4e-4952.04Show/hide
Query:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE
        MF QT FGP+L ++++FNG L+H++LLREV E + D+ISF L G +VSFGK EFDLIT LR+ +        + RLR  Y  +  +++C +LE ++    
Subjt:  MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLE

Query:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGK-GSDSKKQATYSLYGFPFAFQV
        F+  ED VK++I YFIEL MMG+E++  +DTSLL I D W  FCN DWS+MIF++T+ SLK ALK K E YK K   DS    TYSLY FP+AFQV
Subjt:  FQTKEDGVKMSIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGK-GSDSKKQATYSLYGFPFAFQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTAGGCAAACTGTTTTTGGACCTCTATTGGATCTATCGATGATTTTTAATGGGCAACTTGTTCATTACATTCTACTTAGAGAAGTTAATGAAACTAGGGCAGATGT
AATTAGTTTTGAGTTGTCTGGGAAGAAAGTCTCATTTGGTAAGAGTGAGTTTGACCTAATCACCAGTCTTAGATATGCAATTACACCGACTAGGAGACACTCAGCGAGTA
ATAGGCTTAGAGAAACTTACATAAATAATAGCATAACCATGAGATGTGAGGACTTAGAAAATTTATACCCTAATTTAGAGTTCCAAACTAAGGAGGATGGAGTGAAGATG
TCCATATTTTACTTTATTGAGCTCGTGATGATGGGGAGAGAGAAAAGACAATTAATTGACACATCCCTGTTGAATATCTTCGACGATTGGGTTGCTTTCTGTAATGAGGA
TTGGAGCAATATGATATTCCAAAAGACTATAAAGAGCCTCAAGAAAGCATTGAAAGGAAAGACAGAGTCGTACAAGGGAAAAGGATCGGATTCAAAGAAGCAGGCGACTT
ATAGTTTATATGGATTTCCTTTCGCGTTCCAGGTTGTTGTCACGATACATCTTATTCCCACCGACGAAGAGAGAGAGTTTATGTCTCGACGGCTGGAGACTCCACATATA
GAACCTGACCTTCCCCCTCTCCCTGCCGCTGTCCCTCAGGTGGAGGGGGGTGCAGGGTTGGATGATATGGAGCTGGATCCACTCAAAGTGGGGAATTACTTGGGTGTGGA
AGAAGAAAGCTTTGAGGCCGAGATGATGAAAGGGAAAGCAAATGATGAGATGGAGATAGTTAAAGAGAAAGATATTAAAGGAGAAAAAGGTAAAGAGAAAGTAGGGGATG
AAAAAGTGATCGAACAAGAAAAGAATAAGAAGAAGAAGAAGAAAGAGAAAGAGAAGGAGGTGGAGACGGAGAAAGTGAAAGAAAAAAATGTTAAAGGAAAAAAGGGTAAA
GAGAAAGTAGTGGATGAAGAAGTGATCGAACAAGAAAAGAATAAGATGAAGAAAGGGAAAGAGAAGGAGTTGGAGACGGAGAAAGTGAAAGATAAAGATATTGAAGGAGA
TAAGGACAAAGAGAAAGTAGTGGATGAACAAGTGATCGAAGGAGAAGAGAAGAAGAAGAAGAAGAAGAAGCGGAGTTGCGAATGTACGGAGATTCTATTAAGGATGGAGG
CGGAGTTACACGACATGCGTAGATTGTTACGGAAGCTTGCTAAGACCTGTCTAAGTACATTGGACCACGATGATGCGAGTAATGGGGGTCCATCCACTAAAAAACATGAT
GACGAGGGTAATGGGGGTCCATCCACCAAAAACCATGATGACGAGGGTAATGAGCGTGACGTCGAGGACAACGTACCTGGTAGTGGGAAGAGTCCATCCACCGAAAAACA
TGATGACACGACCGGGGAACGTGACACCGATGACGACATAGGAGTCAGTGGGAAGGTGGATGACATCGTGGTACCTAAACTGGAGCTTGTTGAGTTTGAGGACGGGGAGG
AGGACAAAGTGGACGTAATGGGACCGGACGAACCCATCATGCGTCGTGGAAAGCGTGTTCGTCAGATATGGACACTCGTCCCGACTTATGTCGTTGGAGGTTCGTTACCG
TTCTTAATGCGAGGGGACGTGTATGAAGAACTTGTAGGAGGCAACCCCGAGTCCTTCGACTGGAGTAGGTTCAAGTCCGTCCTCAAATACGTTTGGGGCGAGCACACGGA
TTATAATGTTCCATGGAGTACGGTAGATGCCATGTACATGCCATTTGGGGCGACGCAGGTAGATAGTGGAGATAACAAAACCCAGCTGGGGAACTCCAATCCTCCTCTTG
AATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTAGGCAAACTGTTTTTGGACCTCTATTGGATCTATCGATGATTTTTAATGGGCAACTTGTTCATTACATTCTACTTAGAGAAGTTAATGAAACTAGGGCAGATGT
AATTAGTTTTGAGTTGTCTGGGAAGAAAGTCTCATTTGGTAAGAGTGAGTTTGACCTAATCACCAGTCTTAGATATGCAATTACACCGACTAGGAGACACTCAGCGAGTA
ATAGGCTTAGAGAAACTTACATAAATAATAGCATAACCATGAGATGTGAGGACTTAGAAAATTTATACCCTAATTTAGAGTTCCAAACTAAGGAGGATGGAGTGAAGATG
TCCATATTTTACTTTATTGAGCTCGTGATGATGGGGAGAGAGAAAAGACAATTAATTGACACATCCCTGTTGAATATCTTCGACGATTGGGTTGCTTTCTGTAATGAGGA
TTGGAGCAATATGATATTCCAAAAGACTATAAAGAGCCTCAAGAAAGCATTGAAAGGAAAGACAGAGTCGTACAAGGGAAAAGGATCGGATTCAAAGAAGCAGGCGACTT
ATAGTTTATATGGATTTCCTTTCGCGTTCCAGGTTGTTGTCACGATACATCTTATTCCCACCGACGAAGAGAGAGAGTTTATGTCTCGACGGCTGGAGACTCCACATATA
GAACCTGACCTTCCCCCTCTCCCTGCCGCTGTCCCTCAGGTGGAGGGGGGTGCAGGGTTGGATGATATGGAGCTGGATCCACTCAAAGTGGGGAATTACTTGGGTGTGGA
AGAAGAAAGCTTTGAGGCCGAGATGATGAAAGGGAAAGCAAATGATGAGATGGAGATAGTTAAAGAGAAAGATATTAAAGGAGAAAAAGGTAAAGAGAAAGTAGGGGATG
AAAAAGTGATCGAACAAGAAAAGAATAAGAAGAAGAAGAAGAAAGAGAAAGAGAAGGAGGTGGAGACGGAGAAAGTGAAAGAAAAAAATGTTAAAGGAAAAAAGGGTAAA
GAGAAAGTAGTGGATGAAGAAGTGATCGAACAAGAAAAGAATAAGATGAAGAAAGGGAAAGAGAAGGAGTTGGAGACGGAGAAAGTGAAAGATAAAGATATTGAAGGAGA
TAAGGACAAAGAGAAAGTAGTGGATGAACAAGTGATCGAAGGAGAAGAGAAGAAGAAGAAGAAGAAGAAGCGGAGTTGCGAATGTACGGAGATTCTATTAAGGATGGAGG
CGGAGTTACACGACATGCGTAGATTGTTACGGAAGCTTGCTAAGACCTGTCTAAGTACATTGGACCACGATGATGCGAGTAATGGGGGTCCATCCACTAAAAAACATGAT
GACGAGGGTAATGGGGGTCCATCCACCAAAAACCATGATGACGAGGGTAATGAGCGTGACGTCGAGGACAACGTACCTGGTAGTGGGAAGAGTCCATCCACCGAAAAACA
TGATGACACGACCGGGGAACGTGACACCGATGACGACATAGGAGTCAGTGGGAAGGTGGATGACATCGTGGTACCTAAACTGGAGCTTGTTGAGTTTGAGGACGGGGAGG
AGGACAAAGTGGACGTAATGGGACCGGACGAACCCATCATGCGTCGTGGAAAGCGTGTTCGTCAGATATGGACACTCGTCCCGACTTATGTCGTTGGAGGTTCGTTACCG
TTCTTAATGCGAGGGGACGTGTATGAAGAACTTGTAGGAGGCAACCCCGAGTCCTTCGACTGGAGTAGGTTCAAGTCCGTCCTCAAATACGTTTGGGGCGAGCACACGGA
TTATAATGTTCCATGGAGTACGGTAGATGCCATGTACATGCCATTTGGGGCGACGCAGGTAGATAGTGGAGATAACAAAACCCAGCTGGGGAACTCCAATCCTCCTCTTG
AATGA
Protein sequenceShow/hide protein sequence
MFRQTVFGPLLDLSMIFNGQLVHYILLREVNETRADVISFELSGKKVSFGKSEFDLITSLRYAITPTRRHSASNRLRETYINNSITMRCEDLENLYPNLEFQTKEDGVKM
SIFYFIELVMMGREKRQLIDTSLLNIFDDWVAFCNEDWSNMIFQKTIKSLKKALKGKTESYKGKGSDSKKQATYSLYGFPFAFQVVVTIHLIPTDEEREFMSRRLETPHI
EPDLPPLPAAVPQVEGGAGLDDMELDPLKVGNYLGVEEESFEAEMMKGKANDEMEIVKEKDIKGEKGKEKVGDEKVIEQEKNKKKKKKEKEKEVETEKVKEKNVKGKKGK
EKVVDEEVIEQEKNKMKKGKEKELETEKVKDKDIEGDKDKEKVVDEQVIEGEEKKKKKKKRSCECTEILLRMEAELHDMRRLLRKLAKTCLSTLDHDDASNGGPSTKKHD
DEGNGGPSTKNHDDEGNERDVEDNVPGSGKSPSTEKHDDTTGERDTDDDIGVSGKVDDIVVPKLELVEFEDGEEDKVDVMGPDEPIMRRGKRVRQIWTLVPTYVVGGSLP
FLMRGDVYEELVGGNPESFDWSRFKSVLKYVWGEHTDYNVPWSTVDAMYMPFGATQVDSGDNKTQLGNSNPPLE