; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030583 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030583
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGag protease polyprotein
Genome locationscaffold6:34445759..34449948
RNA-Seq ExpressionSpg030583
SyntenySpg030583
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]5.7e-6255.94Show/hide
Query:  MLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETV
        + + SEA    DF+RYGPP+F+G+SE     E WI +LEAL+  + C+D LK++GAVFML+ +   WW  VA  EDH + PI+W   KDLLYDYYFP+T+
Subjt:  MLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETV

Query:  KDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKV
        KD+KE EFLHL Q ++ V QYE+KFT  SRFA DL+ T  RKIKRF++GL + I+G I L RP T+AEA+ GAL+MDK+V +KAQP  + GL+ G+KRKV
Subjt:  KDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKV

Query:  SP
         P
Subjt:  SP

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]3.3e-6260Show/hide
Query:  AVPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFP
        AV  +  SEA    DFKRYGPP+FDG+SE   AAE WI +LEA +  + C D  K++GAVFML+ +   WW S+AAAEDHA+  I W RFKDLLYDYY+ 
Subjt:  AVPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFP

Query:  ETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIK
        ETVKD KEAEFLHL QG++SV QYERKFT LSRFA +L+     KIKRF+KGL + IRG + L RPA++AEA+ GALIMDK+VS KA    E G + G+K
Subjt:  ETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIK

Query:  RKVSP
        RK  P
Subjt:  RKVSP

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]4.1e-5253.03Show/hide
Query:  ITSEALQTM-DFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVK
        I+ E +Q + DFKR+GPP F+G SE P AAE W+ +LEAL+  + C+D  K+RGAVFML+ +   WW+SVAAAEDHA+ P++W RFKDLLY+YYFP TV+
Subjt:  ITSEALQTM-DFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVK

Query:  DDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRK
        ++K AEFL L Q S+ V QYERKFT LSRF    + T + KI +FI GL  EI+G + L  P T+A A+  AL+MDK + ++ Q     G + G+KRK
Subjt:  DDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRK

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.5e-6258.37Show/hide
Query:  PPGQHKVDPPPPPPPPAPSAVPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHA
        P G   V  PPP     P        SEA    DFKRYGPP+FDG+SE   A E WI +LEAL+  + C D  K++GAVFML+ +   WW SVAAAED+A
Subjt:  PPGQHKVDPPPPPPPPAPSAVPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHA

Query:  SRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDK
        + PI W RFK+LLYDYY+PETVKD KEAEFLHL QG++SV QYERKFT LSRFA +L+ T   KIKRF+KGL + IRG + L RP T+AEA+ GAL+MDK
Subjt:  SRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDK

Query:  NVSKKAQPHLEKGLAFGIKRK
        +VS KA P  E G + G+KRK
Subjt:  NVSKKAQPHLEKGLAFGIKRK

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]9.6e-6250.38Show/hide
Query:  EVNPLIPPGQHKVDPPPPP--------PPPAPSAVPMLI----------------------TSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALF
        +V+P  P G++  DPPPPP        PP  P+A   L                        SEA    DFKRYGPP+F G SE    AE W+ +LEAL+
Subjt:  EVNPLIPPGQHKVDPPPPP--------PPPAPSAVPMLI----------------------TSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALF

Query:  DLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERK
          + C D  K++GAVFML+ +   WW SVAA EDHA+ P+ W RFK+LLYD+Y+ ETV+D KE EFLHL QG+++V QYERKFT LS FA +L+ T   K
Subjt:  DLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERK

Query:  IKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKVSP
        IKRF+KGLH+ IRGS+ L RP T+AEA+ G LIMDK+VS + QP +E G + G+KRKV P
Subjt:  IKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKVSP

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196032.7e-6255.94Show/hide
Query:  MLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETV
        + + SEA    DF+RYGPP+F+G+SE     E WI +LEAL+  + C+D LK++GAVFML+ +   WW  VA  EDH + PI+W   KDLLYDYYFP+T+
Subjt:  MLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETV

Query:  KDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKV
        KD+KE EFLHL Q ++ V QYE+KFT  SRFA DL+ T  RKIKRF++GL + I+G I L RP T+AEA+ GAL+MDK+V +KAQP  + GL+ G+KRKV
Subjt:  KDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKV

Query:  SP
         P
Subjt:  SP

A0A6J1DL73 uncharacterized protein LOC1110221441.6e-6260Show/hide
Query:  AVPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFP
        AV  +  SEA    DFKRYGPP+FDG+SE   AAE WI +LEA +  + C D  K++GAVFML+ +   WW S+AAAEDHA+  I W RFKDLLYDYY+ 
Subjt:  AVPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFP

Query:  ETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIK
        ETVKD KEAEFLHL QG++SV QYERKFT LSRFA +L+     KIKRF+KGL + IRG + L RPA++AEA+ GALIMDK+VS KA    E G + G+K
Subjt:  ETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIK

Query:  RKVSP
        RK  P
Subjt:  RKVSP

A0A6J1DNV8 uncharacterized protein LOC1110229252.0e-5253.03Show/hide
Query:  ITSEALQTM-DFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVK
        I+ E +Q + DFKR+GPP F+G SE P AAE W+ +LEAL+  + C+D  K+RGAVFML+ +   WW+SVAAAEDHA+ P++W RFKDLLY+YYFP TV+
Subjt:  ITSEALQTM-DFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVK

Query:  DDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRK
        ++K AEFL L Q S+ V QYERKFT LSRF    + T + KI +FI GL  EI+G + L  P T+A A+  AL+MDK + ++ Q     G + G+KRK
Subjt:  DDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRK

A0A6J1DUM2 uncharacterized protein LOC1110232477.2e-6358.37Show/hide
Query:  PPGQHKVDPPPPPPPPAPSAVPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHA
        P G   V  PPP     P        SEA    DFKRYGPP+FDG+SE   A E WI +LEAL+  + C D  K++GAVFML+ +   WW SVAAAED+A
Subjt:  PPGQHKVDPPPPPPPPAPSAVPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHA

Query:  SRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDK
        + PI W RFK+LLYDYY+PETVKD KEAEFLHL QG++SV QYERKFT LSRFA +L+ T   KIKRF+KGL + IRG + L RP T+AEA+ GAL+MDK
Subjt:  SRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDK

Query:  NVSKKAQPHLEKGLAFGIKRK
        +VS KA P  E G + G+KRK
Subjt:  NVSKKAQPHLEKGLAFGIKRK

A0A6J1DVA0 uncharacterized protein LOC1110234244.7e-6250.38Show/hide
Query:  EVNPLIPPGQHKVDPPPPP--------PPPAPSAVPMLI----------------------TSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALF
        +V+P  P G++  DPPPPP        PP  P+A   L                        SEA    DFKRYGPP+F G SE    AE W+ +LEAL+
Subjt:  EVNPLIPPGQHKVDPPPPP--------PPPAPSAVPMLI----------------------TSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALF

Query:  DLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERK
          + C D  K++GAVFML+ +   WW SVAA EDHA+ P+ W RFK+LLYD+Y+ ETV+D KE EFLHL QG+++V QYERKFT LS FA +L+ T   K
Subjt:  DLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERK

Query:  IKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKVSP
        IKRF+KGLH+ IRGS+ L RP T+AEA+ G LIMDK+VS + QP +E G + G+KRKV P
Subjt:  IKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKVSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCACCCAAGGGTGCCCTTTTATAGGCCTGGAATAGGGGTAGCGTCGCGACGCTACCTTGCTCAGCGTTGCGACGCTGCTGCGCACGCGGCTTGGGCAACACGAAA
GGGTAGCGTCGCGACGCTGCCTTACTTAGCGTCTCGACGCTGTCCCAAAATTCCAGATTTTTCAGCTTCCTTTTGGGCTGATTTTTGGGTCTTCTTCTTCATTTCTTTTG
GTCATTTCTTCATGGTTGACTTCTTTGGGCCTCCAATTGCTTCAAGCTTTGATTTGGGTTTCTCTTCCATCGTCTCTCTCTCCCTCTTCCGTGTCTTGTCGCCGACAGCA
GCAGTTCAGAGGCGCCGTCTTCGCGTGCGTCGGCTAGCACTCACAGTTCCTTTTTTTCTCCTTCAAGCTCGCAACGTGAGATTCCTTCGCTGTTCGGAGCCGTTTTGGGT
TCGAAATCATCCCTGTTTGGTGACCCATCGTCCGTGGAGCCTTAAACTTGAACACCCAGTGCTTAGGGCCTCTAAGCAAGTGGTTTTAGACCTCTTCGCTTGCTGGACAG
CACGTGTTCGAGGTCGTTCGGACAGTGAACGCGTGTTGGTTCGGTTTTGCATTCAAGAGTCAAACATGAGTTTAGAGATGTGGTCATGTGAATGCTTAGTGGGCGTGAAT
CATGAGTCTAGAAGCATGTTGCAGGGCATATGCATTCTAGGAGATGTGAGATATAGGGTCGGTGTGATGAGGCTTGAGGCAAGTACTGAGGCCTTGGGTATAAATGGTCA
AGGGTCAATATGTTGTTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGAATGCCGAGCTCTGTAGAGAAGTGTCGAGGCCCTGGGTAGGGGCTGCTTACCAGT
ACCTTAGTGTTAAAGTAAAATTTTGGGTTAGTCCAATGTACGTCGTTATGCTGCCGAAATTTTTGGTGCGCACGGTTTGTTTTGGTCTAGAAGTTAGTAATGTCACTGGG
TTAGCTTTTAAAATCCTGGGGCGTTACAGTTGGTATCAGAGTAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTAGGAGTTTAGGGTTATGGTCTTCCTT
GTTCTCCTCTCCATCACCAGTACCACCTTCTCAGGCAATGTCCCGTGGTCATGATCCTGAGGTTCCAATTGTAAGGCAAGATGACCAAGCAGAGGAAGTTACTACACAGC
AGAGGGTCGATCCTTTGGCTCCCCCTCTGCAGGAGGTTAATCCCCTGATTCCTCCCGGTCAGCACAAAGTTGATCCTCCTCCTCCTCCGCCTCCTCCGGCCCCTTCTGCA
GTTCCTATGCTGATCACTTCGGAAGCCCTCCAGACCATGGACTTCAAGCGCTACGGGCCTCCCTCCTTTGATGGGCAATCCGAAAATCCGTTGGCAGCAGAGCGATGGAT
CGCTGATTTAGAGGCACTGTTTGACCTCATGAATTGTAATGATTCCTTGAAGATCAGAGGAGCAGTCTTCATGCTCAAGGATGACGTTCGCACGTGGTGGCAATCGGTGG
CAGCAGCCGAAGACCATGCTAGTCGACCGATCTCGTGGGAAAGGTTCAAGGATCTGTTGTACGATTATTACTTCCCGGAGACAGTCAAGGATGACAAAGAAGCAGAATTC
CTTCATTTGGCCCAGGGAAGTATGTCTGTAGTGCAGTATGAGAGGAAGTTCACTGCACTATCACGCTTTGCTCCTGACCTAGTCAGCACGCCAGAGCGGAAGATCAAGAG
GTTCATTAAAGGTCTCCATGAGGAAATTCGTGGCTCTATAGCCCTGAGCAGGCCTGCGACCTTTGCTGAAGCACTCACGGGGGCATTGATCATGGATAAGAATGTTTCCA
AAAAGGCACAACCTCATCTTGAAAAGGGATTAGCTTTTGGAATTAAAAGGAAAGTCTCCCCCAAGGAACCCACCTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGCACCCAAGGGTGCCCTTTTATAGGCCTGGAATAGGGGTAGCGTCGCGACGCTACCTTGCTCAGCGTTGCGACGCTGCTGCGCACGCGGCTTGGGCAACACGAAA
GGGTAGCGTCGCGACGCTGCCTTACTTAGCGTCTCGACGCTGTCCCAAAATTCCAGATTTTTCAGCTTCCTTTTGGGCTGATTTTTGGGTCTTCTTCTTCATTTCTTTTG
GTCATTTCTTCATGGTTGACTTCTTTGGGCCTCCAATTGCTTCAAGCTTTGATTTGGGTTTCTCTTCCATCGTCTCTCTCTCCCTCTTCCGTGTCTTGTCGCCGACAGCA
GCAGTTCAGAGGCGCCGTCTTCGCGTGCGTCGGCTAGCACTCACAGTTCCTTTTTTTCTCCTTCAAGCTCGCAACGTGAGATTCCTTCGCTGTTCGGAGCCGTTTTGGGT
TCGAAATCATCCCTGTTTGGTGACCCATCGTCCGTGGAGCCTTAAACTTGAACACCCAGTGCTTAGGGCCTCTAAGCAAGTGGTTTTAGACCTCTTCGCTTGCTGGACAG
CACGTGTTCGAGGTCGTTCGGACAGTGAACGCGTGTTGGTTCGGTTTTGCATTCAAGAGTCAAACATGAGTTTAGAGATGTGGTCATGTGAATGCTTAGTGGGCGTGAAT
CATGAGTCTAGAAGCATGTTGCAGGGCATATGCATTCTAGGAGATGTGAGATATAGGGTCGGTGTGATGAGGCTTGAGGCAAGTACTGAGGCCTTGGGTATAAATGGTCA
AGGGTCAATATGTTGTTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGAATGCCGAGCTCTGTAGAGAAGTGTCGAGGCCCTGGGTAGGGGCTGCTTACCAGT
ACCTTAGTGTTAAAGTAAAATTTTGGGTTAGTCCAATGTACGTCGTTATGCTGCCGAAATTTTTGGTGCGCACGGTTTGTTTTGGTCTAGAAGTTAGTAATGTCACTGGG
TTAGCTTTTAAAATCCTGGGGCGTTACAGTTGGTATCAGAGTAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTAGGAGTTTAGGGTTATGGTCTTCCTT
GTTCTCCTCTCCATCACCAGTACCACCTTCTCAGGCAATGTCCCGTGGTCATGATCCTGAGGTTCCAATTGTAAGGCAAGATGACCAAGCAGAGGAAGTTACTACACAGC
AGAGGGTCGATCCTTTGGCTCCCCCTCTGCAGGAGGTTAATCCCCTGATTCCTCCCGGTCAGCACAAAGTTGATCCTCCTCCTCCTCCGCCTCCTCCGGCCCCTTCTGCA
GTTCCTATGCTGATCACTTCGGAAGCCCTCCAGACCATGGACTTCAAGCGCTACGGGCCTCCCTCCTTTGATGGGCAATCCGAAAATCCGTTGGCAGCAGAGCGATGGAT
CGCTGATTTAGAGGCACTGTTTGACCTCATGAATTGTAATGATTCCTTGAAGATCAGAGGAGCAGTCTTCATGCTCAAGGATGACGTTCGCACGTGGTGGCAATCGGTGG
CAGCAGCCGAAGACCATGCTAGTCGACCGATCTCGTGGGAAAGGTTCAAGGATCTGTTGTACGATTATTACTTCCCGGAGACAGTCAAGGATGACAAAGAAGCAGAATTC
CTTCATTTGGCCCAGGGAAGTATGTCTGTAGTGCAGTATGAGAGGAAGTTCACTGCACTATCACGCTTTGCTCCTGACCTAGTCAGCACGCCAGAGCGGAAGATCAAGAG
GTTCATTAAAGGTCTCCATGAGGAAATTCGTGGCTCTATAGCCCTGAGCAGGCCTGCGACCTTTGCTGAAGCACTCACGGGGGCATTGATCATGGATAAGAATGTTTCCA
AAAAGGCACAACCTCATCTTGAAAAGGGATTAGCTTTTGGAATTAAAAGGAAAGTCTCCCCCAAGGAACCCACCTATTGA
Protein sequenceShow/hide protein sequence
MRHPRVPFYRPGIGVASRRYLAQRCDAAAHAAWATRKGSVATLPYLASRRCPKIPDFSASFWADFWVFFFISFGHFFMVDFFGPPIASSFDLGFSSIVSLSLFRVLSPTA
AVQRRRLRVRRLALTVPFFLLQARNVRFLRCSEPFWVRNHPCLVTHRPWSLKLEHPVLRASKQVVLDLFACWTARVRGRSDSERVLVRFCIQESNMSLEMWSCECLVGVN
HESRSMLQGICILGDVRYRVGVMRLEASTEALGINGQGSICCLVIEALGINGQGSNAELCREVSRPWVGAAYQYLSVKVKFWVSPMYVVMLPKFLVRTVCFGLEVSNVTG
LAFKILGRYSWYQSRVVPVDWPRKSRLFRSLGLWSSLFSSPSPVPPSQAMSRGHDPEVPIVRQDDQAEEVTTQQRVDPLAPPLQEVNPLIPPGQHKVDPPPPPPPPAPSA
VPMLITSEALQTMDFKRYGPPSFDGQSENPLAAERWIADLEALFDLMNCNDSLKIRGAVFMLKDDVRTWWQSVAAAEDHASRPISWERFKDLLYDYYFPETVKDDKEAEF
LHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLHEEIRGSIALSRPATFAEALTGALIMDKNVSKKAQPHLEKGLAFGIKRKVSPKEPTY