; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041174 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041174
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr13:13326361..13333677
RNA-Seq ExpressionLag0041174
SyntenyLag0041174
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]1.3e-1930.89Show/hide
Query:  NWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNKI--------QASSNRGAAEFLIKDVQRN--INDWESSYLKNQNPERLRNHVS--HVIWKRPKPN
        +W+ +D WNW+V+ L++EE+A   +I W IW  RN+         +   +R    F+  ++ +   I+    S   +    R R +++   V W  P  N
Subjt:  NWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNKI--------QASSNRGAAEFLIKDVQRN--INDWESSYLKNQNPERLRNHVS--HVIWKRPKPN

Query:  SWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLS
         WKLN DA+W E+R VGG+GW++ D  G ++  G  K     EI  +E   I+ G++ I    +Q    + +ESD++EVI+++  ++ DL+
Subjt:  SWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]1.1e-1834.09Show/hide
Query:  VDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWF
        +D    EE  +  II W IW  RNK            IQ + +R    ++I    RN N    S  K+ +  R     +   WK P  NSWKLN +A W 
Subjt:  VDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWF

Query:  EQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQT-CLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREV
           + GG+GW++ D  G +I    +       I  +E  AI  G++AIRQ  C  ++L    ESD+LE I +L    +D +E+  + E I    KD+  V
Subjt:  EQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQT-CLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREV

Query:  SFIHCNRLANSTAHWLARHA
        S  H +R AN  AH LAR A
Subjt:  SFIHCNRLANSTAHWLARHA

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]2.3e-1631.98Show/hide
Query:  KIHIDFSEL---SNWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRN
        + + +F+EL   +NW+ +++W W++D    EE  +  II   IW  RNK            IQ + +R    ++I    ++ N    S  K+ +P R   
Subjt:  KIHIDFSEL---SNWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRN

Query:  HVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQT-CLQLNLGLEIESDALEVIKVL
          +   WK P  NSWKLN DA W    +  G+GW++ D  G +I  G +       I  +E  AI  G++AIRQ  C  ++L    ESD+LE I +L
Subjt:  HVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQT-CLQLNLGLEIESDALEVIKVL

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]7.8e-1729.52Show/hide
Query:  MVDHLNNEEIAKGSIIMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVI-WKRPKPNSWKLNADATWFEQRSVGGLGW
        M+D  ++E++    I  W IWNHRN +       +   +I+ + + +   ESSY    +   L   +++ + W+ P  + W LNADA+W +    GG+GW
Subjt:  MVDHLNNEEIAKGSIIMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVI-WKRPKPNSWKLNADATWFEQRSVGGLGW

Query:  VVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREVSFIHCNRLANS
        ++   +G ++  G +  +    +K++EA AIL G++ +  T L +   L IE+D+ EV  +L    EDL++   + E I++       ++F    R  N 
Subjt:  VVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREVSFIHCNRLANS

Query:  TAHWLARHAS
         AH LA+ AS
Subjt:  TAHWLARHAS

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]2.1e-1733.63Show/hide
Query:  VDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWF
        +D    EE  +  II W IW  RNK            IQ   +R    ++I    R+ N    S  K+ +  R     +   WK P  NSWKLN DA W 
Subjt:  VDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWF

Query:  EQRSVGGLGWVVHDSNGSLICFGLQ--KTDRNWEIKIMEAKAILMGIKAIRQT-CLQLN----LGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSS
           + GG+GW++ D  G +I    +  +T+RN  I  +E  AI  G++AIRQ  C  +       + +ESD+LE I +L    +D +E+  + E I    
Subjt:  EQRSVGGLGWVVHDSNGSLICFGLQ--KTDRNWEIKIMEAKAILMGIKAIRQT-CLQLN----LGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSS

Query:  KDLREVSFIHCNRLANSTAHWLARHA
        +D++ VS  H +R AN  AH LAR A
Subjt:  KDLREVSFIHCNRLANSTAHWLARHA

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134125.3e-1934.09Show/hide
Query:  VDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWF
        +D    EE  +  II W IW  RNK            IQ + +R    ++I    RN N    S  K+ +  R     +   WK P  NSWKLN +A W 
Subjt:  VDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWF

Query:  EQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQT-CLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREV
           + GG+GW++ D  G +I    +       I  +E  AI  G++AIRQ  C  ++L    ESD+LE I +L    +D +E+  + E I    KD+  V
Subjt:  EQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQT-CLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREV

Query:  SFIHCNRLANSTAHWLARHA
        S  H +R AN  AH LAR A
Subjt:  SFIHCNRLANSTAHWLARHA

A0A6J1CQG0 uncharacterized protein LOC1110132166.2e-2030.89Show/hide
Query:  NWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNKI--------QASSNRGAAEFLIKDVQRN--INDWESSYLKNQNPERLRNHVS--HVIWKRPKPN
        +W+ +D WNW+V+ L++EE+A   +I W IW  RN+         +   +R    F+  ++ +   I+    S   +    R R +++   V W  P  N
Subjt:  NWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNKI--------QASSNRGAAEFLIKDVQRN--INDWESSYLKNQNPERLRNHVS--HVIWKRPKPN

Query:  SWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLS
         WKLN DA+W E+R VGG+GW++ D  G ++  G  K     EI  +E   I+ G++ I    +Q    + +ESD++EVI+++  ++ DL+
Subjt:  SWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLS

A0A6J1DNV9 uncharacterized protein LOC1110224033.8e-1729.52Show/hide
Query:  MVDHLNNEEIAKGSIIMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVI-WKRPKPNSWKLNADATWFEQRSVGGLGW
        M+D  ++E++    I  W IWNHRN +       +   +I+ + + +   ESSY    +   L   +++ + W+ P  + W LNADA+W +    GG+GW
Subjt:  MVDHLNNEEIAKGSIIMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVI-WKRPKPNSWKLNADATWFEQRSVGGLGW

Query:  VVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREVSFIHCNRLANS
        ++   +G ++  G +  +    +K++EA AIL G++ +  T L +   L IE+D+ EV  +L    EDL++   + E I++       ++F    R  N 
Subjt:  VVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREVSFIHCNRLANS

Query:  TAHWLARHAS
         AH LA+ AS
Subjt:  TAHWLARHAS

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X21.1e-1631.98Show/hide
Query:  KIHIDFSEL---SNWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRN
        + + +F+EL   +NW+ +++W W++D    EE  +  II   IW  RNK            IQ + +R    ++I    ++ N    S  K+ +P R   
Subjt:  KIHIDFSEL---SNWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRN

Query:  HVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQT-CLQLNLGLEIESDALEVIKVL
          +   WK P  NSWKLN DA W    +  G+GW++ D  G +I  G +       I  +E  AI  G++AIRQ  C  ++L    ESD+LE I +L
Subjt:  HVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQT-CLQLNLGLEIESDALEVIKVL

A0A6J1DSV1 uncharacterized protein LOC1110236081.0e-1733.63Show/hide
Query:  VDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWF
        +D    EE  +  II W IW  RNK            IQ   +R    ++I    R+ N    S  K+ +  R     +   WK P  NSWKLN DA W 
Subjt:  VDHLNNEEIAKGSIIMWSIWNHRNK------------IQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWF

Query:  EQRSVGGLGWVVHDSNGSLICFGLQ--KTDRNWEIKIMEAKAILMGIKAIRQT-CLQLN----LGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSS
           + GG+GW++ D  G +I    +  +T+RN  I  +E  AI  G++AIRQ  C  +       + +ESD+LE I +L    +D +E+  + E I    
Subjt:  EQRSVGGLGWVVHDSNGSLICFGLQ--KTDRNWEIKIMEAKAILMGIKAIRQT-CLQLN----LGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSS

Query:  KDLREVSFIHCNRLANSTAHWLARHA
        +D++ VS  H +R AN  AH LAR A
Subjt:  KDLREVSFIHCNRLANSTAHWLARHA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.7e-0624.1Show/hide
Query:  IMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDW---ESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFG
        ++W IW  RN +  +  R +    +   +   +DW     S+ K  +P R +   + + W+ P     K N DA +  Q+     GW++ +  G+ I +G
Subjt:  IMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDW---ESSYLKNQNPERLRNHVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFG

Query:  LQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREVSFIHCNRLANSTAHWLARH
          K          E KA+L    A++QT ++    + +E D   +I ++ G     S L    E I   +     + F    R  N  AH LA++
Subjt:  LQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREVSFIHCNRLANSTAHWLARH

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0823.27Show/hide
Query:  IMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDWESSYLKN--QNPERLRNHVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGL
        +MW IW   N +  +  R   +  ++    +  +W  + + N  QN  R  +   +  W  P  +  K N DA+  E+ +V GLGW++ +S G++I  G+
Subjt:  IMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDWESSYLKN--QNPERLRNHVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGL

Query:  QKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLG---LEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREVSFIHCNRLANSTAHWLARHASS
         K       +  E   ++  I+A          G   +  E D  + I  ++  +     L+   +TI S       + F   +R  N  A +LA+ A  
Subjt:  QKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLG---LEIESDALEVIKVLVGDEEDLSELKAIAETIVSSSKDLREVSFIHCNRLANSTAHWLARHASS

Query:  VN
         N
Subjt:  VN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGTCATTGCCACTGCTGCAACCCACATAGTGTCGCCGCCGCTACCACGTCCCGCACAGTTCCGCCGCTGCCGCTCTAGTTTGAAGATCCATATTGATTTCAGTGA
GTTGTCAAATTGGAGCCCCAGAGACTTCTGGAATTGGATGGTGGATCATCTTAACAATGAGGAGATAGCAAAGGGGTCAATCATCATGTGGAGTATATGGAATCACAGAA
ACAAAATTCAGGCATCAAGCAATCGGGGAGCAGCAGAATTTCTCATCAAAGATGTACAAAGGAACATCAACGATTGGGAGAGTTCCTACCTTAAGAATCAAAATCCGGAA
AGGCTGAGGAACCACGTGAGTCATGTCATTTGGAAGAGACCGAAACCAAACTCTTGGAAGCTAAACGCAGACGCGACTTGGTTCGAGCAAAGAAGTGTCGGAGGCCTCGG
GTGGGTGGTGCACGACTCGAATGGGTCCTTAATCTGTTTCGGATTACAAAAAACCGATCGGAATTGGGAAATCAAAATCATGGAAGCCAAAGCAATTCTCATGGGAATCA
AGGCAATTCGACAAACCTGCCTTCAATTGAATCTAGGGCTGGAAATAGAATCAGACGCCCTGGAAGTGATCAAGGTCCTGGTCGGAGACGAAGAAGACCTGTCGGAGCTC
AAGGCCATCGCTGAGACGATCGTGTCTTCCTCCAAGGATCTGCGTGAAGTTTCTTTCATCCACTGTAACCGTCTAGCTAATTCAACAGCCCACTGGTTGGCTAGGCACGC
TTCTTCTGTAAATTTTTGTTCTAAAAATTTTGATTTCGATCAGGGGAATCCTCTTTGCGAGGAATCTGGGCTTTCTTTTTGGGCGCCTGATCTTCCCTCCTGGTTCTCCC
CTCCTTTTTTAGAGGATATGAGACAAAAGAACTCATGTACAACCTATGACAGAGTTGAGATAATTTCTCAAATTCTTGAGCTCATTGTCGGCGAGCTTGAGCAGCAACCA
GCATCAGAGCTCGAGCAACATCCATTGCAGCAGGTCATCAGCGATAGGCGTTCAGCAACGAATCCGATGACCAAGAGCAGACGCGCACGAACAGGCAGCAGTGGGTCTTG
TTATTGGGTGATTTCCGGTCGATTTCTACTCCACTCACATCTCCTCTGCGTCATCCATGGTGGTCCAGCGGCTGGACGAGTTTTTGGCGAGTTCAAGGCAGCAGCGGCAG
GTTGGATCTTCTATGGTAGGCGCCTCAGTCCAGCGGCGGTGCAAAGACTTTTTCCGACGATTTGGGTTTTTATCGAGTGTGGGCGAGTTCAAGCCGATTTCCAACGGAGT
TTTCGCGTGGGTATCTTCCTTTGGCGTTTTCTGACGAGGTCTAGTAAGCCTTTAAGTTTCAGCAAAATTAGAGTTCTCTTCGCATTTGAAAGTTTAGTTGTCGATTTGGA
GCGTTTTCGGGCCCTCGGACATATGCCTCCACGTGGTCAAGGTCGAGGACGGGGACGCAGGCGTGGTCGTGATAGGGGTGGTAGAGGTCTGACACTCCTGGAACAAGTAG
ATCCTCCTATGGATCAGCATGATGAAGATCTCCCTGATGAGAAAGATCTTGCGTCGCCTGCACCTCCAGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGTCATTGCCACTGCTGCAACCCACATAGTGTCGCCGCCGCTACCACGTCCCGCACAGTTCCGCCGCTGCCGCTCTAGTTTGAAGATCCATATTGATTTCAGTGA
GTTGTCAAATTGGAGCCCCAGAGACTTCTGGAATTGGATGGTGGATCATCTTAACAATGAGGAGATAGCAAAGGGGTCAATCATCATGTGGAGTATATGGAATCACAGAA
ACAAAATTCAGGCATCAAGCAATCGGGGAGCAGCAGAATTTCTCATCAAAGATGTACAAAGGAACATCAACGATTGGGAGAGTTCCTACCTTAAGAATCAAAATCCGGAA
AGGCTGAGGAACCACGTGAGTCATGTCATTTGGAAGAGACCGAAACCAAACTCTTGGAAGCTAAACGCAGACGCGACTTGGTTCGAGCAAAGAAGTGTCGGAGGCCTCGG
GTGGGTGGTGCACGACTCGAATGGGTCCTTAATCTGTTTCGGATTACAAAAAACCGATCGGAATTGGGAAATCAAAATCATGGAAGCCAAAGCAATTCTCATGGGAATCA
AGGCAATTCGACAAACCTGCCTTCAATTGAATCTAGGGCTGGAAATAGAATCAGACGCCCTGGAAGTGATCAAGGTCCTGGTCGGAGACGAAGAAGACCTGTCGGAGCTC
AAGGCCATCGCTGAGACGATCGTGTCTTCCTCCAAGGATCTGCGTGAAGTTTCTTTCATCCACTGTAACCGTCTAGCTAATTCAACAGCCCACTGGTTGGCTAGGCACGC
TTCTTCTGTAAATTTTTGTTCTAAAAATTTTGATTTCGATCAGGGGAATCCTCTTTGCGAGGAATCTGGGCTTTCTTTTTGGGCGCCTGATCTTCCCTCCTGGTTCTCCC
CTCCTTTTTTAGAGGATATGAGACAAAAGAACTCATGTACAACCTATGACAGAGTTGAGATAATTTCTCAAATTCTTGAGCTCATTGTCGGCGAGCTTGAGCAGCAACCA
GCATCAGAGCTCGAGCAACATCCATTGCAGCAGGTCATCAGCGATAGGCGTTCAGCAACGAATCCGATGACCAAGAGCAGACGCGCACGAACAGGCAGCAGTGGGTCTTG
TTATTGGGTGATTTCCGGTCGATTTCTACTCCACTCACATCTCCTCTGCGTCATCCATGGTGGTCCAGCGGCTGGACGAGTTTTTGGCGAGTTCAAGGCAGCAGCGGCAG
GTTGGATCTTCTATGGTAGGCGCCTCAGTCCAGCGGCGGTGCAAAGACTTTTTCCGACGATTTGGGTTTTTATCGAGTGTGGGCGAGTTCAAGCCGATTTCCAACGGAGT
TTTCGCGTGGGTATCTTCCTTTGGCGTTTTCTGACGAGGTCTAGTAAGCCTTTAAGTTTCAGCAAAATTAGAGTTCTCTTCGCATTTGAAAGTTTAGTTGTCGATTTGGA
GCGTTTTCGGGCCCTCGGACATATGCCTCCACGTGGTCAAGGTCGAGGACGGGGACGCAGGCGTGGTCGTGATAGGGGTGGTAGAGGTCTGACACTCCTGGAACAAGTAG
ATCCTCCTATGGATCAGCATGATGAAGATCTCCCTGATGAGAAAGATCTTGCGTCGCCTGCACCTCCAGCGTAG
Protein sequenceShow/hide protein sequence
MNVIATAATHIVSPPLPRPAQFRRCRSSLKIHIDFSELSNWSPRDFWNWMVDHLNNEEIAKGSIIMWSIWNHRNKIQASSNRGAAEFLIKDVQRNINDWESSYLKNQNPE
RLRNHVSHVIWKRPKPNSWKLNADATWFEQRSVGGLGWVVHDSNGSLICFGLQKTDRNWEIKIMEAKAILMGIKAIRQTCLQLNLGLEIESDALEVIKVLVGDEEDLSEL
KAIAETIVSSSKDLREVSFIHCNRLANSTAHWLARHASSVNFCSKNFDFDQGNPLCEESGLSFWAPDLPSWFSPPFLEDMRQKNSCTTYDRVEIISQILELIVGELEQQP
ASELEQHPLQQVISDRRSATNPMTKSRRARTGSSGSCYWVISGRFLLHSHLLCVIHGGPAAGRVFGEFKAAAAGWIFYGRRLSPAAVQRLFPTIWVFIECGRVQADFQRS
FRVGIFLWRFLTRSSKPLSFSKIRVLFAFESLVVDLERFRALGHMPPRGQGRGRGRRRGRDRGGRGLTLLEQVDPPMDQHDEDLPDEKDLASPAPPA